A structural-alphabet-based strategy for finding structural motifs across protein families
Wu, Chih Yuan; Chen, Yao Chi; Lim, Carmay
2010-01-01
Proteins with insignificant sequence and overall structure similarity may still share locally conserved contiguous structural segments; i.e. structural/3D motifs. Most methods for finding 3D motifs require a known motif to search for other similar structures or functionally/structurally crucial residues. Here, without requiring a query motif or essential residues, a fully automated method for discovering 3D motifs of various sizes across protein families with different folds based on a 16-letter structural alphabet is presented. It was applied to structurally non-redundant proteins bound to DNA, RNA, obligate/non-obligate proteins as well as free DNA-binding proteins (DBPs) and proteins with known structures but unknown function. Its usefulness was illustrated by analyzing the 3D motifs found in DBPs. A non-specific motif was found with a ‘corner’ architecture that confers a stable scaffold and enables diverse interactions, making it suitable for binding not only DNA but also RNA and proteins. Furthermore, DNA-specific motifs present ‘only’ in DBPs were discovered. The motifs found can provide useful guidelines in detecting binding sites and computational protein redesign. PMID:20525797
Regad, Leslie; Martin, Juliette; Camproux, Anne-Claude
2011-06-20
One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet), which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i) ubiquitous motifs, shared by several superfamilies and (ii) superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P) and SAH/SAM. Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins.
2011-01-01
Background One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Results Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet), which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i) ubiquitous motifs, shared by several superfamilies and (ii) superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P) and SAH/SAM. Conclusions Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins. PMID:21689388
Niv, Masha Y.; Skrabanek, Lucy; Roberts, Richard J.; Scheraga, Harold A.; Weinstein, Harel
2008-01-01
Restriction endonucleases (REases) are DNA-cleaving enzymes that have become indispensable tools in molecular biology. Type II REases are highly divergent in sequence despite their common structural core, function and, in some cases, common specificities towards DNA sequences. This makes it difficult to identify and classify them functionally based on sequence, and has hampered the efforts of specificity-engineering. Here, we define novel REase sequence motifs, which extend beyond the PD-(D/E)XK hallmark, and incorporate secondary structure information. The automated search using these motifs is carried out with a newly developed fast regular expression matching algorithm that accommodates long patterns with optional secondary structure constraints. Using this new tool, named Scan2S, motifs derived from REases with specificity towards GATC- and CGGG-containing DNA sequences successfully identify REases of the same specificity. Notably, some of these sequences are not identified by standard sequence detection tools. The new motifs highlight potential specificity-determining positions that do not fully overlap for the GATC- and the CCGG-recognizing REases and are candidates for specificity re-engineering. PMID:17972284
Niv, Masha Y; Skrabanek, Lucy; Roberts, Richard J; Scheraga, Harold A; Weinstein, Harel
2008-05-01
Restriction endonucleases (REases) are DNA-cleaving enzymes that have become indispensable tools in molecular biology. Type II REases are highly divergent in sequence despite their common structural core, function and, in some cases, common specificities towards DNA sequences. This makes it difficult to identify and classify them functionally based on sequence, and has hampered the efforts of specificity-engineering. Here, we define novel REase sequence motifs, which extend beyond the PD-(D/E)XK hallmark, and incorporate secondary structure information. The automated search using these motifs is carried out with a newly developed fast regular expression matching algorithm that accommodates long patterns with optional secondary structure constraints. Using this new tool, named Scan2S, motifs derived from REases with specificity towards GATC- and CGGG-containing DNA sequences successfully identify REases of the same specificity. Notably, some of these sequences are not identified by standard sequence detection tools. The new motifs highlight potential specificity-determining positions that do not fully overlap for the GATC- and the CCGG-recognizing REases and are candidates for specificity re-engineering.
RNA 3D Structural Motifs: Definition, Identification, Annotation, and Database Searching
NASA Astrophysics Data System (ADS)
Nasalean, Lorena; Stombaugh, Jesse; Zirbel, Craig L.; Leontis, Neocles B.
Structured RNA molecules resemble proteins in the hierarchical organization of their global structures, folding and broad range of functions. Structured RNAs are composed of recurrent modular motifs that play specific functional roles. Some motifs direct the folding of the RNA or stabilize the folded structure through tertiary interactions. Others bind ligands or proteins or catalyze chemical reactions. Therefore, it is desirable, starting from the RNA sequence, to be able to predict the locations of recurrent motifs in RNA molecules. Conversely, the potential occurrence of one or more known 3D RNA motifs may indicate that a genomic sequence codes for a structured RNA molecule. To identify known RNA structural motifs in new RNA sequences, precise structure-based definitions are needed that specify the core nucleotides of each motif and their conserved interactions. By comparing instances of each recurrent motif and applying base pair isosteriCity relations, one can identify neutral mutations that preserve its structure and function in the contexts in which it occurs.
Chemical Space Mapping and Structure-Activity Analysis of the ChEMBL Antiviral Compound Set.
Klimenko, Kyrylo; Marcou, Gilles; Horvath, Dragos; Varnek, Alexandre
2016-08-22
Curation, standardization and data fusion of the antiviral information present in the ChEMBL public database led to the definition of a robust data set, providing an association of antiviral compounds to seven broadly defined antiviral activity classes. Generative topographic mapping (GTM) subjected to evolutionary tuning was then used to produce maps of the antiviral chemical space, providing an optimal separation of compound families associated with the different antiviral classes. The ability to pinpoint the specific spots occupied (responsibility patterns) on a map by various classes of antiviral compounds opened the way for a GTM-supported search for privileged structural motifs, typical for each antiviral class. The privileged locations of antiviral classes were analyzed in order to highlight underlying privileged common structural motifs. Unlike in classical medicinal chemistry, where privileged structures are, almost always, predefined scaffolds, privileged structural motif detection based on GTM responsibility patterns has the decisive advantage of being able to automatically capture the nature ("resolution detail"-scaffold, detailed substructure, pharmacophore pattern, etc.) of the relevant structural motifs. Responsibility patterns were found to represent underlying structural motifs of various natures-from very fuzzy (groups of various "interchangeable" similar scaffolds), to the classical scenario in medicinal chemistry (underlying motif actually being the scaffold), to very precisely defined motifs (specifically substituted scaffolds).
Kshirsagar, Rucha; Khan, Krishnendu; Joshi, Mamata V; Hosur, Ramakrishna V; Muniyappa, K
2017-05-23
A plethora of evidence suggests that different types of DNA quadruplexes are widely present in the genome of all organisms. The existence of a growing number of proteins that selectively bind and/or process these structures underscores their biological relevance. Moreover, G-quadruplex DNA has been implicated in the alignment of four sister chromatids by forming parallel guanine quadruplexes during meiosis; however, the underlying mechanism is not well defined. Here we show that a G/C-rich motif associated with a meiosis-specific DNA double-strand break (DSB) in Saccharomyces cerevisiae folds into G-quadruplex, and the C-rich sequence complementary to the G-rich sequence forms an i-motif. The presence of G-quadruplex or i-motif structures upstream of the green fluorescent protein-coding sequence markedly reduces the levels of gfp mRNA expression in S. cerevisiae cells, with a concomitant decrease in green fluorescent protein abundance, and blocks primer extension by DNA polymerase, thereby demonstrating the functional significance of these structures. Surprisingly, although S. cerevisiae Hop1, a component of synaptonemal complex axial/lateral elements, exhibits strong affinity to G-quadruplex DNA, it displays a much weaker affinity for the i-motif structure. However, the Hop1 C-terminal but not the N-terminal domain possesses strong i-motif binding activity, implying that the C-terminal domain has a distinct substrate specificity. Additionally, we found that Hop1 promotes intermolecular pairing between G/C-rich DNA segments associated with a meiosis-specific DSB site. Our results support the idea that the G/C-rich motifs associated with meiosis-specific DSBs fold into intramolecular G-quadruplex and i-motif structures, both in vitro and in vivo, thus revealing an important link between non-B form DNA structures and Hop1 in meiotic chromosome synapsis and recombination. Copyright © 2017 Biophysical Society. Published by Elsevier Inc. All rights reserved.
TFBSshape: a motif database for DNA shape features of transcription factor binding sites.
Yang, Lin; Zhou, Tianyin; Dror, Iris; Mathelier, Anthony; Wasserman, Wyeth W; Gordân, Raluca; Rohs, Remo
2014-01-01
Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein-DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone.
TFBSshape: a motif database for DNA shape features of transcription factor binding sites
Yang, Lin; Zhou, Tianyin; Dror, Iris; Mathelier, Anthony; Wasserman, Wyeth W.; Gordân, Raluca; Rohs, Remo
2014-01-01
Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein–DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone. PMID:24214955
Stewart, H.; Bingham, R.J.; White, S. J.; Dykeman, E. C.; Zothner, C.; Tuplin, A. K.; Stockley, P. G.; Twarock, R.; Harris, M.
2016-01-01
The specific packaging of the hepatitis C virus (HCV) genome is hypothesised to be driven by Core-RNA interactions. To identify the regions of the viral genome involved in this process, we used SELEX (systematic evolution of ligands by exponential enrichment) to identify RNA aptamers which bind specifically to Core in vitro. Comparison of these aptamers to multiple HCV genomes revealed the presence of a conserved terminal loop motif within short RNA stem-loop structures. We postulated that interactions of these motifs, as well as sub-motifs which were present in HCV genomes at statistically significant levels, with the Core protein may drive virion assembly. We mutated 8 of these predicted motifs within the HCV infectious molecular clone JFH-1, thereby producing a range of mutant viruses predicted to possess altered RNA secondary structures. RNA replication and viral titre were unaltered in viruses possessing only one mutated structure. However, infectivity titres were decreased in viruses possessing a higher number of mutated regions. This work thus identified multiple novel RNA motifs which appear to contribute to genome packaging. We suggest that these structures act as cooperative packaging signals to drive specific RNA encapsidation during HCV assembly. PMID:26972799
Motivated Proteins: A web application for studying small three-dimensional protein motifs
Leader, David P; Milner-White, E James
2009-01-01
Background Small loop-shaped motifs are common constituents of the three-dimensional structure of proteins. Typically they comprise between three and seven amino acid residues, and are defined by a combination of dihedral angles and hydrogen bonding partners. The most abundant of these are αβ-motifs, asx-motifs, asx-turns, β-bulges, β-bulge loops, β-turns, nests, niches, Schellmann loops, ST-motifs, ST-staples and ST-turns. We have constructed a database of such motifs from a range of high-quality protein structures and built a web application as a visual interface to this. Description The web application, Motivated Proteins, provides access to these 12 motifs (with 48 sub-categories) in a database of over 400 representative proteins. Queries can be made for specific categories or sub-categories of motif, motifs in the vicinity of ligands, motifs which include part of an enzyme active site, overlapping motifs, or motifs which include a particular amino acid sequence. Individual proteins can be specified, or, where appropriate, motifs for all proteins listed. The results of queries are presented in textual form as an (X)HTML table, and may be saved as parsable plain text or XML. Motifs can be viewed and manipulated either individually or in the context of the protein in the Jmol applet structural viewer. Cartoons of the motifs imposed on a linear representation of protein secondary structure are also provided. Summary information for the motifs is available, as are histograms of amino acid distribution, and graphs of dihedral angles at individual positions in the motifs. Conclusion Motivated Proteins is a publicly and freely accessible web application that enables protein scientists to study small three-dimensional motifs without requiring knowledge of either Structured Query Language or the underlying database schema. PMID:19210785
Finding the target sites of RNA-binding proteins
Li, Xiao; Kazan, Hilal; Lipshitz, Howard D; Morris, Quaid D
2014-01-01
RNA–protein interactions differ from DNA–protein interactions because of the central role of RNA secondary structure. Some RNA-binding domains (RBDs) recognize their target sites mainly by their shape and geometry and others are sequence-specific but are sensitive to secondary structure context. A number of small- and large-scale experimental approaches have been developed to measure RNAs associated in vitro and in vivo with RNA-binding proteins (RBPs). Generalizing outside of the experimental conditions tested by these assays requires computational motif finding. Often RBP motif finding is done by adapting DNA motif finding methods; but modeling secondary structure context leads to better recovery of RBP-binding preferences. Genome-wide assessment of mRNA secondary structure has recently become possible, but these data must be combined with computational predictions of secondary structure before they add value in predicting in vivo binding. There are two main approaches to incorporating structural information into motif models: supplementing primary sequence motif models with preferred secondary structure contexts (e.g., MEMERIS and RNAcontext) and directly modeling secondary structure recognized by the RBP using stochastic context-free grammars (e.g., CMfinder and RNApromo). The former better reconstruct known binding preferences for sequence-specific RBPs but are not suitable for modeling RBPs that recognize shape and geometry of RNAs. Future work in RBP motif finding should incorporate interactions between multiple RBDs and multiple RBPs in binding to RNA. WIREs RNA 2014, 5:111–130. doi: 10.1002/wrna.1201 PMID:24217996
Conservation of the Human Integrin-Type Beta-Propeller Domain in Bacteria
Chouhan, Bhanupratap; Denesyuk, Alexander; Heino, Jyrki; Johnson, Mark S.; Denessiouk, Konstantin
2011-01-01
Integrins are heterodimeric cell-surface receptors with key functions in cell-cell and cell-matrix adhesion. Integrin α and β subunits are present throughout the metazoans, but it is unclear whether the subunits predate the origin of multicellular organisms. Several component domains have been detected in bacteria, one of which, a specific 7-bladed β-propeller domain, is a unique feature of the integrin α subunits. Here, we describe a structure-derived motif, which incorporates key features of each blade from the X-ray structures of human αIIbβ3 and αVβ3, includes elements of the FG-GAP/Cage and Ca2+-binding motifs, and is specific only for the metazoan integrin domains. Separately, we searched for the metazoan integrin type β-propeller domains among all available sequences from bacteria and unicellular eukaryotic organisms, which must incorporate seven repeats, corresponding to the seven blades of the β-propeller domain, and so that the newly found structure-derived motif would exist in every repeat. As the result, among 47 available genomes of unicellular eukaryotes we could not find a single instance of seven repeats with the motif. Several sequences contained three repeats, a predicted transmembrane segment, and a short cytoplasmic motif associated with some integrins, but otherwise differ from the metazoan integrin α subunits. Among the available bacterial sequences, we found five examples containing seven sequential metazoan integrin-specific motifs within the seven repeats. The motifs differ in having one Ca2+-binding site per repeat, whereas metazoan integrins have three or four sites. The bacterial sequences are more conserved in terms of motif conservation and loop length, suggesting that the structure is more regular and compact than those example structures from human integrins. Although the bacterial examples are not full-length integrins, the full-length metazoan-type 7-bladed β-propeller domains are present, and sometimes two tandem copies are found. PMID:22022374
Kinjo, Akira R; Nakamura, Haruki
2013-01-01
Protein functions are mediated by interactions between proteins and other molecules. One useful approach to analyze protein functions is to compare and classify the structures of interaction interfaces of proteins. Here, we describe the procedures for compiling a database of interface structures and efficiently comparing the interface structures. To do so requires a good understanding of the data structures of the Protein Data Bank (PDB). Therefore, we also provide a detailed account of the PDB exchange dictionary necessary for extracting data that are relevant for analyzing interaction interfaces and secondary structures. We identify recurring structural motifs by classifying similar interface structures, and we define a coarse-grained representation of supersecondary structures (SSS) which represents a sequence of two or three secondary structure elements including their relative orientations as a string of four to seven letters. By examining the correspondence between structural motifs and SSS strings, we show that no SSS string has particularly high propensity to be found interaction interfaces in general, indicating any SSS can be used as a binding interface. When individual structural motifs are examined, there are some SSS strings that have high propensity for particular groups of structural motifs. In addition, it is shown that while the SSS strings found in particular structural motifs for nonpolymer and protein interfaces are as abundant as in other structural motifs that belong to the same subunit, structural motifs for nucleic acid interfaces exhibit somewhat stronger preference for SSS strings. In regard to protein folds, many motif-specific SSS strings were found across many folds, suggesting that SSS may be a useful description to investigate the universality of ligand binding modes.
Maurer-Stroh, Sebastian; Gao, He; Han, Hao; Baeten, Lies; Schymkowitz, Joost; Rousseau, Frederic; Zhang, Louxin; Eisenhaber, Frank
2013-02-01
Data mining in protein databases, derivatives from more fundamental protein 3D structure and sequence databases, has considerable unearthed potential for the discovery of sequence motif--structural motif--function relationships as the finding of the U-shape (Huf-Zinc) motif, originally a small student's project, exemplifies. The metal ion zinc is critically involved in universal biological processes, ranging from protein-DNA complexes and transcription regulation to enzymatic catalysis and metabolic pathways. Proteins have evolved a series of motifs to specifically recognize and bind zinc ions. Many of these, so called zinc fingers, are structurally independent globular domains with discontinuous binding motifs made up of residues mostly far apart in sequence. Through a systematic approach starting from the BRIX structure fragment database, we discovered that there exists another predictable subset of zinc-binding motifs that not only have a conserved continuous sequence pattern but also share a characteristic local conformation, despite being included in totally different overall folds. While this does not allow general prediction of all Zn binding motifs, a HMM-based web server, Huf-Zinc, is available for prediction of these novel, as well as conventional, zinc finger motifs in protein sequences. The Huf-Zinc webserver can be freely accessed through this URL (http://mendel.bii.a-star.edu.sg/METHODS/hufzinc/).
Bandyopadhyay, Deepak; Huan, Jun; Prins, Jan; Snoeyink, Jack; Wang, Wei; Tropsha, Alexander
2009-11-01
Protein function prediction is one of the central problems in computational biology. We present a novel automated protein structure-based function prediction method using libraries of local residue packing patterns that are common to most proteins in a known functional family. Critical to this approach is the representation of a protein structure as a graph where residue vertices (residue name used as a vertex label) are connected by geometrical proximity edges. The approach employs two steps. First, it uses a fast subgraph mining algorithm to find all occurrences of family-specific labeled subgraphs for all well characterized protein structural and functional families. Second, it queries a new structure for occurrences of a set of motifs characteristic of a known family, using a graph index to speed up Ullman's subgraph isomorphism algorithm. The confidence of function inference from structure depends on the number of family-specific motifs found in the query structure compared with their distribution in a large non-redundant database of proteins. This method can assign a new structure to a specific functional family in cases where sequence alignments, sequence patterns, structural superposition and active site templates fail to provide accurate annotation.
SSMART: Sequence-structure motif identification for RNA-binding proteins.
Munteanu, Alina; Mukherjee, Neelanjan; Ohler, Uwe
2018-06-11
RNA-binding proteins (RBPs) regulate every aspect of RNA metabolism and function. There are hundreds of RBPs encoded in the eukaryotic genomes, and each recognize its RNA targets through a specific mixture of RNA sequence and structure properties. For most RBPs, however, only a primary sequence motif has been determined, while the structure of the binding sites is uncharacterized. We developed SSMART, an RNA motif finder that simultaneously models the primary sequence and the structural properties of the RNA targets sites. The sequence-structure motifs are represented as consensus strings over a degenerate alphabet, extending the IUPAC codes for nucleotides to account for secondary structure preferences. Evaluation on synthetic data showed that SSMART is able to recover both sequence and structure motifs implanted into 3'UTR-like sequences, for various degrees of structured/unstructured binding sites. In addition, we successfully used SSMART on high-throughput in vivo and in vitro data, showing that we not only recover the known sequence motif, but also gain insight into the structural preferences of the RBP. Availability: SSMART is freely available at https://ohlerlab.mdc-berlin.de/software/SSMART_137/. Supplementary data are available at Bioinformatics online.
Tran, Tuan; Disney, Matthew D
2012-01-01
RNA is an important therapeutic target but information about RNA-ligand interactions is limited. Here, we report a screening method that probes over 3,000,000 combinations of RNA motif-small molecule interactions to identify the privileged RNA structures and chemical spaces that interact. Specifically, a small molecule library biased for binding RNA was probed for binding to over 70,000 unique RNA motifs in a high throughput solution-based screen. The RNA motifs that specifically bind each small molecule were identified by microarray-based selection. In this library-versus-library or multidimensional combinatorial screening approach, hairpin loops (among a variety of RNA motifs) were the preferred RNA motif space that binds small molecules. Furthermore, it was shown that indole, 2-phenyl indole, 2-phenyl benzimidazole and pyridinium chemotypes allow for specific recognition of RNA motifs. As targeting RNA with small molecules is an extremely challenging area, these studies provide new information on RNA-ligand interactions that has many potential uses.
Tran, Tuan; Disney, Matthew D.
2012-01-01
RNA is an important therapeutic target but information about RNA-ligand interactions is limited. Here we report a screening method that probes over 3,000,000 combinations of RNA motif-small molecule interactions to identify the privileged RNA structures and chemical spaces that interact. Specifically, a small molecule library biased for binding RNA was probed for binding to over 70,000 unique RNA motifs in a high throughput solution-based screen. The RNA motifs that specifically bind each small molecule were identified by microarray-based selection. In this library-versus-library or multidimensional combinatorial screening approach, hairpin loops (amongst a variety of RNA motifs) were the preferred RNA motif space that binds small molecules. Furthermore, it was shown that indole, 2-phenyl indole, 2-phenyl benzimidazole, and pyridinium chemotypes allow for specific recognition of RNA motifs. Since targeting RNA with small molecules is an extremely challenging area, these studies provide new information on RNA-ligand interactions that has many potential uses. PMID:23047683
Composition-dependent stability of the medium-range order responsible for metallic glass formation
Zhang, Feng; Ji, Min; Fang, Xiao-Wei; ...
2014-09-18
The competition between the characteristic medium-range order corresponding to amorphous alloys and that in ordered crystalline phases is central to phase selection and morphology evolution under various processing conditions. We examine the stability of a model glass system, Cu–Zr, by comparing the energetics of various medium-range structural motifs over a wide range of compositions using first-principles calculations. Furthermore, we focus specifically on motifs that represent possible building blocks for competing glassy and crystalline phases, and we employ a genetic algorithm to efficiently identify the energetically favored decorations of each motif for specific compositions. These results show that a Bergman-type motifmore » with crystallization-resisting icosahedral symmetry is energetically most favorable in the composition range 0.63 < xCu < 0.68, and is the underlying motif for one of the three optimal glass-forming ranges observed experimentally for this binary system (Li et al., 2008). This work establishes an energy-based methodology to evaluate specific medium-range structural motifs which compete with stable crystalline nuclei in deeply undercooled liquids.« less
Structural basis for genome wide recognition of 5-bp GC motifs by SMAD transcription factors.
Martin-Malpartida, Pau; Batet, Marta; Kaczmarska, Zuzanna; Freier, Regina; Gomes, Tiago; Aragón, Eric; Zou, Yilong; Wang, Qiong; Xi, Qiaoran; Ruiz, Lidia; Vea, Angela; Márquez, José A; Massagué, Joan; Macias, Maria J
2017-12-12
Smad transcription factors activated by TGF-β or by BMP receptors form trimeric complexes with Smad4 to target specific genes for cell fate regulation. The CAGAC motif has been considered as the main binding element for Smad2/3/4, whereas Smad1/5/8 have been thought to preferentially bind GC-rich elements. However, chromatin immunoprecipitation analysis in embryonic stem cells showed extensive binding of Smad2/3/4 to GC-rich cis-regulatory elements. Here, we present the structural basis for specific binding of Smad3 and Smad4 to GC-rich motifs in the goosecoid promoter, a nodal-regulated differentiation gene. The structures revealed a 5-bp consensus sequence GGC(GC)|(CG) as the binding site for both TGF-β and BMP-activated Smads and for Smad4. These 5GC motifs are highly represented as clusters in Smad-bound regions genome-wide. Our results provide a basis for understanding the functional adaptability of Smads in different cellular contexts, and their dependence on lineage-determining transcription factors to target specific genes in TGF-β and BMP pathways.
RNA Bricks—a database of RNA 3D motifs and their interactions
Chojnowski, Grzegorz; Waleń, Tomasz; Bujnicki, Janusz M.
2014-01-01
The RNA Bricks database (http://iimcb.genesilico.pl/rnabricks), stores information about recurrent RNA 3D motifs and their interactions, found in experimentally determined RNA structures and in RNA–protein complexes. In contrast to other similar tools (RNA 3D Motif Atlas, RNA Frabase, Rloom) RNA motifs, i.e. ‘RNA bricks’ are presented in the molecular environment, in which they were determined, including RNA, protein, metal ions, water molecules and ligands. All nucleotide residues in RNA bricks are annotated with structural quality scores that describe real-space correlation coefficients with the electron density data (if available), backbone geometry and possible steric conflicts, which can be used to identify poorly modeled residues. The database is also equipped with an algorithm for 3D motif search and comparison. The algorithm compares spatial positions of backbone atoms of the user-provided query structure and of stored RNA motifs, without relying on sequence or secondary structure information. This enables the identification of local structural similarities among evolutionarily related and unrelated RNA molecules. Besides, the search utility enables searching ‘RNA bricks’ according to sequence similarity, and makes it possible to identify motifs with modified ribonucleotide residues at specific positions. PMID:24220091
Helix-packing motifs in membrane proteins.
Walters, R F S; DeGrado, W F
2006-09-12
The fold of a helical membrane protein is largely determined by interactions between membrane-imbedded helices. To elucidate recurring helix-helix interaction motifs, we dissected the crystallographic structures of membrane proteins into a library of interacting helical pairs. The pairs were clustered according to their three-dimensional similarity (rmsd =1.5 A), allowing 90% of the library to be assigned to clusters consisting of at least five members. Surprisingly, three quarters of the helical pairs belong to one of five tightly clustered motifs whose structural features can be understood in terms of simple principles of helix-helix packing. Thus, the universe of common transmembrane helix-pairing motifs is relatively simple. The largest cluster, which comprises 29% of the library members, consists of an antiparallel motif with left-handed packing angles, and it is frequently stabilized by packing of small side chains occurring every seven residues in the sequence. Right-handed parallel and antiparallel structures show a similar tendency to segregate small residues to the helix-helix interface but spaced at four-residue intervals. Position-specific sequence propensities were derived for the most populated motifs. These structural and sequential motifs should be quite useful for the design and structural prediction of membrane proteins.
Takeda, Ryuta; Petrov, Anton I.; Leontis, Neocles B.; Ding, Biao
2011-01-01
Cell-to-cell trafficking of RNA is an emerging biological principle that integrates systemic gene regulation, viral infection, antiviral response, and cell-to-cell communication. A key mechanistic question is how an RNA is specifically selected for trafficking from one type of cell into another type. Here, we report the identification of an RNA motif in Potato spindle tuber viroid (PSTVd) required for trafficking from palisade mesophyll to spongy mesophyll in Nicotiana benthamiana leaves. This motif, called loop 6, has the sequence 5′-CGA-3′...5′-GAC-3′ flanked on both sides by cis Watson-Crick G/C and G/U wobble base pairs. We present a three-dimensional (3D) structural model of loop 6 that specifies all non-Watson-Crick base pair interactions, derived by isostericity-based sequence comparisons with 3D RNA motifs from the RNA x-ray crystal structure database. The model is supported by available chemical modification patterns, natural sequence conservation/variations in PSTVd isolates and related species, and functional characterization of all possible mutants for each of the loop 6 base pairs. Our findings and approaches have broad implications for studying the 3D RNA structural motifs mediating trafficking of diverse RNA species across specific cellular boundaries and for studying the structure-function relationships of RNA motifs in other biological processes. PMID:21258006
Takeda, Ryuta; Petrov, Anton I; Leontis, Neocles B; Ding, Biao
2011-01-01
Cell-to-cell trafficking of RNA is an emerging biological principle that integrates systemic gene regulation, viral infection, antiviral response, and cell-to-cell communication. A key mechanistic question is how an RNA is specifically selected for trafficking from one type of cell into another type. Here, we report the identification of an RNA motif in Potato spindle tuber viroid (PSTVd) required for trafficking from palisade mesophyll to spongy mesophyll in Nicotiana benthamiana leaves. This motif, called loop 6, has the sequence 5'-CGA-3'...5'-GAC-3' flanked on both sides by cis Watson-Crick G/C and G/U wobble base pairs. We present a three-dimensional (3D) structural model of loop 6 that specifies all non-Watson-Crick base pair interactions, derived by isostericity-based sequence comparisons with 3D RNA motifs from the RNA x-ray crystal structure database. The model is supported by available chemical modification patterns, natural sequence conservation/variations in PSTVd isolates and related species, and functional characterization of all possible mutants for each of the loop 6 base pairs. Our findings and approaches have broad implications for studying the 3D RNA structural motifs mediating trafficking of diverse RNA species across specific cellular boundaries and for studying the structure-function relationships of RNA motifs in other biological processes.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Yong; Kovach, Amanda; Suino-Powell, Kelly
2008-07-23
The functional interaction between the peroxisome proliferator-activated receptor {gamma} (PPAR{gamma}) and its coactivator PGC-1{alpha} is crucial for the normal physiology of PPAR{gamma} and its pharmacological response to antidiabetic treatment with rosiglitazone. Here we report the crystal structure of the PPAR{gamma} ligand-binding domain bound to rosiglitazone and to a large PGC-1{alpha} fragment that contains two LXXLL-related motifs. The structure reveals critical contacts mediated through the first LXXLL motif of PGC-1{alpha} and the PPAR{gamma} coactivator binding site. Through a combination of biochemical and structural studies, we demonstrate that the first LXXLL motif is the most potent among all nuclear receptor coactivator motifsmore » tested, and only this motif of the two LXXLL-related motifs in PGC-1{alpha} is capable of binding to PPAR{gamma}. Our studies reveal that the strong interaction of PGC-1{alpha} and PPAR{gamma} is mediated through both hydrophobic and specific polar interactions. Mutations within the context of the full-length PGC-1{alpha} indicate that the first PGC-1{alpha} motif is necessary and sufficient for PGC-1{alpha} to coactivate PPAR{gamma} in the presence or absence of rosiglitazone. These results provide a molecular basis for specific recruitment and functional interplay between PPAR{gamma} and PGC-1{alpha} in glucose homeostasis and adipocyte differentiation.« less
Efficacy of function specific 3D-motifs in enzyme classification according to their EC-numbers.
Rahimi, Amir; Madadkar-Sobhani, Armin; Touserkani, Rouzbeh; Goliaei, Bahram
2013-11-07
Due to the increasing number of protein structures with unknown function originated from structural genomics projects, protein function prediction has become an important subject in bioinformatics. Among diverse function prediction methods, exploring known 3D-motifs, which are associated with functional elements in unknown protein structures is one of the most biologically meaningful methods. Homologous enzymes inherit such motifs in their active sites from common ancestors. However, slight differences in the properties of these motifs, results in variation in the reactions and substrates of the enzymes. In this study, we examined the possibility of discriminating highly related active site patterns according to their EC-numbers by 3D-motifs. For each EC-number, the spatial arrangement of an active site, which has minimum average distance to other active sites with the same function, was selected as a representative 3D-motif. In order to characterize the motifs, various points in active site elements were tested. The results demonstrated the possibility of predicting full EC-number of enzymes by 3D-motifs. However, the discriminating power of 3D-motifs varies among different enzyme families and depends on selecting the appropriate points and features. © 2013 Elsevier Ltd. All rights reserved.
Alenton, Rod Russel R; Koiwai, Keiichiro; Miyaguchi, Kohei; Kondo, Hidehiro; Hirono, Ikuo
2017-04-04
C-type lectins (CTLs) are calcium-dependent carbohydrate-binding proteins known to assist the innate immune system as pattern recognition receptors (PRRs). The binding specificity of CTLs lies in the motif of their carbohydrate recognition domain (CRD), the tripeptide motifs EPN and QPD bind to mannose and galactose, respectively. However, variants of these motifs were discovered including a QAP sequence reported in shrimp believed to have the same carbohydrate specificity as QPD. Here, we characterized a novel C-type lectin (MjGCTL) possessing a CRD with a QAP motif. The recombinant MjGCTL has a calcium-dependent agglutinating capability against both Gram-negative and Gram-positive bacteria, and its sugar specificity did not involve either mannose or galactose. In an encapsulation assay, agarose beads coated with rMjGCTL were immediately encapsulated from 0 h followed by melanization at 4 h post-incubation with hemocytes. These results confirm that MjGCTL functions as a classical CTL. The structure of QAP motif and carbohydrate-specificity of rMjGCTL was found to be different to both EPN and QPD, suggesting that QAP is a new motif. Furthermore, MjGCTL acts as a PRR binding to hemocytes to activate their adherent state and initiate encapsulation.
Alenton, Rod Russel R.; Koiwai, Keiichiro; Miyaguchi, Kohei; Kondo, Hidehiro; Hirono, Ikuo
2017-01-01
C-type lectins (CTLs) are calcium-dependent carbohydrate-binding proteins known to assist the innate immune system as pattern recognition receptors (PRRs). The binding specificity of CTLs lies in the motif of their carbohydrate recognition domain (CRD), the tripeptide motifs EPN and QPD bind to mannose and galactose, respectively. However, variants of these motifs were discovered including a QAP sequence reported in shrimp believed to have the same carbohydrate specificity as QPD. Here, we characterized a novel C-type lectin (MjGCTL) possessing a CRD with a QAP motif. The recombinant MjGCTL has a calcium-dependent agglutinating capability against both Gram-negative and Gram-positive bacteria, and its sugar specificity did not involve either mannose or galactose. In an encapsulation assay, agarose beads coated with rMjGCTL were immediately encapsulated from 0 h followed by melanization at 4 h post-incubation with hemocytes. These results confirm that MjGCTL functions as a classical CTL. The structure of QAP motif and carbohydrate-specificity of rMjGCTL was found to be different to both EPN and QPD, suggesting that QAP is a new motif. Furthermore, MjGCTL acts as a PRR binding to hemocytes to activate their adherent state and initiate encapsulation. PMID:28374848
Moriuchi, Hiromi; Unno, Hideaki; Goda, Shuichiro; Tateno, Hiroaki; Hirabayashi, Jun; Hatakeyama, Tomomitsu
2015-07-01
CEL-I is a galactose/N-acetylgalactosamine-specific C-type lectin isolated from the sea cucumber Cucumaria echinata. Its carbohydrate-binding site contains a QPD (Gln-Pro-Asp) motif, which is generally recognized as the galactose specificity-determining motif in the C-type lectins. In our previous study, replacement of the QPD motif by an EPN (Glu-Pro-Asn) motif led to a weak binding affinity for mannose. Therefore, we examined the effects of an additional mutation in the carbohydrate-binding site on the specificity of the lectin. Trp105 of EPN-CEL-I was replaced by a histidine residue using site-directed mutagenesis, and the binding affinity of the resulting mutant, EPNH-CEL-I, was examined by sugar-polyamidoamine dendrimer assay, isothermal titration calorimetry, and glycoconjugate microarray analysis. Tertiary structure of the EPNH-CEL-I/mannose complex was determined by X-ray crystallographic analysis. Sugar-polyamidoamine dendrimer assay and glycoconjugate microarray analysis revealed a drastic change in the specificity of EPNH-CEL-I from galactose/N-acetylgalactosamine to mannose. The association constant of EPNH-CEL-I for mannose was determined to be 3.17×10(3) M(-1) at 25°C. Mannose specificity of EPNH-CEL-I was achieved by stabilization of the binding of mannose in a correct orientation, in which the EPN motif can form proper hydrogen bonds with 3- and 4-hydroxy groups of the bound mannose. Specificity of CEL-I can be engineered by mutating a limited number of amino acid residues in addition to the QPD/EPN motifs. Versatility of the C-type carbohydrate-recognition domain structure in the recognition of various carbohydrate chains could become a promising platform to develop novel molecular recognition proteins. Copyright © 2015 Elsevier B.V. All rights reserved.
2012-01-01
Background To discover a compound inhibiting multiple proteins (i.e. polypharmacological targets) is a new paradigm for the complex diseases (e.g. cancers and diabetes). In general, the polypharmacological proteins often share similar local binding environments and motifs. As the exponential growth of the number of protein structures, to find the similar structural binding motifs (pharma-motifs) is an emergency task for drug discovery (e.g. side effects and new uses for old drugs) and protein functions. Results We have developed a Space-Related Pharmamotifs (called SRPmotif) method to recognize the binding motifs by searching against protein structure database. SRPmotif is able to recognize conserved binding environments containing spatially discontinuous pharma-motifs which are often short conserved peptides with specific physico-chemical properties for protein functions. Among 356 pharma-motifs, 56.5% interacting residues are highly conserved. Experimental results indicate that 81.1% and 92.7% polypharmacological targets of each protein-ligand complex are annotated with same biological process (BP) and molecular function (MF) terms, respectively, based on Gene Ontology (GO). Our experimental results show that the identified pharma-motifs often consist of key residues in functional (active) sites and play the key roles for protein functions. The SRPmotif is available at http://gemdock.life.nctu.edu.tw/SRP/. Conclusions SRPmotif is able to identify similar pharma-interfaces and pharma-motifs sharing similar binding environments for polypharmacological targets by rapidly searching against the protein structure database. Pharma-motifs describe the conservations of binding environments for drug discovery and protein functions. Additionally, these pharma-motifs provide the clues for discovering new sequence-based motifs to predict protein functions from protein sequence databases. We believe that SRPmotif is useful for elucidating protein functions and drug discovery. PMID:23281852
Chiu, Yi-Yuan; Lin, Chun-Yu; Lin, Chih-Ta; Hsu, Kai-Cheng; Chang, Li-Zen; Yang, Jinn-Moon
2012-01-01
To discover a compound inhibiting multiple proteins (i.e. polypharmacological targets) is a new paradigm for the complex diseases (e.g. cancers and diabetes). In general, the polypharmacological proteins often share similar local binding environments and motifs. As the exponential growth of the number of protein structures, to find the similar structural binding motifs (pharma-motifs) is an emergency task for drug discovery (e.g. side effects and new uses for old drugs) and protein functions. We have developed a Space-Related Pharmamotifs (called SRPmotif) method to recognize the binding motifs by searching against protein structure database. SRPmotif is able to recognize conserved binding environments containing spatially discontinuous pharma-motifs which are often short conserved peptides with specific physico-chemical properties for protein functions. Among 356 pharma-motifs, 56.5% interacting residues are highly conserved. Experimental results indicate that 81.1% and 92.7% polypharmacological targets of each protein-ligand complex are annotated with same biological process (BP) and molecular function (MF) terms, respectively, based on Gene Ontology (GO). Our experimental results show that the identified pharma-motifs often consist of key residues in functional (active) sites and play the key roles for protein functions. The SRPmotif is available at http://gemdock.life.nctu.edu.tw/SRP/. SRPmotif is able to identify similar pharma-interfaces and pharma-motifs sharing similar binding environments for polypharmacological targets by rapidly searching against the protein structure database. Pharma-motifs describe the conservations of binding environments for drug discovery and protein functions. Additionally, these pharma-motifs provide the clues for discovering new sequence-based motifs to predict protein functions from protein sequence databases. We believe that SRPmotif is useful for elucidating protein functions and drug discovery.
Sites of instability in the human TCF3 (E2A) gene adopt G-quadruplex DNA structures in vitro
Williams, Jonathan D.; Fleetwood, Sara; Berroyer, Alexandra; Kim, Nayun; Larson, Erik D.
2015-01-01
The formation of highly stable four-stranded DNA, called G-quadruplex (G4), promotes site-specific genome instability. G4 DNA structures fold from repetitive guanine sequences, and increasing experimental evidence connects G4 sequence motifs with specific gene rearrangements. The human transcription factor 3 (TCF3) gene (also termed E2A) is subject to genetic instability associated with severe disease, most notably a common translocation event t(1;19) associated with acute lymphoblastic leukemia. The sites of instability in TCF3 are not randomly distributed, but focused to certain sequences. We asked if G4 DNA formation could explain why TCF3 is prone to recombination and mutagenesis. Here we demonstrate that sequences surrounding the major t(1;19) break site and a region associated with copy number variations both contain G4 sequence motifs. The motifs identified readily adopt G4 DNA structures that are stable enough to interfere with DNA synthesis in physiological salt conditions in vitro. When introduced into the yeast genome, TCF3 G4 motifs promoted gross chromosomal rearrangements in a transcription-dependent manner. Our results provide a molecular rationale for the site-specific instability of human TCF3, suggesting that G4 DNA structures contribute to oncogenic DNA breaks and recombination. PMID:26029241
SMARTIV: combined sequence and structure de-novo motif discovery for in-vivo RNA binding data.
Polishchuk, Maya; Paz, Inbal; Yakhini, Zohar; Mandel-Gutfreund, Yael
2018-05-25
Gene expression regulation is highly dependent on binding of RNA-binding proteins (RBPs) to their RNA targets. Growing evidence supports the notion that both RNA primary sequence and its local secondary structure play a role in specific Protein-RNA recognition and binding. Despite the great advance in high-throughput experimental methods for identifying sequence targets of RBPs, predicting the specific sequence and structure binding preferences of RBPs remains a major challenge. We present a novel webserver, SMARTIV, designed for discovering and visualizing combined RNA sequence and structure motifs from high-throughput RNA-binding data, generated from in-vivo experiments. The uniqueness of SMARTIV is that it predicts motifs from enriched k-mers that combine information from ranked RNA sequences and their predicted secondary structure, obtained using various folding methods. Consequently, SMARTIV generates Position Weight Matrices (PWMs) in a combined sequence and structure alphabet with assigned P-values. SMARTIV concisely represents the sequence and structure motif content as a single graphical logo, which is informative and easy for visual perception. SMARTIV was examined extensively on a variety of high-throughput binding experiments for RBPs from different families, generated from different technologies, showing consistent and accurate results. Finally, SMARTIV is a user-friendly webserver, highly efficient in run-time and freely accessible via http://smartiv.technion.ac.il/.
Reynolds, Kimberly A
2015-01-06
In this issue of Structure, Lanouette and colleagues use a combination of computation and experiment to define a specificity motif for the lysine methyltransferase SMYD2. Using this motif, they predict and experimentally verify four new SMYD2 substrates. Copyright © 2015 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Fernandez-Chamorro, Javier; Lozano, Gloria; Garcia-Martin, Juan Antonio; Ramajo, Jorge; Dotu, Ivan; Clote, Peter; Martinez-Salas, Encarnacion
2016-04-01
The function of Internal Ribosome Entry Site (IRES) elements is intimately linked to their RNA structure. Viral IRES elements are organized in modular domains consisting of one or more stem-loops that harbor conserved RNA motifs critical for internal initiation of translation. A conserved motif is the pyrimidine-tract located upstream of the functional initiation codon in type I and II picornavirus IRES. By computationally designing synthetic RNAs to fold into a structure that sequesters the polypyrimidine tract in a hairpin, we establish a correlation between predicted inaccessibility of the pyrimidine tract and IRES activity, as determined in both in vitro and in vivo systems. Our data supports the hypothesis that structural sequestration of the pyrimidine-tract within a stable hairpin inactivates IRES activity, since the stronger the stability of the hairpin the higher the inhibition of protein synthesis. Destabilization of the stem-loop immediately upstream of the pyrimidine-tract also decreases IRES activity. Our work introduces a hybrid computational/experimental method to determine the importance of structural motifs for biological function. Specifically, we show the feasibility of using the software RNAiFold to design synthetic RNAs with particular sequence and structural motifs that permit subsequent experimental determination of the importance of such motifs for biological function.
2011-01-01
Background Transcription factors (TFs) play a central role in regulating gene expression by interacting with cis-regulatory DNA elements associated with their target genes. Recent surveys have examined the DNA binding specificities of most Saccharomyces cerevisiae TFs, but a comprehensive evaluation of their data has been lacking. Results We analyzed in vitro and in vivo TF-DNA binding data reported in previous large-scale studies to generate a comprehensive, curated resource of DNA binding specificity data for all characterized S. cerevisiae TFs. Our collection comprises DNA binding site motifs and comprehensive in vitro DNA binding specificity data for all possible 8-bp sequences. Investigation of the DNA binding specificities within the basic leucine zipper (bZIP) and VHT1 regulator (VHR) TF families revealed unexpected plasticity in TF-DNA recognition: intriguingly, the VHR TFs, newly characterized by protein binding microarrays in this study, recognize bZIP-like DNA motifs, while the bZIP TF Hac1 recognizes a motif highly similar to the canonical E-box motif of basic helix-loop-helix (bHLH) TFs. We identified several TFs with distinct primary and secondary motifs, which might be associated with different regulatory functions. Finally, integrated analysis of in vivo TF binding data with protein binding microarray data lends further support for indirect DNA binding in vivo by sequence-specific TFs. Conclusions The comprehensive data in this curated collection allow for more accurate analyses of regulatory TF-DNA interactions, in-depth structural studies of TF-DNA specificity determinants, and future experimental investigations of the TFs' predicted target genes and regulatory roles. PMID:22189060
Will, Katrin; Warnecke, Gabriele; Wiesmüller, Lisa; Deppert, Wolfgang
1998-01-01
Mutant, but not wild-type p53 binds with high affinity to a variety of MAR-DNA elements (MARs), suggesting that MAR-binding of mutant p53 relates to the dominant-oncogenic activities proposed for mutant p53. MARs recognized by mutant p53 share AT richness and contain variations of an AATATATTT “DNA-unwinding motif,” which enhances the structural dynamics of chromatin and promotes regional DNA base-unpairing. Mutant p53 specifically interacted with MAR-derived oligonucleotides carrying such unwinding motifs, catalyzing DNA strand separation when this motif was located within a structurally labile sequence environment. Addition of GC-clamps to the respective MAR-oligonucleotides or introducing mutations into the unwinding motif strongly reduced DNA strand separation, but supported the formation of tight complexes between mutant p53 and such oligonucleotides. We conclude that the specific interaction of mutant p53 with regions of MAR-DNA with a high potential for base-unpairing provides the basis for the high-affinity binding of mutant p53 to MAR-DNA. PMID:9811860
CircularLogo: A lightweight web application to visualize intra-motif dependencies.
Ye, Zhenqing; Ma, Tao; Kalmbach, Michael T; Dasari, Surendra; Kocher, Jean-Pierre A; Wang, Liguo
2017-05-22
The sequence logo has been widely used to represent DNA or RNA motifs for more than three decades. Despite its intelligibility and intuitiveness, the traditional sequence logo is unable to display the intra-motif dependencies and therefore is insufficient to fully characterize nucleotide motifs. Many methods have been developed to quantify the intra-motif dependencies, but fewer tools are available for visualization. We developed CircularLogo, a web-based interactive application, which is able to not only visualize the position-specific nucleotide consensus and diversity but also display the intra-motif dependencies. Applying CircularLogo to HNF6 binding sites and tRNA sequences demonstrated its ability to show intra-motif dependencies and intuitively reveal biomolecular structure. CircularLogo is implemented in JavaScript and Python based on the Django web framework. The program's source code and user's manual are freely available at http://circularlogo.sourceforge.net . CircularLogo web server can be accessed from http://bioinformaticstools.mayo.edu/circularlogo/index.html . CircularLogo is an innovative web application that is specifically designed to visualize and interactively explore intra-motif dependencies.
A flexible motif search technique based on generalized profiles.
Bucher, P; Karplus, K; Moeri, N; Hofmann, K
1996-03-01
A flexible motif search technique is presented which has two major components: (1) a generalized profile syntax serving as a motif definition language; and (2) a motif search method specifically adapted to the problem of finding multiple instances of a motif in the same sequence. The new profile structure, which is the core of the generalized profile syntax, combines the functions of a variety of motif descriptors implemented in other methods, including regular expression-like patterns, weight matrices, previously used profiles, and certain types of hidden Markov models (HMMs). The relationship between generalized profiles and other biomolecular motif descriptors is analyzed in detail, with special attention to HMMs. Generalized profiles are shown to be equivalent to a particular class of HMMs, and conversion procedures in both directions are given. The conversion procedures provide an interpretation for local alignment in the framework of stochastic models, allowing for clear, simple significance tests. A mathematical statement of the motif search problem defines the new method exactly without linking it to a specific algorithmic solution. Part of the definition includes a new definition of disjointness of alignments.
SARNAclust: Semi-automatic detection of RNA protein binding motifs from immunoprecipitation data
Dotu, Ivan; Adamson, Scott I.; Coleman, Benjamin; Fournier, Cyril; Ricart-Altimiras, Emma; Eyras, Eduardo
2018-01-01
RNA-protein binding is critical to gene regulation, controlling fundamental processes including splicing, translation, localization and stability, and aberrant RNA-protein interactions are known to play a role in a wide variety of diseases. However, molecular understanding of RNA-protein interactions remains limited; in particular, identification of RNA motifs that bind proteins has long been challenging, especially when such motifs depend on both sequence and structure. Moreover, although RNA binding proteins (RBPs) often contain more than one binding domain, algorithms capable of identifying more than one binding motif simultaneously have not been developed. In this paper we present a novel pipeline to determine binding peaks in crosslinking immunoprecipitation (CLIP) data, to discover multiple possible RNA sequence/structure motifs among them, and to experimentally validate such motifs. At the core is a new semi-automatic algorithm SARNAclust, the first unsupervised method to identify and deconvolve multiple sequence/structure motifs simultaneously. SARNAclust computes similarity between sequence/structure objects using a graph kernel, providing the ability to isolate the impact of specific features through the bulge graph formalism. Application of SARNAclust to synthetic data shows its capability of clustering 5 motifs at once with a V-measure value of over 0.95, while GraphClust achieves only a V-measure of 0.083 and RNAcontext cannot detect any of the motifs. When applied to existing eCLIP sets, SARNAclust finds known motifs for SLBP and HNRNPC and novel motifs for several other RBPs such as AGGF1, AKAP8L and ILF3. We demonstrate an experimental validation protocol, a targeted Bind-n-Seq-like high-throughput sequencing approach that relies on RNA inverse folding for oligo pool design, that can validate the components within the SLBP motif. Finally, we use this protocol to experimentally interrogate the SARNAclust motif predictions for protein ILF3. Our results support a newly identified partially double-stranded UUUUUGAGA motif similar to that known for the splicing factor HNRNPC. PMID:29596423
Sequence, Structure, and Context Preferences of Human RNA Binding Proteins.
Dominguez, Daniel; Freese, Peter; Alexis, Maria S; Su, Amanda; Hochman, Myles; Palden, Tsultrim; Bazile, Cassandra; Lambert, Nicole J; Van Nostrand, Eric L; Pratt, Gabriel A; Yeo, Gene W; Graveley, Brenton R; Burge, Christopher B
2018-06-07
RNA binding proteins (RBPs) orchestrate the production, processing, and function of mRNAs. Here, we present the affinity landscapes of 78 human RBPs using an unbiased assay that determines the sequence, structure, and context preferences of these proteins in vitro by deep sequencing of bound RNAs. These data enable construction of "RNA maps" of RBP activity without requiring crosslinking-based assays. We found an unexpectedly low diversity of RNA motifs, implying frequent convergence of binding specificity toward a relatively small set of RNA motifs, many with low compositional complexity. Offsetting this trend, however, we observed extensive preferences for contextual features distinct from short linear RNA motifs, including spaced "bipartite" motifs, biased flanking nucleotide composition, and bias away from or toward RNA structure. Our results emphasize the importance of contextual features in RNA recognition, which likely enable targeting of distinct subsets of transcripts by different RBPs that recognize the same linear motif. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
2012-01-01
Background GDSL esterases/lipases are a newly discovered subclass of lipolytic enzymes that are very important and attractive research subjects because of their multifunctional properties, such as broad substrate specificity and regiospecificity. Compared with the current knowledge regarding these enzymes in bacteria, our understanding of the plant GDSL enzymes is very limited, although the GDSL gene family in plant species include numerous members in many fully sequenced plant genomes. Only two genes from a large rice GDSL esterase/lipase gene family were previously characterised, and the majority of the members remain unknown. In the present study, we describe the rice OsGELP (Oryza sativa GDSL esterase/lipase protein) gene family at the genomic and proteomic levels, and use this knowledge to provide insights into the multifunctionality of the rice OsGELP enzymes. Results In this study, an extensive bioinformatics analysis identified 114 genes in the rice OsGELP gene family. A complete overview of this family in rice is presented, including the chromosome locations, gene structures, phylogeny, and protein motifs. Among the OsGELPs and the plant GDSL esterase/lipase proteins of known functions, 41 motifs were found that represent the core secondary structure elements or appear specifically in different phylogenetic subclades. The specification and distribution of identified putative conserved clade-common and -specific peptide motifs, and their location on the predicted protein three dimensional structure may possibly signify their functional roles. Potentially important regions for substrate specificity are highlighted, in accordance with protein three-dimensional model and location of the phylogenetic specific conserved motifs. The differential expression of some representative genes were confirmed by quantitative real-time PCR. The phylogenetic analysis, together with protein motif architectures, and the expression profiling were analysed to predict the possible biological functions of the rice OsGELP genes. Conclusions Our current genomic analysis, for the first time, presents fundamental information on the organization of the rice OsGELP gene family. With combination of the genomic, phylogenetic, microarray expression, protein motif distribution, and protein structure analyses, we were able to create supported basis for the functional prediction of many members in the rice GDSL esterase/lipase family. The present study provides a platform for the selection of candidate genes for further detailed functional study. PMID:22793791
Common fold in helix–hairpin–helix proteins
Shao, Xuguang; Grishin, Nick V.
2000-01-01
Helix–hairpin–helix (HhH) is a widespread motif involved in non-sequence-specific DNA binding. The majority of HhH motifs function as DNA-binding modules, however, some of them are used to mediate protein–protein interactions or have acquired enzymatic activity by incorporating catalytic residues (DNA glycosylases). From sequence and structural analysis of HhH-containing proteins we conclude that most HhH motifs are integrated as a part of a five-helical domain, termed (HhH)2 domain here. It typically consists of two consecutive HhH motifs that are linked by a connector helix and displays pseudo-2-fold symmetry. (HhH)2 domains show clear structural integrity and a conserved hydrophobic core composed of seven residues, one residue from each α-helix and each hairpin, and deserves recognition as a distinct protein fold. In addition to known HhH in the structures of RuvA, RadA, MutY and DNA-polymerases, we have detected new HhH motifs in sterile alpha motif and barrier-to-autointegration factor domains, the α-subunit of Escherichia coli RNA-polymerase, DNA-helicase PcrA and DNA glycosylases. Statistically significant sequence similarity of HhH motifs and pronounced structural conservation argue for homology between (HhH)2 domains in different protein families. Our analysis helps to clarify how non-symmetric protein motifs bind to the double helix of DNA through the formation of a pseudo-2-fold symmetric (HhH)2 functional unit. PMID:10908318
The K-turn motif in riboswitches and other RNA species☆
Lilley, David M.J.
2014-01-01
The kink turn is a widespread structure motif that introduces a tight bend into the axis of duplex RNA. This generally functions to mediate tertiary interactions, and to serve as a specific protein binding site. K-turns or closely related structures are found in at least seven different riboswitch structures, where they function as key architectural elements that help generate the ligand binding pocket. This article is part of a Special Issue entitled: Riboswitches. PMID:24798078
2011-01-01
Background Mapping protein primary sequences to their three dimensional folds referred to as the 'second genetic code' remains an unsolved scientific problem. A crucial part of the problem concerns the geometrical specificity in side chain association leading to densely packed protein cores, a hallmark of correctly folded native structures. Thus, any model of packing within proteins should constitute an indispensable component of protein folding and design. Results In this study an attempt has been made to find, characterize and classify recurring patterns in the packing of side chain atoms within a protein which sustains its native fold. The interaction of side chain atoms within the protein core has been represented as a contact network based on the surface complementarity and overlap between associating side chain surfaces. Some network topologies definitely appear to be preferred and they have been termed 'packing motifs', analogous to super secondary structures in proteins. Study of the distribution of these motifs reveals the ubiquitous presence of typical smaller graphs, which appear to get linked or coalesce to give larger graphs, reminiscent of the nucleation-condensation model in protein folding. One such frequently occurring motif, also envisaged as the unit of clustering, the three residue clique was invariably found in regions of dense packing. Finally, topological measures based on surface contact networks appeared to be effective in discriminating sequences native to a specific fold amongst a set of decoys. Conclusions Out of innumerable topological possibilities, only a finite number of specific packing motifs are actually realized in proteins. This small number of motifs could serve as a basis set in the construction of larger networks. Of these, the triplet clique exhibits distinct preference both in terms of composition and geometry. PMID:21605466
The Thiamine-Pyrophosphate-Motif
NASA Technical Reports Server (NTRS)
Ciszak, Ewa; Dominiak, Paulina
2004-01-01
Thiamin pyrophosphate (TPP), a derivative of vitamin B1, is a cofactor for enzymes performing catalysis in pathways of energy production including the well known decarboxylation of a-keto acid dehydrogenases followed by transketolation. TPP-dependent enzymes constitute a structurally and functionally diverse group exhibiting multimeric subunit organization, multiple domains and two chemically equivalent catalytic centers. Annotation of functional TPP-dependcnt enzymes, therefore, has not been trivial due to low sequence similarity related to this complex organization. Our approach to analysis of structures of known TPP-dependent enzymes reveals for the first time features common to this group, which we have termed the TPP-motif. The TPP-motif consists of specific spatial arrangements of structural elements and their specific contacts to provide for a flip-flop, or alternate site, enzymatic mechanism of action. Analysis of structural elements entrained in the flip-flop action displayed by TPP-dependent enzymes reveals a novel definition of the common amino acid sequences. These sequences allow for annotation of TPP-dependent enzymes, thus advancing functional proteomics. Further details of three-dimensional structures of TPP-dependent enzymes will be discussed.
Structural basis of RNA folding and recognition in an AMP-RNA aptamer complex.
Jiang, F; Kumar, R A; Jones, R A; Patel, D J
1996-07-11
The catalytic properties of RNA and its well known role in gene expression and regulation are the consequence of its unique solution structures. Identification of the structural determinants of ligand recognition by RNA molecules is of fundamental importance for understanding the biological functions of RNA, as well as for the rational design of RNA Sequences with specific catalytic activities. Towards this latter end, Szostak et al. used in vitro selection techniques to isolate RNA sequences ('aptamers') containing a high-affinity binding site for ATP, the universal currency of cellular energy, and then used this motif to engineer ribozymes with polynucleotide kinase activity. Here we present the solution structure, as determined by multidimensional NMR spectroscopy and molecular dynamics calculations, of both uniformly and specifically 13C-, 15N-labelled 40-mer RNA containing the ATP-binding motif complexed with AMP. The aptamer adopts an L-shaped structure with two nearly orthogonal stems, each capped proximally by a G x G mismatch pair, binding the AMP ligand at their junction in a GNRA-like motif.
NoFold: RNA structure clustering without folding or alignment.
Middleton, Sarah A; Kim, Junhyong
2014-11-01
Structures that recur across multiple different transcripts, called structure motifs, often perform a similar function-for example, recruiting a specific RNA-binding protein that then regulates translation, splicing, or subcellular localization. Identifying common motifs between coregulated transcripts may therefore yield significant insight into their binding partners and mechanism of regulation. However, as most methods for clustering structures are based on folding individual sequences or doing many pairwise alignments, this results in a tradeoff between speed and accuracy that can be problematic for large-scale data sets. Here we describe a novel method for comparing and characterizing RNA secondary structures that does not require folding or pairwise alignment of the input sequences. Our method uses the idea of constructing a distance function between two objects by their respective distances to a collection of empirical examples or models, which in our case consists of 1973 Rfam family covariance models. Using this as a basis for measuring structural similarity, we developed a clustering pipeline called NoFold to automatically identify and annotate structure motifs within large sequence data sets. We demonstrate that NoFold can simultaneously identify multiple structure motifs with an average sensitivity of 0.80 and precision of 0.98 and generally exceeds the performance of existing methods. We also perform a cross-validation analysis of the entire set of Rfam families, achieving an average sensitivity of 0.57. We apply NoFold to identify motifs enriched in dendritically localized transcripts and report 213 enriched motifs, including both known and novel structures. © 2014 Middleton and Kim; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Eyal, Zohar; Matzov, Donna; Krupkin, Miri; Wekselman, Itai; Paukner, Susanne; Zimmerman, Ella; Rozenberg, Haim; Bashan, Anat; Yonath, Ada
2015-01-01
The emergence of bacterial multidrug resistance to antibiotics threatens to cause regression to the preantibiotic era. Here we present the crystal structure of the large ribosomal subunit from Staphylococcus aureus, a versatile Gram-positive aggressive pathogen, and its complexes with the known antibiotics linezolid and telithromycin, as well as with a new, highly potent pleuromutilin derivative, BC-3205. These crystal structures shed light on specific structural motifs of the S. aureus ribosome and the binding modes of the aforementioned antibiotics. Moreover, by analyzing the ribosome structure and comparing it with those of nonpathogenic bacterial models, we identified some unique internal and peripheral structural motifs that may be potential candidates for improving known antibiotics and for use in the design of selective antibiotic drugs against S. aureus. PMID:26464510
Characteristic motifs for families of allergenic proteins
Ivanciuc, Ovidiu; Garcia, Tzintzuni; Torres, Miguel; Schein, Catherine H.; Braun, Werner
2008-01-01
The identification of potential allergenic proteins is usually done by scanning a database of allergenic proteins and locating known allergens with a high sequence similarity. However, there is no universally accepted cut-off value for sequence similarity to indicate potential IgE cross-reactivity. Further, overall sequence similarity may be less important than discrete areas of similarity in proteins with homologous structure. To identify such areas, we first classified all allergens and their subdomains in the Structural Database of Allergenic Proteins (SDAP, http://fermi.utmb.edu/SDAP/) to their closest protein families as defined in Pfam, and identified conserved physicochemical property motifs characteristic of each group of sequences. Allergens populate only a small subset of all known Pfam families, as all allergenic proteins in SDAP could be grouped to only 130 (of 9318 total) Pfams, and 31 families contain more than four allergens. Conserved physicochemical property motifs for the aligned sequences of the most populated Pfam families were identified with the PCPMer program suite and catalogued in the webserver Motif-Mate (http://born.utmb.edu/motifmate/summary.php). We also determined specific motifs for allergenic members of a family that could distinguish them from non-allergenic ones. These allergen specific motifs should be most useful in database searches for potential allergens. We found that sequence motifs unique to the allergens in three families (seed storage proteins, Bet v 1, and tropomyosin) overlap with known IgE epitopes, thus providing evidence that our motif based approach can be used to assess the potential allergenicity of novel proteins. PMID:18951633
Singh, D D; Saikrishnan, K; Kumar, Prashant; Surolia, A; Sekar, K; Vijayan, M
2005-10-01
The crystal structure of a complex of methyl-alpha-D-mannoside with banana lectin from Musa paradisiaca reveals two primary binding sites in the lectin, unlike in other lectins with beta-prism I fold which essentially consists of three Greek key motifs. It has been suggested that the fold evolved through successive gene duplication and fusion of an ancestral Greek key motif. In other lectins, all from dicots, the primary binding site exists on one of the three motifs in the three-fold symmetric molecule. Banana is a monocot, and the three motifs have not diverged enough to obliterate sequence similarity among them. Two Greek key motifs in it carry one primary binding site each. A common secondary binding site exists on the third Greek key. Modelling shows that both the primary sites can support 1-2, 1-3, and 1-6 linked mannosides with the second residue interacting in each case primarily with the secondary binding site. Modelling also readily leads to a bound branched mannopentose with the nonreducing ends of the two branches anchored at the two primary binding sites, providing a structural explanation for the lectin's specificity for branched alpha-mannans. A comparison of the dimeric banana lectin with other beta-prism I fold lectins, provides interesting insights into the variability in their quaternary structure.
NASA Astrophysics Data System (ADS)
Parry, Christian S.; Gorski, Jack; Stern, Lawrence J.
2003-03-01
The stable binding of processed foreign peptide to a class II major histocompatibility (MHC) molecule and subsequent presentation to a T cell receptor is a central event in immune recognition and regulation. Polymorphic residues on the floor of the peptide binding site form pockets that anchor peptide side chains. These and other residues in the helical wall of the groove determine the specificity of each allele and define a motif. Allele specific motifs allow the prediction of epitopes from the sequence of pathogens. There are, however, known epitopes that do not satisfy these motifs: anchor motifs are not adequate for predicting epitopes as there are apparently major and minor motifs. We present crystallographic studies into the nature of the interactions that govern the binding of these so called nonconforming peptides. We would like to understand the role of the P10 pocket and find out whether the peptides that do not obey the consensus anchor motif bind in the canonical conformation observed in in prior structures of class II MHC-peptide complexes. HLA-DRB3*0101 complexed with peptide crystallized in unit cell 92.10 x 92.10 x 248.30 (90, 90, 90), P41212, and the diffraction data is reliable to 2.2ÅWe are complementing our studies with dynamical long time simulations to answer these questions, particularly the interplay of the anchor motifs in peptide binding, the range of protein and ligand conformations, and water hydration structures.
Jenkins, Janelle E.; Sampath, Sujatha; Butler, Emily; Kim, Jihyun; Henning, Robert W.; Holland, Gregory P.; Yarger, Jeffery L.
2013-01-01
This study provides a detailed secondary structural characterization of major ampullate dragline silk from Latrodectus hesperus (black widow) spiders. X-ray diffraction results show that the structure of black widow major ampullate silk fibers is comprised of stacked β-sheet nanocrystallites oriented parallel to the fiber axis and an amorphous region with oriented (anisotropic) and isotropic components. The combination of two-dimensional (2D) 13C-13C through-space and through-bond solid-state NMR experiments provide chemical shifts that are used to determine detailed information about amino acid motif secondary structure in black widow spider dragline silk. Individual amino acids are incorporated into different repetitive motifs that make up the majority of this protein-based biopolymer. From the solid-state NMR measurements, we assign distinct secondary conformations to each repetitive amino acid motif and hence to the amino acids that make up the motifs. Specifically, alanine is incorporated in β-sheet (poly(Alan) and poly(Gly-Ala)), 31-helix (poly(Gly-Gly-Xaa), and α-helix (poly(Gln-Gln-Ala-Tyr)) components. Glycine is determined to be in β-sheet (poly(Gly-Ala)) and 31-helical (poly(Gly-Gly-Xaa)) regions, while serine is present in β-sheet (poly(Gly-Ala-Ser)), 31-helix (poly(Gly-Gly-Ser)), and β-turn (poly(Gly-Pro-Ser)) structures. These various motif-specific secondary structural elements are quantitatively correlated to the primary amino acid sequence of major ampullate spidroin 1 and 2 (MaSp1 and MaSp2) and are shown to form a self-consistent model for black widow dragline silk. PMID:24024617
Protein–DNA Interactions: The Story so Far and a New Method for Prediction
Jones, Susan; Thornton, Janet M.
2003-01-01
This review describes methods for the prediction of DNA binding function, and specifically summarizes a new method using 3D structural templates. The new method features the HTH motif that is found in approximately one-third of DNAbinding protein families. A library of 3D structural templates of HTH motifs was derived from proteins in the PDB. Templates were scanned against complete protein structures and the optimal superposition of a template on a structure calculated. Significance thresholds in terms of a minimum root mean squared deviation (rmsd) of an optimal superposition, and a minimum motif accessible surface area (ASA), have been calculated. Inmore » this way, it is possible to scan the template library against proteins of unknown function to make predictions about DNA-binding functionality.« less
Zhang, Yi; Berghaus, Melanie; Klein, Sean; Jenkins, Kelly; Zhang, Siwen; McCallum, Scott A; Morgan, Joel E; Winter, Roland; Barrick, Doug; Royer, Catherine A
2018-04-27
Many repeat proteins contain capping motifs, which serve to shield the hydrophobic core from solvent and maintain structural integrity. While the role of capping motifs in enhancing the stability and structural integrity of repeat proteins is well documented, their contribution to folding cooperativity is not. Here we examined the role of capping motifs in defining the folding cooperativity of the leucine-rich repeat protein, pp32, by monitoring the pressure- and urea-induced unfolding of an N-terminal capping motif (N-cap) deletion mutant, pp32-∆N-cap, and a C-terminal capping motif destabilization mutant pp32-Y131F/D146L, using residue-specific NMR and small-angle X-ray scattering. Destabilization of the C-terminal capping motif resulted in higher cooperativity for the unfolding transition compared to wild-type pp32, as these mutations render the stability of the C-terminus similar to that of the rest of the protein. In contrast, deletion of the N-cap led to strong deviation from two-state unfolding. In both urea- and pressure-induced unfolding, residues in repeats 1-3 of pp32-ΔN-cap lost their native structure first, while the C-terminal half was more stable. The residue-specific free energy changes in all regions of pp32-ΔN-cap were larger in urea compared to high pressure, indicating a less cooperative destabilization by pressure. Moreover, in contrast to complete structural disruption of pp32-ΔN-cap at high urea concentration, its pressure unfolded state remained compact. The contrasting effects of the capping motifs on folding cooperativity arise from the differential local stabilities of pp32, whereas the contrasting effects of pressure and urea on the pp32-ΔN-cap variant arise from their distinct mechanisms of action. Copyright © 2018 Elsevier Ltd. All rights reserved.
Knabe, Johannes F; Nehaniv, Chrystopher L; Schilstra, Maria J
2008-01-01
Methods that analyse the topological structure of networks have recently become quite popular. Whether motifs (subgraph patterns that occur more often than in randomized networks) have specific functions as elementary computational circuits has been cause for debate. As the question is difficult to resolve with currently available biological data, we approach the issue using networks that abstractly model natural genetic regulatory networks (GRNs) which are evolved to show dynamical behaviors. Specifically one group of networks was evolved to be capable of exhibiting two different behaviors ("differentiation") in contrast to a group with a single target behavior. In both groups we find motif distribution differences within the groups to be larger than differences between them, indicating that evolutionary niches (target functions) do not necessarily mold network structure uniquely. These results show that variability operators can have a stronger influence on network topologies than selection pressures, especially when many topologies can create similar dynamics. Moreover, analysis of motif functional relevance by lesioning did not suggest that motifs were of greater importance to the functioning of the network than arbitrary subgraph patterns. Only when drastically restricting network size, so that one motif corresponds to a whole functionally evolved network, was preference for particular connection patterns found. This suggests that in non-restricted, bigger networks, entanglement with the rest of the network hinders topological subgraph analysis.
QuadBase2: web server for multiplexed guanine quadruplex mining and visualization
Dhapola, Parashar; Chowdhury, Shantanu
2016-01-01
DNA guanine quadruplexes or G4s are non-canonical DNA secondary structures which affect genomic processes like replication, transcription and recombination. G4s are computationally identified by specific nucleotide motifs which are also called putative G4 (PG4) motifs. Despite the general relevance of these structures, there is currently no tool available that can allow batch queries and genome-wide analysis of these motifs in a user-friendly interface. QuadBase2 (quadbase.igib.res.in) presents a completely reinvented web server version of previously published QuadBase database. QuadBase2 enables users to mine PG4 motifs in up to 178 eukaryotes through the EuQuad module. This module interfaces with Ensembl Compara database, to allow users mine PG4 motifs in the orthologues of genes of interest across eukaryotes. PG4 motifs can be mined across genes and their promoter sequences in 1719 prokaryotes through ProQuad module. This module includes a feature that allows genome-wide mining of PG4 motifs and their visualization as circular histograms. TetraplexFinder, the module for mining PG4 motifs in user-provided sequences is now capable of handling up to 20 MB of data. QuadBase2 is a comprehensive PG4 motif mining tool that further expands the configurations and algorithms for mining PG4 motifs in a user-friendly way. PMID:27185890
Petrov, Artem; Arzhanik, Vladimir; Makarov, Gennady; Koliasnikov, Oleg
2016-08-01
Antibodies are the family of proteins, which are responsible for antigen recognition. The computational modeling of interaction between an antigen and an antibody is very important when crystallographic structure is unavailable. In this research, we have discovered the correlation between the amino acid sequence of antibody and its specific binding characteristics on the example of the novel conservative binding motif, which consists of four residues: Arg H52, Tyr H33, Thr H59, and Glu H61. These residues are specifically oriented in the binding site and interact with each other in a specific manner. The residues of the binding motif are involved in interaction strictly with negatively charged groups of antigens, and form a binding complex. Mechanism of interaction and characteristics of the complex were also discovered. The results of this research can be used to increase the accuracy of computational antibody-antigen interaction modeling and for post-modeling quality control of the modeled structures.
Kieken, Fabien; Jović, Marko; Tonelli, Marco; Naslavsky, Naava; Caplan, Steve; Sorgen, Paul L
2009-01-01
Eps15 homology (EH)-domain containing proteins are regulators of endocytic membrane trafficking. EH-domain binding to proteins containing the tripeptide NPF has been well characterized, but recent studies have shown that EH-domains are also able to interact with ligands containing DPF or GPF motifs. We demonstrate that the three motifs interact in a similar way with the EH-domain of EHD1, with the NPF motif having the highest affinity due to the presence of an intermolecular hydrogen bond. The weaker affinity for the DPF and GPF motifs suggests that if complex formation occurs in vivo, they may require high ligand concentrations, the presence of successive motifs and/or specific flanking residues. PMID:19798736
Finding specific RNA motifs: Function in a zeptomole world?
KNIGHT, ROB; YARUS, MICHAEL
2003-01-01
We have developed a new method for estimating the abundance of any modular (piecewise) RNA motif within a longer random region. We have used this method to estimate the size of the active motifs available to modern SELEX experiments (picomoles of unique sequences) and to a plausible RNA World (zeptomoles of unique sequences: 1 zmole = 602 sequences). Unexpectedly, activities such as specific isoleucine binding are almost certainly present in zeptomoles of molecules, and even ribozymes such as self-cleavage motifs may appear (depending on assumptions about the minimal structures). The number of specified nucleotides is not the only important determinant of a motif’s rarity: The number of modules into which it is divided, and the details of this division, are also crucial. We propose three maxims for easily isolated motifs: the Maxim of Minimization, the Maxim of Multiplicity, and the Maxim of the Median. These maxims together state that selected motifs should be small and composed of as many separate, equally sized modules as possible. For evenly divided motifs with four modules, the largest accessible activity in picomole scale (1–1000 pmole) pools of length 100 is about 34 nucleotides; while for zeptomole scale (1–1000 zmole) pools it is about 20 specific nucleotides (50% probability of occurrence). This latter figure includes some ribozymes and aptamers. Consequently, an RNA metabolism apparently could have begun with only zeptomoles of RNA molecules. PMID:12554865
Conserved binding of GCAC motifs by MEC-8, couch potato, and the RBPMS protein family
Soufari, Heddy
2017-01-01
Precise regulation of mRNA processing, translation, localization, and stability relies on specific interactions with RNA-binding proteins whose biological function and target preference are dictated by their preferred RNA motifs. The RBPMS family of RNA-binding proteins is defined by a conserved RNA recognition motif (RRM) domain found in metazoan RBPMS/Hermes and RBPMS2, Drosophila couch potato, and MEC-8 from Caenorhabditis elegans. In order to determine the parameters of RNA sequence recognition by the RBPMS family, we have first used the N-terminal domain from MEC-8 in binding assays and have demonstrated a preference for two GCAC motifs optimally separated by >6 nucleotides (nt). We have also determined the crystal structure of the dimeric N-terminal RRM domain from MEC-8 in the unbound form, and in complex with an oligonucleotide harboring two copies of the optimal GCAC motif. The atomic details reveal the molecular network that provides specificity to all four bases in the motif, including multiple hydrogen bonds to the initial guanine. Further studies with human RBPMS, as well as Drosophila couch potato, confirm a general preference for this double GCAC motif by other members of the protein family and the presence of this motif in known targets. PMID:28003515
Rules for the recognition of dilysine retrieval motifs by coatomer
Ma, Wenfu; Goldberg, Jonathan
2013-01-01
Cytoplasmic dilysine motifs on transmembrane proteins are captured by coatomer α-COP and β′-COP subunits and packaged into COPI-coated vesicles for Golgi-to-ER retrieval. Numerous ER/Golgi proteins contain K(x)Kxx motifs, but the rules for their recognition are unclear. We present crystal structures of α-COP and β′-COP bound to a series of naturally occurring retrieval motifs—encompassing KKxx, KxKxx and non-canonical RKxx and viral KxHxx sequences. Binding experiments show that α-COP and β′-COP have generally the same specificity for KKxx and KxKxx, but only β′-COP recognizes the RKxx signal. Dilysine motif recognition involves lysine side-chain interactions with two acidic patches. Surprisingly, however, KKxx and KxKxx motifs bind differently, with their lysine residues transposed at the binding patches. We derive rules for retrieval motif recognition from key structural features: the reversed binding modes, the recognition of the C-terminal carboxylate group which enforces lysine positional context, and the tolerance of the acidic patches for non-lysine residues. PMID:23481256
BlockLogo: visualization of peptide and sequence motif conservation
Olsen, Lars Rønn; Kudahl, Ulrich Johan; Simon, Christian; Sun, Jing; Schönbach, Christian; Reinherz, Ellis L.; Zhang, Guang Lan; Brusic, Vladimir
2013-01-01
BlockLogo is a web-server application for visualization of protein and nucleotide fragments, continuous protein sequence motifs, and discontinuous sequence motifs using calculation of block entropy from multiple sequence alignments. The user input consists of a multiple sequence alignment, selection of motif positions, type of sequence, and output format definition. The output has BlockLogo along with the sequence logo, and a table of motif frequencies. We deployed BlockLogo as an online application and have demonstrated its utility through examples that show visualization of T-cell epitopes and B-cell epitopes (both continuous and discontinuous). Our additional example shows a visualization and analysis of structural motifs that determine specificity of peptide binding to HLA-DR molecules. The BlockLogo server also employs selected experimentally validated prediction algorithms to enable on-the-fly prediction of MHC binding affinity to 15 common HLA class I and class II alleles as well as visual analysis of discontinuous epitopes from multiple sequence alignments. It enables the visualization and analysis of structural and functional motifs that are usually described as regular expressions. It provides a compact view of discontinuous motifs composed of distant positions within biological sequences. BlockLogo is available at: http://research4.dfci.harvard.edu/cvc/blocklogo/ and http://methilab.bu.edu/blocklogo/ PMID:24001880
Structure of a putative acetyltransferase (PA1377) from Pseudomonas aeruginosa
DOE Office of Scientific and Technical Information (OSTI.GOV)
Davies, Anna M.; Tata, Renée; Chauviac, François-Xavier
2008-05-01
The crystal structure of an acetyltransferase encoded by the gene PA1377 from Pseudomonas aeruginosa has been determined at 2.25 Å resolution. Comparison with a related acetyltransferase revealed a structural difference in the active site that was taken to reflect a difference in substrate binding and/or specificity between the two enzymes. Gene PA1377 from Pseudomonas aeruginosa encodes a 177-amino-acid conserved hypothetical protein of unknown function. The structure of this protein (termed pitax) has been solved in space group I222 to 2.25 Å resolution. Pitax belongs to the GCN5-related N-acetyltransferase family and contains all four sequence motifs conserved among family members. Themore » β-strand structure in one of these motifs (motif A) is disrupted, which is believed to affect binding of the substrate that accepts the acetyl group from acetyl-CoA.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Porebski, Przemyslaw J.; Klimecka, Maria; Chruszcz, Maksymilian
2012-07-11
Dethiobiotin synthetase (DTBS) is involved in the biosynthesis of biotin in bacteria, fungi, and plants. As humans lack this pathway, DTBS is a promising antimicrobial drug target. We determined structures of DTBS from Helicobacter pylori (hpDTBS) bound with cofactors and a substrate analog, and described its unique characteristics relative to other DTBS proteins. Comparison with bacterial DTBS orthologs revealed considerable structural differences in nucleotide recognition. The C-terminal region of DTBS proteins, which contains two nucleotide-recognition motifs, differs greatly among DTBS proteins from different species. The structure of hpDTBS revealed that this protein is unique and does not contain a C-terminalmore » region containing one of the motifs. The single nucleotide-binding motif in hpDTBS is similar to its counterpart in GTPases; however, isothermal titration calorimetry binding studies showed that hpDTBS has a strong preference for ATP. The structural determinants of ATP specificity were assessed with X-ray crystallographic studies of hpDTBS-ATP and hpDTBS-GTP complexes. The unique mode of nucleotide recognition in hpDTBS makes this protein a good target for H. pylori-specific inhibitors of the biotin synthesis pathway.« less
[Structure and evolution of the eukaryotic FANCJ-like proteins].
Wuhe, Jike; Zefeng, Wu; Sanhong, Fan; Xuguang, Xi
2015-02-01
The FANCJ-like protein family is a class of ATP-dependent helicases that can catalytically unwind duplex DNA along the 5'-3' direction. It is involved in the processes of DNA damage repair, homologous recombination and G-quadruplex DNA unwinding, and plays a critical role in maintaining genome integrity. In this study, we systemically analyzed FNACJ-like proteins from 47 eukaryotic species and discussed their sequences diversity, origin and evolution, motif organization patterns and spatial structure differences. Four members of FNACJ-like proteins, including XPD, CHL1, RTEL1 and FANCJ, were found in eukaryotes, but some of them were seriously deficient in most fungi and some insects. For example, the Zygomycota fungi lost RTEL1, Basidiomycota and Ascomycota fungi lost RTEL1 and FANCJ, and Diptera insect lost FANCJ. FANCJ-like proteins contain canonical motor domains HD1 and HD2, and the HD1 domain further integrates with three unique domains Fe-S, Arch and Extra-D. Fe-S and Arch domains are relatively conservative in all members of the family, but the Extra-D domain is lost in XPD and differs from one another in rest members. There are 7, 10 and 2 specific motifs found from the three unique domains respectively, while 5 and 12 specific motifs are found from HD1 and HD2 domains except the conserved motifs reported previously. By analyzing the arrangement pattern of these specific motifs, we found that RTEL1 and FANCJ are more closer and share two specific motifs Vb2 and Vc in HD2 domain, which are likely related with their G-quadruplex DNA unwinding activity. The evidence of evolution showed that FACNJ-like proteins were originated from a helicase, which has a HD1 domain inserted by extra Fe-S domain and Arch domain. By three continuous gene duplication events and followed specialization, eukaryotes finally possessed the current four members of FANCJ-like proteins.
Tomar, Navneet; Mishra, Akhilesh; Mrinal, Nirotpal; Jayaram, B.
2016-01-01
Transcription factors (TFs) bind at multiple sites in the genome and regulate expression of many genes. Regulating TF binding in a gene specific manner remains a formidable challenge in drug discovery because the same binding motif may be present at multiple locations in the genome. Here, we present Onco-Regulon (http://www.scfbio-iitd.res.in/software/onco/NavSite/index.htm), an integrated database of regulatory motifs of cancer genes clubbed with Unique Sequence-Predictor (USP) a software suite that identifies unique sequences for each of these regulatory DNA motifs at the specified position in the genome. USP works by extending a given DNA motif, in 5′→3′, 3′ →5′ or both directions by adding one nucleotide at each step, and calculates the frequency of each extended motif in the genome by Frequency Counter programme. This step is iterated till the frequency of the extended motif becomes unity in the genome. Thus, for each given motif, we get three possible unique sequences. Closest Sequence Finder program predicts off-target drug binding in the genome. Inclusion of DNA-Protein structural information further makes Onco-Regulon a highly informative repository for gene specific drug development. We believe that Onco-Regulon will help researchers to design drugs which will bind to an exclusive site in the genome with no off-target effects, theoretically. Database URL: http://www.scfbio-iitd.res.in/software/onco/NavSite/index.htm PMID:27515825
Smith, Robert A; Anderson, Donovan J; Preston, Bradley D
2006-07-01
Human immunodeficiency virus type 1 (HIV-1) reverse transcriptase (RT) contains four structural motifs (A, B, C, and D) that are conserved in polymerases from diverse organisms. Motif B interacts with the incoming nucleotide, the template strand, and key active-site residues from other motifs, suggesting that motif B is an important determinant of substrate specificity. To examine the functional role of this region, we performed "random scanning mutagenesis" of 11 motif B residues and screened replication-competent mutants for altered substrate analog sensitivity in culture. Single amino acid replacements throughout the targeted region conferred resistance to lamivudine and/or hypersusceptibility to zidovudine (AZT). Substitutions at residue Q151 increased the sensitivity of HIV-1 to multiple nucleoside analogs, and a subset of these Q151 variants was also hypersusceptible to the pyrophosphate analog phosphonoformic acid (PFA). Other AZT-hypersusceptible mutants were resistant to PFA and are therefore phenotypically similar to PFA-resistant variants selected in vitro and in infected patients. Collectively, these data show that specific amino acid replacements in motif B confer broad-spectrum hypersusceptibility to substrate analog inhibitors. Our results suggest that motif B influences RT-deoxynucleoside triphosphate interactions at multiple steps in the catalytic cycle of polymerization.
NASA Astrophysics Data System (ADS)
Zimmermann, Nils E. R.; Horton, Matthew K.; Jain, Anubhav; Haranczyk, Maciej
2017-11-01
Structure-property relationships form the basis of many design rules in materials science, including synthesizability and long-term stability of catalysts, control of electrical and optoelectronic behavior in semiconductors as well as the capacity of and transport properties in cathode materials for rechargeable batteries. The immediate atomic environments (i.e., the first coordination shells) of a few atomic sites are often a key factor in achieving a desired property. Some of the most frequently encountered coordination patterns are tetrahedra, octahedra, body and face-centered cubic as well as hexagonal closed packed-like environments. Here, we showcase the usefulness of local order parameters to identify these basic structural motifs in inorganic solid materials by developing classification criteria. We introduce a systematic testing framework, the Einstein crystal test rig, that probes the response of order parameters to distortions in perfect motifs to validate our approach. Subsequently, we highlight three important application cases. First, we map basic crystal structure information of a large materials database in an intuitive manner by screening the Materials Project (MP) database (61,422 compounds) for element-specific motif distributions. Second, we use the structure-motif recognition capabilities to automatically find interstitials in metals, semiconductor, and insulator materials. Our Interstitialcy Finding Tool (InFiT) facilitates high-throughput screenings of defect properties. Third, the order parameters are reliable and compact quantitative structure descriptors for characterizing diffusion hops of intercalants as our example of magnesium in MnO2-spinel indicates. Finally, the tools developed in our work are readily and freely available as software implementations in the pymatgen library, and we expect them to be further applied to machine-learning approaches for emerging applications in materials science.
Structural and functional analysis of the GABARAP interaction motif (GIM)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rogov, Vladimir V.; Stolz, Alexandra; Ravichandran, Arvind C.
Through the canonical LC3 interaction motif (LIR), [W/F/Y]–X 1–X 2[I/L/V], protein complexes are recruited to autophagosomes to perform their functions as either autophagy adaptors or receptors. How these adaptors/receptors selectively interact with either LC3 or GABARAP families remains unclear. Herein, we determine the range of selectivity of 30 known core LIR motifs towards individual LC3s and GABARAPs. From these, we define a GABARAP Interaction Motif (GIM) sequence ([W/F]–[V/I]–X 2–V) that the adaptor protein PLEKHM1 tightly conforms to. Using biophysical and structural approaches, we show that the PLEKHM1–LIR is indeed 11–fold more specific for GABARAP than LC3B. Selective mutation of themore » X 1 and X 2 positions either completely abolished the interaction with all LC3 and GABARAPs or increased PLEKHM1–GIM selectivity 20–fold towards LC3B. Finally, we show that conversion of p62/SQSTM1, FUNDC1 and FIP200 LIRs into our newly defined GIM, by introducing two valine residues, enhances their interaction with endogenous GABARAP over LC3B. In conclusion, the identification of a GABARAP–specific interaction motif will aid the identification and characterization of the expanding array of autophagy receptor and adaptor proteins and their in vivo functions.« less
Structural and functional analysis of the GABARAP interaction motif (GIM)
Rogov, Vladimir V.; Stolz, Alexandra; Ravichandran, Arvind C.; ...
2017-06-27
Through the canonical LC3 interaction motif (LIR), [W/F/Y]–X 1–X 2[I/L/V], protein complexes are recruited to autophagosomes to perform their functions as either autophagy adaptors or receptors. How these adaptors/receptors selectively interact with either LC3 or GABARAP families remains unclear. Herein, we determine the range of selectivity of 30 known core LIR motifs towards individual LC3s and GABARAPs. From these, we define a GABARAP Interaction Motif (GIM) sequence ([W/F]–[V/I]–X 2–V) that the adaptor protein PLEKHM1 tightly conforms to. Using biophysical and structural approaches, we show that the PLEKHM1–LIR is indeed 11–fold more specific for GABARAP than LC3B. Selective mutation of themore » X 1 and X 2 positions either completely abolished the interaction with all LC3 and GABARAPs or increased PLEKHM1–GIM selectivity 20–fold towards LC3B. Finally, we show that conversion of p62/SQSTM1, FUNDC1 and FIP200 LIRs into our newly defined GIM, by introducing two valine residues, enhances their interaction with endogenous GABARAP over LC3B. In conclusion, the identification of a GABARAP–specific interaction motif will aid the identification and characterization of the expanding array of autophagy receptor and adaptor proteins and their in vivo functions.« less
Cellular automata simulation of topological effects on the dynamics of feed-forward motifs
Apte, Advait A; Cain, John W; Bonchev, Danail G; Fong, Stephen S
2008-01-01
Background Feed-forward motifs are important functional modules in biological and other complex networks. The functionality of feed-forward motifs and other network motifs is largely dictated by the connectivity of the individual network components. While studies on the dynamics of motifs and networks are usually devoted to the temporal or spatial description of processes, this study focuses on the relationship between the specific architecture and the overall rate of the processes of the feed-forward family of motifs, including double and triple feed-forward loops. The search for the most efficient network architecture could be of particular interest for regulatory or signaling pathways in biology, as well as in computational and communication systems. Results Feed-forward motif dynamics were studied using cellular automata and compared with differential equation modeling. The number of cellular automata iterations needed for a 100% conversion of a substrate into a target product was used as an inverse measure of the transformation rate. Several basic topological patterns were identified that order the specific feed-forward constructions according to the rate of dynamics they enable. At the same number of network nodes and constant other parameters, the bi-parallel and tri-parallel motifs provide higher network efficacy than single feed-forward motifs. Additionally, a topological property of isodynamicity was identified for feed-forward motifs where different network architectures resulted in the same overall rate of the target production. Conclusion It was shown for classes of structural motifs with feed-forward architecture that network topology affects the overall rate of a process in a quantitatively predictable manner. These fundamental results can be used as a basis for simulating larger networks as combinations of smaller network modules with implications on studying synthetic gene circuits, small regulatory systems, and eventually dynamic whole-cell models. PMID:18304325
DOE Office of Scientific and Technical Information (OSTI.GOV)
Han, S.; Tainer, J.A.
2001-08-01
ADP-ribosylation is a widely occurring and biologically critical covalent chemical modification process in pathogenic mechanisms, intracellular signaling systems, DNA repair, and cell division. The reaction is catalyzed by ADP-ribosyltransferases, which transfer the ADP-ribose moiety of NAD to a target protein with nicotinamide release. A family of bacterial toxins and eukaryotic enzymes has been termed the mono-ADP-ribosyltransferases, in distinction to the poly-ADP-ribosyltransferases, which catalyze the addition of multiple ADP-ribose groups to the carboxyl terminus of eukaryotic nucleoproteins. Despite the limited primary sequence homology among the different ADP-ribosyltransferases, a central cleft bearing NAD-binding pocket formed by the two perpendicular b-sheet core hasmore » been remarkably conserved between bacterial toxins and eukaryotic mono- and poly-ADP-ribosyltransferases. The majority of bacterial toxins and eukaryotic mono-ADP-ribosyltransferases are characterized by conserved His and catalytic Glu residues. In contrast, Diphtheria toxin, Pseudomonas exotoxin A, and eukaryotic poly-ADP-ribosyltransferases are characterized by conserved Arg and catalytic Glu residues. The NAD-binding core of a binary toxin and a C3-like toxin family identified an ARTT motif (ADP-ribosylating turn-turn motif) that is implicated in substrate specificity and recognition by structural and mutagenic studies. Here we apply structure-based sequence alignment and comparative structural analyses of all known structures of ADP-ribosyltransfeases to suggest that this ARTT motif is functionally important in many ADP-ribosylating enzymes that bear a NAD binding cleft as characterized by conserved Arg and catalytic Glu residues. Overall, structure-based sequence analysis reveals common core structures and conserved active sites of ADP-ribosyltransferases to support similar NAD binding mechanisms but differing mechanisms of target protein binding via sequence variations within the ARTT motif structural framework. Thus, we propose here that the ARTT motif represents an experimentally testable general recognition motif region for many ADP-ribosyltransferases and thereby potentially provides a unified structural understanding of substrate recognition in ADP-ribosylation processes.« less
Membrane Curvature Sensing by Amphipathic Helices Is Modulated by the Surrounding Protein Backbone.
Doucet, Christine M; Esmery, Nina; de Saint-Jean, Maud; Antonny, Bruno
2015-01-01
Membrane curvature is involved in numerous biological pathways like vesicle trafficking, endocytosis or nuclear pore complex assembly. In addition to its topological role, membrane curvature is sensed by specific proteins, enabling the coordination of biological processes in space and time. Amongst membrane curvature sensors are the ALPS (Amphipathic Lipid Packing Sensors). ALPS motifs are short peptides with peculiar amphipathic properties. They are found in proteins targeted to distinct curved membranes, mostly in the early secretory pathway. For instance, the ALPS motif of the golgin GMAP210 binds trafficking vesicles, while the ALPS motif of Nup133 targets nuclear pores. It is not clear if, besides curvature sensitivity, ALPS motifs also provide target specificity, or if other domains in the surrounding protein backbone are involved. To elucidate this aspect, we studied the subcellular localization of ALPS motifs outside their natural protein context. The ALPS motifs of GMAP210 or Nup133 were grafted on artificial fluorescent probes. Importantly, ALPS motifs are held in different positions and these contrasting architectures were mimicked by the fluorescent probes. The resulting chimeras recapitulated the original proteins localization, indicating that ALPS motifs are sufficient to specifically localize proteins. Modulating the electrostatic or hydrophobic content of Nup133 ALPS motif modified its avidity for cellular membranes but did not change its organelle targeting properties. In contrast, the structure of the backbone surrounding the helix strongly influenced targeting. In particular, introducing an artificial coiled-coil between ALPS and the fluorescent protein increased membrane curvature sensitivity. This coiled-coil domain also provided membrane curvature sensitivity to the amphipathic helix of Sar1. The degree of curvature sensitivity within the coiled-coil context remains correlated to the natural curvature sensitivity of the helices. This suggests that the chemistry of ALPS motifs is a key parameter for membrane curvature sensitivity, which can be further modulated by the surrounding protein backbone.
Ni2+-binding RNA motifs with an asymmetric purine-rich internal loop and a G-A base pair.
Hofmann, H P; Limmer, S; Hornung, V; Sprinzl, M
1997-01-01
RNA molecules with high affinity for immobilized Ni2+ were isolated from an RNA pool with 50 randomized positions by in vitro selection-amplification. The selected RNAs preferentially bind Ni2+ and Co2+ over other cations from first series transition metals. Conserved structure motifs, comprising about 15 nt, were identified that are likely to represent the Ni2+ binding sites. Two conserved motifs contain an asymmetric purine-rich internal loop and probably a mismatch G-A base pair. The structure of one of these motifs was studied with proton NMR spectroscopy and formation of the G-A pair at the junction of helix and internal loop was demonstrated. Using Ni2+ as a paramagnetic probe, a divalent metal ion binding site near this G-A base pair was identified. Ni2+ ions bound to this motif exert a specific stabilization effect. We propose that small asymmetric purine-rich loops that contain a G-A interaction may represent a divalent metal ion binding site in RNA. PMID:9409620
SA-Mot: a web server for the identification of motifs of interest extracted from protein loops
Regad, Leslie; Saladin, Adrien; Maupetit, Julien; Geneix, Colette; Camproux, Anne-Claude
2011-01-01
The detection of functional motifs is an important step for the determination of protein functions. We present here a new web server SA-Mot (Structural Alphabet Motif) for the extraction and location of structural motifs of interest from protein loops. Contrary to other methods, SA-Mot does not focus only on functional motifs, but it extracts recurrent and conserved structural motifs involved in structural redundancy of loops. SA-Mot uses the structural word notion to extract all structural motifs from uni-dimensional sequences corresponding to loop structures. Then, SA-Mot provides a description of these structural motifs using statistics computed in the loop data set and in SCOP superfamily, sequence and structural parameters. SA-Mot results correspond to an interactive table listing all structural motifs extracted from a target structure and their associated descriptors. Using this information, the users can easily locate loop regions that are important for the protein folding and function. The SA-Mot web server is available at http://sa-mot.mti.univ-paris-diderot.fr. PMID:21665924
SA-Mot: a web server for the identification of motifs of interest extracted from protein loops.
Regad, Leslie; Saladin, Adrien; Maupetit, Julien; Geneix, Colette; Camproux, Anne-Claude
2011-07-01
The detection of functional motifs is an important step for the determination of protein functions. We present here a new web server SA-Mot (Structural Alphabet Motif) for the extraction and location of structural motifs of interest from protein loops. Contrary to other methods, SA-Mot does not focus only on functional motifs, but it extracts recurrent and conserved structural motifs involved in structural redundancy of loops. SA-Mot uses the structural word notion to extract all structural motifs from uni-dimensional sequences corresponding to loop structures. Then, SA-Mot provides a description of these structural motifs using statistics computed in the loop data set and in SCOP superfamily, sequence and structural parameters. SA-Mot results correspond to an interactive table listing all structural motifs extracted from a target structure and their associated descriptors. Using this information, the users can easily locate loop regions that are important for the protein folding and function. The SA-Mot web server is available at http://sa-mot.mti.univ-paris-diderot.fr.
Identification of sequence-structure RNA binding motifs for SELEX-derived aptamers.
Hoinka, Jan; Zotenko, Elena; Friedman, Adam; Sauna, Zuben E; Przytycka, Teresa M
2012-06-15
Systematic Evolution of Ligands by EXponential Enrichment (SELEX) represents a state-of-the-art technology to isolate single-stranded (ribo)nucleic acid fragments, named aptamers, which bind to a molecule (or molecules) of interest via specific structural regions induced by their sequence-dependent fold. This powerful method has applications in designing protein inhibitors, molecular detection systems, therapeutic drugs and antibody replacement among others. However, full understanding and consequently optimal utilization of the process has lagged behind its wide application due to the lack of dedicated computational approaches. At the same time, the combination of SELEX with novel sequencing technologies is beginning to provide the data that will allow the examination of a variety of properties of the selection process. To close this gap we developed, Aptamotif, a computational method for the identification of sequence-structure motifs in SELEX-derived aptamers. To increase the chances of identifying functional motifs, Aptamotif uses an ensemble-based approach. We validated the method using two published aptamer datasets containing experimentally determined motifs of increasing complexity. We were able to recreate the author's findings to a high degree, thus proving the capability of our approach to identify binding motifs in SELEX data. Additionally, using our new experimental dataset, we illustrate the application of Aptamotif to elucidate several properties of the selection process.
Zimmermann, Nils E. R.; Horton, Matthew K.; Jain, Anubhav; ...
2017-11-13
Structure–property relationships form the basis of many design rules in materials science, including synthesizability and long-term stability of catalysts, control of electrical and optoelectronic behavior in semiconductors, as well as the capacity of and transport properties in cathode materials for rechargeable batteries. The immediate atomic environments (i.e., the first coordination shells) of a few atomic sites are often a key factor in achieving a desired property. Some of the most frequently encountered coordination patterns are tetrahedra, octahedra, body and face-centered cubic as well as hexagonal close packed-like environments. Here, we showcase the usefulness of local order parameters to identify thesemore » basic structural motifs in inorganic solid materials by developing classification criteria. We introduce a systematic testing framework, the Einstein crystal test rig, that probes the response of order parameters to distortions in perfect motifs to validate our approach. Subsequently, we highlight three important application cases. First, we map basic crystal structure information of a large materials database in an intuitive manner by screening the Materials Project (MP) database (61,422 compounds) for element-specific motif distributions. Second, we use the structure-motif recognition capabilities to automatically find interstitials in metals, semiconductor, and insulator materials. Our Interstitialcy Finding Tool (InFiT) facilitates high-throughput screenings of defect properties. Third, the order parameters are reliable and compact quantitative structure descriptors for characterizing diffusion hops of intercalants as our example of magnesium in MnO 2-spinel indicates. Finally, the tools developed in our work are readily and freely available as software implementations in the pymatgen library, and we expect them to be further applied to machine-learning approaches for emerging applications in materials science.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zimmermann, Nils E. R.; Horton, Matthew K.; Jain, Anubhav
Structure–property relationships form the basis of many design rules in materials science, including synthesizability and long-term stability of catalysts, control of electrical and optoelectronic behavior in semiconductors, as well as the capacity of and transport properties in cathode materials for rechargeable batteries. The immediate atomic environments (i.e., the first coordination shells) of a few atomic sites are often a key factor in achieving a desired property. Some of the most frequently encountered coordination patterns are tetrahedra, octahedra, body and face-centered cubic as well as hexagonal close packed-like environments. Here, we showcase the usefulness of local order parameters to identify thesemore » basic structural motifs in inorganic solid materials by developing classification criteria. We introduce a systematic testing framework, the Einstein crystal test rig, that probes the response of order parameters to distortions in perfect motifs to validate our approach. Subsequently, we highlight three important application cases. First, we map basic crystal structure information of a large materials database in an intuitive manner by screening the Materials Project (MP) database (61,422 compounds) for element-specific motif distributions. Second, we use the structure-motif recognition capabilities to automatically find interstitials in metals, semiconductor, and insulator materials. Our Interstitialcy Finding Tool (InFiT) facilitates high-throughput screenings of defect properties. Third, the order parameters are reliable and compact quantitative structure descriptors for characterizing diffusion hops of intercalants as our example of magnesium in MnO 2-spinel indicates. Finally, the tools developed in our work are readily and freely available as software implementations in the pymatgen library, and we expect them to be further applied to machine-learning approaches for emerging applications in materials science.« less
NASA Astrophysics Data System (ADS)
Susan, Anju; Joshi, Kavita
2014-04-01
Melting in finite size systems is an interesting but complex phenomenon. Many factors affect melting and owing to their interdependencies it is a challenging task to rationalize their roles in the phase transition. In this work, we demonstrate how structural motif of the ground state influences melting transition in small clusters. Here, we report a case with clusters of aluminum and gallium having same number of atoms, valence electrons, and similar structural motif of the ground state but drastically different melting temperatures. We have employed Born-Oppenheimer molecular dynamics to simulate the solid-like to liquid-like transition in these clusters. Our simulations have reproduced the experimental trends fairly well. Further, the detailed analysis of isomers has brought out the role of the ground state structure and underlying electronic structure in the finite temperature behavior of these clusters. For both clusters, isomers accessible before cluster melts have striking similarities and does have strong influence of the structural motif of the ground state. Further, the shape of the heat capacity curve is similar in both the cases but the transition is more spread over for Al36 which is consistent with the observed isomerization pattern. Our simulations also suggest a way to characterize transition region on the basis of accessibility of the ground state at a specific temperature.
Zheng, Heping; Shabalin, Ivan G.; Handing, Katarzyna B.; Bujnicki, Janusz M.; Minor, Wladek
2015-01-01
The ubiquitous presence of magnesium ions in RNA has long been recognized as a key factor governing RNA folding, and is crucial for many diverse functions of RNA molecules. In this work, Mg2+-binding architectures in RNA were systematically studied using a database of RNA crystal structures from the Protein Data Bank (PDB). Due to the abundance of poorly modeled or incorrectly identified Mg2+ ions, the set of all sites was comprehensively validated and filtered to identify a benchmark dataset of 15 334 ‘reliable’ RNA-bound Mg2+ sites. The normalized frequencies by which specific RNA atoms coordinate Mg2+ were derived for both the inner and outer coordination spheres. A hierarchical classification system of Mg2+ sites in RNA structures was designed and applied to the benchmark dataset, yielding a set of 41 types of inner-sphere and 95 types of outer-sphere coordinating patterns. This classification system has also been applied to describe six previously reported Mg2+-binding motifs and detect them in new RNA structures. Investigation of the most populous site types resulted in the identification of seven novel Mg2+-binding motifs, and all RNA structures in the PDB were screened for the presence of these motifs. PMID:25800744
NASA Astrophysics Data System (ADS)
Krishnan, Gopi; Verheijen, Marcel A.; Ten Brink, Gert H.; Palasantzas, George; Kooi, Bart J.
2013-05-01
Nowadays bimetallic nanoparticles (NPs) have emerged as key materials for important modern applications in nanoplasmonics, catalysis, biodiagnostics, and nanomagnetics. Consequently the control of bimetallic structural motifs with specific shapes provides increasing functionality and selectivity for related applications. However, producing bimetallic NPs with well controlled structural motifs still remains a formidable challenge. Hence, we present here a general methodology for gas phase synthesis of bimetallic NPs with distinctively different structural motifs ranging at a single particle level from a fully mixed alloy to core-shell, to onion (multi-shell), and finally to a Janus/dumbbell, with the same overall particle composition. These concepts are illustrated for Mo-Cu NPs, where the precise control of the bimetallic NPs with various degrees of chemical ordering, including different shapes from spherical to cube, is achieved by tailoring the energy and thermal environment that the NPs experience during their production. The initial state of NP growth, either in the liquid or in the solid state phase, has important implications for the different structural motifs and shapes of synthesized NPs. Finally we demonstrate that we are able to tune the alloying regime, for the otherwise bulk immiscible Mo-Cu, by achieving an increase of the critical size, below which alloying occurs, closely up to an order of magnitude. It is discovered that the critical size of the NP alloy is not only affected by controlled tuning of the alloying temperature but also by the particle shape.Nowadays bimetallic nanoparticles (NPs) have emerged as key materials for important modern applications in nanoplasmonics, catalysis, biodiagnostics, and nanomagnetics. Consequently the control of bimetallic structural motifs with specific shapes provides increasing functionality and selectivity for related applications. However, producing bimetallic NPs with well controlled structural motifs still remains a formidable challenge. Hence, we present here a general methodology for gas phase synthesis of bimetallic NPs with distinctively different structural motifs ranging at a single particle level from a fully mixed alloy to core-shell, to onion (multi-shell), and finally to a Janus/dumbbell, with the same overall particle composition. These concepts are illustrated for Mo-Cu NPs, where the precise control of the bimetallic NPs with various degrees of chemical ordering, including different shapes from spherical to cube, is achieved by tailoring the energy and thermal environment that the NPs experience during their production. The initial state of NP growth, either in the liquid or in the solid state phase, has important implications for the different structural motifs and shapes of synthesized NPs. Finally we demonstrate that we are able to tune the alloying regime, for the otherwise bulk immiscible Mo-Cu, by achieving an increase of the critical size, below which alloying occurs, closely up to an order of magnitude. It is discovered that the critical size of the NP alloy is not only affected by controlled tuning of the alloying temperature but also by the particle shape. Electronic supplementary information (ESI) available: Experimental details including schematics of the gas phase synthesis set up, target arrangement, synthesis condition for various structures, and TEM images of alloy, core-shell and Mo-Cu-Mo onion nanoparticles. See DOI: 10.1039/c3nr00565h
Automated classification of RNA 3D motifs and the RNA 3D Motif Atlas
Petrov, Anton I.; Zirbel, Craig L.; Leontis, Neocles B.
2013-01-01
The analysis of atomic-resolution RNA three-dimensional (3D) structures reveals that many internal and hairpin loops are modular, recurrent, and structured by conserved non-Watson–Crick base pairs. Structurally similar loops define RNA 3D motifs that are conserved in homologous RNA molecules, but can also occur at nonhomologous sites in diverse RNAs, and which often vary in sequence. To further our understanding of RNA motif structure and sequence variability and to provide a useful resource for structure modeling and prediction, we present a new method for automated classification of internal and hairpin loop RNA 3D motifs and a new online database called the RNA 3D Motif Atlas. To classify the motif instances, a representative set of internal and hairpin loops is automatically extracted from a nonredundant list of RNA-containing PDB files. Their structures are compared geometrically, all-against-all, using the FR3D program suite. The loops are clustered into motif groups, taking into account geometric similarity and structural annotations and making allowance for a variable number of bulged bases. The automated procedure that we have implemented identifies all hairpin and internal loop motifs previously described in the literature. All motif instances and motif groups are assigned unique and stable identifiers and are made available in the RNA 3D Motif Atlas (http://rna.bgsu.edu/motifs), which is automatically updated every four weeks. The RNA 3D Motif Atlas provides an interactive user interface for exploring motif diversity and tools for programmatic data access. PMID:23970545
Selective integrin endocytosis is driven by interactions between the integrin α-chain and AP2
De Franceschi, Nicola; Arjonen, Antti; Elkhatib, Nadia; Denessiouk, Konstantin; Wrobel, Antoni G; Wilson, Thomas A; Pouwels, Jeroen; Montagnac, Guillaume; Owen, David J; Ivaska, Johanna
2016-01-01
Integrins are heterodimeric cell-surface adhesion molecules comprising one of possible 18 α-chains and one of possible 8 β-chains. They control a range of cell functions in a matrix- and ligand-specific manner. Integrins can be internalised by clathrin-mediated endocytosis (CME) through β subunit-based motifs found in all integrin heterodimers. However, whether specific integrin heterodimers can be selectively endocytosed was unknown. Here, we found that a subset of α subunits contain an evolutionarily conserved and functional YxxΦ motif directing integrins to selective internalisation by the most abundant endocytic clathrin adaptor, AP2. We determined the structure of the human integrin α4-tail motif in complex with AP2 C-µ2 subunit and confirmed the interaction by isothermal titration calorimetry. Mutagenesis of the motif impaired selective heterodimer endocytosis and attenuated integrin-mediated cell migration. We propose that integrins evolved to enable selective integrin-receptor turnover in response to changing matrix conditions. PMID:26779610
Selective integrin endocytosis is driven by interactions between the integrin α-chain and AP2.
De Franceschi, Nicola; Arjonen, Antti; Elkhatib, Nadia; Denessiouk, Konstantin; Wrobel, Antoni G; Wilson, Thomas A; Pouwels, Jeroen; Montagnac, Guillaume; Owen, David J; Ivaska, Johanna
2016-02-01
Integrins are heterodimeric cell-surface adhesion molecules comprising one of 18 possible α-chains and one of eight possible β-chains. They control a range of cell functions in a matrix- and ligand-specific manner. Integrins can be internalized by clathrin-mediated endocytosis (CME) through β subunit-based motifs found in all integrin heterodimers. However, whether specific integrin heterodimers can be selectively endocytosed was unknown. Here, we found that a subset of α subunits contain an evolutionarily conserved and functional YxxΦ motif directing integrins to selective internalization by the most abundant endocytic clathrin adaptor, AP2. We determined the structure of the human integrin α4-tail motif in complex with the AP2 C-μ2 subunit and confirmed the interaction by isothermal titration calorimetry. Mutagenesis of the motif impaired selective heterodimer endocytosis and attenuated integrin-mediated cell migration. We propose that integrins evolved to enable selective integrin-receptor turnover in response to changing matrix conditions.
Substrate specificity and reaction kinetics of an X-motif ribozyme
LAZAREV, DENIS; PUSKARZ, IZABELA; BREAKER, RONALD R.
2003-01-01
The X-motif is an in vitro-selected ribozyme that catalyzes RNA cleavage by an internal phosphoester transfer reaction. This ribozyme class is distinguished by the fact that it emerged as the dominant clone among at least 12 different classes of ribozymes when in vitro selection was conducted to favor the isolation of high-speed catalysts. We have examined the structural and kinetic properties of the X-motif in order to provide a framework for its application as an RNA-cleaving agent and to explore how this ribozyme catalyzes phosphoester transfer with a predicted rate constant that is similar to those exhibited by the four natural self-cleaving ribozymes. The secondary structure of the X-motif includes four stem elements that form a central unpaired junction. In a bimolecular format, two of these base-paired arms define the substrate specificity of the ribozyme and can be changed to target different RNAs for cleavage. The requirements for nucleotide identity at the cleavage site are GD, where D = G, A, or U and cleavage occurs between the two nucleotides. The ribozyme has an absolute requirement for a divalent cation cofactor and exhibits kinetic behavior that is consistent with the obligate binding of at least two metal ions. PMID:12756327
ssHMM: extracting intuitive sequence-structure motifs from high-throughput RNA-binding protein data
Krestel, Ralf; Ohler, Uwe; Vingron, Martin; Marsico, Annalisa
2017-01-01
Abstract RNA-binding proteins (RBPs) play an important role in RNA post-transcriptional regulation and recognize target RNAs via sequence-structure motifs. The extent to which RNA structure influences protein binding in the presence or absence of a sequence motif is still poorly understood. Existing RNA motif finders either take the structure of the RNA only partially into account, or employ models which are not directly interpretable as sequence-structure motifs. We developed ssHMM, an RNA motif finder based on a hidden Markov model (HMM) and Gibbs sampling which fully captures the relationship between RNA sequence and secondary structure preference of a given RBP. Compared to previous methods which output separate logos for sequence and structure, it directly produces a combined sequence-structure motif when trained on a large set of sequences. ssHMM’s model is visualized intuitively as a graph and facilitates biological interpretation. ssHMM can be used to find novel bona fide sequence-structure motifs of uncharacterized RBPs, such as the one presented here for the YY1 protein. ssHMM reaches a high motif recovery rate on synthetic data, it recovers known RBP motifs from CLIP-Seq data, and scales linearly on the input size, being considerably faster than MEMERIS and RNAcontext on large datasets while being on par with GraphProt. It is freely available on Github and as a Docker image. PMID:28977546
Classification and assessment tools for structural motif discovery algorithms.
Badr, Ghada; Al-Turaiki, Isra; Mathkour, Hassan
2013-01-01
Motif discovery is the problem of finding recurring patterns in biological data. Patterns can be sequential, mainly when discovered in DNA sequences. They can also be structural (e.g. when discovering RNA motifs). Finding common structural patterns helps to gain a better understanding of the mechanism of action (e.g. post-transcriptional regulation). Unlike DNA motifs, which are sequentially conserved, RNA motifs exhibit conservation in structure, which may be common even if the sequences are different. Over the past few years, hundreds of algorithms have been developed to solve the sequential motif discovery problem, while less work has been done for the structural case. In this paper, we survey, classify, and compare different algorithms that solve the structural motif discovery problem, where the underlying sequences may be different. We highlight their strengths and weaknesses. We start by proposing a benchmark dataset and a measurement tool that can be used to evaluate different motif discovery approaches. Then, we proceed by proposing our experimental setup. Finally, results are obtained using the proposed benchmark to compare available tools. To the best of our knowledge, this is the first attempt to compare tools solely designed for structural motif discovery. Results show that the accuracy of discovered motifs is relatively low. The results also suggest a complementary behavior among tools where some tools perform well on simple structures, while other tools are better for complex structures. We have classified and evaluated the performance of available structural motif discovery tools. In addition, we have proposed a benchmark dataset with tools that can be used to evaluate newly developed tools.
Pomel, S; Rodrigo, J; Hendra, F; Cavé, C; Loiseau, P M
2012-02-01
Leishmaniases are tropical and sub-tropical diseases for which classical drugs (i.e. antimonials) exhibit toxicity and drug resistance. Such a situation requires to find new chemical series with antileishmanial activity. This work consists in analyzing the structure of a validated target in Leishmania: the GDP-mannose pyrophosphorylase (GDP-MP), an enzyme involved in glycosylation and essential for amastigote survival. By comparing both human and L. infantum GDP-MP 3D homology models, we identified (i) a common motif of amino acids that binds to the mannose moiety of the substrate and, interestingly, (ii) a motif that is specific to the catalytic site of the parasite enzyme. This motif could then be used to design compounds that specifically inhibit the leishmanial GDP-MP, without any effect on the human homolog.
Han, S; Arvai, A S; Clancy, S B; Tainer, J A
2001-01-05
Clostridium botulinum C3 exoenzyme inactivates the small GTP-binding protein family Rho by ADP-ribosylating asparagine 41, which depolymerizes the actin cytoskeleton. C3 thus represents a major family of the bacterial toxins that transfer the ADP-ribose moiety of NAD to specific amino acids in acceptor proteins to modify key biological activities in eukaryotic cells, including protein synthesis, differentiation, transformation, and intracellular signaling. The 1.7 A resolution C3 exoenzyme structure establishes the conserved features of the core NAD-binding beta-sandwich fold with other ADP-ribosylating toxins despite little sequence conservation. Importantly, the central core of the C3 exoenzyme structure is distinguished by the absence of an active site loop observed in many other ADP-ribosylating toxins. Unlike the ADP-ribosylating toxins that possess the active site loop near the central core, the C3 exoenzyme replaces the active site loop with an alpha-helix, alpha3. Moreover, structural and sequence similarities with the catalytic domain of vegetative insecticidal protein 2 (VIP2), an actin ADP-ribosyltransferase, unexpectedly implicates two adjacent, protruding turns, which join beta5 and beta6 of the toxin core fold, as a novel recognition specificity motif for this newly defined toxin family. Turn 1 evidently positions the solvent-exposed, aromatic side-chain of Phe209 to interact with the hydrophobic region of Rho adjacent to its GTP-binding site. Turn 2 evidently both places the Gln212 side-chain for hydrogen bonding to recognize Rho Asn41 for nucleophilic attack on the anomeric carbon of NAD ribose and holds the key Glu214 catalytic side-chain in the adjacent catalytic pocket. This proposed bipartite ADP-ribosylating toxin turn-turn (ARTT) motif places the VIP2 and C3 toxin classes into a single ARTT family characterized by analogous target protein recognition via turn 1 aromatic and turn 2 hydrogen-bonding side-chain moieties. Turn 2 centrally anchors the catalytic Glu214 within the ARTT motif, and furthermore distinguishes the C3 toxin class by a conserved turn 2 Gln and the VIP2 binary toxin class by a conserved turn 2 Glu for appropriate target side-chain hydrogen-bonding recognition. Taken together, these structural results provide a molecular basis for understanding the coupled activity and recognition specificity for C3 and for the newly defined ARTT toxin family, which acts in the depolymerization of the actin cytoskeleton. This beta5 to beta6 region of the toxin fold represents an experimentally testable and potentially general recognition motif region for other ADP-ribosylating toxins that have a similar beta-structure framework. Copyright 2001 Academic Press.
NASA Astrophysics Data System (ADS)
Wei Poh, Zhong; Heng Gan, Chin; Lee, Eric J.; Guo, Suxian; Yip, George W.; Lam, Yulin
2015-09-01
Glycosaminoglycans (GAGs) regulate many important physiological processes. A pertinent issue to address is whether GAGs encode important functional information via introduction of position specific sulfate groups in the GAG structure. However, procurement of pure, homogenous GAG motifs to probe the “sulfation code” is a challenging task due to isolation difficulty and structural complexity. To this end, we devised a versatile synthetic strategy to obtain all the 16 theoretically possible sulfation patterns in the chondroitin sulfate (CS) repeating unit; these include rare but potentially important sulfated motifs which have not been isolated earlier. Biological evaluation indicated that CS sulfation patterns had differing effects for different breast cancer cell types, and the greatest inhibitory effect was observed for the most aggressive, triple negative breast cancer cell line MDA-MB-231.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Teplova,M.; Yuan, Y.; Phan, A.
2006-01-01
The nuclear phosphoprotein La was identified as an autoantigen in patients with systemic lupus erythematosus and Sjogren's syndrome. La binds to and protects the UUUOH 3' terminii of nascent RNA polymerase III transcripts from exonuclease digestion. We report the 1.85 Angstroms crystal structure of the N-terminal domain of human La, consisting of La and RRM1 motifs, bound to r(U1-G2-C3-U4-G5-U6-U7-U8-U9OH). The U7-U8-U9OH 3' end, in a splayed-apart orientation, is sequestered within a basic and aromatic amino acid-lined cleft between the La and RRM1 motifs. The specificity-determining U8 residue bridges both motifs, in part through unprecedented targeting of the {beta} sheet edge,more » rather than the anticipated face, of the RRM1 motif. Our structural observations, supported by mutation studies of both La and RNA components, illustrate the principles behind RNA sequestration by a rheumatic disease autoantigen, whereby the UUUOH 3' ends of nascent RNA transcripts are protected during downstream processing and maturation events.« less
Teplova, Marianna; Yuan, Yu-Ren; Phan, Anh Tuân; Malinina, Lucy; Ilin, Serge; Teplov, Alexei; Patel, Dinshaw J
2006-01-06
The nuclear phosphoprotein La was identified as an autoantigen in patients with systemic lupus erythematosus and Sjogren's syndrome. La binds to and protects the UUU(OH) 3' terminii of nascent RNA polymerase III transcripts from exonuclease digestion. We report the 1.85 angstroms crystal structure of the N-terminal domain of human La, consisting of La and RRM1 motifs, bound to r(U1-G2-C3-U4-G5-U6-U7-U8-U9OH). The U7-U8-U9OH 3' end, in a splayed-apart orientation, is sequestered within a basic and aromatic amino acid-lined cleft between the La and RRM1 motifs. The specificity-determining U8 residue bridges both motifs, in part through unprecedented targeting of the beta sheet edge, rather than the anticipated face, of the RRM1 motif. Our structural observations, supported by mutation studies of both La and RNA components, illustrate the principles behind RNA sequestration by a rheumatic disease autoantigen, whereby the UUU(OH) 3' ends of nascent RNA transcripts are protected during downstream processing and maturation events.
Role of sequence encoded κB DNA geometry in gene regulation by Dorsal
Mrinal, Nirotpal; Tomar, Archana; Nagaraju, Javaregowda
2011-01-01
Many proteins of the Rel family can act as both transcriptional activators and repressors. However, mechanism that discerns the ‘activator/repressor’ functions of Rel-proteins such as Dorsal (Drosophila homologue of mammalian NFκB) is not understood. Using genomic, biophysical and biochemical approaches, we demonstrate that the underlying principle of this functional specificity lies in the ‘sequence-encoded structure’ of the κB-DNA. We show that Dorsal-binding motifs exist in distinct activator and repressor conformations. Molecular dynamics of DNA-Dorsal complexes revealed that repressor κB-motifs typically have A-tract and flexible conformation that facilitates interaction with co-repressors. Deformable structure of repressor motifs, is due to changes in the hydrogen bonding in A:T pair in the ‘A-tract’ core. The sixth nucleotide in the nonameric κB-motif, ‘A’ (A6) in the repressor motifs and ‘T’ (T6) in the activator motifs, is critical to confer this functional specificity as A6 → T6 mutation transformed flexible repressor conformation into a rigid activator conformation. These results highlight that ‘sequence encoded κB DNA-geometry’ regulates gene expression by exerting allosteric effect on binding of Rel proteins which in turn regulates interaction with co-regulators. Further, we identified and characterized putative repressor motifs in Dl-target genes, which can potentially aid in functional annotation of Dorsal gene regulatory network. PMID:21890896
Identification of sequence–structure RNA binding motifs for SELEX-derived aptamers
Hoinka, Jan; Zotenko, Elena; Friedman, Adam; Sauna, Zuben E.; Przytycka, Teresa M.
2012-01-01
Motivation: Systematic Evolution of Ligands by EXponential Enrichment (SELEX) represents a state-of-the-art technology to isolate single-stranded (ribo)nucleic acid fragments, named aptamers, which bind to a molecule (or molecules) of interest via specific structural regions induced by their sequence-dependent fold. This powerful method has applications in designing protein inhibitors, molecular detection systems, therapeutic drugs and antibody replacement among others. However, full understanding and consequently optimal utilization of the process has lagged behind its wide application due to the lack of dedicated computational approaches. At the same time, the combination of SELEX with novel sequencing technologies is beginning to provide the data that will allow the examination of a variety of properties of the selection process. Results: To close this gap we developed, Aptamotif, a computational method for the identification of sequence–structure motifs in SELEX-derived aptamers. To increase the chances of identifying functional motifs, Aptamotif uses an ensemble-based approach. We validated the method using two published aptamer datasets containing experimentally determined motifs of increasing complexity. We were able to recreate the author's findings to a high degree, thus proving the capability of our approach to identify binding motifs in SELEX data. Additionally, using our new experimental dataset, we illustrate the application of Aptamotif to elucidate several properties of the selection process. Contact: przytyck@ncbi.nlm.nih.gov, Zuben.Sauna@fda.hhs.gov PMID:22689764
Detection of core-periphery structure in networks based on 3-tuple motifs
NASA Astrophysics Data System (ADS)
Ma, Chuang; Xiang, Bing-Bing; Chen, Han-Shuang; Small, Michael; Zhang, Hai-Feng
2018-05-01
Detecting mesoscale structure, such as community structure, is of vital importance for analyzing complex networks. Recently, a new mesoscale structure, core-periphery (CP) structure, has been identified in many real-world systems. In this paper, we propose an effective algorithm for detecting CP structure based on a 3-tuple motif. In this algorithm, we first define a 3-tuple motif in terms of the patterns of edges as well as the property of nodes, and then a motif adjacency matrix is constructed based on the 3-tuple motif. Finally, the problem is converted to find a cluster that minimizes the smallest motif conductance. Our algorithm works well in different CP structures: including single or multiple CP structure, and local or global CP structures. Results on the synthetic and the empirical networks validate the high performance of our method.
Krystkowiak, Izabella; Manguy, Jean; Davey, Norman E
2018-06-05
There is a pressing need for in silico tools that can aid in the identification of the complete repertoire of protein binding (SLiMs, MoRFs, miniMotifs) and modification (moiety attachment/removal, isomerization, cleavage) motifs. We have created PSSMSearch, an interactive web-based tool for rapid statistical modeling, visualization, discovery and annotation of protein motif specificity determinants to discover novel motifs in a proteome-wide manner. PSSMSearch analyses proteomes for regions with significant similarity to a motif specificity determinant model built from a set of aligned motif-containing peptides. Multiple scoring methods are available to build a position-specific scoring matrix (PSSM) describing the motif specificity determinant model. This model can then be modified by a user to add prior knowledge of specificity determinants through an interactive PSSM heatmap. PSSMSearch includes a statistical framework to calculate the significance of specificity determinant model matches against a proteome of interest. PSSMSearch also includes the SLiMSearch framework's annotation, motif functional analysis and filtering tools to highlight relevant discriminatory information. Additional tools to annotate statistically significant shared keywords and GO terms, or experimental evidence of interaction with a motif-recognizing protein have been added. Finally, PSSM-based conservation metrics have been created for taxonomic range analyses. The PSSMSearch web server is available at http://slim.ucd.ie/pssmsearch/.
Solution structure of CEH-37 homeodomain of the nematode Caenorhabditis elegans
DOE Office of Scientific and Technical Information (OSTI.GOV)
Moon, Sunjin; Lee, Yong Woo; Kim, Woo Taek
Highlights: •We have determined solution structures of CEH-37 homedomain. •CEH-37 HD has a compact α-helical structure with HTH DNA binding motif. •Solution structure of CEH-37 HD shares its molecular topology with that of the homeodomain proteins. •Residues in the N-terminal region and HTH motif are important in binding to Caenorhabditis elegans telomeric DNA. •CEH-37 could play an important role in telomere function via DNA binding. -- Abstract: The nematode Caenorhabditis elegans protein CEH-37 belongs to the paired OTD/OTX family of homeobox-containing homeodomain proteins. CEH-37 shares sequence similarity with homeodomain proteins, although it specifically binds to double-stranded C. elegans telomeric DNA,more » which is unusual to homeodomain proteins. Here, we report the solution structure of CEH-37 homeodomain and molecular interaction with double-stranded C. elegans telomeric DNA using nuclear magnetic resonance (NMR) spectroscopy. NMR structure shows that CEH-37 homeodomain is composed of a flexible N-terminal region and three α-helices with a helix-turn-helix (HTH) DNA binding motif. Data from size-exclusion chromatography and fluorescence spectroscopy reveal that CEH-37 homeodomain interacts strongly with double-stranded C. elegans telomeric DNA. NMR titration experiments identified residues responsible for specific binding to nematode double-stranded telomeric DNA. These results suggest that C. elegans homeodomain protein, CEH-37 could play an important role in telomere function via DNA binding.« less
Effect of C(60) fullerene on the duplex formation of i-motif DNA with complementary DNA in solution.
Jin, Kyeong Sik; Shin, Su Ryon; Ahn, Byungcheol; Jin, Sangwoo; Rho, Yecheol; Kim, Heesoo; Kim, Seon Jeong; Ree, Moonhor
2010-04-15
The structural effects of fullerene on i-motif DNA were investigated by characterizing the structures of fullerene-free and fullerene-bound i-motif DNA, in the presence of cDNA and in solutions of varying pH, using circular dichroism and synchrotron small-angle X-ray scattering. To facilitate a direct structural comparison between the i-motif and duplex structures in response to pH stimulus, we developed atomic scale structural models for the duplex and i-motif DNA structures, and for the C(60)/i-motif DNA hybrid associated with the cDNA strand, assuming that the DNA strands are present in an ideal right-handed helical conformation. We found that fullerene shifted the pH-induced conformational transition between the i-motif and the duplex structure, possibly due to the hydrophobic interactions between the terminal fullerenes and between the terminal fullerenes and an internal TAA loop in the DNA strand. The hybrid structure showed a dramatic reduction in cyclic hysteresis.
Identifying DNA-binding proteins using structural motifs and the electrostatic potential
Shanahan, Hugh P.; Garcia, Mario A.; Jones, Susan; Thornton, Janet M.
2004-01-01
Robust methods to detect DNA-binding proteins from structures of unknown function are important for structural biology. This paper describes a method for identifying such proteins that (i) have a solvent accessible structural motif necessary for DNA-binding and (ii) a positive electrostatic potential in the region of the binding region. We focus on three structural motifs: helix–turn-helix (HTH), helix–hairpin–helix (HhH) and helix–loop–helix (HLH). We find that the combination of these variables detect 78% of proteins with an HTH motif, which is a substantial improvement over previous work based purely on structural templates and is comparable to more complex methods of identifying DNA-binding proteins. Similar true positive fractions are achieved for the HhH and HLH motifs. We see evidence of wide evolutionary diversity for DNA-binding proteins with an HTH motif, and much smaller diversity for those with an HhH or HLH motif. PMID:15356290
Peña, Maria J; Darvill, Alan G; Eberhard, Stefan; York, William S; O'Neill, Malcolm A
2008-11-01
Xyloglucan is a well-characterized hemicellulosic polysaccharide that is present in the cell walls of all seed-bearing plants. The cell walls of avascular and seedless vascular plants are also believed to contain xyloglucan. However, these xyloglucans have not been structurally characterized. This lack of information is an impediment to understanding changes in xyloglucan structure that occurred during land plant evolution. In this study, xyloglucans were isolated from the walls of avascular (liverworts, mosses, and hornworts) and seedless vascular plants (club and spike mosses and ferns and fern allies). Each xyloglucan was fragmented with a xyloglucan-specific endo-glucanase and the resulting oligosaccharides then structurally characterized using NMR spectroscopy, MALDI-TOF and electrospray mass spectrometry, and glycosyl-linkage and glycosyl residue composition analyses. Our data show that xyloglucan is present in the cell walls of all major divisions of land plants and that these xyloglucans have several common structural motifs. However, these polysaccharides are not identical because specific plant groups synthesize xyloglucans with unique structural motifs. For example, the moss Physcomitrella patens and the liverwort Marchantia polymorpha synthesize XXGGG- and XXGG-type xyloglucans, respectively, with sidechains that contain a beta-D-galactosyluronic acid and a branched xylosyl residue. By contrast, hornworts synthesize XXXG-type xyloglucans that are structurally homologous to the xyloglucans synthesized by many seed-bearing and seedless vascular plants. Our results increase our understanding of the evolution, diversity, and function of structural motifs in land-plant xyloglucans and provide support to the proposal that hornworts are sisters to the vascular plants.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Suhkmann; Zhang, Ziming; Upchurch, Sean
2004-04-16
2 ARID is a homologous family of DNA-binding domains that occur in DNA binding proteins from a wide variety of species, ranging from yeast to nematodes, insects, mammals and plants. SWI1, a member of the SWI/SNF protein complex that is involved in chromatin remodeling during transcription, contains the ARID motif. The ARID domain of human SWI1 (also known as p270) does not select for a specific DNA sequence from a random sequence pool. The lack of sequence specificity shown by the SWI1 ARID domain stands in contrast to the other characterized ARID domains, which recognize specific AT-rich sequences. We havemore » solved the three-dimensional structure of human SWI1 ARID using solution NMR methods. In addition, we have characterized non-specific DNA-binding by the SWI1 ARID domain. Results from this study indicate that a flexible long internal loop in ARID motif is likely to be important for sequence specific DNA-recognition. The structure of human SWI1 ARID domain also represents a distinct structural subfamily. Studies of ARID indicate that boundary of the DNA binding structural and functional domains can extend beyond the sequence homologous region in a homologous family of proteins. Structural studies of homologous domains such as ARID family of DNA-binding domains should provide information to better predict the boundary of structural and functional domains in structural genomic studies. Key Words: ARID, SWI1, NMR, structural genomics, protein-DNA interaction.« less
Johnson, Glynis; Moore, Samuel W
2013-09-01
Short linear motifs confer evolutionary flexibility on proteins as they can be added with relative ease allowing the acquisition of new functions. Such motifs may mediate a variety of signalling functions. The adhesion-mediating Leu-Arg-Glu (LRE) motif is enriched in laminin beta 2, and has been observed in other proteins, including members of the carboxylesterase/cholinesterase family. It acts as a stop signal for growing axons in the developing neuromuscular junction, binding to the voltage-gated calcium channel. In this bioinformatic analysis, we have investigated the presence of the motif in proteins of the neuromuscular junction, and have also examined its structural position and potential for ligand interaction, as well as phylogenetic conservation, in the carboxylesterase/cholinesterase family. The motif was observed to occur with a significantly higher frequency than expected in the UniProt/Swiss-Prot database, as well as in four individual species (human, mouse, Caenorhabditis elegans and Drosophila melanogaster). Examination of its presence in neuromuscular junction proteins showed it to be enriched in certain proteins of the synaptic basement membrane, including laminin, agrin, acetylcholinesterase and tenascin. A highly significant enrichment was observed in cytoskeletal proteins, particularly intermediate filament proteins and members of the spectrin family. In the carboxylesterase/cholinesterase family, the motif was observed in four conserved positions in the protein structure. It is present in the majority of mammalian acetylcholinesterases, as well as acetylcholinesterases from electric fish and a number of invertebrates. In insects, it is present in the ace-2, rather than in the synaptic ace-1, enzyme. It is also observed in the cholinesterase-like adhesion molecules (neuroligins, neurotactin and glutactin). It is never seen in butyrylcholinesterases, which do not mediate cell adhesion. In conclusion, the significant enrichment of the motif in certain classes of protein, as well as its conserved presence and structural positioning in one protein family, suggests that it has specific functions both in cell adhesion in the neuromuscular junction and in maintaining the structural integrity of the cytoskeleton. Copyright © 2013 Elsevier Inc. All rights reserved.
Boehm, Elizabeth M.; Powers, Kyle T.; Kondratick, Christine M.; Spies, Maria; Houtman, Jon C. D.; Washington, M. Todd
2016-01-01
Y-family DNA polymerases, such as polymerase η, polymerase ι, and polymerase κ, catalyze the bypass of DNA damage during translesion synthesis. These enzymes are recruited to sites of DNA damage by interacting with the essential replication accessory protein proliferating cell nuclear antigen (PCNA) and the scaffold protein Rev1. In most Y-family polymerases, these interactions are mediated by one or more conserved PCNA-interacting protein (PIP) motifs that bind in a hydrophobic pocket on the front side of PCNA as well as by conserved Rev1-interacting region (RIR) motifs that bind in a hydrophobic pocket on the C-terminal domain of Rev1. Yeast polymerase η, a prototypical translesion synthesis polymerase, binds both PCNA and Rev1. It possesses a single PIP motif but not an RIR motif. Here we show that the PIP motif of yeast polymerase η mediates its interactions both with PCNA and with Rev1. Moreover, the PIP motif of polymerase η binds in the hydrophobic pocket on the Rev1 C-terminal domain. We also show that the RIR motif of human polymerase κ and the PIP motif of yeast Msh6 bind both PCNA and Rev1. Overall, these findings demonstrate that PIP motifs and RIR motifs have overlapping specificities and can interact with both PCNA and Rev1 in structurally similar ways. These findings also suggest that PIP motifs are a more versatile protein interaction motif than previously believed. PMID:26903512
Stewart, Mikaela; Dunlap, Tori; Dourlain, Elizabeth; Grant, Bryce; McFail-Isom, Lori
2013-01-01
The fine conformational subtleties of DNA structure modulate many fundamental cellular processes including gene activation/repression, cellular division, and DNA repair. Most of these cellular processes rely on the conformational heterogeneity of specific DNA sequences. Factors including those structural characteristics inherent in the particular base sequence as well as those induced through interaction with solvent components combine to produce fine DNA structural variation including helical flexibility and conformation. Cation-pi interactions between solvent cations or their first hydration shell waters and the faces of DNA bases form sequence selectively and contribute to DNA structural heterogeneity. In this paper, we detect and characterize the binding patterns found in cation-pi interactions between solvent cations and DNA bases in a set of high resolution x-ray crystal structures. Specifically, we found that monovalent cations (Tl+) and the polarized first hydration shell waters of divalent cations (Mg2+, Ca2+) form cation-pi interactions with DNA bases stabilizing unstacked conformations. When these cation-pi interactions are combined with electrostatic interactions a pattern of specific binding motifs is formed within the grooves. PMID:23940752
Stewart, Mikaela; Dunlap, Tori; Dourlain, Elizabeth; Grant, Bryce; McFail-Isom, Lori
2013-01-01
The fine conformational subtleties of DNA structure modulate many fundamental cellular processes including gene activation/repression, cellular division, and DNA repair. Most of these cellular processes rely on the conformational heterogeneity of specific DNA sequences. Factors including those structural characteristics inherent in the particular base sequence as well as those induced through interaction with solvent components combine to produce fine DNA structural variation including helical flexibility and conformation. Cation-pi interactions between solvent cations or their first hydration shell waters and the faces of DNA bases form sequence selectively and contribute to DNA structural heterogeneity. In this paper, we detect and characterize the binding patterns found in cation-pi interactions between solvent cations and DNA bases in a set of high resolution x-ray crystal structures. Specifically, we found that monovalent cations (Tl⁺) and the polarized first hydration shell waters of divalent cations (Mg²⁺, Ca²⁺) form cation-pi interactions with DNA bases stabilizing unstacked conformations. When these cation-pi interactions are combined with electrostatic interactions a pattern of specific binding motifs is formed within the grooves.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tripathi, S.; Zhang, D.; Paukstelis, P. J.
DNA has proved to be an excellent material for nanoscale construction because complementary DNA duplexes are programmable and structurally predictable. However, in the absence of Watson–Crick pairings, DNA can be structurally more diverse. Here, we describe the crystal structures of d(ACTCGGATGAT) and the brominated derivative, d(AC BrUCGGA BrUGAT). These oligonucleotides form parallel-stranded duplexes with a crystallographically equivalent strand, resulting in the first examples of DNA crystal structures that contains four different symmetric homo base pairs. Two of the parallel-stranded duplexes are coaxially stacked in opposite directions and locked together to form a tetraplex through intercalation of the 5'-most A–A basemore » pairs between adjacent G–G pairs in the partner duplex. The intercalation region is a new type of DNA tertiary structural motif with similarities to the i-motif. 1H– 1H nuclear magnetic resonance and native gel electrophoresis confirmed the formation of a parallel-stranded duplex in solution. Finally, we modified specific nucleotide positions and added d(GAY) motifs to oligonucleotides and were readily able to obtain similar crystals. This suggests that this parallel-stranded DNA structure may be useful in the rational design of DNA crystals and nanostructures.« less
An intercalation-locked parallel-stranded DNA tetraplex
Tripathi, S.; Zhang, D.; Paukstelis, P. J.
2015-01-27
DNA has proved to be an excellent material for nanoscale construction because complementary DNA duplexes are programmable and structurally predictable. However, in the absence of Watson–Crick pairings, DNA can be structurally more diverse. Here, we describe the crystal structures of d(ACTCGGATGAT) and the brominated derivative, d(AC BrUCGGA BrUGAT). These oligonucleotides form parallel-stranded duplexes with a crystallographically equivalent strand, resulting in the first examples of DNA crystal structures that contains four different symmetric homo base pairs. Two of the parallel-stranded duplexes are coaxially stacked in opposite directions and locked together to form a tetraplex through intercalation of the 5'-most A–A basemore » pairs between adjacent G–G pairs in the partner duplex. The intercalation region is a new type of DNA tertiary structural motif with similarities to the i-motif. 1H– 1H nuclear magnetic resonance and native gel electrophoresis confirmed the formation of a parallel-stranded duplex in solution. Finally, we modified specific nucleotide positions and added d(GAY) motifs to oligonucleotides and were readily able to obtain similar crystals. This suggests that this parallel-stranded DNA structure may be useful in the rational design of DNA crystals and nanostructures.« less
Rapid search for tertiary fragments reveals protein sequence–structure relationships
Zhou, Jianfu; Grigoryan, Gevorg
2015-01-01
Finding backbone substructures from the Protein Data Bank that match an arbitrary query structural motif, composed of multiple disjoint segments, is a problem of growing relevance in structure prediction and protein design. Although numerous protein structure search approaches have been proposed, methods that address this specific task without additional restrictions and on practical time scales are generally lacking. Here, we propose a solution, dubbed MASTER, that is both rapid, enabling searches over the Protein Data Bank in a matter of seconds, and provably correct, finding all matches below a user-specified root-mean-square deviation cutoff. We show that despite the potentially exponential time complexity of the problem, running times in practice are modest even for queries with many segments. The ability to explore naturally plausible structural and sequence variations around a given motif has the potential to synthesize its design principles in an automated manner; so we go on to illustrate the utility of MASTER to protein structural biology. We demonstrate its capacity to rapidly establish structure–sequence relationships, uncover the native designability landscapes of tertiary structural motifs, identify structural signatures of binding, and automatically rewire protein topologies. Given the broad utility of protein tertiary fragment searches, we hope that providing MASTER in an open-source format will enable novel advances in understanding, predicting, and designing protein structure. PMID:25420575
Hyperactive antifreeze proteins from longhorn beetles: some structural insights.
Kristiansen, Erlend; Wilkens, Casper; Vincents, Bjarne; Friis, Dennis; Lorentzen, Anders Blomkild; Jenssen, Håvard; Løbner-Olesen, Anders; Ramløv, Hans
2012-11-01
This study reports on structural characteristics of hyperactive antifreeze proteins (AFPs) from two species of longhorn beetles. In Rhagium mordax, eight unique mRNAs coding for five different mature AFPs were identified from cold-hardy individuals. These AFPs are apparently homologues to a previously characterized AFP from the closely related species Rhagium inquisitor, and consist of six identifiable repeats of a putative ice binding motif TxTxTxT spaced irregularly apart by segments varying in length from 13 to 20 residues. Circular dichroism spectra show that the AFPs from both species have a high content of β-sheet and low levels of α-helix and random coil. Theoretical predictions of residue-specific secondary structure locate these β-sheets within the putative ice-binding motifs and the central parts of the segments separating them, consistent with an overall β-helical structure with the ice-binding motifs stacked in a β-sheet on one side of the coil. Molecular dynamics models based on these findings show that these AFPs would be energetically stable in a β-helical conformation. Copyright © 2012 Elsevier Ltd. All rights reserved.
I-motif DNA structures are formed in the nuclei of human cells
NASA Astrophysics Data System (ADS)
Zeraati, Mahdi; Langley, David B.; Schofield, Peter; Moye, Aaron L.; Rouet, Romain; Hughes, William E.; Bryan, Tracy M.; Dinger, Marcel E.; Christ, Daniel
2018-06-01
Human genome function is underpinned by the primary storage of genetic information in canonical B-form DNA, with a second layer of DNA structure providing regulatory control. I-motif structures are thought to form in cytosine-rich regions of the genome and to have regulatory functions; however, in vivo evidence for the existence of such structures has so far remained elusive. Here we report the generation and characterization of an antibody fragment (iMab) that recognizes i-motif structures with high selectivity and affinity, enabling the detection of i-motifs in the nuclei of human cells. We demonstrate that the in vivo formation of such structures is cell-cycle and pH dependent. Furthermore, we provide evidence that i-motif structures are formed in regulatory regions of the human genome, including promoters and telomeric regions. Our results support the notion that i-motif structures provide key regulatory roles in the genome.
Behura, Susanta K; Severson, David W
2015-02-01
We present a detailed genome-wide comparative study of motif mismatches of microsatellites among 20 insect species representing five taxonomic orders. The results show that varying proportions (∼15-46%) of microsatellites identified in these species are imperfect in motif structure, and that they also vary in chromosomal distribution within genomes. It was observed that the genomic abundance of imperfect repeats is significantly associated with the length and number of motif mismatches of microsatellites. Furthermore, microsatellites with a higher number of mismatches tend to have lower abundance in the genome, suggesting that sequence heterogeneity of repeat motifs is a key determinant of genomic abundance of microsatellites. This relationship seems to be a general feature of microsatellites even in unrelated species such as yeast, roundworm, mouse and human. We provide a mechanistic explanation of the evolutionary link between motif heterogeneity and genomic abundance of microsatellites by examining the patterns of motif mismatches and allele sequences of single-nucleotide polymorphisms identified within microsatellite loci. Using Drosophila Reference Genetic Panel data, we further show that pattern of allelic variation modulates motif heterogeneity of microsatellites, and provide estimates of allele age of specific imperfect microsatellites found within protein-coding genes. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Di Scala, Coralie; Baier, Carlos J; Evans, Luke S; Williamson, Philip T F; Fantini, Jacques; Barrantes, Francisco J
2017-01-01
Cholesterol is a ubiquitous neutral lipid, which finely tunes the activity of a wide range of membrane proteins, including neurotransmitter and hormone receptors and ion channels. Given the scarcity of available X-ray crystallographic structures and the even fewer in which cholesterol sites have been directly visualized, application of in silico computational methods remains a valid alternative for the detection and thermodynamic characterization of cholesterol-specific sites in functionally important membrane proteins. The membrane-embedded segments of the paradigm neurotransmitter receptor for acetylcholine display a series of cholesterol consensus domains (which we have coined "CARC"). The CARC motif exhibits a preference for the outer membrane leaflet and its mirror motif, CRAC, for the inner one. Some membrane proteins possess the double CARC-CRAC sequences within the same transmembrane domain. In addition to in silico molecular modeling, the affinity, concentration dependence, and specificity of the cholesterol-recognition motif-protein interaction have recently found experimental validation in other biophysical approaches like monolayer techniques and nuclear magnetic resonance spectroscopy. From the combined studies, it becomes apparent that the CARC motif is now more firmly established as a high-affinity cholesterol-binding domain for membrane-bound receptors and remarkably conserved along phylogenetic evolution. © 2017 Elsevier Inc. All rights reserved.
Reid, Korey M; Sunanda, Punnepalli; Raghothama, S; Krishnan, V V
2017-11-01
Intrinsically disordered proteins (IDP) lack a well-defined 3D-structure under physiological conditions, yet, the inherent disorder represented by an ensemble of conformation plays a critical role in many cellular and regulatory processes. Nucleoporins, or Nups, are the proteins found in the nuclear pore complex (NPC). The central pore of the NPC is occupied by Nups, which have phenylalanine-glycine domain repeats and are intrinsically disordered, and therefore are termed FG-Nups. These FG-domain repeats exhibit differing cohesiveness character and differ from least (FG) to most (GLFG) cohesive. The designed FG-Nup is a 25 AA model peptide containing a noncohesive FG-motif flanked by two cohesive GLFG-motifs (WT peptide). Complete NMR-based ensemble characterization of this peptide along with a control peptide with an F>A substitution (MU peptide) are discussed. Ensemble characterization of the NMR-determined models suggests that both the peptides do not have consistent secondary structures and continue to be disordered. Nonetheless, the role of cohesive elements mediated by the GLFG motifs is evident in the WT ensemble of structures that are more compact than the MU peptide. The approach presented here allows an alternate way to investigate the specific roles of distinct amino acid motifs that translate into the long-range organization of the ensemble of structures and in general on the nature of IDPs. © 2017 Wiley Periodicals, Inc.
Mlýnský, Vojtěch; Bussi, Giovanni
2018-01-18
The function of RNA molecules usually depends on their overall fold and on the presence of specific structural motifs. Chemical probing methods are routinely used in combination with nearest-neighbor models to determine RNA secondary structure. Among the available methods, SHAPE is relevant due to its capability to probe all RNA nucleotides and the possibility to be used in vivo. However, the structural determinants for SHAPE reactivity and its mechanism of reaction are still unclear. Here molecular dynamics simulations and enhanced sampling techniques are used to predict the accessibility of nucleotide analogs and larger RNA structural motifs to SHAPE reagents. We show that local RNA reconformations are crucial in allowing reagents to reach the 2'-OH group of a particular nucleotide and that sugar pucker is a major structural factor influencing SHAPE reactivity.
Novel functions of CCM1 delimit the relationship of PTB/PH domains.
Zhang, Jun; Dubey, Pallavi; Padarti, Akhil; Zhang, Aileen; Patel, Rinkal; Patel, Vipulkumar; Cistola, David; Badr, Ahmed
2017-10-01
Three NPXY motifs and one FERM domain in CCM1 makes it a versatile scaffold protein for tethering the signaling components together within the CCM signaling complex (CSC). The cellular role of CCM1 protein remains inadequately expounded. Both phosphotyrosine binding (PTB) and pleckstrin homology (PH) domains were recognized as structurally related but functionally distinct domains. By utilizing molecular cloning, protein binding assays and RT-qPCR to identify novel cellular partners of CCM1 and its cellular expression patterns; by screening candidate PTB/PH proteins and subsequently structurally simulation in combining with current X-ray crystallography and NMR data to defined the essential structure of PTB/PH domain for NPXY-binding and the relationship among PTB, PH and FERM domain(s). We identified a group of 28 novel cellular partners of CCM1, all of which contain either PTB or PH domain(s), and developed a novel classification system for these PTB/PH proteins based on their relationship with different NPXY motifs of CCM1. Our results demonstrated that CCM1 has a wide spectrum of binding to different PTB/PH proteins and perpetuates their specificity to interact with certain PTB/PH domains through selective combination of three NPXY motifs. We also demonstrated that CCM1 can be assembled into oligomers through intermolecular interaction between its F3 lobe in FERM domain and one of the three NPXY motifs. Despite being embedded in FERM domain as F3 lobe, F3 module acts as a fully functional PH domain to interact with NPXY motif. The most salient feature of the study was that both PTB and PH domains are structurally and functionally comparable, suggesting that PTB domain is likely evolved from PH domain with polymorphic structural additions at its N-terminus. A new β1A-strand of the PTB domain was discovered and new minimum structural requirement of PTB/PH domain for NPXY motif-binding was determined. Based on our data, a novel theory of structure, function and relationship of PTB, PH and FERM domains has been proposed, which extends the importance of the NPXY-PTB/PH interaction on the CSC signaling and/or other cell receptors with great potential pointing to new therapeutic strategies. The study provides new insight into the structural characteristics of PTB/PH domains, essential structural elements of PTB/PH domain required for NPXY motif-binding, and function and relationship among PTB, PH and FERM domains. Copyright © 2017 Elsevier B.V. All rights reserved.
Identifying the scale-dependent motifs in atmospheric surface layer by ordinal pattern analysis
NASA Astrophysics Data System (ADS)
Li, Qinglei; Fu, Zuntao
2018-07-01
Ramp-like structures in various atmospheric surface layer time series have been long studied, but the presence of motifs with the finer scale embedded within larger scale ramp-like structures has largely been overlooked in the reported literature. Here a novel, objective and well-adapted methodology, the ordinal pattern analysis, is adopted to study the finer-scaled motifs in atmospheric boundary-layer (ABL) time series. The studies show that the motifs represented by different ordinal patterns take clustering properties and 6 dominated motifs out of the whole 24 motifs account for about 45% of the time series under particular scales, which indicates the higher contribution of motifs with the finer scale to the series. Further studies indicate that motif statistics are similar for both stable conditions and unstable conditions at larger scales, but large discrepancies are found at smaller scales, and the frequencies of motifs "1234" and/or "4321" are a bit higher under stable conditions than unstable conditions. Under stable conditions, there are great changes for the occurrence frequencies of motifs "1234" and "4321", where the occurrence frequencies of motif "1234" decrease from nearly 24% to 4.5% with the scale factor increasing, and the occurrence frequencies of motif "4321" change nonlinearly with the scale increasing. These great differences of dominated motifs change with scale can be taken as an indicator to quantify the flow structure changes under different stability conditions, and motif entropy can be defined just by only 6 dominated motifs to quantify this time-scale independent property of the motifs. All these results suggest that the defined scale of motifs with the finer scale should be carefully taken into consideration in the interpretation of turbulence coherent structures.
Composite Structural Motifs of Binding Sites for Delineating Biological Functions of Proteins
Kinjo, Akira R.; Nakamura, Haruki
2012-01-01
Most biological processes are described as a series of interactions between proteins and other molecules, and interactions are in turn described in terms of atomic structures. To annotate protein functions as sets of interaction states at atomic resolution, and thereby to better understand the relation between protein interactions and biological functions, we conducted exhaustive all-against-all atomic structure comparisons of all known binding sites for ligands including small molecules, proteins and nucleic acids, and identified recurring elementary motifs. By integrating the elementary motifs associated with each subunit, we defined composite motifs that represent context-dependent combinations of elementary motifs. It is demonstrated that function similarity can be better inferred from composite motif similarity compared to the similarity of protein sequences or of individual binding sites. By integrating the composite motifs associated with each protein function, we define meta-composite motifs each of which is regarded as a time-independent diagrammatic representation of a biological process. It is shown that meta-composite motifs provide richer annotations of biological processes than sequence clusters. The present results serve as a basis for bridging atomic structures to higher-order biological phenomena by classification and integration of binding site structures. PMID:22347478
Schuetz, Anja; Min, Jinrong; Allali-Hassani, Abdellah; Schapira, Matthieu; Shuen, Michael; Loppnau, Peter; Mazitschek, Ralph; Kwiatkowski, Nick P.; Lewis, Timothy A.; Maglathin, Rebecca L.; McLean, Thomas H.; Bochkarev, Alexey; Plotnikov, Alexander N.; Vedadi, Masoud; Arrowsmith, Cheryl H.
2008-01-01
Histone deacetylases (HDACs) are protein deacetylases that play a role in repression of gene transcription and are emerging targets in cancer therapy. Here, we characterize the structure and enzymatic activity of the catalytic domain of human HDAC7 (cdHDAC7). Although HDAC7 normally exists as part of a multiprotein complex, we show that cdHDAC7 has a low level of deacetylase activity which can be inhibited by known HDAC inhibitors. The crystal structures of human cdHDAC7 and its complexes with two hydroxamate inhibitors are the first structures of the catalytic domain of class IIa HDACs and demonstrate significant differences with previously reported class I and class IIb-like HDAC structures. We show that cdHDAC7 has an additional class IIa HDAC-specific zinc binding motif adjacent to the active site which is likely to participate in substrate recognition and protein-protein interaction and may provide a site for modulation of activity. Furthermore, a different active site topology results in modified catalytic properties and in an enlarged active site pocket. Our studies provide mechanistic insights into class IIa HDACs and facilitate the design of specific modulators. PMID:18285338
Schuetz, Anja; Min, Jinrong; Allali-Hassani, Abdellah; Schapira, Matthieu; Shuen, Michael; Loppnau, Peter; Mazitschek, Ralph; Kwiatkowski, Nick P; Lewis, Timothy A; Maglathin, Rebecca L; McLean, Thomas H; Bochkarev, Alexey; Plotnikov, Alexander N; Vedadi, Masoud; Arrowsmith, Cheryl H
2008-04-25
Histone deacetylases (HDACs) are protein deacetylases that play a role in repression of gene transcription and are emerging targets in cancer therapy. Here, we characterize the structure and enzymatic activity of the catalytic domain of human HDAC7 (cdHDAC7). Although HDAC7 normally exists as part of a multiprotein complex, we show that cdHDAC7 has a low level of deacetylase activity which can be inhibited by known HDAC inhibitors. The crystal structures of human cdHDAC7 and its complexes with two hydroxamate inhibitors are the first structures of the catalytic domain of class IIa HDACs and demonstrate significant differences with previously reported class I and class IIb-like HDAC structures. We show that cdHDAC7 has an additional class IIa HDAC-specific zinc binding motif adjacent to the active site which is likely to participate in substrate recognition and protein-protein interaction and may provide a site for modulation of activity. Furthermore, a different active site topology results in modified catalytic properties and in an enlarged active site pocket. Our studies provide mechanistic insights into class IIa HDACs and facilitate the design of specific modulators.
Ligand binding by repeat proteins: natural and designed
Grove, Tijana Z; Cortajarena, Aitziber L; Regan, Lynne
2012-01-01
Repeat proteins contain tandem arrays of small structural motifs. As a consequence of this architecture, they adopt non-globular, extended structures that present large, highly specific surfaces for ligand binding. Here we discuss recent advances toward understanding the functional role of this unique modular architecture. We showcase specific examples of natural repeat proteins interacting with diverse ligands and also present examples of designed repeat protein–ligand interactions. PMID:18602006
DOE Office of Scientific and Technical Information (OSTI.GOV)
Poust, Sean; Yoon, Isu; Adams, Paul D.
Acyltransferases determine which extender units are incorporated into polyketide and fatty acid products. Thus, the ping-pong acyltransferase mechanism utilizes a serine in a conserved GHSxG motif. However, the role of the conserved histidine in this motif is poorly understood. We observed that a histidine to alanine mutation (H640A) in the GHSxG motif of the malonyl-CoA specific yersiniabactin acyltransferase results in an approximately seven-fold higher hydrolysis rate over the wildtype enzyme, while retaining transacylation activity. We propose two possibilities for the reduction in hydrolysis rate: either H640 structurally stabilizes the protein by hydrogen bonding with a conserved asparagine in the ferredoxin-likemore » subdomain of the protein, or a water-mediated hydrogen bond between H640 and the malonyl moiety stabilizes the malonyl-O-AT ester intermediate.« less
Poust, Sean; Yoon, Isu; Adams, Paul D.; ...
2014-10-06
Acyltransferases determine which extender units are incorporated into polyketide and fatty acid products. Thus, the ping-pong acyltransferase mechanism utilizes a serine in a conserved GHSxG motif. However, the role of the conserved histidine in this motif is poorly understood. We observed that a histidine to alanine mutation (H640A) in the GHSxG motif of the malonyl-CoA specific yersiniabactin acyltransferase results in an approximately seven-fold higher hydrolysis rate over the wildtype enzyme, while retaining transacylation activity. We propose two possibilities for the reduction in hydrolysis rate: either H640 structurally stabilizes the protein by hydrogen bonding with a conserved asparagine in the ferredoxin-likemore » subdomain of the protein, or a water-mediated hydrogen bond between H640 and the malonyl moiety stabilizes the malonyl-O-AT ester intermediate.« less
Molecular Signaling Network Motifs Provide a Mechanistic Basis for Cellular Threshold Responses
Bhattacharya, Sudin; Conolly, Rory B.; Clewell, Harvey J.; Kaminski, Norbert E.; Andersen, Melvin E.
2014-01-01
Background: Increasingly, there is a move toward using in vitro toxicity testing to assess human health risk due to chemical exposure. As with in vivo toxicity testing, an important question for in vitro results is whether there are thresholds for adverse cellular responses. Empirical evaluations may show consistency with thresholds, but the main evidence has to come from mechanistic considerations. Objectives: Cellular response behaviors depend on the molecular pathway and circuitry in the cell and the manner in which chemicals perturb these circuits. Understanding circuit structures that are inherently capable of resisting small perturbations and producing threshold responses is an important step towards mechanistically interpreting in vitro testing data. Methods: Here we have examined dose–response characteristics for several biochemical network motifs. These network motifs are basic building blocks of molecular circuits underpinning a variety of cellular functions, including adaptation, homeostasis, proliferation, differentiation, and apoptosis. For each motif, we present biological examples and models to illustrate how thresholds arise from specific network structures. Discussion and Conclusion: Integral feedback, feedforward, and transcritical bifurcation motifs can generate thresholds. Other motifs (e.g., proportional feedback and ultrasensitivity)produce responses where the slope in the low-dose region is small and stays close to the baseline. Feedforward control may lead to nonmonotonic or hormetic responses. We conclude that network motifs provide a basis for understanding thresholds for cellular responses. Computational pathway modeling of these motifs and their combinations occurring in molecular signaling networks will be a key element in new risk assessment approaches based on in vitro cellular assays. Citation: Zhang Q, Bhattacharya S, Conolly RB, Clewell HJ III, Kaminski NE, Andersen ME. 2014. Molecular signaling network motifs provide a mechanistic basis for cellular threshold responses. Environ Health Perspect 122:1261–1270; http://dx.doi.org/10.1289/ehp.1408244 PMID:25117432
Occurrence probability of structured motifs in random sequences.
Robin, S; Daudin, J-J; Richard, H; Sagot, M-F; Schbath, S
2002-01-01
The problem of extracting from a set of nucleic acid sequences motifs which may have biological function is more and more important. In this paper, we are interested in particular motifs that may be implicated in the transcription process. These motifs, called structured motifs, are composed of two ordered parts separated by a variable distance and allowing for substitutions. In order to assess their statistical significance, we propose approximations of the probability of occurrences of such a structured motif in a given sequence. An application of our method to evaluate candidate promoters in E. coli and B. subtilis is presented. Simulations show the goodness of the approximations.
Tes, a specific Mena interacting partner, breaks the rules for EVH1 binding.
Boëda, Batiste; Briggs, David C; Higgins, Theresa; Garvalov, Boyan K; Fadden, Andrew J; McDonald, Neil Q; Way, Michael
2007-12-28
The intracellular targeting of Ena/VASP family members is achieved via the interaction of their EVH1 domain with FPPPP sequence motifs found in a variety of cytoskeletal proteins, including lamellipodin, vinculin, and zyxin. Here we show that the LIM3 domain of Tes, which lacks the FPPPP motif, binds to the EVH1 domain of Mena, but not to those of VASP or Evl. The structure of the LIM3:EVH1 complex reveals that Tes occludes the FPPPP-binding site and competes with FPPPP-containing proteins for EVH1 binding. Structure-based gain-of-function experiments define the molecular basis for the specificity of the Tes-Mena interaction. Consistent with in vitro observations, the LIM3 domain displaces Mena, but not VASP, from the leading edge and focal adhesions. It also regulates cell migration through a Mena-dependent mechanism. Our observations identify Tes as an atypical EVH1 binding partner and a regulator specific to a single Ena/VASP family member.
De Moura, Dref C; Bryksa, Brian C; Yada, Rickey Y
2014-01-01
The plant-specific insert is an approximately 100-residue domain found exclusively within the C-terminal lobe of some plant aspartic proteases. Structurally, this domain is a member of the saposin-like protein family, and is involved in plant pathogen defense as well as vacuolar targeting of the parent protease molecule. Similar to other members of the saposin-like protein family, most notably saposins A and C, the recently resolved crystal structure of potato (Solanum tuberosum) plant-specific insert has been shown to exist in a substrate-bound open conformation in which the plant-specific insert oligomerizes to form homodimers. In addition to the open structure, a closed conformation also exists having the classic saposin fold of the saposin-like protein family as observed in the crystal structure of barley (Hordeum vulgare L.) plant-specific insert. In the present study, the mechanisms of tertiary and quaternary conformation changes of potato plant-specific insert were investigated in silico as a function of pH. Umbrella sampling and determination of the free energy change of dissociation of the plant-specific insert homodimer revealed that increasing the pH of the system to near physiological levels reduced the free energy barrier to dissociation. Furthermore, principal component analysis was used to characterize conformational changes at both acidic and neutral pH. The results indicated that the plant-specific insert may adopt a tertiary structure similar to the characteristic saposin fold and suggest a potential new structural motif among saposin-like proteins. To our knowledge, this acidified PSI structure presents the first example of an alternative saposin-fold motif for any member of the large and diverse SAPLIP family.
De Moura, Dref C.; Bryksa, Brian C.; Yada, Rickey Y.
2014-01-01
The plant-specific insert is an approximately 100-residue domain found exclusively within the C-terminal lobe of some plant aspartic proteases. Structurally, this domain is a member of the saposin-like protein family, and is involved in plant pathogen defense as well as vacuolar targeting of the parent protease molecule. Similar to other members of the saposin-like protein family, most notably saposins A and C, the recently resolved crystal structure of potato (Solanum tuberosum) plant-specific insert has been shown to exist in a substrate-bound open conformation in which the plant-specific insert oligomerizes to form homodimers. In addition to the open structure, a closed conformation also exists having the classic saposin fold of the saposin-like protein family as observed in the crystal structure of barley (Hordeum vulgare L.) plant-specific insert. In the present study, the mechanisms of tertiary and quaternary conformation changes of potato plant-specific insert were investigated in silico as a function of pH. Umbrella sampling and determination of the free energy change of dissociation of the plant-specific insert homodimer revealed that increasing the pH of the system to near physiological levels reduced the free energy barrier to dissociation. Furthermore, principal component analysis was used to characterize conformational changes at both acidic and neutral pH. The results indicated that the plant-specific insert may adopt a tertiary structure similar to the characteristic saposin fold and suggest a potential new structural motif among saposin-like proteins. To our knowledge, this acidified PSI structure presents the first example of an alternative saposin-fold motif for any member of the large and diverse SAPLIP family. PMID:25188221
Biological network motif detection and evaluation
2011-01-01
Background Molecular level of biological data can be constructed into system level of data as biological networks. Network motifs are defined as over-represented small connected subgraphs in networks and they have been used for many biological applications. Since network motif discovery involves computationally challenging processes, previous algorithms have focused on computational efficiency. However, we believe that the biological quality of network motifs is also very important. Results We define biological network motifs as biologically significant subgraphs and traditional network motifs are differentiated as structural network motifs in this paper. We develop five algorithms, namely, EDGEGO-BNM, EDGEBETWEENNESS-BNM, NMF-BNM, NMFGO-BNM and VOLTAGE-BNM, for efficient detection of biological network motifs, and introduce several evaluation measures including motifs included in complex, motifs included in functional module and GO term clustering score in this paper. Experimental results show that EDGEGO-BNM and EDGEBETWEENNESS-BNM perform better than existing algorithms and all of our algorithms are applicable to find structural network motifs as well. Conclusion We provide new approaches to finding network motifs in biological networks. Our algorithms efficiently detect biological network motifs and further improve existing algorithms to find high quality structural network motifs, which would be impossible using existing algorithms. The performances of the algorithms are compared based on our new evaluation measures in biological contexts. We believe that our work gives some guidelines of network motifs research for the biological networks. PMID:22784624
Identifying novel sequence variants of RNA 3D motifs
Zirbel, Craig L.; Roll, James; Sweeney, Blake A.; Petrov, Anton I.; Pirrung, Meg; Leontis, Neocles B.
2015-01-01
Predicting RNA 3D structure from sequence is a major challenge in biophysics. An important sub-goal is accurately identifying recurrent 3D motifs from RNA internal and hairpin loop sequences extracted from secondary structure (2D) diagrams. We have developed and validated new probabilistic models for 3D motif sequences based on hybrid Stochastic Context-Free Grammars and Markov Random Fields (SCFG/MRF). The SCFG/MRF models are constructed using atomic-resolution RNA 3D structures. To parameterize each model, we use all instances of each motif found in the RNA 3D Motif Atlas and annotations of pairwise nucleotide interactions generated by the FR3D software. Isostericity relations between non-Watson–Crick basepairs are used in scoring sequence variants. SCFG techniques model nested pairs and insertions, while MRF ideas handle crossing interactions and base triples. We use test sets of randomly-generated sequences to set acceptance and rejection thresholds for each motif group and thus control the false positive rate. Validation was carried out by comparing results for four motif groups to RMDetect. The software developed for sequence scoring (JAR3D) is structured to automatically incorporate new motifs as they accumulate in the RNA 3D Motif Atlas when new structures are solved and is available free for download. PMID:26130723
New structures of Fe3S for rare-earth-free permanent magnets
NASA Astrophysics Data System (ADS)
Yu, Shu; Zhao, Xin; Wu, Shunqing; Nguyen, Manh Cuong; Zhu, Zi-zhong; Wang, Cai-Zhuang; Ho, Kai-Ming
2018-02-01
We applied an adaptive genetic algorithm (AGA) to search for low-energy crystal structures of Fe3S. A number of structures with energies lower than that of the experimentally reported Pnma and I-4 structures have been obtained from our AGA searches. These low-energy structures can be classified as layer-motif and column-motif structures. In the column-motif structures, Fe atoms self-assemble into rods with a bcc type of underlying lattice, which are separated by the holes terminated by S atoms. In the layer-motif structures, the bulk Fe is broken into slabs of several layers passivated by S atoms. Magnetic property calculations showed that the column-motif structures exhibit reasonably high uniaxial magnetic anisotropy. In addition, we examined the effect of Co doping to Fe3S and found that magnetic anisotropy can be enhanced through Co doping.
An Efficient Scheme for Crystal Structure Prediction Based on Structural Motifs
Zhu, Zizhong; Wu, Ping; Wu, Shunqing; ...
2017-05-15
An efficient scheme based on structural motifs is proposed for the crystal structure prediction of materials. The key advantage of the present method comes in two fold: first, the degrees of freedom of the system are greatly reduced, since each structural motif, regardless of its size, can always be described by a set of parameters (R, θ, φ) with five degrees of freedom; second, the motifs could always appear in the predicted structures when the energies of the structures are relatively low. Both features make the present scheme a very efficient method for predicting desired materials. The method has beenmore » applied to the case of LiFePO 4, an important cathode material for lithium-ion batteries. Numerous new structures of LiFePO 4 have been found, compared to those currently available, available, demonstrating the reliability of the present methodology and illustrating the promise of the concept of structural motifs.« less
An Efficient Scheme for Crystal Structure Prediction Based on Structural Motifs
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhu, Zizhong; Wu, Ping; Wu, Shunqing
An efficient scheme based on structural motifs is proposed for the crystal structure prediction of materials. The key advantage of the present method comes in two fold: first, the degrees of freedom of the system are greatly reduced, since each structural motif, regardless of its size, can always be described by a set of parameters (R, θ, φ) with five degrees of freedom; second, the motifs could always appear in the predicted structures when the energies of the structures are relatively low. Both features make the present scheme a very efficient method for predicting desired materials. The method has beenmore » applied to the case of LiFePO 4, an important cathode material for lithium-ion batteries. Numerous new structures of LiFePO 4 have been found, compared to those currently available, available, demonstrating the reliability of the present methodology and illustrating the promise of the concept of structural motifs.« less
De novo discovery of structural motifs in RNA 3D structures through clustering.
Ge, Ping; Islam, Shahidul; Zhong, Cuncong; Zhang, Shaojie
2018-05-18
As functional components in three-dimensional (3D) conformation of an RNA, the RNA structural motifs provide an easy way to associate the molecular architectures with their biological mechanisms. In the past years, many computational tools have been developed to search motif instances by using the existing knowledge of well-studied families. Recently, with the rapidly increasing number of resolved RNA 3D structures, there is an urgent need to discover novel motifs with the newly presented information. In this work, we classify all the loops in non-redundant RNA 3D structures to detect plausible RNA structural motif families by using a clustering pipeline. Compared with other clustering approaches, our method has two benefits: first, the underlying alignment algorithm is tolerant to the variations in 3D structures. Second, sophisticated downstream analysis has been performed to ensure the clusters are valid and easily applied to further research. The final clustering results contain many interesting new variants of known motif families, such as GNAA tetraloop, kink-turn, sarcin-ricin and T-loop. We have also discovered potential novel functional motifs conserved in ribosomal RNA, sgRNA, SRP RNA, riboswitch and ribozyme.
Structural Elements Recognized by Abacavir-Induced T Cells.
Yerly, Daniel; Pompeu, Yuri Andreiw; Schutte, Ryan J; Eriksson, Klara K; Strhyn, Anette; Bracey, Austin W; Buus, Soren; Ostrov, David A
2017-07-07
Adverse drug reactions are one of the leading causes of morbidity and mortality in health care worldwide. Human leukocyte antigen (HLA) alleles have been strongly associated with drug hypersensitivities, and the causative drugs have been shown to stimulate specific T cells at the sites of autoimmune destruction. The structural elements recognized by drug-specific T cell receptors (TCRs) in vivo are poorly defined. Drug-stimulated T cells express TCRs specific for peptide/HLA complexes, but the characteristics of peptides (sequence, or endogenous or exogenous origin) presented in the context of small molecule drugs are not well studied. Using HLA-B*57:01 mediated hypersensitivity to abacavir as a model system, this study examines structural similarities of HLA presented peptides recognized by drug-specific TCRs. Using the crystal structure of HLA-B*57:01 complexed with abacavir and an immunogenic self peptide, VTTDIQVKV SPT5a 976-984, peptide side chains exhibiting flexibility and solvent exposure were identified as potential drug-specific T cell recognition motifs. Viral sequences with structural motifs similar to the immunogenic self peptide were identified. Abacavir-specific T cell clones were used to determine if virus peptides presented in the context of abacavir stimulate T cell responsiveness. An abacavir-specific T cell clone was stimulated by VTQQAQVRL, corresponding to HSV1/2 230-238, in the context of HLA-B*57:01. These data suggest the T cell polyclonal response to abacavir consists of multiple subsets, including T cells that recognize self peptide/HLA-B*57:01 complexes and crossreact with viral peptide/HLA-B*57:01 complexes due to similarity in TCR contact residues.
Adelman, K; Salmon, B; Baines, J D
2001-03-13
The product of the herpes simplex virus type 1 U(L)28 gene is essential for cleavage of concatemeric viral DNA into genome-length units and packaging of this DNA into viral procapsids. To address the role of U(L)28 in this process, purified U(L)28 protein was assayed for the ability to recognize conserved herpesvirus DNA packaging sequences. We report that DNA fragments containing the pac1 DNA packaging motif can be induced by heat treatment to adopt novel DNA conformations that migrate faster than the corresponding duplex in nondenaturing gels. Surprisingly, these novel DNA structures are high-affinity substrates for U(L)28 protein binding, whereas double-stranded DNA of identical sequence composition is not recognized by U(L)28 protein. We demonstrate that only one strand of the pac1 motif is responsible for the formation of novel DNA structures that are bound tightly and specifically by U(L)28 protein. To determine the relevance of the observed U(L)28 protein-pac1 interaction to the cleavage and packaging process, we have analyzed the binding affinity of U(L)28 protein for pac1 mutants previously shown to be deficient in cleavage and packaging in vivo. Each of the pac1 mutants exhibited a decrease in DNA binding by U(L)28 protein that correlated directly with the reported reduction in cleavage and packaging efficiency, thereby supporting a role for the U(L)28 protein-pac1 interaction in vivo. These data therefore suggest that the formation of novel DNA structures by the pac1 motif confers added specificity on recognition of DNA packaging sequences by the U(L)28-encoded component of the herpesvirus cleavage and packaging machinery.
Wienk, Hans; Slootweg, Jack C.; Speerstra, Sietske; Kaptein, Robert; Boelens, Rolf; Folkers, Gert E.
2013-01-01
To maintain the integrity of the genome, multiple DNA repair systems exist to repair damaged DNA. Recognition of altered DNA, including bulky adducts, pyrimidine dimers and interstrand crosslinks (ICL), partially depends on proteins containing helix-hairpin-helix (HhH) domains. To understand how ICL is specifically recognized by the Fanconi anemia proteins FANCM and FAAP24, we determined the structure of the HhH domain of FAAP24. Although it resembles other HhH domains, the FAAP24 domain contains a canonical hairpin motif followed by distorted motif. The HhH domain can bind various DNA substrates; using nuclear magnetic resonance titration experiments, we demonstrate that the canonical HhH motif is required for double-stranded DNA (dsDNA) binding, whereas the unstructured N-terminus can interact with single-stranded DNA. Both DNA binding surfaces are used for binding to ICL-like single/double-strand junction-containing DNA substrates. A structural model for FAAP24 bound to dsDNA has been made based on homology with the translesion polymerase iota. Site-directed mutagenesis, sequence conservation and charge distribution support the dsDNA-binding model. Analogous to other HhH domain-containing proteins, we suggest that multiple FAAP24 regions together contribute to binding to single/double-strand junction, which could contribute to specificity in ICL DNA recognition. PMID:23661679
DOE Office of Scientific and Technical Information (OSTI.GOV)
Parish, D.; Benach, J; Liu, G
2008-01-01
The structure of the 142-residue protein Q8ZP25 SALTY encoded in the genome of Salmonella typhimurium LT2 was determined independently by NMR and X-ray crystallography, and the structure of the 140-residue protein HYAE ECOLI encoded in the genome of Escherichia coli was determined by NMR. The two proteins belong to Pfam (Finn et al. 34:D247-D251, 2006) PF07449, which currently comprises 50 members, and belongs itself to the 'thioredoxin-like clan'. However, protein HYAE ECOLI and the other proteins of Pfam PF07449 do not contain the canonical Cys-X-X-Cys active site sequence motif of thioredoxin. Protein HYAE ECOLI was previously classified as a (NiFe)more » hydrogenase-1 specific chaperone interacting with the twin-arginine translocation (Tat) signal peptide. The structures presented here exhibit the expected thioredoxin-like fold and support the view that members of Pfam family PF07449 specifically interact with Tat signal peptides.« less
kpLogo: positional k-mer analysis reveals hidden specificity in biological sequences
2017-01-01
Abstract Motifs of only 1–4 letters can play important roles when present at key locations within macromolecules. Because existing motif-discovery tools typically miss these position-specific short motifs, we developed kpLogo, a probability-based logo tool for integrated detection and visualization of position-specific ultra-short motifs from a set of aligned sequences. kpLogo also overcomes the limitations of conventional motif-visualization tools in handling positional interdependencies and utilizing ranked or weighted sequences increasingly available from high-throughput assays. kpLogo can be found at http://kplogo.wi.mit.edu/. PMID:28460012
Joseph, Prem Raj B.; Sawant, Kirti V.; Isley, Angela; Pedroza, Mesias; Garofalo, Roberto P.; Richardson, Ricardo M.; Rajarathnam, Krishna
2014-01-01
Chemokines mediate diverse functions from organogenesis to mobilizing leucocytes, and are unusual agonists for class-A GPCRs (G-protein-coupled receptors) because of their large size and multi-domain structure. The current model for receptor activation, which involves interactions between chemokine N-loop and receptor N-terminal residues (Site-I) and between chemokine N-terminal and receptor extracellular loop/transmembrane residues (Site-II), fails to describe differences in ligand/receptor selectivity and the activation of multiple signalling pathways. In the present study, we show in neutrophil-activating chemokine CXCL8 that the highly conserved GP (glycine-proline) motif located distal to both N-terminal and N-loop residues couples Site-I and Site-II interactions. Mutations in the GP motif caused various differences from native-like function to complete loss of activity that could not be correlated with the specific mutation, receptor affinity or subtype, or a specific signalling pathway. NMR studies indicated that the GP motif does not influence Site-I interactions, but molecular dynamics simulations suggested that this motif dictates substates of the CXCL8 conformational ensemble. We conclude that the GP motif enables diverse receptor functions by controlling cross-talk between Site-I and Site-II, and further propose that the repertoire of chemokine functions is best described by a conformational ensemble model in which a network of long-range coupled indirect interactions mediate receptor activity. PMID:24032673
Campion, S R; Ameen, A S; Lai, L; King, J M; Munzenmaier, T N
2001-08-15
This report describes the application of a simple computational tool, AAPAIR.TAB, for the systematic analysis of the cysteine-rich EGF, Sushi, and Laminin motif/sequence families at the two-amino acid level. Automated dipeptide frequency/bias analysis detects preferences in the distribution of amino acids in established protein families, by determining which "ordered dipeptides" occur most frequently in comprehensive motif-specific sequence data sets. Graphic display of the dipeptide frequency/bias data revealed family-specific preferences for certain dipeptides, but more importantly detected a shared preference for employment of the ordered dipeptides Gly-Tyr (GY) and Gly-Phe (GF) in all three protein families. The dipeptide Asn-Gly (NG) also exhibited high-frequency and bias in the EGF and Sushi motif families, whereas Asn-Thr (NT) was distinguished in the Laminin family. Evaluation of the distribution of dipeptides identified by frequency/bias analysis subsequently revealed the highly restricted localization of the G(F/Y) and N(G/T) sequence elements at two separate sites of extreme conservation in the consensus sequence of all three sequence families. The similar employment of the high-frequency/bias dipeptides in three distinct protein sequence families was further correlated with the concurrence of these shared molecular determinants at similar positions within the distinctive scaffolds of three structurally divergent, but similarly employed, motif modules.
FoldMiner and LOCK 2: protein structure comparison and motif discovery on the web.
Shapiro, Jessica; Brutlag, Douglas
2004-07-01
The FoldMiner web server (http://foldminer.stanford.edu/) provides remote access to methods for protein structure alignment and unsupervised motif discovery. FoldMiner is unique among such algorithms in that it improves both the motif definition and the sensitivity of a structural similarity search by combining the search and motif discovery methods and using information from each process to enhance the other. In a typical run, a query structure is aligned to all structures in one of several databases of single domain targets in order to identify its structural neighbors and to discover a motif that is the basis for the similarity among the query and statistically significant targets. This process is fully automated, but options for manual refinement of the results are available as well. The server uses the Chime plugin and customized controls to allow for visualization of the motif and of structural superpositions. In addition, we provide an interface to the LOCK 2 algorithm for rapid alignments of a query structure to smaller numbers of user-specified targets.
Structural basis for concerted recruitment and activation of IRF-3 by innate immune adaptor proteins
Zhao, Baoyu; Shu, Chang; Gao, Xinsheng; ...
2016-06-02
Type I IFNs are key cytokines mediating innate antiviral immunity. cGMP-AMP synthase, ritinoic acid-inducible protein 1 (RIG-I)–like receptors, and Toll-like receptors recognize microbial double-stranded (ds)DNA, dsRNA, and LPS to induce the expression of type I IFNs. These signaling pathways converge at the recruitment and activation of the transcription factor IRF-3 (IFN regulatory factor 3). The adaptor proteins STING (stimulator of IFN genes), MAVS (mitochondrial antiviral signaling), and TRIF (TIR domain-containing adaptor inducing IFN-β) mediate the recruitment of IRF-3 through a conserved pLxIS motif. Here in this paper, we show that the pLxIS motif of phosphorylated STING, MAVS, and TRIF bindsmore » to IRF-3 in a similar manner, whereas residues upstream of the motif confer specificity. The structure of the IRF-3 phosphomimetic mutant S386/396E bound to the cAMP response element binding protein (CREB)-binding protein reveals that the pLxIS motif also mediates IRF-3 dimerization and activation. Moreover, rotavirus NSP1 (nonstructural protein 1) employs a pLxIS motif to target IRF-3 for degradation, but phosphorylation of NSP1 is not required for its activity. These results suggest a concerted mechanism for the recruitment and activation of IRF-3 that can be subverted by viral proteins to evade innate immune responses.« less
Structural basis for concerted recruitment and activation of IRF-3 by innate immune adaptor proteins
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhao, Baoyu; Shu, Chang; Gao, Xinsheng
Type I IFNs are key cytokines mediating innate antiviral immunity. cGMP-AMP synthase, ritinoic acid-inducible protein 1 (RIG-I)–like receptors, and Toll-like receptors recognize microbial double-stranded (ds)DNA, dsRNA, and LPS to induce the expression of type I IFNs. These signaling pathways converge at the recruitment and activation of the transcription factor IRF-3 (IFN regulatory factor 3). The adaptor proteins STING (stimulator of IFN genes), MAVS (mitochondrial antiviral signaling), and TRIF (TIR domain-containing adaptor inducing IFN-β) mediate the recruitment of IRF-3 through a conserved pLxIS motif. Here in this paper, we show that the pLxIS motif of phosphorylated STING, MAVS, and TRIF bindsmore » to IRF-3 in a similar manner, whereas residues upstream of the motif confer specificity. The structure of the IRF-3 phosphomimetic mutant S386/396E bound to the cAMP response element binding protein (CREB)-binding protein reveals that the pLxIS motif also mediates IRF-3 dimerization and activation. Moreover, rotavirus NSP1 (nonstructural protein 1) employs a pLxIS motif to target IRF-3 for degradation, but phosphorylation of NSP1 is not required for its activity. These results suggest a concerted mechanism for the recruitment and activation of IRF-3 that can be subverted by viral proteins to evade innate immune responses.« less
Informative priors based on transcription factor structural class improve de novo motif discovery.
Narlikar, Leelavati; Gordân, Raluca; Ohler, Uwe; Hartemink, Alexander J
2006-07-15
An important problem in molecular biology is to identify the locations at which a transcription factor (TF) binds to DNA, given a set of DNA sequences believed to be bound by that TF. In previous work, we showed that information in the DNA sequence of a binding site is sufficient to predict the structural class of the TF that binds it. In particular, this suggests that we can predict which locations in any DNA sequence are more likely to be bound by certain classes of TFs than others. Here, we argue that traditional methods for de novo motif finding can be significantly improved by adopting an informative prior probability that a TF binding site occurs at each sequence location. To demonstrate the utility of such an approach, we present priority, a powerful new de novo motif finding algorithm. Using data from TRANSFAC, we train three classifiers to recognize binding sites of basic leucine zipper, forkhead, and basic helix loop helix TFs. These classifiers are used to equip priority with three class-specific priors, in addition to a default prior to handle TFs of other classes. We apply priority and a number of popular motif finding programs to sets of yeast intergenic regions that are reported by ChIP-chip to be bound by particular TFs. priority identifies motifs the other methods fail to identify, and correctly predicts the structural class of the TF recognizing the identified binding sites. Supplementary material and code can be found at http://www.cs.duke.edu/~amink/.
DOE Office of Scientific and Technical Information (OSTI.GOV)
B Eckenroth; A Steere; N Chasteen
2011-12-31
Delivery of iron to cells requires binding of two iron-containing human transferrin (hTF) molecules to the specific homodimeric transferrin receptor (TFR) on the cell surface. Through receptor-mediated endocytosis involving lower pH, salt, and an unidentified chelator, iron is rapidly released from hTF within the endosome. The crystal structure of a monoferric N-lobe hTF/TFR complex (3.22-{angstrom} resolution) features two binding motifs in the N lobe and one in the C lobe of hTF. Binding of Fe{sub N}hTF induces global and site-specific conformational changes within the TFR ectodomain. Specifically, movements at the TFR dimer interface appear to prime the TFR to undergomore » pH-induced movements that alter the hTF/TFR interaction. Iron release from each lobe then occurs by distinctly different mechanisms: Binding of His349 to the TFR (strengthened by protonation at low pH) controls iron release from the C lobe, whereas displacement of one N-lobe binding motif, in concert with the action of the dilysine trigger, elicits iron release from the N lobe. One binding motif in each lobe remains attached to the same {alpha}-helix in the TFR throughout the endocytic cycle. Collectively, the structure elucidates how the TFR accelerates iron release from the C lobe, slows it from the N lobe, and stabilizes binding of apohTF for return to the cell surface. Importantly, this structure provides new targets for mutagenesis studies to further understand and define this system.« less
Bui, Huyen T.; Karren, Mary A.; Bhar, Debjani
2012-01-01
To initiate mitochondrial fission, dynamin-related proteins (DRPs) must bind specific adaptors on the outer mitochondrial membrane. The structural features underlying this interaction are poorly understood. Using yeast as a model, we show that the Insert B domain of the Dnm1 guanosine triphosphatase (a DRP) contains a novel motif required for association with the mitochondrial adaptor Mdv1. Mutation of this conserved motif specifically disrupted Dnm1–Mdv1 interactions, blocking Dnm1 recruitment and mitochondrial fission. Suppressor mutations in Mdv1 that restored Dnm1–Mdv1 interactions and fission identified potential protein-binding interfaces on the Mdv1 β-propeller domain. These results define the first known function for Insert B in DRP–adaptor interactions. Based on the variability of Insert B sequences and adaptor proteins, we propose that Insert B domains and mitochondrial adaptors have coevolved to meet the unique requirements for mitochondrial fission of different organisms. PMID:23148233
Charge splitters and charge transport junctions based on guanine quadruplexes
NASA Astrophysics Data System (ADS)
Sha, Ruojie; Xiang, Limin; Liu, Chaoren; Balaeff, Alexander; Zhang, Yuqi; Zhang, Peng; Li, Yueqi; Beratan, David N.; Tao, Nongjian; Seeman, Nadrian C.
2018-04-01
Self-assembling circuit elements, such as current splitters or combiners at the molecular scale, require the design of building blocks with three or more terminals. A promising material for such building blocks is DNA, wherein multiple strands can self-assemble into multi-ended junctions, and nucleobase stacks can transport charge over long distances. However, nucleobase stacking is often disrupted at junction points, hindering electric charge transport between the two terminals of the junction. Here, we show that a guanine-quadruplex (G4) motif can be used as a connector element for a multi-ended DNA junction. By attaching specific terminal groups to the motif, we demonstrate that charges can enter the structure from one terminal at one end of a three-way G4 motif, and can exit from one of two terminals at the other end with minimal carrier transport attenuation. Moreover, we study four-way G4 junction structures by performing theoretical calculations to assist in the design and optimization of these connectors.
Lathrop, R H; Casale, M; Tobias, D J; Marsh, J L; Thompson, L M
1998-01-01
We describe a prototype system (Poly-X) for assisting an expert user in modeling protein repeats. Poly-X reduces the large number of degrees of freedom required to specify a protein motif in complete atomic detail. The result is a small number of parameters that are easily understood by, and under the direct control of, a domain expert. The system was applied to the polyglutamine (poly-Q) repeat in the first exon of huntingtin, the gene implicated in Huntington's disease. We present four poly-Q structural motifs: two poly-Q beta-sheet motifs (parallel and antiparallel) that constitute plausible alternatives to a similar previously published poly-Q beta-sheet motif, and two novel poly-Q helix motifs (alpha-helix and pi-helix). To our knowledge, helical forms of polyglutamine have not been proposed before. The motifs suggest that there may be several plausible aggregation structures for the intranuclear inclusion bodies which have been found in diseased neurons, and may help in the effort to understand the structural basis for Huntington's disease.
Zhang, Lu; Xu, Jinhao; Ma, Jinbiao
2016-07-25
RNA-binding protein exerts important biological function by specifically recognizing RNA motif. SELEX (Systematic evolution of ligands by exponential enrichment), an in vitro selection method, can obtain consensus motif with high-affinity and specificity for many target molecules from DNA or RNA libraries. Here, we combined SELEX with next-generation sequencing to study the protein-RNA interaction in vitro. A pool of RNAs with 20 bp random sequences were transcribed by T7 promoter, and target protein was inserted into plasmid containing SBP-tag, which can be captured by streptavidin beads. Through only one cycle, the specific RNA motif can be obtained, which dramatically improved the selection efficiency. Using this method, we found that human hnRNP A1 RRMs domain (UP1 domain) bound RNA motifs containing AGG and AG sequences. The EMSA experiment indicated that hnRNP A1 RRMs could bind the obtained RNA motif. Taken together, this method provides a rapid and effective method to study the RNA binding specificity of proteins.
Allergen cross reactions: a problem greater than ever thought?
Pfiffner, P; Truffer, R; Matsson, P; Rasi, C; Mari, A; Stadler, B M
2010-12-01
Cross reactions are an often observed phenomenon in patients with allergy. Sensitization against some allergens may cause reactions against other seemingly unrelated allergens. Today, cross reactions are being investigated on a per-case basis, analyzing blood serum specific IgE (sIgE) levels and clinical features of patients suffering from cross reactions. In this study, we evaluated the level of sIgE compared to patients' total IgE assuming epitope specificity is a consequence of sequence similarity. Our objective was to evaluate our recently published model of molecular sequence similarities underlying cross reactivity using serum-derived data from IgE determinations of standard laboratory tests. We calculated the probabilities of protein cross reactivity based on conserved sequence motifs and compared these in silico predictions to a database consisting of 5362 sera with sIgE determinations. Cumulating sIgE values of a patient resulted in a median of 25-30% total IgE. Comparing motif cross reactivity predictions to sIgE levels showed that on average three times fewer motifs than extracts were recognized in a given serum (correlation coefficient: 0.967). Extracts belonging to the same motif group co-reacted in a high percentage of sera (up to 80% for some motifs). Cumulated sIgE levels are exaggerated because of a high level of observed cross reactions. Thus, not only bioinformatic prediction of allergenic motifs, but also serological routine testing of allergic patients implies that the immune system may recognize only a small number of allergenic structures. © 2010 John Wiley & Sons A/S.
Song, Wen; Liu, Li; Wang, Jizong; Wu, Zhen; Zhang, Heqiao; Tang, Jiao; Lin, Guangzhong; Wang, Yichuan; Wen, Xing; Li, Wenyang; Han, Zhifu; Guo, Hongwei; Chai, Jijie
2016-06-01
Peptide-mediated cell-to-cell signaling has crucial roles in coordination and definition of cellular functions in plants. Peptide-receptor matching is important for understanding the mechanisms underlying peptide-mediated signaling. Here we report the structure-guided identification of root meristem growth factor (RGF) receptors important for plant development. An assay based on a signature ligand recognition motif (Arg-x-Arg) conserved in a subfamily of leucine-rich repeat receptor kinases (LRR-RKs) identified the functionally uncharacterized LRR-RK At4g26540 as a receptor of RGF1 (RGFR1). We further solved the crystal structure of RGF1 in complex with the LRR domain of RGFR1 at a resolution of 2.6 Å, which reveals that the Arg-x-Gly-Gly (RxGG) motif is responsible for specific recognition of the sulfate group of RGF1 by RGFR1. Based on the RxGG motif, we identified additional four RGFRs. Participation of the five RGFRs in RGF-induced signaling is supported by biochemical and genetic data. We also offer evidence showing that SERKs function as co-receptors for RGFs. Taken together, our study identifies RGF receptors and co-receptors that can link RGF signals with their downstream components and provides a proof of principle for structure-based matching of LRR-RKs with their peptide ligands.
Tlatli, Rym; Nozach, Hervé; Collet, Guillaume; Beau, Fabrice; Vera, Laura; Stura, Enrico; Dive, Vincent; Cuniasse, Philippe
2013-01-01
Artificial miniproteins that are able to target catalytic sites of matrix metalloproteinases (MMPs) were designed using a functional motif-grafting approach. The motif corresponded to the four N-terminal residues of TIMP-2, a broad-spectrum protein inhibitor of MMPs. Scaffolds that are able to reproduce the functional topology of this motif were obtained by exhaustive screening of the Protein Data Bank (PDB) using STAMPS software (search for three-dimensional atom motifs in protein structures). Ten artificial protein binders were produced. The designed proteins bind catalytic sites of MMPs with affinities ranging from 450 nm to 450 μm prior to optimization. The crystal structure of one artificial binder in complex with the catalytic domain of MMP-12 showed that the inter-molecular interactions established by the functional motif in the artificial binder corresponded to those found in the MMP-14-TIMP-2 complex, albeit with some differences in geometry. Molecular dynamics simulations of the ten binders in complex with MMP-14 suggested that these scaffolds may allow partial reproduction of native inter-molecular interactions, but differences in geometry and stability may contribute to the lower affinity of the artificial protein binders compared to the natural protein binder. Nevertheless, these results show that the in silico design method used provides sets of protein binders that target a specific binding site with a good rate of success. This approach may constitute the first step of an efficient hybrid computational/experimental approach to protein binder design. © 2012 The Authors Journal compilation © 2012 FEBS.
Manczyk, Noah; Yates, Bradley P; Veggiani, Gianluca; Ernst, Andreas; Sicheri, Frank; Sidhu, Sachdev S
2017-05-01
Ubiquitin interacting motifs (UIMs) are short α-helices found in a number of eukaryotic proteins. UIMs interact weakly but specifically with ubiquitin conjugated to other proteins, and in so doing, mediate specific cellular signals. Here we used phage display to generate ubiquitin variants (UbVs) targeting the N-terminal UIM of the yeast Vps27 protein. Selections yielded UbV.v27.1, which recognized the cognate UIM with high specificity relative to other yeast UIMs and bound with an affinity more than two orders of magnitude higher than that of ubiquitin. Structural and mutational studies of the UbV.v27.1-UIM complex revealed the molecular details for the enhanced affinity and specificity of UbV.v27.1, and underscored the importance of changes at the binding interface as well as at positions that do not contact the UIM. Our study highlights the power of the phage display approach for selecting UbVs with unprecedented affinity and high selectivity for particular α-helical UIM domains within proteomes, and it establishes a general approach for the development of inhibitors targeting interactions of this type. © 2017 The Protein Society.
Combinatorial Histone Acetylation Patterns Are Generated by Motif-Specific Reactions.
Blasi, Thomas; Feller, Christian; Feigelman, Justin; Hasenauer, Jan; Imhof, Axel; Theis, Fabian J; Becker, Peter B; Marr, Carsten
2016-01-27
Post-translational modifications (PTMs) are pivotal to cellular information processing, but how combinatorial PTM patterns ("motifs") are set remains elusive. We develop a computational framework, which we provide as open source code, to investigate the design principles generating the combinatorial acetylation patterns on histone H4 in Drosophila melanogaster. We find that models assuming purely unspecific or lysine site-specific acetylation rates were insufficient to explain the experimentally determined motif abundances. Rather, these abundances were best described by an ensemble of models with acetylation rates that were specific to motifs. The model ensemble converged upon four acetylation pathways; we validated three of these using independent data from a systematic enzyme depletion study. Our findings suggest that histone acetylation patterns originate through specific pathways involving motif-specific acetylation activity. Copyright © 2016 Elsevier Inc. All rights reserved.
Mining protein loops using a structural alphabet and statistical exceptionality
2010-01-01
Background Protein loops encompass 50% of protein residues in available three-dimensional structures. These regions are often involved in protein functions, e.g. binding site, catalytic pocket... However, the description of protein loops with conventional tools is an uneasy task. Regular secondary structures, helices and strands, have been widely studied whereas loops, because they are highly variable in terms of sequence and structure, are difficult to analyze. Due to data sparsity, long loops have rarely been systematically studied. Results We developed a simple and accurate method that allows the description and analysis of the structures of short and long loops using structural motifs without restriction on loop length. This method is based on the structural alphabet HMM-SA. HMM-SA allows the simplification of a three-dimensional protein structure into a one-dimensional string of states, where each state is a four-residue prototype fragment, called structural letter. The difficult task of the structural grouping of huge data sets is thus easily accomplished by handling structural letter strings as in conventional protein sequence analysis. We systematically extracted all seven-residue fragments in a bank of 93000 protein loops and grouped them according to the structural-letter sequence, named structural word. This approach permits a systematic analysis of loops of all sizes since we consider the structural motifs of seven residues rather than complete loops. We focused the analysis on highly recurrent words of loops (observed more than 30 times). Our study reveals that 73% of loop-lengths are covered by only 3310 highly recurrent structural words out of 28274 observed words). These structural words have low structural variability (mean RMSd of 0.85 Å). As expected, half of these motifs display a flanking-region preference but interestingly, two thirds are shared by short (less than 12 residues) and long loops. Moreover, half of recurrent motifs exhibit a significant level of amino-acid conservation with at least four significant positions and 87% of long loops contain at least one such word. We complement our analysis with the detection of statistically over-represented patterns of structural letters as in conventional DNA sequence analysis. About 30% (930) of structural words are over-represented, and cover about 40% of loop lengths. Interestingly, these words exhibit lower structural variability and higher sequential specificity, suggesting structural or functional constraints. Conclusions We developed a method to systematically decompose and study protein loops using recurrent structural motifs. This method is based on the structural alphabet HMM-SA and not on structural alignment and geometrical parameters. We extracted meaningful structural motifs that are found in both short and long loops. To our knowledge, it is the first time that pattern mining helps to increase the signal-to-noise ratio in protein loops. This finding helps to better describe protein loops and might permit to decrease the complexity of long-loop analysis. Detailed results are available at http://www.mti.univ-paris-diderot.fr/publication/supplementary/2009/ACCLoop/. PMID:20132552
Mining protein loops using a structural alphabet and statistical exceptionality.
Regad, Leslie; Martin, Juliette; Nuel, Gregory; Camproux, Anne-Claude
2010-02-04
Protein loops encompass 50% of protein residues in available three-dimensional structures. These regions are often involved in protein functions, e.g. binding site, catalytic pocket... However, the description of protein loops with conventional tools is an uneasy task. Regular secondary structures, helices and strands, have been widely studied whereas loops, because they are highly variable in terms of sequence and structure, are difficult to analyze. Due to data sparsity, long loops have rarely been systematically studied. We developed a simple and accurate method that allows the description and analysis of the structures of short and long loops using structural motifs without restriction on loop length. This method is based on the structural alphabet HMM-SA. HMM-SA allows the simplification of a three-dimensional protein structure into a one-dimensional string of states, where each state is a four-residue prototype fragment, called structural letter. The difficult task of the structural grouping of huge data sets is thus easily accomplished by handling structural letter strings as in conventional protein sequence analysis. We systematically extracted all seven-residue fragments in a bank of 93000 protein loops and grouped them according to the structural-letter sequence, named structural word. This approach permits a systematic analysis of loops of all sizes since we consider the structural motifs of seven residues rather than complete loops. We focused the analysis on highly recurrent words of loops (observed more than 30 times). Our study reveals that 73% of loop-lengths are covered by only 3310 highly recurrent structural words out of 28274 observed words). These structural words have low structural variability (mean RMSd of 0.85 A). As expected, half of these motifs display a flanking-region preference but interestingly, two thirds are shared by short (less than 12 residues) and long loops. Moreover, half of recurrent motifs exhibit a significant level of amino-acid conservation with at least four significant positions and 87% of long loops contain at least one such word. We complement our analysis with the detection of statistically over-represented patterns of structural letters as in conventional DNA sequence analysis. About 30% (930) of structural words are over-represented, and cover about 40% of loop lengths. Interestingly, these words exhibit lower structural variability and higher sequential specificity, suggesting structural or functional constraints. We developed a method to systematically decompose and study protein loops using recurrent structural motifs. This method is based on the structural alphabet HMM-SA and not on structural alignment and geometrical parameters. We extracted meaningful structural motifs that are found in both short and long loops. To our knowledge, it is the first time that pattern mining helps to increase the signal-to-noise ratio in protein loops. This finding helps to better describe protein loops and might permit to decrease the complexity of long-loop analysis. Detailed results are available at http://www.mti.univ-paris-diderot.fr/publication/supplementary/2009/ACCLoop/.
Ngo, Tri Duc; Van Le, Binh; Subramani, Vinod Kumar; Thi Nguyen, Chi My; Lee, Hyun Sook; Cho, Yona; Kim, Kyeong Kyu; Hwang, Hye-Yeon
2015-05-22
Proteins in the haloalkaloic acid dehalogenase (HAD) superfamily, which is one of the largest enzyme families, is generally composed of a catalytic core domain and a cap domain. Although proteins in this family show broad substrate specificities, the mechanisms of their substrate recognition are not well understood. In this study, we identified a new substrate binding motif of HAD proteins from structural and functional analyses, and propose that this motif might be crucial for interacting with hydrophobic rings of substrates. The crystal structure of TON_0338, one of the 17 putative HAD proteins identified in a hyperthermophilic archaeon, Thermococcus onnurineus NA1, was determined as an apo-form at 2.0 Å resolution. In addition, we determined the crystal structure TON_0338 in complex with Mg(2+) or N-cyclohexyl-2-aminoethanesulfonic acid (CHES) at 1.7 Å resolution. Examination of the apo-form and CHES-bound structures revealed that CHES is sandwiched between Trp58 and Trp61, suggesting that this Trp sandwich might function as a substrate recognition motif. In the phosphatase assay, TON_0338 was shown to have high activity for flavin mononucleotide (FMN), and the docking analysis suggested that the flavin of FMN may interact with Trp58 and Trp61 in a way similar to that observed in the crystal structure. Moreover, the replacement of these tryptophan residues significantly reduced the phosphatase activity for FMN. Our results suggest that WxxW may function as a substrate binding motif in HAD proteins, and expand the diversity of their substrate recognition mode. Copyright © 2015 Elsevier Inc. All rights reserved.
Systematic comparison of the response properties of protein and RNA mediated gene regulatory motifs.
Iyengar, Bharat Ravi; Pillai, Beena; Venkatesh, K V; Gadgil, Chetan J
2017-05-30
We present a framework enabling the dissection of the effects of motif structure (feedback or feedforward), the nature of the controller (RNA or protein), and the regulation mode (transcriptional, post-transcriptional or translational) on the response to a step change in the input. We have used a common model framework for gene expression where both motif structures have an activating input and repressing regulator, with the same set of parameters, to enable a comparison of the responses. We studied the global sensitivity of the system properties, such as steady-state gain, overshoot, peak time, and peak duration, to parameters. We find that, in all motifs, overshoot correlated negatively whereas peak duration varied concavely with peak time. Differences in the other system properties were found to be mainly dependent on the nature of the controller rather than the motif structure. Protein mediated motifs showed a higher degree of adaptation i.e. a tendency to return to baseline levels; in particular, feedforward motifs exhibited perfect adaptation. RNA mediated motifs had a mild regulatory effect; they also exhibited a lower peaking tendency and mean overshoot. Protein mediated feedforward motifs showed higher overshoot and lower peak time compared to the corresponding feedback motifs.
Prediction of TF target sites based on atomistic models of protein-DNA complexes
Angarica, Vladimir Espinosa; Pérez, Abel González; Vasconcelos, Ana T; Collado-Vides, Julio; Contreras-Moreira, Bruno
2008-01-01
Background The specific recognition of genomic cis-regulatory elements by transcription factors (TFs) plays an essential role in the regulation of coordinated gene expression. Studying the mechanisms determining binding specificity in protein-DNA interactions is thus an important goal. Most current approaches for modeling TF specific recognition rely on the knowledge of large sets of cognate target sites and consider only the information contained in their primary sequence. Results Here we describe a structure-based methodology for predicting sequence motifs starting from the coordinates of a TF-DNA complex. Our algorithm combines information regarding the direct and indirect readout of DNA into an atomistic statistical model, which is used to estimate the interaction potential. We first measure the ability of our method to correctly estimate the binding specificities of eight prokaryotic and eukaryotic TFs that belong to different structural superfamilies. Secondly, the method is applied to two homology models, finding that sampling of interface side-chain rotamers remarkably improves the results. Thirdly, the algorithm is compared with a reference structural method based on contact counts, obtaining comparable predictions for the experimental complexes and more accurate sequence motifs for the homology models. Conclusion Our results demonstrate that atomic-detail structural information can be feasibly used to predict TF binding sites. The computational method presented here is universal and might be applied to other systems involving protein-DNA recognition. PMID:18922190
Hybrid DNA i-motif: Aminoethylprolyl-PNA (pC5) enhance the stability of DNA (dC5) i-motif structure.
Gade, Chandrasekhar Reddy; Sharma, Nagendra K
2017-12-15
This report describes the synthesis of C-rich sequence, cytosine pentamer, of aep-PNA and its biophysical studies for the formation of hybrid DNA:aep-PNAi-motif structure with DNA cytosine pentamer (dC 5 ) under acidic pH conditions. Herein, the CD/UV/NMR/ESI-Mass studies strongly support the formation of stable hybrid DNA i-motif structure with aep-PNA even near acidic conditions. Hence aep-PNA C-rich sequence cytosine could be considered as potential DNA i-motif stabilizing agents in vivo conditions. Copyright © 2017 Elsevier Ltd. All rights reserved.
Structural Elements Recognized by Abacavir-Induced T Cells
Yerly, Daniel; Pompeu, Yuri Andreiw; Schutte, Ryan J.; Eriksson, Klara. K.; Strhyn, Anette; Bracey, Austin. W.; Buus, Soren; Ostrov, David A.
2017-01-01
Adverse drug reactions are one of the leading causes of morbidity and mortality in health care worldwide. Human leukocyte antigen (HLA) alleles have been strongly associated with drug hypersensitivities, and the causative drugs have been shown to stimulate specific T cells at the sites of autoimmune destruction. The structural elements recognized by drug-specific T cell receptors (TCRs) in vivo are poorly defined. Drug-stimulated T cells express TCRs specific for peptide/HLA complexes, but the characteristics of peptides (sequence, or endogenous or exogenous origin) presented in the context of small molecule drugs are not well studied. Using HLA-B*57:01 mediated hypersensitivity to abacavir as a model system, this study examines structural similarities of HLA presented peptides recognized by drug-specific TCRs. Using the crystal structure of HLA-B*57:01 complexed with abacavir and an immunogenic self peptide, VTTDIQVKV SPT5a 976–984, peptide side chains exhibiting flexibility and solvent exposure were identified as potential drug-specific T cell recognition motifs. Viral sequences with structural motifs similar to the immunogenic self peptide were identified. Abacavir-specific T cell clones were used to determine if virus peptides presented in the context of abacavir stimulate T cell responsiveness. An abacavir-specific T cell clone was stimulated by VTQQAQVRL, corresponding to HSV1/2 230–238, in the context of HLA-B*57:01. These data suggest the T cell polyclonal response to abacavir consists of multiple subsets, including T cells that recognize self peptide/HLA-B*57:01 complexes and crossreact with viral peptide/HLA-B*57:01 complexes due to similarity in TCR contact residues. PMID:28686208
The amylase inhibitor montbretin A reveals a new glycosidase inhibition motif.
Williams, Leslie K; Zhang, Xiaohua; Caner, Sami; Tysoe, Christina; Nguyen, Nham T; Wicki, Jacqueline; Williams, David E; Coleman, John; McNeill, John H; Yuen, Violet; Andersen, Raymond J; Withers, Stephen G; Brayer, Gary D
2015-09-01
The complex plant flavonol glycoside montbretin A is a potent (Ki = 8 nM) and specific inhibitor of human pancreatic α-amylase with potential as a therapeutic for diabetes and obesity. Controlled degradation studies on montbretin A, coupled with inhibition analyses, identified an essential high-affinity core structure comprising the myricetin and caffeic acid moieties linked via a disaccharide. X-ray structural analyses of the montbretin A-human α-amylase complex confirmed the importance of this core structure and revealed a novel mode of glycosidase inhibition wherein internal π-stacking interactions between the myricetin and caffeic acid organize their ring hydroxyls for optimal hydrogen bonding to the α-amylase catalytic residues D197 and E233. This novel inhibitory motif can be reproduced in a greatly simplified analog, offering potential for new strategies for glycosidase inhibition and therapeutic development.
Wobble pairs of the HDV ribozyme play specific roles in stabilization of active site dynamics.
Sripathi, Kamali N; Banáš, Pavel; Réblová, Kamila; Šponer, Jiří; Otyepka, Michal; Walter, Nils G
2015-02-28
The hepatitis delta virus (HDV) is the only known human pathogen whose genome contains a catalytic RNA motif (ribozyme). The overall architecture of the HDV ribozyme is that of a double-nested pseudoknot, with two GU pairs flanking the active site. Although extensive studies have shown that mutation of either wobble results in decreased catalytic activity, little work has focused on linking these mutations to specific structural effects on catalytic fitness. Here we use molecular dynamics simulations based on an activated structure to probe the active site dynamics as a result of wobble pair mutations. In both wild-type and mutant ribozymes, the in-line fitness of the active site (as a measure of catalytic proficiency) strongly depends on the presence of a C75(N3H3+)N1(O5') hydrogen bond, which positions C75 as the general acid for the reaction. Our mutational analyses show that each GU wobble supports catalytically fit conformations in distinct ways; the reverse G25U20 wobble promotes high in-line fitness, high occupancy of the C75(N3H3+)G1(O5') general-acid hydrogen bond and stabilization of the G1U37 wobble, while the G1U37 wobble acts more locally by stabilizing high in-line fitness and the C75(N3H3+)G1(O5') hydrogen bond. We also find that stable type I A-minor and P1.1 hydrogen bonding above and below the active site, respectively, prevent local structural disorder from spreading and disrupting global conformation. Taken together, our results define specific, often redundant architectural roles for several structural motifs of the HDV ribozyme active site, expanding the known roles of these motifs within all HDV-like ribozymes and other structured RNAs.
Wobble Pairs of the HDV Ribozyme Play Specific Roles in Stabilization of Active Site Dynamics
Sripathi, Kamali N.; Banáš, Pavel; Reblova, Kamila; Šponer, Jiři; Otyepka, Michal
2015-01-01
The hepatitis delta virus (HDV) is the only known human pathogen whose genome contains a catalytic RNA motif (ribozyme). The overall architecture of the HDV ribozyme is that of a double-nested pseudoknot, with two GU pairs flanking the active site. Although extensive studies have shown that mutation of either wobble results in decreased catalytic activity, little work has focused on linking these mutations to specific structural effects on catalytic fitness. Here we use molecular dynamics simulations based on an activated structure to probe the active site dynamics as a result of wobble pair mutations. In both wild-type and mutant ribozymes, the in-line fitness of the active site (as a measure of catalytic proficiency) strongly depends on the presence of a C75(N3H3+)N1(O5′) hydrogen bond, which positions C75 as the general acid for the reaction. Our mutational analyses show that each GU wobble supports catalytically fit conformations in distinct ways; the reverse G25U20 wobble promotes high in-line fitness, high occupancy of the C75(N3H3+)G1(O5′) general-acid hydrogen bond and stabilization of the G1U37 wobble, while the G1U37 wobble acts more locally by stabilizing high in-line fitness and the C75(N3H3+)G1(O5′) hydrogen bond. We also find that stable type I A-minor and P1.1 hydrogen bonding above and below the active site, respectively, prevent local structural disorder from spreading and disrupting global conformation. Taken together, our results define specific, often redundant architectural roles for several structural motifs of the HDV ribozyme active site, expanding the known roles of these motifs within all HDV-like ribozymes and other structured RNAs. PMID:25631765
Wang, Lilin; Smith, Dan; Bot, Simona; Dellamary, Luis; Bloom, Amy; Bot, Adrian
2002-01-01
The adaptive immune response is triggered by recognition of T and B cell epitopes and is influenced by “danger” motifs that act via innate immune receptors. This study shows that motifs associated with noncoding RNA are essential features in the immune response reminiscent of viral infection, mediating rapid induction of proinflammatory chemokine expression, recruitment and activation of antigen-presenting cells, modulation of regulatory cytokines, subsequent differentiation of Th1 cells, isotype switching, and stimulation of cross-priming. The heterogeneity of RNA-associated motifs results in differential binding to cellular receptors, and specifically impacts the immune profile. Naturally occurring double-stranded RNA (dsRNA) triggered activation of dendritic cells and enhancement of specific immunity, similar to selected synthetic dsRNA motifs. Based on the ability of specific RNA motifs to block tolerance induction and effectively organize the immune defense during viral infection, we conclude that such RNA species are potent danger motifs. We also demonstrate the feasibility of using selected RNA motifs as adjuvants in the context of novel aerosol carriers for optimizing the immune response to subunit vaccines. In conclusion, RNA-associated motifs produced during viral infection bridge the early response with the late adaptive phase, regulating the activation and differentiation of antigen-specific B and T cells, in addition to a short-term impact on innate immunity. PMID:12393853
The Thiamin Pyrophosphate-Motif
NASA Technical Reports Server (NTRS)
Dominiak, Paulina M.; Ciszak, Ewa M.
2003-01-01
Using databases the authors have identified a common thiamin pyrophosphate (TPP)-motif in the family of functionally diverse TPP-dependent enzymes. This common motif consists of multimeric organization of subunits, two catalytic centers, common amino acid sequence, and specific contacts to provide a flip-flop, or alternate site, mechanism of action. Each catalytic center [PP:PYR] is formed at the interface of the PP-domain binding the magnesium ion, pyrophosphate and aminopyrimidine ring of TPP, and the PYR-domain binding the aminopyrimidine ring of that cofactor. A pair of these catalytic centers constitutes the catalytic core [PP:PYR]* within these enzymes. Analysis of the structural elements of this catalytic core reveals novel definition of the common amino acid sequences, which are GX@&(G)@XXGQ, and GDGX25-30 within the PP- domain, and the E&(G)@XXG@ within the PYR-domain, where Q, corresponds to a hydrophobic amino acid. This TPP-motif provides a novel tool for annotation of TPP-dependent enzymes useful in advancing functional proteomics.
Ono, K; Ohtomo, T; Sato, S; Sugamata, Y; Suzuki, M; Hisamoto, N; Ninomiya-Tsuji, J; Tsuchiya, M; Matsumoto, K
2001-06-29
TAK1, a member of the MAPKKK family, is involved in the intracellular signaling pathways mediated by transforming growth factor beta, interleukin 1, and Wnt. TAK1 kinase activity is specifically activated by the TAK1-binding protein TAB1. The C-terminal 68-amino acid sequence of TAB1 (TAB1-C68) is sufficient for TAK1 interaction and activation. Analysis of various truncated versions of TAB1-C68 defined a C-terminal 30-amino acid sequence (TAB1-C30) necessary for TAK1 binding and activation. NMR studies revealed that the TAB1-C30 region has a unique alpha-helical structure. We identified a conserved sequence motif, PYVDXA/TXF, in the C-terminal domain of mammalian TAB1, Xenopus TAB1, and its Caenorhabditis elegans homolog TAP-1, suggesting that this motif constitutes a specific TAK1 docking site. Alanine substitution mutagenesis showed that TAB1 Phe-484, located in the conserved motif, is crucial for TAK1 binding and activation. The C. elegans homolog of TAB1, TAP-1, was able to interact with and activate the C. elegans homolog of TAK1, MOM-4. However, the site in TAP-1 corresponding to Phe-484 of TAB1 is an alanine residue (Ala-364), and changing this residue to Phe abrogates the ability of TAP-1 to interact with and activate MOM-4. These results suggest that the Phe or Ala residue within the conserved motif of the TAB1-related proteins is important for interaction with and activation of specific TAK1 MAPKKK family members in vivo.
Insights into Structural and Mechanistic Features of Viral IRES Elements
Martinez-Salas, Encarnacion; Francisco-Velilla, Rosario; Fernandez-Chamorro, Javier; Embarek, Azman M.
2018-01-01
Internal ribosome entry site (IRES) elements are cis-acting RNA regions that promote internal initiation of protein synthesis using cap-independent mechanisms. However, distinct types of IRES elements present in the genome of various RNA viruses perform the same function despite lacking conservation of sequence and secondary RNA structure. Likewise, IRES elements differ in host factor requirement to recruit the ribosomal subunits. In spite of this diversity, evolutionarily conserved motifs in each family of RNA viruses preserve sequences impacting on RNA structure and RNA–protein interactions important for IRES activity. Indeed, IRES elements adopting remarkable different structural organizations contain RNA structural motifs that play an essential role in recruiting ribosomes, initiation factors and/or RNA-binding proteins using different mechanisms. Therefore, given that a universal IRES motif remains elusive, it is critical to understand how diverse structural motifs deliver functions relevant for IRES activity. This will be useful for understanding the molecular mechanisms beyond cap-independent translation, as well as the evolutionary history of these regulatory elements. Moreover, it could improve the accuracy to predict IRES-like motifs hidden in genome sequences. This review summarizes recent advances on the diversity and biological relevance of RNA structural motifs for viral IRES elements. PMID:29354113
Goudot, Christel; Etchebest, Catherine
2011-01-01
AP-1 proteins are transcription factors (TFs) that belong to the basic leucine zipper family, one of the largest families of TFs in eukaryotic cells. Despite high homology between their DNA binding domains, these proteins are able to recognize diverse DNA motifs. In yeasts, these motifs are referred as YRE (Yap Response Element) and are either seven (YRE-Overlap) or eight (YRE-Adjacent) base pair long. It has been proposed that the AP-1 DNA binding motif preference relies on a single change in the amino acid sequence of the yeast AP-1 TFs (an arginine in the YRE-O binding factors being replaced by a lysine in the YRE-A binding Yaps). We developed a computational approach to infer condition-specific transcriptional modules associated to the orthologous AP-1 protein Yap1p, Cgap1p and Cap1p, in three yeast species: the model yeast Saccharomyces cerevisiae and two pathogenic species Candida glabrata and Candida albicans. Exploitation of these modules in terms of predictions of the protein/DNA regulatory interactions changed our vision of AP-1 protein evolution. Cis-regulatory motif analyses revealed the presence of a conserved adenine in 5′ position of the canonical YRE sites. While Yap1p, Cgap1p and Cap1p shared a remarkably low number of target genes, an impressive conservation was observed in the YRE sequences identified by Yap1p and Cap1p. In Candida glabrata, we found that Cgap1p, unlike Yap1p and Cap1p, recognizes YRE-O and YRE-A motifs. These findings were supported by structural data available for the transcription factor Pap1p (Schizosaccharomyces pombe). Thus, whereas arginine and lysine substitutions in Cgap1p and Yap1p proteins were reported as responsible for a specific YRE-O or YRE-A preference, our analyses rather suggest that the ancestral yeast AP-1 protein could recognize both YRE-O and YRE-A motifs and that the arginine/lysine exchange is not the only determinant of the specialization of modern Yaps for one motif or another. PMID:21695268
A new subfamily LIP of the major intrinsic proteins.
Khabudaev, Kirill Vladimirovich; Petrova, Darya Petrovna; Grachev, Mikhail Aleksandrovich; Likhoshway, Yelena Valentinovna
2014-03-04
Proteins of the major intrinsic protein (MIP) family, or aquaporins, have been detected in almost all organisms. These proteins are important in cells and organisms because they allow for passive transmembrane transport of water and other small, uncharged polar molecules. We compared the predicted amino acid sequences of 20 MIPs from several algae species of the phylum Heterokontophyta (Kingdom Chromista) with the sequences of MIPs from other organisms. Multiple sequence alignments revealed motifs that were homologous to functionally important NPA motifs and the so-called ar/R-selective filter of glyceroporins and aquaporins. The MIP sequences of the studied chromists fell into several clusters that belonged to different groups of MIPs from a wide variety of organisms from different Kingdoms. Two of these proteins belong to Plasma membrane intrinsic proteins (PIPs), four of them belong to GlpF-like intrinsic proteins (GIPs), and one of them belongs to a specific MIPE subfamily from green algae. Three proteins belong to the unclassified MIPs, two of which are of bacterial origin. Eight of the studied MIPs contain an NPM-motif in place of the second conserved NPA-motif typical of the majority of MIPs. The MIPs of heterokonts within all detected clusters can differ from other MIPs in the same cluster regarding the structure of the ar/R-selective filter and other generally conserved motifs. We proposed placing nine MIPs from heterokonts into a new group, which we have named the LIPs (large intrinsic proteins). The possible substrate specificities of the studied MIPs are discussed.
The evolution of function within the Nudix homology clan
Srouji, John R.; Xu, Anting; Park, Annsea; Kirsch, Jack F.
2017-01-01
ABSTRACT The Nudix homology clan encompasses over 80,000 protein domains from all three domains of life, defined by homology to each other. Proteins with a domain from this clan fall into four general functional classes: pyrophosphohydrolases, isopentenyl diphosphate isomerases (IDIs), adenine/guanine mismatch‐specific adenine glycosylases (A/G‐specific adenine glycosylases), and nonenzymatic activities such as protein/protein interaction and transcriptional regulation. The largest group, pyrophosphohydrolases, encompasses more than 100 distinct hydrolase specificities. To understand the evolution of this vast number of activities, we assembled and analyzed experimental and structural data for 205 Nudix proteins collected from the literature. We corrected erroneous functions or provided more appropriate descriptions for 53 annotations described in the Gene Ontology Annotation database in this family, and propose 275 new experimentally‐based annotations. We manually constructed a structure‐guided sequence alignment of 78 Nudix proteins. Using the structural alignment as a seed, we then made an alignment of 347 “select” Nudix homology domains, curated from structurally determined, functionally characterized, or phylogenetically important Nudix domains. Based on our review of Nudix pyrophosphohydrolase structures and specificities, we further analyzed a loop region downstream of the Nudix hydrolase motif previously shown to contact the substrate molecule and possess known functional motifs. This loop region provides a potential structural basis for the functional radiation and evolution of substrate specificity within the hydrolase family. Finally, phylogenetic analyses of the 347 select protein domains and of the complete Nudix homology clan revealed general monophyly with regard to function and a few instances of probable homoplasy. Proteins 2017; 85:775–811. © 2016 Wiley Periodicals, Inc. PMID:27936487
Mueller, Benjamin K.; Subramaniam, Sabareesh; Senes, Alessandro
2014-01-01
Carbon hydrogen bonds between Cα–H donors and carbonyl acceptors are frequently observed between transmembrane helices (Cα–H···O=C). Networks of these interactions occur often at helix−helix interfaces mediated by GxxxG and similar patterns. Cα–H hydrogen bonds have been hypothesized to be important in membrane protein folding and association, but evidence that they are major determinants of helix association is still lacking. Here we present a comprehensive geometric analysis of homodimeric helices that demonstrates the existence of a single region in conformational space with high propensity for Cα–H···O=C hydrogen bond formation. This region corresponds to the most frequent motif for parallel dimers, GASright, whose best-known example is glycophorin A. The finding suggests a causal link between the high frequency of occurrence of GASright and its propensity for carbon hydrogen bond formation. Investigation of the sequence dependency of the motif determined that Gly residues are required at specific positions where only Gly can act as a donor with its “side chain” Hα. Gly also reduces the steric barrier for non-Gly amino acids at other positions to act as Cα donors, promoting the formation of cooperative hydrogen bonding networks. These findings offer a structural rationale for the occurrence of GxxxG patterns at the GASright interface. The analysis identified the conformational space and the sequence requirement of Cα–H···O=C mediated motifs; we took advantage of these results to develop a structural prediction method. The resulting program, CATM, predicts ab initio the known high-resolution structures of homodimeric GASright motifs at near-atomic level. PMID:24569864
A Feature-Based Approach to Modeling Protein–DNA Interactions
Segal, Eran
2008-01-01
Transcription factor (TF) binding to its DNA target site is a fundamental regulatory interaction. The most common model used to represent TF binding specificities is a position specific scoring matrix (PSSM), which assumes independence between binding positions. However, in many cases, this simplifying assumption does not hold. Here, we present feature motif models (FMMs), a novel probabilistic method for modeling TF–DNA interactions, based on log-linear models. Our approach uses sequence features to represent TF binding specificities, where each feature may span multiple positions. We develop the mathematical formulation of our model and devise an algorithm for learning its structural features from binding site data. We also developed a discriminative motif finder, which discovers de novo FMMs that are enriched in target sets of sequences compared to background sets. We evaluate our approach on synthetic data and on the widely used TF chromatin immunoprecipitation (ChIP) dataset of Harbison et al. We then apply our algorithm to high-throughput TF ChIP data from mouse and human, reveal sequence features that are present in the binding specificities of mouse and human TFs, and show that FMMs explain TF binding significantly better than PSSMs. Our FMM learning and motif finder software are available at http://genie.weizmann.ac.il/. PMID:18725950
Gorelik, Maryna; Davidson, Alan R
2012-03-16
The yeast Nbp2p SH3 and Bem1p SH3b domains bind certain target peptides with similar high affinities, yet display vastly different affinities for other targets. To investigate this unusual behavior, we have solved the structure of the Nbp2p SH3-Ste20 peptide complex and compared it with the previously determined structure of the Bem1p SH3b bound to the same peptide. Although the Ste20 peptide interacts with both domains in a structurally similar manner, extensive in vitro studies with domain and peptide mutants revealed large variations in interaction strength across the binding interface of the two complexes. Whereas the Nbp2p SH3 made stronger contacts with the peptide core RXXPXXP motif, the Bem1p SH3b domain made stronger contacts with residues flanking the core motif. Remarkably, this modulation of local binding energetics can explain the distinct and highly nuanced binding specificities of these two domains.
Mohtar, M Aiman; Hernychova, Lenka; O'Neill, J Robert; Lawrence, Melanie L; Murray, Euan; Vojtesek, Borek; Hupp, Ted R
2018-04-01
AGR2 is an oncogenic endoplasmic reticulum (ER)-resident protein disulfide isomerase. AGR2 protein has a relatively unique property for a chaperone in that it can bind sequence-specifically to a specific peptide motif (TTIYY). A synthetic TTIYY-containing peptide column was used to affinity-purify AGR2 from crude lysates highlighting peptide selectivity in complex mixtures. Hydrogen-deuterium exchange mass spectrometry localized the dominant region in AGR2 that interacts with the TTIYY peptide to within a structural loop from amino acids 131-135 (VDPSL). A peptide binding site consensus of Tx[IL][YF][YF] was developed for AGR2 by measuring its activity against a mutant peptide library. Screening the human proteome for proteins harboring this motif revealed an enrichment in transmembrane proteins and we focused on validating EpCAM as a potential AGR2-interacting protein. AGR2 and EpCAM proteins formed a dose-dependent protein-protein interaction in vitro Proximity ligation assays demonstrated that endogenous AGR2 and EpCAM protein associate in cells. Introducing a single alanine mutation in EpCAM at Tyr251 attenuated its binding to AGR2 in vitro and in cells. Hydrogen-deuterium exchange mass spectrometry was used to identify a stable binding site for AGR2 on EpCAM, adjacent to the TLIYY motif and surrounding EpCAM's detergent binding site. These data define a dominant site on AGR2 that mediates its specific peptide-binding function. EpCAM forms a model client protein for AGR2 to study how an ER-resident chaperone can dock specifically to a peptide motif and regulate the trafficking a protein destined for the secretory pathway. © 2018 by The American Society for Biochemistry and Molecular Biology, Inc.
Lozano, José Manuel; Lesmes, Liliana P; Carreño, Luisa F; Gallego, Gina M; Patarroyo, Manuel Elkin
2010-12-06
Synthetic vaccines constitute the most promising tools for controlling and preventing infectious diseases. When synthetic immunogens are designed from the pathogen native sequences, these are normally poorly immunogenic and do not induce protection, as demonstrated in our research. After attempting many synthetic strategies for improving the immunogenicity properties of these sequences, the approach consisting of identifying high binding motifs present in those, and then performing specific changes on amino-acids belonging to such motifs, has proven to be a workable strategy. In addition, other strategies consisting of chemically introducing non-natural constraints to the backbone topology of the molecule and modifying the α-carbon asymmetry are becoming valuable tools to be considered in this pursuit. Non-natural structural constraints to the peptide backbone can be achieved by introducing peptide bond isosters such as reduced amides, partially retro or retro-inverso modifications or even including urea motifs. The second can be obtained by strategically replacing L-amino-acids with their enantiomeric forms for obtaining both structurally site-directed designed immunogens as potential vaccine candidates and their Ig structural molecular images, both having immuno-therapeutic effects for preventing and controlling malaria.
ELM: the status of the 2010 eukaryotic linear motif resource
Gould, Cathryn M.; Diella, Francesca; Via, Allegra; Puntervoll, Pål; Gemünd, Christine; Chabanis-Davidson, Sophie; Michael, Sushama; Sayadi, Ahmed; Bryne, Jan Christian; Chica, Claudia; Seiler, Markus; Davey, Norman E.; Haslam, Niall; Weatheritt, Robert J.; Budd, Aidan; Hughes, Tim; Paś, Jakub; Rychlewski, Leszek; Travé, Gilles; Aasland, Rein; Helmer-Citterich, Manuela; Linding, Rune; Gibson, Toby J.
2010-01-01
Linear motifs are short segments of multidomain proteins that provide regulatory functions independently of protein tertiary structure. Much of intracellular signalling passes through protein modifications at linear motifs. Many thousands of linear motif instances, most notably phosphorylation sites, have now been reported. Although clearly very abundant, linear motifs are difficult to predict de novo in protein sequences due to the difficulty of obtaining robust statistical assessments. The ELM resource at http://elm.eu.org/ provides an expanding knowledge base, currently covering 146 known motifs, with annotation that includes >1300 experimentally reported instances. ELM is also an exploratory tool for suggesting new candidates of known linear motifs in proteins of interest. Information about protein domains, protein structure and native disorder, cellular and taxonomic contexts is used to reduce or deprecate false positive matches. Results are graphically displayed in a ‘Bar Code’ format, which also displays known instances from homologous proteins through a novel ‘Instance Mapper’ protocol based on PHI-BLAST. ELM server output provides links to the ELM annotation as well as to a number of remote resources. Using the links, researchers can explore the motifs, proteins, complex structures and associated literature to evaluate whether candidate motifs might be worth experimental investigation. PMID:19920119
DLocalMotif: a discriminative approach for discovering local motifs in protein sequences.
Mehdi, Ahmed M; Sehgal, Muhammad Shoaib B; Kobe, Bostjan; Bailey, Timothy L; Bodén, Mikael
2013-01-01
Local motifs are patterns of DNA or protein sequences that occur within a sequence interval relative to a biologically defined anchor or landmark. Current protein motif discovery methods do not adequately consider such constraints to identify biologically significant motifs that are only weakly over-represented but spatially confined. Using negatives, i.e. sequences known to not contain a local motif, can further increase the specificity of their discovery. This article introduces the method DLocalMotif that makes use of positional information and negative data for local motif discovery in protein sequences. DLocalMotif combines three scoring functions, measuring degrees of motif over-representation, entropy and spatial confinement, specifically designed to discriminatively exploit the availability of negative data. The method is shown to outperform current methods that use only a subset of these motif characteristics. We apply the method to several biological datasets. The analysis of peroxisomal targeting signals uncovers several novel motifs that occur immediately upstream of the dominant peroxisomal targeting signal-1 signal. The analysis of proline-tyrosine nuclear localization signals uncovers multiple novel motifs that overlap with C2H2 zinc finger domains. We also evaluate the method on classical nuclear localization signals and endoplasmic reticulum retention signals and find that DLocalMotif successfully recovers biologically relevant sequence properties. http://bioinf.scmb.uq.edu.au/dlocalmotif/
Computational study of stability of an H-H-type pseudoknot motif.
Wang, Jun; Zhao, Yunjie; Wang, Jian; Xiao, Yi
2015-12-01
Motifs in RNA tertiary structures are important to their structural organizations and biological functions. Here we consider an H-H-type pseudoknot (HHpk) motif that consists of two hairpins connected by a junction loop and with kissing interactions between the two hairpin loops. Such a tertiary structural motif is recurrently found in RNA tertiary structures, but is difficult to predict computationally. So it is important to understand the mechanism of its formation and stability. Here we investigate the stability of the HHpk tertiary structure by using an all-atom molecular dynamics simulation. The results indicate that the HHpk tertiary structure is stable. However, it is found that this stability is not due to the helix-helix packing, as is usually expected, but is maintained by the combined action of the kissing hairpin loops and junctions, although the former plays the main role. Stable HHpk motifs may form structural platforms for the molecules to realize their biological functions. These results are useful for understanding the construction principle of RNA tertiary structures and structure prediction.
Bhagavat, Raghu; Srinivasan, Narayanaswamy; Chandra, Nagasuma
2017-09-01
Nucleoside triphosphate (NTP) ligands are of high biological importance and are essential for all life forms. A pre-requisite for them to participate in diverse biochemical processes is their recognition by diverse proteins. It is thus of great interest to understand the basis for such recognition in different proteins. Towards this, we have used a structural bioinformatics approach and analyze structures of 4677 NTP complexes available in Protein Data Bank (PDB). Binding sites were extracted and compared exhaustively using PocketMatch, a sensitive in-house site comparison algorithm, which resulted in grouping the entire dataset into 27 site-types. Each of these site-types represent a structural motif comprised of two or more residue conservations, derived using another in-house tool for superposing binding sites, PocketAlign. The 27 site-types could be grouped further into 9 super-types by considering partial similarities in the sites, which indicated that the individual site-types comprise different combinations of one or more site features. A scan across PDB using the 27 structural motifs determined the motifs to be specific to NTP binding sites, and a computational alanine mutagenesis indicated that residues identified to be highly conserved in the motifs are also most contributing to binding. Alternate orientations of the ligand in several site-types were observed and rationalized, indicating the possibility of some residues serving as anchors for NTP recognition. The presence of multiple site-types and the grouping of multiple folds into each site-type is strongly suggestive of convergent evolution. Knowledge of determinants obtained from this study will be useful for detecting function in unknown proteins. Proteins 2017; 85:1699-1712. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Comparative genomics of pyridoxal 5′-phosphate-dependent transcription factor regulons in Bacteria
Suvorova, Inna A.
2016-01-01
The MocR-subfamily transcription factors (MocR-TFs) characterized by the GntR-family DNA-binding domain and aminotransferase-like sensory domain are broadly distributed among certain lineages of Bacteria. Characterized MocR-TFs bind pyridoxal 5′-phosphate (PLP) and control transcription of genes involved in PLP, gamma aminobutyric acid (GABA) and taurine metabolism via binding specific DNA operator sites. To identify putative target genes and DNA binding motifs of MocR-TFs, we performed comparative genomics analysis of over 250 bacterial genomes. The reconstructed regulons for 825 MocR-TFs comprise structural genes from over 200 protein families involved in diverse biological processes. Using the genome context and metabolic subsystem analysis we tentatively assigned functional roles for 38 out of 86 orthologous groups of studied regulators. Most of these MocR-TF regulons are involved in PLP metabolism, as well as utilization of GABA, taurine and ectoine. The remaining studied MocR-TF regulators presumably control genes encoding enzymes involved in reduction/oxidation processes, various transporters and PLP-dependent enzymes, for example aminotransferases. Predicted DNA binding motifs of MocR-TFs are generally similar in each orthologous group and are characterized by two to four repeated sequences. Identified motifs were classified according to their structures. Motifs with direct and/or inverted repeat symmetry constitute the majority of inferred DNA motifs, suggesting preferable TF dimerization in head-to-tail or head-to-head configuration. The obtained genomic collection of in silico reconstructed MocR-TF motifs and regulons in Bacteria provides a basis for future experimental characterization of molecular mechanisms for various regulators in this family. PMID:28348826
NASA Astrophysics Data System (ADS)
Zhang, Liyuan; Fan, Denggui; Wang, Qingyun
2018-06-01
Studies on the structural-functional connectomes of the human brain have demonstrated the existence of synchronous firings in a specific brain network motif. In particular, synchronization of high-frequency oscillations (HFOs) has been observed in the experimental data sets of temporal lobe epilepsy (TLE). In addition, both clinical and experimental evidences have accumulated to demonstrate the effect of electrical stimulation on TLE, which, however, remains largely unexplored. In this work, we first employ our previously proposed dentate gyrus (DG)-CA3 network model to investigate the influence of an external electrical stimulus on the HFO transitions. The results indicate that the reinforcing stimulus can induce the HFO transitions of the DG-CA3 system from the gamma band to the fast ripples band. Along with that, the consistent oscillations of neurons within DG-CA3 can also be enhanced with the increasing of stimulus. Then, we expand into a simple motif of three coupled DG-CA3 systems in both the feedforward inhibition and feedback inhibition connections, to investigate the synchronous evolutions of HFOs by regulating both the stimulation strength and inhibitory function. It is shown that the comprehensive effects, which lead to band transition, are independent of the motif configurations. The enhanced external electrical stimulus weakens the synchronism and correlation of connected motifs. In contrast, we demonstrate that the increased inhibitory coupling could facilitate correlation to some extent. Overall, our work highlights the possible origin of synchronous HFOs of hippocampal motifs governed by external inputs and inhibitory connection, which might contribute to a better understanding of the interplay between synchronization dynamics and epileptic structure in the human brain.
Shi, Wei-Wei; Tang, Yun-Sang; Sze, See-Yuen; Zhu, Zhen-Ning; Wong, Kam-Bo; Shaw, Pang-Chui
2016-10-13
Ricin is a type 2 ribosome-inactivating protein (RIP), containing a catalytic A chain and a lectin-like B chain. It inhibits protein synthesis by depurinating the N-glycosidic bond at α-sarcin/ricin loop (SRL) of the 28S rRNA, which thereby prevents the binding of elongation factors to the GTPase activation center of the ribosome. Here, we present the 1.6 Å crystal structure of Ricin A chain (RTA) complexed to the C-terminal peptide of the ribosomal stalk protein P2, which plays a crucial role in specific recognition of elongation factors and recruitment of eukaryote-specific RIPs to the ribosomes. Our structure reveals that the C-terminal GFGLFD motif of P2 peptide is inserted into a hydrophobic pocket of RTA, while the interaction assays demonstrate the structurally untraced SDDDM motif of P2 peptide contributes to the interaction with RTA. This interaction mode of RTA and P protein is in contrast to that with trichosanthin (TCS), Shiga-toxin (Stx) and the active form of maize RIP (MOD), implying the flexibility of the P2 peptide-RIP interaction, for the latter to gain access to ribosome.
Kim, Hyun-Jun; Kwon, Hye-Rim; Bae, Chang-Dae; Park, Joobae; Hong, Kyung U
2010-05-15
During mitosis, regulation of protein structures and functions by phosphorylation plays critical roles in orchestrating a series of complex events essential for the cell division process. Tumor-associated microtubule-associated protein (TMAP), also known as cytoskeleton-associated protein 2 (CKAP2), is a novel player in spindle assembly and chromosome segregation. We have previously reported that TMAP is phosphorylated at multiple residues specifically during mitosis. However, the mechanisms and functional importance of phosphorylation at most of the sites identified are currently unknown. Here, we report that TMAP is a novel substrate of the Aurora B kinase. Ser627 of TMAP was specifically phosphorylated by Aurora B both in vitro and in vivo. Ser627 and neighboring conserved residues were strictly required for efficient phosphorylation of TMAP by Aurora B, as even minor amino acid substitutions of the phosphorylation motif significantly diminished the efficiency of the substrate phosphorylation. Nearly all mutations at the phosphorylation motif had dramatic effects on the subcellular localization of TMAP. Instead of being localized to the chromosome region during late mitosis, the mutants remained associated with microtubules and centrosomes throughout mitosis. However, the changes in the subcellular localization of these mutants could not be completely explained by the phosphorylation status on Ser627. Our findings suggest that the motif surrounding Ser627 ((625) RRSRRL (630)) is a critical part of a functionally important sequence motif which not only governs the kinase-substrate recognition, but also regulates the subcellular localization of TMAP during mitosis.
A Novel Protein Interaction between Nucleotide Binding Domain of Hsp70 and p53 Motif
Elengoe, Asita; Naser, Mohammed Abu; Hamdan, Salehhuddin
2015-01-01
Currently, protein interaction of Homo sapiens nucleotide binding domain (NBD) of heat shock 70 kDa protein (PDB: 1HJO) with p53 motif remains to be elucidated. The NBD-p53 motif complex enhances the p53 stabilization, thereby increasing the tumor suppression activity in cancer treatment. Therefore, we identified the interaction between NBD and p53 using STRING version 9.1 program. Then, we modeled the three-dimensional structure of p53 motif through homology modeling and determined the binding affinity and stability of NBD-p53 motif complex structure via molecular docking and dynamics (MD) simulation. Human DNA binding domain of p53 motif (SCMGGMNR) retrieved from UniProt (UniProtKB: P04637) was docked with the NBD protein, using the Autodock version 4.2 program. The binding energy and intermolecular energy for the NBD-p53 motif complex were −0.44 Kcal/mol and −9.90 Kcal/mol, respectively. Moreover, RMSD, RMSF, hydrogen bonds, salt bridge, and secondary structure analyses revealed that the NBD protein had a strong bond with p53 motif and the protein-ligand complex was stable. Thus, the current data would be highly encouraging for designing Hsp70 structure based drug in cancer therapy. PMID:26098630
A Novel Protein Interaction between Nucleotide Binding Domain of Hsp70 and p53 Motif.
Elengoe, Asita; Naser, Mohammed Abu; Hamdan, Salehhuddin
2015-01-01
Currently, protein interaction of Homo sapiens nucleotide binding domain (NBD) of heat shock 70 kDa protein (PDB: 1HJO) with p53 motif remains to be elucidated. The NBD-p53 motif complex enhances the p53 stabilization, thereby increasing the tumor suppression activity in cancer treatment. Therefore, we identified the interaction between NBD and p53 using STRING version 9.1 program. Then, we modeled the three-dimensional structure of p53 motif through homology modeling and determined the binding affinity and stability of NBD-p53 motif complex structure via molecular docking and dynamics (MD) simulation. Human DNA binding domain of p53 motif (SCMGGMNR) retrieved from UniProt (UniProtKB: P04637) was docked with the NBD protein, using the Autodock version 4.2 program. The binding energy and intermolecular energy for the NBD-p53 motif complex were -0.44 Kcal/mol and -9.90 Kcal/mol, respectively. Moreover, RMSD, RMSF, hydrogen bonds, salt bridge, and secondary structure analyses revealed that the NBD protein had a strong bond with p53 motif and the protein-ligand complex was stable. Thus, the current data would be highly encouraging for designing Hsp70 structure based drug in cancer therapy.
Topological characteristics of helical repeat proteins.
Groves, M R; Barford, D
1999-06-01
The recent elucidation of protein structures based upon repeating amino acid motifs, including the armadillo motif, the HEAT motif and tetratricopeptide repeats, reveals that they belong to the class of helical repeat proteins. These proteins share the common property of being assembled from tandem repeats of an alpha-helical structural unit, creating extended superhelical structures that are ideally suited to create a protein recognition interface.
Havrila, Marek; Réblová, Kamila; Zirbel, Craig L.; Leontis, Neocles B.; Šponer, Jiří
2013-01-01
The Sarcin-Ricin RNA motif (SR motif) is one of the most prominent recurrent RNA building blocks that occurs in many different RNA contexts and folds autonomously, i.e., in a context-independent manner. In this study, we combined bioinformatics analysis with explicit-solvent molecular dynamics (MD) simulations to better understand the relation between the RNA sequence and the evolutionary patterns of SR motif. SHAPE probing experiment was also performed to confirm fidelity of MD simulations. We identified 57 instances of the SR motif in a non-redundant subset of the RNA X-ray structure database and analyzed their basepairing, base-phosphate, and backbone-backbone interactions. We extracted sequences aligned to these instances from large ribosomal RNA alignments to determine frequency of occurrence for different sequence variants. We then used a simple scoring scheme based on isostericity to suggest 10 sequence variants with highly variable expected degree of compatibility with the SR motif 3D structure. We carried out MD simulations of SR motifs with these base substitutions. Non isosteric base substitutions led to unstable structures, but so did isosteric substitutions which were unable to make key base-phosphate interactions. MD technique explains why some potentially isosteric SR motifs are not realized during evolution. We also found that inability to form stable cWW geometry is an important factor in case of the first base pair of the flexible region of the SR motif. Comparison of structural, bioinformatics, SHAPE probing and MD simulation data reveals that explicit solvent MD simulations neatly reflect viability of different sequence variants of the SR motif. Thus, MD simulations can efficiently complement bioinformatics tools in studies of conservation patterns of RNA motifs and provide atomistic insight into the role of their different signature interactions. PMID:24144333
BEAM web server: a tool for structural RNA motif discovery.
Pietrosanto, Marco; Adinolfi, Marta; Casula, Riccardo; Ausiello, Gabriele; Ferrè, Fabrizio; Helmer-Citterich, Manuela
2018-03-15
RNA structural motif finding is a relevant problem that becomes computationally hard when working on high-throughput data (e.g. eCLIP, PAR-CLIP), often represented by thousands of RNA molecules. Currently, the BEAM server is the only web tool capable to handle tens of thousands of RNA in input with a motif discovery procedure that is only limited by the current secondary structure prediction accuracies. The recently developed method BEAM (BEAr Motifs finder) can analyze tens of thousands of RNA molecules and identify RNA secondary structure motifs associated to a measure of their statistical significance. BEAM is extremely fast thanks to the BEAR encoding that transforms each RNA secondary structure in a string of characters. BEAM also exploits the evolutionary knowledge contained in a substitution matrix of secondary structure elements, extracted from the RFAM database of families of homologous RNAs. The BEAM web server has been designed to streamline data pre-processing by automatically handling folding and encoding of RNA sequences, giving users a choice for the preferred folding program. The server provides an intuitive and informative results page with the list of secondary structure motifs identified, the logo of each motif, its significance, graphic representation and information about its position in the RNA molecules sharing it. The web server is freely available at http://beam.uniroma2.it/ and it is implemented in NodeJS and Python with all major browsers supported. marco.pietrosanto@uniroma2.it. Supplementary data are available at Bioinformatics online.
Specificity determinants for the abscisic acid response element.
Sarkar, Aditya Kumar; Lahiri, Ansuman
2013-01-01
Abscisic acid (ABA) response elements (ABREs) are a group of cis-acting DNA elements that have been identified from promoter analysis of many ABA-regulated genes in plants. We are interested in understanding the mechanism of binding specificity between ABREs and a class of bZIP transcription factors known as ABRE binding factors (ABFs). In this work, we have modeled the homodimeric structure of the bZIP domain of ABRE binding factor 1 from Arabidopsis thaliana (AtABF1) and studied its interaction with ACGT core motif-containing ABRE sequences. We have also examined the variation in the stability of the protein-DNA complex upon mutating ABRE sequences using the protein design algorithm FoldX. The high throughput free energy calculations successfully predicted the ability of ABF1 to bind to alternative core motifs like GCGT or AAGT and also rationalized the role of the flanking sequences in determining the specificity of the protein-DNA interaction.
Rigden, Daniel J.; Woodhead, Duncan D.; Wong, Prudence W. H.; Galperin, Michael Y.
2011-01-01
Binding of calcium ions (Ca2+) to proteins can have profound effects on their structure and function. Common roles of calcium binding include structure stabilization and regulation of activity. It is known that diverse families – EF-hands being one of at least twelve – use a Dx[DN]xDG linear motif to bind calcium in near-identical fashion. Here, four novel structural contexts for the motif are described. Existing experimental data for one of them, a thermophilic archaeal subtilisin, demonstrate for the first time a role for Dx[DN]xDG-bound calcium in protein folding. An integrin-like embedding of the motif in the blade of a β-propeller fold – here named the calcium blade – is discovered in structures of bacterial and fungal proteins. Furthermore, sensitive database searches suggest a common origin for the calcium blade in β-propeller structures of different sizes and a pan-kingdom distribution of these proteins. Factors favouring the multiple convergent evolution of the motif appear to include its general Asp-richness, the regular spacing of the Asp residues and the fact that change of Asp into Gly and vice versa can occur though a single nucleotide change. Among the known structural contexts for the Dx[DN]xDG motif, only the calcium blade and the EF-hand are currently found intracellularly in large numbers, perhaps because the higher extracellular concentration of Ca2+ allows for easier fixing of newly evolved motifs that have acquired useful functions. The analysis presented here will inform ongoing efforts toward prediction of similar calcium-binding motifs from sequence information alone. PMID:21720552
Gonadotropin-Releasing Hormone (GnRH) Receptor Structure and GnRH Binding
Flanagan, Colleen A.; Manilall, Ashmeetha
2017-01-01
Gonadotropin-releasing hormone (GnRH) regulates reproduction. The human GnRH receptor lacks a cytoplasmic carboxy-terminal tail but has amino acid sequence motifs characteristic of rhodopsin-like, class A, G protein-coupled receptors (GPCRs). This review will consider how recent descriptions of X-ray crystallographic structures of GPCRs in inactive and active conformations may contribute to understanding GnRH receptor structure, mechanism of activation and ligand binding. The structures confirmed that ligands bind to variable extracellular surfaces, whereas the seven membrane-spanning α-helices convey the activation signal to the cytoplasmic receptor surface, which binds and activates heterotrimeric G proteins. Forty non-covalent interactions that bridge topologically equivalent residues in different transmembrane (TM) helices are conserved in class A GPCR structures, regardless of activation state. Conformation-independent interhelical contacts account for a conserved receptor protein structure and their importance in the GnRH receptor structure is supported by decreased expression of receptors with mutations of residues in the network. Many of the GnRH receptor mutations associated with congenital hypogonadotropic hypogonadism, including the Glu2.53(90) Lys mutation, involve amino acids that constitute the conserved network. Half of the ~250 intramolecular interactions in GPCRs differ between inactive and active structures. Conformation-specific interhelical contacts depend on amino acids changing partners during activation. Conserved inactive conformation-specific contacts prevent receptor activation by stabilizing proximity of TM helices 3 and 6 and a closed G protein-binding site. Mutations of GnRH receptor residues involved in these interactions, such as Arg3.50(139) of the DRY/S motif or Tyr7.53(323) of the N/DPxxY motif, increase or decrease receptor expression and efficiency of receptor coupling to G protein signaling, consistent with the native residues stabilizing the inactive GnRH receptor structure. Active conformation-specific interhelical contacts stabilize an open G protein-binding site. Progress in defining the GnRH-binding site has recently slowed, with evidence that Tyr6.58(290) contacts Tyr5 of GnRH, whereas other residues affect recognition of Trp3 and Gly10NH2. The surprisingly consistent observations that GnRH receptor mutations that disrupt GnRH binding have less effect on “conformationally constrained” GnRH peptides may now be explained by crystal structures of agonist-bound peptide receptors. Analysis of GPCR structures provides insight into GnRH receptor function. PMID:29123501
Transient α-helices in the disordered RPEL motifs of the serum response factor coactivator MKL1
NASA Astrophysics Data System (ADS)
Mizuguchi, Mineyuki; Fuju, Takahiro; Obita, Takayuki; Ishikawa, Mitsuru; Tsuda, Masaaki; Tabuchi, Akiko
2014-06-01
The megakaryoblastic leukemia 1 (MKL1) protein functions as a transcriptional coactivator of the serum response factor. MKL1 has three RPEL motifs (RPEL1, RPEL2, and RPEL3) in its N-terminal region. MKL1 binds to monomeric G-actin through RPEL motifs, and the dissociation of MKL1 from G-actin promotes the translocation of MKL1 to the nucleus. Although structural data are available for RPEL motifs of MKL1 in complex with G-actin, the structural characteristics of RPEL motifs in the free state have been poorly defined. Here we characterized the structures of free RPEL motifs using NMR and CD spectroscopy. NMR and CD measurements showed that free RPEL motifs are largely unstructured in solution. However, NMR analysis identified transient α-helices in the regions where helices α1 and α2 are induced upon binding to G-actin. Proline mutagenesis showed that the transient α-helices are locally formed without helix-helix interactions. The helix content is higher in the order of RPEL1, RPEL2, and RPEL3. The amount of preformed structure may correlate with the binding affinity between the intrinsically disordered protein and its target molecule.
New structures of Fe3S for rare-earth-free permanent magnets
Yu, Shu; Zhao, Xin; Wu, Shunqing; ...
2018-02-25
We applied adaptive genetic algorithm (AGA) to search for low-energy crystal structures of Fe 3S. A number of structures with energies lower than that of the experimentally reported Pnma and I-4 structures have been obtained from our AGA searches. These low-energy structures can be classified as layer-motif and column-motif structures. In the column-motif structures, Fe atoms self-assemble into rods with bcc type of underlying lattice, which are separated by the holes terminated by S atoms. In the layer-motif structures, the bulk Fe is broken into slabs of several layers passivated by S atoms. Magnetic properties calculations showed that the column-motifmore » structures exhibit reasonably high uniaxial magnetic anisotropy. In addition, we examined the effect of Co doping to Fe 3S and found magnetic anisotropy can be enhanced through Co doping.« less
Structural basis for catalytic activation by the human ZNF451 SUMO E3 ligase
Cappadocia, Laurent; Pichler, Andrea; Lima, Christopher D.
2015-11-02
E3 protein ligases enhance transfer of ubiquitin-like (Ubl) proteins from E2 conjugating enzymes to substrates by stabilizing the thioester-charged E2~Ubl in a closed configuration optimally aligned for nucleophilic attack. In this paper, we report biochemical and structural data that define the N-terminal domain of the Homo sapiens ZNF451 as the catalytic module for SUMO E3 ligase activity. The ZNF451 catalytic module contains tandem SUMO-interaction motifs (SIMs) bridged by a Pro-Leu-Arg-Pro (PLRP) motif. The first SIM and PLRP motif engage thioester-charged E2~SUMO while the next SIM binds a second molecule of SUMO bound to the back side of E2. We showmore » that ZNF451 is SUMO2 specific and that SUMO modification of ZNF451 may contribute to activity by providing a second molecule of SUMO that interacts with E2. Finally, our results are consistent with ZNF451 functioning as a bona fide SUMO E3 ligase.« less
Structural basis for catalytic activation by the human ZNF451 SUMO E3 ligase
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cappadocia, Laurent; Pichler, Andrea; Lima, Christopher D.
E3 protein ligases enhance transfer of ubiquitin-like (Ubl) proteins from E2 conjugating enzymes to substrates by stabilizing the thioester-charged E2~Ubl in a closed configuration optimally aligned for nucleophilic attack. In this paper, we report biochemical and structural data that define the N-terminal domain of the Homo sapiens ZNF451 as the catalytic module for SUMO E3 ligase activity. The ZNF451 catalytic module contains tandem SUMO-interaction motifs (SIMs) bridged by a Pro-Leu-Arg-Pro (PLRP) motif. The first SIM and PLRP motif engage thioester-charged E2~SUMO while the next SIM binds a second molecule of SUMO bound to the back side of E2. We showmore » that ZNF451 is SUMO2 specific and that SUMO modification of ZNF451 may contribute to activity by providing a second molecule of SUMO that interacts with E2. Finally, our results are consistent with ZNF451 functioning as a bona fide SUMO E3 ligase.« less
Quantifying domain-ligand affinities and specificities by high-throughput holdup assay
Vincentelli, Renaud; Luck, Katja; Poirson, Juline; Polanowska, Jolanta; Abdat, Julie; Blémont, Marilyne; Turchetto, Jeremy; Iv, François; Ricquier, Kevin; Straub, Marie-Laure; Forster, Anne; Cassonnet, Patricia; Borg, Jean-Paul; Jacob, Yves; Masson, Murielle; Nominé, Yves; Reboul, Jérôme; Wolff, Nicolas; Charbonnier, Sebastian; Travé, Gilles
2015-01-01
Many protein interactions are mediated by small linear motifs interacting specifically with defined families of globular domains. Quantifying the specificity of a motif requires measuring and comparing its binding affinities to all its putative target domains. To this aim, we developed the high-throughput holdup assay, a chromatographic approach that can measure up to a thousand domain-motif equilibrium binding affinities per day. Extracts of overexpressed domains are incubated with peptide-coated resins and subjected to filtration. Binding affinities are deduced from microfluidic capillary electrophoresis of flow-throughs. After benchmarking the approach on 210 PDZ-peptide pairs with known affinities, we determined the affinities of two viral PDZ-binding motifs derived from Human Papillomavirus E6 oncoproteins for 209 PDZ domains covering 79% of the human PDZome. We obtained exquisite sequence-dependent binding profiles, describing quantitatively the PDZome recognition specificity of each motif. This approach, applicable to many categories of domain-ligand interactions, has a wide potential for quantifying the specificities of interactomes. PMID:26053890
Papanikolopoulou, Katerina; van Raaij, Mark J; Mitraki, Anna
2008-01-01
Stable, artificial fibrous proteins that can be functionalized open new avenues in fields such as bionanomaterials design and fiber engineering. An important source of inspiration for the creation of such proteins are natural fibrous proteins such as collagen, elastin, insect silks, and fibers from phages and viruses. The fibrous parts of this last class of proteins usually adopt trimeric, beta-stranded structural folds and are appended to globular, receptor-binding domains. It has been recently shown that the globular domains are essential for correct folding and trimerization and can be successfully substituted by a very small (27-amino acid) trimerization motif from phage T4 fibritin. The hybrid proteins are correctly folded nanorods that can withstand extreme conditions. When the fibrous part derives from the adenovirus fiber shaft, different tissue-targeting specificities can be engineered into the hybrid proteins, which therefore can be used as gene therapy vectors. The integration of such stable nanorods in devices is also a big challenge in the field of biomechanical design. The fibritin foldon domain is a versatile trimerization motif and can be combined with a variety of fibrous motifs, such as coiled-coil, collagenous, and triple beta-stranded motifs, provided the appropriate linkers are used. The combination of different motifs within the same fibrous molecule to create stable rods with multiple functions can even be envisioned. We provide a comprehensive overview of the experimental procedures used for designing, creating, and characterizing hybrid fibrous nanorods using the fibritin trimerization motif.
NASA Astrophysics Data System (ADS)
Papanikolopoulou, Katerina; van Raaij, Mark J.; Mitraki, Anna
Stable, artificial fibrous proteins that can be functionalized open new avenues in fields such as bionanomaterials design and fiber engineering. An important source of inspiration for the creation of such proteins are natural fibrous proteins such as collagen, elastin, insect silks, and fibers from phages and viruses. The fibrous parts of this last class of proteins usually adopt trimeric, β-stranded structural folds and are appended to globular, receptor-binding domains. It has been recently shown that the globular domains are essential for correct folding and trimerization and can be successfully substituted by a very small (27-amino acid) trimerization motif from phage T4 fibritin. The hybrid proteins are correctly folded nanorods that can withstand extreme conditions. When the fibrous part derives from the adenovirus fiber shaft, different tissue-targeting specificities can be engineered into the hybrid proteins, which therefore can be used as gene therapy vectors. The integration of such stable nanorods in devices is also a big challenge in the field of biomechanical design. The fibritin foldon domain is a versatile trimerization motif and can be combined with a variety of fibrous motifs, such as coiled-coil, collagenous, and triple β-stranded motifs, provided the appropriate linkers are used. The combination of different motifs within the same fibrous molecule to create stable rods with multiple functions can even be envisioned. We provide a comprehensive overview of the experimental procedures used for designing, creating, and characterizing hybrid fibrous nanorods using the fibritin trimerization motif.
Ahnert, S E; Fink, T M A
2016-07-01
Network motifs have been studied extensively over the past decade, and certain motifs, such as the feed-forward loop, play an important role in regulatory networks. Recent studies have used Boolean network motifs to explore the link between form and function in gene regulatory networks and have found that the structure of a motif does not strongly determine its function, if this is defined in terms of the gene expression patterns the motif can produce. Here, we offer a different, higher-level definition of the 'function' of a motif, in terms of two fundamental properties of its dynamical state space as a Boolean network. One is the basin entropy, which is a complexity measure of the dynamics of Boolean networks. The other is the diversity of cyclic attractor lengths that a given motif can produce. Using these two measures, we examine all 104 topologically distinct three-node motifs and show that the structural properties of a motif, such as the presence of feedback loops and feed-forward loops, predict fundamental characteristics of its dynamical state space, which in turn determine aspects of its functional versatility. We also show that these higher-level properties have a direct bearing on real regulatory networks, as both basin entropy and cycle length diversity show a close correspondence with the prevalence, in neural and genetic regulatory networks, of the 13 connected motifs without self-interactions that have been studied extensively in the literature. © 2016 The Authors.
Gibbs motif sampling: detection of bacterial outer membrane protein repeats.
Neuwald, A. F.; Liu, J. S.; Lawrence, C. E.
1995-01-01
The detection and alignment of locally conserved regions (motifs) in multiple sequences can provide insight into protein structure, function, and evolution. A new Gibbs sampling algorithm is described that detects motif-encoding regions in sequences and optimally partitions them into distinct motif models; this is illustrated using a set of immunoglobulin fold proteins. When applied to sequences sharing a single motif, the sampler can be used to classify motif regions into related submodels, as is illustrated using helix-turn-helix DNA-binding proteins. Other statistically based procedures are described for searching a database for sequences matching motifs found by the sampler. When applied to a set of 32 very distantly related bacterial integral outer membrane proteins, the sampler revealed that they share a subtle, repetitive motif. Although BLAST (Altschul SF et al., 1990, J Mol Biol 215:403-410) fails to detect significant pairwise similarity between any of the sequences, the repeats present in these outer membrane proteins, taken as a whole, are highly significant (based on a generally applicable statistical test for motifs described here). Analysis of bacterial porins with known trimeric beta-barrel structure and related proteins reveals a similar repetitive motif corresponding to alternating membrane-spanning beta-strands. These beta-strands occur on the membrane interface (as opposed to the trimeric interface) of the beta-barrel. The broad conservation and structural location of these repeats suggests that they play important functional roles. PMID:8520488
The Methionine-aromatic Motif Plays a Unique Role in Stabilizing Protein Structure*
Valley, Christopher C.; Cembran, Alessandro; Perlmutter, Jason D.; Lewis, Andrew K.; Labello, Nicholas P.; Gao, Jiali; Sachs, Jonathan N.
2012-01-01
Of the 20 amino acids, the precise function of methionine (Met) remains among the least well understood. To establish a determining characteristic of methionine that fundamentally differentiates it from purely hydrophobic residues, we have used in vitro cellular experiments, molecular simulations, quantum calculations, and a bioinformatics screen of the Protein Data Bank. We show that approximately one-third of all known protein structures contain an energetically stabilizing Met-aromatic motif and, remarkably, that greater than 10,000 structures contain this motif more than 10 times. Critically, we show that as compared with a purely hydrophobic interaction, the Met-aromatic motif yields an additional stabilization of 1–1.5 kcal/mol. To highlight its importance and to dissect the energetic underpinnings of this motif, we have studied two clinically relevant TNF ligand-receptor complexes, namely TRAIL-DR5 and LTα-TNFR1. In both cases, we show that the motif is necessary for high affinity ligand binding as well as function. Additionally, we highlight previously overlooked instances of the motif in several disease-related Met mutations. Our results strongly suggest that the Met-aromatic motif should be exploited in the rational design of therapeutics targeting a range of proteins. PMID:22859300
Chang, Chungyu; Amer, Brendan R; Osipiuk, Jerzy; McConnell, Scott A; Huang, I-Hsiu; Hsieh, Van; Fu, Janine; Nguyen, Hong H; Muroski, John; Flores, Erika; Ogorzalek Loo, Rachel R; Loo, Joseph A; Putkey, John A; Joachimiak, Andrzej; Das, Asis; Clubb, Robert T; Ton-That, Hung
2018-06-12
Covalently cross-linked pilus polymers displayed on the cell surface of Gram-positive bacteria are assembled by class C sortase enzymes. These pilus-specific transpeptidases located on the bacterial membrane catalyze a two-step protein ligation reaction, first cleaving the LPXTG motif of one pilin protomer to form an acyl-enzyme intermediate and then joining the terminal Thr to the nucleophilic Lys residue residing within the pilin motif of another pilin protomer. To date, the determinants of class C enzymes that uniquely enable them to construct pili remain unknown. Here, informed by high-resolution crystal structures of corynebacterial pilus-specific sortase (SrtA) and utilizing a structural variant of the enzyme (SrtA 2M ), whose catalytic pocket has been unmasked by activating mutations, we successfully reconstituted in vitro polymerization of the cognate major pilin (SpaA). Mass spectrometry, electron microscopy, and biochemical experiments authenticated that SrtA 2M synthesizes pilus fibers with correct Lys-Thr isopeptide bonds linking individual pilins via a thioacyl intermediate. Structural modeling of the SpaA-SrtA-SpaA polymerization intermediate depicts SrtA 2M sandwiched between the N- and C-terminal domains of SpaA harboring the reactive pilin and LPXTG motifs, respectively. Remarkably, the model uncovered a conserved TP(Y/L)XIN(S/T)H signature sequence following the catalytic Cys, in which the alanine substitutions abrogated cross-linking activity but not cleavage of LPXTG. These insights and our evidence that SrtA 2M can terminate pilus polymerization by joining the terminal pilin SpaB to SpaA and catalyze ligation of isolated SpaA domains in vitro provide a facile and versatile platform for protein engineering and bio-conjugation that has major implications for biotechnology.
Alvadia, Carolina M; Sommer, Theis; Bjerregaard-Andersen, Kaare; Damkier, Helle Hasager; Montrasio, Michele; Aalkjaer, Christian; Morth, J Preben
2017-09-21
The sodium-driven chloride/bicarbonate exchanger (NDCBE) is essential for maintaining homeostatic pH in neurons. The crystal structure at 2.8 Å resolution of the regulatory N-terminal domain of human NDCBE represents the first crystal structure of an electroneutral sodium-bicarbonate cotransporter. The crystal structure forms an equivalent dimeric interface as observed for the cytoplasmic domain of Band 3, and thus establishes that the consensus motif VTVLP is the key minimal dimerization motif. The VTVLP motif is highly conserved and likely to be the physiologically relevant interface for all other members of the SLC4 family. A novel conserved Zn 2+ -binding motif present in the N-terminal domain of NDCBE is identified and characterized in vitro. Cellular studies confirm the Zn 2+ dependent transport of two electroneutral bicarbonate transporters, NCBE and NBCn1. The Zn 2+ site is mapped to a cluster of histidines close to the conserved ETARWLKFEE motif and likely plays a role in the regulation of this important motif. The combined structural and bioinformatics analysis provides a model that predicts with additional confidence the physiologically relevant interface between the cytoplasmic domain and the transmembrane domain.
Verma, Anjali; Rajagopalan, Pavithra; Lotke, Rishikesh; Varghese, Rebu; Selvam, Deepak; Kundu, Tapas K.
2016-01-01
ABSTRACT Of the various genetic subtypes of human immunodeficiency virus types 1 and 2 (HIV-1 and HIV-2) and simian immunodeficiency virus (SIV), only in subtype C of HIV-1 is a genetically variant NF-κB binding site found at the core of the viral promoter in association with a subtype-specific Sp1III motif. How the subtype-associated variations in the core transcription factor binding sites (TFBS) influence gene expression from the viral promoter has not been examined previously. Using panels of infectious viral molecular clones, we demonstrate that subtype-specific NF-κB and Sp1III motifs have evolved for optimal gene expression, and neither of the motifs can be replaced by a corresponding TFBS variant. The variant NF-κB motif binds NF-κB with an affinity 2-fold higher than that of the generic NF-κB site. Importantly, in the context of an infectious virus, the subtype-specific Sp1III motif demonstrates a profound loss of function in association with the generic NF-κB motif. An additional substitution of the Sp1III motif fully restores viral replication, suggesting that the subtype C-specific Sp1III has evolved to function with the variant, but not generic, NF-κB motif. A change of only two base pairs in the central NF-κB motif completely suppresses viral transcription from the provirus and converts the promoter into heterochromatin refractory to tumor necrosis factor alpha (TNF-α) induction. The present work represents the first demonstration of functional incompatibility between an otherwise functional NF-κB motif and a unique Sp1 site in the context of an HIV-1 promoter. Our work provides important leads as to the evolution of the HIV-1 subtype C viral promoter with relevance for gene expression regulation and viral latency. IMPORTANCE Subtype-specific genetic variations provide a powerful tool to examine how these variations offer a replication advantage to specific viral subtypes, if any. Only in subtype C of HIV-1 are two genetically distinct transcription factor binding sites positioned at the most critical location of the viral promoter. Since a single promoter regulates viral gene expression, the promoter variations can play a critical role in determining the replication fitness of the viral strains. Our work for the first time provides a scientific explanation for the presence of a unique NF-κB binding motif in subtype C, a major HIV-1 genetic family responsible for half of the global HIV-1 infections. The results offer compelling evidence that the subtype C viral promoter not only is stronger but also is endowed with a qualitative gain-of-function advantage. The genetically variant NF-κB and the Sp1III motifs may be respond differently to specific cell signal pathways, and these mechanisms must be examined. PMID:27194770
An experimental test of a fundamental food web motif.
Rip, Jason M K; McCann, Kevin S; Lynn, Denis H; Fawcett, Sonia
2010-06-07
Large-scale changes to the world's ecosystem are resulting in the deterioration of biostructure-the complex web of species interactions that make up ecological communities. A difficult, yet crucial task is to identify food web structures, or food web motifs, that are the building blocks of this baroque network of interactions. Once identified, these food web motifs can then be examined through experiments and theory to provide mechanistic explanations for how structure governs ecosystem stability. Here, we synthesize recent ecological research to show that generalist consumers coupling resources with different interaction strengths, is one such motif. This motif amazingly occurs across an enormous range of spatial scales, and so acts to distribute coupled weak and strong interactions throughout food webs. We then perform an experiment that illustrates the importance of this motif to ecological stability. We find that weak interactions coupled to strong interactions by generalist consumers dampen strong interaction strengths and increase community stability. This study takes a critical step by isolating a common food web motif and through clear, experimental manipulation, identifies the fundamental stabilizing consequences of this structure for ecological communities.
Identification of 15 candidate structured noncoding RNA motifs in fungi by comparative genomics.
Li, Sanshu; Breaker, Ronald R
2017-10-13
With the development of rapid and inexpensive DNA sequencing, the genome sequences of more than 100 fungal species have been made available. This dataset provides an excellent resource for comparative genomics analyses, which can be used to discover genetic elements, including noncoding RNAs (ncRNAs). Bioinformatics tools similar to those used to uncover novel ncRNAs in bacteria, likewise, should be useful for searching fungal genomic sequences, and the relative ease of genetic experiments with some model fungal species could facilitate experimental validation studies. We have adapted a bioinformatics pipeline for discovering bacterial ncRNAs to systematically analyze many fungal genomes. This comparative genomics pipeline integrates information on conserved RNA sequence and structural features with alternative splicing information to reveal fungal RNA motifs that are candidate regulatory domains, or that might have other possible functions. A total of 15 prominent classes of structured ncRNA candidates were identified, including variant HDV self-cleaving ribozyme representatives, atypical snoRNA candidates, and possible structured antisense RNA motifs. Candidate regulatory motifs were also found associated with genes for ribosomal proteins, S-adenosylmethionine decarboxylase (SDC), amidase, and HexA protein involved in Woronin body formation. We experimentally confirm that the variant HDV ribozymes undergo rapid self-cleavage, and we demonstrate that the SDC RNA motif reduces the expression of SAM decarboxylase by translational repression. Furthermore, we provide evidence that several other motifs discovered in this study are likely to be functional ncRNA elements. Systematic screening of fungal genomes using a computational discovery pipeline has revealed the existence of a variety of novel structured ncRNAs. Genome contexts and similarities to known ncRNA motifs provide strong evidence for the biological and biochemical functions of some newly found ncRNA motifs. Although initial examinations of several motifs provide evidence for their likely functions, other motifs will require more in-depth analysis to reveal their functions.
Mapping and analysis of Caenorhabditis elegans transcription factor sequence specificities
Narasimhan, Kamesh; Lambert, Samuel A; Yang, Ally WH; Riddell, Jeremy; Mnaimneh, Sanie; Zheng, Hong; Albu, Mihai; Najafabadi, Hamed S; Reece-Hoyes, John S; Fuxman Bass, Juan I; Walhout, Albertha JM; Weirauch, Matthew T; Hughes, Timothy R
2015-01-01
Caenorhabditis elegans is a powerful model for studying gene regulation, as it has a compact genome and a wealth of genomic tools. However, identification of regulatory elements has been limited, as DNA-binding motifs are known for only 71 of the estimated 763 sequence-specific transcription factors (TFs). To address this problem, we performed protein binding microarray experiments on representatives of canonical TF families in C. elegans, obtaining motifs for 129 TFs. Additionally, we predict motifs for many TFs that have DNA-binding domains similar to those already characterized, increasing coverage of binding specificities to 292 C. elegans TFs (∼40%). These data highlight the diversification of binding motifs for the nuclear hormone receptor and C2H2 zinc finger families and reveal unexpected diversity of motifs for T-box and DM families. Motif enrichment in promoters of functionally related genes is consistent with known biology and also identifies putative regulatory roles for unstudied TFs. DOI: http://dx.doi.org/10.7554/eLife.06967.001 PMID:25905672
Wu, Tzu-Hui; Chen, Chun-Chi; Cheng, Ya-Shan; Ko, Tzu-Ping; Lin, Cheng-Yen; Lai, Hui-Lin; Huang, Ting-Yung; Liu, Je-Ruei; Guo, Rey-Ting
2014-04-10
Escherichia coli phytase (EcAppA) which hydrolyzes phytate has been widely applied in the feed industry, but the need to improve the enzyme activity and thermostability remains. Here, we conduct rational design with two strategies to enhance the EcAppA performance. First, residues near the substrate binding pocket of EcAppA were modified according to the consensus sequence of two highly active Citrobacter phytases. One out of the eleven mutants, V89T, exhibited 17.5% increase in catalytic activity, which might be a result of stabilized protein folding. Second, the EcAppA glycosylation pattern was modified in accordance with the Citrobacter phytases. An N-glycosylation motif near the substrate binding site was disrupted to remove spatial hindrance for phytate entry and product departure. The de-glycosylated mutants showed 9.6% increase in specific activity. On the other hand, the EcAppA mutants that adopt N-glycosylation motifs from CbAppA showed improved thermostability that three mutants carrying single N-glycosylation motif exhibited 5.6-9.5% residual activity after treatment at 80°C (1.8% for wild type). Furthermore, the mutant carrying all three glycosylation motifs exhibited 27% residual activity. In conclusion, a successful rational design was performed to obtain several useful EcAppA mutants with better properties for further applications. Copyright © 2014 Elsevier B.V. All rights reserved.
Mining for class-specific motifs in protein sequence classification
2013-01-01
Background In protein sequence classification, identification of the sequence motifs or n-grams that can precisely discriminate between classes is a more interesting scientific question than the classification itself. A number of classification methods aim at accurate classification but fail to explain which sequence features indeed contribute to the accuracy. We hypothesize that sequences in lower denominations (n-grams) can be used to explore the sequence landscape and to identify class-specific motifs that discriminate between classes during classification. Discriminative n-grams are short peptide sequences that are highly frequent in one class but are either minimally present or absent in other classes. In this study, we present a new substitution-based scoring function for identifying discriminative n-grams that are highly specific to a class. Results We present a scoring function based on discriminative n-grams that can effectively discriminate between classes. The scoring function, initially, harvests the entire set of 4- to 8-grams from the protein sequences of different classes in the dataset. Similar n-grams of the same size are combined to form new n-grams, where the similarity is defined by positive amino acid substitution scores in the BLOSUM62 matrix. Substitution has resulted in a large increase in the number of discriminatory n-grams harvested. Due to the unbalanced nature of the dataset, the frequencies of the n-grams are normalized using a dampening factor, which gives more weightage to the n-grams that appear in fewer classes and vice-versa. After the n-grams are normalized, the scoring function identifies discriminative 4- to 8-grams for each class that are frequent enough to be above a selection threshold. By mapping these discriminative n-grams back to the protein sequences, we obtained contiguous n-grams that represent short class-specific motifs in protein sequences. Our method fared well compared to an existing motif finding method known as Wordspy. We have validated our enriched set of class-specific motifs against the functionally important motifs obtained from the NLSdb, Prosite and ELM databases. We demonstrate that this method is very generic; thus can be widely applied to detect class-specific motifs in many protein sequence classification tasks. Conclusion The proposed scoring function and methodology is able to identify class-specific motifs using discriminative n-grams derived from the protein sequences. The implementation of amino acid substitution scores for similarity detection, and the dampening factor to normalize the unbalanced datasets have significant effect on the performance of the scoring function. Our multipronged validation tests demonstrate that this method can detect class-specific motifs from a wide variety of protein sequence classes with a potential application to detecting proteome-specific motifs of different organisms. PMID:23496846
Paranemic Crossover DNA: There and Back Again.
Wang, Xing; Chandrasekaran, Arun Richard; Shen, Zhiyong; Ohayon, Yoel P; Wang, Tong; Kizer, Megan E; Sha, Ruojie; Mao, Chengde; Yan, Hao; Zhang, Xiaoping; Liao, Shiping; Ding, Baoquan; Chakraborty, Banani; Jonoska, Natasha; Niu, Dong; Gu, Hongzhou; Chao, Jie; Gao, Xiang; Li, Yuhang; Ciengshin, Tanashaya; Seeman, Nadrian C
2018-06-18
Over the past 35 years, DNA has been used to produce various nanometer-scale constructs, nanomechanical devices, and walkers. Construction of complex DNA nanostructures relies on the creation of rigid DNA motifs. Paranemic crossover (PX) DNA is one such motif that has played many roles in DNA nanotechnology. Specifically, PX cohesion has been used to connect topologically closed molecules, to assemble a three-dimensional object, and to create two-dimensional DNA crystals. Additionally, a sequence-dependent nanodevice based on conformational change between PX and its topoisomer, JX 2 , has been used in robust nanoscale assembly lines, as a key component in a DNA transducer, and to dictate polymer assembly. Furthermore, the PX motif has recently found a new role directly in basic biology, by possibly serving as the molecular structure for double-stranded DNA homology recognition, a prominent feature of molecular biology and essential for many crucial biological processes. This review discusses the many attributes and usages of PX-DNA-its design, characteristics, applications, and potential biological relevance-and aims to accelerate the understanding of PX-DNA motif in its many roles and manifestations.
Bickford, Justin S; Nick, Harry S
2013-12-01
Isoprenoid lipid carriers are essential in protein glycosylation and bacterial cell envelope biosynthesis. The enzymes involved in their metabolism (synthases, kinases and phosphatases) are therefore critical to cell viability. In this review, we focus on two broad groups of isoprenoid pyrophosphate phosphatases. One group, containing phosphatidic acid phosphatase motifs, includes the eukaryotic dolichyl pyrophosphate phosphatases and proposed recycling bacterial undecaprenol pyrophosphate phosphatases, PgpB, YbjB and YeiU/LpxT. The second group comprises the bacterial undecaprenol pyrophosphate phosphatase, BacA/UppP, responsible for initial formation of undecaprenyl phosphate, which we predict contains a tyrosine phosphate phosphatase motif resembling that of the tumour suppressor, phosphatase and tensin homologue (PTEN). Based on protein sequence alignments across species and 2D structure predictions, we propose catalytic and lipid recognition motifs unique to BacA/UppP enzymes. The verification of our proposed active-site residues would provide new strategies for the development of substrate-specific inhibitors which mimic both the lipid and pyrophosphate moieties, leading to the development of novel antimicrobial agents.
RNAfbinv: an interactive Java application for fragment-based design of RNA sequences.
Weinbrand, Lina; Avihoo, Assaf; Barash, Danny
2013-11-15
In RNA design problems, it is plausible to assume that the user would be interested in preserving a particular RNA secondary structure motif, or fragment, for biological reasons. The preservation could be in structure or sequence, or both. Thus, the inverse RNA folding problem could benefit from considering fragment constraints. We have developed a new interactive Java application called RNA fragment-based inverse that allows users to insert an RNA secondary structure in dot-bracket notation. It then performs sequence design that conforms to the shape of the input secondary structure, the specified thermodynamic stability, the specified mutational robustness and the user-selected fragment after shape decomposition. In this shape-based design approach, specific RNA structural motifs with known biological functions are strictly enforced, while others can possess more flexibility in their structure in favor of preserving physical attributes and additional constraints. RNAfbinv is freely available for download on the web at http://www.cs.bgu.ac.il/~RNAexinv/RNAfbinv. The site contains a help file with an explanation regarding the exact use.
Analysis of Protein-RNA and Protein-Peptide Interactions in Equine Infectious Anemia
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, Jae-Hyung
2007-01-01
Macromolecular interactions are essential for virtually all cellular functions including signal transduction processes, metabolic processes, regulation of gene expression and immune responses. This dissertation focuses on the characterization of two important macromolecular interactions involved in the relationship between Equine Infectious Anemia Virus (EIAV) and its host cell in horse: (1) the interaction between the EIAV Rev protein and its binding site, the Rev-responsive element (RRE) and (2) interactions between equine MHC class I molecules and epitope peptides derived from EIAV proteins. EIAV, one of the most divergent members of the lentivirus family, has a single-stranded RNA genome and carries severalmore » regulatory and structural proteins within its viral particle. Rev is an essential EIAV regulatory encoded protein that interacts with the viral RRE, a specific binding site in the viral mRNA. Using a combination of experimental and computational methods, the interactions between EIAV Rev and RRE were characterized in detail. EIAV Rev was shown to have a bipartite RNA binding domain contain two arginine rich motifs (ARMs). The RRE secondary structure was determined and specific structural motifs that act as cis-regulatory elements for EIAV Rev-RRE interaction were identified. Interestingly, a structural motif located in the high affinity Rev binding site is well conserved in several diverse lentiviral genoes, including HIV-1. Macromolecular interactions involved in the immune response of the horse to EIAV infection were investigated by analyzing complexes between MHC class I proteins and epitope peptides derived from EIAV Rev, Env and Gag proteins. Computational modeling results provided a mechanistic explanation for the experimental finding that a single amino acid change in the peptide binding domain of the quine MHC class I molecule differentially affectes the recognitino of specific epitopes by EIAV-specific CTL. Together, the findings in this dissertation provide novel insights into the strategy used by EIAV to replicate itself, and provide new details about how the host cell responds to and defends against EIAV upon the infection. Moreover, they have contributed to the understanding of the macromolecular recognition events that regulate these processes.« less
Kim, Inae; Kwak, Hoyun; Lee, Hee Kyu; Hyun, Soonsil; Jeong, Sunjoo
2012-01-01
RNA-binding proteins regulate multiple steps of RNA metabolism through both dynamic and combined binding. In addition to its crucial roles in cell adhesion and Wnt-activated transcription in cancer cells, β-catenin regulates RNA alternative splicing and stability possibly by binding to target RNA in cells. An RNA aptamer was selected for specific binding to β-catenin to address RNA recognition by β-catenin more specifically. Here, we characterized the structural properties of the RNA aptamer as a model and identified a β-catenin RNA motif. Similar RNA motif was found in cellular RNA, Cyclooxygenase-2 (COX-2) mRNA 3′-untranslated region (3′-UTR). More significantly, the C-terminal domain of β-catenin interacted with HuR and the Armadillo repeat domain associated with RNA to form the RNA–β-catenin–HuR complex in vitro and in cells. Furthermore, the tertiary RNA–protein complex was predominantly found in the cytoplasm of colon cancer cells; thus, it might be related to COX-2 protein level and cancer progression. Taken together, the β-catenin RNA aptamer was valuable for deducing the cellular RNA aptamer and identifying novel and oncogenic RNA–protein networks in colon cancer cells. PMID:22544606
Identification of the sequence motif of glycoside hydrolase 13 family members
Kumar, Vikash
2011-01-01
A bioinformatics analysis of sequences of enzymes of the glycoside hydrolase (GH) 13 family members such as α-amylase, cyclodextrin glycosyltransferase (CGTase), branching enzyme and cyclomaltodextrinase has been carried out in order to find out the sequence motifs that govern the reactions specificities of these enzymes by using hidden Markov model (HMM) profile. This analysis suggests the existence of such sequence motifs and residues of these motifs constituting the −1 to +3 catalytic subsites of the enzyme. Hence, by introducing mutations in the residues of these four subsites, one can change the reaction specificities of the enzymes. In general it has been observed that α -amylase sequence motif have low sequence conservation than rest of the motifs of the GH13 family members. PMID:21544166
Structure-function characterization and optimization of a plant-derived antibacterial peptide.
Suarez, Mougli; Haenni, Marisa; Canarelli, Stéphane; Fisch, Florian; Chodanowski, Pierre; Servis, Catherine; Michielin, Olivier; Freitag, Ruth; Moreillon, Philippe; Mermod, Nicolas
2005-09-01
Crushed seeds of the Moringa oleifera tree have been used traditionally as natural flocculants to clarify drinking water. We previously showed that one of the seed peptides mediates both the sedimentation of suspended particles such as bacterial cells and a direct bactericidal activity, raising the possibility that the two activities might be related. In this study, the conformational modeling of the peptide was coupled to a functional analysis of synthetic derivatives. This indicated that partly overlapping structural determinants mediate the sedimentation and antibacterial activities. Sedimentation requires a positively charged, glutamine-rich portion of the peptide that aggregates bacterial cells. The bactericidal activity was localized to a sequence prone to form a helix-loop-helix structural motif. Amino acid substitution showed that the bactericidal activity requires hydrophobic proline residues within the protruding loop. Vital dye staining indicated that treatment with peptides containing this motif results in bacterial membrane damage. Assembly of multiple copies of this structural motif into a branched peptide enhanced antibacterial activity, since low concentrations effectively kill bacteria such as Pseudomonas aeruginosa and Streptococcus pyogenes without displaying a toxic effect on human red blood cells. This study thus identifies a synthetic peptide with potent antibacterial activity against specific human pathogens. It also suggests partly distinct molecular mechanisms for each activity. Sedimentation may result from coupled flocculation and coagulation effects, while the bactericidal activity would require bacterial membrane destabilization by a hydrophobic loop.
Structure-Function Characterization and Optimization of a Plant-Derived Antibacterial Peptide
Suarez, Mougli; Haenni, Marisa; Canarelli, Stéphane; Fisch, Florian; Chodanowski, Pierre; Servis, Catherine; Michielin, Olivier; Freitag, Ruth; Moreillon, Philippe; Mermod, Nicolas
2005-01-01
Crushed seeds of the Moringa oleifera tree have been used traditionally as natural flocculants to clarify drinking water. We previously showed that one of the seed peptides mediates both the sedimentation of suspended particles such as bacterial cells and a direct bactericidal activity, raising the possibility that the two activities might be related. In this study, the conformational modeling of the peptide was coupled to a functional analysis of synthetic derivatives. This indicated that partly overlapping structural determinants mediate the sedimentation and antibacterial activities. Sedimentation requires a positively charged, glutamine-rich portion of the peptide that aggregates bacterial cells. The bactericidal activity was localized to a sequence prone to form a helix-loop-helix structural motif. Amino acid substitution showed that the bactericidal activity requires hydrophobic proline residues within the protruding loop. Vital dye staining indicated that treatment with peptides containing this motif results in bacterial membrane damage. Assembly of multiple copies of this structural motif into a branched peptide enhanced antibacterial activity, since low concentrations effectively kill bacteria such as Pseudomonas aeruginosa and Streptococcus pyogenes without displaying a toxic effect on human red blood cells. This study thus identifies a synthetic peptide with potent antibacterial activity against specific human pathogens. It also suggests partly distinct molecular mechanisms for each activity. Sedimentation may result from coupled flocculation and coagulation effects, while the bactericidal activity would require bacterial membrane destabilization by a hydrophobic loop. PMID:16127062
Sun, Eric I; Leyn, Semen A; Kazanov, Marat D; Saier, Milton H; Novichkov, Pavel S; Rodionov, Dmitry A
2013-09-02
In silico comparative genomics approaches have been efficiently used for functional prediction and reconstruction of metabolic and regulatory networks. Riboswitches are metabolite-sensing structures often found in bacterial mRNA leaders controlling gene expression on transcriptional or translational levels.An increasing number of riboswitches and other cis-regulatory RNAs have been recently classified into numerous RNA families in the Rfam database. High conservation of these RNA motifs provides a unique advantage for their genomic identification and comparative analysis. A comparative genomics approach implemented in the RegPredict tool was used for reconstruction and functional annotation of regulons controlled by RNAs from 43 Rfam families in diverse taxonomic groups of Bacteria. The inferred regulons include ~5200 cis-regulatory RNAs and more than 12000 target genes in 255 microbial genomes. All predicted RNA-regulated genes were classified into specific and overall functional categories. Analysis of taxonomic distribution of these categories allowed us to establish major functional preferences for each analyzed cis-regulatory RNA motif family. Overall, most RNA motif regulons showed predictable functional content in accordance with their experimentally established effector ligands. Our results suggest that some RNA motifs (including thiamin pyrophosphate and cobalamin riboswitches that control the cofactor metabolism) are widespread and likely originated from the last common ancestor of all bacteria. However, many more analyzed RNA motifs are restricted to a narrow taxonomic group of bacteria and likely represent more recent evolutionary innovations. The reconstructed regulatory networks for major known RNA motifs substantially expand the existing knowledge of transcriptional regulation in bacteria. The inferred regulons can be used for genetic experiments, functional annotations of genes, metabolic reconstruction and evolutionary analysis. The obtained genome-wide collection of reference RNA motif regulons is available in the RegPrecise database (http://regprecise.lbl.gov/).
Mechanisms of Zero-Lag Synchronization in Cortical Motifs
Gollo, Leonardo L.; Mirasso, Claudio; Sporns, Olaf; Breakspear, Michael
2014-01-01
Zero-lag synchronization between distant cortical areas has been observed in a diversity of experimental data sets and between many different regions of the brain. Several computational mechanisms have been proposed to account for such isochronous synchronization in the presence of long conduction delays: Of these, the phenomenon of “dynamical relaying” – a mechanism that relies on a specific network motif – has proven to be the most robust with respect to parameter mismatch and system noise. Surprisingly, despite a contrary belief in the community, the common driving motif is an unreliable means of establishing zero-lag synchrony. Although dynamical relaying has been validated in empirical and computational studies, the deeper dynamical mechanisms and comparison to dynamics on other motifs is lacking. By systematically comparing synchronization on a variety of small motifs, we establish that the presence of a single reciprocally connected pair – a “resonance pair” – plays a crucial role in disambiguating those motifs that foster zero-lag synchrony in the presence of conduction delays (such as dynamical relaying) from those that do not (such as the common driving triad). Remarkably, minor structural changes to the common driving motif that incorporate a reciprocal pair recover robust zero-lag synchrony. The findings are observed in computational models of spiking neurons, populations of spiking neurons and neural mass models, and arise whether the oscillatory systems are periodic, chaotic, noise-free or driven by stochastic inputs. The influence of the resonance pair is also robust to parameter mismatch and asymmetrical time delays amongst the elements of the motif. We call this manner of facilitating zero-lag synchrony resonance-induced synchronization, outline the conditions for its occurrence, and propose that it may be a general mechanism to promote zero-lag synchrony in the brain. PMID:24763382
Nomura, Yusuke; Tanaka, Yoichiro; Fukunaga, Jun-ichi; Fujiwara, Kazuya; Chiba, Manabu; Iibuchi, Hiroaki; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Kozu, Tomoko; Sakamoto, Taiichi
2013-12-01
AML1/RUNX1 is an essential transcription factor involved in the differentiation of hematopoietic cells. AML1 binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. In a previous study, we obtained RNA aptamers against the AML1 Runt domain by systematic evolution of ligands by exponential enrichment and revealed that RNA aptamers exhibit higher affinity for the Runt domain than that for RDE and possess the 5'-GCGMGNN-3' and 5'-N'N'CCAC-3' conserved motif (M: A or C; N and N' form Watson-Crick base pairs) that is important for Runt domain binding. In this study, to understand the structural basis of recognition of the Runt domain by the aptamer motif, the solution structure of a 22-mer RNA was determined using nuclear magnetic resonance. The motif contains the AH(+)-C mismatch and base triple and adopts an unusual backbone structure. Structural analysis of the aptamer motif indicated that the aptamer binds to the Runt domain by mimicking the RDE sequence and structure. Our data should enhance the understanding of the structural basis of DNA mimicry by RNA molecules.
Kandimalla, Ekambar R; Bhagat, Lakshmi; Zhu, Fu-Gang; Yu, Dong; Cong, Yan-Ping; Wang, Daqing; Tang, Jimmy X; Tang, Jin-Yan; Knetter, Cathrine F; Lien, Egil; Agrawal, Sudhir
2003-11-25
Bacterial and synthetic DNAs containing CpG dinucleotides in specific sequence contexts activate the vertebrate immune system through Toll-like receptor 9 (TLR9). In the present study, we used a synthetic nucleoside with a bicyclic heterobase [1-(2'-deoxy-beta-d-ribofuranosyl)-2-oxo-7-deaza-8-methyl-purine; R] to replace the C in CpG, resulting in an RpG dinucleotide. The RpG dinucleotide was incorporated in mouse- and human-specific motifs in oligodeoxynucleotides (oligos) and 3'-3-linked oligos, referred to as immunomers. Oligos containing the RpG motif induced cytokine secretion in mouse spleen-cell cultures. Immunomers containing RpG dinucleotides showed activity in transfected-HEK293 cells stably expressing mouse TLR9, suggesting direct involvement of TLR9 in the recognition of RpG motif. In J774 macrophages, RpG motifs activated NF-kappa B and mitogen-activated protein kinase pathways. Immunomers containing the RpG dinucleotide induced high levels of IL-12 and IFN-gamma, but lower IL-6 in time- and concentration-dependent fashion in mouse spleen-cell cultures costimulated with IL-2. Importantly, immunomers containing GTRGTT and GARGTT motifs were recognized to a similar extent by both mouse and human immune systems. Additionally, both mouse- and human-specific RpG immunomers potently stimulated proliferation of peripheral blood mononuclear cells obtained from diverse vertebrate species, including monkey, pig, horse, sheep, goat, rat, and chicken. An immunomer containing GTRGTT motif prevented conalbumin-induced and ragweed allergen-induced allergic inflammation in mice. We show that a synthetic bicyclic nucleotide is recognized in the C position of a CpG dinucleotide by immune cells from diverse vertebrate species without bias for flanking sequences, suggesting a divergent nucleotide motif recognition pattern of TLR9.
Pisanti, Nadia; Soldano, Henry; Carpentier, Mathilde; Pothier, Joel
2009-12-01
The geometrical configurations of atoms in protein structures can be viewed as approximate relations among them. Then, finding similar common substructures within a set of protein structures belongs to a new class of problems that generalizes that of finding repeated motifs. The novelty lies in the addition of constraints on the motifs in terms of relations that must hold between pairs of positions of the motifs. We will hence denote them as relational motifs. For this class of problems, we present an algorithm that is a suitable extension of the KMR paradigm and, in particular, of the KMRC as it uses a degenerate alphabet. Our algorithm contains several improvements that become especially useful when-as it is required for relational motifs-the inference is made by partially overlapping shorter motifs, rather than concatenating them. The efficiency, correctness and completeness of the algorithm is ensured by several non-trivial properties that are proven in this paper. The algorithm has been applied in the important field of protein common 3D substructure searching. The methods implemented have been tested on several examples of protein families such as serine proteases, globins and cytochromes P450 additionally. The detected motifs have been compared to those found by multiple structural alignments methods.
Fauteux, François; Strömvik, Martina V
2009-01-01
Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP) gene promoters from three plant families, namely Brassicaceae (mustards), Fabaceae (legumes) and Poaceae (grasses) using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L.) Heynh.), soybean (Glycine max (L.) Merr.) and rice (Oryza sativa L.) respectively. We have identified three conserved motifs (two RY-like and one ACGT-like) in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination of conserved motifs. The majority of discovered motifs match experimentally characterized cis-regulatory elements. These results provide a good starting point for further experimental analysis of plant seed-specific promoters and our methodology can be used to unravel more transcriptional regulatory mechanisms in plants and other eukaryotes. PMID:19843335
Brendolise, Cyril; Espley, Richard V; Lin-Wang, Kui; Laing, William; Peng, Yongyan; McGhie, Tony; Dejnoprat, Supinya; Tomes, Sumathi; Hellens, Roger P; Allan, Andrew C
2017-01-01
In apple, the MYB transcription factor MYB10 controls the accumulation of anthocyanins. MYB10 is able to auto-activate its expression by binding its own promoter at a specific motif, the R1 motif. In some apple accessions a natural mutation, termed R6, has more copies of this motif within the MYB10 promoter resulting in stronger auto-activation and elevated anthocyanins. Here we show that other anthocyanin-related MYBs selected from apple, pear, strawberry, petunia, kiwifruit and Arabidopsis are able to activate promoters containing the R6 motif. To examine the specificity of this motif, members of the R2R3 MYB family were screened against a promoter harboring the R6 mutation. Only MYBs from subgroups 5 and 6 activate expression by binding the R6 motif, with these MYBs sharing conserved residues in their R2R3 DNA binding domains. Insertion of the apple R6 motif into orthologous promoters of MYB10 in pear ( PcMYB10 ) and Arabidopsis ( AtMY75 ) elevated anthocyanin levels. Introduction of the R6 motif into the promoter region of an anthocyanin biosynthetic enzyme F3'5'H of kiwifruit imparts regulation by MYB10. This results in elevated levels of delphinidin in both tobacco and kiwifruit. Finally, an R6 motif inserted into the promoter the vitamin C biosynthesis gene GDP-L-Gal phosphorylase increases vitamin C content in a MYB10-dependent manner. This motif therefore provides a tool to re-engineer novel MYB-regulated responses in plants.
Identification of helix capping and β-turn motifs from NMR chemical shifts
Shen, Yang; Bax, Ad
2012-01-01
We present an empirical method for identification of distinct structural motifs in proteins on the basis of experimentally determined backbone and 13Cβ chemical shifts. Elements identified include the N-terminal and C-terminal helix capping motifs and five types of β-turns: I, II, I′, II′ and VIII. Using a database of proteins of known structure, the NMR chemical shifts, together with the PDB-extracted amino acid preference of the helix capping and β-turn motifs are used as input data for training an artificial neural network algorithm, which outputs the statistical probability of finding each motif at any given position in the protein. The trained neural networks, contained in the MICS (motif identification from chemical shifts) program, also provide a confidence level for each of their predictions, and values ranging from ca 0.7–0.9 for the Matthews correlation coefficient of its predictions far exceed that attainable by sequence analysis. MICS is anticipated to be useful both in the conventional NMR structure determination process and for enhancing on-going efforts to determine protein structures solely on the basis of chemical shift information, where it can aid in identifying protein database fragments suitable for use in building such structures. PMID:22314702
del Val, Coral; White, Stephen H.
2014-01-01
We combined systematic bioinformatics analyses and molecular dynamics simulations to assess the conservation patterns of Ser and Thr motifs in membrane proteins, and the effect of such motifs on the structure and dynamics of α-helical transmembrane (TM) segments. We find that Ser/Thr motifs are often present in β-barrel TM proteins. At least one Ser/Thr motif is present in almost half of the sequences of α-helical proteins analyzed here. The extensive bioinformatics analyses and inspection of protein structures led to the identification of molecular transporters with noticeable numbers of Ser/Thr motifs within the TM region. Given the energetic penalty for burying multiple Ser/Thr groups in the membrane hydrophobic core, the observation of transporters with multiple membrane-embedded Ser/Thr is intriguing and raises the question of how the presence of multiple Ser/Thr affects protein local structure and dynamics. Molecular dynamics simulations of four different Ser-containing model TM peptides indicate that backbone hydrogen bonding of membrane-buried Ser/Thr hydroxyl groups can significantly change the local structure and dynamics of the helix. Ser groups located close to the membrane interface can hydrogen bond to solvent water instead of protein backbone, leading to an enhanced local solvation of the peptide. PMID:22836667
Nonin, S; Phan, A T; Leroy, J L
1997-09-15
Repetitive cytosine-rich DNA sequences have been identified in telomeres and centromeres of eukaryotic chromosomes. These sequences play a role in maintaining chromosome stability during replication and may be involved in chromosome pairing during meiosis. The C-rich repeats can fold into an 'i-motif' structure, in which two parallel-stranded duplexes with hemiprotonated C.C+ pairs are intercalated. Previous NMR studies of naturally occurring repeats have produced poor NMR spectra. This led us to investigate oligonucleotides, based on natural sequences, to produce higher quality spectra and thus provide further information as to the structure and possible biological function of the i-motif. NMR spectroscopy has shown that d(5mCCTTTACC) forms an i-motif dimer of symmetry-related and intercalated folded strands. The high-definition structure is computed on the basis of the build-up rates of 29 intraresidue and 35 interresidue nuclear Overhauser effect (NOE) connectivities. The i-motif core includes intercalated interstrand C.C+ pairs stacked in the order 2*.8/1.7*/1*.7/2.8* (where one strand is distinguished by an asterisk and the numbers relate to the base positions within the repeat). The TTTA sequences form two loops which span the two wide grooves on opposite sides of the i-motif core; the i-motif core is extended at both ends by the stacking of A6 onto C2.C8+. The lifetimes of pairs C2.C8+ and 5mC1.C7+ are 1 ms and 1 s, respectively, at 15 degrees C. Anomalous exchange properties of the T3 imino proton indicate hydrogen bonding to A6 N7 via a water bridge. The d(5mCCTTTTCC) deoxyoligonucleotide, in which position 6 is occupied by a thymidine instead of an adenine, also forms a symmetric i-motif dimer. However, in this structure the two TTTT loops are located on the same side of the i-motif core and the C.C+ pairs are formed by equivalent cytidines stacked in the order 8*.8/1.1*/7*.7/2.2*. Oligodeoxynucleotides containing two C-rich repeats can fold and dimerize into an i-motif. The change of folding topology resulting from the substitution of a single nucleoside emphasizes the influence of the loop residues on the i-motif structure formed by two folded strands.
Jauch, Ralf; Ng, Calista K L; Narasimhan, Kamesh; Kolatkar, Prasanna R
2012-04-01
It has recently been proposed that the sequence preferences of DNA-binding TFs (transcription factors) can be well described by models that include the positional interdependence of the nucleotides of the target sites. Such binding models allow for multiple motifs to be invoked, such as principal and secondary motifs differing at two or more nucleotide positions. However, the structural mechanisms underlying the accommodation of such variant motifs by TFs remain elusive. In the present study we examine the crystal structure of the HMG (high-mobility group) domain of Sox4 [Sry (sex-determining region on the Y chromosome)-related HMG box 4] bound to DNA. By comparing this structure with previously solved structures of Sox17 and Sox2, we observed subtle conformational differences at the DNA-binding interface. Furthermore, using quantitative electrophoretic mobility-shift assays we validated the positional interdependence of two nucleotides and the presence of a secondary Sox motif in the affinity landscape of Sox4. These results suggest that a concerted rearrangement of two interface amino acids enables Sox4 to accommodate primary and secondary motifs. The structural adaptations lead to altered dinucleotide preferences that mutually reinforce each other. These analyses underline the complexity of the DNA recognition by TFs and provide an experimental validation for the conceptual framework of positional interdependence and secondary binding motifs.
Maximum likelihood density modification by pattern recognition of structural motifs
Terwilliger, Thomas C.
2004-04-13
An electron density for a crystallographic structure having protein regions and solvent regions is improved by maximizing the log likelihood of a set of structures factors {F.sub.h } using a local log-likelihood function: (x)+p(.rho.(x).vertline.SOLV)p.sub.SOLV (x)+p(.rho.(x).vertline.H)p.sub.H (x)], where p.sub.PROT (x) is the probability that x is in the protein region, p(.rho.(x).vertline.PROT) is the conditional probability for .rho.(x) given that x is in the protein region, and p.sub.SOLV (x) and p(.rho.(x).vertline.SOLV) are the corresponding quantities for the solvent region, p.sub.H (x) refers to the probability that there is a structural motif at a known location, with a known orientation, in the vicinity of the point x; and p(.rho.(x).vertline.H) is the probability distribution for electron density at this point given that the structural motif actually is present. One appropriate structural motif is a helical structure within the crystallographic structure.
Dimasi, Nazzareno
2007-01-01
The Grb2-like adaptor protein GADS is essential for tyrosine kinase-dependent signaling in T lymphocytes. Following T cell receptor ligation, GADS interacts through its C-terminal SH3 domain with the adaptors SLP-76 and LAT, to form a multiprotein signaling complex that is crucial for T cell activation. To understand the structural basis for the selective recognition of GADS by SLP-76, herein is reported the crystal structure at 1.54 Angstrom of the C-terminal SH3 domain of GADS bound to the SLP-76 motif 233-PSIDRSTKP-241, which represents the minimal binding site. In addition to the unique structural features adopted by the bound SLP-76 peptide, the complex structure reveals a unique SH3-SH3 interaction. This homophilic interaction, which is observed in presence of the SLP-76 peptide and is present in solution, extends our understanding of the molecular mechanisms that could be employed by modular proteins to increase their signaling transduction specificity.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bharat, Somireddy Venkata; Shekhtman, Alexander; Pande, Jayanti, E-mail: jpande@albany.edu
2014-01-03
Highlights: •We present NMR analysis of V41M, a cataract-causing mutant of human γS-crystallin. •Mutation alters strand–strand interactions throughout the N-terminal domain. •Mutation directly affects Trp46 due to key Met41-S–Trp46-pi interactions. •We identify the basis of the surface hydrophobicity increase and residues involved. -- Abstract: The major crystallins expressed in the human lens are γS-, γC- and γD-crystallins. Several mutations in γS-crystallin are associated with hereditary cataracts, one of which involves the substitution of a highly conserved Valine at position 41 to Methionine. According to a recent report, the mutant protein, V41M, shows lower stability and increased surface hydrophobicity compared tomore » the wild-type, and a propensity for self-aggregation. Here we address the structural differences between the two proteins, with residue-level specificity using NMR spectroscopy. Based on the structural model of the mutant protein, our results clearly show that the mutation creates a major local perturbation almost at the junction of the first and second “Greek-key” motifs in the N-terminal domain. A larger section of the second motif (residues 44–86) appears to be mainly affected. Based on the sizeable chemical shift of the imino proton of the indole side-chain of Trp46 in V41M, we suggest that the sulphur atom of Met41 is involved in an S–π interaction with Trp46. This interaction would bring the last β-strand of the first “Greek-key” motif closer to the first β-strand of the second motif. This appears to lead to a domino effect, towards both the N- and C-terminal ends, even as it decays off substantially beyond the domain interface. During this process discreet hydrophobic surface patches are created, as revealed by ANS-binding. Such changes would not affect the secondary structure or cause a major change in the tertiary structure, but can lead to self-aggregation or aberrant binding interactions of the mutant protein in vivo, and lead to lens opacity or cataract.« less
Erceg, Jelena; Saunders, Timothy E.; Girardot, Charles; Devos, Damien P.; Hufnagel, Lars; Furlong, Eileen E. M.
2014-01-01
Deciphering the specific contribution of individual motifs within cis-regulatory modules (CRMs) is crucial to understanding how gene expression is regulated and how this process is affected by sequence variation. But despite vast improvements in the ability to identify where transcription factors (TFs) bind throughout the genome, we are limited in our ability to relate information on motif occupancy to function from sequence alone. Here, we engineered 63 synthetic CRMs to systematically assess the relationship between variation in the content and spacing of motifs within CRMs to CRM activity during development using Drosophila transgenic embryos. In over half the cases, very simple elements containing only one or two types of TF binding motifs were capable of driving specific spatio-temporal patterns during development. Different motif organizations provide different degrees of robustness to enhancer activity, ranging from binary on-off responses to more subtle effects including embryo-to-embryo and within-embryo variation. By quantifying the effects of subtle changes in motif organization, we were able to model biophysical rules that explain CRM behavior and may contribute to the spatial positioning of CRM activity in vivo. For the same enhancer, the effects of small differences in motif positions varied in developmentally related tissues, suggesting that gene expression may be more susceptible to sequence variation in one tissue compared to another. This result has important implications for human eQTL studies in which many associated mutations are found in cis-regulatory regions, though the mechanism for how they affect tissue-specific gene expression is often not understood. PMID:24391522
2012-01-01
Background Discovery of functionally significant short, statistically overrepresented subsequence patterns (motifs) in a set of sequences is a challenging problem in bioinformatics. Oftentimes, not all sequences in the set contain a motif. These non-motif-containing sequences complicate the algorithmic discovery of motifs. Filtering the non-motif-containing sequences from the larger set of sequences while simultaneously determining the identity of the motif is, therefore, desirable and a non-trivial problem in motif discovery research. Results We describe MotifCatcher, a framework that extends the sensitivity of existing motif-finding tools by employing random sampling to effectively remove non-motif-containing sequences from the motif search. We developed two implementations of our algorithm; each built around a commonly used motif-finding tool, and applied our algorithm to three diverse chromatin immunoprecipitation (ChIP) data sets. In each case, the motif finder with the MotifCatcher extension demonstrated improved sensitivity over the motif finder alone. Our approach organizes candidate functionally significant discovered motifs into a tree, which allowed us to make additional insights. In all cases, we were able to support our findings with experimental work from the literature. Conclusions Our framework demonstrates that additional processing at the sequence entry level can significantly improve the performance of existing motif-finding tools. For each biological data set tested, we were able to propose novel biological hypotheses supported by experimental work from the literature. Specifically, in Escherichia coli, we suggested binding site motifs for 6 non-traditional LexA protein binding sites; in Saccharomyces cerevisiae, we hypothesize 2 disparate mechanisms for novel binding sites of the Cse4p protein; and in Halobacterium sp. NRC-1, we discoverd subtle differences in a general transcription factor (GTF) binding site motif across several data sets. We suggest that small differences in our discovered motif could confer specificity for one or more homologous GTF proteins. We offer a free implementation of the MotifCatcher software package at http://www.bme.ucdavis.edu/facciotti/resources_data/software/. PMID:23181585
Borthakur, Susmita; Lee, HyeongJu; Kim, SoonJeung; Wang, Bing-Cheng; Buck, Matthias
2014-01-01
The sterile α motif (SAM) domain of the ephrin receptor tyrosine kinase, EphA2, undergoes tyrosine phosphorylation, but the effect of phosphorylation on the structure and interactions of the receptor is unknown. Studies to address these questions have been hindered by the difficulty of obtaining site-specifically phosphorylated proteins in adequate amounts. Here, we describe the use of chemically synthesized and specifically modified domain-length peptides to study the behavior of phosphorylated EphA2 SAM domains. We show that tyrosine phosphorylation of any of the three tyrosines, Tyr921, Tyr930, and Tyr960, has a surprisingly small effect on the EphA2 SAM structure and stability. However, phosphorylation at Tyr921 and Tyr930 enables differential binding to the Src homology 2 domain of the adaptor protein Grb7, which we propose will lead to distinct functional outcomes. Setting up different signaling platforms defined by selective interactions with adaptor proteins thus adds another level of regulation to EphA2 signaling. PMID:24825902
Discovery of phosphorylation motif mixtures in phosphoproteomics data
Ritz, Anna; Shakhnarovich, Gregory; Salomon, Arthur R.; Raphael, Benjamin J.
2009-01-01
Motivation: Modification of proteins via phosphorylation is a primary mechanism for signal transduction in cells. Phosphorylation sites on proteins are determined in part through particular patterns, or motifs, present in the amino acid sequence. Results: We describe an algorithm that simultaneously discovers multiple motifs in a set of peptides that were phosphorylated by several different kinases. Such sets of peptides are routinely produced in proteomics experiments.Our motif-finding algorithm uses the principle of minimum description length to determine a mixture of sequence motifs that distinguish a foreground set of phosphopeptides from a background set of unphosphorylated peptides. We show that our algorithm outperforms existing motif-finding algorithms on synthetic datasets consisting of mixtures of known phosphorylation sites. We also derive a motif specificity score that quantifies whether or not the phosphoproteins containing an instance of a motif have a significant number of known interactions. Application of our motif-finding algorithm to recently published human and mouse proteomic studies recovers several known phosphorylation motifs and reveals a number of novel motifs that are enriched for interactions with a particular kinase or phosphatase. Our tools provide a new approach for uncovering the sequence specificities of uncharacterized kinases or phosphatases. Availability: Software is available at http:/cs.brown.edu/people/braphael/software.html. Contact: aritz@cs.brown.edu; braphael@cs.brown.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:18996944
Identification of cancer-specific motifs in mimotope profiles of serum antibody repertoire.
Gerasimov, Ekaterina; Zelikovsky, Alex; Măndoiu, Ion; Ionov, Yurij
2017-06-07
For fighting cancer, earlier detection is crucial. Circulating auto-antibodies produced by the patient's own immune system after exposure to cancer proteins are promising bio-markers for the early detection of cancer. Since an antibody recognizes not the whole antigen but 4-7 critical amino acids within the antigenic determinant (epitope), the whole proteome can be represented by a random peptide phage display library. This opens the possibility to develop an early cancer detection test based on a set of peptide sequences identified by comparing cancer patients' and healthy donors' global peptide profiles of antibody specificities. Due to the enormously large number of peptide sequences contained in global peptide profiles generated by next generation sequencing, the large number of cancer and control sera is required to identify cancer-specific peptides with high degree of statistical significance. To decrease the number of peptides in profiles generated by nextgen sequencing without losing cancer-specific sequences we used for generation of profiles the phage library enriched by panning on the pool of cancer sera. To further decrease the complexity of profiles we used computational methods for transforming a list of peptides constituting the mimotope profiles to the list motifs formed by similar peptide sequences. We have shown that the amino-acid order is meaningful in mimotope motifs since they contain significantly more peptides than motifs among peptides where amino-acids are randomly permuted. Also the single sample motifs significantly differ from motifs in peptides drawn from multiple samples. Finally, multiple cancer-specific motifs have been identified.
Direct Sequence Detection of Structured H5 Influenza Viral RNA
Kerby, Matthew B.; Freeman, Sarah; Prachanronarong, Kristina; Artenstein, Andrew W.; Opal, Steven M.; Tripathi, Anubhav
2008-01-01
We describe the development of sequence-specific molecular beacons (dual-labeled DNA probes) for identification of the H5 influenza subtype, cleavage motif, and receptor specificity when hybridized directly with in vitro transcribed viral RNA (vRNA). The cloned hemagglutinin segment from a highly pathogenic H5N1 strain, A/Hanoi/30408/2005(H5N1), isolated from humans was used as template for in vitro transcription of sense-strand vRNA. The hybridization behavior of vRNA and a conserved subtype probe was characterized experimentally by varying conditions of time, temperature, and Mg2+ to optimize detection. Comparison of the hybridization rates of probe to DNA and RNA targets indicates that conformational switching of influenza RNA structure is a rate-limiting step and that the secondary structure of vRNA dominates the binding kinetics. The sensitivity and specificity of probe recognition of other H5 strains was calculated from sequence matches to the National Center for Biotechnology Information influenza database. The hybridization specificity of the subtype probes was experimentally verified with point mutations within the probe loop at five locations corresponding to the other human H5 strains. The abundance frequencies of the hemagglutinin cleavage motif and sialic acid recognition sequences were experimentally tested for H5 in all host viral species. Although the detection assay must be coupled with isothermal amplification on the chip, the new probes form the basis of a portable point-of-care diagnostic device for influenza subtyping. PMID:18403607
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Zhongchuan; Xie, Tian; Key Laboratory of Environmental Microbiology of Sichuan Province, Chengdu 610041, People’s Republic of
2016-03-24
The crystal structure of CotA complexed with 2,2-azinobis-(3-ethylbenzothiazoline-6-sulfonate) in a hole motif has been solved; this novel binding site could be a potential structure-based target for protein engineering of CotA laccase. The CotA laccase from Bacillus subtilis is an abundant component of the spore outer coat and has been characterized as a typical laccase. The crystal structure of CotA complexed with 2,2-azinobis-(3-ethylbenzothiazoline-6-sulfonate) (ABTS) in a hole motif has been solved. The novel binding site was about 26 Å away from the T1 binding pocket. Comparison with known structures of other laccases revealed that the hole is a specific feature ofmore » CotA. The key residues Arg476 and Ser360 were directly bound to ABTS. Site-directed mutagenesis studies revealed that the residues Arg146, Arg429 and Arg476, which are located at the bottom of the novel binding site, are essential for the oxidation of ABTS and syringaldazine. Specially, a Thr480Phe variant was identified to be almost 3.5 times more specific for ABTS than for syringaldazine compared with the wild type. These results suggest this novel binding site for ABTS could be a potential target for protein engineering of CotA laccases.« less
Secbase: database module to retrieve secondary structure elements with ligand binding motifs.
Koch, Oliver; Cole, Jason; Block, Peter; Klebe, Gerhard
2009-10-01
Secbase is presented as a novel extension module of Relibase. It integrates the information about secondary structure elements into the retrieval facilities of Relibase. The data are accessible via the extended Relibase user interface, and integrated retrieval queries can be addressed using an extended version of Reliscript. The primary information about alpha-helices and beta-sheets is used as provided by the PDB. Furthermore, a uniform classification of all turn families, based on recent clustering methods, and a new helix assignment that is based on this turn classification has been included. Algorithms to analyze the geometric features of helices and beta-strands were also implemented. To demonstrate the performance of the Secbase implementation, some application examples are given. They provide new insights into the involvement of secondary structure elements in ligand binding. A survey of water molecules detected next to the N-terminus of helices is analyzed to show their involvement in ligand binding. Additionally, the parallel oriented NH groups at the alpha-helix N-termini provide special binding motifs to bind particular ligand functional groups with two adjacent oxygen atoms, e.g., as found in negatively charged carboxylate or phosphate groups, respectively. The present study also shows that the specific structure of the first turn of alpha-helices provides a suitable explanation for stabilizing charged structures. The magnitude of the overall helix macrodipole seems to have no or only a minor influence on binding. Furthermore, an overview of the involvement of secondary structure elements with the recognition of some important endogenous ligands such as cofactors shows some distinct preference for particular binding motifs and amino acids.
Morales, Lucia; Mateos-Gomez, Pedro A.; Capiscol, Carmen; del Palacio, Lorena; Sola, Isabel
2013-01-01
Preferential RNA packaging in coronaviruses involves the recognition of viral genomic RNA, a crucial process for viral particle morphogenesis mediated by RNA-specific sequences, known as packaging signals. An essential packaging signal component of transmissible gastroenteritis coronavirus (TGEV) has been further delimited to the first 598 nucleotides (nt) from the 5′ end of its RNA genome, by using recombinant viruses transcribing subgenomic mRNA that included potential packaging signals. The integrity of the entire sequence domain was necessary because deletion of any of the five structural motifs defined within this region abrogated specific packaging of this viral RNA. One of these RNA motifs was the stem-loop SL5, a highly conserved motif in coronaviruses located at nucleotide positions 106 to 136. Partial deletion or point mutations within this motif also abrogated packaging. Using TGEV-derived defective minigenomes replicated in trans by a helper virus, we have shown that TGEV RNA packaging is a replication-independent process. Furthermore, the last 494 nt of the genomic 3′ end were not essential for packaging, although this region increased packaging efficiency. TGEV RNA sequences identified as necessary for viral genome packaging were not sufficient to direct packaging of a heterologous sequence derived from the green fluorescent protein gene. These results indicated that TGEV genome packaging is a complex process involving many factors in addition to the identified RNA packaging signal. The identification of well-defined RNA motifs within the TGEV RNA genome that are essential for packaging will be useful for designing packaging-deficient biosafe coronavirus-derived vectors and providing new targets for antiviral therapies. PMID:23966403
NASA Astrophysics Data System (ADS)
Bai, Lina; Xie, Ting; Hu, Qingqing; Deng, Changyan; Zheng, Rong; Chen, Wanping
2015-10-01
Ferritins are highly conserved proteins that are widely distributed in various species from archaea to humans. The ubiquitous characteristic of these proteins reflects the pivotal contribution of ferritins to the safe storage and timely delivery of iron to achieve iron homeostasis. This study investigated the ferritin genes in 248 genomes from various species, including viruses, archaea, bacteria, and eukarya. The distribution comparison suggests that mammals and eudicots possess abundant ferritin genes, whereas fungi contain very few ferritin genes. Archaea and bacteria show considerable numbers of ferritin genes. Generally, prokaryotes possess three types of ferritin (the typical ferritin, bacterioferritin, and DNA-binding protein from starved cell), whereas eukaryotes have various subunit types of ferritin, thereby indicating the individuation of the ferritin family during evolution. The characteristic motif analysis of ferritins suggested that all key residues specifying the unique structural motifs of ferritin are highly conserved across three domains of life. Meanwhile, the characteristic motifs were also distinguishable between ferritin groups, especially phytoferritins, which show a plant-specific motif. The phylogenetic analyses show that ferritins within the same subfamily or subunits are generally clustered together. The phylogenetic relationships among ferritin members suggest that both gene duplication and horizontal transfer contribute to the wide variety of ferritins, and their possible evolutionary scenario was also proposed. The results contribute to a better understanding of the distribution, characteristic motif, and evolutionary relationship of the ferritin family.
Recognition of p63 by the E3 ligase ITCH: Effect of an ectodermal dysplasia mutant.
Bellomaria, A; Barbato, Gaetano; Melino, G; Paci, M; Melino, Sonia
2010-09-15
The E3 ubiquitin ligase Itch mediates the degradation of the p63 protein. Itch contains four WW domains which are pivotal for the substrate recognition process. Indeed, this domain is implicated in several signalling complexes crucially involved in human diseases including Muscular Dystrophy, Alzheimer's Disease and Huntington Disease. WW domains are highly compact protein-protein binding modules that interact with short proline-rich sequences. The four WW domains present in Itch belong to the Group I type, which binds polypeptides with a PY motif characterized by a PP xY consensus sequence, where x can be any residue. Accordingly, the Itch-p63 interaction results from a direct binding of Itch-WW2 domain with the PY motif of p63. Here, we report a structural analysis of the Itch-p63 interaction by fluorescence, CD and NMR spectroscopy. Indeed, we studied the in vitro interaction between Itch-WW2 domain and p63(534-551), an 18-mer peptide encompassing a fragment of the p63 protein including the PY motif. In addition, we evaluated the conformation and the interaction with Itch-WW2 of a site specific mutant of p63, I549T, that has been reported in both Hay-Wells syndrome and Rapp-Hodgkin syndrome. Based on our results, we propose an extended PP xY motif for the Itch recognition motif (P-P-P-Y-x(4)-[ST]-[ILV]), which includes these C-terminal residues to the PP xY motif.
Substrate sequence selectivity of APOBEC3A implicates intra-DNA interactions.
Silvas, Tania V; Hou, Shurong; Myint, Wazo; Nalivaika, Ellen; Somasundaran, Mohan; Kelch, Brian A; Matsuo, Hiroshi; Kurt Yilmaz, Nese; Schiffer, Celia A
2018-05-14
The APOBEC3 (A3) family of human cytidine deaminases is renowned for providing a first line of defense against many exogenous and endogenous retroviruses. However, the ability of these proteins to deaminate deoxycytidines in ssDNA makes A3s a double-edged sword. When overexpressed, A3s can mutate endogenous genomic DNA resulting in a variety of cancers. Although the sequence context for mutating DNA varies among A3s, the mechanism for substrate sequence specificity is not well understood. To characterize substrate specificity of A3A, a systematic approach was used to quantify the affinity for substrate as a function of sequence context, length, secondary structure, and solution pH. We identified the A3A ssDNA binding motif as (T/C)TC(A/G), which correlated with enzymatic activity. We also validated that A3A binds RNA in a sequence specific manner. A3A bound tighter to substrate binding motif within a hairpin loop compared to linear oligonucleotide, suggesting A3A affinity is modulated by substrate structure. Based on these findings and previously published A3A-ssDNA co-crystal structures, we propose a new model with intra-DNA interactions for the molecular mechanism underlying A3A sequence preference. Overall, the sequence and structural preferences identified for A3A leads to a new paradigm for identifying A3A's involvement in mutation of endogenous or exogenous DNA.
Mechanism for CARMIL Protein Inhibition of Heterodimeric Actin-capping Protein*
Kim, Taekyung; Ravilious, Geoffrey E.; Sept, David; Cooper, John A.
2012-01-01
Capping protein (CP) controls the polymerization of actin filaments by capping their barbed ends. In lamellipodia, CP dissociates from the actin cytoskeleton rapidly, suggesting the possible existence of an uncapping factor, for which the protein CARMIL (capping protein, Arp2/3 and myosin-I linker) is a candidate. CARMIL binds to CP via two motifs. One, the CP interaction (CPI) motif, is found in a number of unrelated proteins; the other motif is unique to CARMILs, the CARMIL-specific interaction motif. A 115-aa CARMIL fragment of CARMIL with both motifs, termed the CP-binding region (CBR), binds to CP with high affinity, inhibits capping, and causes uncapping. We wanted to understand the structural basis for this function. We used a collection of mutants affecting the actin-binding surface of CP to test the possibility of a steric-blocking model, which remained open because a region of CBR was not resolved in the CBR/CP co-crystal structure. The CP actin-binding mutants bound CBR normally. In addition, a CBR mutant with all residues of the unresolved region changed showed nearly normal binding to CP. Having ruled out a steric blocking model, we tested an allosteric model with molecular dynamics. We found that CBR binding induces changes in the conformation of the actin-binding surface of CP. In addition, ∼30-aa truncations on the actin-binding surface of CP decreased the affinity of CBR for CP. Thus, CARMIL promotes uncapping by binding to a freely accessible site on CP bound to a filament barbed end and inducing a change in the conformation of the actin-binding surface of CP. PMID:22411988
Brendolise, Cyril; Espley, Richard V.; Lin-Wang, Kui; Laing, William; Peng, Yongyan; McGhie, Tony; Dejnoprat, Supinya; Tomes, Sumathi; Hellens, Roger P.; Allan, Andrew C.
2017-01-01
In apple, the MYB transcription factor MYB10 controls the accumulation of anthocyanins. MYB10 is able to auto-activate its expression by binding its own promoter at a specific motif, the R1 motif. In some apple accessions a natural mutation, termed R6, has more copies of this motif within the MYB10 promoter resulting in stronger auto-activation and elevated anthocyanins. Here we show that other anthocyanin-related MYBs selected from apple, pear, strawberry, petunia, kiwifruit and Arabidopsis are able to activate promoters containing the R6 motif. To examine the specificity of this motif, members of the R2R3 MYB family were screened against a promoter harboring the R6 mutation. Only MYBs from subgroups 5 and 6 activate expression by binding the R6 motif, with these MYBs sharing conserved residues in their R2R3 DNA binding domains. Insertion of the apple R6 motif into orthologous promoters of MYB10 in pear (PcMYB10) and Arabidopsis (AtMY75) elevated anthocyanin levels. Introduction of the R6 motif into the promoter region of an anthocyanin biosynthetic enzyme F3′5′H of kiwifruit imparts regulation by MYB10. This results in elevated levels of delphinidin in both tobacco and kiwifruit. Finally, an R6 motif inserted into the promoter the vitamin C biosynthesis gene GDP-L-Gal phosphorylase increases vitamin C content in a MYB10-dependent manner. This motif therefore provides a tool to re-engineer novel MYB-regulated responses in plants. PMID:29163590
A naturally occurring, noncanonical GTP aptamer made of simple tandem repeats
Curtis, Edward A; Liu, David R
2014-01-01
Recently, we used in vitro selection to identify a new class of naturally occurring GTP aptamer called the G motif. Here we report the discovery and characterization of a second class of naturally occurring GTP aptamer, the “CA motif.” The primary sequence of this aptamer is unusual in that it consists entirely of tandem repeats of CA-rich motifs as short as three nucleotides. Several active variants of the CA motif aptamer lack the ability to form consecutive Watson-Crick base pairs in any register, while others consist of repeats containing only cytidine and adenosine residues, indicating that noncanonical interactions play important roles in its structure. The circular dichroism spectrum of the CA motif aptamer is distinct from that of A-form RNA and other major classes of nucleic acid structures. Bioinformatic searches indicate that the CA motif is absent from most archaeal and bacterial genomes, but occurs in at least 70 percent of approximately 400 eukaryotic genomes examined. These searches also uncovered several phylogenetically conserved examples of the CA motif in rodent (mouse and rat) genomes. Together, these results reveal the existence of a second class of naturally occurring GTP aptamer whose sequence requirements, like that of the G motif, are not consistent with those of a canonical secondary structure. They also indicate a new and unexpected potential biochemical activity of certain naturally occurring tandem repeats. PMID:24824832
Wang, Jichao; Zhang, Tongchuan; Liu, Ruicun; Song, Meilin; Wang, Juncheng; Hong, Jiong; Chen, Quan; Liu, Haiyan
2017-02-01
An interesting way of generating novel artificial proteins is to combine sequence motifs from natural proteins, mimicking the evolutionary path suggested by natural proteins comprising recurring motifs. We analyzed the βα and αβ modules of TIM barrel proteins by structure alignment-based sequence clustering. A number of preferred motifs were identified. A chimeric TIM was designed by using recurring elements as mutually compatible interfaces. The foldability of the designed TIM protein was then significantly improved by six rounds of directed evolution. The melting temperature has been improved by more than 20°C. A variety of characteristics suggested that the resulting protein is well-folded. Our analysis provided a library of peptide motifs that is potentially useful for different protein engineering studies. The protein engineering strategy of using recurring motifs as interfaces to connect partial natural proteins may be applied to other protein folds. Copyright © 2016 Elsevier B.V. All rights reserved.
Willwand, Kurt; Moroianu, Adela; Hörlein, Rita; Stremmel, Wolfgang; Rommelaere, Jean
2002-07-01
The linear single-stranded DNA genome of minute virus of mice (MVM) is replicated via a double-stranded replicative form (RF) intermediate DNA. Amplification of viral RF DNA requires the structural transition of the right-end palindrome from a linear duplex into a double-hairpin structure, which serves for the repriming of unidirectional DNA synthesis. This conformational transition was found previously to be induced by the MVM nonstructural protein NS1. Elimination of the cognate NS1-binding sites, [ACCA](2), from the central region of the right-end palindrome next to the axis of symmetry was shown to markedly reduce the efficiency of hairpin-primed DNA replication, as measured in a reconstituted in vitro replication system. Thus, [ACCA](2) sequence motifs are essential as NS1-binding elements in the context of the structural transition of the right-end MVM palindrome.
Li, Tong; Johansson, Ingegerd; Hay, Donald I.; Strömberg, Nicklas
1999-01-01
Oral strains of Actinomyces spp. express type 1 fimbriae, which are composed of major FimP subunits, and bind preferentially to salivary acidic proline-rich proteins (APRPs) or to statherin. We have mapped genetic differences in the fimP subunit genes and the peptide recognition motifs within the host proteins associated with these differential binding specificities. The fimP genes were amplified by PCR from Actinomyces viscosus ATCC 19246, with preferential binding to statherin, and from Actinomyces naeslundii LY7, P-1-K, and B-1-K, with preferential binding to APRPs. The fimP gene from the statherin-binding strain 19246 is novel and has about 80% nucleotide and amino acid sequence identity to the highly conserved fimP genes of the APRP-binding strains (about 98 to 99% sequence identity). The novel FimP protein contains an amino-terminal signal peptide, randomly distributed single-amino-acid substitutions, and structurally different segments and ends with a cell wall-anchoring and a membrane-spanning region. When agarose beads with CNBr-linked host determinant-specific decapeptides were used, A. viscosus 19246 bound to the Thr42Phe43 terminus of statherin and A. naeslundii LY7 bound to the Pro149Gln150 termini of APRPs. Furthermore, while the APRP-binding A. naeslundii strains originate from the human mouth, A. viscosus strains isolated from the oral cavity of rat and hamster hosts showed preferential binding to statherin and contained the novel fimP gene. Thus, A. viscosus and A. naeslundii display structurally variant fimP genes whose protein products are likely to interact with different peptide motifs and to determine animal host tropism. PMID:10225854
Di, Chao; Yuan, Jiapei; Wu, Yue; Li, Jingrui; Lin, Huixin; Hu, Long; Zhang, Ting; Qi, Yijun; Gerstein, Mark B; Guo, Yan; Lu, Zhi John
2014-12-01
Recently, in addition to poly(A)+ long non-coding RNAs (lncRNAs), many lncRNAs without poly(A) tails, have been characterized in mammals. However, the non-polyA lncRNAs and their conserved motifs, especially those associated with environmental stresses, have not been fully investigated in plant genomes. We performed poly(A)- RNA-seq for seedlings of Arabidopsis thaliana under four stress conditions, and predicted lncRNA transcripts. We classified the lncRNAs into three confidence levels according to their expression patterns, epigenetic signatures and RNA secondary structures. Then, we further classified the lncRNAs to poly(A)+ and poly(A)- transcripts. Compared with poly(A)+ lncRNAs and coding genes, we found that poly(A)- lncRNAs tend to have shorter transcripts and lower expression levels, and they show significant expression specificity in response to stresses. In addition, their differential expression is significantly enriched in drought condition and depleted in heat condition. Overall, we identified 245 poly(A)+ and 58 poly(A)- lncRNAs that are differentially expressed under various stress stimuli. The differential expression was validated by qRT-PCR, and the signaling pathways involved were supported by specific binding of transcription factors (TFs), phytochrome-interacting factor 4 (PIF4) and PIF5. Moreover, we found many conserved sequence and structural motifs of lncRNAs from different functional groups (e.g. a UUC motif responding to salt and a AU-rich stem-loop responding to cold), indicated that the conserved elements might be responsible for the stress-responsive functions of lncRNAs. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.
Bonsor, Daniel A.; Pham, Kieu T.; Beadenkopf, Robert; Diederichs, Kay; Haas, Rainer; Beckett, Dorothy; Fischer, Wolfgang; Sundberg, Eric J.
2015-01-01
Arginine-aspartate-glycine (RGD) motifs are recognized by integrins to bridge cells to one another and the extracellular matrix. RGD motifs typically reside in exposed loop conformations. X-ray crystal structures of the Helicobacter pylori protein CagL revealed that RGD motifs can also exist in helical regions of proteins. Interactions between CagL and host gastric epithelial cell via integrins are required for the translocation of the bacterial oncoprotein CagA. Here, we have investigated the molecular basis of the CagL-host cell interactions using structural, biophysical, and functional analyses. We solved an x-ray crystal structure of CagL that revealed conformational changes induced by low pH not present in previous structures. Using analytical ultracentrifugation, we found that pH-induced conformational changes in CagL occur in solution and not just in the crystalline environment. By designing numerous CagL mutants based on all available crystal structures, we probed the functional roles of CagL conformational changes on cell surface integrin engagement. Together, our data indicate that the helical RGD motif in CagL is buried by a neighboring helix at low pH to inhibit CagL binding to integrin, whereas at neutral pH the neighboring helix is displaced to allow integrin access to the CagL RGD motif. This novel molecular mechanism of regulating integrin-RGD motif interactions by changes in the chemical environment provides new insight to H. pylori-mediated oncogenesis. PMID:25837254
Mapping Hfq-RNA interaction surfaces using tryptophan fluorescence quenching
Robinson, Kirsten E.; Orans, Jillian; Kovach, Alexander R.; Link, Todd M.; Brennan, Richard G.
2014-01-01
Hfq is a posttranscriptional riboregulator and RNA chaperone that binds small RNAs and target mRNAs to effect their annealing and message-specific regulation in response to environmental stressors. Structures of Hfq-RNA complexes indicate that U-rich sequences prefer the proximal face and A-rich sequences the distal face; however, the Hfq-binding sites of most RNAs are unknown. Here, we present an Hfq-RNA mapping approach that uses single tryptophan-substituted Hfq proteins, all of which retain the wild-type Hfq structure, and tryptophan fluorescence quenching (TFQ) by proximal RNA binding. TFQ properly identified the respective distal and proximal binding of A15 and U6 RNA to Gram-negative Escherichia coli (Ec) Hfq and the distal face binding of (AA)3A, (AU)3A and (AC)3A to Gram-positive Staphylococcus aureus (Sa) Hfq. The inability of (GU)3G to bind the distal face of Sa Hfq reveals the (R-L)n binding motif is a more restrictive (A-L)n binding motif. Remarkably Hfq from Gram-positive Listeria monocytogenes (Lm) binds (GU)3G on its proximal face. TFQ experiments also revealed the Ec Hfq (A-R-N)n distal face-binding motif should be redefined as an (A-A-N)n binding motif. TFQ data also demonstrated that the 5′-untranslated region of hfq mRNA binds both the proximal and distal faces of Ec Hfq and the unstructured C-terminus. PMID:24288369
Soumya, Neelagiri; Kumar, I Sravan; Shivaprasad, S; Gorakh, Landage Nitin; Dinesh, Neeradi; Swamy, Kayala Kambagiri; Singh, Sushma
2015-04-01
An adenosine monophosphate forming acetyl CoA synthetase (AceCS) which is the key enzyme involved in the conversion of acetate to acetyl CoA has been identified from Leishmania donovani for the first time. Sequence analysis of L. donovani AceCS (LdAceCS) revealed the presence of a 'PX4GK' motif which is highly conserved throughout organisms with higher sequence identity (96%) to lower sequence identity (38%). A ∼ 77 kDa heterologous protein with C-terminal 6X His-tag was expressed in Escherichia coli. Expression of LdAceCS in promastigotes was confirmed by western blot and RT-PCR analysis. Immunolocalization studies revealed that it is a cytosolic protein. We also report the kinetic characterization of recombinant LdAceCS with acetate, adenosine 5'-triphosphate, coenzyme A and propionate as substrates. Site directed mutagenesis of residues in conserved PX4GK motif of LdAceCS was performed to gain insight into its potential role in substrate binding, catalysis and its role in maintaining structural integrity of the protein. P646A, G651A and K652R exhibited more than 90% loss in activity signifying its indispensible role in the enzyme activity. Substitution of other residues in this motif resulted in altered substrate specificity and catalysis. However, none of them had any role in modulation of the secondary structure of the protein except G651A mutant. Copyright © 2015 Elsevier B.V. All rights reserved.
Liseron-Monfils, Christophe; Lewis, Tim; Ashlock, Daniel; McNicholas, Paul D; Fauteux, François; Strömvik, Martina; Raizada, Manish N
2013-03-15
The discovery of genetic networks and cis-acting DNA motifs underlying their regulation is a major objective of transcriptome studies. The recent release of the maize genome (Zea mays L.) has facilitated in silico searches for regulatory motifs. Several algorithms exist to predict cis-acting elements, but none have been adapted for maize. A benchmark data set was used to evaluate the accuracy of three motif discovery programs: BioProspector, Weeder and MEME. Analysis showed that each motif discovery tool had limited accuracy and appeared to retrieve a distinct set of motifs. Therefore, using the benchmark, statistical filters were optimized to reduce the false discovery ratio, and then remaining motifs from all programs were combined to improve motif prediction. These principles were integrated into a user-friendly pipeline for motif discovery in maize called Promzea, available at http://www.promzea.org and on the Discovery Environment of the iPlant Collaborative website. Promzea was subsequently expanded to include rice and Arabidopsis. Within Promzea, a user enters cDNA sequences or gene IDs; corresponding upstream sequences are retrieved from the maize genome. Predicted motifs are filtered, combined and ranked. Promzea searches the chosen plant genome for genes containing each candidate motif, providing the user with the gene list and corresponding gene annotations. Promzea was validated in silico using a benchmark data set: the Promzea pipeline showed a 22% increase in nucleotide sensitivity compared to the best standalone program tool, Weeder, with equivalent nucleotide specificity. Promzea was also validated by its ability to retrieve the experimentally defined binding sites of transcription factors that regulate the maize anthocyanin and phlobaphene biosynthetic pathways. Promzea predicted additional promoter motifs, and genome-wide motif searches by Promzea identified 127 non-anthocyanin/phlobaphene genes that each contained all five predicted promoter motifs in their promoters, perhaps uncovering a broader co-regulated gene network. Promzea was also tested against tissue-specific microarray data from maize. An online tool customized for promoter motif discovery in plants has been generated called Promzea. Promzea was validated in silico by its ability to retrieve benchmark motifs and experimentally defined motifs and was tested using tissue-specific microarray data. Promzea predicted broader networks of gene regulation associated with the historic anthocyanin and phlobaphene biosynthetic pathways. Promzea is a new bioinformatics tool for understanding transcriptional gene regulation in maize and has been expanded to include rice and Arabidopsis.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jacewicz, Agata; Schwer, Beate; Smith, Paul
Yeast Prp28 is a DEAD-box pre-mRNA splicing factor implicated in displacing U1 snRNP from the 5' splice site. Here we report that the 588-aa Prp28 protein consists of a trypsin-sensitive 126-aa N-terminal segment (of which aa 1–89 are dispensable for Prp28 function in vivo) fused to a trypsin-resistant C-terminal catalytic domain. Purified recombinant Prp28 and Prp28-(127–588) have an intrinsic RNA-dependent ATPase activity, albeit with a low turnover number. The crystal structure of Prp28-(127–588) comprises two RecA-like domains splayed widely apart. AMPPNP•Mg 2+ is engaged by the proximal domain, with proper and specific contacts from Phe194 and Gln201 (Q motif) tomore » the adenine nucleobase. The triphosphate moiety of AMPPNP•Mg 2+ is not poised for catalysis in the open domain conformation. Guided by the Prp28•AMPPNP structure, and that of the Drosophila Vasa•AMPPNP•Mg 2+•RNA complex, we targeted 20 positions in Prp28 for alanine scanning. ATP-site components Asp341 and Glu342 (motif II) and Arg527 and Arg530 (motif VI) and RNA-site constituent Arg476 (motif Va) are essential for Prp28 activity in vivo. Synthetic lethality of double-alanine mutations highlighted functionally redundant contacts in the ATP-binding (Phe194-Gln201, Gln201-Asp502) and RNA-binding (Arg264-Arg320) sites. As a result, overexpression of defective ATP-site mutants, but not defective RNA-site mutants, elicited severe dominant-negative growth defects.« less
Jacewicz, Agata; Schwer, Beate; Smith, Paul; ...
2014-10-10
Yeast Prp28 is a DEAD-box pre-mRNA splicing factor implicated in displacing U1 snRNP from the 5' splice site. Here we report that the 588-aa Prp28 protein consists of a trypsin-sensitive 126-aa N-terminal segment (of which aa 1–89 are dispensable for Prp28 function in vivo) fused to a trypsin-resistant C-terminal catalytic domain. Purified recombinant Prp28 and Prp28-(127–588) have an intrinsic RNA-dependent ATPase activity, albeit with a low turnover number. The crystal structure of Prp28-(127–588) comprises two RecA-like domains splayed widely apart. AMPPNP•Mg 2+ is engaged by the proximal domain, with proper and specific contacts from Phe194 and Gln201 (Q motif) tomore » the adenine nucleobase. The triphosphate moiety of AMPPNP•Mg 2+ is not poised for catalysis in the open domain conformation. Guided by the Prp28•AMPPNP structure, and that of the Drosophila Vasa•AMPPNP•Mg 2+•RNA complex, we targeted 20 positions in Prp28 for alanine scanning. ATP-site components Asp341 and Glu342 (motif II) and Arg527 and Arg530 (motif VI) and RNA-site constituent Arg476 (motif Va) are essential for Prp28 activity in vivo. Synthetic lethality of double-alanine mutations highlighted functionally redundant contacts in the ATP-binding (Phe194-Gln201, Gln201-Asp502) and RNA-binding (Arg264-Arg320) sites. As a result, overexpression of defective ATP-site mutants, but not defective RNA-site mutants, elicited severe dominant-negative growth defects.« less
Conserved and divergent features of the structure and function of La and La-related proteins (LARPs)
Bayfield, Mark A.; Yang, Ruiqing; Maraia, Richard J.
2010-01-01
Genuine La proteins contain two RNA binding motifs, a La motif (LAM) followed by a RNA recognition motif (RRM), arranged in a unique way to bind RNA. These proteins interact with an extensive variety of cellular RNAs and exhibit activities in two broad categories: i) to promote the metabolism of nascent pol III transcripts, including precursor-tRNAs, by binding to their common, UUU-3’OH containing ends, and ii) to modulate the translation of certain mRNAs involving an unknown binding mechanism. Characterization of several La-RNA crystal structures as well as biochemical studies reveal insight into their unique two-motif domain architecture and how the LAM recognizes UUU-3’OH while the RRM binds other parts of a pre-tRNA. Recent studies of members of distinct families of conserved La-related proteins (LARPs) indicate that some of these harbor activity related to genuine La proteins, suggesting that their UUU-3’OH binding mode has been appropriated for the assembly and regulation of a specific snRNP (e.g., 7SK snRNA assembly by hLARP7/PIP7S). Analyses of other LARP family members (i.e., hLARP4, hLARP6) suggest more diverged RNA binding modes and specialization for cytoplasmic mRNA-related functions. Thus it appears that while genuine La proteins exhibit broad general involvement in both snRNA-related and mRNA-related functions, different LARP families may have evolved specialized activities in either snRNA or mRNA related functions. In this review, we summarize recent progress that has led to greater understanding of the structure and function of La proteins and their roles in tRNA processing and RNP assembly dynamics, as well as progress on the different LARPs. PMID:20138158
Bayfield, Mark A; Yang, Ruiqing; Maraia, Richard J
2010-01-01
Genuine La proteins contain two RNA binding motifs, a La motif (LAM) followed by a RNA recognition motif (RRM), arranged in a unique way to bind RNA. These proteins interact with an extensive variety of cellular RNAs and exhibit activities in two broad categories: i) to promote the metabolism of nascent pol III transcripts, including precursor-tRNAs, by binding to their common, UUU-3'OH containing ends, and ii) to modulate the translation of certain mRNAs involving an unknown binding mechanism. Characterization of several La-RNA crystal structures as well as biochemical studies reveal insight into their unique two-motif domain architecture and how the LAM recognizes UUU-3'OH while the RRM binds other parts of a pre-tRNA. Recent studies of members of distinct families of conserved La-related proteins (LARPs) indicate that some of these harbor activity related to genuine La proteins, suggesting that their UUU-3'OH binding mode has been appropriated for the assembly and regulation of a specific snRNP (e.g., 7SK snRNP assembly by hLARP7/PIP7S). Analyses of other LARP family members suggest more diverged RNA binding modes and specialization for cytoplasmic mRNA-related functions. Thus it appears that while genuine La proteins exhibit broad general involvement in both snRNA-related and mRNA-related functions, different LARP families may have evolved specialized activities in either snRNA or mRNA-related functions. In this review, we summarize recent progress that has led to greater understanding of the structure and function of La proteins and their roles in tRNA processing and RNP assembly dynamics, as well as progress on the different LARPs.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ivanova, Marina E.; Fletcher, Georgina C.; O’Reilly, Nicola
2015-03-01
This study characterizes the interaction between the carboxy-terminal (ERLI) motif of the essential polarity protein Crb and the Pals1/Stardust PDZ-domain protein. Structures of human Pals1 PDZ with and without a Crb peptide are described, explaining the highly conserved nature of the ERLI motif and revealing a sterically blocked peptide-binding groove in the absence of ligand. Many components of epithelial polarity protein complexes possess PDZ domains that are required for protein interaction and recruitment to the apical plasma membrane. Apical localization of the Crumbs (Crb) transmembrane protein requires a PDZ-mediated interaction with Pals1 (protein-associated with Lin7, Stardust, MPP5), a member ofmore » the p55 family of membrane-associated guanylate kinases (MAGUKs). This study describes the molecular interaction between the Crb carboxy-terminal motif (ERLI), which is required for Drosophila cell polarity, and the Pals1 PDZ domain using crystallography and fluorescence polarization. Only the last four Crb residues contribute to Pals1 PDZ-domain binding affinity, with specificity contributed by conserved charged interactions. Comparison of the Crb-bound Pals1 PDZ structure with an apo Pals1 structure reveals a key Phe side chain that gates access to the PDZ peptide-binding groove. Removal of this side chain enhances the binding affinity by more than fivefold, suggesting that access of Crb to Pals1 may be regulated by intradomain contacts or by protein–protein interaction.« less
Structural and biochemical analysis of Bcl-2 interaction with the hepatitis B virus protein HBx.
Jiang, Tianyu; Liu, Minhao; Wu, Jianping; Shi, Yigong
2016-02-23
HBx is a hepatitis B virus protein that is required for viral infectivity and replication. Anti-apoptotic Bcl-2 family members are thought to be among the important host targets of HBx. However, the structure and function of HBx are poorly understood and the molecular mechanism of HBx-induced carcinogenesis remains unknown. In this study, we report biochemical and structural characterization of HBx. The recombinant HBx protein contains metal ions, in particular iron and zinc. A BH3-like motif in HBx (residues 110-135) binds Bcl-2 with a dissociation constant of ∼193 μM, which is drastically lower than that for a canonical BH3 motif from Bim or Bad. Structural analysis reveals that, similar to other BH3 motifs, the BH3-like motif of HBx adopts an amphipathic α-helix and binds the conserved BH3-binding groove on Bcl-2. Unlike the helical Bim or Bad BH3 motif, the C-terminal portion of the bound HBx BH3-like motif has an extended conformation and makes considerably fewer interactions with Bcl-2. These observations suggest that HBx may modulate Bcl-2 function in a way that is different from that of the classical BH3-only proteins.
Combined sequence and structure analysis of the fungal laccase family.
Kumar, S V Suresh; Phale, Prashant S; Durani, S; Wangikar, Pramod P
2003-08-20
Plant and fungal laccases belong to the family of multi-copper oxidases and show much broader substrate specificity than other members of the family. Laccases have consequently been of interest for potential industrial applications. We have analyzed the essential sequence features of fungal laccases based on multiple sequence alignments of more than 100 laccases. This has resulted in identification of a set of four ungapped sequence regions, L1-L4, as the overall signature sequences that can be used to identify the laccases, distinguishing them within the broader class of multi-copper oxidases. The 12 amino acid residues in the enzymes serving as the copper ligands are housed within these four identified conserved regions, of which L2 and L4 conform to the earlier reported copper signature sequences of multi-copper oxidases while L1 and L3 are distinctive to the laccases. The mapping of regions L1-L4 on to the three-dimensional structure of the Coprinus cinerius laccase indicates that many of the non-copper-ligating residues of the conserved regions could be critical in maintaining a specific, more or less C-2 symmetric, protein conformational motif characterizing the active site apparatus of the enzymes. The observed intraprotein homologies between L1 and L3 and between L2 and L4 at both the structure and the sequence levels suggest that the quasi C-2 symmetric active site conformational motif may have arisen from a structural duplication event that neither the sequence homology analysis nor the structure homology analysis alone would have unraveled. Although the sequence and structure homology is not detectable in the rest of the protein, the relative orientation of region L1 with L2 is similar to that of L3 with L4. The structure duplication of first-shell and second-shell residues has become cryptic because the intraprotein sequence homology noticeable for a given laccase becomes significant only after comparing the conservation pattern in several fungal laccases. The identified motifs, L1-L4, can be useful in searching the newly sequenced genomes for putative laccase enzymes. Copyright 2003 Wiley Periodicals, Inc. Biotechnol Bioeng 83: 386-394, 2003.
Sohn, Woon Yong; Habka, Sana; Gloaguen, Eric; Mons, Michel
2017-07-14
The presence in crystallized proteins of a local anchoring between the side chain of a His residue, located in the central position of a γ- or β-turn, and its local main chain environment, was assessed by the comparison of protein structures with relevant isolated model peptides. Gas phase laser spectroscopy, combined with relevant quantum chemistry methods, was used to characterize the γ- and β-turn structures in these model peptides. A conformer-selective NH stretch infrared study provided evidence for the formation in vacuo of two types of short-range H-bonded motifs, labelled ε-6 δ and δ- δ 7/π H , bridging the His side chain (in its gauche+ rotamer) to the neighbouring NH(i) and CO(i) sites of the backbone; each side chain-backbone motif was found to be specific of the tautomer (ε or δ) adopted by the His side chain in its neutral form. A close comparison between β- and γ-turns, selected from the Protein Data Bank, and the gas phase models demonstrated that a significant proportion of the gauche+ His rotamer distribution of proteins was well described by the corresponding gas phase H-bonded structures. This is consistent with the persistence of local 6 δ and δ 7/π H intramolecular interactions in proteins, emphasizing the relevance of gas phase data to secondary structures that are poorly accessible to solvents, e.g., in the case of a specific compact topology (Xxx-His β-turns). Deviations from the gas phase structures were also observed, mainly in His-Xxx β-turns, and assigned to solvent accessible turn structures. They were well accounted for by theoretical models of microhydrated turns, in which a few solvent molecules take over the gas phase motifs, constituting a water-mediated local anchoring of the His side chain to the backbone. Finally, the present gas phase benchmark models also pinpointed weaknesses in the protein structure determination by X-ray diffraction analysis; in particular, besides the lack of tautomer information, inaccuracies in the description of imidazole ring flip rotamerism were identified.
Measuring Symmetry, Asymmetry and Randomness in Neural Network Connectivity
Esposito, Umberto; Giugliano, Michele; van Rossum, Mark; Vasilaki, Eleni
2014-01-01
Cognitive functions are stored in the connectome, the wiring diagram of the brain, which exhibits non-random features, so-called motifs. In this work, we focus on bidirectional, symmetric motifs, i.e. two neurons that project to each other via connections of equal strength, and unidirectional, non-symmetric motifs, i.e. within a pair of neurons only one neuron projects to the other. We hypothesise that such motifs have been shaped via activity dependent synaptic plasticity processes. As a consequence, learning moves the distribution of the synaptic connections away from randomness. Our aim is to provide a global, macroscopic, single parameter characterisation of the statistical occurrence of bidirectional and unidirectional motifs. To this end we define a symmetry measure that does not require any a priori thresholding of the weights or knowledge of their maximal value. We calculate its mean and variance for random uniform or Gaussian distributions, which allows us to introduce a confidence measure of how significantly symmetric or asymmetric a specific configuration is, i.e. how likely it is that the configuration is the result of chance. We demonstrate the discriminatory power of our symmetry measure by inspecting the eigenvalues of different types of connectivity matrices. We show that a Gaussian weight distribution biases the connectivity motifs to more symmetric configurations than a uniform distribution and that introducing a random synaptic pruning, mimicking developmental regulation in synaptogenesis, biases the connectivity motifs to more asymmetric configurations, regardless of the distribution. We expect that our work will benefit the computational modelling community, by providing a systematic way to characterise symmetry and asymmetry in network structures. Further, our symmetry measure will be of use to electrophysiologists that investigate symmetry of network connectivity. PMID:25006663
Measuring symmetry, asymmetry and randomness in neural network connectivity.
Esposito, Umberto; Giugliano, Michele; van Rossum, Mark; Vasilaki, Eleni
2014-01-01
Cognitive functions are stored in the connectome, the wiring diagram of the brain, which exhibits non-random features, so-called motifs. In this work, we focus on bidirectional, symmetric motifs, i.e. two neurons that project to each other via connections of equal strength, and unidirectional, non-symmetric motifs, i.e. within a pair of neurons only one neuron projects to the other. We hypothesise that such motifs have been shaped via activity dependent synaptic plasticity processes. As a consequence, learning moves the distribution of the synaptic connections away from randomness. Our aim is to provide a global, macroscopic, single parameter characterisation of the statistical occurrence of bidirectional and unidirectional motifs. To this end we define a symmetry measure that does not require any a priori thresholding of the weights or knowledge of their maximal value. We calculate its mean and variance for random uniform or Gaussian distributions, which allows us to introduce a confidence measure of how significantly symmetric or asymmetric a specific configuration is, i.e. how likely it is that the configuration is the result of chance. We demonstrate the discriminatory power of our symmetry measure by inspecting the eigenvalues of different types of connectivity matrices. We show that a Gaussian weight distribution biases the connectivity motifs to more symmetric configurations than a uniform distribution and that introducing a random synaptic pruning, mimicking developmental regulation in synaptogenesis, biases the connectivity motifs to more asymmetric configurations, regardless of the distribution. We expect that our work will benefit the computational modelling community, by providing a systematic way to characterise symmetry and asymmetry in network structures. Further, our symmetry measure will be of use to electrophysiologists that investigate symmetry of network connectivity.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chojnowski, Grzegorz, E-mail: gchojnowski@genesilico.pl; Waleń, Tomasz; University of Warsaw, Banacha 2, 02-097 Warsaw
2015-03-01
A computer program that builds crystal structure models of nucleic acid molecules is presented. Brickworx is a computer program that builds crystal structure models of nucleic acid molecules using recurrent motifs including double-stranded helices. In a first step, the program searches for electron-density peaks that may correspond to phosphate groups; it may also take into account phosphate-group positions provided by the user. Subsequently, comparing the three-dimensional patterns of the P atoms with a database of nucleic acid fragments, it finds the matching positions of the double-stranded helical motifs (A-RNA or B-DNA) in the unit cell. If the target structure ismore » RNA, the helical fragments are further extended with recurrent RNA motifs from a fragment library that contains single-stranded segments. Finally, the matched motifs are merged and refined in real space to find the most likely conformations, including a fit of the sequence to the electron-density map. The Brickworx program is available for download and as a web server at http://iimcb.genesilico.pl/brickworx.« less
Development of a Nanotechnology Platform for Prostate Cancer Gene Therapy
2011-07-01
NUMBER OF PAGES 19a. NAME OF RESPONSIBLE PERSON USAMRMC a. REPORT U b . ABSTRACT U c. THIS PAGE U UU 19b. TELEPHONE NUMBER (include...condense pDNA into nano-size particles (nanocarriers), b ) a PC-3 specific targeting motif (TM) to target prostate cancer cells, c) an endosome...particles (nanocarriers), b ) a PC-3 specific targeting motif (TM) to target prostate cancer cells, c) an endosome disrupting motif (EDM) to disrupt
Structural Elements Regulating AAA+ Protein Quality Control Machines.
Chang, Chiung-Wen; Lee, Sukyeong; Tsai, Francis T F
2017-01-01
Members of the ATPases Associated with various cellular Activities (AAA+) superfamily participate in essential and diverse cellular pathways in all kingdoms of life by harnessing the energy of ATP binding and hydrolysis to drive their biological functions. Although most AAA+ proteins share a ring-shaped architecture, AAA+ proteins have evolved distinct structural elements that are fine-tuned to their specific functions. A central question in the field is how ATP binding and hydrolysis are coupled to substrate translocation through the central channel of ring-forming AAA+ proteins. In this mini-review, we will discuss structural elements present in AAA+ proteins involved in protein quality control, drawing similarities to their known role in substrate interaction by AAA+ proteins involved in DNA translocation. Elements to be discussed include the pore loop-1, the Inter-Subunit Signaling (ISS) motif, and the Pre-Sensor I insert (PS-I) motif. Lastly, we will summarize our current understanding on the inter-relationship of those structural elements and propose a model how ATP binding and hydrolysis might be coupled to polypeptide translocation in protein quality control machines.
MotifMark: Finding regulatory motifs in DNA sequences.
Hassanzadeh, Hamid Reza; Kolhe, Pushkar; Isbell, Charles L; Wang, May D
2017-07-01
The interaction between proteins and DNA is a key driving force in a significant number of biological processes such as transcriptional regulation, repair, recombination, splicing, and DNA modification. The identification of DNA-binding sites and the specificity of target proteins in binding to these regions are two important steps in understanding the mechanisms of these biological activities. A number of high-throughput technologies have recently emerged that try to quantify the affinity between proteins and DNA motifs. Despite their success, these technologies have their own limitations and fall short in precise characterization of motifs, and as a result, require further downstream analysis to extract useful and interpretable information from a haystack of noisy and inaccurate data. Here we propose MotifMark, a new algorithm based on graph theory and machine learning, that can find binding sites on candidate probes and rank their specificity in regard to the underlying transcription factor. We developed a pipeline to analyze experimental data derived from compact universal protein binding microarrays and benchmarked it against two of the most accurate motif search methods. Our results indicate that MotifMark can be a viable alternative technique for prediction of motif from protein binding microarrays and possibly other related high-throughput techniques.
Structural and Functional Investigations of the N-Terminal Ubiquitin Binding Region of Usp25.
Yang, Yuanyuan; Shi, Li; Ding, Yiluan; Shi, Yanhong; Hu, Hong-Yu; Wen, Yi; Zhang, Naixia
2017-05-23
Ubiquitin-specific protease 25 (Usp25) is a deubiquitinase that is involved in multiple biological processes. The N-terminal ubiquitin-binding region (UBR) of Usp25 contains one ubiquitin-associated domain, one small ubiquitin-like modifier (SUMO)-interacting motif and two ubiquitin-interacting motifs. Previous studies suggest that the covalent sumoylation in the UBR of Usp25 impairs its enzymatic activity. Here, we raise the hypothesis that non-covalent binding of SUMO, a prerequisite for efficient sumoylation, will impair Usp25's catalytic activity as well. To test our hypothesis and elucidate the underlying molecular mechanism, we investigated the structure and function of the Usp25 N-terminal UBR. The solution structure of Usp25 1-146 is obtained, and the key residues responsible for recognition of ubiquitin and SUMO2 are identified. Our data suggest inhibition of Usp25's catalytic activity upon the non-covalent binding of SUMO2 to the Usp25 SUMO-interacting motif. We also find that SUMO2 can competitively block the interaction between the Usp25 UBR and its ubiquitin substrates. Based on our findings, we have proposed a working model to depict the regulatory role of the Usp25 UBR in the functional display of the enzyme. Copyright © 2017 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Badrinarayan, Preethi; Sastry, G. Narahari
2014-01-01
The present study examines the conformational transitions occurring among the major structural motifs of Aurora kinase (AK) concomitant with the DFG-flip and deciphers the role of non-covalent interactions in rendering specificity. Multiple sequence alignment, docking and structural analysis of a repertoire of 56 crystal structures of AK from Protein Data Bank (PDB) has been carried out. The crystal structures were systematically categorized based on the conformational disposition of the DFG-loop [in (DI) 42, out (DO) 5 and out-up (DOU) 9], G-loop [extended (GE) 53 and folded (GF) 3] and αC-helix [in (CI) 42 and out (CO) 14]. The overlapping subsets on categorization show the inter-dependency among structural motifs. Therefore, the four distinct possibilities a) 2W1C (DI, CI, GE) b) 3E5A (DI, CI, GF) c) 3DJ6 (DI, CO, GF) d) 3UNZ (DOU, CO, GF) along with their co-crystals and apo-forms were subjected to molecular dynamics simulations of 40 ns each to evaluate the variations of individual residues and their impact on forming interactions. The non-covalent interactions formed by the 157 AK co-crystals with different regions of the binding site were initially studied with the docked complexes and structure interaction fingerprints. The frequency of the most prominent interactions was gauged in the AK inhibitors from PDB and the four representative conformations during 40 ns. Based on this study, seven major non-covalent interactions and their complementary sites in AK capable of rendering specificity have been prioritized for the design of different classes of inhibitors. PMID:25485544
Aravind, Penmatsa; Wistow, Graeme; Sharma, Yogendra; Sankaranarayanan, Rajan
2008-01-01
βγ-Crystallins belong to a superfamily of proteins in prokaryotes and eukaryotes that are based on duplications of a characteristic, highly conserved Greek Key motif. Most members of the superfamily in vertebrates are structural proteins of the eye lens that contain four motifs arranged as two structural domains. Absent in melanoma-1 (AIM1), an unusual member of the superfamily whose expression is associated with suppression of malignancy in melanoma, contains 12 βγ-crystallin motifs in six domains. Some of these motifs diverge considerably from the canonical motif sequence. AIM1g1, the first βγ-crystallin domain of AIM1, is the most variant of βγ-crystallin domains currently known. In order to understand the limits of sequence variation on the structure, we report the crystal structure of AIM1g1 at 1.9Å resolution. In spite of having changes in key residues, the domain retains the overall βγ-crystallin fold. The domain also contains an unusual extended surface loop that significantly alters the shape of the domain and its charge profile. This structure illustrates the resilience of the βγ fold to considerable sequence changes and its remarkable ability to adapt for novel functions. PMID:18582473
A Motif in the Clathrin Heavy Chain Required for the Hsc70/Auxilin Uncoating Reaction
Rapoport, Iris; Boll, Werner; Yu, Anan; Böcking, Till
2008-01-01
The 70-kDa heat-shock cognate protein (Hsc70) chaperone is an ATP-dependent “disassembly enzyme” for many subcellular structures, including clathrin-coated vesicles where it functions as an uncoating ATPase. Hsc70, and its cochaperone auxilin together catalyze coat disassembly. Like other members of the Hsp70 chaperone family, it is thought that ATP-bound Hsc70 recognizes the clathrin triskelion through an unfolded exposed hydrophobic segment. The best candidate is the unstructured C terminus (residues 1631–1675) of the heavy chain at the foot of the tripod below the hub, containing the sequence motif QLMLT, closely related to the sequence bound preferentially by the substrate groove of Hsc70 (Fotin et al., 2004b). To test this hypothesis, we generated in insect cells recombinant mammalian triskelions that in vitro form clathrin cages and clathrin/AP-2 coats exactly like those assembled from native clathrin. We show that coats assembled from recombinant clathrin are good substrates for ATP- and auxilin-dependent, Hsc70-catalyzed uncoating. Finally, we show that this uncoating reaction proceeds normally when the coats contain recombinant heavy chains truncated C-terminal to the QLMLT motif, but very inefficiently when the motif is absent. Thus, the QLMLT motif is required for Hsc-70–facilitated uncoating, consistent with the proposal that this sequence is a specific target of the chaperone. PMID:17978091
Alfassy, Omri S.; Cohen, Itamar; Reiss, Yuval; Tirosh, Boaz; Ravid, Tommer
2013-01-01
Protein elimination by the ubiquitin-proteasome system requires the presence of a cis-acting degradation signal. Efforts to discern degradation signals of misfolded proteasome substrates thus far revealed a general mechanism whereby the exposure of cryptic hydrophobic motifs provides a degradation determinant. We have previously characterized such a determinant, employing the yeast kinetochore protein Ndc10 as a model substrate. Ndc10 is essentially a stable protein that is rapidly degraded upon exposure of a hydrophobic motif located at the C-terminal region. The degradation motif comprises two distinct and essential elements: DegA, encompassing two amphipathic helices, and DegB, a hydrophobic sequence within the loosely structured C-terminal tail of Ndc10. Here we show that the hydrophobic nature of DegB is irrelevant for the ubiquitylation of substrates containing the Ndc10 degradation motif, but is essential for proteasomal degradation. Mutant DegB, in which the hydrophobic sequence was disrupted, acted as a dominant degradation inhibitory element when expressed at the C-terminal regions of ubiquitin-dependent and -independent substrates of the 26S proteasome. This mutant stabilized substrates in both yeast and mammalian cells, indicative of a modular recognition moiety. The dominant function of the mutant DegB provides a powerful experimental tool for evaluating the physiological implications of stabilization of specific proteasome substrates in intact cells and for studying the associated pathological effects. PMID:23519465
Crystal structure of bacterial cell-surface alginate-binding protein with an M75 peptidase motif
DOE Office of Scientific and Technical Information (OSTI.GOV)
Maruyama, Yukie; Ochiai, Akihito; Mikami, Bunzo
Research highlights: {yields} Bacterial alginate-binding Algp7 is similar to component EfeO of Fe{sup 2+} transporter. {yields} We determined the crystal structure of Algp7 with a metal-binding motif. {yields} Algp7 consists of two helical bundles formed through duplication of a single bundle. {yields} A deep cleft involved in alginate binding locates around the metal-binding site. {yields} Algp7 may function as a Fe{sup 2+}-chelated alginate-binding protein. -- Abstract: A gram-negative Sphingomonas sp. A1 directly incorporates alginate polysaccharide into the cytoplasm via the cell-surface pit and ABC transporter. A cell-surface alginate-binding protein, Algp7, functions as a concentrator of the polysaccharide in the pit.more » Based on the primary structure and genetic organization in the bacterial genome, Algp7 was found to be homologous to an M75 peptidase motif-containing EfeO, a component of a ferrous ion transporter. Despite the presence of an M75 peptidase motif with high similarity, the Algp7 protein purified from recombinant Escherichia coli cells was inert on insulin B chain and N-benzoyl-Phe-Val-Arg-p-nitroanilide, both of which are substrates for a typical M75 peptidase, imelysin, from Pseudomonas aeruginosa. The X-ray crystallographic structure of Algp7 was determined at 2.10 A resolution by single-wavelength anomalous diffraction. Although a metal-binding motif, HxxE, conserved in zinc ion-dependent M75 peptidases is also found in Algp7, the crystal structure of Algp7 contains no metal even at the motif. The protein consists of two structurally similar up-and-down helical bundles as the basic scaffold. A deep cleft between the bundles is sufficiently large to accommodate macromolecules such as alginate polysaccharide. This is the first structural report on a bacterial cell-surface alginate-binding protein with an M75 peptidase motif.« less
Motif formation and industry specific topologies in the Japanese business firm network
NASA Astrophysics Data System (ADS)
Maluck, Julian; Donner, Reik V.; Takayasu, Hideki; Takayasu, Misako
2017-05-01
Motifs and roles are basic quantities for the characterization of interactions among 3-node subsets in complex networks. In this work, we investigate how the distribution of 3-node motifs can be influenced by modifying the rules of an evolving network model while keeping the statistics of simpler network characteristics, such as the link density and the degree distribution, invariant. We exemplify this problem for the special case of the Japanese Business Firm Network, where a well-studied and relatively simple yet realistic evolving network model is available, and compare the resulting motif distribution in the real-world and simulated networks. To better approximate the motif distribution of the real-world network in the model, we introduce both subgraph dependent and global additional rules. We find that a specific rule that allows only for the merging process between nodes with similar link directionality patterns reduces the observed excess of densely connected motifs with bidirectional links. Our study improves the mechanistic understanding of motif formation in evolving network models to better describe the characteristic features of real-world networks with a scale-free topology.
NASA Astrophysics Data System (ADS)
Nawaz, Meh Sameen; Vik, Erik Sebastian; Ronander, Mia Elise; Solvoll, Anne Marthe; Blicher, Pernille; Bjørås, Magnar; Alseth, Ingrun; Dalhus, Bjørn
2016-04-01
Endonuclease V (EndoV) is an enzyme with specificity for deaminated adenosine (inosine) in nucleic acids. EndoV from Escherichia coli (EcEndoV) acts both on inosines in DNA and RNA, whereas the human homolog cleaves only at inosines in RNA. Inosines in DNA are mutagenic and the role of EndoV in DNA repair is well established. In contrast, the biological function of EndoV in RNA processing is largely unexplored. Here we have characterized a second mammalian EndoV homolog, mouse EndoV (mEndoV), and show that mEndoV shares the same RNA selectivity as human EndoV (hEndoV). Mouse EndoV cleaves the same inosine-containing substrates as hEndoV, but with reduced efficiencies. The crystal structure of mEndoV reveals a conformation different from the hEndoV and prokaryotic EndoV structures, particularly for the conserved tyrosine in the wedge motif, suggesting that this strand separating element has some flexibility. Molecular dynamics simulations of mouse and human EndoV reveal alternative conformations for the invariant tyrosine. The configuration of the active site, on the other hand, is very similar between the prokaryotic and mammalian versions of EndoV.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bahl, C.; Morisseau, C; Bomberger, J
Cystic fibrosis transmembrane conductance regulator (CFTR) inhibitory factor (Cif) is a virulence factor secreted by Pseudomonas aeruginosa that reduces the quantity of CFTR in the apical membrane of human airway epithelial cells. Initial sequence analysis suggested that Cif is an epoxide hydrolase (EH), but its sequence violates two strictly conserved EH motifs and also is compatible with other {alpha}/{beta} hydrolase family members with diverse substrate specificities. To investigate the mechanistic basis of Cif activity, we have determined its structure at 1.8-{angstrom} resolution by X-ray crystallography. The catalytic triad consists of residues Asp129, His297, and Glu153, which are conserved across themore » family of EHs. At other positions, sequence deviations from canonical EH active-site motifs are stereochemically conservative. Furthermore, detailed enzymatic analysis confirms that Cif catalyzes the hydrolysis of epoxide compounds, with specific activity against both epibromohydrin and cis-stilbene oxide, but with a relatively narrow range of substrate selectivity. Although closely related to two other classes of {alpha}/{beta} hydrolase in both sequence and structure, Cif does not exhibit activity as either a haloacetate dehalogenase or a haloalkane dehalogenase. A reassessment of the structural and functional consequences of the H269A mutation suggests that Cif's effect on host-cell CFTR expression requires the hydrolysis of an extended endogenous epoxide substrate.« less
Spontaneous cortical activity alternates between motifs defined by regional axonal projections
Mohajerani, Majid H.; Chan, Allen W.; Mohsenvand, Mostafa; LeDue, Jeffrey; Liu, Rui; McVea, David A.; Boyd, Jamie D.; Wang, Yu Tian; Reimers, Mark; Murphy, Timothy H.
2014-01-01
In lightly anaesthetized or awake adult mice using millisecond timescale voltage sensitive dye imaging, we show that a palette of sensory-evoked and hemisphere-wide activity motifs are represented in spontaneous activity. These motifs can reflect multiple modes of sensory processing including vision, audition, and touch. Similar cortical networks were found with direct cortical activation using channelrhodopsin-2. Regional analysis of activity spread indicated modality specific sources such as primary sensory areas, and a common posterior-medial cortical sink where sensory activity was extinguished within the parietal association area, and a secondary anterior medial sink within the cingulate/secondary motor cortices for visual stimuli. Correlation analysis between functional circuits and intracortical axonal projections indicated a common framework corresponding to long-range mono-synaptic connections between cortical regions. Maps of intracortical mono-synaptic structural connections predicted hemisphere-wide patterns of spontaneous and sensory-evoked depolarization. We suggest that an intracortical monosynaptic connectome shapes the ebb and flow of spontaneous cortical activity. PMID:23974708
Huang, Ying; Bayfield, Mark A; Intine, Robert V; Maraia, Richard J
2006-07-01
By sequence-specific binding to 3' UUU-OH, the La protein shields precursor (pre)-RNAs from 3' end digestion and is required to protect defective pre-transfer RNAs from decay. Although La is comprised of a La motif and an RNA-recognition motif (RRM), a recent structure indicates that the RRM beta-sheet surface is not involved in UUU-OH recognition, raising questions as to its function. Progressively defective suppressor tRNAs in Schizosaccharomyces pombe reveal differential sensitivities to La and Rrp6p, a 3' exonuclease component of pre-tRNA decay. 3' end protection is compromised by mutations to the La motif but not the RRM surface. The most defective pre-tRNAs require a second activity of La, in addition to 3' protection, that requires an intact RRM surface. The two activities of La in tRNA maturation map to its two conserved RNA-binding surfaces and suggest a modular model that has implications for its other ligands.
Exploration of tetrahedral structures in silicate cathodes using a motif-network scheme
Zhao, Xin; Wu, Shunqing; Lv, Xiaobao; Nguyen, Manh Cuong; Wang, Cai-Zhuang; Lin, Zijing; Zhu, Zi-Zhong; Ho, Kai-Ming
2015-01-01
Using a motif-network search scheme, we studied the tetrahedral structures of the dilithium/disodium transition metal orthosilicates A2MSiO4 with A = Li or Na and M = Mn, Fe or Co. In addition to finding all previously reported structures, we discovered many other different tetrahedral-network-based crystal structures which are highly degenerate in energy. These structures can be classified into structures with 1D, 2D and 3D M-Si-O frameworks. A clear trend of the structural preference in different systems was revealed and possible indicators that affect the structure stabilities were introduced. For the case of Na systems which have been much less investigated in the literature relative to the Li systems, we predicted their ground state structures and found evidence for the existence of new structural motifs. PMID:26497381
A computational proposal for designing structured RNA pools for in vitro selection of RNAs.
Kim, Namhee; Gan, Hin Hark; Schlick, Tamar
2007-04-01
Although in vitro selection technology is a versatile experimental tool for discovering novel synthetic RNA molecules, finding complex RNA molecules is difficult because most RNAs identified from random sequence pools are simple motifs, consistent with recent computational analysis of such sequence pools. Thus, enriching in vitro selection pools with complex structures could increase the probability of discovering novel RNAs. Here we develop an approach for engineering sequence pools that links RNA sequence space regions with corresponding structural distributions via a "mixing matrix" approach combined with a graph theory analysis. We define five classes of mixing matrices motivated by covariance mutations in RNA; these constructs define nucleotide transition rates and are applied to chosen starting sequences to yield specific nonrandom pools. We examine the coverage of sequence space as a function of the mixing matrix and starting sequence via clustering analysis. We show that, in contrast to random sequences, which are associated only with a local region of sequence space, our designed pools, including a structured pool for GTP aptamers, can target specific motifs. It follows that experimental synthesis of designed pools can benefit from using optimized starting sequences, mixing matrices, and pool fractions associated with each of our constructed pools as a guide. Automation of our approach could provide practical tools for pool design applications for in vitro selection of RNAs and related problems.
Crystal Structure Predictions Using Adaptive Genetic Algorithm and Motif Search methods
NASA Astrophysics Data System (ADS)
Ho, K. M.; Wang, C. Z.; Zhao, X.; Wu, S.; Lyu, X.; Zhu, Z.; Nguyen, M. C.; Umemoto, K.; Wentzcovitch, R. M. M.
2017-12-01
Material informatics is a new initiative which has attracted a lot of attention in recent scientific research. The basic strategy is to construct comprehensive data sets and use machine learning to solve a wide variety of problems in material design and discovery. In pursuit of this goal, a key element is the quality and completeness of the databases used. Recent advance in the development of crystal structure prediction algorithms has made it a complementary and more efficient approach to explore the structure/phase space in materials using computers. In this talk, we discuss the importance of the structural motifs and motif-networks in crystal structure predictions. Correspondingly, powerful methods are developed to improve the sampling of the low-energy structure landscape.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kennedy, Zachary C.; Cardenas, Allan Jay P.; Corbey, Jordan F.
2016-01-01
Glutardiamidoxime, a structural motif on sorbents used in uranium extraction from seawater, was discovered to cyclize in situ at room temperature to 2,6-diimino-piperidin-1-ol in the presence of uranyl nitrate. The new diimino motif was also generated when exposed to competing transition metals Cu(II) and Ni(II). Multinuclear μ-O bridged U(VI), Cu(II), and Ni(II) complexes featuring bound diimino ligands were isolated. A Cu(II) complex with the historically relevant cyclic imide dioxime motif is also reported for structural comparison to the reported diimino complexes.
Ballano, Gema; Zanuy, David; Jiménez, Ana I.; Cativiela, Carlos; Nussinov, Ruth; Alemán, Carlos
2009-01-01
Here we study conformational stabilization induced in a β-helical nanostructure by position-specific mutations. The nanostructure is constructed through the self-assembly of the β-helical building block excised from E. coli galactoside acetyltransferase (PDB code 1krr, chain A; residues 131-165). The mutations involve substitutions by cyclic, conformationally constrained amino acids. Specifically, a complete structural analysis of the Pro-Xaa-Val sequence [with Xaa being Gly, Ac3c (1-aminocyclopropane-1-carboxylic acid) and Ac5c (1-aminocyclopentane-1-carboxylic acid)], corresponding to the 148-150 loop region in the wild-type (Gly) and mutated (Ac3c and Ac5c) 1krr, has been performed using Molecular Dynamics simulations and X-ray crystallography. Simulations have been performed for the wild-type and mutants of three different systems, namely the building block, the nanoconstruct and the isolated Pro-Xaa-Val tripeptide. Furthermore, the crystalline structures of five peptides of Pro-Xaa-Val or Xaa-Val sequences have been solved by X-ray diffraction analysis and compared with theoretical predictions. Both the theoretical and crystallographic studies indicate that the Pro-Acnc-Val sequences exhibit a high propensity to adopt turn-like conformations, and this propensity is little affected by the chemical environment. Overall, the results indicate that replacement of Gly149 by Ac3c or Ac5c significantly reduce the conformational flexibility of the target site enhancing the structural specificity of the building block and the nanoconstruct derived from the 1krr β-helical motif. PMID:18811190
Cui, Yunxi; Kong, Deming; Ghimire, Chiran; Xu, Cuixia; Mao, Hanbin
2016-04-19
G-Quadruplex and i-motif are tetraplex structures that may form in opposite strands at the same location of a duplex DNA. Recent discoveries have indicated that the two tetraplex structures can have conflicting biological activities, which poses a challenge for cells to coordinate. Here, by performing innovative population analysis on mechanical unfolding profiles of tetraplex structures in double-stranded DNA, we found that formations of G-quadruplex and i-motif in the two complementary strands are mutually exclusive in a variety of DNA templates, which include human telomere and promoter fragments of hINS and hTERT genes. To explain this behavior, we placed G-quadruplex- and i-motif-hosting sequences in an offset fashion in the two complementary telomeric DNA strands. We found simultaneous formation of the G-quadruplex and i-motif in opposite strands, suggesting that mutual exclusivity between the two tetraplexes is controlled by steric hindrance. This conclusion was corroborated in the BCL-2 promoter sequence, in which simultaneous formation of two tetraplexes was observed due to possible offset arrangements between G-quadruplex and i-motif in opposite strands. The mutual exclusivity revealed here sets a molecular basis for cells to efficiently coordinate opposite biological activities of G-quadruplex and i-motif at the same dsDNA location.
Jaspard, Emmanuel
2006-01-01
Background There are three isoforms of glutamate dehydrogenase. The isoform EC 1.4.1.4 (GDH4) catalyses glutamate synthesis from 2-oxoglutarate and ammonium, using NAD(P)H. Ammonium assimilation is critical for plant growth. Although GDH4 from animals and prokaryotes are well characterized, there are few data concerning plant GDH4, even from those whose genomes are well annotated. Results A large set of the three GDH isoforms was built resulting in 116 non-redundant full polypeptide sequences. A computational analysis was made to gain more information concerning the structure – function relationship of GDH4 from plants (Eukaryota, Viridiplantae). The tested plant GDH4 sequences were the two ones known to date, those of Chlorella sorokiniana. This analysis revealed several structural features specific of plant GDH4: (i) the lack of a structure called "antenna"; (ii) the NAD(P)-binding motif GAGNVA; and (iii) a second putative coenzyme-binding motif GVLTGKG together with four residues involved in the binding of the reduced form of NADP. Conclusion A number of structural features specific of plant GDH4 have been found. The results reinforce the probable key role of GDH4 in ammonium assimilation by plants. Reviewers This article was reviewed by Tina Bakolitsa (nominated by Eugene Koonin), Martin Jambon (nominated by Laura Landweber), Sandor Pangor and Franck Eisenhaber. PMID:17173671
Roux-Rouquie, M; Marilley, M
2000-09-15
We have modeled local DNA sequence parameters to search for DNA architectural motifs involved in transcription regulation and promotion within the Xenopus laevis ribosomal gene promoter and the intergenic spacer (IGS) sequences. The IGS was found to be shaped into distinct topological domains. First, intrinsic bends split the IGS into domains of common but different helical features. Local parameters at inter-domain junctions exhibit a high variability with respect to intrinsic curvature, bendability and thermal stability. Secondly, the repeated sequence blocks of the IGS exhibit right-handed supercoiled structures which could be related to their enhancer properties. Thirdly, the gene promoter presents both inherent curvature and minor groove narrowing which may be viewed as motifs of a structural code for protein recognition and binding. Such pre-existing deformations could simply be remodeled during the binding of the transcription complex. Alternatively, these deformations could pre-shape the promoter in such a way that further remodeling is facilitated. Mutations shown to abolish promoter curvature as well as intrinsic minor groove narrowing, in a variant which maintained full transcriptional activity, bring circumstantial evidence for structurally-preorganized motifs in relation to transcription regulation and promotion. Using well documented X. laevis rDNA regulatory sequences we showed that computer modeling may be of invaluable assistance in assessing encrypted architectural motifs. The evidence of these DNA topological motifs with respect to the concept of structural code is discussed.
Analysis of secondary structural elements in human microRNA hairpin precursors.
Liu, Biao; Childs-Disney, Jessica L; Znosko, Brent M; Wang, Dan; Fallahi, Mohammad; Gallo, Steven M; Disney, Matthew D
2016-03-01
MicroRNAs (miRNAs) regulate gene expression by targeting complementary mRNAs for destruction or translational repression. Aberrant expression of miRNAs has been associated with various diseases including cancer, thus making them interesting therapeutic targets. The composite of secondary structural elements that comprise miRNAs could aid the design of small molecules that modulate their function. We analyzed the secondary structural elements, or motifs, present in all human miRNA hairpin precursors and compared them to highly expressed human RNAs with known structures and other RNAs from various organisms. Amongst human miRNAs, there are 3808 are unique motifs, many residing in processing sites. Further, we identified motifs in miRNAs that are not present in other highly expressed human RNAs, desirable targets for small molecules. MiRNA motifs were incorporated into a searchable database that is freely available. We also analyzed the most frequently occurring bulges and internal loops for each RNA class and found that the smallest loops possible prevail. However, the distribution of loops and the preferred closing base pairs were unique to each class. Collectively, we have completed a broad survey of motifs found in human miRNA precursors, highly expressed human RNAs, and RNAs from other organisms. Interestingly, unique motifs were identified in human miRNA processing sites, binding to which could inhibit miRNA maturation and hence function.
Pereira, Maria G; Benevides, Norma M B; Melo, Marcia R S; Valente, Ana Paula; Melo, Fábio R; Mourão, Paulo A S
2005-09-05
Marine red algae are an abundant source of sulfated galactans with potent anticoagulant activity. However, the specific structural motifs that confer biological activity remain to be elucidated. We have now isolated and purified a sulfated galactan from the marine red alga, Gellidium crinale. The structure of this polysaccharide was determined using NMR spectroscopy. It is composed of the repeating structure -4-alpha-Galp-(1-->3)-beta-Galp1--> but with a variable sulfation pattern. Clearly 15% of the total alpha-units are 2,3-di-sulfated and another 55% are 2-sulfated. No evidence for the occurrence of 3,6-anhydro alpha-galactose units was observed in the NMR spectra. We also compared the anticoagulant activity of this sulfated galactan with a polysaccharide from the species, Botryocladia occidentalis, with a similar saccharide chain but with higher amounts of 2,3-di-sulfated alpha-units. The sulfated galactan from G. crinale has a lower anticoagulant activity on a clotting assay when compared with the polysaccharide from B. occidentalis. When tested in assays using specific proteases and coagulation inhibitors, these two galactans showed significant differences in their activity. They do not differ in thrombin inhibition mediated by antithrombin, but in assays where heparin cofactor II replaces antithrombin, the sulfated galactan from G. crinale requires a significantly higher concentration to achieve the same inhibitory effect as the polysaccharide from B. occidentalis. In contrast, when factor Xa instead of thrombin is used as the target protease, the sulfated galactan from G. crinale is a more potent anticoagulant. These observations suggest that the proportion and/or the distribution of 2,3-di-sulfated alpha-units along the galactan chain may be a critical structural motif to promote the interaction of the protease with specific protease and coagulation inhibitors.
Exploration of tetrahedral structures in silicate cathodes using a motif-network scheme
Zhao, Xin; Wu, Shunqing; Lv, Xiaobao; ...
2015-10-26
Using a motif-network search scheme, we studied the tetrahedral structures of the dilithium/disodium transition metal orthosilicates A 2MSiO 4 with A = Li or Na and M = Mn, Fe or Co. In addition to finding all previously reported structures, we discovered many other different tetrahedral-network-based crystal structures which are highly degenerate in energy. In addition, these structures can be classified into structures with 1D, 2D and 3D M-Si-O frameworks. A clear trend of the structural preference in different systems was revealed and possible indicators that affect the structure stabilities were introduced. For the case of Na systems which havemore » been much less investigated in the literature relative to the Li systems, we predicted their ground state structures and found evidence for the existence of new structural motifs.« less
Holdsworth, Gill; Slocombe, Patrick; Doyle, Carl; Sweeney, Bernadette; Veverka, Vaclav; Le Riche, Kelly; Franklin, Richard J.; Compson, Joanne; Brookings, Daniel; Turner, James; Kennedy, Jeffery; Garlish, Rachael; Shi, Jiye; Newnham, Laura; McMillan, David; Muzylak, Mariusz; Carr, Mark D.; Henry, Alistair J.; Ceska, Thomas; Robinson, Martyn K.
2012-01-01
LRP5 and LRP6 are proteins predicted to contain four six-bladed β-propeller domains and both bind the bone-specific Wnt signaling antagonist sclerostin. Here, we report the crystal structure of the amino-terminal region of LRP6 and using NMR show that the ability of sclerostin to bind to this molecule is mediated by the central core of sclerostin and does not involve the amino- and carboxyl-terminal flexible arm regions. We show that this structured core region interacts with LRP5 and LRP6 via an NXI motif (found in the sequence PNAIG) within a flexible loop region (loop 2) within the central core region. This sequence is related closely to a previously identified motif in laminin that mediates its interaction with the β-propeller domain of nidogen. However, the NXI motif is not involved in the interaction of sclerostin with LRP4 (another β-propeller containing protein in the LRP family). A peptide derived from the loop 2 region of sclerostin blocked the interaction of sclerostin with LRP5/6 and also inhibited Wnt1 but not Wnt3A or Wnt9B signaling. This suggests that these Wnts interact with LRP6 in different ways. PMID:22696217
2015-01-01
Many pathogenic bacteria utilize the type III secretion system (T3SS) to translocate effector proteins directly into host cells, facilitating colonization. In enterohemmorhagic Escherichia coli (EHEC), a subset of T3SS effectors is essential for suppression of the inflammatory response in hosts, including humans. Identified as a zinc protease that cleaves NF-κB transcription factors, NleC is one such effector. Here, we investigate NleC substrate specificity, showing that four residues around the cleavage site in the DNA-binding loop of the NF-κB subunit RelA strongly influence the cleavage rate. Class I NF-κB subunit p50 is cleaved at a reduced rate consistent with conservation of only three of these four residues. However, peptides containing 10 residues on each side of the scissile bond were not efficiently cleaved by NleC, indicating that elements distal from the cleavage site are also important for substrate recognition. We present the crystal structure of NleC and show that it mimics DNA structurally and electrostatically. Consistent with this model, mutation of phosphate-mimicking residues in NleC reduces the level of RelA cleavage. We propose that global recognition of NF-κB subunits by DNA mimicry combined with a high sequence selectivity for the cleavage site results in exquisite NleC substrate specificity. The structure also shows that despite undetectable similarity of its sequence to those of other Zn2+ proteases beyond its conserved HExxH Zn2+-binding motif, NleC is a member of the Zincin protease superfamily, albeit divergent from its structural homologues. In particular, NleC displays a modified Ψ-loop motif that may be important for folding and refolding requirements implicit in T3SS translocation. PMID:25040221
Brown, Simon H J; Mitchell, Todd W; Oakley, Aaron J; Pham, Huong T; Blanksby, Stephen J
2012-09-01
Since the 1950s, X-ray crystallography has been the mainstay of structural biology, providing detailed atomic-level structures that continue to revolutionize our understanding of protein function. From recent advances in this discipline, a picture has emerged of intimate and specific interactions between lipids and proteins that has driven renewed interest in the structure of lipids themselves and raised intriguing questions as to the specificity and stoichiometry in lipid-protein complexes. Herein we demonstrate some of the limitations of crystallography in resolving critical structural features of ligated lipids and thus determining how these motifs impact protein binding. As a consequence, mass spectrometry must play an important and complementary role in unraveling the complexities of lipid-protein interactions. We evaluate recent advances and highlight ongoing challenges towards the twin goals of (1) complete structure elucidation of low, abundant, and structurally diverse lipids by mass spectrometry alone, and (2) assignment of stoichiometry and specificity of lipid interactions within protein complexes.
NASA Astrophysics Data System (ADS)
Brown, Simon H. J.; Mitchell, Todd W.; Oakley, Aaron J.; Pham, Huong T.; Blanksby, Stephen J.
2012-09-01
Since the 1950s, X-ray crystallography has been the mainstay of structural biology, providing detailed atomic-level structures that continue to revolutionize our understanding of protein function. From recent advances in this discipline, a picture has emerged of intimate and specific interactions between lipids and proteins that has driven renewed interest in the structure of lipids themselves and raised intriguing questions as to the specificity and stoichiometry in lipid-protein complexes. Herein we demonstrate some of the limitations of crystallography in resolving critical structural features of ligated lipids and thus determining how these motifs impact protein binding. As a consequence, mass spectrometry must play an important and complementary role in unraveling the complexities of lipid-protein interactions. We evaluate recent advances and highlight ongoing challenges towards the twin goals of (1) complete structure elucidation of low, abundant, and structurally diverse lipids by mass spectrometry alone, and (2) assignment of stoichiometry and specificity of lipid interactions within protein complexes.
Experience-Dependent Rewiring of Specific Inhibitory Connections in Adult Neocortex
Kätzel, Dennis; Miesenböck, Gero
2014-01-01
Although neocortical connectivity is remarkably stereotyped, the abundance of some wiring motifs varies greatly between cortical areas. To examine if regional wiring differences represent functional adaptations, we have used optogenetic raster stimulation to map the laminar distribution of GABAergic interneurons providing inhibition to pyramidal cells in layer 2/3 (L2/3) of adult mouse barrel cortex during sensory deprivation and recovery. Whisker trimming caused large, motif-specific changes in inhibitory synaptic connectivity: ascending inhibition from deep layers 4 and 5 was attenuated to 20%–45% of baseline, whereas inhibition from superficial layers remained stable (L2/3) or increased moderately (L1). The principal mechanism of deprivation-induced plasticity was motif-specific changes in inhibitory-to-excitatory connection probabilities; the strengths of extant connections were left unaltered. Whisker regrowth restored the original balance of inhibition from deep and superficial layers. Targeted, reversible modifications of specific inhibitory wiring motifs thus contribute to the adaptive remodeling of cortical circuits. PMID:24586113
Structural basis for the binding of tryptophan-based motifs by δ-COP
Suckling, Richard J.; Poon, Pak Phi; Travis, Sophie M.; Majoul, Irina V.; Hughson, Frederick M.; Evans, Philip R.; Duden, Rainer; Owen, David J.
2015-01-01
Coatomer consists of two subcomplexes: the membrane-targeting, ADP ribosylation factor 1 (Arf1):GTP-binding βγδζ-COP F-subcomplex, which is related to the adaptor protein (AP) clathrin adaptors, and the cargo-binding αβ’ε-COP B-subcomplex. We present the structure of the C-terminal μ-homology domain of the yeast δ-COP subunit in complex with the WxW motif from its binding partner, the endoplasmic reticulum-localized Dsl1 tether. The motif binds at a site distinct from that used by the homologous AP μ subunits to bind YxxΦ cargo motifs with its two tryptophan residues sitting in compatible pockets. We also show that the Saccharomyces cerevisiae Arf GTPase-activating protein (GAP) homolog Gcs1p uses a related WxxF motif at its extreme C terminus to bind to δ-COP at the same site in the same way. Mutations designed on the basis of the structure in conjunction with isothermal titration calorimetry confirm the mode of binding and show that mammalian δ-COP binds related tryptophan-based motifs such as that from ArfGAP1 in a similar manner. We conclude that δ-COP subunits bind Wxn(1–6)[WF] motifs within unstructured regions of proteins that influence the lifecycle of COPI-coated vesicles; this conclusion is supported by the observation that, in the context of a sensitizing domain deletion in Dsl1p, mutating the tryptophan-based motif-binding site in yeast causes defects in both growth and carboxypeptidase Y trafficking/processing. PMID:26578768
Tritschler, Felix; Eulalio, Ana; Helms, Sigrun; Schmidt, Steffen; Coles, Murray; Weichenrieder, Oliver; Izaurralde, Elisa; Truffault, Vincent
2008-01-01
Trailer Hitch (Tral or LSm15) and enhancer of decapping-3 (EDC3 or LSm16) are conserved eukaryotic members of the (L)Sm (Sm and Like-Sm) protein family. They have a similar domain organization, characterized by an N-terminal LSm domain and a central FDF motif; however, in Tral, the FDF motif is flanked by regions rich in charged residues, whereas in EDC3 the FDF motif is followed by a YjeF_N domain. We show that in Drosophila cells, Tral and EDC3 specifically interact with the decapping activator DCP1 and the DEAD-box helicase Me31B. Nevertheless, only Tral associates with the translational repressor CUP, whereas EDC3 associates with the decapping enzyme DCP2. Like EDC3, Tral interacts with DCP1 and localizes to mRNA processing bodies (P bodies) via the LSm domain. This domain remains monomeric in solution and adopts a divergent Sm fold that lacks the characteristic N-terminal α-helix, as determined by nuclear magnetic resonance analyses. Mutational analysis revealed that the structural integrity of the LSm domain is required for Tral both to interact with DCP1 and CUP and to localize to P-bodies. Furthermore, both Tral and EDC3 interact with the C-terminal RecA-like domain of Me31B through their FDF motifs. Together with previous studies, our results show that Tral and EDC3 are structurally related and use a similar mode to associate with common partners in distinct protein complexes. PMID:18765641
Tritschler, Felix; Eulalio, Ana; Helms, Sigrun; Schmidt, Steffen; Coles, Murray; Weichenrieder, Oliver; Izaurralde, Elisa; Truffault, Vincent
2008-11-01
Trailer Hitch (Tral or LSm15) and enhancer of decapping-3 (EDC3 or LSm16) are conserved eukaryotic members of the (L)Sm (Sm and Like-Sm) protein family. They have a similar domain organization, characterized by an N-terminal LSm domain and a central FDF motif; however, in Tral, the FDF motif is flanked by regions rich in charged residues, whereas in EDC3 the FDF motif is followed by a YjeF_N domain. We show that in Drosophila cells, Tral and EDC3 specifically interact with the decapping activator DCP1 and the DEAD-box helicase Me31B. Nevertheless, only Tral associates with the translational repressor CUP, whereas EDC3 associates with the decapping enzyme DCP2. Like EDC3, Tral interacts with DCP1 and localizes to mRNA processing bodies (P bodies) via the LSm domain. This domain remains monomeric in solution and adopts a divergent Sm fold that lacks the characteristic N-terminal alpha-helix, as determined by nuclear magnetic resonance analyses. Mutational analysis revealed that the structural integrity of the LSm domain is required for Tral both to interact with DCP1 and CUP and to localize to P-bodies. Furthermore, both Tral and EDC3 interact with the C-terminal RecA-like domain of Me31B through their FDF motifs. Together with previous studies, our results show that Tral and EDC3 are structurally related and use a similar mode to associate with common partners in distinct protein complexes.
Distance-dependent duplex DNA destabilization proximal to G-quadruplex/i-motif sequences
König, Sebastian L. B.; Huppert, Julian L.; Sigel, Roland K. O.; Evans, Amanda C.
2013-01-01
G-quadruplexes and i-motifs are complementary examples of non-canonical nucleic acid substructure conformations. G-quadruplex thermodynamic stability has been extensively studied for a variety of base sequences, but the degree of duplex destabilization that adjacent quadruplex structure formation can cause has yet to be fully addressed. Stable in vivo formation of these alternative nucleic acid structures is likely to be highly dependent on whether sufficient spacing exists between neighbouring duplex- and quadruplex-/i-motif-forming regions to accommodate quadruplexes or i-motifs without disrupting duplex stability. Prediction of putative G-quadruplex-forming regions is likely to be assisted by further understanding of what distance (number of base pairs) is required for duplexes to remain stable as quadruplexes or i-motifs form. Using oligonucleotide constructs derived from precedented G-quadruplexes and i-motif-forming bcl-2 P1 promoter region, initial biophysical stability studies indicate that the formation of G-quadruplex and i-motif conformations do destabilize proximal duplex regions. The undermining effect that quadruplex formation can have on duplex stability is mitigated with increased distance from the duplex region: a spacing of five base pairs or more is sufficient to maintain duplex stability proximal to predicted quadruplex/i-motif-forming regions. PMID:23771141
DOE Office of Scientific and Technical Information (OSTI.GOV)
Khare, B.; Krishnan, V.; Rajashankar, K.R.
2011-10-21
The assembly of pili on the cell wall of Gram-positive bacteria requires transpeptidase enzymes called sortases. In Streptococcus agalactiae, the PI-1 pilus island of strain 2603V/R encodes two pilus-specific sortases (SrtC1 and SrtC2) and three pilins (GBS80, GBS52 and GBS104). Although either pilus-specific sortase is sufficient for the polymerization of the major pilin, GBS80, incorporation of the minor pilins GBS52 and GBS104 into the pilus structure requires SrtC1 and SrtC2, respectively. The S. agalactiae housekeeping sortase, SrtA, whose gene is present at a different location and does not catalyze pilus polymerization, was shown to be involved in cell wall anchoringmore » of pilus polymers. To understand the structural basis of sortases involved in such diverse functions, we determined the crystal structures of S. agalactiae SrtC1 and SrtA. Both enzymes are made of an eight-stranded beta-barrel core with variations in their active site architecture. SrtA exhibits a catalytic triad arrangement similar to that in Streptococcus pyogenes SrtA but different from that in Staphylococcus aureus SrtA. In contrast, the SrtC1 enzyme contains an N-terminal helical domain and a 'lid' in its putative active site, which is similar to that seen in Streptococcus pneumoniae pilus-specific sortases, although with subtle differences in positioning and composition. To understand the effect of such differences on substrate recognition, we have also determined the crystal structure of a SrtC1 mutant, in which the conserved DP(W/F/Y) motif was replaced with the sorting signal motif of GBS80, IPNTG. By comparing the structures of WT wild type SrtA and SrtC1 and the 'lid' mutant of SrtC1, we propose that structural elements within the active site and the lid may be important for defining the role of specific sortase in pili biogenesis.« less
Biomaterials Made from Coiled-Coil Peptides.
Conticello, Vincent; Hughes, Spencer; Modlin, Charles
The development of biomaterials designed for specific applications is an important objective in personalized medicine. While the breadth and prominence of biomaterials have increased exponentially over the past decades, critical challenges remain to be addressed, particularly in the development of biomaterials that exhibit highly specific functions. These functional properties are often encoded within the molecular structure of the component molecules. Proteins, as a consequence of their structural specificity, represent useful substrates for the construction of functional biomaterials through rational design. This chapter provides an in-depth survey of biomaterials constructed from coiled-coils, one of the best-understood protein structural motifs. We discuss the utility of this structurally diverse and functionally tunable class of proteins for the creation of novel biomaterials. This discussion illustrates the progress that has been made in the development of coiled-coil biomaterials by showcasing studies that bridge the gap between the academic science and potential technological impact.
Santamaría-Hernando, Saray; Krell, Tino; Ramos-González, María-Isabel
2012-01-01
Proteins of the animal heme peroxidase (ANP) superfamily differ greatly in size since they have either one or two catalytic domains that match profile PS50292. The orf PP_2561 of Pseudomonas putida KT2440 that we have called PepA encodes a two-domain ANP. The alignment of these domains with those of PepA homologues revealed a variable number of insertions with the consensus G-x-D-G-x-x-[GN]-[TN]-x-D-D. This motif has also been detected in the structure of pseudopilin (pdb 3G20), where it was found to be involved in Ca(2+) coordination although a sequence analysis did not reveal the presence of any known calcium binding motifs in this protein. Isothermal titration calorimetry revealed that a peptide containing this consensus motif bound specifically calcium ions with affinities ranging between 33-79 µM depending on the pH. Microcalorimetric titrations of the purified N-terminal ANP-like domain of PepA revealed Ca(2+) binding with a K(D) of 12 µM and stoichiometry of 1.25 calcium ions per protein monomer. This domain exhibited peroxidase activity after its reconstitution with heme. These data led to the definition of a novel calcium binding motif that we have termed PERCAL and which was abundantly present in animal peroxidase-like domains of bacterial proteins. Bacterial heme peroxidases thus possess two different types of calcium binding motifs, namely PERCAL and the related hemolysin type calcium binding motif, with the latter being located outside the catalytic domains and in their C-terminal end. A phylogenetic tree of ANP-like catalytic domains of bacterial proteins with PERCAL motifs, including single domain peroxidases, was divided into two major clusters, representing domains with and without PERCAL motif containing insertions. We have verified that the recently reported classification of bacterial heme peroxidases in two families (cd09819 and cd09821) is unrelated to these insertions. Sequences matching PERCAL were detected in all kingdoms of life.
HLA-G peptide preferences change in transformed cells: impact on the binding motif.
Celik, Alexander A; Simper, Gwendolin S; Hiemisch, Wiebke; Blasczyk, Rainer; Bade-Döding, Christina
2018-03-30
HLA-G is known for its strictly restricted tissue distribution. HLA-G expression could be detected in immune privileged organs and many tumor entities such as leukemia, multiple myeloma, and non-Hodgkin and Hodgkin's lymphoma. This functional variability from mediation of immune tolerance to facilitation of tumor immune evasion strategies might translate to a differential NK cell inhibition between immune-privileged organs and tumor cells. The biophysical invariability of the HLA-G heavy chain and its contrary diversity in immunity implicates a strong influence of the bound peptides on the pHLA-G structure. The aim was to determine if HLA-G displays a tissue-specific peptide repertoire. Therefore, using soluble sHLA-G technology, we analyzed the K562 and HDLM-2 peptide repertoires. Although both cell lines possess a comparable proteome and recruit HLA-G-restricted peptides through the same peptide-loading pathway, the peptide features appear to be cell specific. HDLM-2 derived HLA-G peptides are anchored by an Arg at p1 and K562-derived peptides are anchored by a Lys. At p2, no anchor motif could be determined while peptides were anchored at pΩ with a Leu and showed an auxiliary anchor motif Pro at p3. To appreciate if the peptide anchor alterations are due to a cell-specific differential peptidome, we performed analysis of peptide availability within the different cell types. Yet, the comparison of the cell-specific proteome and HLA-G-restricted ligandome clearly demonstrates a tissue-specific peptide selection by HLA-G molecules. This exclusive and unexpected observation suggests an exquisite immune function of HLA-G.
The pH-dependent tertiary structure of a designed helix-loop-helix dimer.
Dolphin, G T; Baltzer, L
1997-01-01
De novo designed helix-loop-helix motifs can fold into well-defined tertiary structures if residues or groups of residues are incorporated at the helix-helix boundary to form helix-recognition sites that restrict the conformational degrees of freedom of the helical segments. Understanding the relationship between structure and function of conformational constraints therefore forms the basis for the engineering of non-natural proteins. This paper describes the design of an interhelical HisH+-Asp- hydrogen-bonded ion pair and the conformational stability of the folded helix-loop-helix motif. GTD-C, a polypeptide with 43 amino acid residues, has been designed to fold into a hairpin helix-loop-helix motif that can dimerise to form a four-helix bundle. The folded motif is in slow conformational exchange on the NMR timescale and has a well-dispersed 1H NMR spectrum, a narrow temperature interval for thermal denaturation and a near-UV CD spectrum with some fine structure. The conformational stability is pH dependent with an optimum that corresponds to the pH for maximum formation of a hydrogen-bonded ion pair between HisH17+ in helix I and Asp27- in helix II. The formation of an interhelical salt bridge is strongly suggested by the pH dependence of a number of spectroscopic probes to generate a well-defined tertiary structure in a designed helix-loop-helix motif. The thermodynamic stability of the folded motif is not increased by the formation of the salt bridge, but neighbouring conformations are destabilised. The use of this novel design principle in combination with hydrophobic interactions that provide sufficient binding energy in the folded structure should be of general use in de novo design of native-like proteins.
Shelar, Ashish; Bansal, Manju
2014-12-01
α-Helices are amongst the most common secondary structural elements seen in membrane proteins and are packed in the form of helix bundles. These α-helices encounter varying external environments (hydrophobic, hydrophilic) that may influence the sequence preferences at their N and C-termini. The role of the external environment in stabilization of the helix termini in membrane proteins is still unknown. Here we analyze α-helices in a high-resolution dataset of integral α-helical membrane proteins and establish that their sequence and conformational preferences differ from those in globular proteins. We specifically examine these preferences at the N and C-termini in helices initiating/terminating inside the membrane core as well as in linkers connecting these transmembrane helices. We find that the sequence preferences and structural motifs at capping (Ncap and Ccap) and near-helical (N' and C') positions are influenced by a combination of features including the membrane environment and the innate helix initiation and termination property of residues forming structural motifs. We also find that a large number of helix termini which do not form any particular capping motif are stabilized by formation of hydrogen bonds and hydrophobic interactions contributed from the neighboring helices in the membrane protein. We further validate the sequence preferences obtained from our analysis with data from an ultradeep sequencing study that identifies evolutionarily conserved amino acids in the rat neurotensin receptor. The results from our analysis provide insights for the secondary structure prediction, modeling and design of membrane proteins. © 2014 Wiley Periodicals, Inc.
Zolotarov, Yevgen; Strömvik, Martina
2015-01-01
Plants accumulate dehydrins in response to osmotic stresses. Dehydrins are divided into five different classes, which are thought to be regulated in different manners. To better understand differences in transcriptional regulation of the five dehydrin classes, de novo motif discovery was performed on 350 dehydrin promoter sequences from a total of 51 plant genomes. Overrepresented motifs were identified in the promoters of five dehydrin classes. The Kn dehydrin promoters contain motifs linked with meristem specific expression, as well as motifs linked with cold/dehydration and abscisic acid response. KS dehydrin promoters contain a motif with a GATA core. SKn and YnSKn dehydrin promoters contain motifs that match elements connected with cold/dehydration, abscisic acid and light response. YnKn dehydrin promoters contain motifs that match abscisic acid and light response elements, but not cold/dehydration response elements. Conserved promoter motifs are present in the dehydrin classes and across different plant lineages, indicating that dehydrin gene regulation is likely also conserved.
Automated Recognition of RNA Structure Motifs by Their SHAPE Data Signatures.
Radecki, Pierce; Ledda, Mirko; Aviran, Sharon
2018-06-14
High-throughput structure profiling (SP) experiments that provide information at nucleotide resolution are revolutionizing our ability to study RNA structures. Of particular interest are RNA elements whose underlying structures are necessary for their biological functions. We previously introduced patteRNA , an algorithm for rapidly mining SP data for patterns characteristic of such motifs. This work provided a proof-of-concept for the detection of motifs and the capability of distinguishing structures displaying pronounced conformational changes. Here, we describe several improvements and automation routines to patteRNA . We then consider more elaborate biological situations starting with the comparison or integration of results from searches for distinct motifs and across datasets. To facilitate such analyses, we characterize patteRNA ’s outputs and describe a normalization framework that regularizes results. We then demonstrate that our algorithm successfully discerns between highly similar structural variants of the human immunodeficiency virus type 1 (HIV-1) Rev response element (RRE) and readily identifies its exact location in whole-genome structure profiles of HIV-1. This work highlights the breadth of information that can be gleaned from SP data and broadens the utility of data-driven methods as tools for the detection of novel RNA elements.
MOCCS: Clarifying DNA-binding motif ambiguity using ChIP-Seq data.
Ozaki, Haruka; Iwasaki, Wataru
2016-08-01
As a key mechanism of gene regulation, transcription factors (TFs) bind to DNA by recognizing specific short sequence patterns that are called DNA-binding motifs. A single TF can accept ambiguity within its DNA-binding motifs, which comprise both canonical (typical) and non-canonical motifs. Clarification of such DNA-binding motif ambiguity is crucial for revealing gene regulatory networks and evaluating mutations in cis-regulatory elements. Although chromatin immunoprecipitation sequencing (ChIP-seq) now provides abundant data on the genomic sequences to which a given TF binds, existing motif discovery methods are unable to directly answer whether a given TF can bind to a specific DNA-binding motif. Here, we report a method for clarifying the DNA-binding motif ambiguity, MOCCS. Given ChIP-Seq data of any TF, MOCCS comprehensively analyzes and describes every k-mer to which that TF binds. Analysis of simulated datasets revealed that MOCCS is applicable to various ChIP-Seq datasets, requiring only a few minutes per dataset. Application to the ENCODE ChIP-Seq datasets proved that MOCCS directly evaluates whether a given TF binds to each DNA-binding motif, even if known position weight matrix models do not provide sufficient information on DNA-binding motif ambiguity. Furthermore, users are not required to provide numerous parameters or background genomic sequence models that are typically unavailable. MOCCS is implemented in Perl and R and is freely available via https://github.com/yuifu/moccs. By complementing existing motif-discovery software, MOCCS will contribute to the basic understanding of how the genome controls diverse cellular processes via DNA-protein interactions. Copyright © 2016 Elsevier Ltd. All rights reserved.
Subramanian, Sundar Raman; Singam, Ettayapuram Ramaprasad Azhagiya; Berinski, Michael; Subramanian, Venkatesan; Wade, Rebecca C
2016-08-25
Sequence-specific cleavage of collagen by mammalian collagenase plays a pivotal role in cell function. Collagenases are matrix metalloproteinases that cleave the peptide bond at a specific position on fibrillar collagen. The collagenase Hemopexin-like (HPX) domain has been proposed to be responsible for substrate recognition, but the mechanism by which collagenases identify the cleavage site on fibrillar collagen is not clearly understood. In this study, Brownian dynamics simulations coupled with atomic-detail and coarse-grained molecular dynamics simulations were performed to dock matrix metalloproteinase-1 (MMP-1) on a collagen IIIα1 triple helical peptide. We find that the HPX domain recognizes the collagen triple helix at a conserved R-X11-R motif C-terminal to the cleavage site to which the HPX domain of collagen is guided electrostatically. The binding of the HPX domain between the two arginine residues is energetically stabilized by hydrophobic contacts with collagen. From the simulations and analysis of the sequences and structural flexibility of collagen and collagenase, a mechanistic scheme by which MMP-1 can recognize and bind collagen for proteolysis is proposed.
Chen, Yi-Ju; Lu, Cheng-Tsung; Huang, Kai-Yao; Wu, Hsin-Yi; Chen, Yu-Ju; Lee, Tzong-Yi
2015-01-01
S-glutathionylation, the covalent attachment of a glutathione (GSH) to the sulfur atom of cysteine, is a selective and reversible protein post-translational modification (PTM) that regulates protein activity, localization, and stability. Despite its implication in the regulation of protein functions and cell signaling, the substrate specificity of cysteine S-glutathionylation remains unknown. Based on a total of 1783 experimentally identified S-glutathionylation sites from mouse macrophages, this work presents an informatics investigation on S-glutathionylation sites including structural factors such as the flanking amino acids composition and the accessible surface area (ASA). TwoSampleLogo presents that positively charged amino acids flanking the S-glutathionylated cysteine may influence the formation of S-glutathionylation in closed three-dimensional environment. A statistical method is further applied to iteratively detect the conserved substrate motifs with statistical significance. Support vector machine (SVM) is then applied to generate predictive model considering the substrate motifs. According to five-fold cross-validation, the SVMs trained with substrate motifs could achieve an enhanced sensitivity, specificity, and accuracy, and provides a promising performance in an independent test set. The effectiveness of the proposed method is demonstrated by the correct identification of previously reported S-glutathionylation sites of mouse thioredoxin (TXN) and human protein tyrosine phosphatase 1b (PTP1B). Finally, the constructed models are adopted to implement an effective web-based tool, named GSHSite (http://csb.cse.yzu.edu.tw/GSHSite/), for identifying uncharacterized GSH substrate sites on the protein sequences. PMID:25849935
Yan, Shuo; Wang, Zhongni; Liu, Yuan; Li, Wei; Wu, Feng; Lin, Xuelei; Meng, Zheng
2015-07-01
Late stage pollen-specific promoters are important tools in crop molecular breeding. Several such promoters, and their functional motifs, have been well characterized in dicotyledonous plants such as tomato and tobacco. However, knowledge about the functional architecture of such promoters is limited in the monocotyledonous plant rice. Here, pollen-late-stage-promoter 1 (PLP1) and pollen-late-stage-promoter 2 (PLP2) were characterized using a stable transformation system in rice. Histochemical staining showed that the two promoters exclusively drive GUS expression in late-stage pollen grains in rice. 5' deletion analysis revealed that four regions, including the -1159 to -720 and the -352 to -156 regions of PLP1 and the -740 to -557 and the -557 to -339 regions of PLP2, are important in maintaining the activity and specificity of these promoters. Motif mutation analysis indicated that 'AGAAA' and 'CAAT' motifs in the -740 to -557 region of PLP2 act as enhancers in the promoter. Gain of function experiments indicated that the novel TA-rich motif 'TACATAA' and 'TATTCAT' in the core region of the PLP1 and PLP2 promoters is necessary, but not sufficient, for pollen-specific expression in rice. Our results provide evidence that the enhancer motif 'AGAAA' is conserved in the pollen-specific promoters of both monocots and eudicots, but that some functional architecture characteristics are different.
Bruce, A. Gregory; Horst, Jeremy A.; Rose, Timothy M.
2016-01-01
The envelope-associated glycoprotein B (gB) is highly conserved within the Herpesviridae and plays a critical role in viral entry. We analyzed the evolutionary conservation of sequence and structural motifs within the Kaposi’s sarcoma-associated herpesvirus (KSHV) gB and homologs of Old World primate rhadinoviruses belonging to the distinct RV1 and RV2 rhadinovirus lineages. In addition to gB homologs of rhadinoviruses infecting the pig-tailed and rhesus macaques, we cloned and sequenced gB homologs of RV1 and RV2 rhadinoviruses infecting chimpanzees. A structural model of the KSHV gB was determined, and functional motifs and sequence variants were mapped to the model structure. Conserved domains and motifs were identified, including an “RGD” motif that plays a critical role in KSHV binding and entry through the cellular integrin αVβ3. The RGD motif was only detected in RV1 rhadinoviruses suggesting an important difference in cell tropism between the two rhadinovirus lineages. PMID:27070755
Identification of a new protein in the centrosome-like "atractophore" of Trichomonas vaginalis.
Bricheux, Geneviève; Coffe, Gérard; Brugerolle, Guy
2007-06-01
The human parasite Trichomonas vaginalis has specific structural bodies, atractophores, associated at one end to the kinetosomes and at the other to the spindle during division. A monoclonal antibody specific for a component of this structure was obtained. It recognizes a protein with a predicted molecular mass of 477 kDa. Sequence analysis of this protein shows that P477 belongs to the family of large coiled-coil proteins, sharing a highly versatile protein folding motif adaptable to many biological functions. P477-might act as an anchor to localize cellular activities and components to the golgi centrosomal region. It may represent a new class of structural proteins, since similar proteins were found in many protozoans.
Structural basis of GSK-3 inhibition by N-terminal phosphorylation and by the Wnt receptor LRP6.
Stamos, Jennifer L; Chu, Matthew Ling-Hon; Enos, Michael D; Shah, Niket; Weis, William I
2014-03-18
Glycogen synthase kinase-3 (GSK-3) is a key regulator of many cellular signaling pathways. Unlike most kinases, GSK-3 is controlled by inhibition rather than by specific activation. In the insulin and several other signaling pathways, phosphorylation of a serine present in a conserved sequence near the amino terminus of GSK-3 generates an auto-inhibitory peptide. In contrast, Wnt/β-catenin signal transduction requires phosphorylation of Ser/Pro rich sequences present in the Wnt co-receptors LRP5/6, and these motifs inhibit GSK-3 activity. We present crystal structures of GSK-3 bound to its phosphorylated N-terminus and to two of the phosphorylated LRP6 motifs. A conserved loop unique to GSK-3 undergoes a dramatic conformational change that clamps the bound pseudo-substrate peptides, and reveals the mechanism of primed substrate recognition. The structures rationalize target sequence preferences and suggest avenues for the design of inhibitors selective for a subset of pathways regulated by GSK-3. DOI: http://dx.doi.org/10.7554/eLife.01998.001.
Hatakeyama, Tomomitsu; Ishimine, Tomohiro; Baba, Tomohiro; Kimura, Masanari; Unno, Hideaki; Goda, Shuichiro
2013-07-01
CEL-I is a Gal/GalNAc-specific C-type lectin isolated from the sea cucumber Cucumaria echinata. This lectin is composed of two carbohydrate-recognition domains (CRDs) with the carbohydrate-recognition motif QPD (Gln-Pro- Asp), which is generally known to exist in galactose-specific C-type CRDs. In the present study, a mutant CEL-I with EPN (Glu-Pro-Asn) motif, which is thought to be responsible for the carbohydrate-recognition of mannose-specific Ctype CRDs, was produced in Escherichia coli, and its effects on the carbohydrate-binding specificity were examined using polyamidoamine dendrimer (PD) conjugated with carbohydrates. Although wild-type CEL-I effectively formed complexes with N-acetylgalactosamine (GalNAc)-PD but not with mannose-PD, the mutant CEL-I showed relatively weak but definite affinity for mannose-PD. These results indicated that the QPD and EPN motifs play a significant role in the carbohydrate-recognition mechanism of CEL-I, especially in the discrimination of galactose and mannose. Additional mutations in the recombinant CEL-I binding site may further increase its specificity for mannose, and should provide insights into designing novel carbohydrate-recognition proteins.
Noborn, Fredrik; Gomez Toledo, Alejandro; Green, Anders; Nasir, Waqas; Sihlbom, Carina; Nilsson, Jonas; Larson, Göran
2016-10-03
Heparan sulfate (HS) and chondroitin sulfate (CS) are complex polysaccharides that regulate important biological pathways in virtually all metazoan organisms. The polysaccharides often display opposite effects on cell functions with HS and CS structural motifs presenting unique binding sites for specific ligands. Still, the mechanisms by which glycan biosynthesis generates complex HS and CS polysaccharides required for the regulation of mammalian physiology remain elusive. Here we present a glycoproteomic approach that identifies and differentiates between HS and CS attachment sites and provides identity to the core proteins. Glycopeptides were prepared from perlecan, a complex proteoglycan known to be substituted with both HS and CS chains, further digested with heparinase or chondroitinase ABC to reduce the HS and CS chain lengths respectively, and thereafter analyzed by nLC-MS/MS. This protocol enabled the identification of three consensus HS sites and one hybrid site, carrying either a HS or a CS chain. Inspection of the amino acid sequence at the hybrid attachment locus indicates that certain peptide motifs may encode for the chain type selection process. This analytical approach will become useful when addressing fundamental questions in basic biology specifically in elucidating the functional roles of site-specific glycosylations of proteoglycans.
Roux-Rouquie, Magali; Marilley, Monique
2000-01-01
We have modeled local DNA sequence parameters to search for DNA architectural motifs involved in transcription regulation and promotion within the Xenopus laevis ribosomal gene promoter and the intergenic spacer (IGS) sequences. The IGS was found to be shaped into distinct topological domains. First, intrinsic bends split the IGS into domains of common but different helical features. Local parameters at inter-domain junctions exhibit a high variability with respect to intrinsic curvature, bendability and thermal stability. Secondly, the repeated sequence blocks of the IGS exhibit right-handed supercoiled structures which could be related to their enhancer properties. Thirdly, the gene promoter presents both inherent curvature and minor groove narrowing which may be viewed as motifs of a structural code for protein recognition and binding. Such pre-existing deformations could simply be remodeled during the binding of the transcription complex. Alternatively, these deformations could pre-shape the promoter in such a way that further remodeling is facilitated. Mutations shown to abolish promoter curvature as well as intrinsic minor groove narrowing, in a variant which maintained full transcriptional activity, bring circumstantial evidence for structurally-preorganized motifs in relation to transcription regulation and promotion. Using well documented X.laevis rDNA regulatory sequences we showed that computer modeling may be of invaluable assistance in assessing encrypted architectural motifs. The evidence of these DNA topological motifs with respect to the concept of structural code is discussed. PMID:10982860
Detecting DNA regulatory motifs by incorporating positional trendsin information content
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kechris, Katherina J.; van Zwet, Erik; Bickel, Peter J.
2004-05-04
On the basis of the observation that conserved positions in transcription factor binding sites are often clustered together, we propose a simple extension to the model-based motif discovery methods. We assign position-specific prior distributions to the frequency parameters of the model, penalizing deviations from a specified conservation profile. Examples with both simulated and real data show that this extension helps discover motifs as the data become noisier or when there is a competing false motif.
Ma, Jun; Liu, Fang; Wang, Qinglian; Wang, Kunbo; Jones, Don C.; Zhang, Baohong
2016-01-01
TCP proteins are plant-specific transcription factors implicated to perform a variety of physiological functions during plant growth and development. In the current study, we performed for the first time the comprehensive analysis of TCP gene family in a diploid cotton species, Gossypium arboreum, including phylogenetic analysis, chromosome location, gene duplication status, gene structure and conserved motif analysis, as well as expression profiles in fiber at different developmental stages. Our results showed that G. arboreum contains 36 TCP genes, distributing across all of the thirteen chromosomes. GaTCPs within the same subclade of the phylogenetic tree shared similar exon/intron organization and motif composition. In addition, both segmental duplication and whole-genome duplication contributed significantly to the expansion of GaTCPs. Many these TCP transcription factor genes are specifically expressed in cotton fiber during different developmental stages, including cotton fiber initiation and early development. This suggests that TCP genes may play important roles in cotton fiber development. PMID:26857372
Ma, Jun; Liu, Fang; Wang, Qinglian; Wang, Kunbo; Jones, Don C; Zhang, Baohong
2016-02-09
TCP proteins are plant-specific transcription factors implicated to perform a variety of physiological functions during plant growth and development. In the current study, we performed for the first time the comprehensive analysis of TCP gene family in a diploid cotton species, Gossypium arboreum, including phylogenetic analysis, chromosome location, gene duplication status, gene structure and conserved motif analysis, as well as expression profiles in fiber at different developmental stages. Our results showed that G. arboreum contains 36 TCP genes, distributing across all of the thirteen chromosomes. GaTCPs within the same subclade of the phylogenetic tree shared similar exon/intron organization and motif composition. In addition, both segmental duplication and whole-genome duplication contributed significantly to the expansion of GaTCPs. Many these TCP transcription factor genes are specifically expressed in cotton fiber during different developmental stages, including cotton fiber initiation and early development. This suggests that TCP genes may play important roles in cotton fiber development.
Gene Isolation Using Degenerate Primers Targeting Protein Motif: A Laboratory Exercise
ERIC Educational Resources Information Center
Yeo, Brandon Pei Hui; Foong, Lian Chee; Tam, Sheh May; Lee, Vivian; Hwang, Siaw San
2018-01-01
Structures and functions of protein motifs are widely included in many biology-based course syllabi. However, little emphasis is placed to link this knowledge to applications in biotechnology to enhance the learning experience. Here, the conserved motifs of nucleotide binding site-leucine rich repeats (NBS-LRR) proteins, successfully used for the…
Nguyen, Thi Quynh Ngoc; Lim, Kah Wai; Phan, Anh Tuân
2017-09-20
Small-molecule ligands targeting nucleic acids have been explored as potential therapeutic agents. Duplex groove-binding ligands have been shown to recognize DNA in a sequence-specific manner. On the other hand, quadruplex-binding ligands exhibit high selectivity between quadruplex and duplex, but show limited discrimination between different quadruplex structures. Here we propose a dual-specific approach through the simultaneous application of duplex- and quadruplex-binders. We demonstrated that a quadruplex-specific ligand and a duplex-specific ligand can simultaneously interact at two separate binding sites of a quadruplex-duplex hybrid harbouring both quadruplex and duplex structural elements. Such a dual-specific targeting strategy would combine the sequence specificity of duplex-binders and the strong binding affinity of quadruplex-binders, potentially allowing the specific targeting of unique quadruplex structures. Future research can be directed towards the development of conjugated compounds targeting specific genomic quadruplex-duplex sites, for which the linker would be highly context-dependent in terms of length and flexibility, as well as the attachment points onto both ligands.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stepanyuk, Galina A.; Serrano, Pedro; Peralta, Eigen
RNA-binding protein 39 (RBM39) is a splicing factor and a transcriptional co-activator of estrogen receptors and Jun/AP-1, and its function has been associated with malignant progression in a number of cancers. The C-terminal RRM domain of RBM39 belongs to the U2AF homology motif family (UHM), which mediate protein–protein interactions through a short tryptophan-containing peptide known as the UHM-ligand motif (ULM). Here, crystal and solution NMR structures of the RBM39-UHM domain, and the crystal structure of its complex with U2AF65-ULM, are reported. The RBM39–U2AF65 interaction was confirmed by co-immunoprecipitation from human cell extracts, by isothermal titration calorimetry and by NMR chemicalmore » shift perturbation experiments with the purified proteins. When compared with related complexes, such as U2AF35–U2AF65 and RBM39–SF3b155, the RBM39-UHM–U2AF65-ULM complex reveals both common and discriminating recognition elements in the UHM–ULM binding interface, providing a rationale for the known specificity of UHM–ULM interactions. This study therefore establishes a structural basis for specific UHM–ULM interactions by splicing factors such as U2AF35, U2AF65, RBM39 and SF3b155, and a platform for continued studies of intermolecular interactions governing disease-related alternative splicing in eukaryotic cells.« less
Atomistic molecular dynamics simulations of bioactive engrailed 1 interference peptides (EN1-iPeps).
Gandhi, Neha S; Blancafort, Pilar; Mancera, Ricardo L
2018-04-27
The neural-specific transcription factor Engrailed 1 - is overexpressed in basal-like breast tumours. Synthetic interference peptides - comprising a cell-penetrating peptide/nuclear localisation sequence and the Engrailed 1-specific sequence from the N-terminus have been engineered to produce a strong apoptotic response in tumour cells overexpressing EN1, with no toxicity to normal or non Engrailed 1-expressing cells. Here scaled molecular dynamics simulations were used to study the conformational dynamics of these interference peptides in aqueous solution to characterise their structure and dynamics. Transitions from disordered to α-helical conformation, stabilised by hydrogen bonds and proline-aromatic interactions, were observed throughout the simulations. The backbone of the wild-type peptide folds to a similar conformation as that found in ternary complexes of anterior Hox proteins with conserved hexapeptide motifs important for recognition of pre-B-cell leukemia Homeobox 1, indicating that the motif may possess an intrinsic preference for helical structure. The predicted NMR chemical shifts of these peptides are consistent with the Hox hexapeptides in solution and Engrailed 2 NMR data. These findings highlight the importance of aromatic residues in determining the structure of Engrailed 1 interference peptides, shedding light on the rational design strategy of molecules that could be adopted to inhibit other transcription factors overexpressed in other cancer types, potentially including other transcription factor families that require highly conserved and cooperative protein-protein partnerships for biological activity.
the NDB archive or in the Non-Redundant list Advanced Search Search for structures based on structural features, chemical features, binding modes, citation and experimental information Featured Tools RNA 3D Motif Atlas, a representative collection of RNA 3D internal and hairpin loop motifs Non-redundant Lists
Gorochowski, Thomas E; Grierson, Claire S; di Bernardo, Mario
2018-03-01
Network motifs are significantly overrepresented subgraphs that have been proposed as building blocks for natural and engineered networks. Detailed functional analysis has been performed for many types of motif in isolation, but less is known about how motifs work together to perform complex tasks. To address this issue, we measure the aggregation of network motifs via methods that extract precisely how these structures are connected. Applying this approach to a broad spectrum of networked systems and focusing on the widespread feed-forward loop motif, we uncover striking differences in motif organization. The types of connection are often highly constrained, differ between domains, and clearly capture architectural principles. We show how this information can be used to effectively predict functionally important nodes in the metabolic network of Escherichia coli . Our findings have implications for understanding how networked systems are constructed from motif parts and elucidate constraints that guide their evolution.
Grierson, Claire S.
2018-01-01
Network motifs are significantly overrepresented subgraphs that have been proposed as building blocks for natural and engineered networks. Detailed functional analysis has been performed for many types of motif in isolation, but less is known about how motifs work together to perform complex tasks. To address this issue, we measure the aggregation of network motifs via methods that extract precisely how these structures are connected. Applying this approach to a broad spectrum of networked systems and focusing on the widespread feed-forward loop motif, we uncover striking differences in motif organization. The types of connection are often highly constrained, differ between domains, and clearly capture architectural principles. We show how this information can be used to effectively predict functionally important nodes in the metabolic network of Escherichia coli. Our findings have implications for understanding how networked systems are constructed from motif parts and elucidate constraints that guide their evolution. PMID:29670941
The role of collagen charge clusters in the modulation of matrix metalloproteinase activity.
Lauer, Janelle L; Bhowmick, Manishabrata; Tokmina-Roszyk, Dorota; Lin, Yan; Van Doren, Steven R; Fields, Gregg B
2014-01-24
Members of the matrix metalloproteinase (MMP) family selectively cleave collagens in vivo. Several substrate structural features that direct MMP collagenolysis have been identified. The present study evaluated the role of charged residue clusters in the regulation of MMP collagenolysis. A series of 10 triple-helical peptide (THP) substrates were constructed in which either Lys-Gly-Asp or Gly-Asp-Lys motifs replaced Gly-Pro-Hyp (where Hyp is 4-hydroxy-L-proline) repeats. The stabilities of THPs containing the two different motifs were analyzed, and kinetic parameters for substrate hydrolysis by six MMPs were determined. A general trend for virtually all enzymes was that, as Gly-Asp-Lys motifs were moved from the extreme N and C termini to the interior next to the cleavage site sequence, kcat/Km values increased. Additionally, all Gly-Asp-Lys THPs were as good or better substrates than the parent THP in which Gly-Asp-Lys was not present. In turn, the Lys-Gly-Asp THPs were also always better substrates than the parent THP, but the magnitude of the difference was considerably less compared with the Gly-Asp-Lys series. Of the MMPs tested, MMP-2 and MMP-9 most greatly favored the presence of charged residues with preference for the Gly-Asp-Lys series. Lys-Gly-(Asp/Glu) motifs are more commonly found near potential MMP cleavage sites than Gly-(Asp/Glu)-Lys motifs. As Lys-Gly-Asp is not as favored by MMPs as Gly-Asp-Lys, the Lys-Gly-Asp motif appears advantageous over the Gly-Asp-Lys motif by preventing unwanted MMP hydrolysis. More specifically, the lack of Gly-Asp-Lys clusters may diminish potential MMP-2 and MMP-9 collagenolytic activity. The present study indicates that MMPs have interactions spanning the P23-P23' subsites of collagenous substrates.
Interleukin-11 binds specific EF-hand proteins via their conserved structural motifs.
Kazakov, Alexei S; Sokolov, Andrei S; Vologzhannikova, Alisa A; Permyakova, Maria E; Khorn, Polina A; Ismailov, Ramis G; Denessiouk, Konstantin A; Denesyuk, Alexander I; Rastrygina, Victoria A; Baksheeva, Viktoriia E; Zernii, Evgeni Yu; Zinchenko, Dmitry V; Glazatov, Vladimir V; Uversky, Vladimir N; Mirzabekov, Tajib A; Permyakov, Eugene A; Permyakov, Sergei E
2017-01-01
Interleukin-11 (IL-11) is a hematopoietic cytokine engaged in numerous biological processes and validated as a target for treatment of various cancers. IL-11 contains intrinsically disordered regions that might recognize multiple targets. Recently we found that aside from IL-11RA and gp130 receptors, IL-11 interacts with calcium sensor protein S100P. Strict calcium dependence of this interaction suggests a possibility of IL-11 interaction with other calcium sensor proteins. Here we probed specificity of IL-11 to calcium-binding proteins of various types: calcium sensors of the EF-hand family (calmodulin, S100B and neuronal calcium sensors: recoverin, NCS-1, GCAP-1, GCAP-2), calcium buffers of the EF-hand family (S100G, oncomodulin), and a non-EF-hand calcium buffer (α-lactalbumin). A specific subset of the calcium sensor proteins (calmodulin, S100B, NCS-1, GCAP-1/2) exhibits metal-dependent binding of IL-11 with dissociation constants of 1-19 μM. These proteins share several amino acid residues belonging to conservative structural motifs of the EF-hand proteins, 'black' and 'gray' clusters. Replacements of the respective S100P residues by alanine drastically decrease its affinity to IL-11, suggesting their involvement into the association process. Secondary structure and accessibility of the hinge region of the EF-hand proteins studied are predicted to control specificity and selectivity of their binding to IL-11. The IL-11 interaction with the EF-hand proteins is expected to occur under numerous pathological conditions, accompanied by disintegration of plasma membrane and efflux of cellular components into the extracellular milieu.
Janky, Rekin's; van Helden, Jacques
2008-01-23
The detection of conserved motifs in promoters of orthologous genes (phylogenetic footprints) has become a common strategy to predict cis-acting regulatory elements. Several software tools are routinely used to raise hypotheses about regulation. However, these tools are generally used as black boxes, with default parameters. A systematic evaluation of optimal parameters for a footprint discovery strategy can bring a sizeable improvement to the predictions. We evaluate the performances of a footprint discovery approach based on the detection of over-represented spaced motifs. This method is particularly suitable for (but not restricted to) Bacteria, since such motifs are typically bound by factors containing a Helix-Turn-Helix domain. We evaluated footprint discovery in 368 Escherichia coli K12 genes with annotated sites, under 40 different combinations of parameters (taxonomical level, background model, organism-specific filtering, operon inference). Motifs are assessed both at the levels of correctness and significance. We further report a detailed analysis of 181 bacterial orthologs of the LexA repressor. Distinct motifs are detected at various taxonomical levels, including the 7 previously characterized taxon-specific motifs. In addition, we highlight a significantly stronger conservation of half-motifs in Actinobacteria, relative to Firmicutes, suggesting an intermediate state in specificity switching between the two Gram-positive phyla, and thereby revealing the on-going evolution of LexA auto-regulation. The footprint discovery method proposed here shows excellent results with E. coli and can readily be extended to predict cis-acting regulatory signals and propose testable hypotheses in bacterial genomes for which nothing is known about regulation.
The T-cell receptor beta chain CDR3 region of BV8S1/BJ1S5 transcripts in type 1 diabetes.
Naserke, H E; Durinovic-Bellò, I; Seidel, D; Ziegler, A G
1996-01-01
We recently described the T-cell receptor (TCR) beta chain CDR3 motif S-SDRLG-NQPQH (BV8S1-BJ1S5) in an islet-specific T-cell clone (K2.12) from a type 1 diabetic patient (AS). A similar motif (RLGNQ) was also reported in a T-cell clone of non-obese diabetic (NOD) mice by others. In order to determine the frequency of our motif in selected and unselected T-cell populations, we cloned and sequenced the CDR3 region of BV8S1-BJ1S5 transcripts. These transcripts were derived from unstimulated peripheral blood T lymphocytes from two type 1 diabetic patients (AS and FS) and their non-diabetic sibling (WS), as well as from an islet-specific T-cell line of one of the patients. In addition, we compared the structure and composition of the CDR3 region in BV8S1-BJ1S5 transcripts from peripheral blood T cells between the patients and their non-diabetic sibling (>50 sequences each). We found that 30% of the islet-specific T-cell line cDNA clones expressed the entire sequence-motif, whereas it was absent in the clones of unstimulated peripheral blood T cells from both patients and their non-diabetic sibling. The average length of the CDR3 region was shorter in the patients (mean AS 9.9, FS 9.9, versus WS 10.7, p = 0.0037) and the number of inserted nucleotides in N nucleotide addition at the DJ-junction lower (mean AS 3.5, FS 3. 2, versus WS 5.2, P = <10(-4)) as compared with their non-diabetic sibling. Moreover, the pattern of amino acid usage in the CDR3 region was dissimilar at positions 5 and 6, where polar amino acids predominated in both diabetic siblings. In contrast, basic amino acids are preferentially used at position 5 in the clones of the non-diabetic sibling. These data provide information on the general structure of the TCR(BV8S1-BJ1S5) CDR3 region in type 1 diabetes and may indicate differences in the amino and nucleic acid composition of the TCR beta chain CDR3 region between two type 1 diabetic patients and their non-diabetic sibling.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fedarovich, Alena; Nicholas, Robert A.; Davies, Christopher
PBPA from Mycobacterium tuberculosis is a class B-like penicillin-binding protein (PBP) that is not essential for cell growth in M. tuberculosis, but is important for proper cell division in Mycobacterium smegmatis. We have determined the crystal structure of PBPA at 2.05 {angstrom} resolution, the first published structure of a PBP from this important pathogen. Compared to other PBPs, PBPA has a relatively small N-terminal domain, and conservation of a cluster of charged residues within this domain suggests that PBPA is more related to class B PBPs than previously inferred from sequence analysis. The C-terminal domain is a typical transpeptidase foldmore » and contains the three conserved active-site motifs characterisitic of penicillin-interacting enzymes. While the arrangement of the SxxK and KTG motifs is similar to that observed in other PBPs, the SxN motif is markedly displaced away from the active site, such that its serine (Ser281) is not involved in hydrogen bonding with residues of the other two motifs. A disulfide bridge between Cys282 (the 'x' of the SxN motif) and Cys266, which resides on an adjacent loop, may be responsible for this unusual conformation. Another interesting feature of the structure is a relatively long connection between {beta}5 and {alpha}11, which restricts the space available in the active site of PBPA and suggests that conformational changes would be required to accommodate peptide substrate or {beta}-lactam antibiotics during acylation. Finally, the structure shows that one of the two threonines postulated to be targets for phosphorylation is inaccessible (Thr362), whereas the other (Thr437) is well placed on a surface loop near the active site.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Haiqing; Han, Jinkyu; McBean, Coray
Understanding the key parameters necessary for generating uniform Er,Yb co-activated NaYF 4 possessing various selected phases (i.e. cubic or hexagonal) represents an important chemical strategy towards tailoring optical behavior in these systems. In this paper, we report on a straightforward hydrothermal synthesis in which the separate effects of reaction temperature, reaction time, and precursor stoichiometry in the absence of any surfactant were independently investigated. Interestingly, the presence and the concentration of NH 4OH appear to be the most critical determinants of the phase and morphology. For example, with NH 4OH as an additive, we have observed the formation of novelmore » hierarchical nanowire bundles which possess overall lengths of ~5 μm and widths of ~1.5 μm but are composed of constituent component sub-units of long, ultrathin (~5 nm) nanowires. These motifs have yet to be reported as distinctive morphological manifestations of fluoride materials. The optical properties of as-generated structures have also been carefully analyzed. Specifically, we have observed tunable, structure-dependent energy transfer behavior associated with the formation of a unique class of NaYF 4–CdSe quantum dot (QD) heterostructures, incorporating zero-dimensional (0D), one-dimensional (1D), and three-dimensional (3D) NaYF 4 structures. Our results have demonstrated the key roles of the intrinsic morphology-specific physical surface area and porosity as factors in governing the resulting opto-electronic behavior. Finally and specifically, the trend in energy transfer efficiency correlates well with the corresponding QD loading within these heterostructures, thereby implying that the efficiency of FRET appears to be directly affected by the amount of QDs immobilized onto the external surfaces of the underlying fluoride host materials.« less
Liu, Haiqing; Han, Jinkyu; McBean, Coray; ...
2017-01-03
Understanding the key parameters necessary for generating uniform Er,Yb co-activated NaYF 4 possessing various selected phases (i.e. cubic or hexagonal) represents an important chemical strategy towards tailoring optical behavior in these systems. In this paper, we report on a straightforward hydrothermal synthesis in which the separate effects of reaction temperature, reaction time, and precursor stoichiometry in the absence of any surfactant were independently investigated. Interestingly, the presence and the concentration of NH 4OH appear to be the most critical determinants of the phase and morphology. For example, with NH 4OH as an additive, we have observed the formation of novelmore » hierarchical nanowire bundles which possess overall lengths of ~5 μm and widths of ~1.5 μm but are composed of constituent component sub-units of long, ultrathin (~5 nm) nanowires. These motifs have yet to be reported as distinctive morphological manifestations of fluoride materials. The optical properties of as-generated structures have also been carefully analyzed. Specifically, we have observed tunable, structure-dependent energy transfer behavior associated with the formation of a unique class of NaYF 4–CdSe quantum dot (QD) heterostructures, incorporating zero-dimensional (0D), one-dimensional (1D), and three-dimensional (3D) NaYF 4 structures. Our results have demonstrated the key roles of the intrinsic morphology-specific physical surface area and porosity as factors in governing the resulting opto-electronic behavior. Finally and specifically, the trend in energy transfer efficiency correlates well with the corresponding QD loading within these heterostructures, thereby implying that the efficiency of FRET appears to be directly affected by the amount of QDs immobilized onto the external surfaces of the underlying fluoride host materials.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Xing, Li; Shieh, Huey S.; Selness, Shaun R.
2009-07-24
PH-797804 is a diarylpyridinone inhibitor of p38{alpha} mitogen-activated protein (MAP) kinase derived from a racemic mixture as the more potent atropisomer (aS), first proposed by molecular modeling and subsequently confirmed by experiments. On the basis of structural comparison with a different biaryl pyrazole template and supported by dozens of high-resolution crystal structures of p38{alpha} inhibitor complexes, PH-797804 is predicted to possess a high level of specificity across the broad human kinase genome. We used a structural bioinformatics approach to identify two selectivity elements encoded by the TXXXG sequence motif on the p38{alpha} kinase hinge: (i) Thr106 that serves as themore » gatekeeper to the buried hydrophobic pocket occupied by 2,4-difluorophenyl of PH-797804 and (ii) the bidentate hydrogen bonds formed by the pyridinone moiety with the kinase hinge requiring an induced 180{sup o} rotation of the Met109-Gly110 peptide bond. The peptide flip occurs in p38{alpha} kinase due to the critical glycine residue marked by its conformational flexibility. Kinome-wide sequence mining revealed rare presentation of the selectivity motif. Corroboratively, PH-797804 exhibited exceptionally high specificity against MAP kinases and the related kinases. No cross-reactivity was observed in large panels of kinase screens (selectivity ratio of >500-fold). In cellular assays, PH-797804 demonstrated superior potency and selectivity consistent with the biochemical measurements. PH-797804 has met safety criteria in human phase I studies and is under clinical development for several inflammatory conditions. Understanding the rationale for selectivity at the molecular level helps elucidate the biological function and design of specific p38{alpha} kinase inhibitors.« less
Apetri, Adrian; Crespo, Rosa; Juraszek, Jarek; Pascual, Gabriel; Janson, Roosmarijn; Zhu, Xueyong; Zhang, Heng; Keogh, Elissa; Holland, Trevin; Wadia, Jay; Verveen, Hanneke; Siregar, Berdien; Mrosek, Michael; Taggenbrock, Renske; Ameijde, Jeroenvan; Inganäs, Hanna; van Winsen, Margot; Koldijk, Martin H; Zuijdgeest, David; Borgers, Marianne; Dockx, Koen; Stoop, Esther J M; Yu, Wenli; Brinkman-van der Linden, Els C; Ummenthum, Kimberley; van Kolen, Kristof; Mercken, Marc; Steinbacher, Stefan; de Marco, Donata; Hoozemans, Jeroen J; Wilson, Ian A; Koudstaal, Wouter; Goudsmit, Jaap
2018-05-31
Misfolding and aggregation of tau protein are closely associated with the onset and progression of Alzheimer's Disease (AD). By interrogating IgG + memory B cells from asymptomatic donors with tau peptides, we have identified two somatically mutated V H 5-51/V L 4-1 antibodies. One of these, CBTAU-27.1, binds to the aggregation motif in the R3 repeat domain and blocks the aggregation of tau into paired helical filaments (PHFs) by sequestering monomeric tau. The other, CBTAU-28.1, binds to the N-terminal insert region and inhibits the spreading of tau seeds and mediates the uptake of tau aggregates into microglia by binding PHFs. Crystal structures revealed that the combination of V H 5-51 and V L 4-1 recognizes a common Pro-X n -Lys motif driven by germline-encoded hotspot interactions while the specificity and thereby functionality of the antibodies are defined by the CDR3 regions. Affinity improvement led to improvement in functionality, identifying their epitopes as new targets for therapy and prevention of AD.
Zhang, Xue-Song; Tegtmeyer, Nicole; Traube, Leah; Jindal, Shawn; Perez-Perez, Guillermo; Sticht, Heinrich; Backert, Steffen; Blaser, Martin J
2015-02-01
Helicobacter pylori persistently colonizes the human stomach, with mixed roles in human health. The CagA protein, a key host-interaction factor, is translocated by a type IV secretion system into host epithelial cells, where its EPIYA tyrosine phosphorylation motifs (TPMs) are recognized by host cell kinases, leading to multiple host cell signaling cascades. The CagA TPMs have been described as type A, B, C or D, each with a specific conserved amino acid sequence surrounding EPIYA. Database searching revealed strong non-random distribution of the B-motifs (including EPIYA and EPIYT) in Western H. pylori isolates. In silico analysis of Western H. pylori CagA sequences provided evidence that the EPIYT B-TPMs are significantly less associated with gastric cancer than the EPIYA B-TPMs. By generating and using a phosphorylated CagA B-TPM-specific antibody, we demonstrated the phosphorylated state of the CagA B-TPM EPIYT during H. pylori co-culture with host cells. We also showed that within host cells, CagA interaction with phosphoinositol 3-kinase (PI3-kinase) was B-TPM tyrosine-phosphorylation-dependent, and the recombinant CagA with EPIYT B-TPM had higher affinity to PI3-kinase and enhanced induction of AKT than the isogenic CagA with EPIYA B-TPM. Structural modeling of the CagA B-TPM motif bound to PI3-kinase indicated that the threonine residue at the pY+1 position forms a side-chain hydrogen bond to N-417 of PI3-kinase, which cannot be formed by alanine. During co-culture with AGS cells, an H. pylori strain with a CagA EPIYT B-TPM had significantly attenuated induction of interleukin-8 and hummingbird phenotype, compared to the isogenic strain with B-TPM EPIYA. These results suggest that the A/T polymorphisms could regulate CagA activity through interfering with host signaling pathways related to carcinogenesis, thus influencing cancer risk.
Mariani, Luca; Weinand, Kathryn; Vedenko, Anastasia; Barrera, Luis A; Bulyk, Martha L
2017-09-27
Transcription factors (TFs) control cellular processes by binding specific DNA motifs to modulate gene expression. Motif enrichment analysis of regulatory regions can identify direct and indirect TF binding sites. Here, we created a glossary of 108 non-redundant TF-8mer "modules" of shared specificity for 671 metazoan TFs from publicly available and new universal protein binding microarray data. Analysis of 239 ENCODE TF chromatin immunoprecipitation sequencing datasets and associated RNA sequencing profiles suggest the 8mer modules are more precise than position weight matrices in identifying indirect binding motifs and their associated tethering TFs. We also developed GENRE (genomically equivalent negative regions), a tunable tool for construction of matched genomic background sequences for analysis of regulatory regions. GENRE outperformed four state-of-the-art approaches to background sequence construction. We used our TF-8mer glossary and GENRE in the analysis of the indirect binding motifs for the co-occurrence of tethering factors, suggesting novel TF-TF interactions. We anticipate that these tools will aid in elucidating tissue-specific gene-regulatory programs. Copyright © 2017 Elsevier Inc. All rights reserved.
Convergent evolution and mimicry of protein linear motifs in host-pathogen interactions.
Chemes, Lucía Beatriz; de Prat-Gay, Gonzalo; Sánchez, Ignacio Enrique
2015-06-01
Pathogen linear motif mimics are highly evolvable elements that facilitate rewiring of host protein interaction networks. Host linear motifs and pathogen mimics differ in sequence, leading to thermodynamic and structural differences in the resulting protein-protein interactions. Moreover, the functional output of a mimic depends on the motif and domain repertoire of the pathogen protein. Regulatory evolution mediated by linear motifs can be understood by measuring evolutionary rates, quantifying positive and negative selection and performing phylogenetic reconstructions of linear motif natural history. Convergent evolution of linear motif mimics is widespread among unrelated proteins from viral, prokaryotic and eukaryotic pathogens and can also take place within individual protein phylogenies. Statistics, biochemistry and laboratory models of infection link pathogen linear motifs to phenotypic traits such as tropism, virulence and oncogenicity. In vitro evolution experiments and analysis of natural sequences suggest that changes in linear motif composition underlie pathogen adaptation to a changing environment. Copyright © 2015 Elsevier Ltd. All rights reserved.
Jiang, Peng; Singh, Mona; Coller, Hilary A
2013-01-01
Transcript degradation is a widespread and important mechanism for regulating protein abundance. Two major regulators of transcript degradation are RNA Binding Proteins (RBPs) and microRNAs (miRNAs). We computationally explored whether RBPs and miRNAs cooperate to promote transcript decay. We defined five RBP motifs based on the evolutionary conservation of their recognition sites in 3'UTRs as the binding motifs for Pumilio (PUM), U1A, Fox-1, Nova, and UAUUUAU. Recognition sites for some of these RBPs tended to localize at the end of long 3'UTRs. A specific group of miRNA recognition sites were enriched within 50 nts from the RBP recognition sites for PUM and UAUUUAU. The presence of both a PUM recognition site and a recognition site for preferentially co-occurring miRNAs was associated with faster decay of the associated transcripts. For PUM and its co-occurring miRNAs, binding of the RBP to its recognition sites was predicted to release nearby miRNA recognition sites from RNA secondary structures. The mammalian miRNAs that preferentially co-occur with PUM binding sites have recognition seeds that are reverse complements to the PUM recognition motif. Their binding sites have the potential to form hairpin secondary structures with proximal PUM binding sites that would normally limit RISC accessibility, but would be more accessible to miRNAs in response to the binding of PUM. In sum, our computational analyses suggest that a specific set of RBPs and miRNAs work together to affect transcript decay, with the rescue of miRNA recognition sites via RBP binding as one possible mechanism of cooperativity.
Genome-Wide Prediction and Validation of Peptides That Bind Human Prosurvival Bcl-2 Proteins
DeBartolo, Joe; Taipale, Mikko; Keating, Amy E.
2014-01-01
Programmed cell death is regulated by interactions between pro-apoptotic and prosurvival members of the Bcl-2 family. Pro-apoptotic family members contain a weakly conserved BH3 motif that can adopt an alpha-helical structure and bind to a groove on prosurvival partners Bcl-xL, Bcl-w, Bcl-2, Mcl-1 and Bfl-1. Peptides corresponding to roughly 13 reported BH3 motifs have been verified to bind in this manner. Due to their short lengths and low sequence conservation, BH3 motifs are not detected using standard sequence-based bioinformatics approaches. Thus, it is possible that many additional proteins harbor BH3-like sequences that can mediate interactions with the Bcl-2 family. In this work, we used structure-based and data-based Bcl-2 interaction models to find new BH3-like peptides in the human proteome. We used peptide SPOT arrays to test candidate peptides for interaction with one or more of the prosurvival proteins Bcl-xL, Bcl-w, Bcl-2, Mcl-1 and Bfl-1. For the 36 most promising array candidates, we quantified binding to all five human receptors using direct and competition binding assays in solution. All 36 peptides showed evidence of interaction with at least one prosurvival protein, and 22 peptides bound at least one prosurvival protein with a dissociation constant between 1 and 500 nM; many peptides had specificity profiles not previously observed. We also screened the full-length parent proteins of a subset of array-tested peptides for binding to Bcl-xL and Mcl-1. Finally, we used the peptide binding data, in conjunction with previously reported interactions, to assess the affinity and specificity prediction performance of different models. PMID:24967846
Warfield, Linda; Tuttle, Lisa M; Pacheco, Derek; Klevit, Rachel E; Hahn, Steven
2014-08-26
Although many transcription activators contact the same set of coactivator complexes, the mechanism and specificity of these interactions have been unclear. For example, do intrinsically disordered transcription activation domains (ADs) use sequence-specific motifs, or do ADs of seemingly different sequence have common properties that encode activation function? We find that the central activation domain (cAD) of the yeast activator Gcn4 functions through a short, conserved sequence-specific motif. Optimizing the residues surrounding this short motif by inserting additional hydrophobic residues creates very powerful ADs that bind the Mediator subunit Gal11/Med15 with high affinity via a "fuzzy" protein interface. In contrast to Gcn4, the activity of these synthetic ADs is not strongly dependent on any one residue of the AD, and this redundancy is similar to that of some natural ADs in which few if any sequence-specific residues have been identified. The additional hydrophobic residues in the synthetic ADs likely allow multiple faces of the AD helix to interact with the Gal11 activator-binding domain, effectively forming a fuzzier interface than that of the wild-type cAD.
Kawano, Yasuhiro; Neeley, Shane; Adachi, Kei; Nakai, Hiroyuki
2013-01-01
Overlapping open reading frames (ORFs) in viral genomes undergo co-evolution; however, how individual amino acids coded by overlapping ORFs are structurally, functionally, and co-evolutionarily constrained remains difficult to address by conventional homologous sequence alignment approaches. We report here a new experimental and computational evolution-based methodology to address this question and report its preliminary application to elucidating a mode of co-evolution of the frame-shifted overlapping ORFs in the adeno-associated virus (AAV) serotype 2 viral genome. These ORFs encode both capsid VP protein and non-structural assembly-activating protein (AAP). To show proof of principle of the new method, we focused on the evolutionarily conserved QVKEVTQ and KSKRSRR motifs, a pair of overlapping heptapeptides in VP and AAP, respectively. In the new method, we first identified a large number of capsid-forming VP3 mutants and functionally competent AAP mutants of these motifs from mutant libraries by experimental directed evolution under no co-evolutionary constraints. We used Illumina sequencing to obtain a large dataset and then statistically assessed the viability of VP and AAP heptapeptide mutants. The obtained heptapeptide information was then integrated into an evolutionary algorithm, with which VP and AAP were co-evolved from random or native nucleotide sequences in silico. As a result, we demonstrate that these two heptapeptide motifs could exhibit high degeneracy if coded by separate nucleotide sequences, and elucidate how overlap-evoked co-evolutionary constraints play a role in making the VP and AAP heptapeptide sequences into the present shape. Specifically, we demonstrate that two valine (V) residues and β-strand propensity in QVKEVTQ are structurally important, the strongly negative and hydrophilic nature of KSKRSRR is functionally important, and overlap-evoked co-evolution imposes strong constraints on serine (S) residues in KSKRSRR, despite high degeneracy of the motifs in the absence of co-evolutionary constraints.
Toffano-Nioche, Claire; Gautheret, Daniel; Leclerc, Fabrice
2015-01-01
A structural and functional classification of H/ACA and H/ACA-like motifs is obtained from the analysis of the H/ACA guide RNAs which have been identified previously in the genomes of Euryarchaea (Pyrococcus) and Crenarchaea (Pyrobaculum). A unified structure/function model is proposed based on the common structural determinants shared by H/ACA and H/ACA-like motifs in both Euryarchaea and Crenarchaea. Using a computational approach, structural and energetic rules for the guide:target RNA-RNA interactions are derived from structural and functional data on the H/ACA RNP particles. H/ACA(-like) motifs found in Pyrococcus are evaluated through the classification and their biological relevance is discussed. Extra-ribosomal targets found in both Pyrococcus and Pyrobaculum might support the hypothesis of a gene regulation mediated by H/ACA(-like) guide RNAs in archaea. PMID:26240384
Denesyuk, Alexander; Denessiouk, Konstantin; Johnson, Mark S
2018-02-01
An integrin-like β-propeller domain contains seven repeats of a four-stranded antiparallel β-sheet motif (blades). Previously we described a 3D structural motif within each blade of the integrin-type β-propeller. Here, we show unique structural links that join different blades of the β-propeller structure, which together with the structural motif for a single blade are repeated in a β-propeller to provide the functional top face of the barrel, found to be involved in protein-protein interactions and substrate recognition. We compare functional top face diagrams of the integrin-type β-propeller domain and two non-integrin type β-propeller domains of virginiamycin B lyase and WD Repeat-Containing Protein 5. Copyright © 2017 Elsevier Inc. All rights reserved.
Karnik, Rahul; Beer, Michael A.
2015-01-01
The generation of genomic binding or accessibility data from massively parallel sequencing technologies such as ChIP-seq and DNase-seq continues to accelerate. Yet state-of-the-art computational approaches for the identification of DNA binding motifs often yield motifs of weak predictive power. Here we present a novel computational algorithm called MotifSpec, designed to find predictive motifs, in contrast to over-represented sequence elements. The key distinguishing feature of this algorithm is that it uses a dynamic search space and a learned threshold to find discriminative motifs in combination with the modeling of motifs using a full PWM (position weight matrix) rather than k-mer words or regular expressions. We demonstrate that our approach finds motifs corresponding to known binding specificities in several mammalian ChIP-seq datasets, and that our PWMs classify the ChIP-seq signals with accuracy comparable to, or marginally better than motifs from the best existing algorithms. In other datasets, our algorithm identifies novel motifs where other methods fail. Finally, we apply this algorithm to detect motifs from expression datasets in C. elegans using a dynamic expression similarity metric rather than fixed expression clusters, and find novel predictive motifs. PMID:26465884
Karnik, Rahul; Beer, Michael A
2015-01-01
The generation of genomic binding or accessibility data from massively parallel sequencing technologies such as ChIP-seq and DNase-seq continues to accelerate. Yet state-of-the-art computational approaches for the identification of DNA binding motifs often yield motifs of weak predictive power. Here we present a novel computational algorithm called MotifSpec, designed to find predictive motifs, in contrast to over-represented sequence elements. The key distinguishing feature of this algorithm is that it uses a dynamic search space and a learned threshold to find discriminative motifs in combination with the modeling of motifs using a full PWM (position weight matrix) rather than k-mer words or regular expressions. We demonstrate that our approach finds motifs corresponding to known binding specificities in several mammalian ChIP-seq datasets, and that our PWMs classify the ChIP-seq signals with accuracy comparable to, or marginally better than motifs from the best existing algorithms. In other datasets, our algorithm identifies novel motifs where other methods fail. Finally, we apply this algorithm to detect motifs from expression datasets in C. elegans using a dynamic expression similarity metric rather than fixed expression clusters, and find novel predictive motifs.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rha, Geun Bae; Wu, Guangteng; Shoelson, Steven E.
2010-04-15
Hepatocyte nuclear factor 4{alpha} (HNF4{alpha}) is a novel nuclear receptor that participates in a hierarchical network of transcription factors regulating the development and physiology of such vital organs as the liver, pancreas, and kidney. Among the various transcriptional coregulators with which HNF4{alpha} interacts, peroxisome proliferation-activated receptor {gamma} (PPAR{gamma}) coactivator 1{alpha} (PGC-1{alpha}) represents a novel coactivator whose activation is unusually robust and whose binding mode appears to be distinct from that of canonical coactivators such as NCoA/SRC/p160 family members. To elucidate the potentially unique molecular mechanism of PGC-1{alpha} recruitment, we have determined the crystal structure of HNF4{alpha} in complex with amore » fragment of PGC-1{alpha} containing all three of its LXXLL motifs. Despite the presence of all three LXXLL motifs available for interactions, only one is bound at the canonical binding site, with no additional contacts observed between the two proteins. However, a close inspection of the electron density map indicates that the bound LXXLL motif is not a selected one but an averaged structure of more than one LXXLL motif. Further biochemical and functional studies show that the individual LXXLL motifs can bind but drive only minimal transactivation. Only when more than one LXXLL motif is involved can significant transcriptional activity be measured, and full activation requires all three LXXLL motifs. These findings led us to propose a model wherein each LXXLL motif has an additive effect, and the multiple binding modes by HNF4{alpha} toward the LXXLL motifs of PGC-1{alpha} could account for the apparent robust activation by providing a flexible mechanism for combinatorial recruitment of additional coactivators and mediators.« less
Analysis of the Isolated SecA DEAD Motor Suggests a Mechanism for Chemical-Mechanical Coupling
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nithianantham, Stanley; Shilton, Brian H
The preprotein cross-linking domain and C-terminal domains of Escherichia coli SecA were removed to create a minimal DEAD motor, SecA-DM. SecA-DM hydrolyzes ATP and has the same affinity for ADP as full-length SecA. The crystal structure of SecA-DM in complex with ADP was solved and shows the DEAD motor in a closed conformation. Comparison with the structure of the E. coli DEAD motor in an open conformation (Protein Data Bank ID 2FSI) indicates main-chain conformational changes in two critical sequences corresponding to Motif III and Motif V of the DEAD helicase family. The structures that the Motif III and Motifmore » V sequences adopt in the DEAD motor open conformation are incompatible with the closed conformation. Therefore, when the DEAD motor makes the transition from open to closed, Motif III and Motif V are forced to change their conformations, which likely functions to regulate passage through the transition state for ATP hydrolysis. The transition state for ATP hydrolysis for the SecA DEAD motor was modeled based on the conformation of the Vasa helicase in complex with adenylyl imidodiphosphate and RNA (Protein Data Bank ID 2DB3). A mechanism for chemical-mechanical coupling emerges, where passage through the transition state for ATP hydrolysis is hindered by the conformational changes required in Motif III and Motif V, and may be promoted by binding interactions with the preprotein substrate and/or other translocase domains and subunits.« less
Analysis of the Isolated SecA DEAD Motor Suggests a Mechanism for Chemical-Mechanical Coupling
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nithianantham, Stanley; Shilton, Brian H
2011-09-28
The preprotein cross-linking domain and C-terminal domains of Escherichia coli SecA were removed to create a minimal DEAD motor, SecA-DM. SecA-DM hydrolyzes ATP and has the same affinity for ADP as full-length SecA. The crystal structure of SecA-DM in complex with ADP was solved and shows the DEAD motor in a closed conformation. Comparison with the structure of the E. coli DEAD motor in an open conformation (Protein Data Bank ID 2FSI) indicates main-chain conformational changes in two critical sequences corresponding to Motif III and Motif V of the DEAD helicase family. The structures that the Motif III and Motifmore » V sequences adopt in the DEAD motor open conformation are incompatible with the closed conformation. Therefore, when the DEAD motor makes the transition from open to closed, Motif III and Motif V are forced to change their conformations, which likely functions to regulate passage through the transition state for ATP hydrolysis. The transition state for ATP hydrolysis for the SecA DEAD motor was modeled based on the conformation of the Vasa helicase in complex with adenylyl imidodiphosphate and RNA (Protein Data Bank ID 2DB3). A mechanism for chemical-mechanical coupling emerges, where passage through the transition state for ATP hydrolysis is hindered by the conformational changes required in Motif III and Motif V, and may be promoted by binding interactions with the preprotein substrate and/or other translocase domains and subunits.« less
Disney, Matthew D.; Liu, Biao; Yang, Wang-Yong; Sellier, Chantal; Tran, Tuan; Charlet-Berguerand, Nicolas; Childs-Disney, Jessica L.
2012-01-01
The development of small molecule chemical probes or therapeutics that target RNA remains a significant challenge despite the great interest in such compounds. The most significant barrier to compound development is a lack of knowledge of the chemical and RNA motif spaces that interact specifically. Herein, we describe a bioactive small molecule probe that targets expanded r(CGG) repeats, or r(CGG)exp , that causes Fragile X-associated Tremor Ataxia Syndrome (FXTAS). The compound was identified by using information on the chemotypes and RNA motifs that interact. Specifically, 9-hydroxy-5,11-dimethyl-2-(2-(piperidin-1-yl)ethyl)-6H-pyrido[4,3-b]carbazol-2-ium, binds the 5’CGG/3’GGC motifs in r(CGG)exp and disrupts a toxic r(CGG)exp -protein complex in vitro. Structure-activity relationships (SAR) studies determined that the alkylated pyridyl and phenolic side chains are important chemotypes that drive molecular recognition to r(CGG)exp . Importantly, the compound is efficacious in FXTAS model cellular systems as evidenced by its ability to improve FXTAS-associated pre-mRNA splicing defects and to reduce the size and number of r(CGG)exp -protein aggregates. This approach may establish a general strategy to identify lead ligands that target RNA while also providing a chemical probe to dissect the varied mechanisms by which r(CGG)exp promotes toxicity. PMID:22948243
Disney, Matthew D; Liu, Biao; Yang, Wang-Yong; Sellier, Chantal; Tran, Tuan; Charlet-Berguerand, Nicolas; Childs-Disney, Jessica L
2012-10-19
The development of small molecule chemical probes or therapeutics that target RNA remains a significant challenge despite the great interest in such compounds. The most significant barrier to compound development is defining which chemical and RNA motif spaces interact specifically. Herein, we describe a bioactive small molecule probe that targets expanded r(CGG) repeats, or r(CGG)(exp), that causes Fragile X-associated Tremor Ataxia Syndrome (FXTAS). The compound was identified by using information on the chemotypes and RNA motifs that interact. Specifically, 9-hydroxy-5,11-dimethyl-2-(2-(piperidin-1-yl)ethyl)-6H-pyrido[4,3-b]carbazol-2-ium binds the 5'CGG/3'GGC motifs in r(CGG)(exp) and disrupts a toxic r(CGG)(exp)-protein complex in vitro. Structure-activity relationship studies determined that the alkylated pyridyl and phenolic side chains are important chemotypes that drive molecular recognition of r(CGG)(exp). Importantly, the compound is efficacious in FXTAS model cellular systems as evidenced by its ability to improve FXTAS-associated pre-mRNA splicing defects and to reduce the size and number of r(CGG)(exp)-containing nuclear foci. This approach may establish a general strategy to identify lead ligands that target RNA while also providing a chemical probe to dissect the varied mechanisms by which r(CGG)(exp) promotes toxicity.
Nissan, Gal; Manulis-Sasson, Shulamit; Chalupowicz, Laura; Teper, Doron; Yeheskel, Adva; Pasmanik-Chor, Metsada; Sessa, Guido; Barash, Isaac
2012-02-01
The type III effector HsvG of the gall-forming Pantoea agglomerans pv. gypsophilae is a DNA-binding protein that is imported to the host nucleus and involved in host specificity. The DNA-binding region of HsvG was delineated to 266 amino acids located within a secondary structure region near the N-terminus of the protein but did not display any homology to canonical DNA-binding motifs. A binding site selection procedure was used to isolate a target gene of HsvG, named HSVGT, in Gypsophila paniculata. HSVGT is a predicted acidic protein of the DnaJ family with 244 amino acids. It harbors characteristic conserved motifs of a eukaryotic transcription factor, including a bipartite nuclear localization signal, zinc finger, and leucine zipper DNA-binding motifs. Quantitative real-time polymerase chain reaction analysis demonstrated that HSVGT transcription is specifically induced in planta within 2 h after inoculation with the wild-type P. agglomerans pv. gypsophilae compared with the hsvG mutant. Induction of HSVGT reached a peak of sixfold at 4 h after inoculation and progressively declined thereafter. Gel-shift assay demonstrated that HsvG binds to the HSVGT promoter, indicating that HSVGT is a direct target of HsvG. Our results support the hypothesis that HsvG functions as a transcription factor in gypsophila.
Structural interactions between retroviral Gag proteins examined by cysteine cross-linking.
Hansen, M S; Barklis, E
1995-01-01
We have examined structural interactions between Gag proteins within Moloney murine leukemia virus (M-MuLV) particles by making use of the cysteine-specific cross-linking agents iodine and bis-maleimido hexane. Virion-associated wild-type M-MuLV Pr65Gag proteins in immature particles were intermolecularly cross-linked at cysteines to form Pr65Gag oligomers, from dimers to pentamers or hexamers. Following a systematic approach of cysteine-to-serine mutagenesis, we have shown that cross-linking of Pr65Gag occurred at cysteines of the nucleocapsid (NC) Cys-His motif, suggesting that the Cys-His motifs within virus particles are packed in close proximity. The M-MuLV Pr65Gag protein did not cross-link to the human immunodeficiency virus Pr55Gag protein when the two molecules were coexpressed, indicating either that they did not coassemble or that heterologous Gag proteins were not in close enough proximity to be cross-linked. Using an assembly-competent, protease-minus, cysteine-minus Pr65Gag protein as a template, novel cysteine residues were generated in the M-MuLV capsid domain major homology region (MHR). Cross-linking of proteins containing MHR cysteines showed above-background levels of Gag-Gag dimers but also identified a novel cellular factor, present in virions, that cross-linked to MHR residues. Although the NC cysteine mutation was compatible with M-MuLV particle assembly, deletions of the NC domain were not tolerated. These results suggest that the Cys-His motif is held in close proximity within immature M-MuLV particles by interactions between CA domains and/or non-Cys-His motif domains of the NC. PMID:7815493
Building a stable RNA U-turn with a protonated cytidine
Gottstein-Schmidtke, Sina R.; Duchardt-Ferner, Elke; Groher, Florian; Weigand, Julia E.; Gottstein, Daniel; Suess, Beatrix; Wöhnert, Jens
2014-01-01
The U-turn is a classical three-dimensional RNA folding motif first identified in the anticodon and T-loops of tRNAs. It also occurs frequently as a building block in other functional RNA structures in many different sequence and structural contexts. U-turns induce sharp changes in the direction of the RNA backbone and often conform to the 3-nt consensus sequence 5′-UNR-3′ (N = any nucleotide, R = purine). The canonical U-turn motif is stabilized by a hydrogen bond between the N3 imino group of the U residue and the 3′ phosphate group of the R residue as well as a hydrogen bond between the 2′-hydroxyl group of the uridine and the N7 nitrogen of the R residue. Here, we demonstrate that a protonated cytidine can functionally and structurally replace the uridine at the first position of the canonical U-turn motif in the apical loop of the neomycin riboswitch. Using NMR spectroscopy, we directly show that the N3 imino group of the protonated cytidine forms a hydrogen bond with the backbone phosphate 3′ from the third nucleotide of the U-turn analogously to the imino group of the uridine in the canonical motif. In addition, we compare the stability of the hydrogen bonds in the mutant U-turn motif to the wild type and describe the NMR signature of the C+-phosphate interaction. Our results have implications for the prediction of RNA structural motifs and suggest simple approaches for the experimental identification of hydrogen bonds between protonated C-imino groups and the phosphate backbone. PMID:24951555
Combinatorics of feedback in cellular uptake and metabolism of small molecules.
Krishna, Sandeep; Semsey, Szabolcs; Sneppen, Kim
2007-12-26
We analyze the connection between structure and function for regulatory motifs associated with cellular uptake and usage of small molecules. Based on the boolean logic of the feedback we suggest four classes: the socialist, consumer, fashion, and collector motifs. We find that the socialist motif is good for homeostasis of a useful but potentially poisonous molecule, whereas the consumer motif is optimal for nutrition molecules. Accordingly, examples of these motifs are found in, respectively, the iron homeostasis system in various organisms and in the uptake of sugar molecules in bacteria. The remaining two motifs have no obvious analogs in small molecule regulation, but we illustrate their behavior using analogies to fashion and obesity. These extreme motifs could inspire construction of synthetic systems that exhibit bistable, history-dependent states, and homeostasis of flux (rather than concentration).
Crystal Structure of a UDP-glucose-specific Glycosyltransferase from a Mycobacterium Species
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fulton, Zara; McAlister, Adrian; Wilce, Matthew C.J.
2008-10-24
Glycosyltransferases (GTs) are a large and ubiquitous family of enzymes that specifically transfer sugar moieties to a range of substrates. Mycobacterium tuberculosis contains a large number of GTs, many of which are implicated in cell wall synthesis, yet the majority of these GTs remain poorly characterized. Here, we report the high resolution crystal structures of an essential GT (MAP2569c) from Mycobacterium avium subsp. paratuberculosis (a close homologue of Rv1208 from M. tuberculosis) in its apo- and ligand-bound forms. The structure adopted the GT-A fold and possessed the characteristic DXD motif that coordinated an Mn{sup 2+} ion. Atypical of most GTsmore » characterized to date, MAP2569c exhibited specificity toward the donor substrate, UDP-glucose. The structure of this ligated complex revealed an induced fit binding mechanism and provided a basis for this unique specificity. Collectively, the structural features suggested that MAP2569c may adopt a 'retaining' enzymatic mechanism, which has implications for the classification of other GTs in this large superfamily.« less
Facile ionothermal synthesis of microporous and mesoporous carbons from task specific ionic liquids.
Lee, Je Seung; Wang, Xiqing; Luo, Huimin; Baker, Gary A; Dai, Sheng
2009-04-08
An expedient, template-free, high-yield, and solventless route to nitrogen-rich micro- and mesoporous carbons is reported based on direct, atmospheric-pressure carbonization of task-specific ionic liquids bearing one or more nitrile side chains. The resulting textural properties (pore regime, surface area) are highly dependent upon the structural motifs of the ions comprising the corresponding parent ionic liquid, and uniform carbon films are routinely deposited with this novel methodology, highlighting excited new opportunities in the development of advanced functional carbon composites.
Self-assembly of multi-stranded RNA motifs into lattices and tubular structures
Stewart, Jaimie Marie; Subramanian, Hari K. K.; Franco, Elisa
2017-02-16
Rational design of nucleic acidmolecules yields selfassembling scaffolds with increasing complexity, size and functionality. It is an open question whether design methods tailored to build DNA nanostructures can be adapted to build RNA nanostructures with comparable features. We demonstrate the formation of RNA lattices and tubular assemblies from double crossover (DX) tiles, a canonical motif in DNA nanotechnology. Tubular structures can exceed 1 m in length, suggesting that this DX motif can produce very robust lattices. Some of these tubes spontaneously form with left-handed chirality. We obtain assemblies by using two methods: a protocol where gel-extracted RNA strands are slowlymore » annealed, and a one-pot transcription and anneal procedure. We then identify the tile nick position as a structural requirement for lattice formation. These results demonstrate that stable RNA structures can be obtained with design tools imported from DNA nanotechnology. These large assemblies could be potentially integrated with a variety of functional RNA motifs for drug or nanoparticle delivery, or for colocalization of cellular components.« less
Self-assembly of multi-stranded RNA motifs into lattices and tubular structures
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stewart, Jaimie Marie; Subramanian, Hari K. K.; Franco, Elisa
Rational design of nucleic acidmolecules yields selfassembling scaffolds with increasing complexity, size and functionality. It is an open question whether design methods tailored to build DNA nanostructures can be adapted to build RNA nanostructures with comparable features. We demonstrate the formation of RNA lattices and tubular assemblies from double crossover (DX) tiles, a canonical motif in DNA nanotechnology. Tubular structures can exceed 1 m in length, suggesting that this DX motif can produce very robust lattices. Some of these tubes spontaneously form with left-handed chirality. We obtain assemblies by using two methods: a protocol where gel-extracted RNA strands are slowlymore » annealed, and a one-pot transcription and anneal procedure. We then identify the tile nick position as a structural requirement for lattice formation. These results demonstrate that stable RNA structures can be obtained with design tools imported from DNA nanotechnology. These large assemblies could be potentially integrated with a variety of functional RNA motifs for drug or nanoparticle delivery, or for colocalization of cellular components.« less
Self-assembly of multi-stranded RNA motifs into lattices and tubular structures
Stewart, Jaimie Marie; Subramanian, Hari K. K.
2017-01-01
Abstract Rational design of nucleic acid molecules yields self-assembling scaffolds with increasing complexity, size and functionality. It is an open question whether design methods tailored to build DNA nanostructures can be adapted to build RNA nanostructures with comparable features. Here we demonstrate the formation of RNA lattices and tubular assemblies from double crossover (DX) tiles, a canonical motif in DNA nanotechnology. Tubular structures can exceed 1 μm in length, suggesting that this DX motif can produce very robust lattices. Some of these tubes spontaneously form with left-handed chirality. We obtain assemblies by using two methods: a protocol where gel-extracted RNA strands are slowly annealed, and a one-pot transcription and anneal procedure. We identify the tile nick position as a structural requirement for lattice formation. Our results demonstrate that stable RNA structures can be obtained with design tools imported from DNA nanotechnology. These large assemblies could be potentially integrated with a variety of functional RNA motifs for drug or nanoparticle delivery, or for colocalization of cellular components. PMID:28204562
2015-01-01
In a companion paper (DOI: 10.021/ja410934b) we demonstrate that the C-rich strand of the cis-regulatory element in the BCL2 promoter element is highly dynamic in nature and can form either an i-motif or a flexible hairpin. Under physiological conditions these two secondary DNA structures are found in an equilibrium mixture, which can be shifted by the addition of small molecules that trap out either the i-motif (IMC-48) or the flexible hairpin (IMC-76). In cellular experiments we demonstrate that the addition of these molecules has opposite effects on BCL2 gene expression and furthermore that these effects are antagonistic. In this contribution we have identified a transcriptional factor that recognizes and binds to the BCL2 i-motif to activate transcription. The molecular basis for the recognition of the i-motif by hnRNP LL is determined, and we demonstrate that the protein unfolds the i-motif structure to form a stable single-stranded complex. In subsequent experiments we show that IMC-48 and IMC-76 have opposite, antagonistic effects on the formation of the hnRNP LL–i-motif complex as well as on the transcription factor occupancy at the BCL2 promoter. For the first time we propose that the i-motif acts as a molecular switch that controls gene expression and that small molecules that target the dynamic equilibrium of the i-motif and the flexible hairpin can differentially modulate gene expression. PMID:24559432
The Crystal Structure of GXGD Membrane Protease FlaK
DOE Office of Scientific and Technical Information (OSTI.GOV)
J Hu; Y Xue; S Lee
2011-12-31
The GXGD proteases are polytopic membrane proteins with catalytic activities against membrane-spanning substrates that require a pair of aspartyl residues. Representative members of the family include preflagellin peptidase, type 4 prepilin peptidase, presenilin and signal peptide peptidase. Many GXGD proteases are important in medicine. For example, type 4 prepilin peptidase may contribute to bacterial pathogenesis, and mutations in presenilin are associated with Alzheimer's disease. As yet, there is no atomic-resolution structure in this protease family. Here we report the crystal structure of FlaK, a preflagellin peptidase from Methanococcus maripaludis, solved at 3.6 {angstrom} resolution. The structure contains six transmembrane helices.more » The GXGD motif and a short transmembrane helix, helix 4, are positioned at the centre, surrounded by other transmembrane helices. The crystal structure indicates that the protease must undergo conformational changes to bring the GXGD motif and a second essential aspartyl residue from transmembrane helix 1 into close proximity for catalysis. A comparison of the crystal structure with models of presenilin derived from biochemical analysis reveals three common transmembrane segments that are similarly arranged around the active site. This observation reinforces the idea that the prokaryotic and human proteases are evolutionarily related. The crystal structure presented here provides a framework for understanding the mechanism of the GXGD proteases, and may facilitate the rational design of inhibitors that target specific members of the family.« less
The crystal structure of GXGD membrane protease FlaK
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hu, Jian; Xue, Yi; Lee, Sangwon
2011-09-20
The GXGD proteases are polytopic membrane proteins with catalytic activities against membrane-spanning substrates that require a pair of aspartyl residues. Representative members of the family include preflagellin peptidase, type 4 prepilin peptidase, presenilin and signal peptide peptidase. Many GXGD proteases are important in medicine. For example, type 4 prepilin peptidase may contribute to bacterial pathogenesis, and mutations in presenilin are associated with Alzheimer's disease. As yet, there is no atomic-resolution structure in this protease family. Here we report the crystal structure of FlaK, a preflagellin peptidase from Methanococcus maripaludis, solved at 3.6 {angstrom} resolution. The structure contains six transmembrane helices.more » The GXGD motif and a short transmembrane helix, helix 4, are positioned at the centre, surrounded by other transmembrane helices. The crystal structure indicates that the protease must undergo conformational changes to bring the GXGD motif and a second essential aspartyl residue from transmembrane helix 1 into close proximity for catalysis. A comparison of the crystal structure with models of presenilin derived from biochemical analysis reveals three common transmembrane segments that are similarly arranged around the active site. This observation reinforces the idea that the prokaryotic and human proteases are evolutionarily related. The crystal structure presented here provides a framework for understanding the mechanism of the GXGD proteases, and may facilitate the rational design of inhibitors that target specific members of the family.« less
Kao, Hui-Ju; Weng, Shun-Long; Huang, Kai-Yao; Kaunang, Fergie Joanda; Hsu, Justin Bo-Kai; Huang, Chien-Hsun; Lee, Tzong-Yi
2017-12-21
Carbonylation, which takes place through oxidation of reactive oxygen species (ROS) on specific residues, is an irreversibly oxidative modification of proteins. It has been reported that the carbonylation is related to a number of metabolic or aging diseases including diabetes, chronic lung disease, Parkinson's disease, and Alzheimer's disease. Due to the lack of computational methods dedicated to exploring motif signatures of protein carbonylation sites, we were motivated to exploit an iterative statistical method to characterize and identify carbonylated sites with motif signatures. By manually curating experimental data from research articles, we obtained 332, 144, 135, and 140 verified substrate sites for K (lysine), R (arginine), T (threonine), and P (proline) residues, respectively, from 241 carbonylated proteins. In order to examine the informative attributes for classifying between carbonylated and non-carbonylated sites, multifarious features including composition of twenty amino acids (AAC), composition of amino acid pairs (AAPC), position-specific scoring matrix (PSSM), and positional weighted matrix (PWM) were investigated in this study. Additionally, in an attempt to explore the motif signatures of carbonylation sites, an iterative statistical method was adopted to detect statistically significant dependencies of amino acid compositions between specific positions around substrate sites. Profile hidden Markov model (HMM) was then utilized to train a predictive model from each motif signature. Moreover, based on the method of support vector machine (SVM), we adopted it to construct an integrative model by combining the values of bit scores obtained from profile HMMs. The combinatorial model could provide an enhanced performance with evenly predictive sensitivity and specificity in the evaluation of cross-validation and independent testing. This study provides a new scheme for exploring potential motif signatures at substrate sites of protein carbonylation. The usefulness of the revealed motifs in the identification of carbonylated sites is demonstrated by their effective performance in cross-validation and independent testing. Finally, these substrate motifs were adopted to build an available online resource (MDD-Carb, http://csb.cse.yzu.edu.tw/MDDCarb/ ) and are also anticipated to facilitate the study of large-scale carbonylated proteomes.
Sztuba-Solinska, Joanna; Teramoto, Tadahisa; Rausch, Jason W.; Shapiro, Bruce A.; Padmanabhan, Radhakrishnan; Le Grice, Stuart F. J.
2013-01-01
The Dengue virus (DENV) genome contains multiple cis-acting elements required for translation and replication. Previous studies indicated that a 719-nt subgenomic minigenome (DENV-MINI) is an efficient template for translation and (−) strand RNA synthesis in vitro. We performed a detailed structural analysis of DENV-MINI RNA, combining chemical acylation techniques, Pb2+ ion-induced hydrolysis and site-directed mutagenesis. Our results highlight protein-independent 5′–3′ terminal interactions involving hybridization between recognized cis-acting motifs. Probing analyses identified tandem dumbbell structures (DBs) within the 3′ terminus spaced by single-stranded regions, internal loops and hairpins with embedded GNRA-like motifs. Analysis of conserved motifs and top loops (TLs) of these dumbbells, and their proposed interactions with downstream pseudoknot (PK) regions, predicted an H-type pseudoknot involving TL1 of the 5′ DB and the complementary region, PK2. As disrupting the TL1/PK2 interaction, via ‘flipping’ mutations of PK2, previously attenuated DENV replication, this pseudoknot may participate in regulation of RNA synthesis. Computer modeling implied that this motif might function as autonomous structural/regulatory element. In addition, our studies targeting elements of the 3′ DB and its complementary region PK1 indicated that communication between 5′–3′ terminal regions strongly depends on structure and sequence composition of the 5′ cyclization region. PMID:23531545
İnce, İkbal Agah; Pijlman, Gorben P; Vlak, Just M; van Oers, Monique M
2017-11-01
Previously, we observed that the transcripts of Invertebrate iridescent virus 6 (IIV6) are not polyadenylated, in line with the absence of canonical poly(A) motifs (AATAAA) downstream of the open reading frames (ORFs) in the genome. Here, we determined the 3' ends of the transcripts of fifty-four IIV6 virion protein genes in infected Drosophila Schneider 2 (S2) cells. By using ligation-based amplification of cDNA ends (LACE) it was shown that the IIV6 mRNAs often ended with a CAUUA motif. In silico analysis showed that the 3'-untranslated regions of IIV6 genes have the ability to form hairpin structures (22-56 nt in length) and that for about half of all IIV6 genes these 3' sequences contained complementary TAATG and CATTA motifs. We also show that a hairpin in the 3' flanking region with conserved sequence motifs is a conserved feature in invertebrate-infecting iridoviruses (genus Iridovirus and Chloriridovirus). Copyright © 2017 Elsevier Inc. All rights reserved.
Mitrea, Diana M; Cika, Jaclyn A; Guy, Clifford S; Ban, David; Banerjee, Priya R; Stanley, Christopher B; Nourse, Amanda; Deniz, Ashok A; Kriwacki, Richard W
2016-02-02
The nucleolus is a membrane-less organelle formed through liquid-liquid phase separation of its components from the surrounding nucleoplasm. Here, we show that nucleophosmin (NPM1) integrates within the nucleolus via a multi-modal mechanism involving multivalent interactions with proteins containing arginine-rich linear motifs (R-motifs) and ribosomal RNA (rRNA). Importantly, these R-motifs are found in canonical nucleolar localization signals. Based on a novel combination of biophysical approaches, we propose a model for the molecular organization within liquid-like droplets formed by the N-terminal domain of NPM1 and R-motif peptides, thus providing insights into the structural organization of the nucleolus. We identify multivalency of acidic tracts and folded nucleic acid binding domains, mediated by N-terminal domain oligomerization, as structural features required for phase separation of NPM1 with other nucleolar components in vitro and for localization within mammalian nucleoli. We propose that one mechanism of nucleolar localization involves phase separation of proteins within the nucleolus.
Organic Carbamates in Drug Design and Medicinal Chemistry
2016-01-01
The carbamate group is a key structural motif in many approved drugs and prodrugs. There is an increasing use of carbamates in medicinal chemistry and many derivatives are specifically designed to make drug–target interactions through their carbamate moiety. In this Perspective, we present properties and stabilities of carbamates, reagents and chemical methodologies for the synthesis of carbamates, and recent applications of carbamates in drug design and medicinal chemistry. PMID:25565044
Organic carbamates in drug design and medicinal chemistry.
Ghosh, Arun K; Brindisi, Margherita
2015-04-09
The carbamate group is a key structural motif in many approved drugs and prodrugs. There is an increasing use of carbamates in medicinal chemistry and many derivatives are specifically designed to make drug-target interactions through their carbamate moiety. In this Perspective, we present properties and stabilities of carbamates, reagents and chemical methodologies for the synthesis of carbamates, and recent applications of carbamates in drug design and medicinal chemistry.
Foulk, Michael S.; Urban, John M.; Casella, Cinzia; Gerbi, Susan A.
2015-01-01
Nascent strand sequencing (NS-seq) is used to discover DNA replication origins genome-wide, allowing identification of features for their specification. NS-seq depends on the ability of lambda exonuclease (λ-exo) to efficiently digest parental DNA while leaving RNA-primer protected nascent strands intact. We used genomics and biochemical approaches to determine if λ-exo digests all parental DNA sequences equally. We report that λ-exo does not efficiently digest G-quadruplex (G4) structures in a plasmid. Moreover, λ-exo digestion of nonreplicating genomic DNA (LexoG0) enriches GC-rich DNA and G4 motifs genome-wide. We used LexoG0 data to control for nascent strand–independent λ-exo biases in NS-seq and validated this approach at the rDNA locus. The λ-exo–controlled NS-seq peaks are not GC-rich, and only 35.5% overlap with 6.8% of all G4s, suggesting that G4s are not general determinants for origin specification but may play a role for a subset. Interestingly, we observed a periodic spacing of G4 motifs and nucleosomes around the peak summits, suggesting that G4s may position nucleosomes at this subset of origins. Finally, we demonstrate that use of Na+ instead of K+ in the λ-exo digestion buffer reduced the effect of G4s on λ-exo digestion and discuss ways to increase both the sensitivity and specificity of NS-seq. PMID:25695952
Foulk, Michael S; Urban, John M; Casella, Cinzia; Gerbi, Susan A
2015-05-01
Nascent strand sequencing (NS-seq) is used to discover DNA replication origins genome-wide, allowing identification of features for their specification. NS-seq depends on the ability of lambda exonuclease (λ-exo) to efficiently digest parental DNA while leaving RNA-primer protected nascent strands intact. We used genomics and biochemical approaches to determine if λ-exo digests all parental DNA sequences equally. We report that λ-exo does not efficiently digest G-quadruplex (G4) structures in a plasmid. Moreover, λ-exo digestion of nonreplicating genomic DNA (LexoG0) enriches GC-rich DNA and G4 motifs genome-wide. We used LexoG0 data to control for nascent strand-independent λ-exo biases in NS-seq and validated this approach at the rDNA locus. The λ-exo-controlled NS-seq peaks are not GC-rich, and only 35.5% overlap with 6.8% of all G4s, suggesting that G4s are not general determinants for origin specification but may play a role for a subset. Interestingly, we observed a periodic spacing of G4 motifs and nucleosomes around the peak summits, suggesting that G4s may position nucleosomes at this subset of origins. Finally, we demonstrate that use of Na(+) instead of K(+) in the λ-exo digestion buffer reduced the effect of G4s on λ-exo digestion and discuss ways to increase both the sensitivity and specificity of NS-seq. © 2015 Foulk et al.; Published by Cold Spring Harbor Laboratory Press.
Huntley, Stuart; Baggott, Daniel M.; Hamilton, Aaron T.; Tran-Gyamfi, Mary; Yang, Shan; Kim, Joomyeong; Gordon, Laurie; Branscomb, Elbert; Stubbs, Lisa
2006-01-01
Krüppel-type zinc finger (ZNF) motifs are prevalent components of transcription factor proteins in all eukaryotes. KRAB-ZNF proteins, in which a potent repressor domain is attached to a tandem array of DNA-binding zinc-finger motifs, are specific to tetrapod vertebrates and represent the largest class of ZNF proteins in mammals. To define the full repertoire of human KRAB-ZNF proteins, we searched the genome sequence for key motifs and then constructed and manually curated gene models incorporating those sequences. The resulting gene catalog contains 423 KRAB-ZNF protein-coding loci, yielding alternative transcripts that altogether predict at least 742 structurally distinct proteins. Active rounds of segmental duplication, involving single genes or larger regions and including both tandem and distributed duplication events, have driven the expansion of this mammalian gene family. Comparisons between the human genes and ZNF loci mined from the draft mouse, dog, and chimpanzee genomes not only identified 103 KRAB-ZNF genes that are conserved in mammals but also highlighted a substantial level of lineage-specific change; at least 136 KRAB-ZNF coding genes are primate specific, including many recent duplicates. KRAB-ZNF genes are widely expressed and clustered genes are typically not coregulated, indicating that paralogs have evolved to fill roles in many different biological processes. To facilitate further study, we have developed a Web-based public resource with access to gene models, sequences, and other data, including visualization tools to provide genomic context and interaction with other public data sets. PMID:16606702
Pan, Xiufang; Sittaramane, Vinoth; Gurung, Suman; Chandrasekhar, Anand
2014-02-01
Van gogh-like 2 (Vangl2), a core component of the Wnt/planar cell polarity (PCP) signaling pathway, is a four-pass transmembrane protein with N-terminal and C-terminal domains located in the cytosol, and is structurally conserved from flies to mammals. In vertebrates, Vangl2 plays an essential role in convergence and extension (CE) movements during gastrulation and in facial branchiomotor (FBM) neuron migration in the hindbrain. However, the roles of specific Vangl2 domains, of membrane association, and of specific extracellular and intracellular motifs have not been examined, especially in the context of FBM neuron migration. Through heat shock-inducible expression of various Vangl2 transgenes, we found that membrane associated functions of the N-terminal and C-terminal domains of Vangl2 are involved in regulating FBM neuron migration. Importantly, through temperature shift experiments, we found that the critical period for Vangl2 function coincides with the initial stages of FBM neuron migration out of rhombomere 4. Intriguingly, we have also uncovered a putative nuclear localization motif in the C-terminal domain that may play a role in regulating CE movements. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Predicted taxonomic patterns in pheromone production by longhorned beetles
NASA Astrophysics Data System (ADS)
Ray, Ann M.; Lacey, Emerson S.; Hanks, Lawrence M.
2006-11-01
Males of five species of three tribes in the longhorned beetle subfamily Cerambycinae produce volatile pheromones that share a structural motif (hydroxyl or carbonyl groups at carbons two and three in straight-chains of six, eight, or ten carbons). Pheromone gland pores are present on the prothoraces of males, but are absent in females, suggesting that male-specific gland pores could provide a convenient morphological indication that a species uses volatile pheromones. In this article, we assess the taxonomic distribution of gland pores within the Cerambycinae by examining males and females of 65 species in 24 tribes using scanning electron microscopy. Gland pores were present in males and absent in females of 49 species, but absent in both sexes of the remaining 16 species. Pores were confined to indentations in the cuticle. Among the species that had male-specific gland pores were four species already known to produce volatile compounds consistent with the structural motif. These findings support the initial assumption that gland pores are associated with the production of pheromones by males. There were apparently no taxonomic patterns in the presence of gland pores. These findings suggest that volatile pheromones play an important role in reproduction for many species of the Cerambycinae, and that the trait is evolutionarily labile.
Visualizing frequent patterns in large multivariate time series
NASA Astrophysics Data System (ADS)
Hao, M.; Marwah, M.; Janetzko, H.; Sharma, R.; Keim, D. A.; Dayal, U.; Patnaik, D.; Ramakrishnan, N.
2011-01-01
The detection of previously unknown, frequently occurring patterns in time series, often called motifs, has been recognized as an important task. However, it is difficult to discover and visualize these motifs as their numbers increase, especially in large multivariate time series. To find frequent motifs, we use several temporal data mining and event encoding techniques to cluster and convert a multivariate time series to a sequence of events. Then we quantify the efficiency of the discovered motifs by linking them with a performance metric. To visualize frequent patterns in a large time series with potentially hundreds of nested motifs on a single display, we introduce three novel visual analytics methods: (1) motif layout, using colored rectangles for visualizing the occurrences and hierarchical relationships of motifs in a multivariate time series, (2) motif distortion, for enlarging or shrinking motifs as appropriate for easy analysis and (3) motif merging, to combine a number of identical adjacent motif instances without cluttering the display. Analysts can interactively optimize the degree of distortion and merging to get the best possible view. A specific motif (e.g., the most efficient or least efficient motif) can be quickly detected from a large time series for further investigation. We have applied these methods to two real-world data sets: data center cooling and oil well production. The results provide important new insights into the recurring patterns.
MotifNet: a web-server for network motif analysis.
Smoly, Ilan Y; Lerman, Eugene; Ziv-Ukelson, Michal; Yeger-Lotem, Esti
2017-06-15
Network motifs are small topological patterns that recur in a network significantly more often than expected by chance. Their identification emerged as a powerful approach for uncovering the design principles underlying complex networks. However, available tools for network motif analysis typically require download and execution of computationally intensive software on a local computer. We present MotifNet, the first open-access web-server for network motif analysis. MotifNet allows researchers to analyze integrated networks, where nodes and edges may be labeled, and to search for motifs of up to eight nodes. The output motifs are presented graphically and the user can interactively filter them by their significance, number of instances, node and edge labels, and node identities, and view their instances. MotifNet also allows the user to distinguish between motifs that are centered on specific nodes and motifs that recur in distinct parts of the network. MotifNet is freely available at http://netbio.bgu.ac.il/motifnet . The website was implemented using ReactJs and supports all major browsers. The server interface was implemented in Python with data stored on a MySQL database. estiyl@bgu.ac.il or michaluz@cs.bgu.ac.il. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Identity and functions of CxxC-derived motifs.
Fomenko, Dmitri E; Gladyshev, Vadim N
2003-09-30
Two cysteines separated by two other residues (the CxxC motif) are employed by many redox proteins for formation, isomerization, and reduction of disulfide bonds and for other redox functions. The place of the C-terminal cysteine in this motif may be occupied by serine (the CxxS motif), modifying the functional repertoire of redox proteins. Here we found that the CxxC motif may also give rise to a motif, in which the C-terminal cysteine is replaced with threonine (the CxxT motif). Moreover, in contrast to a view that the N-terminal cysteine in the CxxC motif always serves as a nucleophilic attacking group, this residue could also be replaced with threonine (the TxxC motif), serine (the SxxC motif), or other residues. In each of these CxxC-derived motifs, the presence of a downstream alpha-helix was strongly favored. A search for conserved CxxC-derived motif/helix patterns in four complete genomes representing bacteria, archaea, and eukaryotes identified known redox proteins and suggested possible redox functions for several additional proteins. Catalytic sites in peroxiredoxins were major representatives of the TxxC motif, whereas those in glutathione peroxidases represented the CxxT motif. Structural assessments indicated that threonines in these enzymes could stabilize catalytic thiolates, suggesting revisions to previously proposed catalytic triads. Each of the CxxC-derived motifs was also observed in natural selenium-containing proteins, in which selenocysteine was present in place of a catalytic cysteine.
Rajan, Rakhi; Taneja, Bhupesh; Mondragón, Alfonso
2010-01-01
Summary Topoisomerase V is an archaeal type I topoisomerase that is unique among topoisomerases due to presence of both topoisomerase and DNA repair activities in the same protein. It is organized as an N-terminal topoisomerase domain followed by 24 tandem helix hairpin helix (HhH) motifs. Structural studies have shown that the active site is buried by the (HhH) motifs. Here we show that the N-terminal domain can relax DNA in the absence of any HhH motifs and that the HhH motifs are required for stable protein-DNA complex formation. Crystal structures of various topoisomerase V fragments show changes in the relative orientation of the domains mediated by a long bent linker helix, and these movements are essential for the DNA to enter the active site. Phosphate ions bound to the protein near the active site helped model DNA in the topoisomerase domain and shows how topoisomerase V may interact with DNA. PMID:20637419
Non-B DB: a database of predicted non-B DNA-forming motifs in mammalian genomes.
Cer, Regina Z; Bruce, Kevin H; Mudunuri, Uma S; Yi, Ming; Volfovsky, Natalia; Luke, Brian T; Bacolla, Albino; Collins, Jack R; Stephens, Robert M
2011-01-01
Although the capability of DNA to form a variety of non-canonical (non-B) structures has long been recognized, the overall significance of these alternate conformations in biology has only recently become accepted en masse. In order to provide access to genome-wide locations of these classes of predicted structures, we have developed non-B DB, a database integrating annotations and analysis of non-B DNA-forming sequence motifs. The database provides the most complete list of alternative DNA structure predictions available, including Z-DNA motifs, quadruplex-forming motifs, inverted repeats, mirror repeats and direct repeats and their associated subsets of cruciforms, triplex and slipped structures, respectively. The database also contains motifs predicted to form static DNA bends, short tandem repeats and homo(purine•pyrimidine) tracts that have been associated with disease. The database has been built using the latest releases of the human, chimp, dog, macaque and mouse genomes, so that the results can be compared directly with other data sources. In order to make the data interpretable in a genomic context, features such as genes, single-nucleotide polymorphisms and repetitive elements (SINE, LINE, etc.) have also been incorporated. The database is accessed through query pages that produce results with links to the UCSC browser and a GBrowse-based genomic viewer. It is freely accessible at http://nonb.abcc.ncifcrf.gov.
Vendra, Venkata Pulla Rao; Agarwal, Garima; Chandani, Sushil; Talla, Venu; Srinivasan, Narayanaswamy; Balasubramanian, Dorairajan
2013-01-01
Background We highlight an unrecognized physiological role for the Greek key motif, an evolutionarily conserved super-secondary structural topology of the βγ-crystallins. These proteins constitute the bulk of the human eye lens, packed at very high concentrations in a compact, globular, short-range order, generating transparency. Congenital cataract (affecting 400,000 newborns yearly worldwide), associated with 54 mutations in βγ-crystallins, occurs in two major phenotypes nuclear cataract, which blocks the central visual axis, hampering the development of the growing eye and demanding earliest intervention, and the milder peripheral progressive cataract where surgery can wait. In order to understand this phenotypic dichotomy at the molecular level, we have studied the structural and aggregation features of representative mutations. Methods Wild type and several representative mutant proteins were cloned, expressed and purified and their secondary and tertiary structural details, as well as structural stability, were compared in solution, using spectroscopy. Their tendencies to aggregate in vitro and in cellulo were also compared. In addition, we analyzed their structural differences by molecular modeling in silico. Results Based on their properties, mutants are seen to fall into two classes. Mutants A36P, L45PL54P, R140X, and G165fs display lowered solubility and structural stability, expose several buried residues to the surface, aggregate in vitro and in cellulo, and disturb/distort the Greek key motif. And they are associated with nuclear cataract. In contrast, mutants P24T and R77S, associated with peripheral cataract, behave quite similar to the wild type molecule, and do not affect the Greek key topology. Conclusion When a mutation distorts even one of the four Greek key motifs, the protein readily self-aggregates and precipitates, consistent with the phenotype of nuclear cataract, while mutations not affecting the motif display ‘native state aggregation’, leading to peripheral cataract, thus offering a protein structural rationale for the cataract phenotypic dichotomy “distort motif, lose central vision”. PMID:23936409
A study of pH-dependence of shrink and stretch of tetrahedral DNA nanostructures.
Wang, Ping; Xia, Zhiwei; Yan, Juan; Liu, Xunwei; Yao, Guangbao; Pei, Hao; Zuo, Xiaolei; Sun, Gang; He, Dannong
2015-04-21
We monitored the shrink and stretch of the tetrahedral DNA nanostructure (TDN) and the i-motif connected TDN structure at pH 8.5 and pH 4.5, and we found that not only the i-motif can change its structure when the pH changes, but also the TDN and the DNA double helix change their structures when the pH changes.
Vives-Adrian, Laia; Lujan, Celia; Oliva, Baldo; van der Linden, Lonneke; Selisko, Barbara; Coutard, Bruno; Canard, Bruno; van Kuppeveld, Frank J. M.
2014-01-01
ABSTRACT Encephalomyocarditis virus (EMCV) is a member of the Cardiovirus genus within the large Picornaviridae family, which includes a number of important human and animal pathogens. The RNA-dependent RNA polymerase (RdRp) 3Dpol is a key enzyme for viral genome replication. In this study, we report the X-ray structures of two different crystal forms of the EMCV RdRp determined at 2.8- and 2.15-Å resolution. The in vitro elongation and VPg uridylylation activities of the purified enzyme have also been demonstrated. Although the overall structure of EMCV 3Dpol is shown to be similar to that of the known RdRps of other members of the Picornaviridae family, structural comparisons show a large reorganization of the active-site cavity in one of the crystal forms. The rearrangement affects mainly motif A, where the conserved residue Asp240, involved in ribonucleoside triphosphate (rNTP) selection, and its neighbor residue, Phe239, move about 10 Å from their expected positions within the ribose binding pocket toward the entrance of the rNTP tunnel. This altered conformation of motif A is stabilized by a cation-π interaction established between the aromatic ring of Phe239 and the side chain of Lys56 within the finger domain. Other contacts, involving Phe239 and different residues of motif F, are also observed. The movement of motif A is connected with important conformational changes in the finger region flanked by residues 54 to 63, harboring Lys56, and in the polymerase N terminus. The structures determined in this work provide essential information for studies on the cardiovirus RNA replication process and may have important implications for the development of new antivirals targeting the altered conformation of motif A. IMPORTANCE The Picornaviridae family is one of the largest virus families known, including many important human and animal pathogens. The RNA-dependent RNA polymerase (RdRp) 3Dpol is a key enzyme for picornavirus genome replication and a validated target for the development of antiviral therapies. Solving the X-ray structure of the first cardiovirus RdRp, EMCV 3Dpol, we captured an altered conformation of a conserved motif in the polymerase active site (motif A) containing the aspartic acid residue involved in rNTP selection and binding. This altered conformation of motif A, which interferes with the correct positioning of the rNTP substrate in the active site, is stabilized by a number of residues strictly conserved among picornaviruses. The rearrangements observed suggest that this motif A segment is a dynamic element that can be modulated by external effectors, either activating or inhibiting enzyme activity, and this type of modulation appears to be general to all picornaviruses. PMID:24600002
Vives-Adrian, Laia; Lujan, Celia; Oliva, Baldo; van der Linden, Lonneke; Selisko, Barbara; Coutard, Bruno; Canard, Bruno; van Kuppeveld, Frank J M; Ferrer-Orta, Cristina; Verdaguer, Núria
2014-05-01
Encephalomyocarditis virus (EMCV) is a member of the Cardiovirus genus within the large Picornaviridae family, which includes a number of important human and animal pathogens. The RNA-dependent RNA polymerase (RdRp) 3Dpol is a key enzyme for viral genome replication. In this study, we report the X-ray structures of two different crystal forms of the EMCV RdRp determined at 2.8- and 2.15-Å resolution. The in vitro elongation and VPg uridylylation activities of the purified enzyme have also been demonstrated. Although the overall structure of EMCV 3Dpol is shown to be similar to that of the known RdRps of other members of the Picornaviridae family, structural comparisons show a large reorganization of the active-site cavity in one of the crystal forms. The rearrangement affects mainly motif A, where the conserved residue Asp240, involved in ribonucleoside triphosphate (rNTP) selection, and its neighbor residue, Phe239, move about 10 Å from their expected positions within the ribose binding pocket toward the entrance of the rNTP tunnel. This altered conformation of motif A is stabilized by a cation-π interaction established between the aromatic ring of Phe239 and the side chain of Lys56 within the finger domain. Other contacts, involving Phe239 and different residues of motif F, are also observed. The movement of motif A is connected with important conformational changes in the finger region flanked by residues 54 to 63, harboring Lys56, and in the polymerase N terminus. The structures determined in this work provide essential information for studies on the cardiovirus RNA replication process and may have important implications for the development of new antivirals targeting the altered conformation of motif A. The Picornaviridae family is one of the largest virus families known, including many important human and animal pathogens. The RNA-dependent RNA polymerase (RdRp) 3Dpol is a key enzyme for picornavirus genome replication and a validated target for the development of antiviral therapies. Solving the X-ray structure of the first cardiovirus RdRp, EMCV 3Dpol, we captured an altered conformation of a conserved motif in the polymerase active site (motif A) containing the aspartic acid residue involved in rNTP selection and binding. This altered conformation of motif A, which interferes with the correct positioning of the rNTP substrate in the active site, is stabilized by a number of residues strictly conserved among picornaviruses. The rearrangements observed suggest that this motif A segment is a dynamic element that can be modulated by external effectors, either activating or inhibiting enzyme activity, and this type of modulation appears to be general to all picornaviruses.
Self-Organization of Microcircuits in Networks of Spiking Neurons with Plastic Synapses.
Ocker, Gabriel Koch; Litwin-Kumar, Ashok; Doiron, Brent
2015-08-01
The synaptic connectivity of cortical networks features an overrepresentation of certain wiring motifs compared to simple random-network models. This structure is shaped, in part, by synaptic plasticity that promotes or suppresses connections between neurons depending on their joint spiking activity. Frequently, theoretical studies focus on how feedforward inputs drive plasticity to create this network structure. We study the complementary scenario of self-organized structure in a recurrent network, with spike timing-dependent plasticity driven by spontaneous dynamics. We develop a self-consistent theory for the evolution of network structure by combining fast spiking covariance with a slow evolution of synaptic weights. Through a finite-size expansion of network dynamics we obtain a low-dimensional set of nonlinear differential equations for the evolution of two-synapse connectivity motifs. With this theory in hand, we explore how the form of the plasticity rule drives the evolution of microcircuits in cortical networks. When potentiation and depression are in approximate balance, synaptic dynamics depend on weighted divergent, convergent, and chain motifs. For additive, Hebbian STDP these motif interactions create instabilities in synaptic dynamics that either promote or suppress the initial network structure. Our work provides a consistent theoretical framework for studying how spiking activity in recurrent networks interacts with synaptic plasticity to determine network structure.
Self-Organization of Microcircuits in Networks of Spiking Neurons with Plastic Synapses
Ocker, Gabriel Koch; Litwin-Kumar, Ashok; Doiron, Brent
2015-01-01
The synaptic connectivity of cortical networks features an overrepresentation of certain wiring motifs compared to simple random-network models. This structure is shaped, in part, by synaptic plasticity that promotes or suppresses connections between neurons depending on their joint spiking activity. Frequently, theoretical studies focus on how feedforward inputs drive plasticity to create this network structure. We study the complementary scenario of self-organized structure in a recurrent network, with spike timing-dependent plasticity driven by spontaneous dynamics. We develop a self-consistent theory for the evolution of network structure by combining fast spiking covariance with a slow evolution of synaptic weights. Through a finite-size expansion of network dynamics we obtain a low-dimensional set of nonlinear differential equations for the evolution of two-synapse connectivity motifs. With this theory in hand, we explore how the form of the plasticity rule drives the evolution of microcircuits in cortical networks. When potentiation and depression are in approximate balance, synaptic dynamics depend on weighted divergent, convergent, and chain motifs. For additive, Hebbian STDP these motif interactions create instabilities in synaptic dynamics that either promote or suppress the initial network structure. Our work provides a consistent theoretical framework for studying how spiking activity in recurrent networks interacts with synaptic plasticity to determine network structure. PMID:26291697
Li, Jun; Shi, Jian-Li; Wu, Xiao-Yan; Fu, Fang; Yu, Jiang; Yuan, Xiao-Yuan; Peng, Zhe; Cong, Xiao-Yan; Xu, Shao-Jian; Sun, Wen-Bo; Cheng, Kai-Hui; Du, Yi-Jun; Wu, Jia-Qiang; Wang, Jin-Bao; Huang, Bao-Hua
2015-06-01
Nowadays, adjuvant is still important for boosting immunity and improving resistance in animals. In order to boost the immunity of porcine circovirus type 2 (PCV2) DNA vaccine, CpG motifs were inserted. In this study, the dose-effect was studied, and the immunity of PCV2 DNA vaccines by recombinant open reading frame 2 (ORF2) gene and CpG motifs was evaluated. Three-week-old Changbai piglets were inoculated intramuscularly with 200 μg, 400 μg, and 800 μg DNA vaccines containing 14 and 18 CpG motifs, respectively. Average gain and rectum temperature were recorded everyday during the experiments. Blood was collected from the piglets after vaccination to detect the changes of specific antibodies, interleukin-2, and immune cells every week. Tissues were collected for histopathology and polymerase chain reaction. The results indicated that compared to those of the control piglets, all concentrations of two DNA vaccines could induce PCV2-specific antibodies. A cellular immunity test showed that PCV2-specific lymphocytes proliferated the number of TH, TC, and CD3+ positive T-cells raised in the blood of DNA vaccine immune groups. There was no distinct pathological damage and viremia occurring in pigs that were inoculated with DNA vaccines, but there was some minor pathological damage in the control group. The results demonstrated that CpG motifs as an adjuvant could boost the humoral and cellular immunity of pigs to PCV2, especially in terms of cellular immunity. Comparing two DNA vaccines that were constructed, the one containing 18 CpG motifs was more effective. This is the first report that CpG motifs as an adjuvant insert to the PCV2 DNA vaccine could boost immunity.
Papadopoulos, Dimitrios K.; Reséndez-Pérez, Diana; Cárdenas-Chávez, Diana L.; Villanueva-Segura, Karina; Canales-del-Castillo, Ricardo; Felix, Daniel A.; Fünfschilling, Raphael; Gehring, Walter J.
2011-01-01
Segmental identity along the anteroposterior axis of bilateral animals is specified by Hox genes. These genes encode transcription factors, harboring the conserved homeodomain and, generally, a YPWM motif, which binds Hox cofactors and increases Hox transcriptional specificity in vivo. Here we derive synthetic Drosophila Antennapedia genes, consisting only of the YPWM motif and homeodomain, and investigate their functional role throughout development. Synthetic peptides and full-length Antennapedia proteins cause head-to-thorax transformations in the embryo, as well as antenna-to-tarsus and eye-to-wing transformations in the adult, thus converting the entire head to a mesothorax. This conversion is achieved by repression of genes required for head and antennal development and ectopic activation of genes promoting thoracic and tarsal fates, respectively. Synthetic Antennapedia peptides bind DNA specifically and interact with Extradenticle and Bric-à-brac interacting protein 2 cofactors in vitro and ex vivo. Substitution of the YPWM motif by alanines abolishes Antennapedia homeotic function, whereas substitution of YPWM by the WRPW repressor motif, which binds the transcriptional corepressor Groucho, allows all proteins to act as repressors only. Finally, naturally occurring variations in the size of the linker between the homeodomain and YPWM motif enhance Antennapedia repressive or activating efficiency, emphasizing the importance of linker size, rather than sequence, for specificity. Our results clearly show that synthetic Antennapedia genes are functional in vivo and therefore provide powerful tools for synthetic biology. Moreover, the YPWM motif is necessary—whereas the entire N terminus of the protein is dispensable—for Antennapedia homeotic function, indicating its dual role in transcriptional activation and repression by recruiting either coactivators or corepressors. PMID:21712439
Horchani, Habib; de Saint-Jean, Maud; Barelli, Hélène; Antonny, Bruno
2014-01-01
The yeast protein Spo20 contains a regulatory amphipathic motif that has been suggested to recognize phosphatidic acid, a lipid involved in signal transduction, lipid metabolism and membrane fusion. We have investigated the interaction of the Spo20 amphipathic motif with lipid membranes using a bioprobe strategy that consists in appending this motif to the end of a long coiled-coil, which can be coupled to a GFP reporter for visualization in cells. The resulting construct is amenable to in vitro and in vivo experiments and allows unbiased comparison between amphipathic helices of different chemistry. In vitro, the Spo20 bioprobe responded to small variations in the amount of phosphatidic acid. However, this response was not specific. The membrane binding of the probe depended on the presence of phosphatidylethanolamine and also integrated the contribution of other anionic lipids, including phosphatidylserine and phosphatidyl-inositol-(4,5)bisphosphate. Inverting the sequence of the Spo20 motif neither affected the ability of the probe to interact with anionic liposomes nor did it modify its cellular localization, making a stereo-specific mode of phosphatidic acid recognition unlikely. Nevertheless, the lipid binding properties and the cellular localization of the Spo20 alpha-helix differed markedly from that of another amphipathic motif, Amphipathic Lipid Packing Sensor (ALPS), suggesting that even in the absence of stereo specific interactions, amphipathic helices can act as subcellular membrane targeting determinants in a cellular context.
Crystal structure of yeast allantoicase reveals a repeated jelly roll motif.
Leulliot, Nicolas; Quevillon-Cheruel, Sophie; Sorel, Isabelle; Graille, Marc; Meyer, Philippe; Liger, Dominique; Blondeau, Karine; Janin, Joël; van Tilbeurgh, Herman
2004-05-28
Allantoicase (EC 3.5.3.4) catalyzes the conversion of allantoate into ureidoglycolate and urea, one of the final steps in the degradation of purines to urea. The mechanism of most enzymes involved in this pathway, which has been known for a long time, is unknown. In this paper we describe the three-dimensional crystal structure of the yeast allantoicase determined at a resolution of 2.6 A by single anomalous diffraction. This constitutes the first structure for an enzyme of this pathway. The structure reveals a repeated jelly roll beta-sheet motif, also present in proteins of unrelated biochemical function. Allantoicase has a hexameric arrangement in the crystal (dimer of trimers). Analysis of the protein sequence against the structural data reveals the presence of two totally conserved surface patches, one on each jelly roll motif. The hexameric packing concentrates these patches into conserved pockets that probably constitute the active site.
NASA Astrophysics Data System (ADS)
Prasanna, M. D.; Row, T. N. Guru
2001-05-01
The crystal structure of Flunazirine, an anticonvulsant drug, is analyzed in terms of intermolecular interactions involving fluorine. The structure displays motifs formed by only weak interactions C-H⋯F and C-H⋯π. The motifs thus generated show cavities, which could serve as hosts for complexation. The structure of Flunazirine displays cavities formed by C-H⋯F and C-H⋯π interactions. Haloperidol, an antipsychotic drug, shows F⋯F interactions in the crystalline lattice in lieu of Cl⋯Cl interactions. However, strong O-H⋯N interactions dominate packing. The salient features of the two structures in terms of intermolecular interactions reveal, even though organic fluorine has lower tendency to engage in hydrogen bonding and F⋯F interactions, these interactions could play a significant role in the design of molecular assemblies via crystal engineering.
Differential pleiotropy and HOX functional organization.
Sivanantharajah, Lovesha; Percival-Smith, Anthony
2015-02-01
Key studies led to the idea that transcription factors are composed of defined modular protein motifs or domains, each with separable, unique function. During evolution, the recombination of these modular domains could give rise to transcription factors with new properties, as has been shown using recombinant molecules. This archetypic, modular view of transcription factor organization is based on the analyses of a few transcription factors such as GAL4, which may represent extreme exemplars rather than an archetype or the norm. Recent work with a set of Homeotic selector (HOX) proteins has revealed differential pleiotropy: the observation that highly-conserved HOX protein motifs and domains make small, additive, tissue specific contributions to HOX activity. Many of these differentially pleiotropic HOX motifs may represent plastic sequence elements called short linear motifs (SLiMs). The coupling of differential pleiotropy with SLiMs, suggests that protein sequence changes in HOX transcription factors may have had a greater impact on morphological diversity during evolution than previously believed. Furthermore, differential pleiotropy may be the genetic consequence of an ensemble nature of HOX transcription factor allostery, where HOX proteins exist as an ensemble of states with the capacity to integrate an extensive array of developmental information. Given a new structural model for HOX functional domain organization, the properties of the archetypic TF may require reassessment. Copyright © 2014 Elsevier Inc. All rights reserved.
Lu, Shun-Wen; Chen, Shiyan; Wang, Jianying; Yu, Hang; Chronis, Demosthenis; Mitchum, Melissa G; Wang, Xiaohong
2009-09-01
Plant CLAVATA3/ESR-related (CLE) peptides have diverse roles in plant growth and development. Here, we report the isolation and functional characterization of five new CLE genes from the potato cyst nematode Globodera rostochiensis. Unlike typical plant CLE peptides that contain a single CLE motif, four of the five Gr-CLE genes encode CLE proteins with multiple CLE motifs. These Gr-CLE genes were found to be specifically expressed within the dorsal esophageal gland cell of nematode parasitic stages, suggesting a role for their encoded proteins in plant parasitism. Overexpression phenotypes of Gr-CLE genes in Arabidopsis mimicked those of plant CLE genes, and Gr-CLE proteins could rescue the Arabidopsis clv3-2 mutant phenotype when expressed within meristems. A short root phenotype was observed when synthetic GrCLE peptides were exogenously applied to roots of Arabidopsis or potato similar to the overexpression of Gr-CLE genes in Arabidopsis and potato hairy roots. These results reveal that G. rostochiensis CLE proteins with either single or multiple CLE motifs function similarly to plant CLE proteins and that CLE signaling components are conserved in both Arabidopsis and potato roots. Furthermore, our results provide evidence to suggest that the evolution of multiple CLE motifs may be an important mechanism for generating functional diversity in nematode CLE proteins to facilitate parasitism.
Connecting Interface Structure to Energy Level Alignment at Aqueous Semiconductor Interfaces
NASA Astrophysics Data System (ADS)
Hybertsen, Mark
Understanding structure-function relationships at aqueous semiconductor interfaces presents fundamental challenges, including the discovery of the key interface structure motifs themselves. Important examples include the alignment of electrochemical redox levels with the semiconductor band edges and the identification of catalytic active sites. We have developed a multistep approach, initially demonstrated for GaN, ZnO and their alloys, motivated by measured high efficiency for photocatalytic water oxidation. The interface structure is simulated using ab initio molecular dynamics (AIMD). The calculated, average interface dipole is combined with the GW approach from many-body perturbation theory to calculate the energy level alignment between the semiconductor band edges and the centroid of the occupied 1b1 energy level of water and thus, the electrochemical levels. Cluster models are used to study reaction pathways. The emergent interface motif is the full (GaN) or partial (ZnO) dissociated interface water layer. Here I will focus on the aqueous interfaces to the stable TiO2 anatase (101) and rutile (110) facets. The AIMD calculations reveal interface water dissociation and reassociation processes through distinct pathways: one direct at the interface and the other via a spectator water molecule from the hydration layer. Comparisons between the two interfaces shows that the energy landscape for these pathways depends on the local hydrogen bonding patterns and the interplay with the interface template. Combined results from different initial conditions and AIMD temperatures demonstrate a partially dissociated interface water layer in both cases. Specifically for rutile, structure and the GW-based analysis of the interface energy level alignment agree with experiment. Finally, hole localization at different interface structure motifs will be discussed. Work performed in collaboration with J. Lyons, N. Kharche, M. Ertem and J. Muckerman, done in part at the CFN, which is a U.S. DOE Office of Science Facility, at BNL under Contract No. DE-SC0012704 and with resources from NERSC under Contract No. DE-AC02-05CH11231.
Specificity and non-specificity in RNA–protein interactions
Jankowsky, Eckhard; Harris, Michael E.
2016-01-01
Gene expression is regulated by complex networks of interactions between RNAs and proteins. Proteins that interact with RNA have been traditionally viewed as either specific or non-specific; specific proteins interact preferentially with defined RNA sequence or structure motifs, whereas non-specific proteins interact with RNA sites devoid of such characteristics. Recent studies indicate that the binary “specific vs. non-specific” classification is insufficient to describe the full spectrum of RNA–protein interactions. Here, we review new methods that enable quantitative measurements of protein binding to large numbers of RNA variants, and the concepts aimed as describing resulting binding spectra: affinity distributions, comprehensive binding models and free energy landscapes. We discuss how these new methodologies and associated concepts enable work towards inclusive, quantitative models for specific and non-specific RNA–protein interactions. PMID:26285679
Lee, Il Joon; Kim, Byeang Hyean
2012-02-18
Pairs of pyrene-modified deoxyadenosine ((Py)A) units induce a stable interstrand i-motif structure, which can be characterized by a change in the fluorescence λ(max), with an exciplex emission that is not observable in its single-strand structure. This journal is © The Royal Society of Chemistry 2012
Classification of proteins with shared motifs and internal repeats in the ECOD database
Kinch, Lisa N.; Liao, Yuxing
2016-01-01
Abstract Proteins and their domains evolve by a set of events commonly including the duplication and divergence of small motifs. The presence of short repetitive regions in domains has generally constituted a difficult case for structural domain classifications and their hierarchies. We developed the Evolutionary Classification Of protein Domains (ECOD) in part to implement a new schema for the classification of these types of proteins. Here we document the ways in which ECOD classifies proteins with small internal repeats, widespread functional motifs, and assemblies of small domain‐like fragments in its evolutionary schema. We illustrate the ways in which the structural genomics project impacted the classification and characterization of new structural domains and sequence families over the decade. PMID:26833690
Pan, Xiaoyong; Shen, Hong-Bin
2017-02-28
RNAs play key roles in cells through the interactions with proteins known as the RNA-binding proteins (RBP) and their binding motifs enable crucial understanding of the post-transcriptional regulation of RNAs. How the RBPs correctly recognize the target RNAs and why they bind specific positions is still far from clear. Machine learning-based algorithms are widely acknowledged to be capable of speeding up this process. Although many automatic tools have been developed to predict the RNA-protein binding sites from the rapidly growing multi-resource data, e.g. sequence, structure, their domain specific features and formats have posed significant computational challenges. One of current difficulties is that the cross-source shared common knowledge is at a higher abstraction level beyond the observed data, resulting in a low efficiency of direct integration of observed data across domains. The other difficulty is how to interpret the prediction results. Existing approaches tend to terminate after outputting the potential discrete binding sites on the sequences, but how to assemble them into the meaningful binding motifs is a topic worth of further investigation. In viewing of these challenges, we propose a deep learning-based framework (iDeep) by using a novel hybrid convolutional neural network and deep belief network to predict the RBP interaction sites and motifs on RNAs. This new protocol is featured by transforming the original observed data into a high-level abstraction feature space using multiple layers of learning blocks, where the shared representations across different domains are integrated. To validate our iDeep method, we performed experiments on 31 large-scale CLIP-seq datasets, and our results show that by integrating multiple sources of data, the average AUC can be improved by 8% compared to the best single-source-based predictor; and through cross-domain knowledge integration at an abstraction level, it outperforms the state-of-the-art predictors by 6%. Besides the overall enhanced prediction performance, the convolutional neural network module embedded in iDeep is also able to automatically capture the interpretable binding motifs for RBPs. Large-scale experiments demonstrate that these mined binding motifs agree well with the experimentally verified results, suggesting iDeep is a promising approach in the real-world applications. The iDeep framework not only can achieve promising performance than the state-of-the-art predictors, but also easily capture interpretable binding motifs. iDeep is available at http://www.csbio.sjtu.edu.cn/bioinf/iDeep.
Statistical tests to compare motif count exceptionalities
Robin, Stéphane; Schbath, Sophie; Vandewalle, Vincent
2007-01-01
Background Finding over- or under-represented motifs in biological sequences is now a common task in genomics. Thanks to p-value calculation for motif counts, exceptional motifs are identified and represent candidate functional motifs. The present work addresses the related question of comparing the exceptionality of one motif in two different sequences. Just comparing the motif count p-values in each sequence is indeed not sufficient to decide if this motif is significantly more exceptional in one sequence compared to the other one. A statistical test is required. Results We develop and analyze two statistical tests, an exact binomial one and an asymptotic likelihood ratio test, to decide whether the exceptionality of a given motif is equivalent or significantly different in two sequences of interest. For that purpose, motif occurrences are modeled by Poisson processes, with a special care for overlapping motifs. Both tests can take the sequence compositions into account. As an illustration, we compare the octamer exceptionalities in the Escherichia coli K-12 backbone versus variable strain-specific loops. Conclusion The exact binomial test is particularly adapted for small counts. For large counts, we advise to use the likelihood ratio test which is asymptotic but strongly correlated with the exact binomial test and very simple to use. PMID:17346349
SCOPE: a web server for practical de novo motif discovery.
Carlson, Jonathan M; Chakravarty, Arijit; DeZiel, Charles E; Gross, Robert H
2007-07-01
SCOPE is a novel parameter-free method for the de novo identification of potential regulatory motifs in sets of coordinately regulated genes. The SCOPE algorithm combines the output of three component algorithms, each designed to identify a particular class of motifs. Using an ensemble learning approach, SCOPE identifies the best candidate motifs from its component algorithms. In tests on experimentally determined datasets, SCOPE identified motifs with a significantly higher level of accuracy than a number of other web-based motif finders run with their default parameters. Because SCOPE has no adjustable parameters, the web server has an intuitive interface, requiring only a set of gene names or FASTA sequences and a choice of species. The most significant motifs found by SCOPE are displayed graphically on the main results page with a table containing summary statistics for each motif. Detailed motif information, including the sequence logo, PWM, consensus sequence and specific matching sites can be viewed through a single click on a motif. SCOPE's efficient, parameter-free search strategy has enabled the development of a web server that is readily accessible to the practising biologist while providing results that compare favorably with those of other motif finders. The SCOPE web server is at
Solution structure and DNA-binding properties of the C-terminal domain of UvrC from E.coli
Singh, S.; Folkers, G.E.; Bonvin, A.M.J.J.; Boelens, R.; Wechselberger, R.; Niztayev, A.; Kaptein, R.
2002-01-01
The C-terminal domain of the UvrC protein (UvrC CTD) is essential for 5′ incision in the prokaryotic nucleotide excision repair process. We have determined the three-dimensional structure of the UvrC CTD using heteronuclear NMR techniques. The structure shows two helix–hairpin–helix (HhH) motifs connected by a small connector helix. The UvrC CTD is shown to mediate structure-specific DNA binding. The domain binds to a single-stranded–double-stranded junction DNA, with a strong specificity towards looped duplex DNA that contains at least six unpaired bases per loop (‘bubble DNA’). Using chemical shift perturbation experiments, the DNA-binding surface is mapped to the first hairpin region encompassing the conserved glycine–valine–glycine residues followed by lysine–arginine–arginine, a positively charged surface patch and the second hairpin region consisting of glycine–isoleucine–serine. A model for the protein– DNA complex is proposed that accounts for this specificity. PMID:12426397
Nicoludis, John M; Lau, Sze-Yi; Schärfe, Charlotta P I; Marks, Debora S; Weihofen, Wilhelm A; Gaudet, Rachelle
2015-11-03
Clustered protocadherin (Pcdh) proteins mediate dendritic self-avoidance in neurons via specific homophilic interactions in their extracellular cadherin (EC) domains. We determined crystal structures of EC1-EC3, containing the homophilic specificity-determining region, of two mouse clustered Pcdh isoforms (PcdhγA1 and PcdhγC3) to investigate the nature of the homophilic interaction. Within the crystal lattices, we observe antiparallel interfaces consistent with a role in trans cell-cell contact. Antiparallel dimerization is supported by evolutionary correlations. Two interfaces, located primarily on EC2-EC3, involve distinctive clustered Pcdh structure and sequence motifs, lack predicted glycosylation sites, and contain residues highly conserved in orthologs but not paralogs, pointing toward their biological significance as homophilic interaction interfaces. These two interfaces are similar yet distinct, reflecting a possible difference in interaction architecture between clustered Pcdh subfamilies. These structures initiate a molecular understanding of clustered Pcdh assemblies that are required to produce functional neuronal networks. Copyright © 2015 Elsevier Ltd. All rights reserved.
Majumder, P; Choudhury, A; Banerjee, M; Lahiri, A; Bhattacharyya, N P
2007-08-01
To investigate the mechanism of increased expression of caspase-1 caused by exogenous Hippi, observed earlier in HeLa and Neuro2A cells, in this work we identified a specific motif AAAGACATG (- 101 to - 93) at the caspase-1 gene upstream sequence where HIPPI could bind. Various mutations in this specific sequence compromised the interaction, showing the specificity of the interactions. In the luciferase reporter assay, when the reporter gene was driven by caspase-1 gene upstream sequences (- 151 to - 92) with the mutation G to T at position - 98, luciferase activity was decreased significantly in green fluorescent protein-Hippi-expressing HeLa cells in comparison to that obtained with the wild-type caspase-1 gene 60 bp upstream sequence, indicating the biological significance of such binding. It was observed that the C-terminal 'pseudo' death effector domain of HIPPI interacted with the 60 bp (- 151 to - 92) upstream sequence of the caspase-1 gene containing the motif. We further observed that expression of caspase-8 and caspase-10 was increased in green fluorescent protein-Hippi-expressing HeLa cells. In addition, HIPPI interacted in vitro with putative promoter sequences of these genes, containing a similar motif. In summary, we identified a novel function of HIPPI; it binds to specific upstream sequences of the caspase-1, caspase-8 and caspase-10 genes and alters the expression of the genes. This result showed the motif-specific interaction of HIPPI with DNA, and indicates that it could act as transcription regulator.
Structural and sequence features of two residue turns in beta-hairpins.
Madan, Bharat; Seo, Sung Yong; Lee, Sun-Gu
2014-09-01
Beta-turns in beta-hairpins have been implicated as important sites in protein folding. In particular, two residue β-turns, the most abundant connecting elements in beta-hairpins, have been a major target for engineering protein stability and folding. In this study, we attempted to investigate and update the structural and sequence properties of two residue turns in beta-hairpins with a large data set. For this, 3977 beta-turns were extracted from 2394 nonhomologous protein chains and analyzed. First, the distribution, dihedral angles and twists of two residue turn types were determined, and compared with previous data. The trend of turn type occurrence and most structural features of the turn types were similar to previous results, but for the first time Type II turns in beta-hairpins were identified. Second, sequence motifs for the turn types were devised based on amino acid positional potentials of two-residue turns, and their distributions were examined. From this study, we could identify code-like sequence motifs for the two residue beta-turn types. Finally, structural and sequence properties of beta-strands in the beta-hairpins were analyzed, which revealed that the beta-strands showed no specific sequence and structural patterns for turn types. The analytical results in this study are expected to be a reference in the engineering or design of beta-hairpin turn structures and sequences. © 2014 Wiley Periodicals, Inc.
Gherghe, Cristina; Lombo, Tania; Leonard, Christopher W.; Datta, Siddhartha A. K.; Bess, Julian W.; Gorelick, Robert J.; Rein, Alan; Weeks, Kevin M.
2010-01-01
All retroviral genomic RNAs contain a cis-acting packaging signal by which dimeric genomes are selectively packaged into nascent virions. However, it is not understood how Gag (the viral structural protein) interacts with these signals to package the genome with high selectivity. We probed the structure of murine leukemia virus RNA inside virus particles using SHAPE, a high-throughput RNA structure analysis technology. These experiments showed that NC (the nucleic acid binding domain derived from Gag) binds within the virus to the sequence UCUG-UR-UCUG. Recombinant Gag and NC proteins bound to this same RNA sequence in dimeric RNA in vitro; in all cases, interactions were strongest with the first U and final G in each UCUG element. The RNA structural context is critical: High-affinity binding requires base-paired regions flanking this motif, and two UCUG-UR-UCUG motifs are specifically exposed in the viral RNA dimer. Mutating the guanosine residues in these two motifs—only four nucleotides per genomic RNA—reduced packaging 100-fold, comparable to the level of nonspecific packaging. These results thus explain the selective packaging of dimeric RNA. This paradigm has implications for RNA recognition in general, illustrating how local context and RNA structure can create information-rich recognition signals from simple single-stranded sequence elements in large RNAs. PMID:20974908
Programmable assembly of nanoarchitectures using genetically engineered viruses.
Huang, Yu; Chiang, Chung-Yi; Lee, Soo Kwan; Gao, Yan; Hu, Evelyn L; De Yoreo, James; Belcher, Angela M
2005-07-01
Biological systems possess inherent molecular recognition and self-assembly capabilities and are attractive templates for constructing complex material structures with molecular precision. Here we report the assembly of various nanoachitectures including nanoparticle arrays, hetero-nanoparticle architectures, and nanowires utilizing highly engineered M13 bacteriophage as templates. The genome of M13 phage can be rationally engineered to produce viral particles with distinct substrate-specific peptides expressed on the filamentous capsid and the ends, providing a generic template for programmable assembly of complex nanostructures. Phage clones with gold-binding motifs on the capsid and streptavidin-binding motifs at one end are created and used to assemble Au and CdSe nanocrytals into ordered one-dimensional arrays and more complex geometries. Initial studies show such nanoparticle arrays can further function as templates to nucleate highly conductive nanowires that are important for addressing/interconnecting individual nanostructures.
Panczyk, Tomasz; Wolski, Pawel
2018-06-01
This work deals with a molecular dynamics analysis of the protonated and deprotonated states of the natural sequence d[(CCCTAA) 3 CCCT] of the telomeric DNA forming the intercalated i-motif or paired with the sequence d[(CCCTAA) 3 CCCT] and forming the Watson-Crick (WC) duplex. By utilizing the amber force field for nucleic acids we built the i-motif and the WC duplex either with native cytosines or using their protonated forms. We studied, by applying molecular dynamics simulations, the role of hydrogen bonds between cytosines or in cytosine-guanine pairs in the stabilization of both structures in the physiological fluid. We found that hydrogen bonds exist in the case of protonated i-motif and in the standard form of the WC duplex. They, however, vanish in the case of the deprotonated i-motif and protonated form of the WC duplex. By determining potentials of mean force in the enforced unwrapping of these structures we found that the protonated i-motif is thermodynamically the most stable. Its deprotonation leads to spontaneous and observed directly in the unbiased calculations unfolding of the i-motif to the hairpin structure at normal temperature. The WC duplex is stable in its standard form and its slight destabilization is observed at the acidic pH. However, the protonated WC duplex unwraps very slowly at 310 K and its decomposition was not observed in the unbiased calculations. At higher temperatures (ca. 400 K or more) the WC duplex unwraps spontaneously. Copyright © 2018. Published by Elsevier B.V.
Fan, Jiqiang; Song, Yongbo; Chai, Jinsong; Yang, Sha; Chen, Tao; Rao, Bo; Yu, Haizhu; Zhu, Manzhou
2016-08-18
We report the observation of new doping behavior in Au36-xAgx(SR)24 nanoclusters (NCs) with x = 1 to 8. The atomic arrangements of Au and Ag atoms are determined by X-ray crystallography. The new gold-silver bimetallic NCs share the same framework as that of the homogold counterpart, i.e. possessing an fcc-type Au28 kernel, four dimeric AuAg(SR)3 staple motifs and twelve simple bridging SR ligands. Interestingly, all the Ag dopants in the Au36-xAgx(SR)24 NCs are selectively incorporated into the surface motifs, which is in contrast to the previously reported Au-Ag alloy structures with the Ag dopants preferentially displacing the core gold atoms. This distinct doping behavior implies that the previous assignments of an fcc Au28 core with four dimers and 12 bridging thiolates for Au36(SR)24 are more justified than other assignments of core vs. surface motifs. The UV-Vis adsorption spectrum of Au36-xAgx(SR)24 is almost the same as that of Au36(SR)24, indicating that the Ag dopants in the motifs do not change the optical properties. The similar UV-Vis spectra are further confirmed by TD-DFT calculations. DFT also reveals that the energies of the HOMO and LUMO of the motif-doped AuAg alloy NC are comparable to those of the homogold Au36 NC, indicating that the electronic structure is not disturbed by the motif Ag dopants. Overall, this study reveals a new silver-doping mode in alloy NCs.
Syntactic structures in languages and biology.
Horn, David
2008-08-01
Both natural languages and cell biology make use of one-dimensional encryption. Their investigation calls for syntactic deciphering of the text and semantic understanding of the resulting structures. Here we discuss recently published algorithms that allow for such searches: automatic distillation of structure (ADIOS) that is successful in discovering syntactic structures in linguistic texts and its motif extraction (MEX) component that can be used for uncovering motifs in DNA and protein sequences. The underlying principles of these syntactic algorithms and some of their results will be described.
Kim, Yoonjung; Lee, Myeongsang; Choi, Hyunsung; Baek, Inchul; Kim, Jae In; Na, Sungsoo
2018-04-01
Silk materials are receiving significant attention as base materials for various functional nanomaterials and nanodevices, due to its exceptionally high mechanical properties, biocompatibility, and degradable characteristics. Although crystalline silk regions are composed of various repetitive motifs with differing amino acid sequences, how the effect of humidity works differently on each of the motifs and their structural characteristics remains unclear. We report molecular dynamics (MD) simulations on various silkworm fibroins composed of major motifs (i.e. (GAGAGS) n , (GAGAGA) n , and (GAGAGY) n ) at varying degrees of hydration, and reveal how each major motifs of silk fibroins change at each degrees of hydration using MD simulations and their structural properties in mechanical perspective via steered molecular dynamics simulations. Our results explain what effects humidity can have on nanoscale materials and devices consisting of crystalline silk materials.
Collet, Jean-Francois; Peisach, Daniel; Bardwell, James C.A.; Xu, Zhaohui
2005-01-01
Escherichia coli thioredoxin is a small monomeric protein that reduces disulfide bonds in cytoplasmic proteins. Two cysteine residues present in a conserved CGPC motif are essential for this activity. Recently, we identified mutations of this motif that changed thioredoxin into a homodimer bridged by a [2Fe-2S] iron–sulfur cluster. When exported to the periplasm, these thioredoxin mutants could restore disulfide bond formation in strains lacking the entire periplasmic oxidative pathway. Essential for the assembly of the iron–sulfur was an additional cysteine that replaced the proline at position three of the CGPC motif. We solved the crystalline structure at 2.3 Å for one of these variants, TrxA(CACA). The mutant protein crystallized as a dimer in which the iron–sulfur cluster is replaced by two intermolecular disulfide bonds. The catalytic site, which forms the dimer interface, crystallized in two different conformations. In one of them, the replacement of the CGPC motif by CACA has a dramatic effect on the structure and causes the unraveling of an extended α-helix. In both conformations, the second cysteine residue of the CACA motif is surface-exposed, which contrasts with wildtype thioredoxin where the second cysteine of the CXXC motif is buried. This exposure of a pair of vicinal cysteine residues apparently allows thioredoxin to acquire an iron–sulfur cofactor at its active site, and thus a new activity and mechanism of action. PMID:15987909
Collet, Jean-Francois; Peisach, Daniel; Bardwell, James C A; Xu, Zhaohui
2005-07-01
Escherichia coli thioredoxin is a small monomeric protein that reduces disulfide bonds in cytoplasmic proteins. Two cysteine residues present in a conserved CGPC motif are essential for this activity. Recently, we identified mutations of this motif that changed thioredoxin into a homodimer bridged by a [2Fe-2S] iron-sulfur cluster. When exported to the periplasm, these thioredoxin mutants could restore disulfide bond formation in strains lacking the entire periplasmic oxidative pathway. Essential for the assembly of the iron-sulfur was an additional cysteine that replaced the proline at position three of the CGPC motif. We solved the crystalline structure at 2.3 Angstroms for one of these variants, TrxA(CACA). The mutant protein crystallized as a dimer in which the iron-sulfur cluster is replaced by two intermolecular disulfide bonds. The catalytic site, which forms the dimer interface, crystallized in two different conformations. In one of them, the replacement of the CGPC motif by CACA has a dramatic effect on the structure and causes the unraveling of an extended alpha-helix. In both conformations, the second cysteine residue of the CACA motif is surface-exposed, which contrasts with wildtype thioredoxin where the second cysteine of the CXXC motif is buried. This exposure of a pair of vicinal cysteine residues apparently allows thioredoxin to acquire an iron-sulfur cofactor at its active site, and thus a new activity and mechanism of action.
Florence, Alastair J; Johnston, Andrea; Price, Sarah L; Nowell, Harriott; Kennedy, Alan R; Shankland, Norman
2006-09-01
An automated parallel crystallisation search for physical forms of carbamazepine, covering 66 solvents and five crystallisation protocols, identified three anhydrous polymorphs (forms I-III), one hydrate and eight organic solvates, including the single-crystal structures of three previously unreported solvates (N,N-dimethylformamide (1:1); hemi-furfural; hemi-1,4-dioxane). Correlation of physical form outcome with the crystallisation conditions demonstrated that the solvent adopts a relatively nonspecific role in determining which polymorph is obtained, and that the previously reported effect of a polymer template facilitating the formation of form IV could not be reproduced by solvent crystallisation alone. In the accompanying computational search, approximately half of the energetically feasible predicted crystal structures exhibit the C=O...H--N R2(2)(8)dimer motif that is observed in the known polymorphs, with the most stable correctly corresponding to form III. Most of the other energetically feasible structures, including the global minimum, have a C=O...H--N C(4) chain hydrogen bond motif. No such chain structures were observed in this or any other previously published work, suggesting that kinetic, rather than thermodynamic, factors determine which of the energetically feasible crystal structures are observed experimentally, with the kinetics apparently favouring nucleation of crystal structures based on the CBZ-CBZ R2(2)(8) motif. (c) 2006 Wiley-Liss, Inc. and the American Pharmacists Association.
Golebiowski, Jérôme; Antonczak, Serge; Di-Giorgio, Audrey; Condom, Roger; Cabrol-Bass, Daniel
2004-02-01
The dynamic behavior of the HCV IRES IIId domain is analyzed by means of a 2.6-ns molecular dynamics simulation, starting from an NMR structure. The simulation is carried out in explicit water with Na+ counterions, and particle-mesh Ewald summation is used for the electrostatic interactions. In this work, we analyze selected patterns of the helix that are crucial for IRES activity and that could be considered as targets for the intervention of inhibitors, such as the hexanucleotide terminal loop (more particularly its three consecutive guanines) and the loop-E motif. The simulation has allowed us to analyze the dynamics of the loop substructure and has revealed a behavior among the guanine bases that might explain the different role of the third guanine of the GGG triplet upon molecular recognition. The accessibility of the loop-E motif and the loop major and minor groove is also examined, as well as the effect of Na+ or Mg2+ counterion within the simulation. The electrostatic analysis reveals several ion pockets, not discussed in the experimental structure. The positions of these ions are useful for locating specific electrostatic recognition sites for potential inhibitor binding.
Chaotic Motifs in Gene Regulatory Networks
Zhang, Zhaoyang; Ye, Weiming; Qian, Yu; Zheng, Zhigang; Huang, Xuhui; Hu, Gang
2012-01-01
Chaos should occur often in gene regulatory networks (GRNs) which have been widely described by nonlinear coupled ordinary differential equations, if their dimensions are no less than 3. It is therefore puzzling that chaos has never been reported in GRNs in nature and is also extremely rare in models of GRNs. On the other hand, the topic of motifs has attracted great attention in studying biological networks, and network motifs are suggested to be elementary building blocks that carry out some key functions in the network. In this paper, chaotic motifs (subnetworks with chaos) in GRNs are systematically investigated. The conclusion is that: (i) chaos can only appear through competitions between different oscillatory modes with rivaling intensities. Conditions required for chaotic GRNs are found to be very strict, which make chaotic GRNs extremely rare. (ii) Chaotic motifs are explored as the simplest few-node structures capable of producing chaos, and serve as the intrinsic source of chaos of random few-node GRNs. Several optimal motifs causing chaos with atypically high probability are figured out. (iii) Moreover, we discovered that a number of special oscillators can never produce chaos. These structures bring some advantages on rhythmic functions and may help us understand the robustness of diverse biological rhythms. (iv) The methods of dominant phase-advanced driving (DPAD) and DPAD time fraction are proposed to quantitatively identify chaotic motifs and to explain the origin of chaotic behaviors in GRNs. PMID:22792171
Building a stable RNA U-turn with a protonated cytidine.
Gottstein-Schmidtke, Sina R; Duchardt-Ferner, Elke; Groher, Florian; Weigand, Julia E; Gottstein, Daniel; Suess, Beatrix; Wöhnert, Jens
2014-08-01
The U-turn is a classical three-dimensional RNA folding motif first identified in the anticodon and T-loops of tRNAs. It also occurs frequently as a building block in other functional RNA structures in many different sequence and structural contexts. U-turns induce sharp changes in the direction of the RNA backbone and often conform to the 3-nt consensus sequence 5'-UNR-3' (N = any nucleotide, R = purine). The canonical U-turn motif is stabilized by a hydrogen bond between the N3 imino group of the U residue and the 3' phosphate group of the R residue as well as a hydrogen bond between the 2'-hydroxyl group of the uridine and the N7 nitrogen of the R residue. Here, we demonstrate that a protonated cytidine can functionally and structurally replace the uridine at the first position of the canonical U-turn motif in the apical loop of the neomycin riboswitch. Using NMR spectroscopy, we directly show that the N3 imino group of the protonated cytidine forms a hydrogen bond with the backbone phosphate 3' from the third nucleotide of the U-turn analogously to the imino group of the uridine in the canonical motif. In addition, we compare the stability of the hydrogen bonds in the mutant U-turn motif to the wild type and describe the NMR signature of the C+-phosphate interaction. Our results have implications for the prediction of RNA structural motifs and suggest simple approaches for the experimental identification of hydrogen bonds between protonated C-imino groups and the phosphate backbone. © 2014 Gottstein-Schmidtke et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Busslinger, M; Portmann, R; Irminger, J C; Birnstiel, M L
1980-01-01
The DNA sequences of the entire structural H4, H3, H2A and H2B genes and of their 5' flanking regions have been determined in the histone DNA clone h19 of the sea urchin Psammechinus miliaris. In clone h19 the polarity of transcription and the relative arrangement of the histone genes is identical to that in clone h22 of the same species. The histone proteins encoded by h19 DNA differ in their primary structure from those encoded by clone h22 and have been compared to histone protein sequences of other sea urchin species as well as other eukaryotes. A comparative analysis of the 5' flanking DNA sequences of the structural histone genes in both clones revealed four ubiquitous sequence motifs; a pentameric element GATCC, followed at short distance by the Hogness box GTATAAATAG, a conserved sequence PyCATTCPu, in or near which the 5' ends of the mRNAs map in h22 DNA and lastly a sequence A, containing the initiation codon. These sequences are also found, sometimes in modified version, in front of other eukaryotic genes transcribed by polymerase II. When prelude sequences of isocoding histone genes in clone h19 and h22 are compared areas of homology are seen to extend beyond the ubiquitous sequence motifs towards the divergent AT-rich spacer and terminate between approximately 140 and 240 nucleotides away from the structural gene. These prelude regions contain quite large conservative sequence blocks which are specific for each type of histone genes. Images PMID:7443547
Computation-Guided Backbone Grafting of a Discontinuous Motif onto a Protein Scaffold
DOE Office of Scientific and Technical Information (OSTI.GOV)
Azoitei, Mihai L.; Correia, Bruno E.; Ban, Yih-En Andrew
2012-02-07
The manipulation of protein backbone structure to control interaction and function is a challenge for protein engineering. We integrated computational design with experimental selection for grafting the backbone and side chains of a two-segment HIV gp120 epitope, targeted by the cross-neutralizing antibody b12, onto an unrelated scaffold protein. The final scaffolds bound b12 with high specificity and with affinity similar to that of gp120, and crystallographic analysis of a scaffold bound to b12 revealed high structural mimicry of the gp120-b12 complex structure. The method can be generalized to design other functional proteins through backbone grafting.
Han, Dianwei; Zhang, Jun; Tang, Guiliang
2012-01-01
An accurate prediction of the pre-microRNA secondary structure is important in miRNA informatics. Based on a recently proposed model, nucleotide cyclic motifs (NCM), to predict RNA secondary structure, we propose and implement a Modified NCM (MNCM) model with a physics-based scoring strategy to tackle the problem of pre-microRNA folding. Our microRNAfold is implemented using a global optimal algorithm based on the bottom-up local optimal solutions. Our experimental results show that microRNAfold outperforms the current leading prediction tools in terms of True Negative rate, False Negative rate, Specificity, and Matthews coefficient ratio.
SiteBinder: an improved approach for comparing multiple protein structural motifs.
Sehnal, David; Vařeková, Radka Svobodová; Huber, Heinrich J; Geidl, Stanislav; Ionescu, Crina-Maria; Wimmerová, Michaela; Koča, Jaroslav
2012-02-27
There is a paramount need to develop new techniques and tools that will extract as much information as possible from the ever growing repository of protein 3D structures. We report here on the development of a software tool for the multiple superimposition of large sets of protein structural motifs. Our superimposition methodology performs a systematic search for the atom pairing that provides the best fit. During this search, the RMSD values for all chemically relevant pairings are calculated by quaternion algebra. The number of evaluated pairings is markedly decreased by using PDB annotations for atoms. This approach guarantees that the best fit will be found and can be applied even when sequence similarity is low or does not exist at all. We have implemented this methodology in the Web application SiteBinder, which is able to process up to thousands of protein structural motifs in a very short time, and which provides an intuitive and user-friendly interface. Our benchmarking analysis has shown the robustness, efficiency, and versatility of our methodology and its implementation by the successful superimposition of 1000 experimentally determined structures for each of 32 eukaryotic linear motifs. We also demonstrate the applicability of SiteBinder using three case studies. We first compared the structures of 61 PA-IIL sugar binding sites containing nine different sugars, and we found that the sugar binding sites of PA-IIL and its mutants have a conserved structure despite their binding different sugars. We then superimposed over 300 zinc finger central motifs and revealed that the molecular structure in the vicinity of the Zn atom is highly conserved. Finally, we superimposed 12 BH3 domains from pro-apoptotic proteins. Our findings come to support the hypothesis that there is a structural basis for the functional segregation of BH3-only proteins into activators and enablers.
Mechanisms of Lin28-Mediated miRNA and mRNA Regulation—A Structural and Functional Perspective
Mayr, Florian; Heinemann, Udo
2013-01-01
Lin28 is an essential RNA-binding protein that is ubiquitously expressed in embryonic stem cells. Its physiological function has been linked to the regulation of differentiation, development, and oncogenesis as well as glucose metabolism. Lin28 mediates these pleiotropic functions by inhibiting let-7 miRNA biogenesis and by modulating the translation of target mRNAs. Both activities strongly depend on Lin28’s RNA-binding domains (RBDs), an N-terminal cold-shock domain (CSD) and a C-terminal Zn-knuckle domain (ZKD). Recent biochemical and structural studies revealed the mechanisms of how Lin28 controls let-7 biogenesis. Lin28 binds to the terminal loop of pri- and pre-let-7 miRNA and represses their processing by Drosha and Dicer. Several biochemical and structural studies showed that the specificity of this interaction is mainly mediated by the ZKD with a conserved GGAGA or GGAGA-like motif. Further RNA crosslinking and immunoprecipitation coupled to high-throughput sequencing (CLIP-seq) studies confirmed this binding motif and uncovered a large number of new mRNA binding sites. Here we review exciting recent progress in our understanding of how Lin28 binds structurally diverse RNAs and fulfills its pleiotropic functions. PMID:23939427
Taylor, Gregory K.; Stoddard, Barry L.
2012-01-01
Homing endonucleases (HEs) are highly specific DNA-cleaving enzymes that are encoded by invasive DNA elements (usually mobile introns or inteins) within the genomes of phage, bacteria, archea, protista and eukaryotic organelles. Six unique structural HE families, that collectively span four distinct nuclease catalytic motifs, have been characterized to date. Members of each family display structural homology and functional relationships to a wide variety of proteins from various organisms. The biological functions of those proteins are highly disparate and include non-specific DNA-degradation enzymes, restriction endonucleases, DNA-repair enzymes, resolvases, intron splicing factors and transcription factors. These relationships suggest that modern day HEs share common ancestors with proteins involved in genome fidelity, maintenance and gene expression. This review summarizes the results of structural studies of HEs and corresponding proteins from host organisms that have illustrated the manner in which these factors are related. PMID:22406833
Lu, Cheng-Tsung; Huang, Kai-Yao; Su, Min-Gang; Lee, Tzong-Yi; Bretaña, Neil Arvin; Chang, Wen-Chi; Chen, Yi-Ju; Chen, Yu-Ju; Huang, Hsien-Da
2013-01-01
Protein modification is an extremely important post-translational regulation that adjusts the physical and chemical properties, conformation, stability and activity of a protein; thus altering protein function. Due to the high throughput of mass spectrometry (MS)-based methods in identifying site-specific post-translational modifications (PTMs), dbPTM (http://dbPTM.mbc.nctu.edu.tw/) is updated to integrate experimental PTMs obtained from public resources as well as manually curated MS/MS peptides associated with PTMs from research articles. Version 3.0 of dbPTM aims to be an informative resource for investigating the substrate specificity of PTM sites and functional association of PTMs between substrates and their interacting proteins. In order to investigate the substrate specificity for modification sites, a newly developed statistical method has been applied to identify the significant substrate motifs for each type of PTMs containing sufficient experimental data. According to the data statistics in dbPTM, >60% of PTM sites are located in the functional domains of proteins. It is known that most PTMs can create binding sites for specific protein-interaction domains that work together for cellular function. Thus, this update integrates protein-protein interaction and domain-domain interaction to determine the functional association of PTM sites located in protein-interacting domains. Additionally, the information of structural topologies on transmembrane (TM) proteins is integrated in dbPTM in order to delineate the structural correlation between the reported PTM sites and TM topologies. To facilitate the investigation of PTMs on TM proteins, the PTM substrate sites and the structural topology are graphically represented. Also, literature information related to PTMs, orthologous conservations and substrate motifs of PTMs are also provided in the resource. Finally, this version features an improved web interface to facilitate convenient access to the resource.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schormann, Norbert; Zhukovskaya, Natalia; Bedwell, Gregory
We report that uracil-DNA glycosylases are ubiquitous enzymes, which play a key role repairing damages in DNA and in maintaining genomic integrity by catalyzing the first step in the base excision repair pathway. Within the superfamily of uracil-DNA glycosylases family I enzymes or UNGs are specific for recognizing and removing uracil from DNA. These enzymes feature conserved structural folds, active site residues and use common motifs for DNA binding, uracil recognition and catalysis. Within this family the enzymes of poxviruses are unique and most remarkable in terms of amino acid sequences, characteristic motifs and more importantly for their novel non-enzymaticmore » function in DNA replication. UNG of vaccinia virus, also known as D4, is the most extensively characterized UNG of the poxvirus family. D4 forms an unusual heterodimeric processivity factor by attaching to a poxvirus-specific protein A20, which also binds to the DNA polymerase E9 and recruits other proteins necessary for replication. D4 is thus integrated in the DNA polymerase complex, and its DNA-binding and DNA scanning abilities couple DNA processivity and DNA base excision repair at the replication fork. In conclusion, the adaptations necessary for taking on the new function are reflected in the amino acid sequence and the three-dimensional structure of D4. We provide an overview of the current state of the knowledge on the structure-function relationship of D4.« less
The Janus Kinase (JAK) FERM and SH2 Domains: Bringing Specificity to JAK-Receptor Interactions.
Ferrao, Ryan; Lupardus, Patrick J
2017-01-01
The Janus kinases (JAKs) are non-receptor tyrosine kinases essential for signaling in response to cytokines and interferons and thereby control many essential functions in growth, development, and immune regulation. JAKs are unique among tyrosine kinases for their constitutive yet non-covalent association with class I and II cytokine receptors, which upon cytokine binding bring together two JAKs to create an active signaling complex. JAK association with cytokine receptors is facilitated by N-terminal FERM and SH2 domains, both of which are classical mediators of peptide interactions. Together, the JAK FERM and SH2 domains mediate a bipartite interaction with two distinct receptor peptide motifs, the proline-rich "Box1" and hydrophobic "Box2," which are present in the intracellular domain of cytokine receptors. While the general sidechain chemistry of Box1 and Box2 peptides is conserved between receptors, they share very weak primary sequence homology, making it impossible to posit why certain JAKs preferentially interact with and signal through specific subsets of cytokine receptors. Here, we review the structure and function of the JAK FERM and SH2 domains in light of several recent studies that reveal their atomic structure and elucidate interaction mechanisms with both the Box1 and Box2 receptor motifs. These crystal structures demonstrate how evolution has repurposed the JAK FERM and SH2 domains into a receptor-binding module that facilitates interactions with multiple receptors possessing diverse primary sequences.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Knappenberger, Andrew John; Reiss, Caroline Wetherington; Strobel, Scott A.
Two classes of riboswitches related to the ykkC guanidine-I riboswitch bind phosphoribosyl pyrophosphate (PRPP) and guanosine tetraphosphate (ppGpp). Here we report the co-crystal structure of the PRPP aptamer and its ligand. We also report the structure of the G96A point mutant that prefers ppGpp over PRPP with a dramatic 40,000-fold switch in specificity. The ends of the aptamer form a helix that is not present in the guanidine aptamer and is involved in the expression platform. In the mutant, the base of ppGpp replaces G96 in three-dimensional space. This disrupts the S-turn, which is a primary structural feature of themore » ykkC RNA motif. These dramatic differences in ligand specificity are achieved with minimal mutations. ykkC aptamers are therefore a prime example of an RNA fold with a rugged fitness landscape. The ease with which the ykkC aptamer acquires new specificity represents a striking case of evolvability in RNA.« less
Knappenberger, Andrew John; Reiss, Caroline Wetherington; Strobel, Scott A
2018-06-07
Two classes of riboswitches related to the ykkC guanidine-I riboswitch bind phosphoribosyl pyrophosphate (PRPP) and guanosine tetraphosphate (ppGpp). Here we report the co-crystal structure of the PRPP aptamer and its ligand. We also report the structure of the G96A point mutant that prefers ppGpp over PRPP with a dramatic 40,000-fold switch in specificity. The ends of the aptamer form a helix that is not present in the guanidine aptamer and is involved in the expression platform. In the mutant, the base of ppGpp replaces G96 in three-dimensional space. This disrupts the S-turn, which is a primary structural feature of the ykkC RNA motif. These dramatic differences in ligand specificity are achieved with minimal mutations. ykkC aptamers are therefore a prime example of an RNA fold with a rugged fitness landscape. The ease with which the ykkC aptamer acquires new specificity represents a striking case of evolvability in RNA. © 2018, Knappenberger et al.
Analysis of zinc binding sites in protein crystal structures.
Alberts, I L; Nadassy, K; Wodak, S J
1998-08-01
The geometrical properties of zinc binding sites in a dataset of high quality protein crystal structures deposited in the Protein Data Bank have been examined to identify important differences between zinc sites that are directly involved in catalysis and those that play a structural role. Coordination angles in the zinc primary coordination sphere are compared with ideal values for each coordination geometry, and zinc coordination distances are compared with those in small zinc complexes from the Cambridge Structural Database as a guide of expected trends. We find that distances and angles in the primary coordination sphere are in general close to the expected (or ideal) values. Deviations occur primarily for oxygen coordinating atoms and are found to be mainly due to H-bonding of the oxygen coordinating ligand to protein residues, bidentate binding arrangements, and multi-zinc sites. We find that H-bonding of oxygen containing residues (or water) to zinc bound histidines is almost universal in our dataset and defines the elec-His-Zn motif. Analysis of the stereochemistry shows that carboxyl elec-His-Zn motifs are geometrically rigid, while water elec-His-Zn motifs show the most geometrical variation. As catalytic motifs have a higher proportion of carboxyl elec atoms than structural motifs, they provide a more rigid framework for zinc binding. This is understood biologically, as a small distortion in the zinc position in an enzyme can have serious consequences on the enzymatic reaction. We also analyze the sequence pattern of the zinc ligands and residues that provide elecs, and identify conserved hydrophobic residues in the endopeptidases that also appear to contribute to stabilizing the catalytic zinc site. A zinc binding template in protein crystal structures is derived from these observations.
Statistical Methods for Identifying Sequence Motifs Affecting Point Mutations
Zhu, Yicheng; Neeman, Teresa; Yap, Von Bing; Huttley, Gavin A.
2017-01-01
Mutation processes differ between types of point mutation, genomic locations, cells, and biological species. For some point mutations, specific neighboring bases are known to be mechanistically influential. Beyond these cases, numerous questions remain unresolved, including: what are the sequence motifs that affect point mutations? How large are the motifs? Are they strand symmetric? And, do they vary between samples? We present new log-linear models that allow explicit examination of these questions, along with sequence logo style visualization to enable identifying specific motifs. We demonstrate the performance of these methods by analyzing mutation processes in human germline and malignant melanoma. We recapitulate the known CpG effect, and identify novel motifs, including a highly significant motif associated with A→G mutations. We show that major effects of neighbors on germline mutation lie within ±2 of the mutating base. Models are also presented for contrasting the entire mutation spectra (the distribution of the different point mutations). We show the spectra vary significantly between autosomes and X-chromosome, with a difference in T→C transition dominating. Analyses of malignant melanoma confirmed reported characteristic features of this cancer, including statistically significant strand asymmetry, and markedly different neighboring influences. The methods we present are made freely available as a Python library https://bitbucket.org/pycogent3/mutationmotif. PMID:27974498
A single thiazole orange molecule forms an exciplex in a DNA i-motif.
Xu, Baochang; Wu, Xiangyang; Yeow, Edwin K L; Shao, Fangwei
2014-06-18
A fluorescent exciplex of thiazole orange (TO) is formed in a single-dye conjugated DNA i-motif. The exciplex fluorescence exhibits a large Stokes shift, high quantum yield, robust response to pH oscillation and little structural disturbance to the DNA quadruplex, which can be used to monitor the folding of high-order DNA structures.
Rigoutsos, Isidore; Riek, Peter; Graham, Robert M; Novotny, Jiri
2003-08-01
One of the promising methods of protein structure prediction involves the use of amino acid sequence-derived patterns. Here we report on the creation of non-degenerate motif descriptors derived through data mining of training sets of residues taken from the transmembrane-spanning segments of polytopic proteins. These residues correspond to short regions in which there is a deviation from the regular alpha-helical character (i.e. pi-helices, 3(10)-helices and kinks). A 'search engine' derived from these motif descriptors correctly identifies, and discriminates amongst instances of the above 'non-canonical' helical motifs contained in the SwissProt/TrEMBL database of protein primary structures. Our results suggest that deviations from alpha-helicity are encoded locally in sequence patterns only about 7-9 residues long and can be determined in silico directly from the amino acid sequence. Delineation of such variations in helical habit is critical to understanding the complex structure-function relationships of polytopic proteins and for drug discovery. The success of our current methodology foretells development of similar prediction tools capable of identifying other structural motifs from sequence alone. The method described here has been implemented and is available on the World Wide Web at http://cbcsrv.watson.ibm.com/Ttkw.html.
Wustman, Brandon A; Santos, Rudolpho; Zhang, Bo; Evans, John Spencer
2002-12-05
Fracture resistance in biomineralized structures has been linked to the presence of proteins, some of which possess sequences that are associated with elastic behavior. One such protein superfamily, the Pro,Gly-rich sea urchin intracrystalline spicule matrix proteins, form protein-protein supramolecular assemblies that modify the microstructure and fracture-resistant properties of the calcium carbonate mineral phase within embryonic sea urchin spicules and adult sea urchin spines. In this report, we detail the identification of a repetitive keratin-like "glycine-loop"- or coil-like structure within the 34-AA (AA: amino acid) N-terminal domain, (PGMG)(8)PG, of the spicule matrix protein, PM27. The identification of this repetitive structural motif was accomplished using two capped model peptides: a 9-AA sequence, GPGMGPGMG, and a 34-AA peptide representing the entire motif. Using CD, NMR spectrometry, and molecular dynamics simulated annealing/minimization simulations, we have determined that the 9-AA model peptide adopts a loop-like structure at pH 7.4. The structure of the 34-AA polypeptide resembles a coil structure consisting of repeating loop motifs that do not exhibit long-range ordering. Given that loop structures have been associated with protein elastic behavior and protein motion, it is plausible that the 34-AA Pro,Gly,Met repeat sequence motif in PM27 represents a putative elastic or mobile domain. Copyright 2002 Wiley Periodicals, Inc.
Blind prediction of noncanonical RNA structure at atomic accuracy.
Watkins, Andrew M; Geniesse, Caleb; Kladwang, Wipapat; Zakrevsky, Paul; Jaeger, Luc; Das, Rhiju
2018-05-01
Prediction of RNA structure from nucleotide sequence remains an unsolved grand challenge of biochemistry and requires distinct concepts from protein structure prediction. Despite extensive algorithmic development in recent years, modeling of noncanonical base pairs of new RNA structural motifs has not been achieved in blind challenges. We report a stepwise Monte Carlo (SWM) method with a unique add-and-delete move set that enables predictions of noncanonical base pairs of complex RNA structures. A benchmark of 82 diverse motifs establishes the method's general ability to recover noncanonical pairs ab initio, including multistrand motifs that have been refractory to prior approaches. In a blind challenge, SWM models predicted nucleotide-resolution chemical mapping and compensatory mutagenesis experiments for three in vitro selected tetraloop/receptors with previously unsolved structures (C7.2, C7.10, and R1). As a final test, SWM blindly and correctly predicted all noncanonical pairs of a Zika virus double pseudoknot during a recent community-wide RNA-Puzzle. Stepwise structure formation, as encoded in the SWM method, enables modeling of noncanonical RNA structure in a variety of previously intractable problems.
Rampello, Anthony J; Glynn, Steven E
2017-03-24
The i-AAA protease is a component of the mitochondrial quality control machinery that regulates respiration, mitochondrial dynamics, and protein import. The protease is required to select specific substrates for degradation from among the diverse complement of proteins present in mitochondria, yet the rules that govern this selection are unclear. Here, we reconstruct the yeast i-AAA protease, Yme1p, to examine the in vitro degradation of two intermembrane space chaperone subunits, Tim9 and Tim10. Yme1p degrades Tim10 more rapidly than Tim9 despite high sequence and structural similarity, and loss of Tim10 is accelerated by the disruption of conserved disulfide bonds within the substrate. An unstructured N-terminal region of Tim10 is necessary and sufficient to target the substrate to the protease through recognition of a short phenylalanine-rich motif, and the presence of similar motifs in other small Tim proteins predicts robust degradation by the protease. Together, these results identify the first specific degron sequence within a native i-AAA protease substrate. Copyright © 2017 Elsevier Ltd. All rights reserved.
Mitrea, Diana M; Cika, Jaclyn A; Guy, Clifford S; Ban, David; Banerjee, Priya R; Stanley, Christopher B; Nourse, Amanda; Deniz, Ashok A; Kriwacki, Richard W
2016-01-01
The nucleolus is a membrane-less organelle formed through liquid-liquid phase separation of its components from the surrounding nucleoplasm. Here, we show that nucleophosmin (NPM1) integrates within the nucleolus via a multi-modal mechanism involving multivalent interactions with proteins containing arginine-rich linear motifs (R-motifs) and ribosomal RNA (rRNA). Importantly, these R-motifs are found in canonical nucleolar localization signals. Based on a novel combination of biophysical approaches, we propose a model for the molecular organization within liquid-like droplets formed by the N-terminal domain of NPM1 and R-motif peptides, thus providing insights into the structural organization of the nucleolus. We identify multivalency of acidic tracts and folded nucleic acid binding domains, mediated by N-terminal domain oligomerization, as structural features required for phase separation of NPM1 with other nucleolar components in vitro and for localization within mammalian nucleoli. We propose that one mechanism of nucleolar localization involves phase separation of proteins within the nucleolus. DOI: http://dx.doi.org/10.7554/eLife.13571.001 PMID:26836305
Beusch, Irene; Barraud, Pierre; Moursy, Ahmed; Cléry, Antoine; Allain, Frédéric Hai-Trieu
2017-01-01
HnRNP A1 regulates many alternative splicing events by the recognition of splicing silencer elements. Here, we provide the solution structures of its two RNA recognition motifs (RRMs) in complex with short RNA. In addition, we show by NMR that both RRMs of hnRNP A1 can bind simultaneously to a single bipartite motif of the human intronic splicing silencer ISS-N1, which controls survival of motor neuron exon 7 splicing. RRM2 binds to the upstream motif and RRM1 to the downstream motif. Combining the insights from the structure with in cell splicing assays we show that the architecture and organization of the two RRMs is essential to hnRNP A1 function. The disruption of the inter-RRM interaction or the loss of RNA binding capacity of either RRM impairs splicing repression by hnRNP A1. Furthermore, both binding sites within the ISS-N1 are important for splicing repression and their contributions are cumulative rather than synergistic. DOI: http://dx.doi.org/10.7554/eLife.25736.001 PMID:28650318
Mitrea, Diana M.; Cika, Jaclyn A.; Guy, Clifford S.; ...
2016-02-02
In this study, the nucleolus is a membrane-less organelle formed through liquid-liquid phase separation of its components from the surrounding nucleoplasm. Here, we show that nucleophosmin (NPM1) integrates within the nucleolus via a multi-modal mechanism involving multivalent interactions with proteins containing arginine-rich linear motifs (R-motifs) and ribosomal RNA (rRNA). Importantly, these R-motifs are found in canonical nucleolar localization signals. Based on a novel combination of biophysical approaches, we propose a model for the molecular organization within liquid-like droplets formed by the N-terminal domain of NPM1 and R-motif peptides, thus providing insights into the structural organization of the nucleolus. We identifymore » multivalency of acidic tracts and folded nucleic acid binding domains, mediated by N-terminal domain oligomerization, as structural features required for phase separation of NPM1 with other nucleolar components in vitro and for localization within mammalian nucleoli. We propose that one mechanism of nucleolar localization involves phase separation of proteins within the nucleolus.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chan,K.; Fedorov, A.; Almo, S.
2008-01-01
Enzymes that share the ({beta}/{alpha})8-barrel fold catalyze a diverse range of reactions. Many utilize phosphorylated substrates and share a conserved C-terminal ({beta}/a)2-quarter barrel subdomain that provides a binding motif for the dianionic phosphate group. We recently reported functional and structural studies of d-ribulose 5-phosphate 3-epimerase (RPE) from Streptococcus pyogenes that catalyzes the equilibration of the pentulose 5-phosphates d-ribulose 5-phosphate and d-xylulose 5-phosphate in the pentose phosphate pathway [J. Akana, A. A. Fedorov, E. Fedorov, W. R. P. Novack, P. C. Babbitt, S. C. Almo, and J. A. Gerlt (2006) Biochemistry 45, 2493-2503]. We now report functional and structural studies ofmore » d-allulose 6-phosphate 3-epimerase (ALSE) from Escherichia coli K-12 that catalyzes the equilibration of the hexulose 6-phosphates d-allulose 6-phosphate and d-fructose 6-phosphate in a catabolic pathway for d-allose. ALSE and RPE prefer their physiological substrates but are promiscuous for each other's substrate. The active sites (RPE complexed with d-xylitol 5-phosphate and ALSE complexed with d-glucitol 6-phosphate) are superimposable (as expected from their 39% sequence identity), with the exception of the phosphate binding motif. The loop following the eighth {beta}-strand in ALSE is one residue longer than the homologous loop in RPE, so the binding site for the hexulose 6-phosphate substrate/product in ALSE is elongated relative to that for the pentulose 5-phosphate substrate/product in RPE. We constructed three single-residue deletion mutants of the loop in ALSE, ?T196, ?S197 and ?G198, to investigate the structural bases for the differing substrate specificities; for each, the promiscuity is altered so that d-ribulose 5-phosphate is the preferred substrate. The changes in kcat/Km are dominated by changes in kcat, suggesting that substrate discrimination results from differential transition state stabilization. In both ALSE and RPE, the phosphate group hydrogen bonds not only with the conserved motif but also with an active site loop following the sixth {beta}-strand, providing a potential structural mechanism for coupling substrate binding with catalysis.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ding, Jun; Ma, Evan; Asta, Mark
Using molecular dynamics simulations, we have studied the atomic correlations characterizing the second peak in the radial distribution function (RDF) of metallic glasses and liquids. The analysis was conducted from the perspective of different connection schemes of atomic packing motifs, based on the number of shared atoms between two linked coordination polyhedra. The results demonstrate that the cluster connections by face-sharing, specifically with three common atoms, are most favored when transitioning from the liquid to glassy state, and exhibit the stiffest elastic response during shear deformation. These properties of the connections and the resultant atomic correlations are generally the samemore » for different types of packing motifs in different alloys. Splitting of the second RDF peak was observed for the inherent structure of the equilibrium liquid, originating solely from cluster connections; this trait can then be inherited in the metallic glass formed via subsequent quenching of the parent liquid through the glass transition, in the absence of any additional type of local structural order. In conclusion, increasing ordering and cluster connection during cooling, however, may tune the position and intensity of the split peaks.« less
Anion induced conformational preference of Cα NN motif residues in functional proteins.
Patra, Piya; Ghosh, Mahua; Banerjee, Raja; Chakrabarti, Jaydeb
2017-12-01
Among different ligand binding motifs, anion binding C α NN motif consisting of peptide backbone atoms of three consecutive residues are observed to be important for recognition of free anions, like sulphate or biphosphate and participate in different key functions. Here we study the interaction of sulphate and biphosphate with C α NN motif present in different proteins. Instead of total protein, a peptide fragment has been studied keeping C α NN motif flanked in between other residues. We use classical force field based molecular dynamics simulations to understand the stability of this motif. Our data indicate fluctuations in conformational preferences of the motif residues in absence of the anion. The anion gives stability to one of these conformations. However, the anion induced conformational preferences are highly sequence dependent and specific to the type of anion. In particular, the polar residues are more favourable compared to the other residues for recognising the anion. © 2017 Wiley Periodicals, Inc.
Huang, Kezhen; Wang, Yue-Hao; Brown, Alex; Sun, Gongqin
2009-01-01
Csk and Src protein tyrosine kinases are structurally homologous, but use opposite regulatory strategies. The isolated catalytic domain of Csk is intrinsically inactive and is activated by interactions with the regulatory SH3 and SH2 domains, while the isolated catalytic domain of Src is intrinsically active and is suppressed by interactions with the regulatory SH3 and SH2 domains. The structural basis for why one isolated catalytic domain is intrinsically active while the other is inactive is not clear. In this current study, we identify the structural elements in the N-terminal lobe of the catalytic domain that render the Src catalytic domain active. These structural elements include the α-helix C region, a β-turn between the β-4 and β-5 strands, and an Arg residue at the beginning of the catalytic domain. These three motifs interact with each other to activate the Src catalytic domain, but the equivalent motifs in Csk directly interact with the regulatory domains that are important for Csk activation. The Src motifs can be grafted to the Csk catalytic domain to obtain an active Csk catalytic domain. These results, together with available Src and Csk tertiary structures, reveal an important structural switch that determines the kinase activity of a catalytic domain and dictates the regulatory strategy of a kinase. PMID:19244618
Structural basis for the facilitative diffusion mechanism by SemiSWEET transporter
NASA Astrophysics Data System (ADS)
Lee, Yongchan; Nishizawa, Tomohiro; Yamashita, Keitaro; Ishitani, Ryuichiro; Nureki, Osamu
2015-01-01
SWEET family proteins mediate sugar transport across biological membranes and play crucial roles in plants and animals. The SWEETs and their bacterial homologues, the SemiSWEETs, are related to the PQ-loop family, which is characterized by highly conserved proline and glutamine residues (PQ-loop motif). Although the structures of the bacterial SemiSWEETs were recently reported, the conformational transition and the significance of the conserved motif in the transport cycle have remained elusive. Here we report crystal structures of SemiSWEET from Escherichia coli, in the both inward-open and outward-open states. A structural comparison revealed that SemiSWEET undergoes an intramolecular conformational change in each protomer. The conserved PQ-loop motif serves as a molecular hinge that enables the ‘binder clip-like’ motion of SemiSWEET. The present work provides the framework for understanding the overall transport cycles of SWEET and PQ-loop family proteins.
Searching for statistically significant regulatory modules.
Bailey, Timothy L; Noble, William Stafford
2003-10-01
The regulatory machinery controlling gene expression is complex, frequently requiring multiple, simultaneous DNA-protein interactions. The rate at which a gene is transcribed may depend upon the presence or absence of a collection of transcription factors bound to the DNA near the gene. Locating transcription factor binding sites in genomic DNA is difficult because the individual sites are small and tend to occur frequently by chance. True binding sites may be identified by their tendency to occur in clusters, sometimes known as regulatory modules. We describe an algorithm for detecting occurrences of regulatory modules in genomic DNA. The algorithm, called mcast, takes as input a DNA database and a collection of binding site motifs that are known to operate in concert. mcast uses a motif-based hidden Markov model with several novel features. The model incorporates motif-specific p-values, thereby allowing scores from motifs of different widths and specificities to be compared directly. The p-value scoring also allows mcast to only accept motif occurrences with significance below a user-specified threshold, while still assigning better scores to motif occurrences with lower p-values. mcast can search long DNA sequences, modeling length distributions between motifs within a regulatory module, but ignoring length distributions between modules. The algorithm produces a list of predicted regulatory modules, ranked by E-value. We validate the algorithm using simulated data as well as real data sets from fruitfly and human. http://meme.sdsc.edu/MCAST/paper
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rajan, Rakhi; Taneja, Bhupesh; Mondragón, Alfonso
Topoisomerase V is an archaeal type I topoisomerase that is unique among topoisomerases due to presence of both topoisomerase and DNA repair activities in the same protein. It is organized as an N-terminal topoisomerase domain followed by 24 tandem helix-hairpin-helix (HhH) motifs. Structural studies have shown that the active site is buried by the (HhH) motifs. Here we show that the N-terminal domain can relax DNA in the absence of any HhH motifs and that the HhH motifs are required for stable protein-DNA complex formation. Crystal structures of various topoisomerase V fragments show changes in the relative orientation of themore » domains mediated by a long bent linker helix, and these movements are essential for the DNA to enter the active site. Phosphate ions bound to the protein near the active site helped model DNA in the topoisomerase domain and show how topoisomerase V may interact with DNA.« less
Temporal motifs reveal homophily, gender-specific patterns, and group talk in call sequences.
Kovanen, Lauri; Kaski, Kimmo; Kertész, János; Saramäki, Jari
2013-11-05
Recent studies on electronic communication records have shown that human communication has complex temporal structure. We study how communication patterns that involve multiple individuals are affected by attributes such as sex and age. To this end, we represent the communication records as a colored temporal network where node color is used to represent individuals' attributes, and identify patterns known as temporal motifs. We then construct a null model for the occurrence of temporal motifs that takes into account the interaction frequencies and connectivity between nodes of different colors. This null model allows us to detect significant patterns in call sequences that cannot be observed in a static network that uses interaction frequencies as link weights. We find sex-related differences in communication patterns in a large dataset of mobile phone records and show the existence of temporal homophily, the tendency of similar individuals to participate in communication patterns beyond what would be expected on the basis of their average interaction frequencies. We also show that temporal patterns differ between dense and sparse neighborhoods in the network. Because also this result is independent of interaction frequencies, it can be seen as an extension of Granovetter's hypothesis to temporal networks.
Temporal motifs reveal homophily, gender-specific patterns, and group talk in call sequences
Kovanen, Lauri; Kaski, Kimmo; Kertész, János; Saramäki, Jari
2013-01-01
Recent studies on electronic communication records have shown that human communication has complex temporal structure. We study how communication patterns that involve multiple individuals are affected by attributes such as sex and age. To this end, we represent the communication records as a colored temporal network where node color is used to represent individuals’ attributes, and identify patterns known as temporal motifs. We then construct a null model for the occurrence of temporal motifs that takes into account the interaction frequencies and connectivity between nodes of different colors. This null model allows us to detect significant patterns in call sequences that cannot be observed in a static network that uses interaction frequencies as link weights. We find sex-related differences in communication patterns in a large dataset of mobile phone records and show the existence of temporal homophily, the tendency of similar individuals to participate in communication patterns beyond what would be expected on the basis of their average interaction frequencies. We also show that temporal patterns differ between dense and sparse neighborhoods in the network. Because also this result is independent of interaction frequencies, it can be seen as an extension of Granovetter’s hypothesis to temporal networks. PMID:24145424
Yeast One-Hybrid Gγ Recruitment System for Identification of Protein Lipidation Motifs
Fukuda, Nobuo; Doi, Motomichi; Honda, Shinya
2013-01-01
Fatty acids and isoprenoids can be covalently attached to a variety of proteins. These lipid modifications regulate protein structure, localization and function. Here, we describe a yeast one-hybrid approach based on the Gγ recruitment system that is useful for identifying sequence motifs those influence lipid modification to recruit proteins to the plasma membrane. Our approach facilitates the isolation of yeast cells expressing lipid-modified proteins via a simple and easy growth selection assay utilizing G-protein signaling that induces diploid formation. In the current study, we selected the N-terminal sequence of Gα subunits as a model case to investigate dual lipid modification, i.e., myristoylation and palmitoylation, a modification that is widely conserved from yeast to higher eukaryotes. Our results suggest that both lipid modifications are required for restoration of G-protein signaling. Although we could not differentiate between myristoylation and palmitoylation, N-terminal position 7 and 8 play some critical role. Moreover, we tested the preference for specific amino-acid residues at position 7 and 8 using library-based screening. This new approach will be useful to explore protein-lipid associations and to determine the corresponding sequence motifs. PMID:23922919
Analysis of the interactome of the Ser/Thr Protein Phosphatase type 1 in Plasmodium falciparum.
Hollin, Thomas; De Witte, Caroline; Lenne, Astrid; Pierrot, Christine; Khalife, Jamal
2016-03-17
Protein Phosphatase 1 (PP1) is an enzyme essential to cell viability in the malaria parasite Plasmodium falciparum (Pf). The activity of PP1 is regulated by the binding of regulatory subunits, of which there are up to 200 in humans, but only 3 have been so far reported for the parasite. To better understand the P. falciparum PP1 (PfPP1) regulatory network, we here report the use of three strategies to characterize the PfPP1 interactome: co-affinity purified proteins identified by mass spectrometry, yeast two-hybrid (Y2H) screening and in silico analysis of the P. falciparum predicted proteome. Co-affinity purification followed by MS analysis identified 6 PfPP1 interacting proteins (Pips) of which 3 contained the RVxF consensus binding, 2 with a Fxx[RK]x[RK] motif, also shown to be a PP1 binding motif and one with both binding motifs. The Y2H screens identified 134 proteins of which 30 present the RVxF binding motif and 20 have the Fxx[RK]x[RK] binding motif. The in silico screen of the Pf predicted proteome using a consensus RVxF motif as template revealed the presence of 55 potential Pips. As further demonstration, 35 candidate proteins were validated as PfPP1 interacting proteins in an ELISA-based assay. To the best of our knowledge, this is the first study on PfPP1 interactome. The data reports several conserved PP1 interacting proteins as well as a high number of specific interactors to PfPP1. Their analysis indicates a high diversity of biological functions for PP1 in Plasmodium. Based on the present data and on an earlier study of the Pf interactome, a potential implication of Pips in protein folding/proteolysis, transcription and pathogenicity networks is proposed. The present work provides a starting point for further studies on the structural basis of these interactions and their functions in P. falciparum.
RGAugury: a pipeline for genome-wide prediction of resistance gene analogs (RGAs) in plants.
Li, Pingchuan; Quan, Xiande; Jia, Gaofeng; Xiao, Jin; Cloutier, Sylvie; You, Frank M
2016-11-02
Resistance gene analogs (RGAs), such as NBS-encoding proteins, receptor-like protein kinases (RLKs) and receptor-like proteins (RLPs), are potential R-genes that contain specific conserved domains and motifs. Thus, RGAs can be predicted based on their conserved structural features using bioinformatics tools. Computer programs have been developed for the identification of individual domains and motifs from the protein sequences of RGAs but none offer a systematic assessment of the different types of RGAs. A user-friendly and efficient pipeline is needed for large-scale genome-wide RGA predictions of the growing number of sequenced plant genomes. An integrative pipeline, named RGAugury, was developed to automate RGA prediction. The pipeline first identifies RGA-related protein domains and motifs, namely nucleotide binding site (NB-ARC), leucine rich repeat (LRR), transmembrane (TM), serine/threonine and tyrosine kinase (STTK), lysin motif (LysM), coiled-coil (CC) and Toll/Interleukin-1 receptor (TIR). RGA candidates are identified and classified into four major families based on the presence of combinations of these RGA domains and motifs: NBS-encoding, TM-CC, and membrane associated RLP and RLK. All time-consuming analyses of the pipeline are paralleled to improve performance. The pipeline was evaluated using the well-annotated Arabidopsis genome. A total of 98.5, 85.2, and 100 % of the reported NBS-encoding genes, membrane associated RLPs and RLKs were validated, respectively. The pipeline was also successfully applied to predict RGAs for 50 sequenced plant genomes. A user-friendly web interface was implemented to ease command line operations, facilitate visualization and simplify result management for multiple datasets. RGAugury is an efficiently integrative bioinformatics tool for large scale genome-wide identification of RGAs. It is freely available at Bitbucket: https://bitbucket.org/yaanlpc/rgaugury .
Distribution and diversity of ribosome binding sites in prokaryotic genomes.
Omotajo, Damilola; Tate, Travis; Cho, Hyuk; Choudhary, Madhusudan
2015-08-14
Prokaryotic translation initiation involves the proper docking, anchoring, and accommodation of mRNA to the 30S ribosomal subunit. Three initiation factors (IF1, IF2, and IF3) and some ribosomal proteins mediate the assembly and activation of the translation initiation complex. Although the interaction between Shine-Dalgarno (SD) sequence and its complementary sequence in the 16S rRNA is important in initiation, some genes lacking an SD ribosome binding site (RBS) are still well expressed. The objective of this study is to examine the pattern of distribution and diversity of RBS in fully sequenced bacterial genomes. The following three hypotheses were tested: SD motifs are prevalent in bacterial genomes; all previously identified SD motifs are uniformly distributed across prokaryotes; and genes with specific cluster of orthologous gene (COG) functions differ in their use of SD motifs. Data for 2,458 bacterial genomes, previously generated by Prodigal (PROkaryotic DYnamic programming Gene-finding ALgorithm) and currently available at the National Center for Biotechnology Information (NCBI), were analyzed. Of the total genes examined, ~77.0% use an SD RBS, while ~23.0% have no RBS. Majority of the genes with the most common SD motifs are distributed in a manner that is representative of their abundance for each COG functional category, while motifs 13 (5'-GGA-3'/5'-GAG-3'/5'-AGG-3') and 27 (5'-AGGAGG-3') appear to be predominantly used by genes for information storage and processing, and translation and ribosome biogenesis, respectively. These findings suggest that an SD sequence is not obligatory for translation initiation; instead, other signals, such as the RBS spacer, may have an overarching influence on translation of mRNAs. Subsequent analyses of the 5' secondary structure of these mRNAs may provide further insight into the translation initiation mechanism.
RNA motif search with data-driven element ordering.
Rampášek, Ladislav; Jimenez, Randi M; Lupták, Andrej; Vinař, Tomáš; Brejová, Broňa
2016-05-18
In this paper, we study the problem of RNA motif search in long genomic sequences. This approach uses a combination of sequence and structure constraints to uncover new distant homologs of known functional RNAs. The problem is NP-hard and is traditionally solved by backtracking algorithms. We have designed a new algorithm for RNA motif search and implemented a new motif search tool RNArobo. The tool enhances the RNAbob descriptor language, allowing insertions in helices, which enables better characterization of ribozymes and aptamers. A typical RNA motif consists of multiple elements and the running time of the algorithm is highly dependent on their ordering. By approaching the element ordering problem in a principled way, we demonstrate more than 100-fold speedup of the search for complex motifs compared to previously published tools. We have developed a new method for RNA motif search that allows for a significant speedup of the search of complex motifs that include pseudoknots. Such speed improvements are crucial at a time when the rate of DNA sequencing outpaces growth in computing. RNArobo is available at http://compbio.fmph.uniba.sk/rnarobo .
QuateXelero: An Accelerated Exact Network Motif Detection Algorithm
Khakabimamaghani, Sahand; Sharafuddin, Iman; Dichter, Norbert; Koch, Ina; Masoudi-Nejad, Ali
2013-01-01
Finding motifs in biological, social, technological, and other types of networks has become a widespread method to gain more knowledge about these networks’ structure and function. However, this task is very computationally demanding, because it is highly associated with the graph isomorphism which is an NP problem (not known to belong to P or NP-complete subsets yet). Accordingly, this research is endeavoring to decrease the need to call NAUTY isomorphism detection method, which is the most time-consuming step in many existing algorithms. The work provides an extremely fast motif detection algorithm called QuateXelero, which has a Quaternary Tree data structure in the heart. The proposed algorithm is based on the well-known ESU (FANMOD) motif detection algorithm. The results of experiments on some standard model networks approve the overal superiority of the proposed algorithm, namely QuateXelero, compared with two of the fastest existing algorithms, G-Tries and Kavosh. QuateXelero is especially fastest in constructing the central data structure of the algorithm from scratch based on the input network. PMID:23874498
A motif detection and classification method for peptide sequences using genetic programming.
Tomita, Yasuyuki; Kato, Ryuji; Okochi, Mina; Honda, Hiroyuki
2008-08-01
An exploration of common rules (property motifs) in amino acid sequences has been required for the design of novel sequences and elucidation of the interactions between molecules controlled by the structural or physical environment. In the present study, we developed a new method to search property motifs that are common in peptide sequence data. Our method comprises the following two characteristics: (i) the automatic determination of the position and length of common property motifs by calculating the physicochemical similarity of amino acids, and (ii) the quick and effective exploration of motif candidates that discriminates the positives and negatives by the introduction of genetic programming (GP). Our method was evaluated by two types of model data sets. First, the intentionally buried property motifs were searched in the artificially derived peptide data containing intentionally buried property motifs. As a result, the expected property motifs were correctly extracted by our algorithm. Second, the peptide data that interact with MHC class II molecules were analyzed as one of the models of biologically active peptides with buried motifs in various lengths. Twofold MHC class II binding peptides were identified with the rule using our method, compared to the existing scoring matrix method. In conclusion, our GP based motif searching approach enabled to obtain knowledge of functional aspects of the peptides without any prior knowledge.
Petitdemange, Caroline; Achour, Abla; Dispinseri, Stefania; Malet, Isabelle; Sennepin, Alexis; Ho Tsong Fang, Raphaël; Crouzet, Joël; Marcelin, Anne-Geneviève; Calvez, Vincent; Scarlatti, Gabriella; Debré, Patrice; Vieillard, Vincent
2013-09-01
The induction of neutralizing antibodies against conserved regions of the human immunodeficiency virus type 1 (HIV-1) envelope protein is a major goal of vaccine strategies. We previously identified 3S, a critical conserved motif of gp41 that induces the NKp44L ligand of an activating NK receptor. In vivo, anti-3S antibodies protect against the natural killer (NK) cell-mediated CD4 depletion that occurs without efficient viral neutralization. Specific substitutions within the 3S peptide motif were prepared by directed mutagenesis. Virus production was monitored by measuring the p24 production. Neutralization assays were performed with immune-purified antibodies from immunized mice and a cohort of HIV-infected patients. Expression of NKp44L on CD4(+) T cells and degranulation assay on activating NK cells were both performed by flow cytometry. Here, we show that specific substitutions in the 3S motif reduce viral infection without affecting gp41 production, while decreasing both its capacity to induce NKp44L expression on CD4(+) T cells and its sensitivity to autologous NK cells. Generation of antibodies in mice against the W614 specific position in the 3S motif elicited a capacity to neutralize cross-clade viruses, notable in its magnitude, breadth, and durability. Antibodies against this 3S variant were also detected in sera from some HIV-1-infected patients, demonstrating both neutralization activity and protection against CD4 depletion. These findings suggest that a specific substitution in a 3S-based immunogen might allow the generation of specific antibodies, providing a foundation for a rational vaccine that combine a capacity to neutralize HIV-1 and to protect CD4(+) T cells.
Tumlirsch, Tony; Jendrossek, Dieter
2017-04-01
On the basis of bioinformatic evidence, we suspected that proteins with a CYTH ( Cy aB th iamine triphosphatase) domain and/or a CHAD ( c onserved h istidine α -helical d omain) motif might represent polyphosphate (polyP) granule-associated proteins. We found no evidence of polyP targeting by proteins with CYTH domains. In contrast, two CHAD motif-containing proteins from Ralstonia eutropha H16 (A0104 and B1017) that were expressed as fusions with enhanced yellow fluorescent protein (eYFP) colocalized with polyP granules. While the expression of B1017 was not detectable, the A0104 protein was specifically identified in an isolated polyP granule fraction by proteome analysis. Moreover, eYFP fusions with the CHAD motif-containing proteins MGMSRV2-1987 from Magnetospirillum gryphiswaldense and PP2307 from Pseudomonas putida also colocalized with polyP granules in a transspecies-specific manner. These data indicated that CHAD-containing proteins are generally attached to polyP granules. Together with the findings from four previously polyP-attached proteins (polyP kinases), the results of this study raised the number of polyP-associated proteins in R. eutropha to six. We suggest designating polyP granule-bound proteins with CHAD motifs as phosins ( pho sphate), analogous to pha sins and oleo sins that are specifically bound to the surface of polyhydroxyalkanoate (PHA) granules in PHA-accumulating bacteria and to oil droplets in oil seed plants, respectively. IMPORTANCE The importance of polyphosphate (polyP) for life is evident from the ubiquitous presence of polyP in all species on earth. In unicellular eukaryotic microorganisms, polyP is located in specific membrane-enclosed organelles, called acidocalcisomes. However, in most prokaryotes, polyP is present as insoluble granules that have been designated previously as volutin granules. Almost nothing is known regarding the macromolecular composition of polyP granules. Particularly, the absence or presence of cellular compounds on the surface of polyP granules has not yet been investigated. In this study, we identified a novel class of proteins that are attached to the surface of polyP granules in three model species of Alphaproteobacteria , Betaproteobacteria , and Gammaproteobacteria These proteins are characterized by the presence of a CHAD ( c onserved h istidine α -helical d omain) motif that functions as a polyP granule-targeting signal. We suggest designating CHAD motif-containing proteins as phosins [analogous to phasins for poly(3-hydroxybutyrate)-associated proteins and to oleosins for oil droplet-associated proteins in oil seed plants]. The expression of phosins in different species confirmed their polyP-targeting function in a transspecies-specific manner. We postulate that polyP granules in prokaryotic species generally have a complex surface structure that consists of one to several polyP kinases and phosin proteins. We suggest differentiating polyP granules from acidocalcisomes by designating them as polyphosphatosomes. Copyright © 2017 American Society for Microbiology.
Discriminative motif discovery via simulated evolution and random under-sampling.
Song, Tao; Gu, Hong
2014-01-01
Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons affecting the performance of the discriminative models. In this article, a simulated evolution method is applied to solve the multi-class imbalance problem at the stage of data preprocessing, and at the stage of Hidden Markov Models (HMMs) training, a random under-sampling method is introduced for the imbalance between the positive and negative datasets. It is shown that, in the task of discovering targeting motifs of nine subcellular compartments, the motifs found by our method are more conserved than the methods without considering data imbalance problem and recover the most known targeting motifs from Minimotif Miner and InterPro. Meanwhile, we use the found motifs to predict protein subcellular localization and achieve higher prediction precision and recall for the minority classes.
Deciphering functional glycosaminoglycan motifs in development.
Townley, Robert A; Bülow, Hannes E
2018-03-23
Glycosaminoglycans (GAGs) such as heparan sulfate, chondroitin/dermatan sulfate, and keratan sulfate are linear glycans, which when attached to protein backbones form proteoglycans. GAGs are essential components of the extracellular space in metazoans. Extensive modifications of the glycans such as sulfation, deacetylation and epimerization create structural GAG motifs. These motifs regulate protein-protein interactions and are thereby repsonsible for many of the essential functions of GAGs. This review focusses on recent genetic approaches to characterize GAG motifs and their function in defined signaling pathways during development. We discuss a coding approach for GAGs that would enable computational analyses of GAG sequences such as alignments and the computation of position weight matrices to describe GAG motifs. Copyright © 2018 Elsevier Ltd. All rights reserved.
Ca2+-binding Motif of βγ-Crystallins*
Srivastava, Shanti Swaroop; Mishra, Amita; Krishnan, Bal; Sharma, Yogendra
2014-01-01
βγ-Crystallin-type double clamp (N/D)(N/D)XX(S/T)S motif is an established but sparsely investigated motif for Ca2+ binding. A βγ-crystallin domain is formed of two Greek key motifs, accommodating two Ca2+-binding sites. βγ-Crystallins make a separate class of Ca2+-binding proteins (CaBP), apparently a major group of CaBP in bacteria. Paralleling the diversity in βγ-crystallin domains, these motifs also show great diversity, both in structure and in function. Although the expression of some of them has been associated with stress, virulence, and adhesion, the functional implications of Ca2+ binding to βγ-crystallins in mediating biological processes are yet to be elucidated. PMID:24567326
Web server to identify similarity of amino acid motifs to compounds (SAAMCO).
Casey, Fergal P; Davey, Norman E; Baran, Ivan; Varekova, Radka Svobodova; Shields, Denis C
2008-07-01
Protein-protein interactions are fundamental in mediating biological processes including metabolism, cell growth, and signaling. To be able to selectively inhibit or induce protein activity or complex formation is a key feature in controlling disease. For those situations in which protein-protein interactions derive substantial affinity from short linear peptide sequences, or motifs, we can develop search algorithms for peptidomimetic compounds that resemble the short peptide's structure but are not compromised by poor pharmacological properties. SAAMCO is a Web service ( http://bioware.ucd.ie/ approximately saamco) that facilitates the screening of motifs with known structures against bioactive compound databases. It is built on an algorithm that defines compound similarity based on the presence of appropriate amino acid side chain fragments and a favorable Root Mean Squared Deviation (RMSD) between compound and motif structure. The methodology is efficient as the available compound databases are preprocessed and fast regular expression searches filter potential matches before time-intensive 3D superposition is performed. The required input information is minimal, and the compound databases have been selected to maximize the availability of information on biological activity. "Hits" are accompanied with a visualization window and links to source database entries. Motif matching can be defined on partial or full similarity which will increase or reduce respectively the number of potential mimetic compounds. The Web server provides the functionality for rapid screening of known or putative interaction motifs against prepared compound libraries using a novel search algorithm. The tabulated results can be analyzed by linking to appropriate databases and by visualization.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Collet, Jean-Francois; Peisach, Daniel; Bardwell, James C.A.
2010-07-13
Escherichia coli thioredoxin is a small monomeric protein that reduces disulfide bonds in cytoplasmic proteins. Two cysteine residues present in a conserved CGPC motif are essential for this activity. Recently, we identified mutations of this motif that changed thioredoxin into a homodimer bridged by a [2Fe-2S] iron-sulfur cluster. When exported to the periplasm, these thioredoxin mutants could restore disulfide bond formation in strains lacking the entire periplasmic oxidative pathway. Essential for the assembly of the iron-sulfur was an additional cysteine that replaced the proline at position three of the CGPC motif. We solved the crystalline structure at 2.3 {angstrom} formore » one of these variants, TrxA(CACA). The mutant protein crystallized as a dimer in which the iron-sulfur cluster is replaced by two intermolecular disulfide bonds. The catalytic site, which forms the dimer interface, crystallized in two different conformations. In one of them, the replacement of the CGPC motif by CACA has a dramatic effect on the structure and causes the unraveling of an extended {alpha}-helix. In both conformations, the second cysteine residue of the CACA motif is surface-exposed, which contrasts with wildtype thioredoxin where the second cysteine of the CXXC motif is buried. This exposure of a pair of vicinal cysteine residues apparently allows thioredoxin to acquire an iron-sulfur cofactor at its active site, and thus a new activity and mechanism of action.« less
Khandaker, Md Shahriar K; Dudek, Daniel M; Beers, Eric P; Dillard, David A; Bevan, David R
2016-08-01
The mechanisms responsible for the properties of disordered elastomeric proteins are not well known. To better understand the relationship between elastomeric behavior and amino acid sequence, we investigated resilin, a disordered rubber-like protein, found in specialized regions of the cuticle of insects. Resilin of Drosophila melanogaster contains Gly-rich repetitive motifs comprised of the amino acids, PSSSYGAPGGGNGGR, which confer elastic properties to resilin. The repetitive motifs of insect resilin can be divided into smaller partially conserved building blocks: PSS, SYGAP, GGGN and GGR. Using molecular dynamics (MD) simulations, we studied the relative roles of SYGAP, and its less common variants SYSAP and TYGAP, on the elastomeric properties of resilin. Results showed that SYGAP adopts a bent structure that is one-half to one-third the end-to-end length of the other motifs having an equal number of amino acids but containing SYSAP or TYGAP substituted for SYGAP. The bent structure of SYGAP forms due to conformational freedom of glycine, and hydrogen bonding within the motif apparently plays a role in maintaining this conformation. These structural features of SYGAP result in higher extensibility compared to other motifs, which may contribute to elastic properties at the macroscopic level. Overall, the results are consistent with a role for the SYGAP building block in the elastomeric properties of these disordered proteins. What we learned from simulating the repetitive motifs of resilin may be applicable to the biology and mechanics of other elastomeric biomaterials, and may provide us the deeper understanding of their unique properties. Copyright © 2016 Elsevier Ltd. All rights reserved.
Genome-wide colonization of gene regulatory elements by G4 DNA motifs
Du, Zhuo; Zhao, Yiqiang; Li, Ning
2009-01-01
G-quadruplex (or G4 DNA), a stable four-stranded structure found in guanine-rich regions, is implicated in the transcriptional regulation of genes involved in growth and development. Previous studies on the role of G4 DNA in gene regulation mostly focused on genomic regions proximal to transcription start sites (TSSs). To gain a more comprehensive understanding of the regulatory role of G4 DNA, we examined the landscape of potential G4 DNA (PG4Ms) motifs in the human genome and found that G4 motifs, not restricted to those found in the TSS-proximal regions, are bias toward gene-associated regions. Significantly, analyses of G4 motifs in seven types of well-known gene regulatory elements revealed a constitutive enrichment pattern and the clusters of G4 motifs tend to be colocalized with regulatory elements. Considering our analysis from a genome evolutionary perspective, we found evidence that the occurrence and accumulation of certain progenitors and canonical G4 DNA motifs within regulatory regions were progressively favored by natural selection. Our results suggest that G4 DNA motifs are ‘colonized’ in regulatory regions, supporting a likely genome-wide role of G4 DNA in gene regulation. We hypothesize that G4 DNA is a regulatory apparatus situated in regulatory elements, acting as a molecular switch that can modulate the role of the host functional regions, by transition in DNA structure. PMID:19759215
Henry, Kelli F.; Kawashima, Tomokazu; Goldberg, Robert B.
2015-03-22
Little is known about the molecular mechanisms by which the embryo proper and suspensor of plant embryos activate specific gene sets shortly after fertilization. We analyzed the upstream region of the Scarlet Runner Bean ( Phaseolus coccineus) G564 gene in order to understand how genes are activated specifically in the suspensor during early embryo development. Previously, we showed that a 54-bp fragment of the G564 upstream region is sufficient for suspensor transcription and contains at least three required cis-regulatory sequences, including the 10-bp motif (5'-GAAAAGCGAA-3'), the 10 bp-like motif (5'-GAAAAACGAA-3'), and Region 2 motif (partial sequence 5'-TTGGT-3'). Here, we usemore » site-directed mutagenesis experiments in transgenic tobacco globularstage embryos to identify two additional cis-regulatory elements within the 54-bp cis-regulatory module that are required for G564 suspensor transcription: the Fifth motif (5'-GAGTTA-3') and a third 10-bp-related sequence (5'-GAAAACCACA-3'). Further deletion of the 54-bp fragment revealed that a 47-bp fragment containing the five motifs (the 10-bp, 10-bp-like, 10-bp-related, Region 2 and Fifth motifs) is sufficient for suspensor transcription, and represents a cis-regulatory module. A consensus sequence for each type of motif was determined by comparing motif sequences shown to activate suspensor transcription. Phylogenetic analyses suggest that the regulation of G564 is evolutionarily conserved. Lastly, a homologous cis-regulatory module was found upstream of the G564 ortholog in the Common Bean (Phaseolus vulgaris), indicating that the regulation of G564 is evolutionarily conserved in closely related bean species.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Henry, Kelli F.; Kawashima, Tomokazu; Goldberg, Robert B.
Little is known about the molecular mechanisms by which the embryo proper and suspensor of plant embryos activate specific gene sets shortly after fertilization. We analyzed the upstream region of the Scarlet Runner Bean ( Phaseolus coccineus) G564 gene in order to understand how genes are activated specifically in the suspensor during early embryo development. Previously, we showed that a 54-bp fragment of the G564 upstream region is sufficient for suspensor transcription and contains at least three required cis-regulatory sequences, including the 10-bp motif (5'-GAAAAGCGAA-3'), the 10 bp-like motif (5'-GAAAAACGAA-3'), and Region 2 motif (partial sequence 5'-TTGGT-3'). Here, we usemore » site-directed mutagenesis experiments in transgenic tobacco globularstage embryos to identify two additional cis-regulatory elements within the 54-bp cis-regulatory module that are required for G564 suspensor transcription: the Fifth motif (5'-GAGTTA-3') and a third 10-bp-related sequence (5'-GAAAACCACA-3'). Further deletion of the 54-bp fragment revealed that a 47-bp fragment containing the five motifs (the 10-bp, 10-bp-like, 10-bp-related, Region 2 and Fifth motifs) is sufficient for suspensor transcription, and represents a cis-regulatory module. A consensus sequence for each type of motif was determined by comparing motif sequences shown to activate suspensor transcription. Phylogenetic analyses suggest that the regulation of G564 is evolutionarily conserved. Lastly, a homologous cis-regulatory module was found upstream of the G564 ortholog in the Common Bean (Phaseolus vulgaris), indicating that the regulation of G564 is evolutionarily conserved in closely related bean species.« less
Henry, Kelli F; Kawashima, Tomokazu; Goldberg, Robert B
2015-06-01
Little is known about the molecular mechanisms by which the embryo proper and suspensor of plant embryos activate specific gene sets shortly after fertilization. We analyzed the upstream region of the Scarlet Runner Bean (Phaseolus coccineus) G564 gene in order to understand how genes are activated specifically in the suspensor during early embryo development. Previously, we showed that a 54-bp fragment of the G564 upstream region is sufficient for suspensor transcription and contains at least three required cis-regulatory sequences, including the 10-bp motif (5'-GAAAAGCGAA-3'), the 10 bp-like motif (5'-GAAAAACGAA-3'), and Region 2 motif (partial sequence 5'-TTGGT-3'). Here, we use site-directed mutagenesis experiments in transgenic tobacco globular-stage embryos to identify two additional cis-regulatory elements within the 54-bp cis-regulatory module that are required for G564 suspensor transcription: the Fifth motif (5'-GAGTTA-3') and a third 10-bp-related sequence (5'-GAAAACCACA-3'). Further deletion of the 54-bp fragment revealed that a 47-bp fragment containing the five motifs (the 10-bp, 10-bp-like, 10-bp-related, Region 2 and Fifth motifs) is sufficient for suspensor transcription, and represents a cis-regulatory module. A consensus sequence for each type of motif was determined by comparing motif sequences shown to activate suspensor transcription. Phylogenetic analyses suggest that the regulation of G564 is evolutionarily conserved. A homologous cis-regulatory module was found upstream of the G564 ortholog in the Common Bean (Phaseolus vulgaris), indicating that the regulation of G564 is evolutionarily conserved in closely related bean species.
Chen, Haimei; Zhang, Jianhui; Yuan, George; Liu, Chang
2014-01-01
Salvia miltiorrhiza is one of the most widely used medicinal plants. As a first step to develop a chloroplast-based genetic engineering method for the over-production of active components from S. miltiorrhiza, we have analyzed the genome, transcriptome, and base modifications of the S. miltiorrhiza chloroplast. Total genomic DNA and RNA were extracted from fresh leaves and then subjected to strand-specific RNA-Seq and Single-Molecule Real-Time (SMRT) sequencing analyses. Mapping the RNA-Seq reads to the genome assembly allowed us to determine the relative expression levels of 80 protein-coding genes. In addition, we identified 19 polycistronic transcription units and 136 putative antisense and intergenic noncoding RNA (ncRNA) genes. Comparison of the abundance of protein-coding transcripts (cRNA) with and without overlapping antisense ncRNAs (asRNA) suggest that the presence of asRNA is associated with increased cRNA abundance (p<0.05). Using the SMRT Portal software (v1.3.2), 2687 potential DNA modification sites and two potential DNA modification motifs were predicted. The two motifs include a TATA box–like motif (CPGDMM1, “TATANNNATNA”), and an unknown motif (CPGDMM2 “WNYANTGAW”). Specifically, 35 of the 97 CPGDMM1 motifs (36.1%) and 91 of the 369 CPGDMM2 motifs (24.7%) were found to be significantly modified (p<0.01). Analysis of genes downstream of the CPGDMM1 motif revealed the significantly increased abundance of ncRNA genes that are less than 400 bp away from the significantly modified CPGDMM1motif (p<0.01). Taking together, the present study revealed a complex interplay among DNA modifications, ncRNA and cRNA expression in chloroplast genome. PMID:24914614
Chen, Haimei; Zhang, Jianhui; Yuan, George; Liu, Chang
2014-01-01
Salvia miltiorrhiza is one of the most widely used medicinal plants. As a first step to develop a chloroplast-based genetic engineering method for the over-production of active components from S. miltiorrhiza, we have analyzed the genome, transcriptome, and base modifications of the S. miltiorrhiza chloroplast. Total genomic DNA and RNA were extracted from fresh leaves and then subjected to strand-specific RNA-Seq and Single-Molecule Real-Time (SMRT) sequencing analyses. Mapping the RNA-Seq reads to the genome assembly allowed us to determine the relative expression levels of 80 protein-coding genes. In addition, we identified 19 polycistronic transcription units and 136 putative antisense and intergenic noncoding RNA (ncRNA) genes. Comparison of the abundance of protein-coding transcripts (cRNA) with and without overlapping antisense ncRNAs (asRNA) suggest that the presence of asRNA is associated with increased cRNA abundance (p<0.05). Using the SMRT Portal software (v1.3.2), 2687 potential DNA modification sites and two potential DNA modification motifs were predicted. The two motifs include a TATA box-like motif (CPGDMM1, "TATANNNATNA"), and an unknown motif (CPGDMM2 "WNYANTGAW"). Specifically, 35 of the 97 CPGDMM1 motifs (36.1%) and 91 of the 369 CPGDMM2 motifs (24.7%) were found to be significantly modified (p<0.01). Analysis of genes downstream of the CPGDMM1 motif revealed the significantly increased abundance of ncRNA genes that are less than 400 bp away from the significantly modified CPGDMM1motif (p<0.01). Taking together, the present study revealed a complex interplay among DNA modifications, ncRNA and cRNA expression in chloroplast genome.
Automated extraction and classification of RNA tertiary structure cyclic motifs
Lemieux, Sébastien; Major, François
2006-01-01
A minimum cycle basis of the tertiary structure of a large ribosomal subunit (LSU) X-ray crystal structure was analyzed. Most cycles are small, as they are composed of 3- to 5 nt, and repeated across the LSU tertiary structure. We used hierarchical clustering to quantify and classify the 4 nt cycles. One class is defined by the GNRA tetraloop motif. The inspection of the GNRA class revealed peculiar instances in sequence. First is the presence of UA, CA, UC and CC base pairs that substitute the usual sheared GA base pair. Second is the revelation of GNR(Xn)A tetraloops, where Xn is bulged out of the classical GNRA structure, and of GN/RA formed by the two strands of interior-loops. We were able to unambiguously characterize the cycle classes using base stacking and base pairing annotations. The cycles identified correspond to small and cyclic motifs that compose most of the LSU RNA tertiary structure and contribute to its thermodynamic stability. Consequently, the RNA minimum cycles could well be used as the basic elements of RNA tertiary structure prediction methods. PMID:16679452
Topological impact of noncanonical DNA structures on Klenow fragment of DNA polymerase.
Takahashi, Shuntaro; Brazier, John A; Sugimoto, Naoki
2017-09-05
Noncanonical DNA structures that stall DNA replication can cause errors in genomic DNA. Here, we investigated how the noncanonical structures formed by sequences in genes associated with a number of diseases impacted DNA polymerization by the Klenow fragment of DNA polymerase. Replication of a DNA sequence forming an i-motif from a telomere, hypoxia-induced transcription factor, and an insulin-linked polymorphic region was effectively inhibited. On the other hand, replication of a mixed-type G-quadruplex (G4) from a telomere was less inhibited than that of the antiparallel type or parallel type. Interestingly, the i-motif was a better inhibitor of replication than were mixed-type G4s or hairpin structures, even though all had similar thermodynamic stabilities. These results indicate that both the stability and topology of structures formed in DNA templates impact the processivity of a DNA polymerase. This suggests that i-motif formation may trigger genomic instability by stalling the replication of DNA, causing intractable diseases.
Topological impact of noncanonical DNA structures on Klenow fragment of DNA polymerase
Takahashi, Shuntaro; Brazier, John A.; Sugimoto, Naoki
2017-01-01
Noncanonical DNA structures that stall DNA replication can cause errors in genomic DNA. Here, we investigated how the noncanonical structures formed by sequences in genes associated with a number of diseases impacted DNA polymerization by the Klenow fragment of DNA polymerase. Replication of a DNA sequence forming an i-motif from a telomere, hypoxia-induced transcription factor, and an insulin-linked polymorphic region was effectively inhibited. On the other hand, replication of a mixed-type G-quadruplex (G4) from a telomere was less inhibited than that of the antiparallel type or parallel type. Interestingly, the i-motif was a better inhibitor of replication than were mixed-type G4s or hairpin structures, even though all had similar thermodynamic stabilities. These results indicate that both the stability and topology of structures formed in DNA templates impact the processivity of a DNA polymerase. This suggests that i-motif formation may trigger genomic instability by stalling the replication of DNA, causing intractable diseases. PMID:28827350
Wu, Yifei; Chin, William W; Wang, Yong; Burris, Thomas P
2003-03-07
The activation function 2 (AF-2)-dependent recruitment of coactivator is essential for gene activation by nuclear receptors. We show that the peroxisome proliferator-activated receptor gamma (PPARgamma) (NR1C3) coactivator-1 (PGC-1) requires both the intact AF-2 domain of PPARgamma and the LXXLL domain of PGC-1 for ligand-dependent and ligand-independent interaction and coactivation. Although the AF-2 domain of PPARgamma is absolutely required for PGC-1-mediated coactivation, this coactivator displayed a unique lack of requirement for the charge clamp of the ligand-binding domain of the receptor that is thought to be essential for LXXLL motif recognition. The mutation of a single serine residue adjacent to the core LXXLL motif of PGC-1 led to restoration of the typical charge clamp requirement. Thus, the unique structural features of the PGC-1 LXXLL motif appear to mediate an atypical mode of interaction with PPARgamma. Unexpectedly, we discovered that various ligands display variability in terms of their requirement for the charge clamp of PPARgamma for coactivation by PGC-1. This ligand-selective variable requirement for the charge clamp was coactivator-specific. Thus, distinct structural determinants, which may be unique for a particular ligand, are utilized by the receptor to recognize the coactivator. Our data suggest that even subtle differences in ligand structure are perceived by the receptor and translated into a unique display of the coactivator-binding surface of the ligand-binding domain, allowing for differential recognition of coactivators that may underlie distinct pharmacological profiles observed for ligands of a particular nuclear receptor.
Midzak, Andrew; Rammouz, Georges; Papadopoulos, Vassilios
2012-11-01
Steroids metabolically derive from lipid cholesterol, and vertebrate steroids additionally derive from the steroid pregnenolone. Pregnenolone is derived from cholesterol by hydrolytic cleavage of the aliphatic tail by mitochondrial cytochrome P450 enzyme CYP11A1, located in the inner mitochondrial membrane. Delivery of cholesterol to CYP11A1 comprises the principal control step of steroidogenesis, and requires a series of proteins spanning the mitochondrial double membranes. A critical member of this cholesterol translocation machinery is the integral outer mitochondrial membrane translocator protein (18kDa, TSPO), a high-affinity drug- and cholesterol-binding protein. The cholesterol-binding site of TSPO consists of a phylogenetically conserved cholesterol recognition/interaction amino acid consensus (CRAC). Previous studies from our group identified 5-androsten-3β,17,19-triol (19-Atriol) as drug ligand for the TSPO CRAC motif inhibiting cholesterol binding to CRAC domain and steroidogenesis. To further understand 19-Atriol's mechanism of action as well as the molecular recognition by the TSPO CRAC motif, we undertook structure-activity relationship (SAR) analysis of the 19-Atriol molecule with a variety of substituted steroids oxygenated at positions around the steroid backbone. We found that in addition to steroids hydroxylated at carbon C19, hydroxylations at C4, C7, and C11 contributed to inhibition of cAMP-mediated steroidogenesis in a minimal steroidogenic cell model. However, only substituted steroids with C19 hydroxylations exhibited specificity to TSPO, its CRAC motif, and mitochondrial cholesterol transport, as the C4, C7, and C11 hydroxylated steroids inhibited the metabolic transformation of cholesterol by CYP11A1. We thus provide new insights into structure-activity relationships of steroids inhibiting mitochondrial cholesterol transport and steroidogenic cholesterol metabolic enzymes. Copyright © 2012 Elsevier Inc. All rights reserved.
NASA Technical Reports Server (NTRS)
Childs-Disney, Jessica L. (Inventor); Disney, Matthew D. (Inventor)
2017-01-01
Disclosed are methods for identifying a nucleic acid (e.g., RNA, DNA, etc.) motif which interacts with a ligand. The method includes providing a plurality of ligands immobilized on a support, wherein each particular ligand is immobilized at a discrete location on the support; contacting the plurality of immobilized ligands with a nucleic acid motif library under conditions effective for one or more members of the nucleic acid motif library to bind with the immobilized ligands; and identifying members of the nucleic acid motif library that are bound to a particular immobilized ligand. Also disclosed are methods for selecting, from a plurality of candidate ligands, one or more ligands that have increased likelihood of binding to a nucleic acid molecule comprising a particular nucleic acid motif, as well as methods for identifying a nucleic acid which interacts with a ligand.
NASA Technical Reports Server (NTRS)
Sassanfar, M.; Szostak, J. W.
1993-01-01
RNAs that contain specific high-affinity binding sites for small molecule ligands immobilized on a solid support are present at a frequency of roughly one in 10(10)-10(11) in pools of random sequence RNA molecules. Here we describe a new in vitro selection procedure designed to ensure the isolation of RNAs that bind the ligand of interest in solution as well as on a solid support. We have used this method to isolate a remarkably small RNA motif that binds ATP, a substrate in numerous biological reactions and the universal biological high-energy intermediate. The selected ATP-binding RNAs contain a consensus sequence, embedded in a common secondary structure. The binding properties of ATP analogues and modified RNAs show that the binding interaction is characterized by a large number of close contacts between the ATP and RNA, and by a change in the conformation of the RNA.
Characterization of substrate binding of the WW domains in human WWP2 protein.
Jiang, Jiahong; Wang, Nan; Jiang, Yafei; Tan, Hongwei; Zheng, Jimin; Chen, Guangju; Jia, Zongchao
2015-07-08
WW domains harbor substrates containing proline-rich motifs, but the substrate specificity and binding mechanism remain elusive for those WW domains less amenable for structural studies, such as human WWP2 (hWWP2). Herein we have employed multiple techniques to investigate the second WW domain (WW2) in hWWP2. Our results show that hWWP2 is a specialized E3 for PPxY motif-containing substrates only and does not recognize other amino acids and phospho-residues. The strongest binding affinity of WW2, and the incompatibility between each WW domain, imply a novel relationship, and our SPR experiment reveals a dynamic binding mode in Class-I WW domains for the first time. The results from alanine-scanning mutagenesis and modeling further point to functionally conserved residues in WW2. Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Doxey, Andrew C; Cheng, Zhenyu; Moffatt, Barbara A; McConkey, Brendan J
2010-08-03
Aromatic amino acids play a critical role in protein-glycan interactions. Clusters of surface aromatic residues and their features may therefore be useful in distinguishing glycan-binding sites as well as predicting novel glycan-binding proteins. In this work, a structural bioinformatics approach was used to screen the Protein Data Bank (PDB) for coplanar aromatic motifs similar to those found in known glycan-binding proteins. The proteins identified in the screen were significantly associated with carbohydrate-related functions according to gene ontology (GO) enrichment analysis, and predicted motifs were found frequently within novel folds and glycan-binding sites not included in the training set. In addition to numerous binding sites predicted in structural genomics proteins of unknown function, one novel prediction was a surface motif (W34/W36/W192) in the tobacco pathogenesis-related protein, PR-5d. Phylogenetic analysis revealed that the surface motif is exclusive to a subfamily of PR-5 proteins from the Solanaceae family of plants, and is absent completely in more distant homologs. To confirm PR-5d's insoluble-polysaccharide binding activity, a cellulose-pulldown assay of tobacco proteins was performed and PR-5d was identified in the cellulose-binding fraction by mass spectrometry. Based on the combined results, we propose that the putative binding site in PR-5d may be an evolutionary adaptation of Solanaceae plants including potato, tomato, and tobacco, towards defense against cellulose-containing pathogens such as species of the deadly oomycete genus, Phytophthora. More generally, the results demonstrate that coplanar aromatic clusters on protein surfaces are a structural signature of glycan-binding proteins, and can be used to computationally predict novel glycan-binding proteins from 3 D structure.
Farhan, Hesso; Reiterer, Veronika; Kriz, Alexander; Hauri, Hans-Peter; Pavelka, Margit; Sitte, Harald H.; Freissmuth, Michael
2015-01-01
Summary The C-terminus of GABA transporter 1 (GAT1, SLC6A1) is required for trafficking of the protein through the secretory pathway to reach its final destination, i.e. the rim of the synaptic specialization. We identified a motif of three hydrophobic residues (569VMI571) that was required for export of GAT1 from the ER-Golgi intermediate compartment (ERGIC). This conclusion was based on the following observations: (i) GAT1-SSS, the mutant in which 569VMI571 was replaced by serine residues, was exported from the ER in a COPII-dependent manner but accumulated in punctate structures and failed to reach the Golgi; (ii) under appropriate conditions (imposing a block at 15°C, disruption of COPI), these structures also contained ERGIC53; (iii) the punctae were part of a dynamic compartment, because it was accessible to a second anterograde cargo [the temperature-sensitive variant of vesicular stomatitis virus G protein (VSV-G)] and because GAT1-SSS could be retrieved from the punctate structures by addition of a KKxx-based retrieval motif, which supported retrograde transport to the ER. To the best of our knowledge, the VMI-motif of GAT1 provides the first example of a cargo-based motif that specifies export from the ERGIC. PMID:18285449
Genome-wide identification of the SWEET gene family in wheat.
Gao, Yue; Wang, Zi Yuan; Kumar, Vikranth; Xu, Xiao Feng; Yuan, De Peng; Zhu, Xiao Feng; Li, Tian Ya; Jia, Baolei; Xuan, Yuan Hu
2018-02-05
The SWEET (sugars will eventually be exported transporter) family is a newly characterized group of sugar transporters. In plants, the key roles of SWEETs in phloem transport, nectar secretion, pollen nutrition, stress tolerance, and plant-pathogen interactions have been identified. SWEET family genes have been characterized in many plant species, but a comprehensive analysis of SWEET members has not yet been performed in wheat. Here, 59 wheat SWEETs (hereafter TaSWEETs) were identified through homology searches. Analyses of phylogenetic relationships, numbers of transmembrane helices (TMHs), gene structures, and motifs showed that TaSWEETs carrying 3-7 TMHs could be classified into four clades with 10 different types of motifs. Examination of the expression patterns of 18 SWEET genes revealed that a few are tissue-specific while most are ubiquitously expressed. In addition, the stem rust-mediated expression patterns of SWEET genes were monitored using a stem rust-susceptible cultivar, 'Little Club' (LC). The resulting data showed that the expression of five out of the 18 SWEETs tested was induced following inoculation. In conclusion, we provide the first comprehensive analysis of the wheat SWEET gene family. Information regarding the phylogenetic relationships, gene structures, and expression profiles of SWEET genes in different tissues and following stem rust disease inoculation will be useful in identifying the potential roles of SWEETs in specific developmental and pathogenic processes. Copyright © 2017 Elsevier B.V. All rights reserved.
Audit, Benjamin; Zaghloul, Lamia; Baker, Antoine; Arneodo, Alain; Chen, Chun-Long; d'Aubenton-Carafa, Yves; Thermes, Claude
2013-01-01
In higher eukaryotes, the absence of specific sequence motifs, marking the origins of replication has been a serious hindrance to the understanding of (i) the mechanisms that regulate the spatio-temporal replication program, and (ii) the links between origins activation, chromatin structure and transcription. In this chapter, we review the partitioning of the human genome into megabased-size replication domains delineated as N-shaped motifs in the strand compositional asymmetry profiles. They collectively span 28.3% of the genome and are bordered by more than 1,000 putative replication origins. We recapitulate the comparison of this partition of the human genome with high-resolution experimental data that confirms that replication domain borders are likely to be preferential replication initiation zones in the germline. In addition, we highlight the specific distribution of experimental and numerical chromatin marks along replication domains. Domain borders correspond to particular open chromatin regions, possibly encoded in the DNA sequence, and around which replication and transcription are highly coordinated. These regions also present a high evolutionary breakpoint density, suggesting that susceptibility to breakage might be linked to local open chromatin fiber state. Altogether, this chapter presents a compartmentalization of the human genome into replication domains that are landmarks of the human genome organization and are likely to play a key role in genome dynamics during evolution and in pathological situations.
Exploitation of peptide motif sequences and their use in nanobiotechnology.
Shiba, Kiyotaka
2010-08-01
Short amino acid sequences extracted from natural proteins or created using in vitro evolution systems are sometimes associated with particular biological functions. These peptides, called peptide motifs, can serve as functional units for the creation of various tools for nanobiotechnology. In particular, peptide motifs that have the ability to specifically recognize the surfaces of solid materials and to mineralize certain inorganic materials have been linking biological science to material science. Here, I review how these peptide motifs have been isolated from natural proteins or created using in vitro evolution systems, and how they have been used in the nanobiotechnology field. Copyright © 2010 Elsevier Ltd. All rights reserved.
DNA nanotechnology based on i-motif structures.
Dong, Yuanchen; Yang, Zhongqiang; Liu, Dongsheng
2014-06-17
CONSPECTUS: Most biological processes happen at the nanometer scale, and understanding the energy transformations and material transportation mechanisms within living organisms has proved challenging. To better understand the secrets of life, researchers have investigated artificial molecular motors and devices over the past decade because such systems can mimic certain biological processes. DNA nanotechnology based on i-motif structures is one system that has played an important role in these investigations. In this Account, we summarize recent advances in functional DNA nanotechnology based on i-motif structures. The i-motif is a DNA quadruplex that occurs as four stretches of cytosine repeat sequences form C·CH(+) base pairs, and their stabilization requires slightly acidic conditions. This unique property has produced the first DNA molecular motor driven by pH changes. The motor is reliable, and studies show that it is capable of millisecond running speeds, comparable to the speed of natural protein motors. With careful design, the output of these types of motors was combined to drive micrometer-sized cantilevers bend. Using established DNA nanostructure assembly and functionalization methods, researchers can easily integrate the motor within other DNA assembled structures and functional units, producing DNA molecular devices with new functions such as suprahydrophobic/suprahydrophilic smart surfaces that switch, intelligent nanopores triggered by pH changes, molecular logic gates, and DNA nanosprings. Recently, researchers have produced motors driven by light and electricity, which have allowed DNA motors to be integrated within silicon-based nanodevices. Moreover, some devices based on i-motif structures have proven useful for investigating processes within living cells. The pH-responsiveness of the i-motif structure also provides a way to control the stepwise assembly of DNA nanostructures. In addition, because of the stability of the i-motif, this structure can serve as the stem of one-dimensional nanowires, and a four-strand stem can provide a new basis for three-dimensional DNA structures such as pillars. By sacrificing some accuracy in assembly, we used these properties to prepare the first fast-responding pure DNA supramolecular hydrogel. This hydrogel does not swell and cannot encapsulate small molecules. These unique properties could lead to new developments in smart materials based on DNA assembly and support important applications in fields such as tissue engineering. We expect that DNA nanotechnology will continue to develop rapidly. At a fundamental level, further studies should lead to greater understanding of the energy transformation and material transportation mechanisms at the nanometer scale. In terms of applications, we expect that many of these elegant molecular devices will soon be used in vivo. These further studies could demonstrate the power of DNA nanotechnology in biology, material science, chemistry, and physics.
Lu, Ruipeng; Mucaki, Eliseos J; Rogan, Peter K
2017-03-17
Data from ChIP-seq experiments can derive the genome-wide binding specificities of transcription factors (TFs) and other regulatory proteins. We analyzed 765 ENCODE ChIP-seq peak datasets of 207 human TFs with a novel motif discovery pipeline based on recursive, thresholded entropy minimization. This approach, while obviating the need to compensate for skewed nucleotide composition, distinguishes true binding motifs from noise, quantifies the strengths of individual binding sites based on computed affinity and detects adjacent cofactor binding sites that coordinate with the targets of primary, immunoprecipitated TFs. We obtained contiguous and bipartite information theory-based position weight matrices (iPWMs) for 93 sequence-specific TFs, discovered 23 cofactor motifs for 127 TFs and revealed six high-confidence novel motifs. The reliability and accuracy of these iPWMs were determined via four independent validation methods, including the detection of experimentally proven binding sites, explanation of effects of characterized SNPs, comparison with previously published motifs and statistical analyses. We also predict previously unreported TF coregulatory interactions (e.g. TF complexes). These iPWMs constitute a powerful tool for predicting the effects of sequence variants in known binding sites, performing mutation analysis on regulatory SNPs and predicting previously unrecognized binding sites and target genes. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Li, Wan; Chen, Lina; Li, Xia; Jia, Xu; Feng, Chenchen; Zhang, Liangcai; He, Weiming; Lv, Junjie; He, Yuehan; Li, Weiguo; Qu, Xiaoli; Zhou, Yanyan; Shi, Yuchen
2013-12-01
Network motifs in central positions are considered to not only have more in-coming and out-going connections but are also localized in an area where more paths reach the networks. These central motifs have been extensively investigated to determine their consistent functions or associations with specific function categories. However, their functional potentials in the maintenance of cross-talk between different functional communities are unclear. In this paper, we constructed an integrated human signaling network from the Pathway Interaction Database. We identified 39 essential cancer-related motifs in central roles, which we called cancer-related marketing centrality motifs, using combined centrality indices on the system level. Our results demonstrated that these cancer-related marketing centrality motifs were pivotal units in the signaling network, and could mediate cross-talk between 61 biological pathways (25 could be mediated by one motif on average), most of which were cancer-related pathways. Further analysis showed that molecules of most marketing centrality motifs were in the same or adjacent subcellular localizations, such as the motif containing PI3K, PDK1 and AKT1 in the plasma membrane, to mediate signal transduction between 32 cancer-related pathways. Finally, we analyzed the pivotal roles of cancer genes in these marketing centrality motifs in the pathogenesis of cancers, and found that non-cancer genes were potential cancer-related genes.
Functionalizing Designer DNA Crystals
NASA Astrophysics Data System (ADS)
Chandrasekaran, Arun Richard
Three-dimensional crystals have been self-assembled from a DNA tensegrity triangle via sticky end interaction. The tensegrity triangle is a rigid DNA motif containing three double helical edges connected pair-wise by three four-arm junctions. The symmetric triangle contains 3 unique strands combined in a 3:3:1 ratio: 3 crossover, 3 helical and 1 central. The length of the sticky end reported previously was two nucleotides (nt) (GA:TC) and the motif with 2-helical turns of DNA per edge diffracted to 4.9 A at beam line NSLS-X25 and to 4 A at beam line ID19 at APS. The purpose of these self-assembled DNA crystals is that they can be used as a framework for hosting external guests for use in crystallographic structure solving or the periodic positioning of molecules for nanoelectronics. This thesis describes strategies to improve the resolution and to incorporate guests into the 3D lattice. The first chapter describes the effect of varying sticky end lengths and the influence of 5'-phosphate addition on crystal formation and resolution. X-ray diffraction data from beam line NSLS-X25 revealed that the crystal resolution for 1-nt (G:C) sticky end was 3.4 A. Motifs with every possible combination of 1-nt and 2-nt sticky-ended phosphorylated strands were crystallized and X-ray data were collected. The position of the 5'-phosphate on either the crossover (strand 1), helical (strand 2), or central strand (3) had an impact on the resolution of the self-assembled crystals with the 1-nt 1P-2-3 system diffracting to 2.62 A at APS and 3.1 A at NSLS-X25. The second chapter describes the sequence-specific recognition of DNA motifs with triplex-forming oligonucleotides (TFOs). This study examined the feasibility of using TFOs to bind to specific locations within a 3-turn DNA tensegrity triangle motif. The TFO 5'-TTCTTTCTTCTCT was used to target the tensegrity motif containing an appropriately embedded oligopurine.oligopyrimidine binding site. As triplex formation involving cytidine nucleotides is usually pH dependent (pH < 6) four different TFOs were examined: TFO-1 was unmodified while TFOs 2-4 contained additional stabilizing analogues capable of extending triplex formation to pH 7. In addition, each of the TFOs contained a Cy5 dye at the 5'-end of the oligonucleotide to aid in characterization of TFO binding - crystals were obtained with all four variations of TFOs. Formation of DNA triplex in the motif was characterized by an electrophoretic mobility shift assay (EMSA), UV melting studies and FRET. Crystals containing TFO-1 (unmodified) and TFO-2 (with 2'-amino ethoxy modification) were isolated and flash-frozen in liquid nitrogen for X-ray data collection at beam line NSLS-X25. X-ray data was also collected for crystals of the 3-turn triangle without any TFO bound to it. Difference maps were done between the crystals with TFO against the one without to identify any additional electron density corresponding to the third strand in the triplex binding region. The data from the crystal containing TFO-2 was used to further analyze if the additional density can match the expected position of the TFO on the triangle motif. Since the additional density did not correspond to the entire binding region, 2Fo-Fc, 3Fo-2Fc and 4Fo-3Fc maps were done to check for missing pieces of the electron density. From the resulting 2Fo-Fc map, the asymmetric unit from the 3-turn triangle (31-bp duplex model based on previous structure 3UBI) was inserted into the density as a reference. However, the electron density corresponding to the TFO was still not continuous throughout the 13-nt triplex binding region and allowed only a partial fit of the TFO. The third nucleotide in positions 1, 3, 4, 6, 7 were fit into the density in the major groove of the underlying duplex with proper triplex configuration. The third chapter describes the triplex approach to position a functional group (the UV cross-linking agent psoralen) within a pre-formed DNA motif. Triplex formation and psoralen cross-linking of the motif were analyzed by native and denaturing gel electrophoresis respectively. Motifs containing the Psoralen-TFO were also successfully crystallized and the crosslinking shown by analyzing the denatured crystals on a gel. The end goal would be to form a crosslinked designed DNA crystal that can diffract to a higher resolution. The fourth chapter describes the use of serial femtosecond crystallography for structure determination of designed DNA lattices. X-ray diffraction data from self-assembled 3D DNA microcrystals were collected from a stream of crystals in solution. Serial femtosecond crystallography eliminates the need for large crystals and the need for freezing, thus overcoming any associated crystal defects and radiation damage. Self-assembled nano/microcrystals were successfully made and were diffracted at room temperature. The best diffraction was from the 1-nt SE motif to an extent of 3.5 A in resolution.
Leisy, D.J.; Rasmussen, C.; Owusu, E.O.; Rohrmann, G.F.
1997-01-01
The Autographa californica multinucleocapsid nuclear polyhedrosis virus (AcMNPV) ie-1 gene product (IE-1) is thought to play a central role in stimulating early viral transcription. IE-1 has been demonstrated to activate several early viral gene promoters and to negatively regulate the promoters of two other AcMNPV regulatory genes, ie-0 and ie-2. Our results indicate that IE-1 negatively regulates the expression of certain genes by binding directly, or as part of a complex, to promoter regions containing a specific IE-1-binding motif (5'-ACBYGTAA-3') near their mRNA start sites. The IE-1 binding motif was also found within the palindromic sequences of AcMNPV homologous repeat (hr) regions that have been shown to bind IE-1. The role of this IE-1 binding motif in the regulation of the ie-2 and pe-38 promoters was examined by introducing mutations in these promoters in which the central 6 bp were replaced with Bg/II sites. GUS reporter constructs containing ie-2 and pe-38 promoter fragments with and without these specific mutations were cotransfected into Sf9 cells with various amounts of an ie-1-containing plasmid (ple-1). Comparisons of GUS expression produced by the mutant and wild-type constructs demonstrated that the IE-1 binding motif mediated a significant decrease in expression from the ie-2 and pe-38 promoters in response to increasing pIe-1 concentrations. Electrophoretic mobility shift assays with pIe-1-transfected cell extracts and supershift assays with IE-1- specific antiserum demonstrated that IE-1 binds to promoter fragments containing the IE-1 binding motif but does not bind to promoter fragments lacking this motif.
ITS2 sequence-structure phylogeny reveals diverse endophytic Pseudocercospora fungi on poplars.
Yan, Dong-Hui; Gao, Qian; Sun, Xiaoming; Song, Xiaoyu; Li, Hongchang
2018-04-01
For matching the new fungal nomenclature to abolish pleomorphic names for a fungus, a genus Pseudocercospora s. str. was suggested to host holomorphic Pseudocercosproa fungi. But the Pseudocercosproa fungi need extra phylogenetic loci to clarify their taxonomy and diversity for their existing and coming species. Internal transcribed spacer 2 (ITS2) secondary structures have been promising in charactering species phylogeny in plants, animals and fungi. In present study, a conserved model of ITS2 secondary structures was confirmed on fungi in Pseudocercospora s. str. genus using RNAshape program. The model has a typical eukaryotic four-helix ITS2 secondary structure. But a single U base occurred in conserved motif of U-U mismatch in Helix 2, and a UG emerged in UGGU motif in Helix 3 to Pseudocercospora fungi. The phylogeny analyses based on the ITS2 sequence-secondary structures with compensatory base change characterizations are able to delimit more species for Pseudocercospora s. str. than phylogenic inferences of traditional multi-loci alignments do. The model was employed to explore the diversity of endophytic Pseudocercospora fungi in poplar trees. The analysis results also showed that endophytic Pseudocercospora fungi were diverse in species and evolved a specific lineage in poplar trees. This work suggested that ITS2 sequence-structures could become as additionally significant loci for species phylogenetic and taxonomic studies on Pseudocerospora fungi, and that Pseudocercospora endophytes could be important roles to Pseudocercospora fungi's evolution and function in ecology.
Fukutomi, Toshiaki; Takagi, Kenji; Mizushima, Tsunehiro; Ohuchi, Noriaki
2014-01-01
Transcription factor Nrf2 (NF-E2-related factor 2) coordinately regulates cytoprotective gene expression, but under unstressed conditions, Nrf2 is degraded rapidly through Keap1 (Kelch-like ECH-associated protein 1)-mediated ubiquitination. Nrf2 harbors two Keap1-binding motifs, DLG and ETGE. Interactions between these two motifs and Keap1 constitute a key regulatory nexus for cellular Nrf2 activity through the formation of a two-site binding hinge-and-latch mechanism. In this study, we determined the minimum Keap1-binding sequence of the DLG motif, the low-affinity latch site, and defined a new DLGex motif that covers a sequence much longer than that previously defined. We have successfully clarified the crystal structure of the Keap1-DC-DLGex complex at 1.6 Å. DLGex possesses a complicated helix structure, which interprets well the human-cancer-derived loss-of-function mutations in DLGex. In thermodynamic analyses, Keap1-DLGex binding is characterized as enthalpy and entropy driven, while Keap1-ETGE binding is characterized as purely enthalpy driven. In kinetic analyses, Keap1-DLGex binding follows a fast-association and fast-dissociation model, while Keap1-ETGE binding contains a slow-reaction step that leads to a stable conformation. These results demonstrate that the mode of DLGex binding to Keap1 is distinct from that of ETGE structurally, thermodynamically, and kinetically and support our contention that the DLGex motif serves as a converter transmitting environmental stress to Nrf2 induction as the latch site. PMID:24366543
Deletion of transcription factor binding motifs using the CRISPR/spCas9 system in the β-globin LCR.
Kim, Yea Woon; Kim, AeRi
2017-07-20
Transcription factors play roles in gene transcription through direct binding to their motifs in genome, and inhibiting this binding provides an effective strategy for studying their roles. Here we applied the CRISPR/spCas9 system to mutate the binding motifs of transcription factors. Binding motifs for erythroid specific transcription factors were mutated in the locus control region hypersensitive sites of the human β-globin locus. Guide RNAs targeting binding motifs were cloned into lentiviral CRISPR vector containing the spCas9 gene, and transduced into MEL/ch11 cells carrying a human chromosome 11. DNA mutations in clonal cells were initially screened by quantitative PCR in genomic DNA and then clarified by sequencing. Mutations in binding motifs reduced occupancy by transcription factors in a chromatin environment. Characterization of mutations revealed that the CRISPR/spCas9 system mainly induced deletions in short regions of <20 bp and preferentially deleted nucleotides around the fifth nucleotide upstream of Protospacer adjacent motifs. These results indicate that the CRISPR/Cas9 system is suitable for mutating the binding motifs of transcription factors, and, consequently, would contribute to elucidate the direct roles of transcription factors. ©2017 The Author(s).
ERIC Educational Resources Information Center
Morin, Erica A.
2013-01-01
As a graduate instructor for HIST 152: United States Since 1877, the author structures the entire course around the motif of the newspaper. She models her curriculum after the newspaper both visually and symbolically and uses it as a theme throughout the class. The newspaper is not a gimmick or cliche, but rather a recurring stylistic theme, an…
Rigoutsos, Isidore; Riek, Peter; Graham, Robert M.; Novotny, Jiri
2003-01-01
One of the promising methods of protein structure prediction involves the use of amino acid sequence-derived patterns. Here we report on the creation of non-degenerate motif descriptors derived through data mining of training sets of residues taken from the transmembrane-spanning segments of polytopic proteins. These residues correspond to short regions in which there is a deviation from the regular α-helical character (i.e. π-helices, 310-helices and kinks). A ‘search engine’ derived from these motif descriptors correctly identifies, and discriminates amongst instances of the above ‘non-canonical’ helical motifs contained in the SwissProt/TrEMBL database of protein primary structures. Our results suggest that deviations from α-helicity are encoded locally in sequence patterns only about 7–9 residues long and can be determined in silico directly from the amino acid sequence. Delineation of such variations in helical habit is critical to understanding the complex structure–function relationships of polytopic proteins and for drug discovery. The success of our current methodology foretells development of similar prediction tools capable of identifying other structural motifs from sequence alone. The method described here has been implemented and is available on the World Wide Web at http://cbcsrv.watson.ibm.com/Ttkw.html. PMID:12888523
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bade-Döding, Christina; Theodossis, Alex; Gras, Stephanie
2011-09-28
Polymorphic differences between human leukocyte antigen (HLA) molecules affect the specificity and conformation of their bound peptides and lead to differential selection of the T-cell repertoire. Mismatching during allogeneic transplantation can, therefore, lead to immunological reactions. We investigated the structure-function relationships of six members of the HLA-B*41 allelic group that differ by six polymorphic amino acids, including positions 80, 95, 97 and 114 within the antigen-binding cleft. Peptide-binding motifs for B*41:01, *41:02, *41:03, *41:04, *41:05 and *41:06 were determined by sequencing self-peptides from recombinant B*41 molecules by electrospray ionization tandem mass spectrometry. The crystal structures of HLA-B*41:03 bound to amore » natural 16-mer self-ligand (AEMYGSVTEHPSPSPL) and HLA-B*41:04 bound to a natural 11-mer self-ligand (HEEAVSVDRVL) were solved. Peptide analysis revealed that all B*41 alleles have an identical anchor motif at peptide position 2 (glutamic acid), but differ in their choice of C-terminal p{Omega} anchor (proline, valine, leucine). Additionally, B*41:04 displayed a greater preference for long peptides (>10 residues) when compared to the other B*41 allomorphs, while the longest peptide to be eluted from the allelic group (a 16mer) was obtained from B*41:03. The crystal structures of HLA-B*41:03 and HLA-B*41:04 revealed that both alleles interact in a highly conserved manner with the terminal regions of their respective ligands, while micropolymorphism-induced changes in the steric and electrostatic properties of the antigen-binding cleft account for differences in peptide repertoire and auxiliary anchoring. Differences in peptide repertoire, and peptide length specificity reflect the significant functional evolution of these closely related allotypes and signal their importance in allogeneic transplantation, especially B*41:03 and B*41:04, which accommodate longer peptides, creating structurally distinct peptide-HLA complexes.« less
Gregor, Craig R.; Cerasoli, Eleonora; Schouten, James; Ravi, Jascindra; Slootstra, Jerry; Horgan, Adrian; Martyna, Glenn J.; Ryadnov, Maxim G.; Davis, Paul; Crain, Jason
2011-01-01
Human chorionic gonadotropin (hCG) is an important biomarker in pregnancy and oncology, where it is routinely detected and quantified by specific immunoassays. Intelligent epitope selection is essential to achieving the required assay performance. We present binding affinity measurements demonstrating that a typical β3-loop-specific monoclonal antibody (8G5) is highly selective in competitive immunoassays and distinguishes between hCGβ66–80 and the closely related luteinizing hormone (LH) fragment LHβ86–100, which differ only by a single amino acid residue. A combination of optical spectroscopic measurements and atomistic computer simulations on these free peptides reveals differences in turn type stabilized by specific hydrogen bonding motifs. We propose that these structural differences are the basis for the observed selectivity in the full protein. PMID:21592960
A peptide affinity column for the identification of integrin alpha IIb-binding proteins.
Daxecker, Heide; Raab, Markus; Bernard, Elise; Devocelle, Marc; Treumann, Achim; Moran, Niamh
2008-03-01
To understand the regulation of integrin alpha(IIb)beta(3), a critical platelet adhesion molecule, we have developed a peptide affinity chromatography method using the known integrin regulatory motif, LAMWKVGFFKR. Using standard Fmoc chemistry, this peptide was synthesized onto a Toyopearl AF-Amino-650 M resin on a 6-aminohexanoic acid (Ahx) linker. Peptide density was controlled by acetylation of 83% of the Ahx amino groups. Four recombinant human proteins (CIB1, PP1, ICln and RN181), previously identified as binding to this integrin regulatory motif, were specifically retained by the column containing the integrin peptide but not by a column presenting an irrelevant peptide. Hemoglobin, creatine kinase, bovine serum albumin, fibrinogen and alpha-tubulin failed to bind under the chosen conditions. Immunodetection methods confirmed the binding of endogenous platelet proteins, including CIB1, PP1, ICln RN181, AUP-1 and beta3-integrin, from a detergent-free platelet lysate. Thus, we describe a reproducible method that facilitates the reliable extraction of specific integrin-binding proteins from complex biological matrices. This methodology may enable the sensitive and specific identification of proteins that interact with linear, membrane-proximal peptide motifs such as the integrin regulatory motif LAMWKVGFFKR.
Specific material recognition by small peptides mediated by the interfacial solvent structure.
Schneider, Julian; Ciacchi, Lucio Colombi
2012-02-01
We present evidence that specific material recognition by small peptides is governed by local solvent density variations at solid/liquid interfaces, sensed by the side-chain residues with atomic-scale precision. In particular, we unveil the origin of the selectivity of the binding motif RKLPDA for Ti over Si using a combination of metadynamics and steered molecular dynamics simulations, obtaining adsorption free energies and adhesion forces in quantitative agreement with corresponding experiments. For an accurate description, we employ realistic models of the natively oxidized surfaces which go beyond the commonly used perfect crystal surfaces. These results have profound implications for nanotechnology and materials science applications, offering a previously missing structure-function relationship for the rational design of materials-selective peptide sequences. © 2011 American Chemical Society
Biomimetic virus-based colourimetric sensors.
Oh, Jin-Woo; Chung, Woo-Jae; Heo, Kwang; Jin, Hyo-Eon; Lee, Byung Yang; Wang, Eddie; Zueger, Chris; Wong, Winnie; Meyer, Joel; Kim, Chuntae; Lee, So-Young; Kim, Won-Geun; Zemla, Marcin; Auer, Manfred; Hexemer, Alexander; Lee, Seung-Wuk
2014-01-01
Many materials in nature change colours in response to stimuli, making them attractive for use as sensor platform. However, both natural materials and their synthetic analogues lack selectivity towards specific chemicals, and introducing such selectivity remains a challenge. Here we report the self-assembly of genetically engineered viruses (M13 phage) into target-specific, colourimetric biosensors. The sensors are composed of phage-bundle nanostructures and exhibit viewing-angle independent colour, similar to collagen structures in turkey skin. On exposure to various volatile organic chemicals, the structures rapidly swell and undergo distinct colour changes. Furthermore, sensors composed of phage displaying trinitrotoluene (TNT)-binding peptide motifs identified from a phage display selectively distinguish TNT down to 300 p.p.b. over similarly structured chemicals. Our tunable, colourimetric sensors can be useful for the detection of a variety of harmful toxicants and pathogens to protect human health and national security.