Dimeric PROP1 binding to diverse palindromic TAAT sequences promotes its transcriptional activity.
Nakayama, Michie; Kato, Takako; Susa, Takao; Sano, Akiko; Kitahara, Kousuke; Kato, Yukio
2009-08-13
Mutations in the Prop1 gene are responsible for murine Ames dwarfism and human combined pituitary hormone deficiency with hypogonadism. Recently, we reported that PROP1 is a possible transcription factor for gonadotropin subunit genes through plural cis-acting sites composed of AT-rich sequences containing a TAAT motif which differs from its consensus binding sequence known as PRDQ9 (TAATTGAATTA). This study aimed to verify the binding specificity and sequence of PROP1 by applying the method of SELEX (Systematic Evolution of Ligands by EXponential enrichment), EMSA (electrophoretic mobility shift assay) and transient transfection assay. SELEX, after 5, 7 and 9 generations of selection using a random sequence library, showed that nucleotides containing one or two TAAT motifs were accumulated and accounted for 98.5% at the 9th generation. Aligned sequences and EMSA demonstrated that PROP1 binds preferentially to 11 nucleotides composed of an inverted TAAT motif separated by 3 nucleotides with variation in the half site of palindromic TAAT motifs and with preferential requirement of T at the nucleotide number 5 immediately 3' to a TAAT motif. Transient transfection assay demonstrated first that dimeric binding of PROP1 to an inverted TAAT motif and its cognates resulted in transcriptional activation, whereas monomeric binding of PROP1 to a single TAAT motif and an inverted ATTA motif did not mediate activation. Thus, this study demonstrated that dimeric binding of PROP1 is able to recognize diverse palindromic TAAT sequences separated by 3 nucleotides and to exhibit its transcriptional activity.
Evolving nucleotide binding surfaces
NASA Technical Reports Server (NTRS)
Kieber-Emmons, T.; Rein, R.
1981-01-01
An analysis is presented of the stability and nature of binding of a nucleotide to several known dehydrogenases. The employed approach includes calculation of hydrophobic stabilization of the binding motif and its intermolecular interaction with the ligand. The evolutionary changes of the binding motif are studied by calculating the Euclidean deviation of the respective dehydrogenases. Attention is given to the possible structural elements involved in the origin of nucleotide recognition by non-coded primordial polypeptides.
TFBSshape: a motif database for DNA shape features of transcription factor binding sites.
Yang, Lin; Zhou, Tianyin; Dror, Iris; Mathelier, Anthony; Wasserman, Wyeth W; Gordân, Raluca; Rohs, Remo
2014-01-01
Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein-DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone.
TFBSshape: a motif database for DNA shape features of transcription factor binding sites
Yang, Lin; Zhou, Tianyin; Dror, Iris; Mathelier, Anthony; Wasserman, Wyeth W.; Gordân, Raluca; Rohs, Remo
2014-01-01
Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein–DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone. PMID:24214955
Deletion of transcription factor binding motifs using the CRISPR/spCas9 system in the β-globin LCR.
Kim, Yea Woon; Kim, AeRi
2017-07-20
Transcription factors play roles in gene transcription through direct binding to their motifs in genome, and inhibiting this binding provides an effective strategy for studying their roles. Here we applied the CRISPR/spCas9 system to mutate the binding motifs of transcription factors. Binding motifs for erythroid specific transcription factors were mutated in the locus control region hypersensitive sites of the human β-globin locus. Guide RNAs targeting binding motifs were cloned into lentiviral CRISPR vector containing the spCas9 gene, and transduced into MEL/ch11 cells carrying a human chromosome 11. DNA mutations in clonal cells were initially screened by quantitative PCR in genomic DNA and then clarified by sequencing. Mutations in binding motifs reduced occupancy by transcription factors in a chromatin environment. Characterization of mutations revealed that the CRISPR/spCas9 system mainly induced deletions in short regions of <20 bp and preferentially deleted nucleotides around the fifth nucleotide upstream of Protospacer adjacent motifs. These results indicate that the CRISPR/Cas9 system is suitable for mutating the binding motifs of transcription factors, and, consequently, would contribute to elucidate the direct roles of transcription factors. ©2017 The Author(s).
A Novel Protein Interaction between Nucleotide Binding Domain of Hsp70 and p53 Motif
Elengoe, Asita; Naser, Mohammed Abu; Hamdan, Salehhuddin
2015-01-01
Currently, protein interaction of Homo sapiens nucleotide binding domain (NBD) of heat shock 70 kDa protein (PDB: 1HJO) with p53 motif remains to be elucidated. The NBD-p53 motif complex enhances the p53 stabilization, thereby increasing the tumor suppression activity in cancer treatment. Therefore, we identified the interaction between NBD and p53 using STRING version 9.1 program. Then, we modeled the three-dimensional structure of p53 motif through homology modeling and determined the binding affinity and stability of NBD-p53 motif complex structure via molecular docking and dynamics (MD) simulation. Human DNA binding domain of p53 motif (SCMGGMNR) retrieved from UniProt (UniProtKB: P04637) was docked with the NBD protein, using the Autodock version 4.2 program. The binding energy and intermolecular energy for the NBD-p53 motif complex were −0.44 Kcal/mol and −9.90 Kcal/mol, respectively. Moreover, RMSD, RMSF, hydrogen bonds, salt bridge, and secondary structure analyses revealed that the NBD protein had a strong bond with p53 motif and the protein-ligand complex was stable. Thus, the current data would be highly encouraging for designing Hsp70 structure based drug in cancer therapy. PMID:26098630
A Novel Protein Interaction between Nucleotide Binding Domain of Hsp70 and p53 Motif.
Elengoe, Asita; Naser, Mohammed Abu; Hamdan, Salehhuddin
2015-01-01
Currently, protein interaction of Homo sapiens nucleotide binding domain (NBD) of heat shock 70 kDa protein (PDB: 1HJO) with p53 motif remains to be elucidated. The NBD-p53 motif complex enhances the p53 stabilization, thereby increasing the tumor suppression activity in cancer treatment. Therefore, we identified the interaction between NBD and p53 using STRING version 9.1 program. Then, we modeled the three-dimensional structure of p53 motif through homology modeling and determined the binding affinity and stability of NBD-p53 motif complex structure via molecular docking and dynamics (MD) simulation. Human DNA binding domain of p53 motif (SCMGGMNR) retrieved from UniProt (UniProtKB: P04637) was docked with the NBD protein, using the Autodock version 4.2 program. The binding energy and intermolecular energy for the NBD-p53 motif complex were -0.44 Kcal/mol and -9.90 Kcal/mol, respectively. Moreover, RMSD, RMSF, hydrogen bonds, salt bridge, and secondary structure analyses revealed that the NBD protein had a strong bond with p53 motif and the protein-ligand complex was stable. Thus, the current data would be highly encouraging for designing Hsp70 structure based drug in cancer therapy.
Jauch, Ralf; Ng, Calista K L; Narasimhan, Kamesh; Kolatkar, Prasanna R
2012-04-01
It has recently been proposed that the sequence preferences of DNA-binding TFs (transcription factors) can be well described by models that include the positional interdependence of the nucleotides of the target sites. Such binding models allow for multiple motifs to be invoked, such as principal and secondary motifs differing at two or more nucleotide positions. However, the structural mechanisms underlying the accommodation of such variant motifs by TFs remain elusive. In the present study we examine the crystal structure of the HMG (high-mobility group) domain of Sox4 [Sry (sex-determining region on the Y chromosome)-related HMG box 4] bound to DNA. By comparing this structure with previously solved structures of Sox17 and Sox2, we observed subtle conformational differences at the DNA-binding interface. Furthermore, using quantitative electrophoretic mobility-shift assays we validated the positional interdependence of two nucleotides and the presence of a secondary Sox motif in the affinity landscape of Sox4. These results suggest that a concerted rearrangement of two interface amino acids enables Sox4 to accommodate primary and secondary motifs. The structural adaptations lead to altered dinucleotide preferences that mutually reinforce each other. These analyses underline the complexity of the DNA recognition by TFs and provide an experimental validation for the conceptual framework of positional interdependence and secondary binding motifs.
Netz, Daili J. A.; Pierik, Antonio J.; Stümpfig, Martin; Bill, Eckhard; Sharma, Anil K.; Pallesen, Leif J.; Walden, William E.; Lill, Roland
2012-01-01
The essential P-loop NTPases Cfd1 and Nbp35 of the cytosolic iron-sulfur (Fe-S) protein assembly machinery perform a scaffold function for Fe-S cluster synthesis. Both proteins contain a nucleotide binding motif of unknown function and a C-terminal motif with four conserved cysteine residues. The latter motif defines the Mrp/Nbp35 subclass of P-loop NTPases and is suspected to be involved in transient Fe-S cluster binding. To elucidate the function of these two motifs, we first created cysteine mutant proteins of Cfd1 and Nbp35 and investigated the consequences of these mutations by genetic, cell biological, biochemical, and spectroscopic approaches. The two central cysteine residues (CPXC) of the C-terminal motif were found to be crucial for cell viability, protein function, coordination of a labile [4Fe-4S] cluster, and Cfd1-Nbp35 hetero-tetramer formation. Surprisingly, the two proximal cysteine residues were dispensable for all these functions, despite their strict evolutionary conservation. Several lines of evidence suggest that the C-terminal CPXC motifs of Cfd1-Nbp35 coordinate a bridging [4Fe-4S] cluster. Upon mutation of the nucleotide binding motifs Fe-S clusters could no longer be assembled on these proteins unless wild-type copies of Cfd1 and Nbp35 were present in trans. This result indicated that Fe-S cluster loading on these scaffold proteins is a nucleotide-dependent step. We propose that the bridging coordination of the C-terminal Fe-S cluster may be ideal for its facile assembly, labile binding, and efficient transfer to target Fe-S apoproteins, a step facilitated by the cytosolic iron-sulfur (Fe-S) protein assembly proteins Nar1 and Cia1 in vivo. PMID:22362766
A Comparison Study for DNA Motif Modeling on Protein Binding Microarray.
Wong, Ka-Chun; Li, Yue; Peng, Chengbin; Wong, Hau-San
2016-01-01
Transcription factor binding sites (TFBSs) are relatively short (5-15 bp) and degenerate. Identifying them is a computationally challenging task. In particular, protein binding microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner; for instance, a typical PBM experiment can measure binding signal intensities of a protein to all possible DNA k-mers (k = 8∼10). Since proteins can often bind to DNA with different binding intensities, one of the major challenges is to build TFBS (also known as DNA motif) models which can fully capture the quantitative binding affinity data. To learn DNA motif models from the non-convex objective function landscape, several optimization methods are compared and applied to the PBM motif model building problem. In particular, representative methods from different optimization paradigms have been chosen for modeling performance comparison on hundreds of PBM datasets. The results suggest that the multimodal optimization methods are very effective for capturing the binding preference information from PBM data. In particular, we observe a general performance improvement if choosing di-nucleotide modeling over mono-nucleotide modeling. In addition, the models learned by the best-performing method are applied to two independent applications: PBM probe rotation testing and ChIP-Seq peak sequence prediction, demonstrating its biological applicability.
Non-thiolate ligation of nickel by nucleotide-free UreG of Klebsiella aerogenes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Martin-Diaconescu, Vlad; Joseph, Crisjoe A.; Boer, Jodi L.
Nickel-dependent ureases are activated by a multiprotein complex that includes the GTPase UreG. Prior studies showed that nucleotide-free UreG from Klebsiella aerogenes is monomeric and binds one nickel or zinc ion with near-equivalent affinity using an undefined binding site, whereas nucleotide-free UreG from Helicobacter pylori selectively binds one zinc ion per dimer via a universally conserved Cys-Pro-His motif in each protomer. Iodoacetamide-treated K. aerogenes UreG was nearly unaffected in nickel binding compared to non-treated sample, suggesting the absence of thiolate ligands to the metal. X-ray absorption spectroscopy of nickel-bound UreG showed the metal possessed four-coordinate geometry with all O/N donormore » ligands including one imidazole, thus confirming the absence of thiolate ligation. The nickel site in Strep-tag II-modified protein possessed six-coordinate geometry, again with all O/N donor ligands, but now including two or three imidazoles. An identical site was noted for the Strep-tag II-modified H74A variant, substituted in the Cys-Pro-His motif, ruling out coordination by this His residue. These results are consistent with metal binding to both His6 and a His residue of the fusion peptide in Strep-tagged K. aerogenes UreG. We conclude that the nickel- and zinc-binding site in nucleotide-free K. aerogenes UreG is distinct from that of nucleotide-free H. pylori UreG and does not involve the Cys-Pro-His motif. Further, we show the Strep-tag II can perturb metal coordination of this protein.« less
Gene Isolation Using Degenerate Primers Targeting Protein Motif: A Laboratory Exercise
ERIC Educational Resources Information Center
Yeo, Brandon Pei Hui; Foong, Lian Chee; Tam, Sheh May; Lee, Vivian; Hwang, Siaw San
2018-01-01
Structures and functions of protein motifs are widely included in many biology-based course syllabi. However, little emphasis is placed to link this knowledge to applications in biotechnology to enhance the learning experience. Here, the conserved motifs of nucleotide binding site-leucine rich repeats (NBS-LRR) proteins, successfully used for the…
Selection of the simplest RNA that binds isoleucine
LOZUPONE, CATHERINE; CHANGAYIL, SHANKAR; MAJERFELD, IRENE; YARUS, MICHAEL
2003-01-01
We have identified the simplest RNA binding site for isoleucine using selection-amplification (SELEX), by shrinking the size of the randomized region until affinity selection is extinguished. Such a protocol can be useful because selection does not necessarily make the simplest active motif most prominent, as is often assumed. We find an isoleucine binding site that behaves exactly as predicted for the site that requires fewest nucleotides. This UAUU motif (16 highly conserved positions; 27 total), is also the most abundant site in successful selections on short random tracts. The UAUU site, now isolated independently at least 63 times, is a small asymmetric internal loop. Conserved loop sequences include isoleucine codon and anticodon triplets, whose nucleotides are required for amino acid binding. This reproducible association between isoleucine and its coding sequences supports the idea that the genetic code is, at least in part, a stereochemical residue of the most easily isolated RNA–amino acid binding structures. PMID:14561881
Garcia, Fernando; Lopez, Francisco J; Cano, Carlos; Blanco, Armando
2009-01-01
Background Regulatory motifs describe sets of related transcription factor binding sites (TFBSs) and can be represented as position frequency matrices (PFMs). De novo identification of TFBSs is a crucial problem in computational biology which includes the issue of comparing putative motifs with one another and with motifs that are already known. The relative importance of each nucleotide within a given position in the PFMs should be considered in order to compute PFM similarities. Furthermore, biological data are inherently noisy and imprecise. Fuzzy set theory is particularly suitable for modeling imprecise data, whereas fuzzy integrals are highly appropriate for representing the interaction among different information sources. Results We propose FISim, a new similarity measure between PFMs, based on the fuzzy integral of the distance of the nucleotides with respect to the information content of the positions. Unlike existing methods, FISim is designed to consider the higher contribution of better conserved positions to the binding affinity. FISim provides excellent results when dealing with sets of randomly generated motifs, and outperforms the remaining methods when handling real datasets of related motifs. Furthermore, we propose a new cluster methodology based on kernel theory together with FISim to obtain groups of related motifs potentially bound by the same TFs, providing more robust results than existing approaches. Conclusion FISim corrects a design flaw of the most popular methods, whose measures favour similarity of low information content positions. We use our measure to successfully identify motifs that describe binding sites for the same TF and to solve real-life problems. In this study the reliability of fuzzy technology for motif comparison tasks is proven. PMID:19615102
DOE Office of Scientific and Technical Information (OSTI.GOV)
Porebski, Przemyslaw J.; Klimecka, Maria; Chruszcz, Maksymilian
2012-07-11
Dethiobiotin synthetase (DTBS) is involved in the biosynthesis of biotin in bacteria, fungi, and plants. As humans lack this pathway, DTBS is a promising antimicrobial drug target. We determined structures of DTBS from Helicobacter pylori (hpDTBS) bound with cofactors and a substrate analog, and described its unique characteristics relative to other DTBS proteins. Comparison with bacterial DTBS orthologs revealed considerable structural differences in nucleotide recognition. The C-terminal region of DTBS proteins, which contains two nucleotide-recognition motifs, differs greatly among DTBS proteins from different species. The structure of hpDTBS revealed that this protein is unique and does not contain a C-terminalmore » region containing one of the motifs. The single nucleotide-binding motif in hpDTBS is similar to its counterpart in GTPases; however, isothermal titration calorimetry binding studies showed that hpDTBS has a strong preference for ATP. The structural determinants of ATP specificity were assessed with X-ray crystallographic studies of hpDTBS-ATP and hpDTBS-GTP complexes. The unique mode of nucleotide recognition in hpDTBS makes this protein a good target for H. pylori-specific inhibitors of the biotin synthesis pathway.« less
Jia, Min; Li, Jianchao; Zhu, Jinwei; Wen, Wenyu; Zhang, Mingjie; Wang, Wenning
2012-01-01
GoLoco (GL) motif-containing proteins regulate G protein signaling by binding to Gα subunit and acting as guanine nucleotide dissociation inhibitors. GLs of LGN are also known to bind the GDP form of Gαi/o during asymmetric cell division. Here, we show that the C-terminal GL domain of LGN binds four molecules of Gαi·GDP. The crystal structures of Gαi·GDP in complex with LGN GL3 and GL4, respectively, reveal distinct GL/Gαi interaction features when compared with the only high resolution structure known with GL/Gαi interaction between RGS14 and Gαi1. Only a few residues C-terminal to the conserved GL sequence are required for LGN GLs to bind to Gαi·GDP. A highly conserved “double Arg finger” sequence (RΨ(D/E)(D/E)QR) is responsible for LGN GL to bind to GDP bound to Gαi. Together with the sequence alignment, we suggest that the LGN GL/Gαi interaction represents a general binding mode between GL motifs and Gαi. We also show that LGN GLs are potent guanine nucleotide dissociation inhibitors. PMID:22952234
Velagapudi, Sai Pradeep; Disney, Matthew D
2013-10-15
RNA is an extremely important target for the development of chemical probes of function or small molecule therapeutics. Aminoglycosides are the most well studied class of small molecules to target RNA. However, the RNA motifs outside of the bacterial rRNA A-site that are likely to be bound by these compounds in biological systems is largely unknown. If such information were known, it could allow for aminoglycosides to be exploited to target other RNAs and, in addition, could provide invaluable insights into potential bystander targets of these clinically used drugs. We utilized two-dimensional combinatorial screening (2DCS), a library-versus-library screening approach, to select the motifs displayed in a 3×3 nucleotide internal loop library and in a 6-nucleotide hairpin library that bind with high affinity and selectivity to six aminoglycoside derivatives. The selected RNA motifs were then analyzed using structure-activity relationships through sequencing (StARTS), a statistical approach that defines the privileged RNA motif space that binds a small molecule. StARTS allowed for the facile annotation of the selected RNA motif-aminoglycoside interactions in terms of affinity and selectivity. The interactions selected by 2DCS generally have nanomolar affinities, which is higher affinity than the binding of aminoglycosides to a mimic of their therapeutic target, the bacterial rRNA A-site. Copyright © 2013 Elsevier Ltd. All rights reserved.
He, Qiye; Johnston, Jeff; Zeitlinger, Julia
2014-01-01
Understanding how eukaryotic enhancers are bound and regulated by specific combinations of transcription factors is still a major challenge. To better map transcription factor binding genome-wide at nucleotide resolution in vivo, we have developed a robust ChIP-exo protocol called ChIP experiments with nucleotide resolution through exonuclease, unique barcode and single ligation (ChIP-nexus), which utilizes an efficient DNA self-circularization step during library preparation. Application of ChIP-nexus to four proteins—human TBP and Drosophila NFkB, Twist and Max— demonstrates that it outperforms existing ChIP protocols in resolution and specificity, pinpoints relevant binding sites within enhancers containing multiple binding motifs and allows the analysis of in vivo binding specificities. Notably, we show that Max frequently interacts with DNA sequences next to its motif, and that this binding pattern correlates with local DNA sequence features such as DNA shape. ChIP-nexus will be broadly applicable to studying in vivo transcription factor binding specificity and its relationship to cis-regulatory changes in humans and model organisms. PMID:25751057
DOE Office of Scientific and Technical Information (OSTI.GOV)
Thomas, P.M.; Wohllk, N.; Huang, E.
1996-09-01
Familial persistent hyperinsulinemic hypoglycemia of infancy is a disorder of glucose homeostasis and is characterized by unregulated insulin secretion and profound hypoglycemia. Loss-of-function mutations in the second nucleotide-binding fold of the sulfonylurea receptor, a subunit of the pancreatic-islet {beta}-cell ATP-dependent potassium channel, has been demonstrated to be causative for persistent hyperinsulinemic hypoglycemia of infancy. We now describe three additional mutations in the first nucleotide-binding fold of the sulfonylurea-receptor gene. One point mutation disrupts the highly conserved Walker A motif of the first nucleotide-binding-fold region. The other two mutations occur in noncoding sequences required for RNA processing and are predicted tomore » disrupt the normal splicing pathway of the sulfonylurea-receptor mRNA precursor. These data suggest that both nucleotide-binding-fold regions of the sulfortylurea receptor are required for normal regulation of {beta}-cell ATP-dependent potassium channel activity and insulin secretion. 32 refs., 4 figs., 1 tab.« less
Lu, Ruipeng; Mucaki, Eliseos J; Rogan, Peter K
2017-03-17
Data from ChIP-seq experiments can derive the genome-wide binding specificities of transcription factors (TFs) and other regulatory proteins. We analyzed 765 ENCODE ChIP-seq peak datasets of 207 human TFs with a novel motif discovery pipeline based on recursive, thresholded entropy minimization. This approach, while obviating the need to compensate for skewed nucleotide composition, distinguishes true binding motifs from noise, quantifies the strengths of individual binding sites based on computed affinity and detects adjacent cofactor binding sites that coordinate with the targets of primary, immunoprecipitated TFs. We obtained contiguous and bipartite information theory-based position weight matrices (iPWMs) for 93 sequence-specific TFs, discovered 23 cofactor motifs for 127 TFs and revealed six high-confidence novel motifs. The reliability and accuracy of these iPWMs were determined via four independent validation methods, including the detection of experimentally proven binding sites, explanation of effects of characterized SNPs, comparison with previously published motifs and statistical analyses. We also predict previously unreported TF coregulatory interactions (e.g. TF complexes). These iPWMs constitute a powerful tool for predicting the effects of sequence variants in known binding sites, performing mutation analysis on regulatory SNPs and predicting previously unrecognized binding sites and target genes. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Conserved binding of GCAC motifs by MEC-8, couch potato, and the RBPMS protein family
Soufari, Heddy
2017-01-01
Precise regulation of mRNA processing, translation, localization, and stability relies on specific interactions with RNA-binding proteins whose biological function and target preference are dictated by their preferred RNA motifs. The RBPMS family of RNA-binding proteins is defined by a conserved RNA recognition motif (RRM) domain found in metazoan RBPMS/Hermes and RBPMS2, Drosophila couch potato, and MEC-8 from Caenorhabditis elegans. In order to determine the parameters of RNA sequence recognition by the RBPMS family, we have first used the N-terminal domain from MEC-8 in binding assays and have demonstrated a preference for two GCAC motifs optimally separated by >6 nucleotides (nt). We have also determined the crystal structure of the dimeric N-terminal RRM domain from MEC-8 in the unbound form, and in complex with an oligonucleotide harboring two copies of the optimal GCAC motif. The atomic details reveal the molecular network that provides specificity to all four bases in the motif, including multiple hydrogen bonds to the initial guanine. Further studies with human RBPMS, as well as Drosophila couch potato, confirm a general preference for this double GCAC motif by other members of the protein family and the presence of this motif in known targets. PMID:28003515
CircularLogo: A lightweight web application to visualize intra-motif dependencies.
Ye, Zhenqing; Ma, Tao; Kalmbach, Michael T; Dasari, Surendra; Kocher, Jean-Pierre A; Wang, Liguo
2017-05-22
The sequence logo has been widely used to represent DNA or RNA motifs for more than three decades. Despite its intelligibility and intuitiveness, the traditional sequence logo is unable to display the intra-motif dependencies and therefore is insufficient to fully characterize nucleotide motifs. Many methods have been developed to quantify the intra-motif dependencies, but fewer tools are available for visualization. We developed CircularLogo, a web-based interactive application, which is able to not only visualize the position-specific nucleotide consensus and diversity but also display the intra-motif dependencies. Applying CircularLogo to HNF6 binding sites and tRNA sequences demonstrated its ability to show intra-motif dependencies and intuitively reveal biomolecular structure. CircularLogo is implemented in JavaScript and Python based on the Django web framework. The program's source code and user's manual are freely available at http://circularlogo.sourceforge.net . CircularLogo web server can be accessed from http://bioinformaticstools.mayo.edu/circularlogo/index.html . CircularLogo is an innovative web application that is specifically designed to visualize and interactively explore intra-motif dependencies.
Sequence, Structure, and Context Preferences of Human RNA Binding Proteins.
Dominguez, Daniel; Freese, Peter; Alexis, Maria S; Su, Amanda; Hochman, Myles; Palden, Tsultrim; Bazile, Cassandra; Lambert, Nicole J; Van Nostrand, Eric L; Pratt, Gabriel A; Yeo, Gene W; Graveley, Brenton R; Burge, Christopher B
2018-06-07
RNA binding proteins (RBPs) orchestrate the production, processing, and function of mRNAs. Here, we present the affinity landscapes of 78 human RBPs using an unbiased assay that determines the sequence, structure, and context preferences of these proteins in vitro by deep sequencing of bound RNAs. These data enable construction of "RNA maps" of RBP activity without requiring crosslinking-based assays. We found an unexpectedly low diversity of RNA motifs, implying frequent convergence of binding specificity toward a relatively small set of RNA motifs, many with low compositional complexity. Offsetting this trend, however, we observed extensive preferences for contextual features distinct from short linear RNA motifs, including spaced "bipartite" motifs, biased flanking nucleotide composition, and bias away from or toward RNA structure. Our results emphasize the importance of contextual features in RNA recognition, which likely enable targeting of distinct subsets of transcripts by different RBPs that recognize the same linear motif. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
SSMART: Sequence-structure motif identification for RNA-binding proteins.
Munteanu, Alina; Mukherjee, Neelanjan; Ohler, Uwe
2018-06-11
RNA-binding proteins (RBPs) regulate every aspect of RNA metabolism and function. There are hundreds of RBPs encoded in the eukaryotic genomes, and each recognize its RNA targets through a specific mixture of RNA sequence and structure properties. For most RBPs, however, only a primary sequence motif has been determined, while the structure of the binding sites is uncharacterized. We developed SSMART, an RNA motif finder that simultaneously models the primary sequence and the structural properties of the RNA targets sites. The sequence-structure motifs are represented as consensus strings over a degenerate alphabet, extending the IUPAC codes for nucleotides to account for secondary structure preferences. Evaluation on synthetic data showed that SSMART is able to recover both sequence and structure motifs implanted into 3'UTR-like sequences, for various degrees of structured/unstructured binding sites. In addition, we successfully used SSMART on high-throughput in vivo and in vitro data, showing that we not only recover the known sequence motif, but also gain insight into the structural preferences of the RBP. Availability: SSMART is freely available at https://ohlerlab.mdc-berlin.de/software/SSMART_137/. Supplementary data are available at Bioinformatics online.
Elengoe, Asita; Hamdan, Salehhuddin
2017-12-01
In this study, we explored the possibility of determining the synergistic interactions between nucleotide-binding domain (NBD) of Homo sapiens heat-shock 70 kDa protein (Hsp70) and E1A 32 kDa of adenovirus serotype 5 motif (PNLVP) in the efficiency of killing of tumor cells in cancer treatment. At present, the protein interaction between NBD and PNLVP motif is still unknown, but believed to enhance the rate of virus replication in tumor cells. Three mutant models (E229V, H225P and D230C) were built and simulated, and their interactions with PNLVP motif were studied. The PNLVP motif showed the binding energy and intermolecular energy values with the novel E229V mutant at -7.32 and -11.2 kcal/mol. The E229V mutant had the highest number of hydrogen bonds (7). Based on the root mean square deviation, root mean square fluctuation, hydrogen bonds, salt bridge, secondary structure, surface-accessible solvent area, potential energy and distance matrices analyses, it was proved that the E229V had the strongest and most stable interaction with the PNLVP motif among all the four protein-ligand complex structures. The knowledge of this protein-ligand complex model would help in designing Hsp70 structure-based drug for cancer therapy.
GIV/Girdin activates Gαi and inhibits Gαs via the same motif
Gupta, Vijay; Bhandari, Deepali; Leyme, Anthony; Aznar, Nicolas; Midde, Krishna K.; Lo, I-Chung; Ear, Jason; Niesman, Ingrid; López-Sánchez, Inmaculada; Blanco-Canosa, Juan Bautista; von Zastrow, Mark; Garcia-Marcos, Mikel; Farquhar, Marilyn G.; Ghosh, Pradipta
2016-01-01
We previously showed that guanine nucleotide-binding (G) protein α subunit (Gα)-interacting vesicle-associated protein (GIV), a guanine-nucleotide exchange factor (GEF), transactivates Gα activity-inhibiting polypeptide 1 (Gαi) proteins in response to growth factors, such as EGF, using a short C-terminal motif. Subsequent work demonstrated that GIV also binds Gαs and that inactive Gαs promotes maturation of endosomes and shuts down mitogenic MAPK–ERK1/2 signals from endosomes. However, the mechanism and consequences of dual coupling of GIV to two G proteins, Gαi and Gαs, remained unknown. Here we report that GIV is a bifunctional modulator of G proteins; it serves as a guanine nucleotide dissociation inhibitor (GDI) for Gαs using the same motif that allows it to serve as a GEF for Gαi. Upon EGF stimulation, GIV modulates Gαi and Gαs sequentially: first, a key phosphomodification favors the assembly of GIV–Gαi complexes and activates GIV’s GEF function; then a second phosphomodification terminates GIV’s GEF function, triggers the assembly of GIV–Gαs complexes, and activates GIV’s GDI function. By comparing WT and GIV mutants, we demonstrate that GIV inhibits Gαs activity in cells responding to EGF. Consequently, the cAMP→PKA→cAMP response element-binding protein signaling axis is inhibited, the transit time of EGF receptor through early endosomes are accelerated, mitogenic MAPK–ERK1/2 signals are rapidly terminated, and proliferation is suppressed. These insights define a paradigm in G-protein signaling in which a pleiotropically acting modulator uses the same motif both to activate and to inhibit G proteins. Our findings also illuminate how such modulation of two opposing Gα proteins integrates downstream signals and cellular responses. PMID:27621449
Velagapudi, Sai Pradeep; Disney, Matthew D.
2013-01-01
RNA is an extremely important target for the development of chemical probes of function or small molecule therapeutics. Aminoglycosides are the most well studied class of small molecules to target RNA. However, the RNA motifs outside of the bacterial rRNA A-site that are likely to be bound by these compounds in biological systems is largely unknown. If such information were known, it could allow for aminoglycosides to be exploited to target other RNAs and, in addition, could provide invaluable insights into potential bystander targets of these clinically used drugs. We utilized two-dimensional combinatorial screening (2DCS), a library-versus-library screening approach, to select the motifs displayed in a 3 × 3 nucleotide internal loop library and in a 6-nucleotide hairpin library that bind with high affinity and selectivity to six aminoglycoside derivatives. The selected RNA motifs were then analyzed using structure–activity relationships through sequencing (StARTS), a statistical approach that defines the privileged RNA motif space that binds a small molecule. StARTS allowed for the facile annotation of the selected RNA motif–aminoglycoside interactions in terms of affinity and selectivity. The interactions selected by 2DCS generally have nanomolar affinities, which is higher affinity than the binding of aminoglycosides to a mimic of their therapeutic target, the bacterial rRNA A-site. PMID:23719281
Finding specific RNA motifs: Function in a zeptomole world?
KNIGHT, ROB; YARUS, MICHAEL
2003-01-01
We have developed a new method for estimating the abundance of any modular (piecewise) RNA motif within a longer random region. We have used this method to estimate the size of the active motifs available to modern SELEX experiments (picomoles of unique sequences) and to a plausible RNA World (zeptomoles of unique sequences: 1 zmole = 602 sequences). Unexpectedly, activities such as specific isoleucine binding are almost certainly present in zeptomoles of molecules, and even ribozymes such as self-cleavage motifs may appear (depending on assumptions about the minimal structures). The number of specified nucleotides is not the only important determinant of a motif’s rarity: The number of modules into which it is divided, and the details of this division, are also crucial. We propose three maxims for easily isolated motifs: the Maxim of Minimization, the Maxim of Multiplicity, and the Maxim of the Median. These maxims together state that selected motifs should be small and composed of as many separate, equally sized modules as possible. For evenly divided motifs with four modules, the largest accessible activity in picomole scale (1–1000 pmole) pools of length 100 is about 34 nucleotides; while for zeptomole scale (1–1000 zmole) pools it is about 20 specific nucleotides (50% probability of occurrence). This latter figure includes some ribozymes and aptamers. Consequently, an RNA metabolism apparently could have begun with only zeptomoles of RNA molecules. PMID:12554865
Karow, Anne R; Theissen, Bettina; Klostermeier, Dagmar
2007-01-01
RNA helicases mediate structural rearrangements of RNA or RNA-protein complexes at the expense of ATP hydrolysis. Members of the DEAD box helicase family consist of two flexibly connected helicase domains. They share nine conserved sequence motifs that are involved in nucleotide binding and hydrolysis, RNA binding, and helicase activity. Most of these motifs line the cleft between the two helicase domains, and extensive communication between them is required for RNA unwinding. The two helicase domains of the Bacillus subtilis RNA helicase YxiN were produced separately as intein fusions, and a functional RNA helicase was generated by expressed protein ligation. The ligated helicase binds adenine nucleotides with very similar affinities to the wild-type protein. Importantly, its intrinsically low ATPase activity is stimulated by RNA, and the Michaelis-Menten parameters are similar to those of the wild-type. Finally, ligated YxiN unwinds a minimal RNA substrate to an extent comparable to that of the wild-type helicase, confirming authentic interdomain communication.
Tomar, Navneet; Mishra, Akhilesh; Mrinal, Nirotpal; Jayaram, B.
2016-01-01
Transcription factors (TFs) bind at multiple sites in the genome and regulate expression of many genes. Regulating TF binding in a gene specific manner remains a formidable challenge in drug discovery because the same binding motif may be present at multiple locations in the genome. Here, we present Onco-Regulon (http://www.scfbio-iitd.res.in/software/onco/NavSite/index.htm), an integrated database of regulatory motifs of cancer genes clubbed with Unique Sequence-Predictor (USP) a software suite that identifies unique sequences for each of these regulatory DNA motifs at the specified position in the genome. USP works by extending a given DNA motif, in 5′→3′, 3′ →5′ or both directions by adding one nucleotide at each step, and calculates the frequency of each extended motif in the genome by Frequency Counter programme. This step is iterated till the frequency of the extended motif becomes unity in the genome. Thus, for each given motif, we get three possible unique sequences. Closest Sequence Finder program predicts off-target drug binding in the genome. Inclusion of DNA-Protein structural information further makes Onco-Regulon a highly informative repository for gene specific drug development. We believe that Onco-Regulon will help researchers to design drugs which will bind to an exclusive site in the genome with no off-target effects, theoretically. Database URL: http://www.scfbio-iitd.res.in/software/onco/NavSite/index.htm PMID:27515825
Mlýnský, Vojtěch; Bussi, Giovanni
2018-01-18
The function of RNA molecules usually depends on their overall fold and on the presence of specific structural motifs. Chemical probing methods are routinely used in combination with nearest-neighbor models to determine RNA secondary structure. Among the available methods, SHAPE is relevant due to its capability to probe all RNA nucleotides and the possibility to be used in vivo. However, the structural determinants for SHAPE reactivity and its mechanism of reaction are still unclear. Here molecular dynamics simulations and enhanced sampling techniques are used to predict the accessibility of nucleotide analogs and larger RNA structural motifs to SHAPE reagents. We show that local RNA reconformations are crucial in allowing reagents to reach the 2'-OH group of a particular nucleotide and that sugar pucker is a major structural factor influencing SHAPE reactivity.
Rigden, Daniel J.; Woodhead, Duncan D.; Wong, Prudence W. H.; Galperin, Michael Y.
2011-01-01
Binding of calcium ions (Ca2+) to proteins can have profound effects on their structure and function. Common roles of calcium binding include structure stabilization and regulation of activity. It is known that diverse families – EF-hands being one of at least twelve – use a Dx[DN]xDG linear motif to bind calcium in near-identical fashion. Here, four novel structural contexts for the motif are described. Existing experimental data for one of them, a thermophilic archaeal subtilisin, demonstrate for the first time a role for Dx[DN]xDG-bound calcium in protein folding. An integrin-like embedding of the motif in the blade of a β-propeller fold – here named the calcium blade – is discovered in structures of bacterial and fungal proteins. Furthermore, sensitive database searches suggest a common origin for the calcium blade in β-propeller structures of different sizes and a pan-kingdom distribution of these proteins. Factors favouring the multiple convergent evolution of the motif appear to include its general Asp-richness, the regular spacing of the Asp residues and the fact that change of Asp into Gly and vice versa can occur though a single nucleotide change. Among the known structural contexts for the Dx[DN]xDG motif, only the calcium blade and the EF-hand are currently found intracellularly in large numbers, perhaps because the higher extracellular concentration of Ca2+ allows for easier fixing of newly evolved motifs that have acquired useful functions. The analysis presented here will inform ongoing efforts toward prediction of similar calcium-binding motifs from sequence information alone. PMID:21720552
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fukumoto, Yasunori, E-mail: fukumoto@faculty.chiba-u.jp; Ikeuchi, Masayoshi; Nakayama, Yuji
ATR-dependent DNA damage checkpoint is the major DNA damage checkpoint against UV irradiation and DNA replication stress. The Rad17–RFC and Rad9–Rad1–Hus1 (9–1–1) complexes interact with each other to contribute to ATR signaling, however, the precise regulatory mechanism of the interaction has not been established. Here, we identified a conserved sequence motif, KYxxL, in the AAA+ domain of Rad17 protein, and demonstrated that this motif is essential for the interaction with the 9–1–1 complex. We also show that UV-induced Rad17 phosphorylation is increased in the Rad17 KYxxL mutants. These data indicate that the interaction with the 9–1–1 complex is not required formore » Rad17 protein to be an efficient substrate for the UV-induced phosphorylation. Our data also raise the possibility that the 9–1–1 complex plays a negative regulatory role in the Rad17 phosphorylation. We also show that the nucleotide-binding activity of Rad17 is required for its nuclear localization. - Highlights: • We have identified a conserved KYxxL motif in Rad17 protein. • The KYxxL motif is crucial for the interaction with the 9–1–1 complex. • The KYxxL motif is dispensable or inhibitory for UV-induced Rad17 phosphorylation. • Nucleotide binding of Rad17 is required for its nuclear localization.« less
Mira, Nuno P.; Henriques, Sílvia F.; Keller, Greg; Teixeira, Miguel C.; Matos, Rute G.; Arraiano, Cecília M.; Winge, Dennis R.; Sá-Correia, Isabel
2011-01-01
The transcription factor Haa1 is the main player in reprogramming yeast genomic expression in response to acetic acid stress. Mapping of the promoter region of one of the Haa1-activated genes, TPO3, allowed the identification of an acetic acid responsive element (ACRE) to which Haa1 binds in vivo. The in silico analysis of the promoter regions of the genes of the Haa1-regulon led to the identification of an Haa1-responsive element (HRE) 5′-GNN(G/C)(A/C)(A/G)G(A/G/C)G-3′. Using surface plasmon resonance experiments and electrophoretic mobility shift assays it is demonstrated that Haa1 interacts with high affinity (KD of 2 nM) with the HRE motif present in the ACRE region of TPO3 promoter. No significant interaction was found between Haa1 and HRE motifs having adenine nucleotides at positions 6 and 8 (KD of 396 and 6780 nM, respectively) suggesting that Haa1p does not recognize these motifs in vivo. A lower affinity of Haa1 toward HRE motifs having mutations in the guanine nucleotides at position 7 and 9 (KD of 21 and 119 nM, respectively) was also observed. Altogether, the results obtained indicate that the minimal functional binding site of Haa1 is 5′-(G/C)(A/C)GG(G/C)G-3′. The Haa1-dependent transcriptional regulatory network active in yeast response to acetic acid stress is proposed. PMID:21586585
Overlapping ETS and CRE Motifs (G/CCGGAAGTGACGTCA) Preferentially Bound by GABPα and CREB Proteins
Chatterjee, Raghunath; Zhao, Jianfei; He, Ximiao; Shlyakhtenko, Andrey; Mann, Ishminder; Waterfall, Joshua J.; Meltzer, Paul; Sathyanarayana, B. K.; FitzGerald, Peter C.; Vinson, Charles
2012-01-01
Previously, we identified 8-bps long DNA sequences (8-mers) that localize in human proximal promoters and grouped them into known transcription factor binding sites (TFBS). We now examine split 8-mers consisting of two 4-mers separated by 1-bp to 30-bps (X4-N1-30-X4) to identify pairs of TFBS that localize in proximal promoters at a precise distance. These include two overlapping TFBS: the ETS⇔ETS motif (C/GCCGGAAGCGGAA) and the ETS⇔CRE motif (C/GCGGAAGTGACGTCAC). The nucleotides in bold are part of both TFBS. Molecular modeling shows that the ETS⇔CRE motif can be bound simultaneously by both the ETS and the B-ZIP domains without protein-protein clashes. The electrophoretic mobility shift assay (EMSA) shows that the ETS protein GABPα and the B-ZIP protein CREB preferentially bind to the ETS⇔CRE motif only when the two TFBS overlap precisely. In contrast, the ETS domain of ETV5 and CREB interfere with each other for binding the ETS⇔CRE. The 11-mer (CGGAAGTGACG), the conserved part of the ETS⇔CRE motif, occurs 226 times in the human genome and 83% are in known regulatory regions. In vivo GABPα and CREB ChIP-seq peaks identified the ETS⇔CRE as the most enriched motif occurring in promoters of genes involved in mRNA processing, cellular catabolic processes, and stress response, suggesting that a specific class of genes is regulated by this composite motif. PMID:23050235
Inokuchi, Shota; Yamashita, Yasuhiro; Nishimura, Kazuma; Nakanishi, Hiroaki; Saito, Kazuyuki
2017-11-01
Phenomena known as null alleles and peak imbalance can occur because of mutations in the primer binding sites used for DNA typing. In these cases, an accurate statistical evaluation of DNA typing is difficult. The estimated likelihood ratio is incorrectly calculated because of the null allele and allele dropout caused by mutation-induced peak imbalance. Although a number of studies have attempted to uncover examples of these phenomena, few reports are available on the human identification kit manufactured by Qiagen. In this study, 196 Japanese individuals who were heterozygous at D2S1360 were genotyped using an Investigator HDplex Kit with optimal amounts of DNA. A peak imbalance was frequently observed at the D2S1360 locus. We performed a sequencing analysis of the area surrounding the D2S1360 repeat motif to identify the cause for peak imbalance. A point mutation (G>A transition) 136 nucleotides upstream from the D2S1360 repeat motif was discovered in a number of samples. The allele frequency of the mutation was 0.0566 in the Japanese population. Therefore, human identification or kinship testing using the Investigator HDplex Kit requires caution because of the higher frequency of single nucleotide polymorphisms at the primer binding site of D2S1360 locus in the Japanese population.
BlockLogo: visualization of peptide and sequence motif conservation
Olsen, Lars Rønn; Kudahl, Ulrich Johan; Simon, Christian; Sun, Jing; Schönbach, Christian; Reinherz, Ellis L.; Zhang, Guang Lan; Brusic, Vladimir
2013-01-01
BlockLogo is a web-server application for visualization of protein and nucleotide fragments, continuous protein sequence motifs, and discontinuous sequence motifs using calculation of block entropy from multiple sequence alignments. The user input consists of a multiple sequence alignment, selection of motif positions, type of sequence, and output format definition. The output has BlockLogo along with the sequence logo, and a table of motif frequencies. We deployed BlockLogo as an online application and have demonstrated its utility through examples that show visualization of T-cell epitopes and B-cell epitopes (both continuous and discontinuous). Our additional example shows a visualization and analysis of structural motifs that determine specificity of peptide binding to HLA-DR molecules. The BlockLogo server also employs selected experimentally validated prediction algorithms to enable on-the-fly prediction of MHC binding affinity to 15 common HLA class I and class II alleles as well as visual analysis of discontinuous epitopes from multiple sequence alignments. It enables the visualization and analysis of structural and functional motifs that are usually described as regular expressions. It provides a compact view of discontinuous motifs composed of distant positions within biological sequences. BlockLogo is available at: http://research4.dfci.harvard.edu/cvc/blocklogo/ and http://methilab.bu.edu/blocklogo/ PMID:24001880
Velagapudi, Sai Pradeep; Seedhouse, Steven J.; French, Jonathan
2011-01-01
RNA is an important therapeutic target, however, RNA targets are generally underexploited due to a lack of understanding of the small molecules that bind RNA and the RNA motifs that bind small molecules. Herein, we describe the identification of the RNA internal loops derived from a 4096-member 3×3 nucleotide loop library that are the most specific and highest affinity binders to a series of four designer, drug-like benzimidazoles. These studies establish a potentially general protocol to define the highest affinity and most specific RNA motif targets for heterocyclic small molecules. Such information could be used to target functionally important RNAs in genomic sequence. PMID:21604752
Robart, Aaron R; O'Connor, Catherine M; Collins, Kathleen
2010-03-01
Telomerase adds simple-sequence repeats to chromosome 3' ends to compensate for the loss of repeats with each round of genome replication. To accomplish this de novo DNA synthesis, telomerase uses a template within its integral RNA component. In addition to providing the template, the telomerase RNA subunit (TER) also harbors nontemplate motifs that contribute to the specialized telomerase catalytic cycle of reiterative repeat synthesis. Most nontemplate TER motifs function through linkage with the template, but in ciliate and vertebrate telomerases, a stem-loop motif binds telomerase reverse transcriptase (TERT) and reconstitutes full activity of the minimal recombinant TERT+TER RNP, even when physically separated from the template. Here, we resolve the functional requirements for this motif of ciliate TER in physiological RNP context using the Tetrahymena thermophila p65-TER-TERT core RNP reconstituted in vitro and the holoenzyme reconstituted in vivo. Contrary to expectation based on assays of the minimal recombinant RNP, we find that none of a panel of individual loop IV nucleotide substitutions impacts the profile of telomerase product synthesis when reconstituted as physiological core RNP or holoenzyme RNP. However, loop IV nucleotide substitutions do variably reduce assembly of TERT with the p65-TER complex in vitro and reduce the accumulation and stability of telomerase RNP in endogenous holoenzyme context. Our results point to a unifying model of a conformational activation role for this TER motif in the telomerase RNP enzyme.
Survey of protein–DNA interactions in Aspergillus oryzae on a genomic scale
Wang, Chao; Lv, Yangyong; Wang, Bin; Yin, Chao; Lin, Ying; Pan, Li
2015-01-01
The genome-scale delineation of in vivo protein–DNA interactions is key to understanding genome function. Only ∼5% of transcription factors (TFs) in the Aspergillus genus have been identified using traditional methods. Although the Aspergillus oryzae genome contains >600 TFs, knowledge of the in vivo genome-wide TF-binding sites (TFBSs) in aspergilli remains limited because of the lack of high-quality antibodies. We investigated the landscape of in vivo protein–DNA interactions across the A. oryzae genome through coupling the DNase I digestion of intact nuclei with massively parallel sequencing and the analysis of cleavage patterns in protein–DNA interactions at single-nucleotide resolution. The resulting map identified overrepresented de novo TF-binding motifs from genomic footprints, and provided the detailed chromatin remodeling patterns and the distribution of digital footprints near transcription start sites. The TFBSs of 19 known Aspergillus TFs were also identified based on DNase I digestion data surrounding potential binding sites in conjunction with TF binding specificity information. We observed that the cleavage patterns of TFBSs were dependent on the orientation of TF motifs and independent of strand orientation, consistent with the DNA shape features of binding motifs with flanking sequences. PMID:25883143
Zeng, Danyun; Shen, Qingliang; Cho, Jae-Hyun
2017-02-26
Biological functions of intrinsically disordered proteins (IDPs), and proteins containing intrinsically disordered regions (IDRs) are often mediated by short linear motifs, like proline-rich motifs (PRMs). Upon binding to their target proteins, IDPs undergo a disorder-to-order transition which is accompanied by a large conformational entropy penalty. Hence, the molecular mechanisms underlying control of conformational entropy are critical for understanding the binding affinity and selectivity of IDPs-mediated protein-protein interactions (PPIs). Here, we investigated the backbone conformational entropy change accompanied by binding of the N-terminal SH3 domain (nSH3) of CrkII and PRM derived from guanine nucleotide exchange factor 1 (C3G). In particular, we focused on the estimation of conformational entropy change of disordered PRM upon binding to the nSH3 domain. Quantitative characterization of conformational dynamics of disordered peptides like PRMs is limited. Hence, we combined various methods, including NMR model-free analysis, δ2D, DynaMine, and structure-based calculation of entropy loss. This study demonstrates that the contribution of backbone conformational entropy change is significant in the PPIs mediated by IDPs/IDRs. Copyright © 2017 Elsevier Inc. All rights reserved.
A G-Quadruplex-Containing RNA Activates Fluorescence in a GFP-Like Fluorophore
Huang, Hao; Suslov, Nikolai B.; Li, Nan-Sheng; Shelke, Sandip A.; Evans, Molly E.; Koldobskaya, Yelena; Rice, Phoebe A.; Piccirilli, Joseph A.
2014-01-01
Spinach is an in vitro selected RNA aptamer that binds a GFP-like ligand and activates its green fluorescence.Spinach is thus an RNA analog of GFP, and has potentially widespread applications for in vivo labeling and imaging. We used antibody-assisted crystallography to determine the structures of Spinach both with and without bound fluorophore at 2.2 and 2.4 Å resolution, respectively. Spinach RNA has an elongated structure containing two helical domains separated by an internal bulge that folds into a G-quadruplex motif of unusual topology. The G-quadruplex motif and adjacent nucleotides comprise a partially pre-formed binding site for the fluorophore.The fluorophore binds in a planar conformation and makes extensive aromatic stacking and hydrogen bond interactions with the RNA. Our findings provide a foundation for structure-based engineering of new fluorophore-binding RNA aptamers. PMID:24952597
Milenkovic, Stefan; Bondar, Ana-Nicoleta
2016-02-01
SecA uses the energy yielded by the binding and hydrolysis of adenosine triphosphate (ATP) to push secretory pre-proteins across the plasma membrane in bacteria. Hydrolysis of ATP occurs at the nucleotide-binding site, which contains the conserved carboxylate groups of the DEAD-box helicases. Although crystal structures provide valuable snapshots of SecA along its reaction cycle, the mechanism that ensures conformational coupling between the nucleotide-binding site and the other domains of SecA remains unclear. The observation that SecA contains numerous hydrogen-bonding groups raises important questions about the role of hydrogen-bonding networks and hydrogen-bond dynamics in long-distance conformational couplings. To address these questions, we explored the molecular dynamics of SecA from three different organisms, with and without bound nucleotide, in water. By computing two-dimensional hydrogen-bonding maps we identify networks of hydrogen bonds that connect the nucleotide-binding site to remote regions of the protein, and sites in the protein that respond to specific perturbations. We find that the nucleotide-binding site of ADP-bound SecA has a preferred geometry whereby the first two carboxylates of the DEAD motif bridge via hydrogen-bonding water. Simulations of a mutant with perturbed ATP hydrolysis highlight the water-bridged geometry as a key structural element of the reaction path. Copyright © 2015. Published by Elsevier B.V.
Guilfoyle, Amy P; Deshpande, Chandrika N; Schenk, Gerhard; Maher, Megan J; Jormakka, Mika
2014-12-12
GDP release from GTPases is usually extremely slow and is in general assisted by external factors, such as association with guanine exchange factors or membrane-embedded GPCRs (G protein-coupled receptors), which accelerate the release of GDP by several orders of magnitude. Intrinsic factors can also play a significant role; a single amino acid substitution in one of the guanine nucleotide recognition motifs, G5, results in a drastically altered GDP release rate, indicating that the sequence composition of this motif plays an important role in spontaneous GDP release. In the present study, we used the GTPase domain from EcNFeoB (Escherichia coli FeoB) as a model and applied biochemical and structural approaches to evaluate the role of all the individual residues in the G5 loop. Our study confirms that several of the residues in the G5 motif have an important role in the intrinsic affinity and release of GDP. In particular, a T151A mutant (third residue of the G5 loop) leads to a reduced nucleotide affinity and provokes a drastically accelerated dissociation of GDP.
Cyr, Normand; de la Fuente, Cynthia; Lecoq, Lauriane; Guendel, Irene; Chabot, Philippe R.; Kehn-Hall, Kylene; Omichinski, James G.
2015-01-01
Rift Valley fever virus (RVFV) is a single-stranded RNA virus capable of inducing fatal hemorrhagic fever in humans. A key component of RVFV virulence is its ability to form nuclear filaments through interactions between the viral nonstructural protein NSs and the host general transcription factor TFIIH. Here, we identify an interaction between a ΩXaV motif in NSs and the p62 subunit of TFIIH. This motif in NSs is similar to ΩXaV motifs found in nucleotide excision repair (NER) factors and transcription factors known to interact with p62. Structural and biophysical studies demonstrate that NSs binds to p62 in a similar manner as these other factors. Functional studies in RVFV-infected cells show that the ΩXaV motif is required for both nuclear filament formation and degradation of p62. Consistent with the fact that the RVFV can be distinguished from other Bunyaviridae-family viruses due to its ability to form nuclear filaments in infected cells, the motif is absent in the NSs proteins of other Bunyaviridae-family viruses. Taken together, our studies demonstrate that p62 binding to NSs through the ΩXaV motif is essential for degrading p62, forming nuclear filaments and enhancing RVFV virulence. In addition, these results show how the RVFV incorporates a simple motif into the NSs protein that enables it to functionally mimic host cell proteins that bind the p62 subunit of TFIIH. PMID:25918396
Cyr, Normand; de la Fuente, Cynthia; Lecoq, Lauriane; Guendel, Irene; Chabot, Philippe R; Kehn-Hall, Kylene; Omichinski, James G
2015-05-12
Rift Valley fever virus (RVFV) is a single-stranded RNA virus capable of inducing fatal hemorrhagic fever in humans. A key component of RVFV virulence is its ability to form nuclear filaments through interactions between the viral nonstructural protein NSs and the host general transcription factor TFIIH. Here, we identify an interaction between a ΩXaV motif in NSs and the p62 subunit of TFIIH. This motif in NSs is similar to ΩXaV motifs found in nucleotide excision repair (NER) factors and transcription factors known to interact with p62. Structural and biophysical studies demonstrate that NSs binds to p62 in a similar manner as these other factors. Functional studies in RVFV-infected cells show that the ΩXaV motif is required for both nuclear filament formation and degradation of p62. Consistent with the fact that the RVFV can be distinguished from other Bunyaviridae-family viruses due to its ability to form nuclear filaments in infected cells, the motif is absent in the NSs proteins of other Bunyaviridae-family viruses. Taken together, our studies demonstrate that p62 binding to NSs through the ΩXaV motif is essential for degrading p62, forming nuclear filaments and enhancing RVFV virulence. In addition, these results show how the RVFV incorporates a simple motif into the NSs protein that enables it to functionally mimic host cell proteins that bind the p62 subunit of TFIIH.
Canard, Bruno
2018-01-01
Viral RNA-dependent RNA polymerases (RdRps) play a central role not only in viral replication, but also in the genetic evolution of viral RNAs. After binding to an RNA template and selecting 5′-triphosphate ribonucleosides, viral RdRps synthesize an RNA copy according to Watson-Crick base-pairing rules. The copy process sometimes deviates from both the base-pairing rules specified by the template and the natural ribose selectivity and, thus, the process is error-prone due to the intrinsic (in)fidelity of viral RdRps. These enzymes share a number of conserved amino-acid sequence strings, called motifs A–G, which can be defined from a structural and functional point-of-view. A co-relation is gradually emerging between mutations in these motifs and viral genome evolution or observed mutation rates. Here, we review our current knowledge on these motifs and their role on the structural and mechanistic basis of the fidelity of nucleotide selection and RNA synthesis by Flavivirus RdRps. PMID:29385764
RNA 3D Structural Motifs: Definition, Identification, Annotation, and Database Searching
NASA Astrophysics Data System (ADS)
Nasalean, Lorena; Stombaugh, Jesse; Zirbel, Craig L.; Leontis, Neocles B.
Structured RNA molecules resemble proteins in the hierarchical organization of their global structures, folding and broad range of functions. Structured RNAs are composed of recurrent modular motifs that play specific functional roles. Some motifs direct the folding of the RNA or stabilize the folded structure through tertiary interactions. Others bind ligands or proteins or catalyze chemical reactions. Therefore, it is desirable, starting from the RNA sequence, to be able to predict the locations of recurrent motifs in RNA molecules. Conversely, the potential occurrence of one or more known 3D RNA motifs may indicate that a genomic sequence codes for a structured RNA molecule. To identify known RNA structural motifs in new RNA sequences, precise structure-based definitions are needed that specify the core nucleotides of each motif and their conserved interactions. By comparing instances of each recurrent motif and applying base pair isosteriCity relations, one can identify neutral mutations that preserve its structure and function in the contexts in which it occurs.
PCR Cloning of Partial "nbs" Sequences from Grape ("Vitis aestivalis" Michx)
ERIC Educational Resources Information Center
Chang, Ming-Mei; DiGennaro, Peter; Macula, Anthony
2009-01-01
Plants defend themselves against pathogens via the expressions of disease resistance (R) genes. Many plant R gene products contain the characteristic nucleotide-binding site (NBS) and leucine-rich repeat (LRR) domains. There are highly conserved motifs within the NBS domain which could be targeted for polymerase chain reaction (PCR) cloning of R…
Shepard, A R; Zhang, W; Eberhardt, N L
1994-01-21
We established the cis-acting elements which mediate cAMP responsiveness of the human growth hormone (hGH) gene in transiently transfected rat anterior pituitary tumor GC cells. Analysis of the intact hGH gene or hGH 5'-flanking DNA (5'-FR) coupled to the hGh cDNA or chloramphenicol acetyltransferase or luciferase genes, indicated that cAMP primarily stimulated hGH promoter activity. Cotransfection of a protein kinase A inhibitory protein cDNA demonstrated that the cAMP response was mediated by protein kinase A. Mutational analysis of the hGH promoter identified two core cAMP response element motifs (CGTCA) located at nucleotides -187/-183 (distal cAMP response element; dCRE) and -99/-95 (proximal cAMP response element; pCRE) and a pituitary-specific transcription factor (GHF1/Pit1) binding site at nucleotides -123/-112 (dGHF1) which were required for cAMP responsiveness. GHF1 was not a limiting factor, since overexpression of GHF1 in cotransfections increased basal but not forskolin induction levels. Gel shift analyses indicated that similar, ubiquitous, thermostable protein(s) specifically bound the pCRE and dCRE motifs. The CGTCA motif-binding factors were cAMP response element binding protein (CREB)/activating transcription factor-1 (ATF-1)-related, since the DNA-protein complex was competed by unlabeled CREB consensus oligonucleotide, specifically supershifted by antisera to CREB and ATF-1 but not ATF-2, and was bound by purified CREB with the same relative binding affinity (pCRE < dCRE < CREB) and mobility as the GC nuclear extract. UV cross-linking and Southwestern blot analyses revealed multiple DNA-protein interactions of which approximately 100- and approximately 45-kDa proteins were predominant; the approximately 45-kDa protein may represent CREB. These results indicate that CREB/ATF-1-related factors act coordinately with the cell-specific factor GHF1 to mediate cAMP-dependent regulation of hGH-1 gene transcription in anterior pituitary somatotrophs.
Boussardon, Clément; Avon, Alexandra; Kindgren, Peter; Bond, Charles S; Challenor, Michael; Lurin, Claire; Small, Ian
2014-09-01
In flowering plants, RNA editing involves deamination of specific cytidines to uridines in both mitochondrial and chloroplast transcripts. Pentatricopeptide repeat (PPR) proteins and multiple organellar RNA editing factor (MORF) proteins have been shown to be involved in RNA editing but none have been shown to possess cytidine deaminase activity. The DYW domain of some PPR proteins contains a highly conserved signature resembling the zinc-binding active site motif of known nucleotide deaminases. We modified these highly conserved amino acids in the DYW motif of DYW1, an editing factor required for editing of the ndhD-1 site in Arabidopsis chloroplasts. We demonstrate that several amino acids of this signature motif are required for RNA editing in vivo and for zinc binding in vitro. We conclude that the DYW domain of DYW1 has features in common with cytidine deaminases, reinforcing the hypothesis that this domain forms part of the active enzyme that carries out RNA editing in plants. © 2014 The Authors. New Phytologist © 2014 New Phytologist Trust.
Yang, Lingna; Wang, Chongyuan; Li, Fudong; Zhang, Jiahai; Nayab, Anam; Wu, Jihui; Shi, Yunyu; Gong, Qingguo
2017-09-29
MEX-3 is a K-homology (KH) domain-containing RNA-binding protein first identified as a translational repressor in Caenorhabditis elegans , and its four orthologs (MEX-3A-D) in human and mouse were subsequently found to have E3 ubiquitin ligase activity mediated by a RING domain and critical for RNA degradation. Current evidence implicates human MEX-3C in many essential biological processes and suggests a strong connection with immune diseases and carcinogenesis. The highly conserved dual KH domains in MEX-3 proteins enable RNA binding and are essential for the recognition of the 3'-UTR and post-transcriptional regulation of MEX-3 target transcripts. However, the molecular mechanisms of translational repression and the consensus RNA sequence recognized by the MEX-3C KH domain are unknown. Here, using X-ray crystallography and isothermal titration calorimetry, we investigated the RNA-binding activity and selectivity of human MEX-3C dual KH domains. Our high-resolution crystal structures of individual KH domains complexed with a noncanonical U-rich and a GA-rich RNA sequence revealed that the KH1/2 domains of human MEX-3C bound MRE10, a 10-mer RNA (5'-CAGAGUUUAG-3') consisting of an eight-nucleotide MEX-3-recognition element (MRE) motif, with high affinity. Of note, we also identified a consensus RNA motif recognized by human MEX-3C. The potential RNA-binding sites in the 3'-UTR of the human leukocyte antigen serotype ( HLA-A2 ) mRNA were mapped with this RNA-binding motif and further confirmed by fluorescence polarization. The binding motif identified here will provide valuable information for future investigations of the functional pathways controlled by human MEX-3C and for predicting potential mRNAs regulated by this enzyme. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Hohl, Michael; Hürlimann, Lea M; Böhm, Simon; Schöppe, Jendrik; Grütter, Markus G; Bordignon, Enrica; Seeger, Markus A
2014-07-29
ATP binding cassette (ABC) transporters mediate vital transport processes in every living cell. ATP hydrolysis, which fuels transport, displays positive cooperativity in numerous ABC transporters. In particular, heterodimeric ABC exporters exhibit pronounced allosteric coupling between a catalytically impaired degenerate site, where nucleotides bind tightly, and a consensus site, at which ATP is hydrolyzed in every transport cycle. Whereas the functional phenomenon of cooperativity is well described, its structural basis remains poorly understood. Here, we present the apo structure of the heterodimeric ABC exporter TM287/288 and compare it to the previously solved structure with adenosine 5'-(β,γ-imido)triphosphate (AMP-PNP) bound at the degenerate site. In contrast to other ABC exporter structures, the nucleotide binding domains (NBDs) of TM287/288 remain in molecular contact even in the absence of nucleotides, and the arrangement of the transmembrane domains (TMDs) is not influenced by AMP-PNP binding, a notion confirmed by double electron-electron resonance (DEER) measurements. Nucleotide binding at the degenerate site results in structural rearrangements, which are transmitted to the consensus site via two D-loops located at the NBD interface. These loops owe their name from a highly conserved aspartate and are directly connected to the catalytically important Walker B motif. The D-loop at the degenerate site ties the NBDs together even in the absence of nucleotides and substitution of its aspartate by alanine is well-tolerated. By contrast, the D-loop of the consensus site is flexible and the aspartate to alanine mutation and conformational restriction by cross-linking strongly reduces ATP hydrolysis and substrate transport.
Limitations and potentials of current motif discovery algorithms
Hu, Jianjun; Li, Bin; Kihara, Daisuke
2005-01-01
Computational methods for de novo identification of gene regulation elements, such as transcription factor binding sites, have proved to be useful for deciphering genetic regulatory networks. However, despite the availability of a large number of algorithms, their strengths and weaknesses are not sufficiently understood. Here, we designed a comprehensive set of performance measures and benchmarked five modern sequence-based motif discovery algorithms using large datasets generated from Escherichia coli RegulonDB. Factors that affect the prediction accuracy, scalability and reliability are characterized. It is revealed that the nucleotide and the binding site level accuracy are very low, while the motif level accuracy is relatively high, which indicates that the algorithms can usually capture at least one correct motif in an input sequence. To exploit diverse predictions from multiple runs of one or more algorithms, a consensus ensemble algorithm has been developed, which achieved 6–45% improvement over the base algorithms by increasing both the sensitivity and specificity. Our study illustrates limitations and potentials of existing sequence-based motif discovery algorithms. Taking advantage of the revealed potentials, several promising directions for further improvements are discussed. Since the sequence-based algorithms are the baseline of most of the modern motif discovery algorithms, this paper suggests substantial improvements would be possible for them. PMID:16284194
Interactions between the R2R3-MYB Transcription Factor, AtMYB61, and Target DNA Binding Sites
Prouse, Michael B.; Campbell, Malcolm M.
2013-01-01
Despite the prominent roles played by R2R3-MYB transcription factors in the regulation of plant gene expression, little is known about the details of how these proteins interact with their DNA targets. For example, while Arabidopsis thaliana R2R3-MYB protein AtMYB61 is known to alter transcript abundance of a specific set of target genes, little is known about the specific DNA sequences to which AtMYB61 binds. To address this gap in knowledge, DNA sequences bound by AtMYB61 were identified using cyclic amplification and selection of targets (CASTing). The DNA targets identified using this approach corresponded to AC elements, sequences enriched in adenosine and cytosine nucleotides. The preferred target sequence that bound with the greatest affinity to AtMYB61 recombinant protein was ACCTAC, the AC-I element. Mutational analyses based on the AC-I element showed that ACC nucleotides in the AC-I element served as the core recognition motif, critical for AtMYB61 binding. Molecular modelling predicted interactions between AtMYB61 amino acid residues and corresponding nucleotides in the DNA targets. The affinity between AtMYB61 and specific target DNA sequences did not correlate with AtMYB61-driven transcriptional activation with each of the target sequences. CASTing-selected motifs were found in the regulatory regions of genes previously shown to be regulated by AtMYB61. Taken together, these findings are consistent with the hypothesis that AtMYB61 regulates transcription from specific cis-acting AC elements in vivo. The results shed light on the specifics of DNA binding by an important family of plant-specific transcriptional regulators. PMID:23741471
A G-quadruplex-containing RNA activates fluorescence in a GFP-like fluorophore
DOE Office of Scientific and Technical Information (OSTI.GOV)
Huang, Hao; Suslov, Nikolai B.; Li, Nan-Sheng
2014-08-21
Spinach is an in vitro–selected RNA aptamer that binds a GFP-like ligand and activates its green fluorescence. Spinach is thus an RNA analog of GFP and has potentially widespread applications for in vivo labeling and imaging. We used antibody-assisted crystallography to determine the structures of Spinach both with and without bound fluorophore at 2.2-Å and 2.4-Å resolution, respectively. Spinach RNA has an elongated structure containing two helical domains separated by an internal bulge that folds into a G-quadruplex motif of unusual topology. The G-quadruplex motif and adjacent nucleotides comprise a partially preformed binding site for the fluorophore. The fluorophore bindsmore » in a planar conformation and makes extensive aromatic stacking and hydrogen bond interactions with the RNA. Our findings provide a foundation for structure-based engineering of new fluorophore-binding RNA aptamers.« less
Guilfoyle, Amy P; Deshpande, Chandrika N; Vincent, Kimberley; Pedroso, Marcelo M; Schenk, Gerhard; Maher, Megan J; Jormakka, Mika
2014-05-01
GTPases (G proteins) hydrolyze the conversion of GTP to GDP and free phosphate, comprising an integral part of prokaryotic and eukaryotic signaling, protein biosynthesis and cell division, as well as membrane transport processes. The G protein cycle is brought to a halt after GTP hydrolysis, and requires the release of GDP before a new cycle can be initiated. For eukaryotic heterotrimeric Gαβγ proteins, the interaction with a membrane-bound G protein-coupled receptor catalyzes the release of GDP from the Gα subunit. Structural and functional studies have implicated one of the nucleotide binding sequence motifs, the G5 motif, as playing an integral part in this release mechanism. Indeed, a Gαs G5 mutant (A366S) was shown to have an accelerated GDP release rate, mimicking a G protein-coupled receptor catalyzed release state. In the present study, we investigate the role of the equivalent residue in the G5 motif (residue A143) in the prokaryotic membrane protein FeoB from Streptococcus thermophilus, which includes an N-terminal soluble G protein domain. The structure of this domain has previously been determined in the apo and GDP-bound states and in the presence of a transition state analogue, revealing conformational changes in the G5 motif. The A143 residue was mutated to a serine and analyzed with respect to changes in GTPase activity, nucleotide release rate, GDP affinity and structural alterations. We conclude that the identity of the residue at this position in the G5 loop plays a key role in the nucleotide release rate by allowing the correct positioning and hydrogen bonding of the nucleotide base. © 2014 FEBS.
Presence of a consensus DNA motif at nearby DNA sequence of the mutation susceptible CG nucleotides.
Chowdhury, Kaushik; Kumar, Suresh; Sharma, Tanu; Sharma, Ankit; Bhagat, Meenakshi; Kamai, Asangla; Ford, Bridget M; Asthana, Shailendra; Mandal, Chandi C
2018-01-10
Complexity in tissues affected by cancer arises from somatic mutations and epigenetic modifications in the genome. The mutation susceptible hotspots present within the genome indicate a non-random nature and/or a position specific selection of mutation. An association exists between the occurrence of mutations and epigenetic DNA methylation. This study is primarily aimed at determining mutation status, and identifying a signature for predicting mutation prone zones of tumor suppressor (TS) genes. Nearby sequences from the top five positions having a higher mutation frequency in each gene of 42 TS genes were selected from a cosmic database and were considered as mutation prone zones. The conserved motifs present in the mutation prone DNA fragments were identified. Molecular docking studies were done to determine putative interactions between the identified conserved motifs and enzyme methyltransferase DNMT1. Collective analysis of 42 TS genes found GC as the most commonly replaced and AT as the most commonly formed residues after mutation. Analysis of the top 5 mutated positions of each gene (210 DNA segments for 42 TS genes) identified that CG nucleotides of the amino acid codons (e.g., Arginine) are most susceptible to mutation, and found a consensus DNA "T/AGC/GAGGA/TG" sequence present in these mutation prone DNA segments. Similar to TS genes, analysis of 54 oncogenes not only found CG nucleotides of the amino acid Arg as the most susceptible to mutation, but also identified the presence of similar consensus DNA motifs in the mutation prone DNA fragments (270 DNA segments for 54 oncogenes) of oncogenes. Docking studies depicted that, upon binding of DNMT1 methylates to this consensus DNA motif (C residues of CpG islands), mutation was likely to occur. Thus, this study proposes that DNMT1 mediated methylation in chromosomal DNA may decrease if a foreign DNA segment containing this consensus sequence along with CG nucleotides is exogenously introduced to dividing cancer cells. Copyright © 2017 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Rivard, Brea R.; Cooper, Sarah J.; Stubbs, John M.
2018-02-01
DNA duplexes consisting of a 25mer together with shorter complementary sequences were studied over a range of temperature and surface binding motifs using a coarse-grained two-site nucleotide model. Results were analyzed in terms of hydrogen bonding interactions and structural characteristics and indicate that hybridization is most stable when furthest from the surface binding site. Strand elongation and straightening near the bound end are found to be correlated to duplex destabilization.
Motif finding in DNA sequences based on skipping nonconserved positions in background Markov chains.
Zhao, Xiaoyan; Sze, Sing-Hoi
2011-05-01
One strategy to identify transcription factor binding sites is through motif finding in upstream DNA sequences of potentially co-regulated genes. Despite extensive efforts, none of the existing algorithms perform very well. We consider a string representation that allows arbitrary ignored positions within the nonconserved portion of single motifs, and use O(2(l)) Markov chains to model the background distributions of motifs of length l while skipping these positions within each Markov chain. By focusing initially on positions that have fixed nucleotides to define core occurrences, we develop an algorithm to identify motifs of moderate lengths. We compare the performance of our algorithm to other motif finding algorithms on a few benchmark data sets, and show that significant improvement in accuracy can be obtained when the sites are sufficiently conserved within a given sample, while comparable performance is obtained when the site conservation rate is low. A software program (PosMotif ) and detailed results are available online at http://faculty.cse.tamu.edu/shsze/posmotif.
Iakhiaeva, Elena; Wower, Jacek; Wower, Iwona K.; Zwieb, Christian
2008-01-01
The signal recognition particle (SRP) plays a pivotal role in transporting proteins to cell membranes. In higher eukaryotes, SRP consists of an RNA molecule and six proteins. The largest of the SRP proteins, SRP72, was found previously to bind to the SRP RNA. A fragment of human SRP72 (72c′) bound effectively to human SRP RNA but only weakly to the similar SRP RNA of the archaeon Methanococcus jannaschii. Chimeras between the human and M. jannaschii SRP RNAs were constructed and used as substrates for 72c′. SRP RNA helical section 5e contained the 72c′ binding site. Systematic alteration within 5e revealed that the A240G and A240C changes dramatically reduced the binding of 72c′. Human SRP RNA with a single A240G change was unable to form a complex with full-length human SRP72. Two small RNA fragments, one composed of helical section 5ef, the other of section 5e, competed equally well for the binding of 72c′, demonstrating that no other regions of the SRPR RNA were required. The biochemical data completely agreed with the nucleotide conservation pattern observed across the phylogenetic spectrum. Thus, most eukaryotic SRP RNAs are likely to require for function an adenosine within their 5e motifs. The human 5ef RNA was remarkably resistant to ribonucleolytic attack suggesting that the 240-AUC-242 “loop” and its surrounding nucleotides form a peculiar compact structure recognized only by SRP72. PMID:18441046
Substrate specificity and reaction kinetics of an X-motif ribozyme
LAZAREV, DENIS; PUSKARZ, IZABELA; BREAKER, RONALD R.
2003-01-01
The X-motif is an in vitro-selected ribozyme that catalyzes RNA cleavage by an internal phosphoester transfer reaction. This ribozyme class is distinguished by the fact that it emerged as the dominant clone among at least 12 different classes of ribozymes when in vitro selection was conducted to favor the isolation of high-speed catalysts. We have examined the structural and kinetic properties of the X-motif in order to provide a framework for its application as an RNA-cleaving agent and to explore how this ribozyme catalyzes phosphoester transfer with a predicted rate constant that is similar to those exhibited by the four natural self-cleaving ribozymes. The secondary structure of the X-motif includes four stem elements that form a central unpaired junction. In a bimolecular format, two of these base-paired arms define the substrate specificity of the ribozyme and can be changed to target different RNAs for cleavage. The requirements for nucleotide identity at the cleavage site are GD, where D = G, A, or U and cleavage occurs between the two nucleotides. The ribozyme has an absolute requirement for a divalent cation cofactor and exhibits kinetic behavior that is consistent with the obligate binding of at least two metal ions. PMID:12756327
cWINNOWER Algorithm for Finding Fuzzy DNA Motifs
NASA Technical Reports Server (NTRS)
Liang, Shoudan
2003-01-01
The cWINNOWER algorithm detects fuzzy motifs in DNA sequences rich in protein-binding signals. A signal is defined as any short nucleotide pattern having up to d mutations differing from a motif of length l. The algorithm finds such motifs if multiple mutated copies of the motif (i.e., the signals) are present in the DNA sequence in sufficient abundance. The cWINNOWER algorithm substantially improves the sensitivity of the winnower method of Pevzner and Sze by imposing a consensus constraint, enabling it to detect much weaker signals. We studied the minimum number of detectable motifs qc as a function of sequence length N for random sequences. We found that qc increases linearly with N for a fast version of the algorithm based on counting three-member sub-cliques. Imposing consensus constraints reduces qc, by a factor of three in this case, which makes the algorithm dramatically more sensitive. Our most sensitive algorithm, which counts four-member sub-cliques, needs a minimum of only 13 signals to detect motifs in a sequence of length N = 12000 for (l,d) = (15,4).
Role of sequence encoded κB DNA geometry in gene regulation by Dorsal
Mrinal, Nirotpal; Tomar, Archana; Nagaraju, Javaregowda
2011-01-01
Many proteins of the Rel family can act as both transcriptional activators and repressors. However, mechanism that discerns the ‘activator/repressor’ functions of Rel-proteins such as Dorsal (Drosophila homologue of mammalian NFκB) is not understood. Using genomic, biophysical and biochemical approaches, we demonstrate that the underlying principle of this functional specificity lies in the ‘sequence-encoded structure’ of the κB-DNA. We show that Dorsal-binding motifs exist in distinct activator and repressor conformations. Molecular dynamics of DNA-Dorsal complexes revealed that repressor κB-motifs typically have A-tract and flexible conformation that facilitates interaction with co-repressors. Deformable structure of repressor motifs, is due to changes in the hydrogen bonding in A:T pair in the ‘A-tract’ core. The sixth nucleotide in the nonameric κB-motif, ‘A’ (A6) in the repressor motifs and ‘T’ (T6) in the activator motifs, is critical to confer this functional specificity as A6 → T6 mutation transformed flexible repressor conformation into a rigid activator conformation. These results highlight that ‘sequence encoded κB DNA-geometry’ regulates gene expression by exerting allosteric effect on binding of Rel proteins which in turn regulates interaction with co-regulators. Further, we identified and characterized putative repressor motifs in Dl-target genes, which can potentially aid in functional annotation of Dorsal gene regulatory network. PMID:21890896
Co-regulation analysis of co-expressed modules under cold and pathogen stress conditions in tomato.
Abedini, Davar; Rashidi Monfared, Sajad
2018-06-01
A primary mechanism for controlling the development of multicellular organisms is transcriptional regulation, which carried out by transcription factors (TFs) that recognize and bind to their binding sites on promoter region. The distance from translation start site, order, orientation, and spacing between cis elements are key factors in the concentration of active nuclear TFs and transcriptional regulation of target genes. In this study, overrepresented motifs in cold and pathogenesis responsive genes were scanned via Gibbs sampling method, this method is based on detection of overrepresented motifs by means of a stochastic optimization strategy that searches for all possible sets of short DNA segments. Then, identified motifs were checked by TRANSFAC, PLACE and Soft Berry databases in order to identify putative TFs which, interact to the motifs. Several cis/trans regulatory elements were found using these databases. Moreover, cross-talk between cold and pathogenesis responsive genes were confirmed. Statistical analysis was used to determine distribution of identified motifs on promoter region. In addition, co-regulation analysis results, illustrated genes in pathogenesis responsive module are divided into two main groups. Also, promoter region was crunched to six subareas in order to draw the pattern of distribution of motifs in promoter subareas. The result showed the majority of motifs are concentrated on 700 nucleotides upstream of the translational start site (ATG). In contrast, this result isn't true in another group. In other words, there was no difference between total and compartmentalized regions in cold responsive genes.
Hickey, Anthony; Esnault, Caroline; Majumdar, Anasuya; Chatterjee, Atreyi Ghatak; Iben, James R; McQueen, Philip G; Yang, Andrew X; Mizuguchi, Takeshi; Grewal, Shiv I S; Levin, Henry L
2015-11-01
Transposable elements (TEs) constitute a substantial fraction of the eukaryotic genome and, as a result, have a complex relationship with their host that is both adversarial and dependent. To minimize damage to cellular genes, TEs possess mechanisms that target integration to sequences of low importance. However, the retrotransposon Tf1 of Schizosaccharomyces pombe integrates with a surprising bias for promoter sequences of stress-response genes. The clustering of integration in specific promoters suggests that Tf1 possesses a targeting mechanism that is important for evolutionary adaptation to changes in environment. We report here that Sap1, an essential DNA-binding protein, plays an important role in Tf1 integration. A mutation in Sap1 resulted in a 10-fold drop in Tf1 transposition, and measures of transposon intermediates support the argument that the defect occurred in the process of integration. Published ChIP-Seq data on Sap1 binding combined with high-density maps of Tf1 integration that measure independent insertions at single-nucleotide positions show that 73.4% of all integration occurs at genomic sequences bound by Sap1. This represents high selectivity because Sap1 binds just 6.8% of the genome. A genome-wide analysis of promoter sequences revealed that Sap1 binding and amounts of integration correlate strongly. More important, an alignment of the DNA-binding motif of Sap1 revealed integration clustered on both sides of the motif and showed high levels specifically at positions +19 and -9. These data indicate that Sap1 contributes to the efficiency and position of Tf1 integration. Copyright © 2015 by the Genetics Society of America.
Hickey, Anthony; Esnault, Caroline; Majumdar, Anasuya; Chatterjee, Atreyi Ghatak; Iben, James R.; McQueen, Philip G.; Yang, Andrew X.; Mizuguchi, Takeshi; Grewal, Shiv I. S.; Levin, Henry L.
2015-01-01
Transposable elements (TEs) constitute a substantial fraction of the eukaryotic genome and, as a result, have a complex relationship with their host that is both adversarial and dependent. To minimize damage to cellular genes, TEs possess mechanisms that target integration to sequences of low importance. However, the retrotransposon Tf1 of Schizosaccharomyces pombe integrates with a surprising bias for promoter sequences of stress-response genes. The clustering of integration in specific promoters suggests that Tf1 possesses a targeting mechanism that is important for evolutionary adaptation to changes in environment. We report here that Sap1, an essential DNA-binding protein, plays an important role in Tf1 integration. A mutation in Sap1 resulted in a 10-fold drop in Tf1 transposition, and measures of transposon intermediates support the argument that the defect occurred in the process of integration. Published ChIP-Seq data on Sap1 binding combined with high-density maps of Tf1 integration that measure independent insertions at single-nucleotide positions show that 73.4% of all integration occurs at genomic sequences bound by Sap1. This represents high selectivity because Sap1 binds just 6.8% of the genome. A genome-wide analysis of promoter sequences revealed that Sap1 binding and amounts of integration correlate strongly. More important, an alignment of the DNA-binding motif of Sap1 revealed integration clustered on both sides of the motif and showed high levels specifically at positions +19 and −9. These data indicate that Sap1 contributes to the efficiency and position of Tf1 integration. PMID:26358720
Regions of extreme synonymous codon selection in mammalian genes
Schattner, Peter; Diekhans, Mark
2006-01-01
Recently there has been increasing evidence that purifying selection occurs among synonymous codons in mammalian genes. This selection appears to be a consequence of either cis-regulatory motifs, such as exonic splicing enhancers (ESEs), or mRNA secondary structures, being superimposed on the coding sequence of the gene. We have developed a program to identify regions likely to be enriched for such motifs by searching for extended regions of extreme codon conservation between homologous genes of related species. Here we present the results of applying this approach to five mammalian species (human, chimpanzee, mouse, rat and dog). Even with very conservative selection criteria, we find over 200 regions of extreme codon conservation, ranging in length from 60 to 178 codons. The regions are often found within genes involved in DNA-binding, RNA-binding or zinc-ion-binding. They are highly depleted for synonymous single nucleotide polymorphisms (SNPs) but not for non-synonymous SNPs, further indicating that the observed codon conservation is being driven by negative selection. Forty-three percent of the regions overlap conserved alternative transcript isoforms and are enriched for known ESEs. Other regions are enriched for TpA dinucleotides and may contain conserved motifs/structures relating to mRNA stability and/or degradation. We anticipate that this tool will be useful for detecting regions enriched in other classes of coding-sequence motifs and structures as well. PMID:16556911
Kumar, Charanya; Eichmiller, Robin; Wang, Bangchen; Williams, Gregory M; Bianco, Piero R; Surtees, Jennifer A
2014-06-01
In Saccharomyces cerevisiae, Msh2-Msh3-mediated mismatch repair (MMR) recognizes and targets insertion/deletion loops for repair. Msh2-Msh3 is also required for 3' non-homologous tail removal (3'NHTR) in double-strand break repair. In both pathways, Msh2-Msh3 binds double-strand/single-strand junctions and initiates repair in an ATP-dependent manner. However, we recently demonstrated that the two pathways have distinct requirements with respect to Msh2-Msh3 activities. We identified a set of aromatic residues in the nucleotide binding pocket (FLY motif) of Msh3 that, when mutated, disrupted MMR, but left 3'NHTR largely intact. One of these mutations, msh3Y942A, was predicted to disrupt the nucleotide sandwich and allow altered positioning of ATP within the pocket. To develop a mechanistic understanding of the differential requirements for ATP binding and/or hydrolysis in the two pathways, we characterized Msh2-Msh3 and Msh2-msh3Y942A ATP binding and hydrolysis activities in the presence of MMR and 3'NHTR DNA substrates. We observed distinct, substrate-dependent ATP hydrolysis and nucleotide turnover by Msh2-Msh3, indicating that the MMR and 3'NHTR DNA substrates differentially modify the ATP binding/hydrolysis activities of Msh2-Msh3. Msh2-msh3Y942A retained the ability to bind DNA and ATP but exhibited altered ATP hydrolysis and nucleotide turnover. We propose that both ATP and structure-specific repair substrates cooperate to direct Msh2-Msh3-mediated repair and suggest an explanation for the msh3Y942A separation-of-function phenotype. Copyright © 2014 Elsevier B.V. All rights reserved.
Kumar, Charanya; Eichmiller, Robin; Wang, Bangchen; Williams, Gregory M.; Bianco, Piero R.; Surtees, Jennifer A.
2014-01-01
In Saccharomyces cerevisiae, Msh2-Msh3-mediated mismatch repair (MMR) recognizes and targets insertion/deletion loops for repair. Msh2-Msh3 is also required for 3′ non-homologous tail removal (3′NHTR) in double-strand break repair. In both pathways, Msh2-Msh3 binds double-strand/single-strand junctions and initiates repair in an ATP-dependent manner. However, we recently demonstrated that the two pathways have distinct requirements with respect to Msh2-Msh3 activities. We identified a set of aromatic residues in the nucleotide binding pocket (FLY motif) of Msh3 that, when mutated, disrupted MMR, but left 3′ NHTR largely intact. One of these mutations, msh3Y942A, was predicted to disrupt the nucleotide sandwich and allow altered positioning of ATP within the pocket. To develop a mechanistic understanding of the differential requirements for ATP binding and/or hydrolysis in the two pathways, we characterized Msh2-Msh3 and Msh2-msh3Y942A ATP binding and hydrolysis activities in the presence of MMR and 3′ NHTR DNA substrates. We observed distinct, substrate-dependent ATP hydrolysis and nucleotide turnover by Msh2-Msh3, indicating that the MMR and 3′ NHTR DNA substrates differentially modify the ATP binding/hydrolysis activities of Msh2-Msh3. Msh2-msh3Y942A retained the ability to bind DNA and ATP but exhibited altered ATP hydrolysis and nucleotide turnover. We propose that both ATP and structure-specific repair substrates cooperate to direct Msh2-Msh3-mediated repair and suggest an explanation for the msh3Y942A separation-of-function phenotype. PMID:24746922
Li, Tong; Johansson, Ingegerd; Hay, Donald I.; Strömberg, Nicklas
1999-01-01
Oral strains of Actinomyces spp. express type 1 fimbriae, which are composed of major FimP subunits, and bind preferentially to salivary acidic proline-rich proteins (APRPs) or to statherin. We have mapped genetic differences in the fimP subunit genes and the peptide recognition motifs within the host proteins associated with these differential binding specificities. The fimP genes were amplified by PCR from Actinomyces viscosus ATCC 19246, with preferential binding to statherin, and from Actinomyces naeslundii LY7, P-1-K, and B-1-K, with preferential binding to APRPs. The fimP gene from the statherin-binding strain 19246 is novel and has about 80% nucleotide and amino acid sequence identity to the highly conserved fimP genes of the APRP-binding strains (about 98 to 99% sequence identity). The novel FimP protein contains an amino-terminal signal peptide, randomly distributed single-amino-acid substitutions, and structurally different segments and ends with a cell wall-anchoring and a membrane-spanning region. When agarose beads with CNBr-linked host determinant-specific decapeptides were used, A. viscosus 19246 bound to the Thr42Phe43 terminus of statherin and A. naeslundii LY7 bound to the Pro149Gln150 termini of APRPs. Furthermore, while the APRP-binding A. naeslundii strains originate from the human mouth, A. viscosus strains isolated from the oral cavity of rat and hamster hosts showed preferential binding to statherin and contained the novel fimP gene. Thus, A. viscosus and A. naeslundii display structurally variant fimP genes whose protein products are likely to interact with different peptide motifs and to determine animal host tropism. PMID:10225854
cWINNOWER algorithm for finding fuzzy dna motifs
NASA Technical Reports Server (NTRS)
Liang, S.; Samanta, M. P.; Biegel, B. A.
2004-01-01
The cWINNOWER algorithm detects fuzzy motifs in DNA sequences rich in protein-binding signals. A signal is defined as any short nucleotide pattern having up to d mutations differing from a motif of length l. The algorithm finds such motifs if a clique consisting of a sufficiently large number of mutated copies of the motif (i.e., the signals) is present in the DNA sequence. The cWINNOWER algorithm substantially improves the sensitivity of the winnower method of Pevzner and Sze by imposing a consensus constraint, enabling it to detect much weaker signals. We studied the minimum detectable clique size qc as a function of sequence length N for random sequences. We found that qc increases linearly with N for a fast version of the algorithm based on counting three-member sub-cliques. Imposing consensus constraints reduces qc by a factor of three in this case, which makes the algorithm dramatically more sensitive. Our most sensitive algorithm, which counts four-member sub-cliques, needs a minimum of only 13 signals to detect motifs in a sequence of length N = 12,000 for (l, d) = (15, 4). Copyright Imperial College Press.
Using Maximum Entropy to Find Patterns in Genomes
NASA Astrophysics Data System (ADS)
Liu, Sophia; Hockenberry, Adam; Lancichinetti, Andrea; Jewett, Michael; Amaral, Luis
The existence of over- and under-represented sequence motifs in genomes provides evidence of selective evolutionary pressures on biological mechanisms such as transcription, translation, ligand-substrate binding, and host immunity. To accurately identify motifs and other genome-scale patterns of interest, it is essential to be able to generate accurate null models that are appropriate for the sequences under study. There are currently no tools available that allow users to create random coding sequences with specified amino acid composition and GC content. Using the principle of maximum entropy, we developed a method that generates unbiased random sequences with pre-specified amino acid and GC content. Our method is the simplest way to obtain maximally unbiased random sequences that are subject to GC usage and primary amino acid sequence constraints. This approach can also be easily be expanded to create unbiased random sequences that incorporate more complicated constraints such as individual nucleotide usage or even di-nucleotide frequencies. The ability to generate correctly specified null models will allow researchers to accurately identify sequence motifs which will lead to a better understanding of biological processes. National Institute of General Medical Science, Northwestern University Presidential Fellowship, National Science Foundation, David and Lucile Packard Foundation, Camille Dreyfus Teacher Scholar Award.
Ranganathan, Sridevi; Cheung, Jonah; Cassidy, Michael; Ginter, Christopher; Pata, Janice D; McDonough, Kathleen A
2018-01-09
Mycobacterium tuberculosis (Mtb) encodes two CRP/FNR family transcription factors (TF) that contribute to virulence, Cmr (Rv1675c) and CRPMt (Rv3676). Prior studies identified distinct chromosomal binding profiles for each TF despite their recognizing overlapping DNA motifs. The present study shows that Cmr binding specificity is determined by discriminator nucleotides at motif positions 4 and 13. X-ray crystallography and targeted mutational analyses identified an arginine-rich loop that expands Cmr's DNA interactions beyond the classical helix-turn-helix contacts common to all CRP/FNR family members and facilitates binding to imperfect DNA sequences. Cmr binding to DNA results in a pronounced asymmetric bending of the DNA and its high level of cooperativity is consistent with DNA-facilitated dimerization. A unique N-terminal extension inserts between the DNA binding and dimerization domains, partially occluding the site where the canonical cAMP binding pocket is found. However, an unstructured region of this N-terminus may help modulate Cmr activity in response to cellular signals. Cmr's multiple levels of DNA interaction likely enhance its ability to integrate diverse gene regulatory signals, while its novel structural features establish Cmr as an atypical CRP/FNR family member. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Miura, Keiji; Kurosawa, Yoshikazu; Hirai, Momoki
1996-06-01
Nucleobindin (Nuc) was first identified as a secreted protein of 55 kDa that promotes production of DNA-specific antibodies in lupus-prone MRL/lpr mice. Analysis of cDNA that encoded Nuc revealed that the protein is composed of a signal peptide, a DNA-binding site, two calcium-binding motifs (EF-hand motifs), and a leucine zipper. In the present study, we analysed the organization of the human gene for Nuc (NUC). It consists of 13 exons that are distributed in a region of 32 kb. The functional motifs listed above are encoded in corresponding exons. NUC was expressed in all organs examined. Comparison of nucleotide sequencesmore » in the promotre regions between human and mouse NCU genes revealed several conserved sequences. Among them, two Sp1-binding sites and a CCAAT box are of particular interest. The promoter is of the TATA-less type, and transcription starts at multiple sites in both the human and the mouse genes. These features suggest that NUC might normally play a role as a housekeeping gene. NUC was located at human chromosome 19q13.2-q13.4. 25 refs., 4 figs., 1 tab.« less
NullSeq: A Tool for Generating Random Coding Sequences with Desired Amino Acid and GC Contents.
Liu, Sophia S; Hockenberry, Adam J; Lancichinetti, Andrea; Jewett, Michael C; Amaral, Luís A N
2016-11-01
The existence of over- and under-represented sequence motifs in genomes provides evidence of selective evolutionary pressures on biological mechanisms such as transcription, translation, ligand-substrate binding, and host immunity. In order to accurately identify motifs and other genome-scale patterns of interest, it is essential to be able to generate accurate null models that are appropriate for the sequences under study. While many tools have been developed to create random nucleotide sequences, protein coding sequences are subject to a unique set of constraints that complicates the process of generating appropriate null models. There are currently no tools available that allow users to create random coding sequences with specified amino acid composition and GC content for the purpose of hypothesis testing. Using the principle of maximum entropy, we developed a method that generates unbiased random sequences with pre-specified amino acid and GC content, which we have developed into a python package. Our method is the simplest way to obtain maximally unbiased random sequences that are subject to GC usage and primary amino acid sequence constraints. Furthermore, this approach can easily be expanded to create unbiased random sequences that incorporate more complicated constraints such as individual nucleotide usage or even di-nucleotide frequencies. The ability to generate correctly specified null models will allow researchers to accurately identify sequence motifs which will lead to a better understanding of biological processes as well as more effective engineering of biological systems.
Sánchez-Navarro, J A; Pallás, V
1997-01-01
The complete nucleotide sequence of an isolate of prunus necrotic ringspot virus (PNRSV) RNA 3 has been determined. Elucidation of the amino acid sequence of the proteins encoded by the two large open reading frames (ORFs) allowed us to carry out comparative and phylogenetic studies on the movement (MP) and coat (CP) proteins in the ilarvirus group. Amino acid sequence comparison of the MP revealed a highly conserved basic sequence motif with an amphipathic alpha-helical structure preceding the conserved motif of the '30K superfamily' proposed by Mushegian and Koonin [26] for MP's. Within this '30K' motif a strictly conserved transmembrane domain is present in all ilarviruses sequenced so far. At the amino-terminal end, prune dwarf virus (PDV) has an extension not present in other ilarviruses but which is observed in all bromo- and cucumoviruses, suggesting a common ancestor or a recombinational event in the Bromoviridae family. Examination of the N-terminus of the CP's of all ilarviruses revealed a highly basic region, part of which resembles the Arg-rich motif that has been characterized in the RNA-binding protein family. This motif has also been found in the other members of the Bromoviridae family, suggesting its involvement in a structural function. Furthermore this region is required for infectivity in ilarviruses. The similarities found in this Arg-rich motif are discussed in terms of this process known as genome activation. Finally, phylogenetic analysis of both the MP and CP proteins revealed a higher relationship of A1MV to PNRSV, apple mosaic virus (ApMV) and PDV than any other member of the ilarvirus group. In that sense, A1MV should be considered as a true ilarvirus instead of forming a distinct group of viruses.
Functionalizing Designer DNA Crystals
NASA Astrophysics Data System (ADS)
Chandrasekaran, Arun Richard
Three-dimensional crystals have been self-assembled from a DNA tensegrity triangle via sticky end interaction. The tensegrity triangle is a rigid DNA motif containing three double helical edges connected pair-wise by three four-arm junctions. The symmetric triangle contains 3 unique strands combined in a 3:3:1 ratio: 3 crossover, 3 helical and 1 central. The length of the sticky end reported previously was two nucleotides (nt) (GA:TC) and the motif with 2-helical turns of DNA per edge diffracted to 4.9 A at beam line NSLS-X25 and to 4 A at beam line ID19 at APS. The purpose of these self-assembled DNA crystals is that they can be used as a framework for hosting external guests for use in crystallographic structure solving or the periodic positioning of molecules for nanoelectronics. This thesis describes strategies to improve the resolution and to incorporate guests into the 3D lattice. The first chapter describes the effect of varying sticky end lengths and the influence of 5'-phosphate addition on crystal formation and resolution. X-ray diffraction data from beam line NSLS-X25 revealed that the crystal resolution for 1-nt (G:C) sticky end was 3.4 A. Motifs with every possible combination of 1-nt and 2-nt sticky-ended phosphorylated strands were crystallized and X-ray data were collected. The position of the 5'-phosphate on either the crossover (strand 1), helical (strand 2), or central strand (3) had an impact on the resolution of the self-assembled crystals with the 1-nt 1P-2-3 system diffracting to 2.62 A at APS and 3.1 A at NSLS-X25. The second chapter describes the sequence-specific recognition of DNA motifs with triplex-forming oligonucleotides (TFOs). This study examined the feasibility of using TFOs to bind to specific locations within a 3-turn DNA tensegrity triangle motif. The TFO 5'-TTCTTTCTTCTCT was used to target the tensegrity motif containing an appropriately embedded oligopurine.oligopyrimidine binding site. As triplex formation involving cytidine nucleotides is usually pH dependent (pH < 6) four different TFOs were examined: TFO-1 was unmodified while TFOs 2-4 contained additional stabilizing analogues capable of extending triplex formation to pH 7. In addition, each of the TFOs contained a Cy5 dye at the 5'-end of the oligonucleotide to aid in characterization of TFO binding - crystals were obtained with all four variations of TFOs. Formation of DNA triplex in the motif was characterized by an electrophoretic mobility shift assay (EMSA), UV melting studies and FRET. Crystals containing TFO-1 (unmodified) and TFO-2 (with 2'-amino ethoxy modification) were isolated and flash-frozen in liquid nitrogen for X-ray data collection at beam line NSLS-X25. X-ray data was also collected for crystals of the 3-turn triangle without any TFO bound to it. Difference maps were done between the crystals with TFO against the one without to identify any additional electron density corresponding to the third strand in the triplex binding region. The data from the crystal containing TFO-2 was used to further analyze if the additional density can match the expected position of the TFO on the triangle motif. Since the additional density did not correspond to the entire binding region, 2Fo-Fc, 3Fo-2Fc and 4Fo-3Fc maps were done to check for missing pieces of the electron density. From the resulting 2Fo-Fc map, the asymmetric unit from the 3-turn triangle (31-bp duplex model based on previous structure 3UBI) was inserted into the density as a reference. However, the electron density corresponding to the TFO was still not continuous throughout the 13-nt triplex binding region and allowed only a partial fit of the TFO. The third nucleotide in positions 1, 3, 4, 6, 7 were fit into the density in the major groove of the underlying duplex with proper triplex configuration. The third chapter describes the triplex approach to position a functional group (the UV cross-linking agent psoralen) within a pre-formed DNA motif. Triplex formation and psoralen cross-linking of the motif were analyzed by native and denaturing gel electrophoresis respectively. Motifs containing the Psoralen-TFO were also successfully crystallized and the crosslinking shown by analyzing the denatured crystals on a gel. The end goal would be to form a crosslinked designed DNA crystal that can diffract to a higher resolution. The fourth chapter describes the use of serial femtosecond crystallography for structure determination of designed DNA lattices. X-ray diffraction data from self-assembled 3D DNA microcrystals were collected from a stream of crystals in solution. Serial femtosecond crystallography eliminates the need for large crystals and the need for freezing, thus overcoming any associated crystal defects and radiation damage. Self-assembled nano/microcrystals were successfully made and were diffracted at room temperature. The best diffraction was from the 1-nt SE motif to an extent of 3.5 A in resolution.
Identifying mRNA sequence elements for target recognition by human Argonaute proteins
Li, Jingjing; Kim, TaeHyung; Nutiu, Razvan; Ray, Debashish; Hughes, Timothy R.; Zhang, Zhaolei
2014-01-01
It is commonly known that mammalian microRNAs (miRNAs) guide the RNA-induced silencing complex (RISC) to target mRNAs through the seed-pairing rule. However, recent experiments that coimmunoprecipitate the Argonaute proteins (AGOs), the central catalytic component of RISC, have consistently revealed extensive AGO-associated mRNAs that lack seed complementarity with miRNAs. We herein test the hypothesis that AGO has its own binding preference within target mRNAs, independent of guide miRNAs. By systematically analyzing the data from in vivo cross-linking experiments with human AGOs, we have identified a structurally accessible and evolutionarily conserved region (∼10 nucleotides in length) that alone can accurately predict AGO–mRNA associations, independent of the presence of miRNA binding sites. Within this region, we further identified an enriched motif that was replicable on independent AGO-immunoprecipitation data sets. We used RNAcompete to enumerate the RNA-binding preference of human AGO2 to all possible 7-mer RNA sequences and validated the AGO motif in vitro. These findings reveal a novel function of AGOs as sequence-specific RNA-binding proteins, which may aid miRNAs in recognizing their targets with high specificity. PMID:24663241
Mitrasinovic, Petar M
2006-03-01
RNA structure can be viewed as both a construct composed of various structural motifs and a flexible polymer that is substantially influenced by its environment. In this light, the present paper represents an attempt to reconcile the two standpoints. By using the 3D structures both of four (16S and 23S) portions of unbound 50S, H50S, and T30S ribosomal subunits and of 38 large ribonucleoligand complexes as the starting point, the behavior, which is induced by ligand binding, of 73 hairpin triloops with closing g-c and c-g base pairs was investigated using root-mean-square deviation (RMSD) approach and pseudotorsional (eta,theta) convention at the nucleotide-by-nucleotide level. Triloops were annotated in accordance with a recent proposal of geometric nomenclature. A simple measure for the determination of the strain of a triloop is introduced. It is believed that a possible classification of the interior triloops, based on the 2D eta-theta unique path, will aid to conceive their local behavior upon ligand binding. All rRNA residues in contact with ligands as well as regions of considerable conformational changes upon complex formation were identified. The analysis offers the answer to: how proximal to and how far from the actual ligand-binding sites the structural changes occur?
The phzA2-G2 Transcript Exhibits Direct RsmA-Mediated Activation in Pseudomonas aeruginosa M18
Ren, Bin; Shen, Huifeng; Lu, Zhi John; Liu, Haiming; Xu, Yuquan
2014-01-01
In bacteria, RNA-binding proteins of the RsmA/CsrA family act as post-transcriptional regulators that modulate translation initiation at target transcripts. The Pseudomonas aeruginosa genome contains two phenazine biosynthetic (phz) gene clusters, phzA1-G1 (phz1) and phzA2-G2 (phz2), each of which is responsible for phenazine-1-carboxylic acid (PCA) biosynthesis. In the present study, we show that RsmA exhibits differential gene regulation on two phz clusters in P. aeruginosa M18 at the post-transcriptional level. Based on the sequence analysis, four GGA motifs, the potential RsmA binding sites, are found on the 5′-untranslated region (UTR) of the phz2 transcript. Studies with a series of lacZ reporter fusions, and gel mobility shift assays suggest that the third GGA motif (S3), located 21 nucleotides upstream of the Shine-Dalgarno (SD) sequence, is involved in direct RsmA-mediated activation of phz2 expression. We therefore propose a novel model in which the binding of RsmA to the target S3 results in the destabilization of the stem-loop structure and the enhancement of ribosome access. This model could be fully supported by RNA structure prediction, free energy calculations, and nucleotide replacement studies. In contrast, various RsmA-mediated translation repression mechanisms have been identified in which RsmA binds near the SD sequence of target transcripts, thereby blocking ribosome access. Similarly, RsmA is shown to negatively regulate phz1 expression. Our new findings suggest that the differential regulation exerted by RsmA on the two phz clusters may confer an advantage to P. aeruginosa over other pseudomonads containing only a single phz cluster in their genomes. PMID:24586939
Suciu, Maria C.; Telenius, Jelena
2017-01-01
In the era of genome-wide association studies (GWAS) and personalized medicine, predicting the impact of single nucleotide polymorphisms (SNPs) in regulatory elements is an important goal. Current approaches to determine the potential of regulatory SNPs depend on inadequate knowledge of cell-specific DNA binding motifs. Here, we present Sasquatch, a new computational approach that uses DNase footprint data to estimate and visualize the effects of noncoding variants on transcription factor binding. Sasquatch performs a comprehensive k-mer-based analysis of DNase footprints to determine any k-mer's potential for protein binding in a specific cell type and how this may be changed by sequence variants. Therefore, Sasquatch uses an unbiased approach, independent of known transcription factor binding sites and motifs. Sasquatch only requires a single DNase-seq data set per cell type, from any genotype, and produces consistent predictions from data generated by different experimental procedures and at different sequence depths. Here we demonstrate the effectiveness of Sasquatch using previously validated functional SNPs and benchmark its performance against existing approaches. Sasquatch is available as a versatile webtool incorporating publicly available data, including the human ENCODE collection. Thus, Sasquatch provides a powerful tool and repository for prioritizing likely regulatory SNPs in the noncoding genome. PMID:28904015
NASA Technical Reports Server (NTRS)
Ji, C.; Chen, Y.; McCarthy, T. L.; Centrella, M.
1999-01-01
Transforming growth factor-beta binds to three high affinity cell surface molecules that directly or indirectly regulate its biological effects. The type III receptor (TRIII) is a proteoglycan that lacks significant intracellular signaling or enzymatic motifs but may facilitate transforming growth factor-beta binding to other receptors, stabilize multimeric receptor complexes, or segregate growth factor from activating receptors. Because various agents or events that regulate osteoblast function rapidly modulate TRIII expression, we cloned the 5' region of the rat TRIII gene to assess possible control elements. DNA fragments from this region directed high reporter gene expression in osteoblasts. Sequencing showed no consensus TATA or CCAAT boxes, whereas several nuclear factors binding sequences within the 3' region of the promoter co-mapped with multiple transcription initiation sites, DNase I footprints, gel mobility shift analysis, or loss of activity by deletion or mutation. An upstream enhancer was evident 5' proximal to nucleotide -979, and a silencer region occurred between nucleotides -2014 and -2194. Glucocorticoid sensitivity mapped between nucleotides -687 and -253, whereas bone morphogenetic protein 2 sensitivity co-mapped within the silencer region. Thus, the TRIII promoter contains cooperative basal elements and dispersed growth factor- and hormone-sensitive regulatory regions that can control TRIII expression by osteoblasts.
Liseron-Monfils, Christophe; Lewis, Tim; Ashlock, Daniel; McNicholas, Paul D; Fauteux, François; Strömvik, Martina; Raizada, Manish N
2013-03-15
The discovery of genetic networks and cis-acting DNA motifs underlying their regulation is a major objective of transcriptome studies. The recent release of the maize genome (Zea mays L.) has facilitated in silico searches for regulatory motifs. Several algorithms exist to predict cis-acting elements, but none have been adapted for maize. A benchmark data set was used to evaluate the accuracy of three motif discovery programs: BioProspector, Weeder and MEME. Analysis showed that each motif discovery tool had limited accuracy and appeared to retrieve a distinct set of motifs. Therefore, using the benchmark, statistical filters were optimized to reduce the false discovery ratio, and then remaining motifs from all programs were combined to improve motif prediction. These principles were integrated into a user-friendly pipeline for motif discovery in maize called Promzea, available at http://www.promzea.org and on the Discovery Environment of the iPlant Collaborative website. Promzea was subsequently expanded to include rice and Arabidopsis. Within Promzea, a user enters cDNA sequences or gene IDs; corresponding upstream sequences are retrieved from the maize genome. Predicted motifs are filtered, combined and ranked. Promzea searches the chosen plant genome for genes containing each candidate motif, providing the user with the gene list and corresponding gene annotations. Promzea was validated in silico using a benchmark data set: the Promzea pipeline showed a 22% increase in nucleotide sensitivity compared to the best standalone program tool, Weeder, with equivalent nucleotide specificity. Promzea was also validated by its ability to retrieve the experimentally defined binding sites of transcription factors that regulate the maize anthocyanin and phlobaphene biosynthetic pathways. Promzea predicted additional promoter motifs, and genome-wide motif searches by Promzea identified 127 non-anthocyanin/phlobaphene genes that each contained all five predicted promoter motifs in their promoters, perhaps uncovering a broader co-regulated gene network. Promzea was also tested against tissue-specific microarray data from maize. An online tool customized for promoter motif discovery in plants has been generated called Promzea. Promzea was validated in silico by its ability to retrieve benchmark motifs and experimentally defined motifs and was tested using tissue-specific microarray data. Promzea predicted broader networks of gene regulation associated with the historic anthocyanin and phlobaphene biosynthetic pathways. Promzea is a new bioinformatics tool for understanding transcriptional gene regulation in maize and has been expanded to include rice and Arabidopsis.
Structural Characterization of Two Metastable ATP-Bound States of P-Glycoprotein
O’Mara, Megan L.; Mark, Alan E.
2014-01-01
ATP Binding Cassette (ABC) transporters couple the binding and hydrolysis of ATP to the transport of substrate molecules across the membrane. The mechanism by which ATP binding and/or hydrolysis drives the conformational changes associated with substrate transport has not yet been characterized fully. Here, changes in the conformation of the ABC export protein P-glycoprotein on ATP binding are examined in a series of molecular dynamics simulations. When one molecule of ATP is placed at the ATP binding site associated with each of the two nucleotide binding domains (NBDs), the membrane-embedded P-glycoprotein crystal structure adopts two distinct metastable conformations. In one, each ATP molecule interacts primarily with the Walker A motif of the corresponding NBD. In the other, the ATP molecules interacts with both Walker A motif of one NBD and the Signature motif of the opposite NBD inducing the partial dimerization of the NBDs. This interaction is more extensive in one of the two ATP binding site, leading to an asymmetric structure. The overall conformation of the transmembrane domains is not altered in either of these metastable states, indicating that the conformational changes associated with ATP binding observed in the simulations in the absence of substrate do not lead to the outward-facing conformation and thus would be insufficient in themselves to drive transport. Nevertheless, the metastable intermediate ATP-bound conformations observed are compatible with a wide range of experimental cross-linking data demonstrating the simulations do capture physiologically important conformations. Analysis of the interaction between ATP and its cofactor Mg2+ with each NBD indicates that the coordination of ATP and Mg2+ differs between the two NBDs. The role structural asymmetry may play in ATP binding and hydrolysis is discussed. Furthermore, we demonstrate that our results are not heavily influenced by the crystal structure chosen for initiation of the simulations. PMID:24632881
Swellix: a computational tool to explore RNA conformational space.
Sloat, Nathan; Liu, Jui-Wen; Schroeder, Susan J
2017-11-21
The sequence of nucleotides in an RNA determines the possible base pairs for an RNA fold and thus also determines the overall shape and function of an RNA. The Swellix program presented here combines a helix abstraction with a combinatorial approach to the RNA folding problem in order to compute all possible non-pseudoknotted RNA structures for RNA sequences. The Swellix program builds on the Crumple program and can include experimental constraints on global RNA structures such as the minimum number and lengths of helices from crystallography, cryoelectron microscopy, or in vivo crosslinking and chemical probing methods. The conceptual advance in Swellix is to count helices and generate all possible combinations of helices rather than counting and combining base pairs. Swellix bundles similar helices and includes improvements in memory use and efficient parallelization. Biological applications of Swellix are demonstrated by computing the reduction in conformational space and entropy due to naturally modified nucleotides in tRNA sequences and by motif searches in Human Endogenous Retroviral (HERV) RNA sequences. The Swellix motif search reveals occurrences of protein and drug binding motifs in the HERV RNA ensemble that do not occur in minimum free energy or centroid predicted structures. Swellix presents significant improvements over Crumple in terms of efficiency and memory use. The efficient parallelization of Swellix enables the computation of sequences as long as 418 nucleotides with sufficient experimental constraints. Thus, Swellix provides a practical alternative to free energy minimization tools when multiple structures, kinetically determined structures, or complex RNA-RNA and RNA-protein interactions are present in an RNA folding problem.
DEEP MOTIF DASHBOARD: VISUALIZING AND UNDERSTANDING GENOMIC SEQUENCES USING DEEP NEURAL NETWORKS.
Lanchantin, Jack; Singh, Ritambhara; Wang, Beilun; Qi, Yanjun
2017-01-01
Deep neural network (DNN) models have recently obtained state-of-the-art prediction accuracy for the transcription factor binding (TFBS) site classification task. However, it remains unclear how these approaches identify meaningful DNA sequence signals and give insights as to why TFs bind to certain locations. In this paper, we propose a toolkit called the Deep Motif Dashboard (DeMo Dashboard) which provides a suite of visualization strategies to extract motifs, or sequence patterns from deep neural network models for TFBS classification. We demonstrate how to visualize and understand three important DNN models: convolutional, recurrent, and convolutional-recurrent networks. Our first visualization method is finding a test sequence's saliency map which uses first-order derivatives to describe the importance of each nucleotide in making the final prediction. Second, considering recurrent models make predictions in a temporal manner (from one end of a TFBS sequence to the other), we introduce temporal output scores, indicating the prediction score of a model over time for a sequential input. Lastly, a class-specific visualization strategy finds the optimal input sequence for a given TFBS positive class via stochastic gradient optimization. Our experimental results indicate that a convolutional-recurrent architecture performs the best among the three architectures. The visualization techniques indicate that CNN-RNN makes predictions by modeling both motifs as well as dependencies among them.
Deep Motif Dashboard: Visualizing and Understanding Genomic Sequences Using Deep Neural Networks
Lanchantin, Jack; Singh, Ritambhara; Wang, Beilun; Qi, Yanjun
2018-01-01
Deep neural network (DNN) models have recently obtained state-of-the-art prediction accuracy for the transcription factor binding (TFBS) site classification task. However, it remains unclear how these approaches identify meaningful DNA sequence signals and give insights as to why TFs bind to certain locations. In this paper, we propose a toolkit called the Deep Motif Dashboard (DeMo Dashboard) which provides a suite of visualization strategies to extract motifs, or sequence patterns from deep neural network models for TFBS classification. We demonstrate how to visualize and understand three important DNN models: convolutional, recurrent, and convolutional-recurrent networks. Our first visualization method is finding a test sequence’s saliency map which uses first-order derivatives to describe the importance of each nucleotide in making the final prediction. Second, considering recurrent models make predictions in a temporal manner (from one end of a TFBS sequence to the other), we introduce temporal output scores, indicating the prediction score of a model over time for a sequential input. Lastly, a class-specific visualization strategy finds the optimal input sequence for a given TFBS positive class via stochastic gradient optimization. Our experimental results indicate that a convolutional-recurrent architecture performs the best among the three architectures. The visualization techniques indicate that CNN-RNN makes predictions by modeling both motifs as well as dependencies among them. PMID:27896980
A MicroRNA Superfamily Regulates Nucleotide Binding Site–Leucine-Rich Repeats and Other mRNAs[W][OA
Shivaprasad, Padubidri V.; Chen, Ho-Ming; Patel, Kanu; Bond, Donna M.; Santos, Bruno A.C.M.; Baulcombe, David C.
2012-01-01
Analysis of tomato (Solanum lycopersicum) small RNA data sets revealed the presence of a regulatory cascade affecting disease resistance. The initiators of the cascade are microRNA members of an unusually diverse superfamily in which miR482 and miR2118 are prominent members. Members of this superfamily are variable in sequence and abundance in different species, but all variants target the coding sequence for the P-loop motif in the mRNA sequences for disease resistance proteins with nucleotide binding site (NBS) and leucine-rich repeat (LRR) motifs. We confirm, using transient expression in Nicotiana benthamiana, that miR482 targets mRNAs for NBS-LRR disease resistance proteins with coiled-coil domains at their N terminus. The targeting causes mRNA decay and production of secondary siRNAs in a manner that depends on RNA-dependent RNA polymerase 6. At least one of these secondary siRNAs targets other mRNAs of a defense-related protein. The miR482-mediated silencing cascade is suppressed in plants infected with viruses or bacteria so that expression of mRNAs with miR482 or secondary siRNA target sequences is increased. We propose that this process allows pathogen-inducible expression of NBS-LRR proteins and that it contributes to a novel layer of defense against pathogen attack. PMID:22408077
GBshape: a genome browser database for DNA shape annotations
Chiu, Tsu-Pei; Yang, Lin; Zhou, Tianyin; Main, Bradley J.; Parker, Stephen C.J.; Nuzhdin, Sergey V.; Tullius, Thomas D.; Rohs, Remo
2015-01-01
Many regulatory mechanisms require a high degree of specificity in protein-DNA binding. Nucleotide sequence does not provide an answer to the question of why a protein binds only to a small subset of the many putative binding sites in the genome that share the same core motif. Whereas higher-order effects, such as chromatin accessibility, cooperativity and cofactors, have been described, DNA shape recently gained attention as another feature that fine-tunes the DNA binding specificities of some transcription factor families. Our Genome Browser for DNA shape annotations (GBshape; freely available at http://rohslab.cmb.usc.edu/GBshape/) provides minor groove width, propeller twist, roll, helix twist and hydroxyl radical cleavage predictions for the entire genomes of 94 organisms. Additional genomes can easily be added using the GBshape framework. GBshape can be used to visualize DNA shape annotations qualitatively in a genome browser track format, and to download quantitative values of DNA shape features as a function of genomic position at nucleotide resolution. As biological applications, we illustrate the periodicity of DNA shape features that are present in nucleosome-occupied sequences from human, fly and worm, and we demonstrate structural similarities between transcription start sites in the genomes of four Drosophila species. PMID:25326329
Sperry, Justin B.; Ryan, Zachary C.; Kumar, Rajiv; Gross, Michael L.
2012-01-01
Xeroderma pigmentosum (XP) is a genetic disease affecting 1 in 10,000-100,000 and predisposes people to early-age skin cancer, a disease that is increasing. Those with XP have decreased ability to repair UV-induced DNA damage, leading to increased susceptibility of cancerous non-melanomas and melanomas. A vital, heterotrimeric protein complex is linked to the nucleotide excision repair pathway for the damaged DNA. The complex consists of XPC protein, human centrin 2, and RAD23B. One of the members, human centrin 2, is a ubiquitous, acidic, Ca2+-binding protein belonging to the calmodulin superfamily. The XPC protein contains a sequence motif specific for binding to human centrin 2. We report here the Ca2+-binding properties of human centrin 2 and its interaction with the XPC peptide motif. We utilized a region-specific H/D exchange protocol to localize the interaction of the XPC peptide with the C-terminal domain of centrin, the binding of which is different than that of calmodulin complexes. The binding dynamics of human centrin 2 to the XPC peptide in the absence and presence of Ca2+ are revealed by the observation of EX1 H/D exchange regime, indicating that a locally unfolded population exists in solution and undergoes fast H/D exchange. PMID:23439742
Rodriguez Parkitna, Jan M; Ozyhar, Andrzej; Wiśniewski, Jacek R; Kochman, Marian
2002-09-01
Juvenile hormone binding proteins (JHBPs) serve as specific carriers of juvenile hormone (JH) in insect hemolymph. As shown in this report, Galleria mellonella JHBP is encoded by a cDNA of 1063 nucleotides. The pre-protein consists of 245 amino acids with a 20 amino acid leader sequence. The concentration of the JHBP mRNA reaches a maximum on the third day of the last larval instar, and decreases five-fold towards pupation. Comparison of amino acid sequences of JHBPs from Bombyx mori, Heliothis virescens, Manduca sexta and G. mellonella shows that 57 positions out of 226 are occupied by identical amino acids. A phylogeny tree was constructed from 32 proteins, which function could be associated to JH. It has three major branches: (i) ligand binding domains of nuclear receptors, (ii) JHBPs and JH esterases (JHEs), and (iii) hypothetical proteins found in Drosophila melanogaster genome. Despite the close positioning of JHEs and JHBPs on the tree, which probably arises from the presence of a common JH binding motif, these proteins are unlikely to belong to the same family. Detailed analysis of the secondary structure modeling shows that JHBPs may contain a beta-barrel motif flanked by alpha-helices and thus be evolutionary related to the same superfamily as calycins.
Andrabi, Munazah; Hutchins, Andrew Paul; Miranda-Saavedra, Diego; Kono, Hidetoshi; Nussinov, Ruth; Mizuguchi, Kenji; Ahmad, Shandar
2017-06-22
DNA shape is emerging as an important determinant of transcription factor binding beyond just the DNA sequence. The only tool for large scale DNA shape estimates, DNAshape was derived from Monte-Carlo simulations and predicts four broad and static DNA shape features, Propeller twist, Helical twist, Minor groove width and Roll. The contributions of other shape features e.g. Shift, Slide and Opening cannot be evaluated using DNAshape. Here, we report a novel method DynaSeq, which predicts molecular dynamics-derived ensembles of a more exhaustive set of DNA shape features. We compared the DNAshape and DynaSeq predictions for the common features and applied both to predict the genome-wide binding sites of 1312 TFs available from protein interaction quantification (PIQ) data. The results indicate a good agreement between the two methods for the common shape features and point to advantages in using DynaSeq. Predictive models employing ensembles from individual conformational parameters revealed that base-pair opening - known to be important in strand separation - was the best predictor of transcription factor-binding sites (TFBS) followed by features employed by DNAshape. Of note, TFBS could be predicted not only from the features at the target motif sites, but also from those as far as 200 nucleotides away from the motif.
Rewriting nature's assembly manual for a ssRNA virus.
Patel, Nikesh; Wroblewski, Emma; Leonov, German; Phillips, Simon E V; Tuma, Roman; Twarock, Reidun; Stockley, Peter G
2017-11-14
Satellite tobacco necrosis virus (STNV) is one of the smallest viruses known. Its genome encodes only its coat protein (CP) subunit, relying on the polymerase of its helper virus TNV for replication. The genome has been shown to contain a cryptic set of dispersed assembly signals in the form of stem-loops that each present a minimal CP-binding motif AXXA in the loops. The genomic fragment encompassing nucleotides 1-127 is predicted to contain five such packaging signals (PSs). We have used mutagenesis to determine the critical assembly features in this region. These include the CP-binding motif, the relative placement of PS stem-loops, their number, and their folding propensity. CP binding has an electrostatic contribution, but assembly nucleation is dominated by the recognition of the folded PSs in the RNA fragment. Mutation to remove all AXXA motifs in PSs throughout the genome yields an RNA that is unable to assemble efficiently. In contrast, when a synthetic 127-nt fragment encompassing improved PSs is swapped onto the RNA otherwise lacking CP recognition motifs, assembly is partially restored, although the virus-like particles created are incomplete, implying that PSs outside this region are required for correct assembly. Swapping this improved region into the wild-type STNV1 sequence results in a better assembly substrate than the viral RNA, producing complete capsids and outcompeting the wild-type genome in head-to-head competition. These data confirm details of the PS-mediated assembly mechanism for STNV and identify an efficient approach for production of stable virus-like particles encapsidating nonnative RNAs or other cargoes. Copyright © 2017 the Author(s). Published by PNAS.
Bunka, David H J; Lane, Stephen W; Lane, Claire L; Dykeman, Eric C; Ford, Robert J; Barker, Amy M; Twarock, Reidun; Phillips, Simon E V; Stockley, Peter G
2011-10-14
Using a recombinant, T=1 Satellite Tobacco Necrosis Virus (STNV)-like particle expressed in Escherichia coli, we have established conditions for in vitro disassembly and reassembly of the viral capsid. In vivo assembly is dependent on the presence of the coat protein (CP) N-terminal region, and in vitro assembly requires RNA. Using immobilised CP monomers under reassembly conditions with "free" CP subunits, we have prepared a range of partially assembled CP species for RNA aptamer selection. SELEX directed against the RNA-binding face of the STNV CP resulted in the isolation of several clones, one of which (B3) matches the STNV-1 genome in 16 out of 25 nucleotide positions, including across a statistically significant 10/10 stretch. This 10-base region folds into a stem-loop displaying the motif ACAA and has been shown to bind to STNV CP. Analysis of the other aptamer sequences reveals that the majority can be folded into stem-loops displaying versions of this motif. Using a sequence and secondary structure search motif to analyse the genomic sequence of STNV-1, we identified 30 stem-loops displaying the sequence motif AxxA. The implication is that there are many stem-loops in the genome carrying essential recognition features for binding STNV CP. Secondary structure predictions of the genomic RNA using Mfold showed that only 8 out of 30 of these stem-loops would be formed in the lowest-energy structure. These results are consistent with an assembly mechanism based on kinetically driven folding of the RNA. Copyright © 2011 Elsevier Ltd. All rights reserved.
Li, Xiaoze; Johansson, Cecilia; Glahder, Jacob; Mossberg, Ann-Kristin; Schwartz, Stefan
2013-01-01
Human papillomavirus type 16 (HPV-16) 5′-splice site SD3632 is used exclusively to produce late L1 mRNAs. We identified a 34-nt splicing inhibitory element located immediately upstream of HPV-16 late 5′-splice site SD3632. Two AUAGUA motifs located in these 34 nt inhibited SD3632. Two nucleotide substitutions in each of the HPV-16 specific AUAGUA motifs alleviated splicing inhibition and induced late L1 mRNA production from episomal forms of the HPV-16 genome in primary human keratinocytes. The AUAGUA motifs bind specifically not only to the heterogeneous nuclear RNP (hnRNP) D family of RNA-binding proteins including hnRNP D/AUF, hnRNP DL and hnRNP AB but also to hnRNP A2/B1. Knock-down of these proteins induced HPV-16 late L1 mRNA expression, and overexpression of hnRNP A2/B1, hnRNP AB, hnRNP DL and the two hnRNP D isoforms hnRNP D37 and hnRNP D40 further suppressed L1 mRNA expression. This inhibition may allow HPV-16 to hide from the immune system and establish long-term persistent infections with enhanced risk at progressing to cancer. There is an inverse correlation between expression of hnRNP D proteins and hnRNP A2/B1 and HPV-16 L1 production in the cervical epithelium, as well as in cervical cancer, supporting the conclusion that hnRNP D proteins and A2/B1 inhibit HPV-16 L1 mRNA production. PMID:24013563
Kumar, Charanya; Williams, Gregory M; Havens, Brett; Dinicola, Michelle K; Surtees, Jennifer A
2013-06-12
In Saccharomyces cerevisiae, repair of insertion/deletion loops is carried out by Msh2-Msh3-mediated mismatch repair (MMR). Msh2-Msh3 is also required for 3' non-homologous tail removal (3' NHTR) in double-strand break repair. In both pathways, Msh2-Msh3 binds double-strand/single-strand junctions and initiates repair in an ATP-dependent manner. However, the kinetics of the two processes appear different; MMR is likely rapid in order to coordinate with the replication fork, whereas 3' NHTR has been shown to be a slower process. To understand the molecular requirements in both repair pathways, we performed an in vivo analysis of well-conserved residues in Msh3 that are hypothesized to be required for MMR and/or 3' NHTR. These residues are predicted to be involved in either communication between the DNA-binding and ATPase domains within the complex or nucleotide binding and/or exchange within Msh2-Msh3. We identified a set of aromatic residues within the FLY motif of the predicted Msh3 nucleotide binding pocket that are essential for Msh2-Msh3-mediated MMR but are largely dispensable for 3' NHTR. In contrast, mutations in other regions gave similar phenotypes in both assays. Based on these results, we suggest that the two pathways have distinct requirements with respect to the position of the bound ATP within Msh3. We propose that the differences are related, at least in part, to the kinetics of each pathway. Proper binding and positioning of ATP is required to induce rapid conformational changes at the replication fork, but is less important when more time is available for repair, as in 3' NHTR. Copyright © 2013 Elsevier Ltd. All rights reserved.
Kumar, Charanya; Williams, Gregory M.; Havens, Brett; Dinicola, Michelle; Surtees, Jennifer A.
2013-01-01
In Saccharomyces cerevisiae, repair of insertion/deletion loops is carried out by Msh2-Msh3-mediated mismatch repair (MMR). Msh2-Msh3 is also required for 3’ non-homologous tail removal (3’NHTR) in double-strand break repair. In both pathways, Msh2-Msh3 binds double-strand/single-strand junctions and initiates repair in an ATP-dependent manner. However, the kinetics of the two processes appear different; MMR is likely rapid in order to coordinate with the replication fork, whereas 3’ NHTR has been shown to be a slower process. To understand the molecular requirements in both repair pathways, we performed an in vivo analysis of well conserved residues in Msh3 that are hypothesized to be required for MMR and/or 3’NHTR. These residues are predicted to be involved in either communication between the DNA-binding and ATPase domains within the complex or nucleotide binding and/or exchange within Msh2-Msh3. We identified a set of aromatic residues within the FLY motif of the predicted Msh3 nucleotide binding pocket that are essential for Msh2-Msh3-mediated MMR but are largely dispensable for 3’NHTR. In contrast, mutations in other regions gave similar phenotypes in both assays. Based on these results, we suggest the two pathways have distinct requirements with respect to the position of the bound ATP within Msh3. We propose that the differences are related, at least in part, to the kinetics of each pathway. Proper binding and positioning of ATP is required to induce rapid conformational changes at the replication fork, but is less important when more time is available for repair, as in 3’ NHTR. PMID:23458407
Functional and Structural Analysis of the Conserved EFhd2 Protein
Acosta, Yancy Ferrer; Rodríguez Cruz, Eva N.; Vaquer, Ana del C.; Vega, Irving E.
2013-01-01
EFhd2 is a novel protein conserved from C. elegans to H. sapiens. This novel protein was originally identified in cells of the immune and central nervous systems. However, it is most abundant in the central nervous system, where it has been found associated with pathological forms of the microtubule-associated protein tau. The physiological or pathological roles of EFhd2 are poorly understood. In this study, a functional and structural analysis was carried to characterize the molecular requirements for EFhd2’s calcium binding activity. The results showed that mutations of a conserved aspartate on either EF-hand motif disrupted the calcium binding activity, indicating that these motifs work in pair as a functional calcium binding domain. Furthermore, characterization of an identified single-nucleotide polymorphisms (SNP) that introduced a missense mutation indicates the importance of a conserved phenylalanine on EFhd2 calcium binding activity. Structural analysis revealed that EFhd2 is predominantly composed of alpha helix and random coil structures and that this novel protein is thermostable. EFhd2’s thermo stability depends on its N-terminus. In the absence of the N-terminus, calcium binding restored EFhd2’s thermal stability. Overall, these studies contribute to our understanding on EFhd2 functional and structural properties, and introduce it into the family of canonical EF-hand domain containing proteins. PMID:22973849
Schwessinger, Ron; Suciu, Maria C; McGowan, Simon J; Telenius, Jelena; Taylor, Stephen; Higgs, Doug R; Hughes, Jim R
2017-10-01
In the era of genome-wide association studies (GWAS) and personalized medicine, predicting the impact of single nucleotide polymorphisms (SNPs) in regulatory elements is an important goal. Current approaches to determine the potential of regulatory SNPs depend on inadequate knowledge of cell-specific DNA binding motifs. Here, we present Sasquatch, a new computational approach that uses DNase footprint data to estimate and visualize the effects of noncoding variants on transcription factor binding. Sasquatch performs a comprehensive k -mer-based analysis of DNase footprints to determine any k -mer's potential for protein binding in a specific cell type and how this may be changed by sequence variants. Therefore, Sasquatch uses an unbiased approach, independent of known transcription factor binding sites and motifs. Sasquatch only requires a single DNase-seq data set per cell type, from any genotype, and produces consistent predictions from data generated by different experimental procedures and at different sequence depths. Here we demonstrate the effectiveness of Sasquatch using previously validated functional SNPs and benchmark its performance against existing approaches. Sasquatch is available as a versatile webtool incorporating publicly available data, including the human ENCODE collection. Thus, Sasquatch provides a powerful tool and repository for prioritizing likely regulatory SNPs in the noncoding genome. © 2017 Schwessinger et al.; Published by Cold Spring Harbor Laboratory Press.
Iakhiaeva, Elena; Iakhiaev, Alexei; Zwieb, Christian
2010-11-13
Human cells depend critically on the signal recognition particle (SRP) for the sorting and delivery of their proteins. The SRP is a ribonucleoprotein complex which binds to signal sequences of secretory polypeptides as they emerge from the ribosome. Among the six proteins of the eukaryotic SRP, the largest protein, SRP72, is essential for protein targeting and possesses a poorly characterized RNA binding domain. We delineated the minimal region of SRP72 capable of forming a stable complex with an SRP RNA fragment. The region encompassed residues 545 to 585 of the full-length human SRP72 and contained a lysine-rich cluster (KKKKKKKKGK) at postions 552 to 561 as well as a conserved Pfam motif with the sequence PDPXRWLPXXER at positions 572 to 583. We demonstrated by site-directed mutagenesis that both regions participated in the formation of a complex with the RNA. In agreement with biochemical data and results from chymotryptic digestion experiments, molecular modeling of SRP72 implied that the invariant W577 was located inside the predicted structure of an RNA binding domain. The 11-nucleotide 5e motif contained within the SRP RNA fragment was shown by comparative electrophoresis on native polyacrylamide gels to conform to an RNA kink-turn. The model of the complex suggested that the conserved A240 of the K-turn, previously identified as being essential for the binding to SRP72, could protrude into a groove of the SRP72 RNA binding domain, similar but not identical to how other K-turn recognizing proteins interact with RNA. The results from the presented experiments provided insights into the molecular details of a functionally important and structurally interesting RNA-protein interaction. A model for how a ligand binding pocket of SRP72 can accommodate a new RNA K-turn in the 5e region of the eukaryotic SRP RNA is proposed.
2010-01-01
Background Human cells depend critically on the signal recognition particle (SRP) for the sorting and delivery of their proteins. The SRP is a ribonucleoprotein complex which binds to signal sequences of secretory polypeptides as they emerge from the ribosome. Among the six proteins of the eukaryotic SRP, the largest protein, SRP72, is essential for protein targeting and possesses a poorly characterized RNA binding domain. Results We delineated the minimal region of SRP72 capable of forming a stable complex with an SRP RNA fragment. The region encompassed residues 545 to 585 of the full-length human SRP72 and contained a lysine-rich cluster (KKKKKKKKGK) at postions 552 to 561 as well as a conserved Pfam motif with the sequence PDPXRWLPXXER at positions 572 to 583. We demonstrated by site-directed mutagenesis that both regions participated in the formation of a complex with the RNA. In agreement with biochemical data and results from chymotryptic digestion experiments, molecular modeling of SRP72 implied that the invariant W577 was located inside the predicted structure of an RNA binding domain. The 11-nucleotide 5e motif contained within the SRP RNA fragment was shown by comparative electrophoresis on native polyacrylamide gels to conform to an RNA kink-turn. The model of the complex suggested that the conserved A240 of the K-turn, previously identified as being essential for the binding to SRP72, could protrude into a groove of the SRP72 RNA binding domain, similar but not identical to how other K-turn recognizing proteins interact with RNA. Conclusions The results from the presented experiments provided insights into the molecular details of a functionally important and structurally interesting RNA-protein interaction. A model for how a ligand binding pocket of SRP72 can accommodate a new RNA K-turn in the 5e region of the eukaryotic SRP RNA is proposed. PMID:21073748
Cleator, John H; Wells, Christopher A; Dingus, Jane; Kurtz, David T; Hildebrandt, John D
2018-05-01
Ser54 of G s α binds guanine nucleotide and Mg 2+ as part of a conserved sequence motif in GTP binding proteins. Mutating the homologous residue in small and heterotrimeric G proteins generates dominant-negative proteins, but by protein-specific mechanisms. For α i/o , this results from persistent binding of α to βγ , whereas for small GTP binding proteins and α s this results from persistent binding to guanine nucleotide exchange factor or receptor. This work examined the role of βγ interactions in mediating the properties of the Ser54-like mutants of G α subunits. Unexpectedly, WT- α s or N54- α s coexpressed with α 1B -adrenergic receptor in human embryonic kidney 293 cells decreased receptor stimulation of IP3 production by a cAMP-independent mechanism, but WT- α s was more effective than the mutant. One explanation for this result would be that α s , like Ser47 α i/o , blocks receptor activation by sequestering βγ ; implying that N54- α S has reduced affinity for βγ since it was less effective at blocking IP3 production. This possibility was more directly supported by the observation that WT- α s was more effective than the mutant in inhibiting βγ activation of phospholipase C β 2. Further, in vitro synthesized N54- α s bound biotinylated- βγ with lower apparent affinity than did WT- α s The Cys54 mutation also decreased βγ binding but less effectively than N54- α s Substitution of the conserved Ser in α o with Cys or Asn increased βγ binding, with the Cys mutant being more effective. This suggests that Ser54 of α s is involved in coupling changes in nucleotide binding with altered subunit interactions, and has important implications for how receptors activate G proteins. Copyright © 2018 by The American Society for Pharmacology and Experimental Therapeutics.
Wang, Hsiu-Yu; Chang, Hao-Teng; Pai, Tun-Wen; Wu, Chung-I; Lee, Yuan-Hung; Chang, Yen-Hsin; Tai, Hsiu-Ling; Tang, Chuan-Yi; Chou, Wei-Yao; Chang, Margaret Dah-Tsyr
2007-01-01
Background Human eosinophil-derived neurotoxin (edn) and eosinophil cationic protein (ecp) are members of a subfamily of primate ribonuclease (rnase) genes. Although they are generated by gene duplication event, distinct edn and ecp expression profile in various tissues have been reported. Results In this study, we obtained the upstream promoter sequences of several representative primate eosinophil rnases. Bioinformatic analysis revealed the presence of a shared 34-nucleotide (nt) sequence stretch located at -81 to -48 in all edn promoters and macaque ecp promoter. Such a unique sequence motif constituted a region essential for transactivation of human edn in hepatocellular carcinoma cells. Gel electrophoretic mobility shift assay, transient transfection and scanning mutagenesis experiments allowed us to identify binding sites for two transcription factors, Myc-associated zinc finger protein (MAZ) and SV-40 protein-1 (Sp1), within the 34-nt segment. Subsequent in vitro and in vivo binding assays demonstrated a direct molecular interaction between this 34-nt region and MAZ and Sp1. Interestingly, overexpression of MAZ and Sp1 respectively repressed and enhanced edn promoter activity. The regulatory transactivation motif was mapped to the evolutionarily conserved -74/-65 region of the edn promoter, which was guanidine-rich and critical for recognition by both transcription factors. Conclusion Our results provide the first direct evidence that MAZ and Sp1 play important roles on the transcriptional activation of the human edn promoter through specific binding to a 34-nt segment present in representative primate eosinophil rnase promoters. PMID:17927842
Esmaeili, Rezvan; Abdoli, Nasrin; Yadegari, Fatemeh; Neishaboury, Mohamadreza; Farahmand, Leila; Kaviani, Ahmad; Majidzadeh-A, Keivan
2018-01-01
CD44 encoded by a single gene is a cell surface transmembrane glycoprotein. Exon 2 is one of the important exons to bind CD44 protein to hyaluronan. Experimental evidences show that hyaluronan-CD44 interaction intensifies the proliferation, migration, and invasion of breast cancer cells. Therefore, the current study aimed at investigating the association between specific polymorphisms in exon 2 and its flanking region of CD44 with predisposition to breast cancer. In the current study, 175 Iranian female patients with breast cancer and 175 age-matched healthy controls were recruited in biobank, Breast Cancer Research Center, Tehran, Iran. Single nucleotide polymorphisms of CD44 exon 2 and its flanking were analyzed via polymerase chain reaction and gene sequencing techniques. Association between the observed variation with breast cancer risk and clinico-pathological characteristics were studied. Subsequently, bioinformatics analysis was conducted to predict potential exonic splicing enhancer (ESE) motifs changed as the result of a mutation. A unique polymorphism of the gene encoding CD44 was identified at position 14 nucleotide upstream of exon 2 (A37692→G) by the sequencing method. The A > G polymorphism exhibited a significant association with higher-grades of breast cancer, although no significant relation was found between this polymorphism and breast cancer risk. Finally, computational analysis revealed that the intronic mutation generated a new consensus-binding motif for the splicing factor, SC35, within intron 1. The current study results indicated that A > G polymorphism was associated with breast cancer development; in addition, in silico analysis with ESE finder prediction software showed that the change created a new SC35 binding site.
Molecular insights into the recruitment of TFIIH to sites of DNA damage
Oksenych, Valentyn; de Jesus, Bruno Bernardes; Zhovmer, Alexander; Egly, Jean-Marc; Coin, Frédéric
2009-01-01
XPB and XPD subunits of TFIIH are central genome caretakers involved in nucleotide excision repair (NER), although their respective role within this DNA repair pathway remains difficult to delineate. To obtain insight into the function of XPB and XPD, we studied cell lines expressing XPB or XPD ATPase-deficient complexes. We show the involvement of XPB, but not XPD, in the accumulation of TFIIH to sites of DNA damage. Recruitment of TFIIH occurs independently of the helicase activity of XPB, but requires two recently identified motifs, a R-E-D residue loop and a Thumb-like domain. Furthermore, we show that these motifs are specifically involved in the DNA-induced stimulation of the ATPase activity of XPB. Together, our data demonstrate that the recruitment of TFIIH to sites of damage is an active process, under the control of the ATPase motifs of XPB and suggest that this subunit functions as an ATP-driven hook to stabilize the binding of the TFIIH to damaged DNA. PMID:19713942
Oswald, Christine; Jenewein, Stefan; Smits, Sander H J; Holland, I Barry; Schmitt, Lutz
2008-04-01
TNP-modified nucleotides have been used extensively to study protein-nucleotide interactions. In the case of ABC-ATPases, application of these powerful tools has been greatly restricted due to the significantly higher affinity of the TNP-nucleotide for the corresponding ABC-ATPase in comparison to the non-modified nucleotides. To understand the molecular changes occurring upon binding of the TNP-nucleotide to an ABC-ATPase, we have determined the crystal structure of the TNP-ADP/HlyB-NBD complex at 1.6A resolution. Despite the higher affinity of TNP-ADP, no direct fluorophore-protein interactions were observed. Unexpectedly, only water-mediated interactions were detected between the TNP moiety and Tyr(477), that is engaged in pi-pi stacking with the adenine ring, as well as with two serine residues (Ser(504) and Ser(509)) of the Walker A motif. Interestingly, the side chains of these two serine residues adopt novel conformations that are not observed in the corresponding ADP structure. However, in the crystal structure of the S504A mutant, which binds TNP-ADP with similar affinity to the wild type enzyme, a novel TNP-water interaction compensates for the missing serine side chain. Since this water molecule is not present in the wild type enzyme, these results suggest that only water-mediated interactions provide a structural explanation for the increased affinity of TNP-nucleotides towards ABC-ATPases. However, our results also imply that in silico approaches such as docking or modeling cannot directly be applied to generate 'affinity-adopted' ADP- or ATP-analogs for ABC-ATPases.
Structural and Biochemical Determinants of Ligand Binding by the c-di-GMP Riboswitch
DOE Office of Scientific and Technical Information (OSTI.GOV)
Smith, K.; Lipchock, S; Livingston,
2010-01-01
The bacterial second messenger c-di-GMP is used in many species to control essential processes that allow the organism to adapt to its environment. The c-di-GMP riboswitch (GEMM) is an important downstream target in this signaling pathway and alters gene expression in response to changing concentrations of c-di-GMP. The riboswitch selectively recognizes its second messenger ligand primarily through contacts with two critical nucleotides. However, these two nucleotides are not the most highly conserved residues within the riboswitch sequence. Instead, nucleotides that stack with c-di-GMP and that form tertiary RNA contacts are the most invariant. Biochemical and structural evidence reveals that themore » most common natural variants are able to make alternative pairing interactions with both guanine bases of the ligand. Additionally, a high-resolution (2.3 {angstrom}) crystal structure of the native complex reveals that a single metal coordinates the c-di-GMP backbone. Evidence is also provided that after transcription of the first nucleotide on the 3{prime}-side of the P1 helix, which is predicted to be the molecular switch, the aptamer is functional for ligand binding. Although large energetic effects occur when several residues in the RNA are altered, mutations at the most conserved positions, rather than at positions that base pair with c-di-GMP, have the most detrimental effects on binding. Many mutants retain sufficient c-di-GMP affinity for the RNA to remain biologically relevant, which suggests that this motif is quite resilient to mutation.« less
The BaMM web server for de-novo motif discovery and regulatory sequence analysis.
Kiesel, Anja; Roth, Christian; Ge, Wanwan; Wess, Maximilian; Meier, Markus; Söding, Johannes
2018-05-28
The BaMM web server offers four tools: (i) de-novo discovery of enriched motifs in a set of nucleotide sequences, (ii) scanning a set of nucleotide sequences with motifs to find motif occurrences, (iii) searching with an input motif for similar motifs in our BaMM database with motifs for >1000 transcription factors, trained from the GTRD ChIP-seq database and (iv) browsing and keyword searching the motif database. In contrast to most other servers, we represent sequence motifs not by position weight matrices (PWMs) but by Bayesian Markov Models (BaMMs) of order 4, which we showed previously to perform substantially better in ROC analyses than PWMs or first order models. To address the inadequacy of P- and E-values as measures of motif quality, we introduce the AvRec score, the average recall over the TP-to-FP ratio between 1 and 100. The BaMM server is freely accessible without registration at https://bammmotif.mpibpc.mpg.de.
Ferrer-Orta, Cristina; de la Higuera, Ignacio; Caridi, Flavia; Sánchez-Aparicio, María Teresa; Moreno, Elena; Perales, Celia; Singh, Kamalendra; Sarafianos, Stefan G; Sobrino, Francisco; Domingo, Esteban; Verdaguer, Nuria
2015-07-01
The N-terminal region of the foot-and-mouth disease virus (FMDV) 3D polymerase contains the sequence MRKTKLAPT (residues 16 to 24) that acts as a nuclear localization signal. A previous study showed that substitutions K18E and K20E diminished the transport to the nucleus of 3D and 3CD and severely impaired virus infectivity. These residues have also been implicated in template binding, as seen in the crystal structures of different 3D-RNA elongation complexes. Here, we report the biochemical and structural characterization of different mutant polymerases harboring substitutions at residues 18 and 20, in particular, K18E, K18A, K20E, K20A, and the double mutant K18A K20A (KAKA). All mutant enzymes exhibit low RNA binding activity, low processivity, and alterations in nucleotide recognition, including increased incorporation of ribavirin monophosphate (RMP) relative to the incorporation of cognate nucleotides compared with the wild-type enzyme. The structural analysis shows an unprecedented flexibility of the 3D mutant polymerases, including both global rearrangements of the closed-hand architecture and local conformational changes at loop β9-α11 (within the polymerase motif B) and at the template-binding channel. Specifically, in 3D bound to RNA, both K18E and K20E induced the opening of new pockets in the template channel where the downstream templating nucleotide at position +2 binds. The comparisons of free and RNA-bound enzymes suggest that the structural rearrangements may occur in a concerted mode to regulate RNA replication, processivity, and fidelity. Thus, the N-terminal region of FMDV 3D that acts as a nuclear localization signal (NLS) and in template binding is also involved in nucleotide recognition and can affect the incorporation of nucleotide analogues. The study documents multifunctionality of a nuclear localization signal (NLS) located at the N-terminal region of the foot-and-mouth disease viral polymerase (3D). Amino acid substitutions at this polymerase region can impair the transport of 3D to the nucleus, reduce 3D binding to RNA, and alter the relative incorporation of standard nucleoside monophosphate versus ribavirin monophosphate. Structural data reveal that the conformational changes in this region, forming part of the template channel entry, would be involved in nucleotide discrimination. The results have implications for the understanding of viral polymerase function and for lethal mutagenesis mechanisms. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Ferrer-Orta, Cristina; de la Higuera, Ignacio; Caridi, Flavia; Sánchez-Aparicio, María Teresa; Moreno, Elena; Perales, Celia; Singh, Kamalendra; Sarafianos, Stefan G.; Sobrino, Francisco; Domingo, Esteban
2015-01-01
ABSTRACT The N-terminal region of the foot-and-mouth disease virus (FMDV) 3D polymerase contains the sequence MRKTKLAPT (residues 16 to 24) that acts as a nuclear localization signal. A previous study showed that substitutions K18E and K20E diminished the transport to the nucleus of 3D and 3CD and severely impaired virus infectivity. These residues have also been implicated in template binding, as seen in the crystal structures of different 3D-RNA elongation complexes. Here, we report the biochemical and structural characterization of different mutant polymerases harboring substitutions at residues 18 and 20, in particular, K18E, K18A, K20E, K20A, and the double mutant K18A K20A (KAKA). All mutant enzymes exhibit low RNA binding activity, low processivity, and alterations in nucleotide recognition, including increased incorporation of ribavirin monophosphate (RMP) relative to the incorporation of cognate nucleotides compared with the wild-type enzyme. The structural analysis shows an unprecedented flexibility of the 3D mutant polymerases, including both global rearrangements of the closed-hand architecture and local conformational changes at loop β9-α11 (within the polymerase motif B) and at the template-binding channel. Specifically, in 3D bound to RNA, both K18E and K20E induced the opening of new pockets in the template channel where the downstream templating nucleotide at position +2 binds. The comparisons of free and RNA-bound enzymes suggest that the structural rearrangements may occur in a concerted mode to regulate RNA replication, processivity, and fidelity. Thus, the N-terminal region of FMDV 3D that acts as a nuclear localization signal (NLS) and in template binding is also involved in nucleotide recognition and can affect the incorporation of nucleotide analogues. IMPORTANCE The study documents multifunctionality of a nuclear localization signal (NLS) located at the N-terminal region of the foot-and-mouth disease viral polymerase (3D). Amino acid substitutions at this polymerase region can impair the transport of 3D to the nucleus, reduce 3D binding to RNA, and alter the relative incorporation of standard nucleoside monophosphate versus ribavirin monophosphate. Structural data reveal that the conformational changes in this region, forming part of the template channel entry, would be involved in nucleotide discrimination. The results have implications for the understanding of viral polymerase function and for lethal mutagenesis mechanisms. PMID:25903341
GBshape: a genome browser database for DNA shape annotations.
Chiu, Tsu-Pei; Yang, Lin; Zhou, Tianyin; Main, Bradley J; Parker, Stephen C J; Nuzhdin, Sergey V; Tullius, Thomas D; Rohs, Remo
2015-01-01
Many regulatory mechanisms require a high degree of specificity in protein-DNA binding. Nucleotide sequence does not provide an answer to the question of why a protein binds only to a small subset of the many putative binding sites in the genome that share the same core motif. Whereas higher-order effects, such as chromatin accessibility, cooperativity and cofactors, have been described, DNA shape recently gained attention as another feature that fine-tunes the DNA binding specificities of some transcription factor families. Our Genome Browser for DNA shape annotations (GBshape; freely available at http://rohslab.cmb.usc.edu/GBshape/) provides minor groove width, propeller twist, roll, helix twist and hydroxyl radical cleavage predictions for the entire genomes of 94 organisms. Additional genomes can easily be added using the GBshape framework. GBshape can be used to visualize DNA shape annotations qualitatively in a genome browser track format, and to download quantitative values of DNA shape features as a function of genomic position at nucleotide resolution. As biological applications, we illustrate the periodicity of DNA shape features that are present in nucleosome-occupied sequences from human, fly and worm, and we demonstrate structural similarities between transcription start sites in the genomes of four Drosophila species. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Nawaz, Zarqa; Kakar, Kaleem Ullah; Saand, Mumtaz A; Shu, Qing-Yao
2014-10-04
Cyclic nucleotide-gated channels (CNGCs) are Ca2+-permeable cation transport channels, which are present in both animal and plant systems. They have been implicated in the uptake of both essential and toxic cations, Ca2+ signaling, pathogen defense, and thermotolerance in plants. To date there has not been a genome-wide overview of the CNGC gene family in any economically important crop, including rice (Oryza sativa L.). There is an urgent need for a thorough genome-wide analysis and experimental verification of this gene family in rice. In this study, a total of 16 full length rice CNGC genes distributed on chromosomes 1-6, 9 and 12, were identified by employing comprehensive bioinformatics analyses. Based on phylogeny, the family of OsCNGCs was classified into four major groups (I-IV) and two sub-groups (IV-A and IV- B). Likewise, the CNGCs from all plant lineages clustered into four groups (I-IV), where group II was conserved in all land plants. Gene duplication analysis revealed that both chromosomal segmentation (OsCNGC1 and 2, 10 and 11, 15 and 16) and tandem duplications (OsCNGC1 and 2) significantly contributed to the expansion of this gene family. Motif composition and protein sequence analysis revealed that the CNGC specific domain "cyclic nucleotide-binding domain (CNBD)" comprises a "phosphate binding cassette" (PBC) and a "hinge" region that is highly conserved among the OsCNGCs. In addition, OsCNGC proteins also contain various other functional motifs and post-translational modification sites. We successively built a stringent motif: (LI-X(2)-[GS]-X-[FV]-X-G-[1]-ELL-X-W-X(12,22)-SA-X(2)-T-X(7)-[EQ]-AF-X-L) that recognizes the rice CNGCs specifically. Prediction of cis-acting regulatory elements in 5' upstream sequences and expression analyses through quantitative qPCR demonstrated that OsCNGC genes were highly responsive to multiple stimuli including hormonal (abscisic acid, indoleacetic acid, kinetin and ethylene), biotic (Pseudomonas fuscovaginae and Xanthomonas oryzae pv. oryzae) and abiotic (cold) stress. There are 16 CNGC genes in rice, which were probably expanded through chromosomal segmentation and tandem duplications and comprise a PBC and a "hinge" region in the CNBD domain, featured by a stringent motif. The various cis-acting regulatory elements in the upstream sequences may be responsible for responding to multiple stimuli, including hormonal, biotic and abiotic stresses.
Structure-Templated Predictions of Novel Protein Interactions from Sequence Information
Betel, Doron; Breitkreuz, Kevin E; Isserlin, Ruth; Dewar-Darch, Danielle; Tyers, Mike; Hogue, Christopher W. V
2007-01-01
The multitude of functions performed in the cell are largely controlled by a set of carefully orchestrated protein interactions often facilitated by specific binding of conserved domains in the interacting proteins. Interacting domains commonly exhibit distinct binding specificity to short and conserved recognition peptides called binding profiles. Although many conserved domains are known in nature, only a few have well-characterized binding profiles. Here, we describe a novel predictive method known as domain–motif interactions from structural topology (D-MIST) for elucidating the binding profiles of interacting domains. A set of domains and their corresponding binding profiles were derived from extant protein structures and protein interaction data and then used to predict novel protein interactions in yeast. A number of the predicted interactions were verified experimentally, including new interactions of the mitotic exit network, RNA polymerases, nucleotide metabolism enzymes, and the chaperone complex. These results demonstrate that new protein interactions can be predicted exclusively from sequence information. PMID:17892321
MOCCS: Clarifying DNA-binding motif ambiguity using ChIP-Seq data.
Ozaki, Haruka; Iwasaki, Wataru
2016-08-01
As a key mechanism of gene regulation, transcription factors (TFs) bind to DNA by recognizing specific short sequence patterns that are called DNA-binding motifs. A single TF can accept ambiguity within its DNA-binding motifs, which comprise both canonical (typical) and non-canonical motifs. Clarification of such DNA-binding motif ambiguity is crucial for revealing gene regulatory networks and evaluating mutations in cis-regulatory elements. Although chromatin immunoprecipitation sequencing (ChIP-seq) now provides abundant data on the genomic sequences to which a given TF binds, existing motif discovery methods are unable to directly answer whether a given TF can bind to a specific DNA-binding motif. Here, we report a method for clarifying the DNA-binding motif ambiguity, MOCCS. Given ChIP-Seq data of any TF, MOCCS comprehensively analyzes and describes every k-mer to which that TF binds. Analysis of simulated datasets revealed that MOCCS is applicable to various ChIP-Seq datasets, requiring only a few minutes per dataset. Application to the ENCODE ChIP-Seq datasets proved that MOCCS directly evaluates whether a given TF binds to each DNA-binding motif, even if known position weight matrix models do not provide sufficient information on DNA-binding motif ambiguity. Furthermore, users are not required to provide numerous parameters or background genomic sequence models that are typically unavailable. MOCCS is implemented in Perl and R and is freely available via https://github.com/yuifu/moccs. By complementing existing motif-discovery software, MOCCS will contribute to the basic understanding of how the genome controls diverse cellular processes via DNA-protein interactions. Copyright © 2016 Elsevier Ltd. All rights reserved.
Footprinting of Chlorella virus DNA ligase bound at a nick in duplex DNA.
Odell, M; Shuman, S
1999-05-14
The 298-amino acid ATP-dependent DNA ligase of Chlorella virus PBCV-1 is the smallest eukaryotic DNA ligase known. The enzyme has intrinsic specificity for binding to nicked duplex DNA. To delineate the ligase-DNA interface, we have footprinted the enzyme binding site on DNA and the DNA binding site on ligase. The size of the exonuclease III footprint of ligase bound a single nick in duplex DNA is 19-21 nucleotides. The footprint is asymmetric, extending 8-9 nucleotides on the 3'-OH side of the nick and 11-12 nucleotides on the 5'-phosphate side. The 5'-phosphate moiety is essential for the binding of Chlorella virus ligase to nicked DNA. Here we show that the 3'-OH moiety is not required for nick recognition. The Chlorella virus ligase binds to a nicked ligand containing 2',3'-dideoxy and 5'-phosphate termini, but cannot catalyze adenylation of the 5'-end. Hence, the 3'-OH is important for step 2 chemistry even though it is not itself chemically transformed during DNA-adenylate formation. A 2'-OH cannot substitute for the essential 3'-OH in adenylation at a nick or even in strand closure at a preadenylated nick. The protein side of the ligase-DNA interface was probed by limited proteolysis of ligase with trypsin and chymotrypsin in the presence and absence of nicked DNA. Protease accessible sites are clustered within a short segment from amino acids 210-225 located distal to conserved motif V. The ligase is protected from proteolysis by nicked DNA. Protease cleavage of the native enzyme prior to DNA addition results in loss of DNA binding. These results suggest a bipartite domain structure in which the interdomain segment either comprises part of the DNA binding site or undergoes a conformational change upon DNA binding. The domain structure of Chlorella virus ligase inferred from the solution experiments is consistent with the structure of T7 DNA ligase determined by x-ray crystallography.
NASA Astrophysics Data System (ADS)
Raab, Monika; Cai, Yun-Cai; Bunnell, Stephen C.; Heyeck, Stephanie D.; Berg, Leslie J.; Rudd, Christopher E.
1995-09-01
T-cell activation requires cooperative signals generated by the T-cell antigen receptor ξ-chain complex (TCRξ-CD3) and the costimulatory antigen CD28. CD28 interacts with three intracellular proteins-phosphatidylinositol 3-kinase (PI 3-kinase), T cell-specific protein-tyrosine kinase ITK (formerly TSK or EMT), and the complex between growth factor receptor-bound protein 2 and son of sevenless guanine nucleotide exchange protein (GRB-2-SOS). PI 3-kinase and GRB-2 bind to the CD28 phosphotyrosine-based Tyr-Met-Asn-Met motif by means of intrinsic Src-homology 2 (SH2) domains. The requirement for tyrosine phosphorylation of the Tyr-Met-Asn-Met motif for SH2 domain binding implicates an intervening protein-tyrosine kinase in the recruitment of PI 3-kinase and GRB-2 by CD28. Candidate kinases include p56Lck, p59Fyn, ξ-chain-associated 70-kDa protein (ZAP-70), and ITK. In this study, we demonstrate in coexpression studies that p56Lck and p59Fyn phosphorylate CD28 primarily at Tyr-191 of the Tyr-Met-Asn-Met motif, inducing a 3- to 8-fold increase in p85 (subunit of PI 3-kinase) and GRB-2 SH2 binding to CD28. Phosphatase digestion of CD28 eliminated binding. In contrast to Src kinases, ZAP-70 and ITK failed to induce these events. Further, ITK binding to CD28 was dependent on the presence of p56Lck and is thus likely to act downstream of p56Lck/p59Fyn in a signaling cascade. p56Lck is therefore likely to be a central switch in T-cell activation, with the dual function of regulating CD28-mediated costimulation as well as TCR-CD3-CD4 signaling.
The UNG2 Arg88Cys variant abrogates RPA-mediated recruitment of UNG2 to single-stranded DNA.
Torseth, Kathrin; Doseth, Berit; Hagen, Lars; Olaisen, Camilla; Liabakk, Nina-Beate; Græsmann, Heidi; Durandy, Anne; Otterlei, Marit; Krokan, Hans E; Kavli, Bodil; Slupphaug, Geir
2012-06-01
In human cell nuclei, UNG2 is the major uracil-DNA glycosylase initiating DNA base excision repair of uracil. In activated B cells it has an additional role in facilitating mutagenic processing of AID-induced uracil at Ig loci and UNG-deficient patients develop hyper-IgM syndrome characterized by impaired class-switch recombination and disturbed somatic hypermutation. How UNG2 is recruited to either error-free or mutagenic uracil processing remains obscure, but likely involves regulated interactions with other proteins. The UNG2 N-terminal domain contains binding motifs for both proliferating cell nuclear antigen (PCNA) and replication protein A (RPA), but the relative contribution of these interactions to genomic uracil processing is not understood. Interestingly, a heterozygous germline single-nucleotide variant leading to Arg88Cys (R88C) substitution in the RPA-interaction motif of UNG2 has been observed in humans, but with unknown functional relevance. Here we demonstrate that UNG2-R88C protein is expressed from the variant allele in a lymphoblastoid cell line derived from a heterozygous germ line carrier. Enzyme activity as well as localization in replication foci of UNG2-R88C was similar to that of WT. However, binding to RPA was essentially abolished by the R88C substitution, whereas binding to PCNA was unaffected. Moreover, we show that disruption of the PCNA-binding motif impaired recruitment of UNG2 to S-phase replication foci, demonstrating that PCNA is a major factor for recruitment of UNG2 to unperturbed replication forks. Conversely, in cells treated with hydroxyurea, RPA mediated recruitment of UNG2 to stalled replication forks independently of functional PCNA binding. Modulation of PCNA- versus RPA-binding may thus constitute a functional switch for UNG2 in cells subsequent to genotoxic stress and potentially also during the processing of uracil at the immunoglobulin locus in antigen-stimulated B cells. Copyright © 2012 Elsevier B.V. All rights reserved.
Liu, Q; Astell, C R
1996-10-01
During replication of the minute virus of mice (MVM) genome, a dimer replicative form (RF) intermediate is resolved into two monomer RF molecules in such a way as to retain a unique sequence within the left hand hairpin terminus of the viral genome. Although the proposed mechanism for resolution of the dimer RF remains uncertain, it likely involves site-specific nicking of the dimer bridge. The RF contains two double-stranded copies of the viral genome joined by the extended 3' hairpin. Minor sequence asymmetries within the 3' hairpin allow the two halves of the dimer bridge to be distinguished. The A half contains the sequence [sequence: see text], whereas the B half contains the sequence [sequence: see text]. Using an in vitro assay, we show that only the B half of the MVM dimer bridge is nicked site-specifically when incubated with crude NS-1 protein (expressed in insect cells) and mouse LA9 cellular extract. When highly purified NS-1, the major nonstructural protein of MVM, is used in this nicking reaction, there is an absolute requirement for the LA9 cellular extract, suggesting a cellular factor (or factors) is (are) required. A series of mutations were created in the putative host factor binding region (HFBR) on the B half of the MVM dimer bridge adjacent to the NS-1 binding site. Nicking assays of these B half mutants showed that two CG motifs displaced by 10 nucleotides are important for nicking. Gel mobility shift assays demonstrated that a host factor(s) can bind to the HFBR of the B half of the dimer bridge and efficient binding depends on the presence of both CG motifs. Competitor DNA containing the wild-type HFBR sequence is able to specifically inhibit nicking of the B half, indicating that the host factor(s) bound to the HFBR is(are) essential for site-specific nicking to occur.
The PUF binding landscape in metazoan germ cells
Prasad, Aman; Porter, Douglas F.; Kroll-Conner, Peggy L.; Mohanty, Ipsita; Ryan, Anne R.; Crittenden, Sarah L.; Wickens, Marvin; Kimble, Judith
2016-01-01
PUF (Pumilio/FBF) proteins are RNA-binding proteins and conserved stem cell regulators. The Caenorhabditis elegans PUF proteins FBF-1 and FBF-2 (collectively FBF) regulate mRNAs in germ cells. Without FBF, adult germlines lose all stem cells. A major gap in our understanding of PUF proteins, including FBF, is a global view of their binding sites in their native context (i.e., their “binding landscape”). To understand the interactions underlying FBF function, we used iCLIP (individual-nucleotide resolution UV crosslinking and immunoprecipitation) to determine binding landscapes of C. elegans FBF-1 and FBF-2 in the germline tissue of intact animals. Multiple iCLIP peak-calling methods were compared to maximize identification of both established FBF binding sites and positive control target mRNAs in our iCLIP data. We discovered that FBF-1 and FBF-2 bind to RNAs through canonical as well as alternate motifs. We also analyzed crosslinking-induced mutations to map binding sites precisely and to identify key nucleotides that may be critical for FBF–RNA interactions. FBF-1 and FBF-2 can bind sites in the 5′UTR, coding region, or 3′UTR, but have a strong bias for the 3′ end of transcripts. FBF-1 and FBF-2 have strongly overlapping target profiles, including mRNAs and noncoding RNAs. From a statistically robust list of 1404 common FBF targets, 847 were previously unknown, 154 were related to cell cycle regulation, three were lincRNAs, and 335 were shared with the human PUF protein PUM2. PMID:27165521
Kumar, Amit; Parkesh, Raman; Sznajder, Lukasz J; Childs-Disney, Jessica L; Sobczak, Krzysztof; Disney, Matthew D
2012-03-16
Recently, it was reported that expanded r(CAG) triplet repeats (r(CAG)(exp)) associated with untreatable neurological diseases cause pre-mRNA mis-splicing likely due to sequestration of muscleblind-like 1 (MBNL1) splicing factor. Bioactive small molecules that bind the 5'CAG/3'GAC motif found in r(CAG)(exp) hairpin structure were identified by using RNA binding studies and virtual screening/chemical similarity searching. Specifically, a benzylguanidine-containing small molecule was found to improve pre-mRNA alternative splicing of MBNL1-sensitive exons in cells expressing the toxic r(CAG)(exp). The compound was identified by first studying the binding of RNA 1 × 1 nucleotide internal loops to small molecules known to have affinity for nucleic acids. Those studies identified 4',6-diamidino-2-phenylindole (DAPI) as a specific binder to RNAs with the 5'CAG/3'GAC motif. DAPI was then used as a query molecule in a shape- and chemistry alignment-based virtual screen to identify compounds with improved properties, which identified 4-guanidinophenyl 4-guanidinobenzoate, a small molecule that improves pre-mRNA splicing defects associated with the r(CAG)(exp)-MBNL1 complex. This compound may facilitate the development of therapeutics to treat diseases caused by r(CAG)(exp) and could serve as a useful chemical tool to dissect the mechanisms of r(CAG)(exp) toxicity. The approach used in these studies, defining the small RNA motifs that bind small molecules with known affinity for nucleic acids and then using virtual screening to optimize them for bioactivity, may be generally applicable for designing small molecules that target other RNAs in the human genomic sequence.
Kumar, Amit; Parkesh, Raman; Sznajder, Lukasz J.; Childs-Disney, Jessica; Sobczak, Krzysztof; Disney, Matthew D.
2012-01-01
Recently, it was reported that expanded r(CAG) triplet repeats (r(CAG)exp) associated with untreatable neurological diseases cause pre-mRNA mis-splicing likely due to sequestration of muscleblind-like 1 (MBNL1) splicing factor. Bioactive small molecules that bind the 5’CAG/3’GAC motif found in r(CAG)exp hairpin structure were identified by using RNA binding studies and virtual screening/chemical similarity searching. Specifically, a benzylguanidine-containing small molecule was found to improve pre-mRNA alternative splicing of MBNL1-sensitive exons in cells expressing the toxic r(CAG)exp. The compound was identified by first studying the binding of RNA 1×1 nucleotide internal loops to small molecules known to have affinity for nucleic acids. Those studies identified 4',6-diamidino-2-phenylindole (DAPI) as a specific binder to RNAs with the 5’CAG/3’GAC motif. DAPI was then used as a query molecule in a shape- and chemistry alignment-based virtual screen to identify compounds with improved properties, which identified 4-guanidinophenyl 4-guanidinobenzoate as small molecule capable of improving pre-mRNA splicing defects associated with the r(CAG)exp-MBNL1 complex. This compound may facilitate the development of therapeutics to treat diseases caused by r(CAG)exp and could serve as a useful chemical tool to dissect the mechanisms of r(CAG)exp toxicity. The approach used in these studies, defining the small RNA motifs that bind known nucleic acid binders and then using virtual screening to optimize them for bioactivity, may be generally applicable for designing small molecules that target other RNAs in human genomic sequence. PMID:22252896
Leder, Verena; Lummer, Martina; Tegeler, Kathrin; Humpert, Fabian; Lewinski, Martin; Schüttpelz, Mark; Staiger, Dorothee
2014-10-10
Arabidopsis thaliana glycine-rich RNA binding protein 7 (AtGRP7) is part of a negative feedback loop through which it regulates alternative splicing and steady-state abundance of its pre-mRNA. Here we use fluorescence correlation spectroscopy to investigate the requirements for AtGRP7 binding to its intron using fluorescently-labelled synthetic oligonucleotides. By systematically introducing point mutations we identify three nucleotides that lead to an increased Kd value when mutated and thus are critical for AtGRP7 binding. Simultaneous mutation of all three residues abrogates binding. The paralogue AtGRP8 binds to an overlapping motif but with a different sequence preference, in line with overlapping but not identical functions of this protein pair. Truncation of the glycine-rich domain reduces the binding affinity of AtGRP7, showing for the first time that the glycine-rich stretch of a plant hnRNP-like protein contributes to binding. Mutation of the conserved R(49) that is crucial for AtGRP7 function in pathogen defence and splicing abolishes binding. Copyright © 2014 Elsevier Inc. All rights reserved.
Marcu, M G; Chadli, A; Bouhouche, I; Catelli, M; Neckers, L M
2000-11-24
Heat shock protein 90 (Hsp90), one of the most abundant chaperones in eukaryotes, participates in folding and stabilization of signal-transducing molecules including steroid hormone receptors and protein kinases. The amino terminus of Hsp90 contains a non-conventional nucleotide-binding site, related to the ATP-binding motif of bacterial DNA gyrase. The anti-tumor agents geldanamycin and radicicol bind specifically at this site and induce destabilization of Hsp90-dependent client proteins. We recently demonstrated that the gyrase inhibitor novobiocin also interacts with Hsp90, altering the affinity of the chaperone for geldanamycin and radicicol and causing in vitro and in vivo depletion of key regulatory Hsp90-dependent kinases including v-Src, Raf-1, and p185(ErbB2). In the present study we used deletion/mutation analysis to identify the site of interaction of novobiocin with Hsp90, and we demonstrate that the novobiocin-binding site resides in the carboxyl terminus of the chaperone. Surprisingly, this motif also recognizes ATP, and ATP and novobiocin efficiently compete with each other for binding to this region of Hsp90. Novobiocin interferes with association of the co-chaperones Hsc70 and p23 with Hsp90. These results identify a second site on Hsp90 where the binding of small molecule inhibitors can significantly impact the function of this chaperone, and they support the hypothesis that both amino- and carboxyl-terminal domains of Hsp90 interact to modulate chaperone activity.
NASA Astrophysics Data System (ADS)
Lawrenz, Morgan E.; Salter, E. A.; Wierzbicki, Andrzej; Thompson, W. J.
Cyclic nucleotide phosphodiesterases (PDEs) comprise a superfamily of enzymes that hydrolyze the second messengers adenosine and guanosine 3',5'-cyclic monophosphate (cAMP and cGMP) to their noncyclic nucleotides (5'-AMP and 5'-GMP). Selective inhibitors of all 11 gene families of PDEs are being sought based on the different biochemical properties of the different isoforms, including their substrate specificities. The PDE4 gene family consists of cAMP-specific isoforms; selective PDE4 inhibitors such as rolipram have been developed, and related agents are used clinically as anti-inflammatory agents for asthma and COPD. The known crystal structures of PDE4 bound with rolipram and IBMX have allowed us to define plausible binding orientations for a novel class of benzylpyridazinone-based PDE4 inhibitors represented by EMD 94360 and EMD 95832 that are structurally distinct from rolipram. Molecular mechanics modeling with autodocking is used to explore energetically favorable binding orientations within the PDE4 catalytic site. We present two putative orientations for EMD 94360/95832 inhibitor binding. Our estimated interaction energies for rolipram, IBMX, EMD 94360, and EMD 95832 are consistent with the experimental data for their IC50 values. Key binding residues and interactions in these orientations are identified and compared with known binding motifs proposed for rolipram. The experimentally observed improved strength of inhibition exhibited by this novel class of PDE4 inhibitors is explained by the molecular modeling reported here.
Bahramnejad, Bahman
2014-01-01
P. atlantica subsp. Kurdica, with the local name of Baneh, is a wild medicinal plant which grows in Kurdistan, Iran. The identification of resistance gene analogs holds great promise for the development of resistant cultivars. A PCR approach with degenerate primers designed according to conserved NBS-LRR (nucleotide binding site-leucine rich repeat) regions of known disease-resistance (R) genes was used to amplify and clone homologous sequences from P. atlantica subsp. Kurdica. A DNA fragment of the expected 500-bp size was amplified. The nucleotide sequence of this amplicon was obtained through sequencing and the predicted amino acid sequence compared to the amino acid sequences of known R-genes revealed significant sequence similarity. Alignment of the deduced amino acid sequence of P. atlantica subsp. Kurdica resistance gene analog (RGA) showed strong identity, ranging from 68% to 77%, to the non-toll interleukin receptor (non-TIR) R-gene subfamily from other plants. A P-loop motif (GMMGGEGKTT), a conserved and hydrophobic motif GLPLAL, a kinase-2a motif (LLVLDDV), when replaced by IAVFDDI in PAKRGA1 and a kinase-3a (FGPGSRIII) were presented in all RGA. A phylogenetic tree, based on the deduced amino-acid sequences of PAKRGA1 and RGAs from different species indicated that they were separated in two clusters, PAKRGA1 being on cluster II. The isolated NBS analogs can be eventually used as guidelines to isolate numerous R-genes in Pistachio. PMID:27843981
Molecular principles underlying dual RNA specificity in the Drosophila SNF protein.
Weber, Gert; DeKoster, Gregory T; Holton, Nicole; Hall, Kathleen B; Wahl, Markus C
2018-06-07
The first RNA recognition motif of the Drosophila SNF protein is an example of an RNA binding protein with multi-specificity. It binds different RNA hairpin loops in spliceosomal U1 or U2 small nuclear RNAs, and only in the latter case requires the auxiliary U2A' protein. Here we investigate its functions by crystal structures of SNF alone and bound to U1 stem-loop II, U2A' or U2 stem-loop IV and U2A', SNF dynamics from NMR spectroscopy, and structure-guided mutagenesis in binding studies. We find that different loop-closing base pairs and a nucleotide exchange at the tips of the loops contribute to differential SNF affinity for the RNAs. U2A' immobilizes SNF and RNA residues to restore U2 stem-loop IV binding affinity, while U1 stem-loop II binding does not require such adjustments. Our findings show how U2A' can modulate RNA specificity of SNF without changing SNF conformation or relying on direct RNA contacts.
Fragment-based modelling of single stranded RNA bound to RNA recognition motif containing proteins
de Beauchene, Isaure Chauvot; de Vries, Sjoerd J.; Zacharias, Martin
2016-01-01
Abstract Protein-RNA complexes are important for many biological processes. However, structural modeling of such complexes is hampered by the high flexibility of RNA. Particularly challenging is the docking of single-stranded RNA (ssRNA). We have developed a fragment-based approach to model the structure of ssRNA bound to a protein, based on only the protein structure, the RNA sequence and conserved contacts. The conformational diversity of each RNA fragment is sampled by an exhaustive library of trinucleotides extracted from all known experimental protein–RNA complexes. The method was applied to ssRNA with up to 12 nucleotides which bind to dimers of the RNA recognition motifs (RRMs), a highly abundant eukaryotic RNA-binding domain. The fragment based docking allows a precise de novo atomic modeling of protein-bound ssRNA chains. On a benchmark of seven experimental ssRNA–RRM complexes, near-native models (with a mean heavy-atom deviation of <3 Å from experiment) were generated for six out of seven bound RNA chains, and even more precise models (deviation < 2 Å) were obtained for five out of seven cases, a significant improvement compared to the state of the art. The method is not restricted to RRMs but was also successfully applied to Pumilio RNA binding proteins. PMID:27131381
A study on the application of topic models to motif finding algorithms.
Basha Gutierrez, Josep; Nakai, Kenta
2016-12-22
Topic models are statistical algorithms which try to discover the structure of a set of documents according to the abstract topics contained in them. Here we try to apply this approach to the discovery of the structure of the transcription factor binding sites (TFBS) contained in a set of biological sequences, which is a fundamental problem in molecular biology research for the understanding of transcriptional regulation. Here we present two methods that make use of topic models for motif finding. First, we developed an algorithm in which first a set of biological sequences are treated as text documents, and the k-mers contained in them as words, to then build a correlated topic model (CTM) and iteratively reduce its perplexity. We also used the perplexity measurement of CTMs to improve our previous algorithm based on a genetic algorithm and several statistical coefficients. The algorithms were tested with 56 data sets from four different species and compared to 14 other methods by the use of several coefficients both at nucleotide and site level. The results of our first approach showed a performance comparable to the other methods studied, especially at site level and in sensitivity scores, in which it scored better than any of the 14 existing tools. In the case of our previous algorithm, the new approach with the addition of the perplexity measurement clearly outperformed all of the other methods in sensitivity, both at nucleotide and site level, and in overall performance at site level. The statistics obtained show that the performance of a motif finding method based on the use of a CTM is satisfying enough to conclude that the application of topic models is a valid method for developing motif finding algorithms. Moreover, the addition of topic models to a previously developed method dramatically increased its performance, suggesting that this combined algorithm can be a useful tool to successfully predict motifs in different kinds of sets of DNA sequences.
Richardson, Kris; Schnitzler, Gavin R; Lai, Chao-Qiang; Ordovas, Jose M
2015-12-01
Cardiovascular disease and type 2 diabetes mellitus represent overlapping diseases where a large portion of the variation attributable to genetics remains unexplained. An important player in their pathogenesis is peroxisome proliferator-activated receptor γ (PPARγ) that is involved in lipid and glucose metabolism and maintenance of metabolic homeostasis. We used a functional genomics methodology to interrogate human chromatin immunoprecipitation-sequencing, genome-wide association studies, and expression quantitative trait locus data to inform selection of candidate functional single nucleotide polymorphisms (SNPs) falling in PPARγ motifs. We derived 27 328 chromatin immunoprecipitation-sequencing peaks for PPARγ in human adipocytes through meta-analysis of 3 data sets. The PPARγ consensus motif showed greatest enrichment and mapped to 8637 peaks. We identified 146 SNPs in these motifs. This number was significantly less than would be expected by chance, and Inference of Natural Selection from Interspersed Genomically coHerent elemenTs analysis indicated that these motifs are under weak negative selection. A screen of these SNPs against genome-wide association studies for cardiometabolic traits revealed significant enrichment with 16 SNPs. A screen against the MuTHER expression quantitative trait locus data revealed 8 of these were significantly associated with altered gene expression in human adipose, more than would be expected by chance. Several SNPs fall close, or are linked by expression quantitative trait locus to lipid-metabolism loci including CYP26A1. We demonstrated the use of functional genomics to identify SNPs of potential function. Specifically, that SNPs within PPARγ motifs that bind PPARγ in adipocytes are significantly associated with cardiometabolic disease and with the regulation of transcription in adipose. This method may be used to uncover functional SNPs that do not reach significance thresholds in the agnostic approach of genome-wide association studies. © 2015 American Heart Association, Inc.
Han, Han; Monroe, Nicole; Votteler, Jörg; Shakya, Binita; Sundquist, Wesley I; Hill, Christopher P
2015-05-22
The endosomal sorting complexes required for transport (ESCRT) pathway drives reverse topology membrane fission events within multiple cellular pathways, including cytokinesis, multivesicular body biogenesis, repair of the plasma membrane, nuclear membrane vesicle formation, and HIV budding. The AAA ATPase Vps4 is recruited to membrane necks shortly before fission, where it catalyzes disassembly of the ESCRT-III lattice. The N-terminal Vps4 microtubule-interacting and trafficking (MIT) domains initially bind the C-terminal MIT-interacting motifs (MIMs) of ESCRT-III subunits, but it is unclear how the enzyme then remodels these substrates in response to ATP hydrolysis. Here, we report quantitative binding studies that demonstrate that residues from helix 5 of the Vps2p subunit of ESCRT-III bind to the central pore of an asymmetric Vps4p hexamer in a manner that is dependent upon the presence of flexible nucleotide analogs that can mimic multiple states in the ATP hydrolysis cycle. We also find that substrate engagement is autoinhibited by the Vps4p MIT domain and that this inhibition is relieved by binding of either Type 1 or Type 2 MIM elements, which bind the Vps4p MIT domain through different interfaces. These observations support the model that Vps4 substrates are initially recruited by an MIM-MIT interaction that activates the Vps4 central pore to engage substrates and generate force, thereby triggering ESCRT-III disassembly. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Han, Han; Monroe, Nicole; Votteler, Jörg; Shakya, Binita; Sundquist, Wesley I.; Hill, Christopher P.
2015-01-01
The endosomal sorting complexes required for transport (ESCRT) pathway drives reverse topology membrane fission events within multiple cellular pathways, including cytokinesis, multivesicular body biogenesis, repair of the plasma membrane, nuclear membrane vesicle formation, and HIV budding. The AAA ATPase Vps4 is recruited to membrane necks shortly before fission, where it catalyzes disassembly of the ESCRT-III lattice. The N-terminal Vps4 microtubule-interacting and trafficking (MIT) domains initially bind the C-terminal MIT-interacting motifs (MIMs) of ESCRT-III subunits, but it is unclear how the enzyme then remodels these substrates in response to ATP hydrolysis. Here, we report quantitative binding studies that demonstrate that residues from helix 5 of the Vps2p subunit of ESCRT-III bind to the central pore of an asymmetric Vps4p hexamer in a manner that is dependent upon the presence of flexible nucleotide analogs that can mimic multiple states in the ATP hydrolysis cycle. We also find that substrate engagement is autoinhibited by the Vps4p MIT domain and that this inhibition is relieved by binding of either Type 1 or Type 2 MIM elements, which bind the Vps4p MIT domain through different interfaces. These observations support the model that Vps4 substrates are initially recruited by an MIM-MIT interaction that activates the Vps4 central pore to engage substrates and generate force, thereby triggering ESCRT-III disassembly. PMID:25833946
Identifying DNA-binding proteins using structural motifs and the electrostatic potential
Shanahan, Hugh P.; Garcia, Mario A.; Jones, Susan; Thornton, Janet M.
2004-01-01
Robust methods to detect DNA-binding proteins from structures of unknown function are important for structural biology. This paper describes a method for identifying such proteins that (i) have a solvent accessible structural motif necessary for DNA-binding and (ii) a positive electrostatic potential in the region of the binding region. We focus on three structural motifs: helix–turn-helix (HTH), helix–hairpin–helix (HhH) and helix–loop–helix (HLH). We find that the combination of these variables detect 78% of proteins with an HTH motif, which is a substantial improvement over previous work based purely on structural templates and is comparable to more complex methods of identifying DNA-binding proteins. Similar true positive fractions are achieved for the HhH and HLH motifs. We see evidence of wide evolutionary diversity for DNA-binding proteins with an HTH motif, and much smaller diversity for those with an HhH or HLH motif. PMID:15356290
Rouka, Evgenia; Simister, Philip C.; Janning, Melanie; Kumbrink, Joerg; Konstantinou, Tassos; Muniz, João R. C.; Joshi, Dhira; O'Reilly, Nicola; Volkmer, Rudolf; Ritter, Brigitte; Knapp, Stefan; von Delft, Frank; Kirsch, Kathrin H.; Feller, Stephan M.
2015-01-01
CD2AP is an adaptor protein involved in membrane trafficking, with essential roles in maintaining podocyte function within the kidney glomerulus. CD2AP contains three Src homology 3 (SH3) domains that mediate multiple protein-protein interactions. However, a detailed comparison of the molecular binding preferences of each SH3 remained unexplored, as well as the discovery of novel interactors. Thus, we studied the binding properties of each SH3 domain to the known interactor Casitas B-lineage lymphoma protein (c-CBL), conducted a peptide array screen based on the recognition motif PxPxPR and identified 40 known or novel candidate binding proteins, such as RIN3, a RAB5-activating guanine nucleotide exchange factor. CD2AP SH3 domains 1 and 2 generally bound with similar characteristics and specificities, whereas the SH3-3 domain bound more weakly to most peptide ligands tested yet recognized an unusually extended sequence in ALG-2-interacting protein X (ALIX). RIN3 peptide scanning arrays revealed two CD2AP binding sites, recognized by all three SH3 domains, but SH3-3 appeared non-functional in precipitation experiments. RIN3 recruited CD2AP to RAB5a-positive early endosomes via these interaction sites. Permutation arrays and isothermal titration calorimetry data showed that the preferred binding motif is Px(P/A)xPR. Two high-resolution crystal structures (1.65 and 1.11 Å) of CD2AP SH3-1 and SH3-2 solved in complex with RIN3 epitopes 1 and 2, respectively, indicated that another extended motif is relevant in epitope 2. In conclusion, we have discovered novel interaction candidates for CD2AP and characterized subtle yet significant differences in the recognition preferences of its three SH3 domains for c-CBL, ALIX, and RIN3. PMID:26296892
Liu, Bingqiang; Zhang, Hanyuan; Zhou, Chuan; Li, Guojun; Fennell, Anne; Wang, Guanghui; Kang, Yu; Liu, Qi; Ma, Qin
2016-08-09
Phylogenetic footprinting is an important computational technique for identifying cis-regulatory motifs in orthologous regulatory regions from multiple genomes, as motifs tend to evolve slower than their surrounding non-functional sequences. Its application, however, has several difficulties for optimizing the selection of orthologous data and reducing the false positives in motif prediction. Here we present an integrative phylogenetic footprinting framework for accurate motif predictions in prokaryotic genomes (MP(3)). The framework includes a new orthologous data preparation procedure, an additional promoter scoring and pruning method and an integration of six existing motif finding algorithms as basic motif search engines. Specifically, we collected orthologous genes from available prokaryotic genomes and built the orthologous regulatory regions based on sequence similarity of promoter regions. This procedure made full use of the large-scale genomic data and taxonomy information and filtered out the promoters with limited contribution to produce a high quality orthologous promoter set. The promoter scoring and pruning is implemented through motif voting by a set of complementary predicting tools that mine as many motif candidates as possible and simultaneously eliminate the effect of random noise. We have applied the framework to Escherichia coli k12 genome and evaluated the prediction performance through comparison with seven existing programs. This evaluation was systematically carried out at the nucleotide and binding site level, and the results showed that MP(3) consistently outperformed other popular motif finding tools. We have integrated MP(3) into our motif identification and analysis server DMINDA, allowing users to efficiently identify and analyze motifs in 2,072 completely sequenced prokaryotic genomes. The performance evaluation indicated that MP(3) is effective for predicting regulatory motifs in prokaryotic genomes. Its application may enhance progress in elucidating transcription regulation mechanism, thus provide benefit to the genomic research community and prokaryotic genome researchers in particular.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shimomura, Tadanori; Miyamura, Norio; Hata, Shoji
2014-01-17
Highlights: •Loss of the PDZ-binding motif inhibits constitutively active YAP (5SA)-induced oncogenic cell transformation. •The PDZ-binding motif of YAP promotes its nuclear localization in cultured cells and mouse liver. •Loss of the PDZ-binding motif inhibits YAP (5SA)-induced CTGF transcription in cultured cells and mouse liver. -- Abstract: YAP is a transcriptional co-activator that acts downstream of the Hippo signaling pathway and regulates multiple cellular processes, including proliferation. Hippo pathway-dependent phosphorylation of YAP negatively regulates its function. Conversely, attenuation of Hippo-mediated phosphorylation of YAP increases its ability to stimulate proliferation and eventually induces oncogenic transformation. The C-terminus of YAP contains amore » highly conserved PDZ-binding motif that regulates YAP’s functions in multiple ways. However, to date, the importance of the PDZ-binding motif to the oncogenic cell transforming activity of YAP has not been determined. In this study, we disrupted the PDZ-binding motif in the YAP (5SA) protein, in which the sites normally targeted by Hippo pathway-dependent phosphorylation are mutated. We found that loss of the PDZ-binding motif significantly inhibited the oncogenic transformation of cultured cells induced by YAP (5SA). In addition, the increased nuclear localization of YAP (5SA) and its enhanced activation of TEAD-dependent transcription of the cell proliferation gene CTGF were strongly reduced when the PDZ-binding motif was deleted. Similarly, in mouse liver, deletion of the PDZ-binding motif suppressed nuclear localization of YAP (5SA) and YAP (5SA)-induced CTGF expression. Taken together, our results indicate that the PDZ-binding motif of YAP is critical for YAP-mediated oncogenesis, and that this effect is mediated by YAP’s co-activation of TEAD-mediated CTGF transcription.« less
Structure of the myotonic dystrophy type 2 RNA and designed small molecules that reduce toxicity.
Childs-Disney, Jessica L; Yildirim, Ilyas; Park, HaJeung; Lohman, Jeremy R; Guan, Lirui; Tran, Tuan; Sarkar, Partha; Schatz, George C; Disney, Matthew D
2014-02-21
Myotonic dystrophy type 2 (DM2) is an incurable neuromuscular disorder caused by a r(CCUG) expansion (r(CCUG)(exp)) that folds into an extended hairpin with periodically repeating 2×2 nucleotide internal loops (5'CCUG/3'GUCC). We designed multivalent compounds that improve DM2-associated defects using information about RNA-small molecule interactions. We also report the first crystal structure of r(CCUG) repeats refined to 2.35 Å. Structural analysis of the three 5'CCUG/3'GUCC repeat internal loops (L) reveals that the CU pairs in L1 are each stabilized by one hydrogen bond and a water-mediated hydrogen bond, while CU pairs in L2 and L3 are stabilized by two hydrogen bonds. Molecular dynamics (MD) simulations reveal that the CU pairs are dynamic and stabilized by Na(+) and water molecules. MD simulations of the binding of the small molecule to r(CCUG) repeats reveal that the lowest free energy binding mode occurs via the major groove, in which one C residue is unstacked and the cross-strand nucleotides are displaced. Moreover, we modeled the binding of our dimeric compound to two 5'CCUG/3'GUCC motifs, which shows that the scaffold on which the RNA-binding modules are displayed provides an optimal distance to span two adjacent loops.
Structure of the Myotonic Dystrophy Type 2 RNA and Designed Small Molecules That Reduce Toxicity
Park, HaJeung; Lohman, Jeremy R.; Guan, Lirui; Tran, Tuan; Sarkar, Partha; Schatz, George C.; Disney, Matthew D.
2014-01-01
Myotonic dystrophy type 2 (DM2) is an untreatable neuromuscular disorder caused by a r(CCUG) expansion (r(CCUG)exp) that folds into an extended hairpin with periodically repeating 2×2 nucleotide internal loops (5’CCUG/3’GUCC). We designed multivalent compounds that improve DM2-associated defects using information about RNA-small molecule interactions. We also report the first crystal structure of r(CCUG)exp refined to 2.35 Å. Structural analysis of the three 5’CCUG/3’GUCC repeat internal loops (L) reveals that the CU pairs in L1 are each stabilized by one hydrogen bond and a water-mediated hydrogen bond while CU pairs in L2 and L3 are stabilized by two hydrogen bonds. Molecular dynamics (MD) simulations reveal that the CU pairs are dynamic and stabilized by Na+ and water molecules. MD simulations of the binding of the small molecule to r(CCUG) repeats reveal that the lowest free energy binding mode occurs via the major groove, in which one C residue is unstacked and the cross-strand nucleotides are displaced. Moreover, we modeled the binding of our dimeric compound to two 5’CCUG/3’GUCC motifs, which shows that the scaffold on which the RNA-binding modules are displayed provides an optimal distance to span two adjacent loops. PMID:24341895
The substrate binding interface of alkylpurine DNA glycosylase AlkD.
Mullins, Elwood A; Rubinson, Emily H; Eichman, Brandt F
2014-01-01
Tandem helical repeats have emerged as an important DNA binding architecture. DNA glycosylase AlkD, which excises N3- and N7-alkylated nucleobases, uses repeating helical motifs to bind duplex DNA and to selectively pause at non-Watson-Crick base pairs. Remodeling of the DNA backbone promotes nucleotide flipping of the lesion and the complementary base into the solvent and toward the protein surface, respectively. The important features of this new DNA binding architecture that allow AlkD to distinguish between damaged and normal DNA without contacting the lesion are poorly understood. Here, we show through extensive mutational analysis that DNA binding and N3-methyladenine (3mA) and N7-methylguanine (7mG) excision are dependent upon each residue lining the DNA binding interface. Disrupting electrostatic or hydrophobic interactions with the DNA backbone substantially reduced binding affinity and catalytic activity. These results demonstrate that residues seemingly only involved in general DNA binding are important for catalytic activity and imply that base excision is driven by binding energy provided by the entire substrate interface of this novel DNA binding architecture. Copyright © 2013 Elsevier B.V. All rights reserved.
Principles of regulatory information conservation between mouse and human.
Cheng, Yong; Ma, Zhihai; Kim, Bong-Hyun; Wu, Weisheng; Cayting, Philip; Boyle, Alan P; Sundaram, Vasavi; Xing, Xiaoyun; Dogan, Nergiz; Li, Jingjing; Euskirchen, Ghia; Lin, Shin; Lin, Yiing; Visel, Axel; Kawli, Trupti; Yang, Xinqiong; Patacsil, Dorrelyn; Keller, Cheryl A; Giardine, Belinda; Kundaje, Anshul; Wang, Ting; Pennacchio, Len A; Weng, Zhiping; Hardison, Ross C; Snyder, Michael P
2014-11-20
To broaden our understanding of the evolution of gene regulation mechanisms, we generated occupancy profiles for 34 orthologous transcription factors (TFs) in human-mouse erythroid progenitor, lymphoblast and embryonic stem-cell lines. By combining the genome-wide transcription factor occupancy repertoires, associated epigenetic signals, and co-association patterns, here we deduce several evolutionary principles of gene regulatory features operating since the mouse and human lineages diverged. The genomic distribution profiles, primary binding motifs, chromatin states, and DNA methylation preferences are well conserved for TF-occupied sequences. However, the extent to which orthologous DNA segments are bound by orthologous TFs varies both among TFs and with genomic location: binding at promoters is more highly conserved than binding at distal elements. Notably, occupancy-conserved TF-occupied sequences tend to be pleiotropic; they function in several tissues and also co-associate with many TFs. Single nucleotide variants at sites with potential regulatory functions are enriched in occupancy-conserved TF-occupied sequences.
SARNAclust: Semi-automatic detection of RNA protein binding motifs from immunoprecipitation data
Dotu, Ivan; Adamson, Scott I.; Coleman, Benjamin; Fournier, Cyril; Ricart-Altimiras, Emma; Eyras, Eduardo
2018-01-01
RNA-protein binding is critical to gene regulation, controlling fundamental processes including splicing, translation, localization and stability, and aberrant RNA-protein interactions are known to play a role in a wide variety of diseases. However, molecular understanding of RNA-protein interactions remains limited; in particular, identification of RNA motifs that bind proteins has long been challenging, especially when such motifs depend on both sequence and structure. Moreover, although RNA binding proteins (RBPs) often contain more than one binding domain, algorithms capable of identifying more than one binding motif simultaneously have not been developed. In this paper we present a novel pipeline to determine binding peaks in crosslinking immunoprecipitation (CLIP) data, to discover multiple possible RNA sequence/structure motifs among them, and to experimentally validate such motifs. At the core is a new semi-automatic algorithm SARNAclust, the first unsupervised method to identify and deconvolve multiple sequence/structure motifs simultaneously. SARNAclust computes similarity between sequence/structure objects using a graph kernel, providing the ability to isolate the impact of specific features through the bulge graph formalism. Application of SARNAclust to synthetic data shows its capability of clustering 5 motifs at once with a V-measure value of over 0.95, while GraphClust achieves only a V-measure of 0.083 and RNAcontext cannot detect any of the motifs. When applied to existing eCLIP sets, SARNAclust finds known motifs for SLBP and HNRNPC and novel motifs for several other RBPs such as AGGF1, AKAP8L and ILF3. We demonstrate an experimental validation protocol, a targeted Bind-n-Seq-like high-throughput sequencing approach that relies on RNA inverse folding for oligo pool design, that can validate the components within the SLBP motif. Finally, we use this protocol to experimentally interrogate the SARNAclust motif predictions for protein ILF3. Our results support a newly identified partially double-stranded UUUUUGAGA motif similar to that known for the splicing factor HNRNPC. PMID:29596423
Bubis, José; Martínez, Juan Carlos; Calabokis, Maritza; Ferreira, Joilyneth; Sanz-Rodríguez, Carlos E; Navas, Victoria; Escalona, José Leonardo; Guo, Yurong; Taylor, Susan S
2018-03-01
The full gene sequence encoding for the Trypanosoma equiperdum ortholog of the cAMP-dependent protein kinase (PKA) regulatory (R) subunits was cloned. A poly-His tagged construct was generated [TeqR-like(His) 8 ], and the protein was expressed in bacteria and purified to homogeneity. The size of the purified TeqR-like(His) 8 was determined to be ∼57,000 Da by molecular exclusion chromatography indicating that the parasite protein is a monomer. Limited proteolysis with various proteases showed that the T. equiperdum R-like protein possesses a hinge region very susceptible to proteolysis. The recombinant TeqR-like(His) 8 did not bind either [ 3 H] cAMP or [ 3 H] cGMP up to concentrations of 0.40 and 0.65 μM, respectively, and neither the parasite protein nor its proteolytically generated carboxy-terminal large fragments were capable of binding to a cAMP-Sepharose affinity column. Bioinformatics analyses predicted that the carboxy-terminal region of the trypanosomal R-like protein appears to fold similarly to the analogous region of all known PKA R subunits. However, the protein amino-terminal portion seems to be unrelated and shows homology with proteins that contained Leu-rich repeats, a folding motif that is particularly appropriate for protein-protein interactions. In addition, the three-dimensional structure of the T. equiperdum protein was modeled using the crystal structure of the bovine PKA R I α subunit as template. Molecular docking experiments predicted critical changes in the environment of the two putative nucleotide binding clefts of the parasite protein, and the resulting binding energy differences support the lack of cyclic nucleotide binding in the trypanosomal R-like protein. Copyright © 2017 Elsevier B.V. and Société Française de Biochimie et Biologie Moléculaire (SFBBM). All rights reserved.
Sydor, Andrew M.; Lebrette, Hugo; Ariyakumaran, Rishikesh; Cavazza, Christine; Zamble, Deborah B.
2014-01-01
The pathogen Helicobacter pylori requires two nickel-containing enzymes, urease and [NiFe]-hydrogenase, for efficient colonization of the human gastric mucosa. These enzymes possess complex metallocenters that are assembled by teams of proteins in multistep pathways. One essential accessory protein is the GTPase HypB, which is required for Ni(II) delivery to [NiFe]-hydrogenase and participates in urease maturation. Ni(II) or Zn(II) binding to a site embedded in the GTPase domain of HypB modulates the enzymatic activity, suggesting a mechanism of regulation. In this study, biochemical and structural analyses of H. pylori HypB (HpHypB) revealed an intricate link between nucleotide and metal binding. HpHypB nickel coordination, stoichiometry, and affinity were modulated by GTP and GDP, an effect not observed for zinc, and biochemical evidence suggests that His-107 coordination to nickel toggles on and off in a nucleotide-dependent manner. These results are consistent with the crystal structure of HpHypB loaded with Ni(II), GDP, and Pi, which reveals a nickel site distinct from that of zinc-loaded Methanocaldococcus jannaschii HypB as well as subtle changes to the protein structure. Furthermore, Cys-142, a metal ligand from the Switch II GTPase motif, was identified as a key component of the signal transduction between metal binding and the enzymatic activity. Finally, potassium accelerated the enzymatic activity of HpHypB but had no effect on the other biochemical properties of the protein. Altogether, this molecular level information about HpHypB provides insight into its cellular function and illuminates a possible mechanism of metal ion discrimination. PMID:24338018
Sydor, Andrew M; Lebrette, Hugo; Ariyakumaran, Rishikesh; Cavazza, Christine; Zamble, Deborah B
2014-02-14
The pathogen Helicobacter pylori requires two nickel-containing enzymes, urease and [NiFe]-hydrogenase, for efficient colonization of the human gastric mucosa. These enzymes possess complex metallocenters that are assembled by teams of proteins in multistep pathways. One essential accessory protein is the GTPase HypB, which is required for Ni(II) delivery to [NiFe]-hydrogenase and participates in urease maturation. Ni(II) or Zn(II) binding to a site embedded in the GTPase domain of HypB modulates the enzymatic activity, suggesting a mechanism of regulation. In this study, biochemical and structural analyses of H. pylori HypB (HpHypB) revealed an intricate link between nucleotide and metal binding. HpHypB nickel coordination, stoichiometry, and affinity were modulated by GTP and GDP, an effect not observed for zinc, and biochemical evidence suggests that His-107 coordination to nickel toggles on and off in a nucleotide-dependent manner. These results are consistent with the crystal structure of HpHypB loaded with Ni(II), GDP, and Pi, which reveals a nickel site distinct from that of zinc-loaded Methanocaldococcus jannaschii HypB as well as subtle changes to the protein structure. Furthermore, Cys-142, a metal ligand from the Switch II GTPase motif, was identified as a key component of the signal transduction between metal binding and the enzymatic activity. Finally, potassium accelerated the enzymatic activity of HpHypB but had no effect on the other biochemical properties of the protein. Altogether, this molecular level information about HpHypB provides insight into its cellular function and illuminates a possible mechanism of metal ion discrimination.
2012-01-01
Background To discover a compound inhibiting multiple proteins (i.e. polypharmacological targets) is a new paradigm for the complex diseases (e.g. cancers and diabetes). In general, the polypharmacological proteins often share similar local binding environments and motifs. As the exponential growth of the number of protein structures, to find the similar structural binding motifs (pharma-motifs) is an emergency task for drug discovery (e.g. side effects and new uses for old drugs) and protein functions. Results We have developed a Space-Related Pharmamotifs (called SRPmotif) method to recognize the binding motifs by searching against protein structure database. SRPmotif is able to recognize conserved binding environments containing spatially discontinuous pharma-motifs which are often short conserved peptides with specific physico-chemical properties for protein functions. Among 356 pharma-motifs, 56.5% interacting residues are highly conserved. Experimental results indicate that 81.1% and 92.7% polypharmacological targets of each protein-ligand complex are annotated with same biological process (BP) and molecular function (MF) terms, respectively, based on Gene Ontology (GO). Our experimental results show that the identified pharma-motifs often consist of key residues in functional (active) sites and play the key roles for protein functions. The SRPmotif is available at http://gemdock.life.nctu.edu.tw/SRP/. Conclusions SRPmotif is able to identify similar pharma-interfaces and pharma-motifs sharing similar binding environments for polypharmacological targets by rapidly searching against the protein structure database. Pharma-motifs describe the conservations of binding environments for drug discovery and protein functions. Additionally, these pharma-motifs provide the clues for discovering new sequence-based motifs to predict protein functions from protein sequence databases. We believe that SRPmotif is useful for elucidating protein functions and drug discovery. PMID:23281852
Chiu, Yi-Yuan; Lin, Chun-Yu; Lin, Chih-Ta; Hsu, Kai-Cheng; Chang, Li-Zen; Yang, Jinn-Moon
2012-01-01
To discover a compound inhibiting multiple proteins (i.e. polypharmacological targets) is a new paradigm for the complex diseases (e.g. cancers and diabetes). In general, the polypharmacological proteins often share similar local binding environments and motifs. As the exponential growth of the number of protein structures, to find the similar structural binding motifs (pharma-motifs) is an emergency task for drug discovery (e.g. side effects and new uses for old drugs) and protein functions. We have developed a Space-Related Pharmamotifs (called SRPmotif) method to recognize the binding motifs by searching against protein structure database. SRPmotif is able to recognize conserved binding environments containing spatially discontinuous pharma-motifs which are often short conserved peptides with specific physico-chemical properties for protein functions. Among 356 pharma-motifs, 56.5% interacting residues are highly conserved. Experimental results indicate that 81.1% and 92.7% polypharmacological targets of each protein-ligand complex are annotated with same biological process (BP) and molecular function (MF) terms, respectively, based on Gene Ontology (GO). Our experimental results show that the identified pharma-motifs often consist of key residues in functional (active) sites and play the key roles for protein functions. The SRPmotif is available at http://gemdock.life.nctu.edu.tw/SRP/. SRPmotif is able to identify similar pharma-interfaces and pharma-motifs sharing similar binding environments for polypharmacological targets by rapidly searching against the protein structure database. Pharma-motifs describe the conservations of binding environments for drug discovery and protein functions. Additionally, these pharma-motifs provide the clues for discovering new sequence-based motifs to predict protein functions from protein sequence databases. We believe that SRPmotif is useful for elucidating protein functions and drug discovery.
Solution structure and DNA-binding properties of the C-terminal domain of UvrC from E.coli
Singh, S.; Folkers, G.E.; Bonvin, A.M.J.J.; Boelens, R.; Wechselberger, R.; Niztayev, A.; Kaptein, R.
2002-01-01
The C-terminal domain of the UvrC protein (UvrC CTD) is essential for 5′ incision in the prokaryotic nucleotide excision repair process. We have determined the three-dimensional structure of the UvrC CTD using heteronuclear NMR techniques. The structure shows two helix–hairpin–helix (HhH) motifs connected by a small connector helix. The UvrC CTD is shown to mediate structure-specific DNA binding. The domain binds to a single-stranded–double-stranded junction DNA, with a strong specificity towards looped duplex DNA that contains at least six unpaired bases per loop (‘bubble DNA’). Using chemical shift perturbation experiments, the DNA-binding surface is mapped to the first hairpin region encompassing the conserved glycine–valine–glycine residues followed by lysine–arginine–arginine, a positively charged surface patch and the second hairpin region consisting of glycine–isoleucine–serine. A model for the protein– DNA complex is proposed that accounts for this specificity. PMID:12426397
A survey of motif finding Web tools for detecting binding site motifs in ChIP-Seq data
2014-01-01
Abstract ChIP-Seq (chromatin immunoprecipitation sequencing) has provided the advantage for finding motifs as ChIP-Seq experiments narrow down the motif finding to binding site locations. Recent motif finding tools facilitate the motif detection by providing user-friendly Web interface. In this work, we reviewed nine motif finding Web tools that are capable for detecting binding site motifs in ChIP-Seq data. We showed each motif finding Web tool has its own advantages for detecting motifs that other tools may not discover. We recommended the users to use multiple motif finding Web tools that implement different algorithms for obtaining significant motifs, overlapping resemble motifs, and non-overlapping motifs. Finally, we provided our suggestions for future development of motif finding Web tool that better assists researchers for finding motifs in ChIP-Seq data. Reviewers This article was reviewed by Prof. Sandor Pongor, Dr. Yuriy Gusev, and Dr. Shyam Prabhakar (nominated by Prof. Limsoon Wong). PMID:24555784
DOE Office of Scientific and Technical Information (OSTI.GOV)
Misic, Ana M.; Satyshur, Kenneth A.; Forest, Katrina T.
Type IV pili are bacterial extracellular filaments that can be retracted to create force and motility. Retraction is accomplished by the motor protein PilT. Crystal structures of Pseudomonas aeruginosa PilT with and without bound {beta},{gamma}-methyleneadenosine-5{prime}-triphosphate have been solved at 2.6 {angstrom} and 3.1 {angstrom} resolution, respectively, revealing an interlocking hexamer formed by the action of a crystallographic 2-fold symmetry operator on three subunits in the asymmetric unit and held together by extensive ionic interactions. The roles of two invariant carboxylates, Asp Box motif Glu163 and Walker B motif Glu204, have been assigned to Mg{sup 2+} binding and catalysis, respectively. Themore » nucleotide ligands in each of the subunits in the asymmetric unit of the {beta},{gamma}-methyleneadenosine-5{prime}-triphosphate-bound PilT are not equally well ordered. Similarly, the three subunits in the asymmetric unit of both structures exhibit differing relative conformations of the two domains. The 12{sup o} and 20{sup o} domain rotations indicate motions that occur during the ATP-coupled mechanism of the disassembly of pili into membrane-localized pilin monomers. Integrating these observations, we propose a three-state 'Ready, Active, Release' model for the action of PilT.« less
Structural basis for the binding of tryptophan-based motifs by δ-COP
Suckling, Richard J.; Poon, Pak Phi; Travis, Sophie M.; Majoul, Irina V.; Hughson, Frederick M.; Evans, Philip R.; Duden, Rainer; Owen, David J.
2015-01-01
Coatomer consists of two subcomplexes: the membrane-targeting, ADP ribosylation factor 1 (Arf1):GTP-binding βγδζ-COP F-subcomplex, which is related to the adaptor protein (AP) clathrin adaptors, and the cargo-binding αβ’ε-COP B-subcomplex. We present the structure of the C-terminal μ-homology domain of the yeast δ-COP subunit in complex with the WxW motif from its binding partner, the endoplasmic reticulum-localized Dsl1 tether. The motif binds at a site distinct from that used by the homologous AP μ subunits to bind YxxΦ cargo motifs with its two tryptophan residues sitting in compatible pockets. We also show that the Saccharomyces cerevisiae Arf GTPase-activating protein (GAP) homolog Gcs1p uses a related WxxF motif at its extreme C terminus to bind to δ-COP at the same site in the same way. Mutations designed on the basis of the structure in conjunction with isothermal titration calorimetry confirm the mode of binding and show that mammalian δ-COP binds related tryptophan-based motifs such as that from ArfGAP1 in a similar manner. We conclude that δ-COP subunits bind Wxn(1–6)[WF] motifs within unstructured regions of proteins that influence the lifecycle of COPI-coated vesicles; this conclusion is supported by the observation that, in the context of a sensitizing domain deletion in Dsl1p, mutating the tryptophan-based motif-binding site in yeast causes defects in both growth and carboxypeptidase Y trafficking/processing. PMID:26578768
Niknezhad, Zhila; Hassani, Leila; Norouzi, Davood
2016-01-01
c-MYC DNA is an attractive target for drug design, especially for cancer chemotherapy. Around 90% of c-MYC transcription is controlled by NHE III1, whose 27-nt purine-rich strand has the ability to form G-quadruplex structure. In this investigation, interaction of ActD with 27-nt G-rich strand (G/c-MYC) and its equimolar mixture with the complementary sequence, (GC/c-MYC) as well as related C-rich oligonucleotide (C/c-MYC) was evaluated. Molecular dynamic simulations showed that phenoxazine and lactone rings of ActD come close to the outer G-tetrad nucleotides indicating that ActD binds through end-stacking to the quadruplex DNA. RMSD and RMSF revealed that fluctuation of the quadruplex DNA increases upon interaction with the drug. The results of spectrophotometry and spectrofluorometry indicated that ActD most probably binds to the c-MYC quadruplex and duplex DNA via end-stacking and intercalation, respectively and polarity of ActD environment decreases due to the interaction. It was also found that binding of ActD to the GC-rich DNA is stronger than the two other forms of DNA. Circular dichroism results showed that the type of the three forms of DNA structures doesn't change, but their compactness alters due to their interaction with ActD. Finally, it can be concluded that ActD binds differently to double stranded DNA, quadruplex DNA and i-motif. Copyright © 2015 Elsevier B.V. All rights reserved.
Lofgren, Michael; Koutmos, Markos; Banerjee, Ruma
2013-10-25
MeaB is an accessory GTPase protein involved in the assembly, protection, and reactivation of 5'-deoxyadenosyl cobalamin-dependent methylmalonyl-CoA mutase (MCM). Mutations in the human ortholog of MeaB result in methylmalonic aciduria, an inborn error of metabolism. G-proteins typically utilize conserved switch I and II motifs for signaling to effector proteins via conformational changes elicited by nucleotide binding and hydrolysis. Our recent discovery that MeaB utilizes an unusual switch III region for bidirectional signaling with MCM raised questions about the roles of the switch I and II motifs in MeaB. In this study, we addressed the functions of conserved switch II residues by performing alanine-scanning mutagenesis. Our results demonstrate that the GTPase activity of MeaB is autoinhibited by switch II and that this loop is important for coupling nucleotide-sensitive conformational changes in switch III to elicit the multiple chaperone functions of MeaB. Furthermore, we report the structure of MeaB·GDP crystallized in the presence of AlFx(-) to form the putative transition state analog, GDP·AlF4(-). The resulting crystal structure and its comparison with related G-proteins support the conclusion that the catalytic site of MeaB is incomplete in the absence of the GTPase-activating protein MCM and therefore unable to stabilize the transition state analog. Favoring an inactive conformation in the absence of the client MCM protein might represent a strategy for suppressing the intrinsic GTPase activity of MeaB in which the switch II loop plays an important role.
The Verrucomicrobia LexA-Binding Motif: Insights into the Evolutionary Dynamics of the SOS Response.
Erill, Ivan; Campoy, Susana; Kılıç, Sefa; Barbé, Jordi
2016-01-01
The SOS response is the primary bacterial mechanism to address DNA damage, coordinating multiple cellular processes that include DNA repair, cell division, and translesion synthesis. In contrast to other regulatory systems, the composition of the SOS genetic network and the binding motif of its transcriptional repressor, LexA, have been shown to vary greatly across bacterial clades, making it an ideal system to study the co-evolution of transcription factors and their regulons. Leveraging comparative genomics approaches and prior knowledge on the core SOS regulon, here we define the binding motif of the Verrucomicrobia, a recently described phylum of emerging interest due to its association with eukaryotic hosts. Site directed mutagenesis of the Verrucomicrobium spinosum recA promoter confirms that LexA binds a 14 bp palindromic motif with consensus sequence TGTTC-N4-GAACA. Computational analyses suggest that recognition of this novel motif is determined primarily by changes in base-contacting residues of the third alpha helix of the LexA helix-turn-helix DNA binding motif. In conjunction with comparative genomics analysis of the LexA regulon in the Verrucomicrobia phylum, electrophoretic shift assays reveal that LexA binds to operators in the promoter region of DNA repair genes and a mutagenesis cassette in this organism, and identify previously unreported components of the SOS response. The identification of tandem LexA-binding sites generating instances of other LexA-binding motifs in the lexA gene promoter of Verrucomicrobia species leads us to postulate a novel mechanism for LexA-binding motif evolution. This model, based on gene duplication, successfully addresses outstanding questions in the intricate co-evolution of the LexA protein, its binding motif and the regulatory network it controls.
Switch I-dependent allosteric signaling in a G-protein chaperone-B12 enzyme complex.
Campanello, Gregory C; Lofgren, Michael; Yokom, Adam L; Southworth, Daniel R; Banerjee, Ruma
2017-10-27
G-proteins regulate various processes ranging from DNA replication and protein synthesis to cytoskeletal dynamics and cofactor assimilation and serve as models for uncovering strategies deployed for allosteric signal transduction. MeaB is a multifunctional G-protein chaperone, which gates loading of the active 5'-deoxyadenosylcobalamin cofactor onto methylmalonyl-CoA mutase (MCM) and precludes loading of inactive cofactor forms. MeaB also safeguards MCM, which uses radical chemistry, against inactivation and rescues MCM inactivated during catalytic turnover by using the GTP-binding energy to offload inactive cofactor. The conserved switch I and II signaling motifs used by G-proteins are predicted to mediate allosteric regulation in response to nucleotide binding and hydrolysis in MeaB. Herein, we targeted conserved residues in the MeaB switch I motif to interrogate the function of this loop. Unexpectedly, the switch I mutations had only modest effects on GTP binding and on GTPase activity and did not perturb stability of the MCM-MeaB complex. However, these mutations disrupted multiple MeaB chaperone functions, including cofactor editing, loading, and offloading. Hence, although residues in the switch I motif are not essential for catalysis, they are important for allosteric regulation. Furthermore, single-particle EM analysis revealed, for the first time, the overall architecture of the MCM-MeaB complex, which exhibits a 2:1 stoichiometry. These EM studies also demonstrate that the complex exhibits considerable conformational flexibility. In conclusion, the switch I element does not significantly stabilize the MCM-MeaB complex or influence the affinity of MeaB for GTP but is required for transducing signals between MeaB and MCM. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
2004-01-01
The nucleotide pyrophosphatases/phosphodiesterases NPP1 and NPP2/autotaxin are structurally related eukaryotic ecto-enzymes, but display a very different substrate specificity. NPP1 releases nucleoside 5′-monophosphates from various nucleotides, whereas NPP2 mainly functions as a lysophospholipase D. We have used a domain-swapping approach to map substrate-specifying determinants of NPP1 and NPP2. The catalytic domain of NPP1 fused to the N- and C-terminal domains of NPP2 was hyperactive as a nucleotide phosphodiesterase, but did not show any lysophospholipase D activity. In contrast, chimaeras of the catalytic domain of NPP2 and the N- and/or C-terminal domains of NPP1 were completely inactive. These data indicate that the catalytic domain as well as both extremities of NPP2 contain lysophospholipid-specifying sequences. Within the catalytic domain of NPP1 and NPP2, we have mapped residues close to the catalytic site that determine the activities towards nucleotides and lysophospholipids. We also show that the conserved Gly/Phe-Xaa-Gly-Xaa-Xaa-Gly (G/FXGXXG) motif near the catalytic site is required for metal binding, but is not involved in substrate-specification. Our data suggest that the distinct activities of NPP1 and NPP2 stem from multiple differences throughout the polypeptide chain. PMID:15096095
Structure of Rot, a global regulator of virulence genes in Staphylococcus aureus.
Zhu, Yuwei; Fan, Xiaojiao; Zhang, Xu; Jiang, Xuguang; Niu, Liwen; Teng, Maikun; Li, Xu
2014-09-01
Staphylococcus aureus is a highly versatile pathogen that can infect human tissue by producing a large arsenal of virulence factors that are tightly regulated by a complex regulatory network. Rot, which shares sequence similarity with SarA homologues, is a global regulator that regulates numerous virulence genes. However, the recognition model of Rot for the promoter region of target genes and the putative regulation mechanism remain elusive. In this study, the 1.77 Å resolution X-ray crystal structure of Rot is reported. The structure reveals that two Rot molecules form a compact homodimer, each of which contains a typical helix-turn-helix module and a β-hairpin motif connected by a flexible loop. Fluorescence polarization results indicate that Rot preferentially recognizes AT-rich dsDNA with ~30-base-pair nucleotides and that the conserved positively charged residues on the winged-helix motif are vital for binding to the AT-rich dsDNA. It is proposed that the DNA-recognition model of Rot may be similar to that of SarA, SarR and SarS, in which the helix-turn-helix motifs of each monomer interact with the major grooves of target dsDNA and the winged motifs contact the minor grooves. Interestingly, the structure shows that Rot adopts a novel dimerization model that differs from that of other SarA homologues. As expected, perturbation of the dimer interface abolishes the dsDNA-binding ability of Rot, suggesting that Rot functions as a dimer. In addition, the results have been further confirmed in vivo by measuring the transcriptional regulation of α-toxin, a major virulence factor produced by most S. aureus strains.
Ghosh, Supratim; Mallick, Sumana; Das, Upasana; Verma, Ajay; Pal, Uttam; Chatterjee, Sabyasachi; Nandy, Abhishek; Saha, Krishna D; Maiti, Nakul Chandra; Baishya, Bikash; Suresh Kumar, G; Gmeiner, William H
2018-03-01
We report, based on biophysical studies and molecular mechanical calculations that curcumin binds DNA hairpin in the minor groove adjacent to the loop region forming a stable complex. UV-Vis and fluorescence spectroscopy indicated interaction of curcumin with DNA hairpin. In this novel binding motif, two ɣ H of curcumin heptadiene chain are closely positioned to the A 16 -H8 and A 17 -H8, while G 12 -H8 is located in the close proximity of curcumin α H. Molecular dynamics (MD) simulations suggest, the complex is stabilized by noncovalent forces including; π-π stacking, H-bonding and hydrophobic interactions. Nuclear magnetic resonance (NMR) spectroscopy in combination with molecular dynamics simulations indicated curcumin is bound in the minor groove, while circular dichroism (CD) spectra suggested minute enhancement in base stacking and a little change in DNA helicity, without significant conformational change of DNA hairpin structure. The DNA:curcumin complex formed with FdU nucleotides rather than Thymidine, demonstrated enhanced cytotoxicity towards oral cancer cells relative to the only FdU substituted hairpin. Fluorescence co-localization demonstrated stability of the complex in biologically relevant conditions, including its cellular uptake. Acridine orange/EtBr staining further confirmed the enhanced cytotoxic effects of the complex, suggesting apoptosis as mode of cell death. Thus, curcumin can be noncovalently complexed to small DNA hairpin for cellular delivery and the complex showed increased cytotoxicity in combination with FdU nucleotides, demonstrating its potential for advanced cancer therapy. Copyright © 2017 Elsevier B.V. All rights reserved.
Yang, Peng; Wu, Min; Guo, Jing; Kwoh, Chee Keong; Przytycka, Teresa M; Zheng, Jie
2014-02-17
As a fundamental genomic element, meiotic recombination hotspot plays important roles in life sciences. Thus uncovering its regulatory mechanisms has broad impact on biomedical research. Despite the recent identification of the zinc finger protein PRDM9 and its 13-mer binding motif as major regulators for meiotic recombination hotspots, other regulators remain to be discovered. Existing methods for finding DNA sequence motifs of recombination hotspots often rely on the enrichment of co-localizations between hotspots and short DNA patterns, which ignore the cross-individual variation of recombination rates and sequence polymorphisms in the population. Our objective in this paper is to capture signals encoded in genetic variations for the discovery of recombination-associated DNA motifs. Recently, an algorithm called "LDsplit" has been designed to detect the association between single nucleotide polymorphisms (SNPs) and proximal meiotic recombination hotspots. The association is measured by the difference of population recombination rates at a hotspot between two alleles of a candidate SNP. Here we present an open source software tool of LDsplit, with integrative data visualization for recombination hotspots and their proximal SNPs. Applying LDsplit on SNPs inside an established 7-mer motif bound by PRDM9 we observed that SNP alleles preserving the original motif tend to have higher recombination rates than the opposite alleles that disrupt the motif. Running on SNP windows around hotspots each containing an occurrence of the 7-mer motif, LDsplit is able to guide the established motif finding algorithm of MEME to recover the 7-mer motif. In contrast, without LDsplit the 7-mer motif could not be identified. LDsplit is a software tool for the discovery of cis-regulatory DNA sequence motifs stimulating meiotic recombination hotspots by screening and narrowing down to hotspot associated SNPs. It is the first computational method that utilizes the genetic variation of recombination hotspots among individuals, opening a new avenue for motif finding. Tested on an established motif and simulated datasets, LDsplit shows promise to discover novel DNA motifs for meiotic recombination hotspots.
2014-01-01
Background As a fundamental genomic element, meiotic recombination hotspot plays important roles in life sciences. Thus uncovering its regulatory mechanisms has broad impact on biomedical research. Despite the recent identification of the zinc finger protein PRDM9 and its 13-mer binding motif as major regulators for meiotic recombination hotspots, other regulators remain to be discovered. Existing methods for finding DNA sequence motifs of recombination hotspots often rely on the enrichment of co-localizations between hotspots and short DNA patterns, which ignore the cross-individual variation of recombination rates and sequence polymorphisms in the population. Our objective in this paper is to capture signals encoded in genetic variations for the discovery of recombination-associated DNA motifs. Results Recently, an algorithm called “LDsplit” has been designed to detect the association between single nucleotide polymorphisms (SNPs) and proximal meiotic recombination hotspots. The association is measured by the difference of population recombination rates at a hotspot between two alleles of a candidate SNP. Here we present an open source software tool of LDsplit, with integrative data visualization for recombination hotspots and their proximal SNPs. Applying LDsplit on SNPs inside an established 7-mer motif bound by PRDM9 we observed that SNP alleles preserving the original motif tend to have higher recombination rates than the opposite alleles that disrupt the motif. Running on SNP windows around hotspots each containing an occurrence of the 7-mer motif, LDsplit is able to guide the established motif finding algorithm of MEME to recover the 7-mer motif. In contrast, without LDsplit the 7-mer motif could not be identified. Conclusions LDsplit is a software tool for the discovery of cis-regulatory DNA sequence motifs stimulating meiotic recombination hotspots by screening and narrowing down to hotspot associated SNPs. It is the first computational method that utilizes the genetic variation of recombination hotspots among individuals, opening a new avenue for motif finding. Tested on an established motif and simulated datasets, LDsplit shows promise to discover novel DNA motifs for meiotic recombination hotspots. PMID:24533858
A novel paired domain DNA recognition motif can mediate Pax2 repression of gene transcription.
Håvik, B; Ragnhildstveit, E; Lorens, J B; Saelemyr, K; Fauske, O; Knudsen, L K; Fjose, A
1999-12-20
The paired domain (PD) is an evolutionarily conserved DNA-binding domain encoded by the Pax gene family of developmental regulators. The Pax proteins are transcription factors and are involved in a variety of processes such as brain development, patterning of the central nervous system (CNS), and B-cell development. In this report we demonstrate that the zebrafish Pax2 PD can interact with a novel type of DNA sequences in vitro, the triple-A motif, consisting of a heptameric nucleotide sequence G/CAAACA/TC with an invariant core of three adjacent adenosines. This recognition sequence was found to be conserved in known natural Pax5 repressor elements involved in controlling the expression of the p53 and J-chain genes. By identifying similar high affinity binding sites in potential target genes of the Pax2 protein, including the pax2 gene itself, we obtained further evidence that the triple-A sites are biologically significant. The putative natural target sites also provide a basis for defining an extended consensus recognition sequence. In addition, we observed in transformation assays a direct correlation between Pax2 repressor activity and the presence of triple-A sites. The results suggest that a transcriptional regulatory function of Pax proteins can be modulated by PD binding to different categories of target sequences. Copyright 1999 Academic Press.
Okuda, Ken-ichi; Yanagihara, Sae; Sugayama, Tomomichi; Zendo, Takeshi; Nakayama, Jiro; Sonomoto, Kenji
2010-06-01
Lantibiotics are peptide-derived antibacterial substances produced by some Gram-positive bacteria and characterized by the presence of unusual amino acids, like lanthionines and dehydrated amino acids. Because lantibiotic producers may be attacked by self-produced lantibiotics, they express immunity proteins on the cytoplasmic membrane. An ATP-binding cassette (ABC) transport system mediated by the LanFEG protein complex is a major system in lantibiotic immunity. Multiple-sequence alignment analysis revealed that LanF proteins contain the E loop, a variant of the Q loop, which is a well-conserved motif in the nucleotide-binding domains (NBDs) of general ABC transporters. To elucidate E loop function, we introduced a mutation in the NukF protein, which is involved in the nukacin-ISK-1 immunity system. Amino acid replacement of glutamic acid in the E loop with glutamine (E85Q) resulted in slight decreases in the immunity level and transport activity. Additionally, the E85A mutation severely impaired the immunity level and transport activity. On the other hand, ATPase activities of purified E85Q and E85A mutants were almost similar to that of the wild type. These results suggested that the E loop found in ABC transporters involved in lantibiotic immunity plays a significant role in the function of these transporters, especially in the structural change of transmembrane domains.
Gherghe, Cristina; Lombo, Tania; Leonard, Christopher W.; Datta, Siddhartha A. K.; Bess, Julian W.; Gorelick, Robert J.; Rein, Alan; Weeks, Kevin M.
2010-01-01
All retroviral genomic RNAs contain a cis-acting packaging signal by which dimeric genomes are selectively packaged into nascent virions. However, it is not understood how Gag (the viral structural protein) interacts with these signals to package the genome with high selectivity. We probed the structure of murine leukemia virus RNA inside virus particles using SHAPE, a high-throughput RNA structure analysis technology. These experiments showed that NC (the nucleic acid binding domain derived from Gag) binds within the virus to the sequence UCUG-UR-UCUG. Recombinant Gag and NC proteins bound to this same RNA sequence in dimeric RNA in vitro; in all cases, interactions were strongest with the first U and final G in each UCUG element. The RNA structural context is critical: High-affinity binding requires base-paired regions flanking this motif, and two UCUG-UR-UCUG motifs are specifically exposed in the viral RNA dimer. Mutating the guanosine residues in these two motifs—only four nucleotides per genomic RNA—reduced packaging 100-fold, comparable to the level of nonspecific packaging. These results thus explain the selective packaging of dimeric RNA. This paradigm has implications for RNA recognition in general, illustrating how local context and RNA structure can create information-rich recognition signals from simple single-stranded sequence elements in large RNAs. PMID:20974908
A regulatory gene (ECO-orf4) required for ECO-0501 biosynthesis in Amycolatopsis orientalis.
Shen, Yang; Huang, He; Zhu, Li; Luo, Minyu; Chen, Daijie
2014-02-01
ECO-0501 is a novel linear polyene antibiotic, which was discovered from Amycolatopsis orientalis. Recent study of ECO-0501 biosynthesis pathway revealed the presence of regulatory gene: ECO-orf4. The A. orientalis ECO-orf4 gene from the ECO-0501 biosynthesis cluster was analyzed, and its deduced protein (ECO-orf4) was found to have amino acid sequence homology with large ATP-binding regulators of the LuxR (LAL) family regulators. Database comparison revealed two hypothetical domains, a LuxR-type helix-turn-helix (HTH) DNA binding motif near the C-terminal and an N-terminal nucleotide triphosphate (NTP) binding motif included. Deletion of the corresponding gene (ECO-orf4) resulted in complete loss of ECO-0501 production. Complementation by one copy of intact ECO-orf4 restored the polyene biosynthesis demonstrating that ECO-orf4 is required for ECO-0501 biosynthesis. The results of overexpression ECO-orf4 on ECO-0501 production indicated that it is a positive regulatory gene. Gene expression analysis by reverse transcription PCR of the ECO-0501 gene cluster showed that the transcription of ECO-orf4 correlates with that of genes involved in polyketide biosynthesis. These results demonstrated that ECO-orf4 is a pathway-specific positive regulatory gene that is essential for ECO-0501 biosynthesis. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A conserved intronic U1 snRNP-binding sequence promotes trans-splicing in Drosophila
Gao, Jun-Li; Fan, Yu-Jie; Wang, Xiu-Ye; Zhang, Yu; Pu, Jia; Li, Liang; Shao, Wei; Zhan, Shuai; Hao, Jianjiang
2015-01-01
Unlike typical cis-splicing, trans-splicing joins exons from two separate transcripts to produce chimeric mRNA and has been detected in most eukaryotes. Trans-splicing in trypanosomes and nematodes has been characterized as a spliced leader RNA-facilitated reaction; in contrast, its mechanism in higher eukaryotes remains unclear. Here we investigate mod(mdg4), a classic trans-spliced gene in Drosophila, and report that two critical RNA sequences in the middle of the last 5′ intron, TSA and TSB, promote trans-splicing of mod(mdg4). In TSA, a 13-nucleotide (nt) core motif is conserved across Drosophila species and is essential and sufficient for trans-splicing, which binds U1 small nuclear RNP (snRNP) through strong base-pairing with U1 snRNA. In TSB, a conserved secondary structure acts as an enhancer. Deletions of TSA and TSB using the CRISPR/Cas9 system result in developmental defects in flies. Although it is not clear how the 5′ intron finds the 3′ introns, compensatory changes in U1 snRNA rescue trans-splicing of TSA mutants, demonstrating that U1 recruitment is critical to promote trans-splicing in vivo. Furthermore, TSA core-like motifs are found in many other trans-spliced Drosophila genes, including lola. These findings represent a novel mechanism of trans-splicing, in which RNA motifs in the 5′ intron are sufficient to bring separate transcripts into close proximity to promote trans-splicing. PMID:25838544
James, Kevin A.; Verkhivker, Gennady M.
2014-01-01
The ErbB protein tyrosine kinases are among the most important cell signaling families and mutation-induced modulation of their activity is associated with diverse functions in biological networks and human disease. We have combined molecular dynamics simulations of the ErbB kinases with the protein structure network modeling to characterize the reorganization of the residue interaction networks during conformational equilibrium changes in the normal and oncogenic forms. Structural stability and network analyses have identified local communities integrated around high centrality sites that correspond to the regulatory spine residues. This analysis has provided a quantitative insight to the mechanism of mutation-induced “superacceptor” activity in oncogenic EGFR dimers. We have found that kinase activation may be determined by allosteric interactions between modules of structurally stable residues that synchronize the dynamics in the nucleotide binding site and the αC-helix with the collective motions of the integrating αF-helix and the substrate binding site. The results of this study have pointed to a central role of the conserved His-Arg-Asp (HRD) motif in the catalytic loop and the Asp-Phe-Gly (DFG) motif as key mediators of structural stability and allosteric communications in the ErbB kinases. We have determined that residues that are indispensable for kinase regulation and catalysis often corresponded to the high centrality nodes within the protein structure network and could be distinguished by their unique network signatures. The optimal communication pathways are also controlled by these nodes and may ensure efficient allosteric signaling in the functional kinase state. Structure-based network analysis has quantified subtle effects of ATP binding on conformational dynamics and stability of the EGFR structures. Consistent with the NMR studies, we have found that nucleotide-induced modulation of the residue interaction networks is not limited to the ATP site, and may enhance allosteric cooperativity with the substrate binding region by increasing communication capabilities of mediating residues. PMID:25427151
Ca2+-binding Motif of βγ-Crystallins*
Srivastava, Shanti Swaroop; Mishra, Amita; Krishnan, Bal; Sharma, Yogendra
2014-01-01
βγ-Crystallin-type double clamp (N/D)(N/D)XX(S/T)S motif is an established but sparsely investigated motif for Ca2+ binding. A βγ-crystallin domain is formed of two Greek key motifs, accommodating two Ca2+-binding sites. βγ-Crystallins make a separate class of Ca2+-binding proteins (CaBP), apparently a major group of CaBP in bacteria. Paralleling the diversity in βγ-crystallin domains, these motifs also show great diversity, both in structure and in function. Although the expression of some of them has been associated with stress, virulence, and adhesion, the functional implications of Ca2+ binding to βγ-crystallins in mediating biological processes are yet to be elucidated. PMID:24567326
Massive GGAAs in genomic repetitive sequences serve as a nuclear reservoir of NF-κB.
Wu, Jian; Wang, Qiao; Dai, Wei; Wang, Wei; Yue, Ming; Wang, Jinke
2018-04-13
Nuclear factor κB (NF-κB) is a DNA-binding transcription factor. Characterizing its genomic binding sites is crucial for understanding its gene regulatory function and mechanism in cells. This study characterized the binding sites of NF-κB RelA/p65 in the tumor neurosis factor-α (TNFα) stimulated HeLa cells by a precise chromatin immunoprecipitation-sequencing (ChIP-seq). The results revealed that NF-κB binds nontraditional motifs (nt-motifs) containing conserved GGAA quadruplet. Moreover, nt-motifs mainly distribute in the peaks nearby centromeres that contain a larger number of repetitive elements such as satellite, simple repeats and short interspersed nuclear elements (SINEs). This intracellular binding pattern was then confirmed by the in vitro detection, indicating that NF-κB dimers can bind the nontraditional κB (nt-κB) sites with low affinity. However, this binding hardly activates transcription. This study thus deduced that NF-κB binding nt-motifs may realize functions other than gene regulation as NF-κB binding traditional motifs (t-motifs). To testify the deduction, many ChIP-seq data of other cell lines were then analyzed. The results indicate that NF-κB binding nt-motifs is also widely present in other cells. The ChIP-seq data analysis also revealed that nt-motifs more widely distribute in the peaks with low-fold enrichment. Importantly, it was also found that NF-κB binding nt-motifs is mainly present in the resting cells, whereas NF-κB binding t-motifs is mainly present in the stimulated cells. Astonishingly, no known function was enriched by the gene annotation of nt-motif peaks. Based on these results, this study proposed that the nt-κB sites that extensively distribute in larger numbers of repeat elements function as a nuclear reservoir of NF-κB. The nuclear NF-κB proteins stored at nt-κB sites in the resting cells may be recruited to the t-κB sites for regulating its target genes upon stimulation. Copyright © 2018 Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, and Genetics Society of China. Published by Elsevier Ltd. All rights reserved.
Liang, Chunyang; Xiong, Ke; Szulwach, Keith E.; Zhang, Yi; Wang, Zhaohui; Peng, Junmin; Fu, Mingui; Jin, Peng; Suzuki, Hiroshi I.; Liu, Qinghua
2013-01-01
MicroRNAs (miRNA) control numerous physiological and pathological processes. Typically, the primary miRNA (pri-miRNA) transcripts are processed by nuclear Drosha complex into ∼70-nucleotide stem-loop precursor miRNAs (pre-miRNA), which are further cleaved by cytoplasmic Dicer complex into ∼21-nucleotide mature miRNAs. However, it is unclear how nascent pre-miRNAs are protected from ribonucleases, such as MCPIP1, that degrade pre-miRNAs to abort miRNA production. Here, we identify Sjögren syndrome antigen B (SSB)/La as a pre-miRNA-binding protein that regulates miRNA processing in vitro. All three RNA-binding motifs (LAM, RRM1, and RRM2) of La/SSB are required for efficient pre-miRNA binding. Intriguingly, La/SSB recognizes the characteristic stem-loop structure of pre-miRNAs, of which the majority lack a 3′ UUU terminus. Moreover, La/SSB associates with endogenous pri-/pre-miRNAs and promotes miRNA biogenesis by stabilizing pre-miRNAs from nuclease (e.g. MCPIP1)-mediated decay in mammalian cells. Accordingly, we observed positive correlations between the expression status of La/SSB and Dicer in human cancer transcriptome and prognosis. These studies identify an important function of La/SSB as a global regulator of miRNA expression, and implicate stem-loop recognition as a major mechanism that mediates association between La/SSB and diverse RNA molecules. PMID:23129761
Liang, Chunyang; Xiong, Ke; Szulwach, Keith E; Zhang, Yi; Wang, Zhaohui; Peng, Junmin; Fu, Mingui; Jin, Peng; Suzuki, Hiroshi I; Liu, Qinghua
2013-01-04
MicroRNAs (miRNA) control numerous physiological and pathological processes. Typically, the primary miRNA (pri-miRNA) transcripts are processed by nuclear Drosha complex into ~70-nucleotide stem-loop precursor miRNAs (pre-miRNA), which are further cleaved by cytoplasmic Dicer complex into ~21-nucleotide mature miRNAs. However, it is unclear how nascent pre-miRNAs are protected from ribonucleases, such as MCPIP1, that degrade pre-miRNAs to abort miRNA production. Here, we identify Sjögren syndrome antigen B (SSB)/La as a pre-miRNA-binding protein that regulates miRNA processing in vitro. All three RNA-binding motifs (LAM, RRM1, and RRM2) of La/SSB are required for efficient pre-miRNA binding. Intriguingly, La/SSB recognizes the characteristic stem-loop structure of pre-miRNAs, of which the majority lack a 3' UUU terminus. Moreover, La/SSB associates with endogenous pri-/pre-miRNAs and promotes miRNA biogenesis by stabilizing pre-miRNAs from nuclease (e.g. MCPIP1)-mediated decay in mammalian cells. Accordingly, we observed positive correlations between the expression status of La/SSB and Dicer in human cancer transcriptome and prognosis. These studies identify an important function of La/SSB as a global regulator of miRNA expression, and implicate stem-loop recognition as a major mechanism that mediates association between La/SSB and diverse RNA molecules.
Boehm, Elizabeth M.; Powers, Kyle T.; Kondratick, Christine M.; Spies, Maria; Houtman, Jon C. D.; Washington, M. Todd
2016-01-01
Y-family DNA polymerases, such as polymerase η, polymerase ι, and polymerase κ, catalyze the bypass of DNA damage during translesion synthesis. These enzymes are recruited to sites of DNA damage by interacting with the essential replication accessory protein proliferating cell nuclear antigen (PCNA) and the scaffold protein Rev1. In most Y-family polymerases, these interactions are mediated by one or more conserved PCNA-interacting protein (PIP) motifs that bind in a hydrophobic pocket on the front side of PCNA as well as by conserved Rev1-interacting region (RIR) motifs that bind in a hydrophobic pocket on the C-terminal domain of Rev1. Yeast polymerase η, a prototypical translesion synthesis polymerase, binds both PCNA and Rev1. It possesses a single PIP motif but not an RIR motif. Here we show that the PIP motif of yeast polymerase η mediates its interactions both with PCNA and with Rev1. Moreover, the PIP motif of polymerase η binds in the hydrophobic pocket on the Rev1 C-terminal domain. We also show that the RIR motif of human polymerase κ and the PIP motif of yeast Msh6 bind both PCNA and Rev1. Overall, these findings demonstrate that PIP motifs and RIR motifs have overlapping specificities and can interact with both PCNA and Rev1 in structurally similar ways. These findings also suggest that PIP motifs are a more versatile protein interaction motif than previously believed. PMID:26903512
Lindfors, Hanna E; Venkata, Bharat Somireddy; Drijfhout, Jan W; Ubbink, Marcellus
2011-02-18
The interaction between a peptide encompassing the SH3 and SH2 binding motifs of focal adhesion kinase (FAK) and the Src SH3-SH2 domains has been investigated with NMR spectroscopy and calorimetry. The binding to both motifs is anti-cooperative. Reduction of the long linker connecting the motifs does not lead to cooperativity. Short linkers that do not allow simultaneous intramolecular binding of the peptide to both motifs cause peptide-mediated dimerisation, even with a linker of only three amino acids. The role of the SH3 binding motif is discussed in view of the independent nature of the SH interactions. Copyright © 2011 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Xu, Yu-Xin; Ma, Anna; Liu, Li
2013-01-01
GDP-fucose transporter plays a crucial role in fucosylation of glycoproteins by providing activated fucose donor, GDP-fucose, for fucosyltransferases in the lumen of the Golgi apparatus. Fucose-containing glycans are involved in many biological processes, which are essential for growth and development. Mutations in the GDP-fucose transporter gene cause leukocyte adhesion deficiency syndrome II, a disease characterized by slow growth, mental retardation and immunodeficiency. However, no information is available regarding its transcriptional regulation. Here, by using human cells, we show that TGF-β1 specifically induces the GDP-fucose transporter expression, but not other transporters tested such as CMP-sialic acid transporter, suggesting a diversity of regulatory pathways for the expression of these transporters. The regulatory elements that are responsive to the TGF-β1 stimulation are present in the region between bp −330 and −268 in the GDP-fucose transporter promoter. We found that this region contains two identical octamer GC-rich motifs (GGGGCGTG) that were demonstrated to be essential for the transporter expression. We also show that the transcription factor Sp1 specifically binds to the GC-rich motifs in vitro and Sp1 coupled with phospho-Smad2 is associated with the promoter region covering the Sp1-binding motifs in vivo using chromatin immunoprecipitation (ChIP) assays. In addition, we further confirmed that Sp1 is essential for the GDP-fucose transporter expression stimulated by TGF-β1 using a luciferase reporter system. These results highlight the role of TGF-β signaling in regulation of the GDP-fucose transporter expression via activating Sp1. This is the first transcriptional study for any nucleotide sugar transporters that have been identified so far. Notably, TGF-β1 receptor itself is known to be modified by fucosylation. Given the essential role of GDP-fucose transporter in fucosylation, the finding that TGF-β1 stimulates the expression of this transporter, suggests a possible intracellular link between the function of nucleotide sugar transporter and the TGF-β signaling pathway. PMID:24069312
Xu, Yu-Xin; Ma, Anna; Liu, Li
2013-01-01
GDP-fucose transporter plays a crucial role in fucosylation of glycoproteins by providing activated fucose donor, GDP-fucose, for fucosyltransferases in the lumen of the Golgi apparatus. Fucose-containing glycans are involved in many biological processes, which are essential for growth and development. Mutations in the GDP-fucose transporter gene cause leukocyte adhesion deficiency syndrome II, a disease characterized by slow growth, mental retardation and immunodeficiency. However, no information is available regarding its transcriptional regulation. Here, by using human cells, we show that TGF-β1 specifically induces the GDP-fucose transporter expression, but not other transporters tested such as CMP-sialic acid transporter, suggesting a diversity of regulatory pathways for the expression of these transporters. The regulatory elements that are responsive to the TGF-β1 stimulation are present in the region between bp -330 and -268 in the GDP-fucose transporter promoter. We found that this region contains two identical octamer GC-rich motifs (GGGGCGTG) that were demonstrated to be essential for the transporter expression. We also show that the transcription factor Sp1 specifically binds to the GC-rich motifs in vitro and Sp1 coupled with phospho-Smad2 is associated with the promoter region covering the Sp1-binding motifs in vivo using chromatin immunoprecipitation (ChIP) assays. In addition, we further confirmed that Sp1 is essential for the GDP-fucose transporter expression stimulated by TGF-β1 using a luciferase reporter system. These results highlight the role of TGF-β signaling in regulation of the GDP-fucose transporter expression via activating Sp1. This is the first transcriptional study for any nucleotide sugar transporters that have been identified so far. Notably, TGF-β1 receptor itself is known to be modified by fucosylation. Given the essential role of GDP-fucose transporter in fucosylation, the finding that TGF-β1 stimulates the expression of this transporter, suggests a possible intracellular link between the function of nucleotide sugar transporter and the TGF-β signaling pathway.
MotifMark: Finding regulatory motifs in DNA sequences.
Hassanzadeh, Hamid Reza; Kolhe, Pushkar; Isbell, Charles L; Wang, May D
2017-07-01
The interaction between proteins and DNA is a key driving force in a significant number of biological processes such as transcriptional regulation, repair, recombination, splicing, and DNA modification. The identification of DNA-binding sites and the specificity of target proteins in binding to these regions are two important steps in understanding the mechanisms of these biological activities. A number of high-throughput technologies have recently emerged that try to quantify the affinity between proteins and DNA motifs. Despite their success, these technologies have their own limitations and fall short in precise characterization of motifs, and as a result, require further downstream analysis to extract useful and interpretable information from a haystack of noisy and inaccurate data. Here we propose MotifMark, a new algorithm based on graph theory and machine learning, that can find binding sites on candidate probes and rank their specificity in regard to the underlying transcription factor. We developed a pipeline to analyze experimental data derived from compact universal protein binding microarrays and benchmarked it against two of the most accurate motif search methods. Our results indicate that MotifMark can be a viable alternative technique for prediction of motif from protein binding microarrays and possibly other related high-throughput techniques.
Structure-Function Model for Kissing Loop Interactions That Initiate Dimerization of Ty1 RNA
Gamache, Eric R.; Doh, Jung H.; Ritz, Justin; Laederach, Alain; Bellaousov, Stanislav; Mathews, David H.; Curcio, M. Joan
2017-01-01
The genomic RNA of the retrotransposon Ty1 is packaged as a dimer into virus-like particles. The 5′ terminus of Ty1 RNA harbors cis-acting sequences required for translation initiation, packaging and initiation of reverse transcription (TIPIRT). To identify RNA motifs involved in dimerization and packaging, a structural model of the TIPIRT domain in vitro was developed from single-nucleotide resolution RNA structural data. In general agreement with previous models, the first 326 nucleotides of Ty1 RNA form a pseudoknot with a 7-bp stem (S1), a 1-nucleotide interhelical loop and an 8-bp stem (S2) that delineate two long, structured loops. Nucleotide substitutions that disrupt either pseudoknot stem greatly reduced helper-Ty1-mediated retrotransposition of a mini-Ty1, but only mutations in S2 destabilized mini-Ty1 RNA in cis and helper-Ty1 RNA in trans. Nested in different loops of the pseudoknot are two hairpins with complementary 7-nucleotide motifs at their apices. Nucleotide substitutions in either motif also reduced retrotransposition and destabilized mini- and helper-Ty1 RNA. Compensatory mutations that restore base-pairing in the S2 stem or between the hairpins rescued retrotransposition and RNA stability in cis and trans. These data inform a model whereby a Ty1 RNA kissing complex with two intermolecular kissing-loop interactions initiates dimerization and packaging. PMID:28445416
Maurer-Stroh, Sebastian; Gao, He; Han, Hao; Baeten, Lies; Schymkowitz, Joost; Rousseau, Frederic; Zhang, Louxin; Eisenhaber, Frank
2013-02-01
Data mining in protein databases, derivatives from more fundamental protein 3D structure and sequence databases, has considerable unearthed potential for the discovery of sequence motif--structural motif--function relationships as the finding of the U-shape (Huf-Zinc) motif, originally a small student's project, exemplifies. The metal ion zinc is critically involved in universal biological processes, ranging from protein-DNA complexes and transcription regulation to enzymatic catalysis and metabolic pathways. Proteins have evolved a series of motifs to specifically recognize and bind zinc ions. Many of these, so called zinc fingers, are structurally independent globular domains with discontinuous binding motifs made up of residues mostly far apart in sequence. Through a systematic approach starting from the BRIX structure fragment database, we discovered that there exists another predictable subset of zinc-binding motifs that not only have a conserved continuous sequence pattern but also share a characteristic local conformation, despite being included in totally different overall folds. While this does not allow general prediction of all Zn binding motifs, a HMM-based web server, Huf-Zinc, is available for prediction of these novel, as well as conventional, zinc finger motifs in protein sequences. The Huf-Zinc webserver can be freely accessed through this URL (http://mendel.bii.a-star.edu.sg/METHODS/hufzinc/).
Lee, Jae Hoon; Sundin, George W; Zhao, Youfu
2016-06-01
The type III secretion system (T3SS) is a key pathogenicity factor in Erwinia amylovora. Previous studies have demonstrated that the T3SS in E. amylovora is transcriptionally regulated by an RpoN-HrpL sigma factor cascade, which is activated by the bacterial alarmone (p)ppGpp. In this study, the binding site of HrpS, an enhancer binding protein, was identified for the first time in plant-pathogenic bacteria. Complementation of the hrpL mutant with promoter deletion constructs of the hrpL gene and promoter activity analyses using various lengths of the hrpL promoter fused to a promoter-less green fluorescent protein (gfp) reporter gene delineated the upstream region for HrpS binding. Sequence analysis revealed a dyad symmetry sequence between -138 and -125 nucleotides (TGCAA-N4-TTGCA) as the potential HrpS binding site, which is conserved in the promoter of the hrpL gene among plant enterobacterial pathogens. Results of quantitative real-time reverse transcription-polymerase chain reaction (qRT-PCR) and electrophoresis mobility shift assay coupled with site-directed mutagenesis (SDM) analysis showed that the intact dyad symmetry sequence was essential for HrpS binding, full activation of T3SS gene expression and virulence. In addition, the role of the GAYTGA motif (RpoN binding site) of HrpS in the regulation of T3SS gene expression in E. amylovora was characterized by complementation of the hrpS mutant using mutant variants generated by SDM. Results showed that a Y100F substitution of HrpS complemented the hrpS mutant, whereas Y100A and Y101A substitutions did not. These results suggest that tyrosine (Y) and phenylalanine (F) function interchangeably in the conserved GAYTGA motif of HrpS in E. amylovora. © 2015 BSPP AND JOHN WILEY & SONS LTD.
The role of RNA structure in the interaction of U1A protein with U1 hairpin II RNA
Law, Michael J.; Rice, Andrew J.; Lin, Patti; Laird-Offringa, Ite A.
2006-01-01
The N-terminal RNA Recognition Motif (RRM1) of the spliceosomal protein U1A interacting with its target U1 hairpin II (U1hpII) has been used as a paradigm for RRM-containing proteins interacting with their RNA targets. U1A binds to U1hpII via direct interactions with a 7-nucleotide (nt) consensus binding sequence at the 5′ end of a 10-nt loop, and via hydrogen bonds with the closing C–G base pair at the top of the RNA stem. Using surface plasmon resonance (Biacore), we have examined the role of structural features of U1hpII in binding to U1A RRM1. Mutational analysis of the closing base pair suggests it plays a minor role in binding and mainly prevents “breathing” of the loop. Lengthening the stem and nontarget part of the loop suggests that the increased negative charge of the RNA might slightly aid association. However, this is offset by an increase in dissociation, which may be caused by attraction of the RRM to nontarget parts of the RNA. Studies of a single stranded target and RNAs with untethered loops indicate that structure is not very relevant for association but is important for complex stability. In particular, breaking the link between the stem and the 5′ side of the loop greatly increases complex dissociation, presumably by hindering simultaneous contacts between the RRM and stem and loop nucleotides. While binding of U1A to a single stranded target is much weaker than to U1hpII, it occurs with nanomolar affinity, supporting recent evidence that binding of unstructured RNA by U1A has physiological significance. PMID:16738410
The role of RNA structure in the interaction of U1A protein with U1 hairpin II RNA.
Law, Michael J; Rice, Andrew J; Lin, Patti; Laird-Offringa, Ite A
2006-07-01
The N-terminal RNA Recognition Motif (RRM1) of the spliceosomal protein U1A interacting with its target U1 hairpin II (U1hpII) has been used as a paradigm for RRM-containing proteins interacting with their RNA targets. U1A binds to U1hpII via direct interactions with a 7-nucleotide (nt) consensus binding sequence at the 5' end of a 10-nt loop, and via hydrogen bonds with the closing C-G base pair at the top of the RNA stem. Using surface plasmon resonance (Biacore), we have examined the role of structural features of U1hpII in binding to U1A RRM1. Mutational analysis of the closing base pair suggests it plays a minor role in binding and mainly prevents "breathing" of the loop. Lengthening the stem and nontarget part of the loop suggests that the increased negative charge of the RNA might slightly aid association. However, this is offset by an increase in dissociation, which may be caused by attraction of the RRM to nontarget parts of the RNA. Studies of a single stranded target and RNAs with untethered loops indicate that structure is not very relevant for association but is important for complex stability. In particular, breaking the link between the stem and the 5' side of the loop greatly increases complex dissociation, presumably by hindering simultaneous contacts between the RRM and stem and loop nucleotides. While binding of U1A to a single stranded target is much weaker than to U1hpII, it occurs with nanomolar affinity, supporting recent evidence that binding of unstructured RNA by U1A has physiological significance.
Plaga, W; Lottspeich, F; Oesterhelt, D
1992-04-01
An improved purification procedure, including nickel chelate affinity chromatography, is reported which resulted in a crystallizable pyruvate:ferredoxin oxidoreductase preparation from Halobacterium halobium. Crystals of the enzyme were obtained using potassium citrate as the precipitant. The genes coding for pyruvate:ferredoxin oxidoreductase were cloned and their nucleotide sequences determined. The genes of both subunits were adjacent to one another on the halobacterial genome. The derived amino acid sequences were confirmed by partial primary structure analysis of the purified protein. The structural motif of thiamin-diphosphate-binding enzymes was unequivocally located in the deduced amino acid sequence of the small subunit.
A genome-wide structure-based survey of nucleotide binding proteins in M. tuberculosis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bhagavat, Raghu; Kim, Heung -Bok; Kim, Chang -Yub
Nucleoside tri-phosphates (NTP) form an important class of small molecule ligands that participate in, and are essential to a large number of biological processes. Here, we seek to identify the NTP binding proteome (NTPome) in M. tuberculosis (M.tb), a deadly pathogen. Identifying the NTPome is useful not only for gaining functional insights of the individual proteins but also for identifying useful drug targets. From an earlier study, we had structural models of M.tb at a proteome scale from which a set of 13,858 small molecule binding pockets were identified. We use a set of NTP binding sub-structural motifs derived frommore » a previous study and scan the M.tb pocketome, and find that 1,768 proteins or 43% of the proteome can theoretically bind NTP ligands. Using an experimental proteomics approach involving dye-ligand affinity chromatography, we confirm NTP binding to 47 different proteins, of which 4 are hypothetical proteins. Our analysis also provides the precise list of binding site residues in each case, and the probable ligand binding pose. In conclusion, as the list includes a number of known and potential drug targets, the identification of NTP binding can directly facilitate structure-based drug design of these targets.« less
A genome-wide structure-based survey of nucleotide binding proteins in M. tuberculosis
Bhagavat, Raghu; Kim, Heung -Bok; Kim, Chang -Yub; ...
2017-10-02
Nucleoside tri-phosphates (NTP) form an important class of small molecule ligands that participate in, and are essential to a large number of biological processes. Here, we seek to identify the NTP binding proteome (NTPome) in M. tuberculosis (M.tb), a deadly pathogen. Identifying the NTPome is useful not only for gaining functional insights of the individual proteins but also for identifying useful drug targets. From an earlier study, we had structural models of M.tb at a proteome scale from which a set of 13,858 small molecule binding pockets were identified. We use a set of NTP binding sub-structural motifs derived frommore » a previous study and scan the M.tb pocketome, and find that 1,768 proteins or 43% of the proteome can theoretically bind NTP ligands. Using an experimental proteomics approach involving dye-ligand affinity chromatography, we confirm NTP binding to 47 different proteins, of which 4 are hypothetical proteins. Our analysis also provides the precise list of binding site residues in each case, and the probable ligand binding pose. In conclusion, as the list includes a number of known and potential drug targets, the identification of NTP binding can directly facilitate structure-based drug design of these targets.« less
Russo Krauss, Irene; Ramaswamy, Sneha; Neidle, Stephen; Haider, Shozeb; Parkinson, Gary N
2016-02-03
We report here on an X-ray crystallographic and molecular modeling investigation into the complex 3' interface formed between putative parallel stranded G-quadruplexes and a duplex DNA sequence constructed from the human telomeric repeat sequence TTAGGG. Our crystallographic approach provides a detailed snapshot of a telomeric 3' quadruplex-duplex junction: a junction that appears to have the potential to form a unique molecular target for small molecule binding and interference with telomere-related functions. This unique target is particularly relevant as current high-affinity compounds that bind putative G-quadruplex forming sequences only rarely have a high degree of selectivity for a particular quadruplex. Here DNA junctions were assembled using different putative quadruplex-forming scaffolds linked at the 3' end to a telomeric duplex sequence and annealed to a complementary strand. We successfully generated a series of G-quadruplex-duplex containing crystals, both alone and in the presence of ligands. The structures demonstrate the formation of a parallel folded G-quadruplex and a B-form duplex DNA stacked coaxially. Most strikingly, structural data reveals the consistent formation of a TAT triad platform between the two motifs. This triad allows for a continuous stack of bases to link the quadruplex motif with the duplex region. For these crystal structures formed in the absence of ligands, the TAT triad interface occludes ligand binding at the 3' quadruplex-duplex interface, in agreement with in silico docking predictions. However, with the rearrangement of a single nucleotide, a stable pocket can be produced, thus providing an opportunity for the binding of selective molecules at the interface.
Structure, function, and evolution of bacterial ATP-binding cassette systems
DOE Office of Scientific and Technical Information (OSTI.GOV)
Davidson, A.L.; Dassa, E.; Orelle, C.
2010-07-27
The ATP-binding cassette (ABC) systems constitute one of the largest superfamilies of paralogous sequences. All ABC systems share a highly conserved ATP-hydrolyzing domain or protein (the ABC; also referred to as a nucleotide-binding domain [NBD]) that is unequivocally characterized by three short sequence motifs (Fig. 1): these are the Walker A and Walker B motifs, indicative of the presence of a nucleotide-binding site, and the signature motif, unique to ABC proteins, located upstream of the Walker B motif (426). Other motifs diagnostic of ABC proteins are also indicated in Fig. 1. The biological significance of these motifs is discussed inmore » Structure, Function, and Dynamics of the ABC. ABC systems are widespread among living organisms and have been detected in all genera of the three kingdoms of life, with remarkable conservation in the primary sequence of the cassette and in the organization of the constitutive domains or subunits (203, 420). ABC systems couple the energy of ATP hydrolysis to an impressively large variety of essential biological phenomena, comprising not only transmembrane (TM) transport, for which they are best known, but also several non-transport-related processes, such as translation elongation (62) and DNA repair (174). Although ABC systems deserve much attention because they are involved in severe human inherited diseases (107), they were first discovered and characterized in detail in prokaryotes, as early as the 1970s (13, 148, 238, 468). The most extensively analyzed systems were the high-affinity histidine and maltose uptake systems of Salmonella enterica serovar Typhimurium and Escherichia coli. Over 2 decades ago, after the completion of the nucleotide sequences encoding these transporters in the respective laboratories of Giovanna Ames and Maurice Hofnung, Hiroshi Nikaido and colleagues noticed that the two systems displayed a global similarity in the nature of their components and, moreover, that the primary sequences of MalK and HisP, the proteins suspected to energize these transporters, shared as much as 32% identity in amino acid residues when their sequences were aligned (171). Later, it was found that several bacterial proteins involved in uptake of nutrients, export of toxins, cell division, bacterial nodulation of plants, and DNA repair displayed the same similarity in their sequences (127, 196). This led to the notion that the conserved protein, which had been shown to bind ATP (198, 201), would probably energize the systems mentioned above by coupling the energy of ATP hydrolysis to transport. The latter was demonstrated with the maltose and histidine transporters by use of isolated membrane vesicles (105, 379) and purified transporters reconstituted into proteoliposomes (30, 98). The determination of the sequence of the first eukaryotic protein strongly similar to these bacterial transporters (the P-glycoprotein, involved in resistance of cancer cells to multiple drugs) (169, 179) demonstrated that these proteins were not restricted to prokaryotes. Two names, 'traffic ATPases' (15) and the more accepted name 'ABC transporters' (193, 218), were proposed for members of this new superfamily. ABC systems can be divided into three main functional categories, as follows. Importers mediate the uptake of nutrients in prokaryotes. The nature of the substrates that are transported is very wide, including mono- and oligosaccharides, organic and inorganic ions, amino acids, peptides, ironsiderophores, metals, polyamine cations, opines, and vitamins. Exporters are involved in the secretion of various molecules, such as peptides, lipids, hydrophobic drugs, polysaccharides, and proteins, including toxins such as hemolysin. The third category of systems is apparently not involved in transport, with some members being involved in translation of mRNA and in DNA repair. Despite the large, diverse population of substrates handled and the difference in the polarity of transport, importers and exporters share a common organization made of two hydrophobic membrane-spanning or integral membrane (IM) domains and two hydrophilic domains carrying the ABC peripherally associated with the IM domains on the cytosolic side of the membrane (26). In importers, these four domains are almost always independent polypeptide chains that come together to form a multimeric complex. In most exporters, including the E. coli hemolysin exporter HlyB, the N-terminal IM and the C-terminal ABC domains are fused as a single polypeptide chain (IM-ABC). An inverted organization in which the IM domain is C-terminal with respect to the ABC domain (ABC-IM) exists, such as in the MacB protein, involved in macrolide resistance in E. coli. No IM domain partners have been identified for ABC proteins falling into the third category, and these proteins consist of two ABCs fused together (ABC2).« less
Mapping and analysis of Caenorhabditis elegans transcription factor sequence specificities
Narasimhan, Kamesh; Lambert, Samuel A; Yang, Ally WH; Riddell, Jeremy; Mnaimneh, Sanie; Zheng, Hong; Albu, Mihai; Najafabadi, Hamed S; Reece-Hoyes, John S; Fuxman Bass, Juan I; Walhout, Albertha JM; Weirauch, Matthew T; Hughes, Timothy R
2015-01-01
Caenorhabditis elegans is a powerful model for studying gene regulation, as it has a compact genome and a wealth of genomic tools. However, identification of regulatory elements has been limited, as DNA-binding motifs are known for only 71 of the estimated 763 sequence-specific transcription factors (TFs). To address this problem, we performed protein binding microarray experiments on representatives of canonical TF families in C. elegans, obtaining motifs for 129 TFs. Additionally, we predict motifs for many TFs that have DNA-binding domains similar to those already characterized, increasing coverage of binding specificities to 292 C. elegans TFs (∼40%). These data highlight the diversification of binding motifs for the nuclear hormone receptor and C2H2 zinc finger families and reveal unexpected diversity of motifs for T-box and DM families. Motif enrichment in promoters of functionally related genes is consistent with known biology and also identifies putative regulatory roles for unstudied TFs. DOI: http://dx.doi.org/10.7554/eLife.06967.001 PMID:25905672
Kresoja-Rakic, Jelena; Felley-Bosco, Emanuela
2018-04-25
The in vitro RNA-pulldown is still largely used in the first steps of protocols aimed at identifying RNA-binding proteins that recognize specific RNA structures and motifs. In this RNA-pulldown protocol, commercially synthesized RNA probes are labeled with a modified form of biotin, desthiobiotin, at the 3' terminus of the RNA strand, which reversibly binds to streptavidin and thus allows elution of proteins under more physiological conditions. The RNA-desthiobiotin is immobilized through interaction with streptavidin on magnetic beads, which are used to pull down proteins that specifically interact with the RNA of interest. Non-denatured and active proteins from the cytosolic fraction of mesothelioma cells are used as the source of proteins. The method described here can be applied to detect the interaction between known RNA binding proteins and a 25-nucleotide (nt) long RNA probe containing a sequence of interest. This is useful to complete the functional characterization of stabilizing or destabilizing elements present in RNA molecules achieved using a reporter vector assay.
Principles of regulatory information conservation between mouse and human
Cheng, Yong; Ma, Zhihai; Kim, Bong-Hyun; ...
2014-11-19
To broaden our understanding of the evolution of gene regulation mechanisms, we generated occupancy profiles for 34 orthologous transcription factors (TFs) in human–mouse erythroid progenitor, lymphoblast and embryonic stem-cell lines. By combining the genome-wide transcription factor occupancy repertoires, associated epigenetic signals, and co-association patterns, here we deduce several evolutionary principles of gene regulatory features operating since the mouse and human lineages diverged. The genomic distribution profiles, primary binding motifs, chromatin states, and DNA methylation preferences are well conserved for TF-occupied sequences. However, the extent to which orthologous DNA segments are bound by orthologous TFs varies both among TFs and withmore » genomic location: binding at promoters is more highly conserved than binding at distal elements. Notably, occupancy-conserved TF-occupied sequences tend to be pleiotropic; they function in several tissues and also co-associate with many TFs. Lastly, single nucleotide variants at sites with potential regulatory functions are enriched in occupancy-conserved TF-occupied sequences.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wallen,J.; Paige, C.; Mallett, T.
2008-01-01
We have recently reported that CoASH is the major low-molecular weight thiol in Bacillus anthracis, and we have now characterized the kinetic and redox properties of the B. anthracis coenzyme A-disulfide reductase (CoADR, BACoADR) and determined the crystal structure at 2.30 Angstroms resolution. While the Staphylococcus aureus and Borrelia burgdorferi CoADRs exhibit strong preferences for NADPH and NADH, respectively, B. anthracis CoADR can use either pyridine nucleotide equally well. Sequence elements within the respective NAD(P)H-binding motifs correctly reflect the preferences for S. aureus and Bo. burgdorferi CoADRs, but leave questions as to how BACoADR can interact with both pyridine nucleotides.more » The structures of the NADH and NADPH complexes at ca. 2.3 Angstroms resolution reveal that a loop consisting of residues Glu180-Thr187 becomes ordered and changes conformation on NAD(P)H binding. NADH and NADPH interact with nearly identical conformations of this loop; the latter interaction, however, involves a novel binding mode in which the 2'-phosphate of NADPH points out toward solvent. In addition, the NAD(P)H-reduced BACoADR structures provide the first view of the reduced form (Cys42-SH/CoASH) of the Cys42-SSCoA redox center. The Cys42-SH side chain adopts a new conformation in which the conserved Tyr367'-OH and Tyr425'-OH interact with the nascent thiol(ate) on the flavin si-face. Kinetic data with Y367F, Y425F, and Y367, 425F BACoADR mutants indicate that Tyr425' is the primary proton donor in catalysis, with Tyr367' functioning as a cryptic alternate donor in the absence of Tyr425'.« less
Nirasawa, Satoru; Nakahara, Kazuhiko; Takahashi, Saori
2018-02-27
Paenidase is the first microorganism-derived D-aspartyl endopeptidase that specifically recognizes an internal D-Asp residue to cleave [D-Asp]-X peptide bonds. Using peptide sequences obtained from the protein, we performed PCR with degenerate primers to amplify the paenidase I-encoding gene. Nucleotide sequencing revealed that mature paenidase I consists of 322 amino acid residues and that the protein is encoded as a pro-protein with a 197-amino-acid N-terminal extension compared to the mature protein. Paenidase I exhibits amino acid sequence similarity to several penicillin-binding proteins. In addition, paenidase I was classified into peptidase family S12 based on a MEROPS database search. Family S12 contains serine-type D-Ala-D-Ala carboxypeptidases that have three active site residues (Ser, Lys, and Tyr) in the conserved motifs Ser-Xaa-Thr-Lys and Tyr-Xaa-Asn. These motifs were conserved in the primary structure of paenidase I, and the role of these residues was confirmed by site-directed mutagenesis.
Tran, Tuan; Disney, Matthew D
2012-01-01
RNA is an important therapeutic target but information about RNA-ligand interactions is limited. Here, we report a screening method that probes over 3,000,000 combinations of RNA motif-small molecule interactions to identify the privileged RNA structures and chemical spaces that interact. Specifically, a small molecule library biased for binding RNA was probed for binding to over 70,000 unique RNA motifs in a high throughput solution-based screen. The RNA motifs that specifically bind each small molecule were identified by microarray-based selection. In this library-versus-library or multidimensional combinatorial screening approach, hairpin loops (among a variety of RNA motifs) were the preferred RNA motif space that binds small molecules. Furthermore, it was shown that indole, 2-phenyl indole, 2-phenyl benzimidazole and pyridinium chemotypes allow for specific recognition of RNA motifs. As targeting RNA with small molecules is an extremely challenging area, these studies provide new information on RNA-ligand interactions that has many potential uses.
Tran, Tuan; Disney, Matthew D.
2012-01-01
RNA is an important therapeutic target but information about RNA-ligand interactions is limited. Here we report a screening method that probes over 3,000,000 combinations of RNA motif-small molecule interactions to identify the privileged RNA structures and chemical spaces that interact. Specifically, a small molecule library biased for binding RNA was probed for binding to over 70,000 unique RNA motifs in a high throughput solution-based screen. The RNA motifs that specifically bind each small molecule were identified by microarray-based selection. In this library-versus-library or multidimensional combinatorial screening approach, hairpin loops (amongst a variety of RNA motifs) were the preferred RNA motif space that binds small molecules. Furthermore, it was shown that indole, 2-phenyl indole, 2-phenyl benzimidazole, and pyridinium chemotypes allow for specific recognition of RNA motifs. Since targeting RNA with small molecules is an extremely challenging area, these studies provide new information on RNA-ligand interactions that has many potential uses. PMID:23047683
Composite Structural Motifs of Binding Sites for Delineating Biological Functions of Proteins
Kinjo, Akira R.; Nakamura, Haruki
2012-01-01
Most biological processes are described as a series of interactions between proteins and other molecules, and interactions are in turn described in terms of atomic structures. To annotate protein functions as sets of interaction states at atomic resolution, and thereby to better understand the relation between protein interactions and biological functions, we conducted exhaustive all-against-all atomic structure comparisons of all known binding sites for ligands including small molecules, proteins and nucleic acids, and identified recurring elementary motifs. By integrating the elementary motifs associated with each subunit, we defined composite motifs that represent context-dependent combinations of elementary motifs. It is demonstrated that function similarity can be better inferred from composite motif similarity compared to the similarity of protein sequences or of individual binding sites. By integrating the composite motifs associated with each protein function, we define meta-composite motifs each of which is regarded as a time-independent diagrammatic representation of a biological process. It is shown that meta-composite motifs provide richer annotations of biological processes than sequence clusters. The present results serve as a basis for bridging atomic structures to higher-order biological phenomena by classification and integration of binding site structures. PMID:22347478
Livingstone, Mark; Folkman, Lukas; Yang, Yuedong; Zhang, Ping; Mort, Matthew; Cooper, David N; Liu, Yunlong; Stantic, Bela; Zhou, Yaoqi
2017-10-01
Synonymous single-nucleotide variants (SNVs), although they do not alter the encoded protein sequences, have been implicated in many genetic diseases. Experimental studies indicate that synonymous SNVs can lead to changes in the secondary and tertiary structures of DNA and RNA, thereby affecting translational efficiency, cotranslational protein folding as well as the binding of DNA-/RNA-binding proteins. However, the importance of these various features in disease phenotypes is not clearly understood. Here, we have built a support vector machine (SVM) model (termed DDIG-SN) as a means to discriminate disease-causing synonymous variants. The model was trained and evaluated on nearly 900 disease-causing variants. The method achieves robust performance with the area under the receiver operating characteristic curve of 0.84 and 0.85 for protein-stratified 10-fold cross-validation and independent testing, respectively. We were able to show that the disease-causing effects in the immediate proximity to exon-intron junctions (1-3 bp) are driven by the loss of splicing motif strength, whereas the gain of splicing motif strength is the primary cause in regions further away from the splice site (4-69 bp). The method is available as a part of the DDIG server at http://sparks-lab.org/ddig. © 2017 Wiley Periodicals, Inc.
Harrison, Thomas; Ruiz, Jaime; Sloan, Daniel B.; Ben-Hur, Asa; Boucher, Christina
2016-01-01
Pentatricopeptide repeat containing proteins (PPRs) bind to RNA transcripts originating from mitochondria and plastids. There are two classes of PPR proteins. The P class contains tandem P-type motif sequences, and the PLS class contains alternating P, L and S type sequences. In this paper, we describe a novel tool that predicts PPR-RNA interaction; specifically, our method, which we call aPPRove, determines where and how a PLS-class PPR protein will bind to RNA when given a PPR and one or more RNA transcripts by using a combinatorial binding code for site specificity proposed by Barkan et al. Our results demonstrate that aPPRove successfully locates how and where a PPR protein belonging to the PLS class can bind to RNA. For each binding event it outputs the binding site, the amino-acid-nucleotide interaction, and its statistical significance. Furthermore, we show that our method can be used to predict binding events for PLS-class proteins using a known edit site and the statistical significance of aligning the PPR protein to that site. In particular, we use our method to make a conjecture regarding an interaction between CLB19 and the second intronic region of ycf3. The aPPRove web server can be found at www.cs.colostate.edu/~approve. PMID:27560805
DOE Office of Scientific and Technical Information (OSTI.GOV)
K Kucera; A Koblansky; L Saunders
Profilins promote actin polymerization by exchanging ADP for ATP on monomeric actin and delivering ATP-actin to growing filament barbed ends. Apicomplexan protozoa such as Toxoplasma gondii invade host cells using an actin-dependent gliding motility. Toll-like receptor (TLR) 11 generates an innate immune response upon sensing T. gondii profilin (TgPRF). The crystal structure of TgPRF reveals a parasite-specific surface motif consisting of an acidic loop, followed by a long {beta}-hairpin. A series of structure-based profilin mutants show that TLR11 recognition of the acidic loop is responsible for most of the interleukin (IL)-12 secretion response to TgPRF in peritoneal macrophages. Deletion ofmore » both the acidic loop and the {beta}-hairpin completely abrogates IL-12 secretion. Insertion of the T. gondii acidic loop and {beta}-hairpin into yeast profilin is sufficient to generate TLR11-dependent signaling. Substitution of the acidic loop in TgPRF with the homologous loop from the apicomplexan parasite Cryptosporidium parvum does not affect TLR11-dependent IL-12 secretion, while substitution with the acidic loop from Plasmodium falciparum results in reduced but significant IL-12 secretion. We conclude that the parasite-specific motif in TgPRF is the key molecular pattern recognized by TLR11. Unlike other profilins, TgPRF slows nucleotide exchange on monomeric rabbit actin and binds rabbit actin weakly. The putative TgPRF actin-binding surface includes the {beta}-hairpin and diverges widely from the actin-binding surfaces of vertebrate profilins.« less
Gachon, F; Thebault, S; Peleraux, A; Devaux, C; Mesnard, J M
2000-05-01
The human T-cell leukemia virus type 1 (HTLV-1) Tax protein activates viral transcription through three 21-bp repeats located in the U3 region of the HTLV-1 long terminal repeat and called Tax-responsive elements (TxREs). Each TxRE contains nucleotide sequences corresponding to imperfect cyclic AMP response elements (CRE). In this study, we demonstrate that the bZIP transcriptional factor CREB-2 is able to bind in vitro to the TxREs and that CREB-2 binding to each of the 21-bp motifs is enhanced by Tax. We also demonstrate that Tax can weakly interact with CREB-2 bound to a cellular palindromic CRE motif such as that found in the somatostatin promoter. Mutagenesis of Tax and CREB-2 demonstrates that both N- and C-terminal domains of Tax and the C-terminal region of CREB-2 are required for direct interaction between the two proteins. In addition, the Tax mutant M47, defective for HTLV-1 activation, is unable to form in vitro a ternary complex with CREB-2 and TxRE. In agreement with recent results suggesting that Tax can recruit the coactivator CREB-binding protein (CBP) on the HTLV-1 promoter, we provide evidence that Tax, CREB-2, and CBP are capable of cooperating to stimulate viral transcription. Taken together, our data highlight the major role played by CREB-2 in Tax-mediated transactivation.
Gachon, Frederic; Thebault, Sabine; Peleraux, Annick; Devaux, Christian; Mesnard, Jean-Michel
2000-01-01
The human T-cell leukemia virus type 1 (HTLV-1) Tax protein activates viral transcription through three 21-bp repeats located in the U3 region of the HTLV-1 long terminal repeat and called Tax-responsive elements (TxREs). Each TxRE contains nucleotide sequences corresponding to imperfect cyclic AMP response elements (CRE). In this study, we demonstrate that the bZIP transcriptional factor CREB-2 is able to bind in vitro to the TxREs and that CREB-2 binding to each of the 21-bp motifs is enhanced by Tax. We also demonstrate that Tax can weakly interact with CREB-2 bound to a cellular palindromic CRE motif such as that found in the somatostatin promoter. Mutagenesis of Tax and CREB-2 demonstrates that both N- and C-terminal domains of Tax and the C-terminal region of CREB-2 are required for direct interaction between the two proteins. In addition, the Tax mutant M47, defective for HTLV-1 activation, is unable to form in vitro a ternary complex with CREB-2 and TxRE. In agreement with recent results suggesting that Tax can recruit the coactivator CREB-binding protein (CBP) on the HTLV-1 promoter, we provide evidence that Tax, CREB-2, and CBP are capable of cooperating to stimulate viral transcription. Taken together, our data highlight the major role played by CREB-2 in Tax-mediated transactivation. PMID:10779337
Identification and preliminary characterization of a protein motif related to the zinc finger.
Lovering, R; Hanson, I M; Borden, K L; Martin, S; O'Reilly, N J; Evan, G I; Rahman, D; Pappin, D J; Trowsdale, J; Freemont, P S
1993-01-01
We have identified a protein motif, related to the zinc finger, which defines a newly discovered family of proteins. The motif was found in the sequence of the human RING1 gene, which is proximal to the major histocompatibility complex region on chromosome six. We propose naming this motif the "RING finger" and it is found in 27 proteins, all of which have putative DNA binding functions. We have synthesized a peptide corresponding to the RING1 motif and examined a number of properties, including metal and DNA binding. We provide evidence to support the suggestion that the RING finger motif is the DNA binding domain of this newly defined family of proteins. Images Fig. 1 Fig. 4 PMID:7681583
Maurer, B; Bannert, H; Darai, G; Flügel, R M
1988-01-01
The nucleotide sequence of the human spumaretrovirus (HSRV) genome was determined. The 5' long terminal repeat region was analyzed by strong stop cDNA synthesis and S1 nuclease mapping. The length of the RU5 region was determined and found to be 346 nucleotides long. The 5' long terminal repeat is 1,123 base pairs long and is bound by an 18-base-pair primer-binding site complementary to the 3' end of mammalian lysine-1,2-specific tRNA. Open reading frames for gag and pol genes were identified. Surprisingly, the HSRV gag protein does not contain the cysteine motif of the nucleic acid-binding proteins found in and typical of all other retroviral gag proteins; instead the HSRV gag gene encodes a strongly basic protein reminiscent of those of hepatitis B virus and retrotransposons. The carboxy-terminal part of the HSRV gag gene products encodes a protease domain. The pol gene overlaps the gag gene and is postulated to be synthesized as a gag/pol precursor via translational frameshifting analogous to that of Rous sarcoma virus, with 7 nucleotides immediately upstream of the termination codons of gag conserved between the two viral genomes. The HSRV pol gene is 2,730 nucleotides long, and its deduced protein sequence is readily subdivided into three well-conserved domains, the reverse transcriptase, the RNase H, and the integrase. Although the degree of homology of the HSRV reverse transcriptase domain is highest to that of murine leukemia virus, the HSRV genomic organization is more similar to that of human and simian immunodeficiency viruses. The data justify classifying the spumaretroviruses as a third subfamily of Retroviridae. Images PMID:2451755
The Rho ADP-ribosylating C3 exoenzyme binds cells via an Arg-Gly-Asp motif.
Rohrbeck, Astrid; Höltje, Markus; Adolf, Andrej; Oms, Elisabeth; Hagemann, Sandra; Ahnert-Hilger, Gudrun; Just, Ingo
2017-10-27
The Rho ADP-ribosylating C3 exoenzyme (C3bot) is a bacterial protein toxin devoid of a cell-binding or -translocation domain. Nevertheless, C3 can efficiently enter intact cells, including neurons, but the mechanism of C3 binding and uptake is not yet understood. Previously, we identified the intermediate filament vimentin as an extracellular membranous interaction partner of C3. However, uptake of C3 into cells still occurs (although reduced) in the absence of vimentin, indicating involvement of an additional host cell receptor. C3 harbors an Arg-Gly-Asp (RGD) motif, which is the major integrin-binding site, present in a variety of integrin ligands. To check whether the RGD motif of C3 is involved in binding to cells, we performed a competition assay with C3 and RGD peptide or with a monoclonal antibody binding to β1-integrin subunit and binding assays in different cell lines, primary neurons, and synaptosomes with C3-RGD mutants. Here, we report that preincubation of cells with the GRGDNP peptide strongly reduced C3 binding to cells. Moreover, mutation of the RGD motif reduced C3 binding to intact cells and also to recombinant vimentin. Anti-integrin antibodies also lowered the C3 binding to cells. Our results indicate that the RGD motif of C3 is at least one essential C3 motif for binding to host cells and that integrin is an additional receptor for C3 besides vimentin. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Molecular analysis of the human SLC13A4 sulfate transporter gene promoter
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jefferis, J.; Rakoczy, J.; School of Biomedical Sciences, University of Queensland, St. Lucia, Queensland
2013-03-29
Highlights: ► Basal promoter activity of SLC13A4 −57 to −192 nt upstream of transcription initiation site. ► Human SLC13A4 5′-flanking region has conserved motifs with other placental species. ► Putative NFY, SP1 and KLF7 motifs in SLC13A4 5′-flanking region enhance transcription. -- Abstract: The human solute linked carrier (SLC) 13A4 gene is primarily expressed in the placenta where it is proposed to mediate the transport of nutrient sulfate from mother to fetus. The molecular mechanisms involved in the regulation of SLC13A4 expression remain unknown. To investigate the regulation of SLC13A4 gene expression, we analysed the transcriptional activity of the humanmore » SLC13A4 5′-flanking region in the JEG-3 placental cell line using luciferase reporter assays. Basal transcriptional activity was identified in the region −57 to −192 nucleotides upstream of the SLC13A4 transcription initiation site. Mutational analysis of the minimal promoter region identified Nuclear factor Y (NFY), Specificity protein 1 (SP1) and Krüppel like factor 7 (KLF7) motifs which conferred positive transcriptional activity, as well as Zinc finger protein of the cerebellum 2 (ZIC2) and helix–loop–helix protein 1 (HEN1) motifs that repressed transcription. The conserved NFY, SP1, KLF7, ZIC2 and HEN1 motifs in the SLC13A4 promoter of placental species but not in non-placental species, suggests a potential role for these putative transcriptional factor binding motifs in the physiological control of SLC13A4 mRNA expression.« less
Improved prediction of MHC class I and class II epitopes using a novel Gibbs sampling approach.
Nielsen, Morten; Lundegaard, Claus; Worning, Peder; Hvid, Christina Sylvester; Lamberth, Kasper; Buus, Søren; Brunak, Søren; Lund, Ole
2004-06-12
Prediction of which peptides will bind a specific major histocompatibility complex (MHC) constitutes an important step in identifying potential T-cell epitopes suitable as vaccine candidates. MHC class II binding peptides have a broad length distribution complicating such predictions. Thus, identifying the correct alignment is a crucial part of identifying the core of an MHC class II binding motif. In this context, we wish to describe a novel Gibbs motif sampler method ideally suited for recognizing such weak sequence motifs. The method is based on the Gibbs sampling method, and it incorporates novel features optimized for the task of recognizing the binding motif of MHC classes I and II. The method locates the binding motif in a set of sequences and characterizes the motif in terms of a weight-matrix. Subsequently, the weight-matrix can be applied to identifying effectively potential MHC binding peptides and to guiding the process of rational vaccine design. We apply the motif sampler method to the complex problem of MHC class II binding. The input to the method is amino acid peptide sequences extracted from the public databases of SYFPEITHI and MHCPEP and known to bind to the MHC class II complex HLA-DR4(B1*0401). Prior identification of information-rich (anchor) positions in the binding motif is shown to improve the predictive performance of the Gibbs sampler. Similarly, a consensus solution obtained from an ensemble average over suboptimal solutions is shown to outperform the use of a single optimal solution. In a large-scale benchmark calculation, the performance is quantified using relative operating characteristics curve (ROC) plots and we make a detailed comparison of the performance with that of both the TEPITOPE method and a weight-matrix derived using the conventional alignment algorithm of ClustalW. The calculation demonstrates that the predictive performance of the Gibbs sampler is higher than that of ClustalW and in most cases also higher than that of the TEPITOPE method.
Contessa, Gian Marco; Orsale, Maria; Melino, Sonia; Torre, Vincent; Paci, Maurizio; Desideri, Alessandro; Cicero, Daniel O
2005-03-01
The NMR high-resolution structure of calmodulin complexed with a fragment of the olfactory cyclic-nucleotide gated channel is described. This structure shows features that are unique for this complex, including an active role of the linker connecting the N- and C-lobes of calmodulin upon binding of the peptide. Such linker is not only involved in the formation of an hydrophobic pocket to accommodate a bulky peptide residue, but it also provides a positively charged region complementary to a negative charge of the target. This complex of calmodulin with a target not belonging to the kinase family was used to test the residual dipolar coupling (RDC) approach for the determination of calmodulin binding modes to peptides. Although the complex here characterized belongs to the (1--14) family, high Q values were obtained with all the 1:1 complexes for which crystalline structures are available. Reduction of the RDC data set used for the correlation analysis to structured regions of the complex allowed a clear identification of the binding mode. Excluded regions comprise calcium binding loops and loops connecting the EF-hand motifs.
A structural-alphabet-based strategy for finding structural motifs across protein families
Wu, Chih Yuan; Chen, Yao Chi; Lim, Carmay
2010-01-01
Proteins with insignificant sequence and overall structure similarity may still share locally conserved contiguous structural segments; i.e. structural/3D motifs. Most methods for finding 3D motifs require a known motif to search for other similar structures or functionally/structurally crucial residues. Here, without requiring a query motif or essential residues, a fully automated method for discovering 3D motifs of various sizes across protein families with different folds based on a 16-letter structural alphabet is presented. It was applied to structurally non-redundant proteins bound to DNA, RNA, obligate/non-obligate proteins as well as free DNA-binding proteins (DBPs) and proteins with known structures but unknown function. Its usefulness was illustrated by analyzing the 3D motifs found in DBPs. A non-specific motif was found with a ‘corner’ architecture that confers a stable scaffold and enables diverse interactions, making it suitable for binding not only DNA but also RNA and proteins. Furthermore, DNA-specific motifs present ‘only’ in DBPs were discovered. The motifs found can provide useful guidelines in detecting binding sites and computational protein redesign. PMID:20525797
Affinity, Avidity, and Kinetics of Target Sequence Binding to LC8 Dynein Light Chain Isoforms*
Radnai, László; Rapali, Péter; Hódi, Zsuzsa; Süveges, Dániel; Molnár, Tamás; Kiss, Bence; Bécsi, Bálint; Erdödi, Ferenc; Buday, László; Kardos, József; Kovács, Mihály; Nyitray, László
2010-01-01
LC8 dynein light chain (DYNLL) is a highly conserved eukaryotic hub protein with dozens of binding partners and various functions beyond being a subunit of dynein and myosin Va motor proteins. Here, we compared the kinetic and thermodynamic parameters of binding of both mammalian isoforms, DYNLL1 and DYNLL2, to two putative consensus binding motifs (KXTQTX and XG(I/V)QVD) and report only subtle differences. Peptides containing either of the above motifs bind to DYNLL2 with micromolar affinity, whereas a myosin Va peptide (lacking the conserved Gln) and the noncanonical Pak1 peptide bind with Kd values of 9 and 40 μm, respectively. Binding of the KXTQTX motif is enthalpy-driven, although that of all other peptides is both enthalpy- and entropy-driven. Moreover, the KXTQTX motif shows strikingly slower off-rate constant than the other motifs. As most DYNLL partners are homodimeric, we also assessed the binding of bivalent ligands to DYNLL2. Compared with monovalent ligands, a significant avidity effect was found as follows: Kd values of 37 and 3.5 nm for a dimeric myosin Va fragment and a Leu zipper dimerized KXTQTX motif, respectively. Ligand binding kinetics of DYNLL can best be described by a conformational selection model consisting of a slow isomerization and a rapid binding step. We also studied the binding of the phosphomimetic S88E mutant of DYNLL2 to the dimeric myosin Va fragment, and we found a significantly lower apparent Kd value (3 μm). We conclude that the thermodynamic and kinetic fine-tuning of binding of various ligands to DYNLL could have physiological relevance in its interaction network. PMID:20889982
Xu, Hongyun; Shi, Xinxin; Wang, Zhibo; Gao, Caiqiu; Wang, Chao; Wang, Yucheng
2017-08-01
WRKY transcription factors play important roles in many biological processes, and mainly bind to the W-box element to regulate gene expression. Previously, we characterized a WRKY gene from Tamarix hispida, ThWRKY4, in response to abiotic stress, and showed that it bound to the W-box motif. However, whether ThWRKY4 could bind to other motifs remains unknown. In this study, we employed a Transcription Factor-Centered Yeast one Hybrid (TF-Centered Y1H) screen to study the motifs recognized by ThWRKY4. In addition to the W-box core cis-element (termed W-box), we identified that ThWRKY4 could bind to two other motifs: the RAV1A element (CAACA) and a novel motif with sequence of GTCTA (W-box like sequence, WLS). The distributions of these motifs were screened in the promoter regions of genes regulated by some WRKYs. The results showed that the W-box, RAV1A, and WLS motifs were all present in high numbers, suggesting that they play key roles in gene expression mediated by WRKYs. Furthermore, five WRKY proteins from different WRKY subfamilies in Arabidopsis thaliana were selected and confirmed to bind to the RAV1A and WLS motifs, indicating that they are recognized commonly by WRKYs. These findings will help to further reveal the functions of WRKY proteins. Copyright © 2017 Elsevier B.V. All rights reserved.
Zehra, Rabail; Abbasi, Amir Ali
2018-03-01
Empirical assessments of human accelerated noncoding DNA frgaments have delineated presence of many cis-regulatory elements. Enhancers make up an important category of such accelerated cis-regulatory elements that efficiently control the spatiotemporal expression of many developmental genes. Establishing plausible reasons for accelerated enhancer sequence divergence in Homo sapiens has been termed significant in various previously published studies. This acceleration by including closely related primates and archaic human data has the potential to open up evolutionary avenues for deducing present-day brain structure. This study relied on empirically confirmed brain exclusive enhancers to avoid any misjudgments about their regulatory status and categorized among them a subset of enhancers with an exceptionally accelerated rate of lineage specific divergence in humans. In this assorted set, 13 distinct transcription factor binding sites were located that possessed unique existence in humans. Three of 13 such sites belonging to transcription factors SOX2, RUNX1/3, and FOS/JUND possessed single nucleotide variants that made them unique to H. sapiens upon comparisons with Neandertal and Denisovan orthologous sequences. These variants modifying the binding sites in modern human lineage were further substantiated as single nucleotide polymorphisms via exploiting 1000 Genomes Project Phase3 data. Long range haplotype based tests laid out evidence of positive selection to be governing in African population on two of the modern human motif modifying alleles with strongest results for SOX2 binding site. In sum, our study acknowledges acceleration in noncoding regulatory landscape of the genome and highlights functional parts within it to have undergone accelerated divergence in present-day human population.
Nicholson, Judith; Scherl, Alex; Way, Luke; Blackburn, Elizabeth A; Walkinshaw, Malcolm D; Ball, Kathryn L; Hupp, Ted R
2014-06-01
Linear motifs mediate protein-protein interactions (PPI) that allow expansion of a target protein interactome at a systems level. This study uses a proteomics approach and linear motif sub-stratifications to expand on PPIs of MDM2. MDM2 is a multi-functional protein with over one hundred known binding partners not stratified by hierarchy or function. A new linear motif based on a MDM2 interaction consensus is used to select novel MDM2 interactors based on Nutlin-3 responsiveness in a cell-based proteomics screen. MDM2 binds a subset of peptide motifs corresponding to real proteins with a range of allosteric responses to MDM2 ligands. We validate cyclophilin B as a novel protein with a consensus MDM2 binding motif that is stabilised by Nutlin-3 in vivo, thus identifying one of the few known interactors of MDM2 that is stabilised by Nutlin-3. These data invoke two modes of peptide binding at the MDM2 N-terminus that rely on a consensus core motif to control the equilibrium between MDM2 binding proteins. This approach stratifies MDM2 interacting proteins based on the linear motif feature and provides a new biomarker assay to define clinically relevant Nutlin-3 responsive MDM2 interactors. Copyright © 2014 Elsevier Inc. All rights reserved.
Zhang, Lu; Xu, Jinhao; Ma, Jinbiao
2016-07-25
RNA-binding protein exerts important biological function by specifically recognizing RNA motif. SELEX (Systematic evolution of ligands by exponential enrichment), an in vitro selection method, can obtain consensus motif with high-affinity and specificity for many target molecules from DNA or RNA libraries. Here, we combined SELEX with next-generation sequencing to study the protein-RNA interaction in vitro. A pool of RNAs with 20 bp random sequences were transcribed by T7 promoter, and target protein was inserted into plasmid containing SBP-tag, which can be captured by streptavidin beads. Through only one cycle, the specific RNA motif can be obtained, which dramatically improved the selection efficiency. Using this method, we found that human hnRNP A1 RRMs domain (UP1 domain) bound RNA motifs containing AGG and AG sequences. The EMSA experiment indicated that hnRNP A1 RRMs could bind the obtained RNA motif. Taken together, this method provides a rapid and effective method to study the RNA binding specificity of proteins.
2012-01-01
Background Discovery of functionally significant short, statistically overrepresented subsequence patterns (motifs) in a set of sequences is a challenging problem in bioinformatics. Oftentimes, not all sequences in the set contain a motif. These non-motif-containing sequences complicate the algorithmic discovery of motifs. Filtering the non-motif-containing sequences from the larger set of sequences while simultaneously determining the identity of the motif is, therefore, desirable and a non-trivial problem in motif discovery research. Results We describe MotifCatcher, a framework that extends the sensitivity of existing motif-finding tools by employing random sampling to effectively remove non-motif-containing sequences from the motif search. We developed two implementations of our algorithm; each built around a commonly used motif-finding tool, and applied our algorithm to three diverse chromatin immunoprecipitation (ChIP) data sets. In each case, the motif finder with the MotifCatcher extension demonstrated improved sensitivity over the motif finder alone. Our approach organizes candidate functionally significant discovered motifs into a tree, which allowed us to make additional insights. In all cases, we were able to support our findings with experimental work from the literature. Conclusions Our framework demonstrates that additional processing at the sequence entry level can significantly improve the performance of existing motif-finding tools. For each biological data set tested, we were able to propose novel biological hypotheses supported by experimental work from the literature. Specifically, in Escherichia coli, we suggested binding site motifs for 6 non-traditional LexA protein binding sites; in Saccharomyces cerevisiae, we hypothesize 2 disparate mechanisms for novel binding sites of the Cse4p protein; and in Halobacterium sp. NRC-1, we discoverd subtle differences in a general transcription factor (GTF) binding site motif across several data sets. We suggest that small differences in our discovered motif could confer specificity for one or more homologous GTF proteins. We offer a free implementation of the MotifCatcher software package at http://www.bme.ucdavis.edu/facciotti/resources_data/software/. PMID:23181585
Karnik, Rahul; Beer, Michael A.
2015-01-01
The generation of genomic binding or accessibility data from massively parallel sequencing technologies such as ChIP-seq and DNase-seq continues to accelerate. Yet state-of-the-art computational approaches for the identification of DNA binding motifs often yield motifs of weak predictive power. Here we present a novel computational algorithm called MotifSpec, designed to find predictive motifs, in contrast to over-represented sequence elements. The key distinguishing feature of this algorithm is that it uses a dynamic search space and a learned threshold to find discriminative motifs in combination with the modeling of motifs using a full PWM (position weight matrix) rather than k-mer words or regular expressions. We demonstrate that our approach finds motifs corresponding to known binding specificities in several mammalian ChIP-seq datasets, and that our PWMs classify the ChIP-seq signals with accuracy comparable to, or marginally better than motifs from the best existing algorithms. In other datasets, our algorithm identifies novel motifs where other methods fail. Finally, we apply this algorithm to detect motifs from expression datasets in C. elegans using a dynamic expression similarity metric rather than fixed expression clusters, and find novel predictive motifs. PMID:26465884
Karnik, Rahul; Beer, Michael A
2015-01-01
The generation of genomic binding or accessibility data from massively parallel sequencing technologies such as ChIP-seq and DNase-seq continues to accelerate. Yet state-of-the-art computational approaches for the identification of DNA binding motifs often yield motifs of weak predictive power. Here we present a novel computational algorithm called MotifSpec, designed to find predictive motifs, in contrast to over-represented sequence elements. The key distinguishing feature of this algorithm is that it uses a dynamic search space and a learned threshold to find discriminative motifs in combination with the modeling of motifs using a full PWM (position weight matrix) rather than k-mer words or regular expressions. We demonstrate that our approach finds motifs corresponding to known binding specificities in several mammalian ChIP-seq datasets, and that our PWMs classify the ChIP-seq signals with accuracy comparable to, or marginally better than motifs from the best existing algorithms. In other datasets, our algorithm identifies novel motifs where other methods fail. Finally, we apply this algorithm to detect motifs from expression datasets in C. elegans using a dynamic expression similarity metric rather than fixed expression clusters, and find novel predictive motifs.
Bioinformatics Analysis of NBS-LRR Encoding Resistance Genes in Setaria italica.
Zhao, Yan; Weng, Qiaoyun; Song, Jinhui; Ma, Hailian; Yuan, Jincheng; Dong, Zhiping; Liu, Yinghui
2016-06-01
In plants, resistance (R) genes are involved in pathogen recognition and subsequent activation of innate immune responses. The nucleotide-binding site-leucine-rich repeat (NBS-LRR) genes family forms the largest R-gene family among plant genomes and play an important role in plant disease resistance. In this paper, comprehensive analysis of NBS-encoding genes is performed in the whole Setaria italica genome. A total of 96 NBS-LRR genes are identified, and comprehensive overview of the NBS-LRR genes is undertaken, including phylogenetic analysis, chromosome locations, conserved motifs of proteins, and gene expression. Based on the domain, these genes are divided into two groups and distributed in all Setaria italica chromosomes. Most NBS-LRR genes are located at the distal tip of the long arms of the chromosomes. Setaria italica NBS-LRR proteins share at least one nucleotide-biding domain and one leucine-rich repeat domain. Our results also show the duplication of NBS-LRR genes in Setaria italica is related to their gene structure.
Alenton, Rod Russel R; Koiwai, Keiichiro; Miyaguchi, Kohei; Kondo, Hidehiro; Hirono, Ikuo
2017-04-04
C-type lectins (CTLs) are calcium-dependent carbohydrate-binding proteins known to assist the innate immune system as pattern recognition receptors (PRRs). The binding specificity of CTLs lies in the motif of their carbohydrate recognition domain (CRD), the tripeptide motifs EPN and QPD bind to mannose and galactose, respectively. However, variants of these motifs were discovered including a QAP sequence reported in shrimp believed to have the same carbohydrate specificity as QPD. Here, we characterized a novel C-type lectin (MjGCTL) possessing a CRD with a QAP motif. The recombinant MjGCTL has a calcium-dependent agglutinating capability against both Gram-negative and Gram-positive bacteria, and its sugar specificity did not involve either mannose or galactose. In an encapsulation assay, agarose beads coated with rMjGCTL were immediately encapsulated from 0 h followed by melanization at 4 h post-incubation with hemocytes. These results confirm that MjGCTL functions as a classical CTL. The structure of QAP motif and carbohydrate-specificity of rMjGCTL was found to be different to both EPN and QPD, suggesting that QAP is a new motif. Furthermore, MjGCTL acts as a PRR binding to hemocytes to activate their adherent state and initiate encapsulation.
Alenton, Rod Russel R.; Koiwai, Keiichiro; Miyaguchi, Kohei; Kondo, Hidehiro; Hirono, Ikuo
2017-01-01
C-type lectins (CTLs) are calcium-dependent carbohydrate-binding proteins known to assist the innate immune system as pattern recognition receptors (PRRs). The binding specificity of CTLs lies in the motif of their carbohydrate recognition domain (CRD), the tripeptide motifs EPN and QPD bind to mannose and galactose, respectively. However, variants of these motifs were discovered including a QAP sequence reported in shrimp believed to have the same carbohydrate specificity as QPD. Here, we characterized a novel C-type lectin (MjGCTL) possessing a CRD with a QAP motif. The recombinant MjGCTL has a calcium-dependent agglutinating capability against both Gram-negative and Gram-positive bacteria, and its sugar specificity did not involve either mannose or galactose. In an encapsulation assay, agarose beads coated with rMjGCTL were immediately encapsulated from 0 h followed by melanization at 4 h post-incubation with hemocytes. These results confirm that MjGCTL functions as a classical CTL. The structure of QAP motif and carbohydrate-specificity of rMjGCTL was found to be different to both EPN and QPD, suggesting that QAP is a new motif. Furthermore, MjGCTL acts as a PRR binding to hemocytes to activate their adherent state and initiate encapsulation. PMID:28374848
Quantifying domain-ligand affinities and specificities by high-throughput holdup assay
Vincentelli, Renaud; Luck, Katja; Poirson, Juline; Polanowska, Jolanta; Abdat, Julie; Blémont, Marilyne; Turchetto, Jeremy; Iv, François; Ricquier, Kevin; Straub, Marie-Laure; Forster, Anne; Cassonnet, Patricia; Borg, Jean-Paul; Jacob, Yves; Masson, Murielle; Nominé, Yves; Reboul, Jérôme; Wolff, Nicolas; Charbonnier, Sebastian; Travé, Gilles
2015-01-01
Many protein interactions are mediated by small linear motifs interacting specifically with defined families of globular domains. Quantifying the specificity of a motif requires measuring and comparing its binding affinities to all its putative target domains. To this aim, we developed the high-throughput holdup assay, a chromatographic approach that can measure up to a thousand domain-motif equilibrium binding affinities per day. Extracts of overexpressed domains are incubated with peptide-coated resins and subjected to filtration. Binding affinities are deduced from microfluidic capillary electrophoresis of flow-throughs. After benchmarking the approach on 210 PDZ-peptide pairs with known affinities, we determined the affinities of two viral PDZ-binding motifs derived from Human Papillomavirus E6 oncoproteins for 209 PDZ domains covering 79% of the human PDZome. We obtained exquisite sequence-dependent binding profiles, describing quantitatively the PDZome recognition specificity of each motif. This approach, applicable to many categories of domain-ligand interactions, has a wide potential for quantifying the specificities of interactomes. PMID:26053890
Binding properties of SUMO-interacting motifs (SIMs) in yeast.
Jardin, Christophe; Horn, Anselm H C; Sticht, Heinrich
2015-03-01
Small ubiquitin-like modifier (SUMO) conjugation and interaction play an essential role in many cellular processes. A large number of yeast proteins is known to interact non-covalently with SUMO via short SUMO-interacting motifs (SIMs), but the structural details of this interaction are yet poorly characterized. In the present work, sequence analysis of a large dataset of 148 yeast SIMs revealed the existence of a hydrophobic core binding motif and a preference for acidic residues either within or adjacent to the core motif. Thus the sequence properties of yeast SIMs are highly similar to those described for human. Molecular dynamics simulations were performed to investigate the binding preferences for four representative SIM peptides differing in the number and distribution of acidic residues. Furthermore, the relative stability of two previously observed alternative binding orientations (parallel, antiparallel) was assessed. For all SIMs investigated, the antiparallel binding mode remained stable in the simulations and the SIMs were tightly bound via their hydrophobic core residues supplemented by polar interactions of the acidic residues. In contrary, the stability of the parallel binding mode is more dependent on the sequence features of the SIM motif like the number and position of acidic residues or the presence of additional adjacent interaction motifs. This information should be helpful to enhance the prediction of SIMs and their binding properties in different organisms to facilitate the reconstruction of the SUMO interactome.
Feliciano, Daniel; Tolsma, Thomas O.; Farrell, Kristen B.; Aradi, Al; Di Pietro, Santiago M.
2018-01-01
During clathrin-mediated endocytosis (CME), actin assembly provides force to drive vesicle internalization. Members of the Wiskott–Aldrich syndrome protein (WASP) family play a fundamental role stimulating actin assembly. WASP family proteins contain a WH2 motif that binds globular actin (G-actin) and a central-acidic motif that binds the Arp2/3 complex, thus promoting the formation of branched actin filaments. Yeast WASP (Las17) is the strongest of five factors promoting Arp2/3-dependent actin polymerization during CME. It was suggested that this strong activity may be caused by a putative second G-actin-binding motif in Las17. Here, we describe the in vitro and in vivo characterization of such Las17 G-actin-binding motif (LGM) and its dependence on a group of conserved arginine residues. Using the yeast two-hybrid system, GST-pulldown, fluorescence polarization and pyrene-actin polymerization assays, we show that LGM binds G-actin and is necessary for normal Arp2/3-mediated actin polymerization in vitro. Live-cell fluorescence microscopy experiments demonstrate that LGM is required for normal dynamics of actin polymerization during CME. Further, LGM is necessary for normal dynamics of endocytic machinery components that are recruited at early, intermediate and late stages of endocytosis, as well as for optimal endocytosis of native CME cargo. Both in vitro and in vivo experiments show that LGM has relatively lower potency compared to the previously known Las17 G-actin-binding motif, WH2. These results establish a second G-actin-binding motif in Las17 and advance our knowledge on the mechanism of actin assembly during CME. PMID:25615019
DOE Office of Scientific and Technical Information (OSTI.GOV)
Teplova, Marianna; Farazi, Thalia A.; Tuschl, Thomas
Abstract RNA-binding protein with multiple splicing (designated RBPMS) is a higher vertebrate mRNA-binding protein containing a single RNA recognition motif (RRM). RBPMS has been shown to be involved in mRNA transport, localization and stability, with key roles in axon guidance, smooth muscle plasticity, as well as regulation of cancer cell proliferation and migration. We report on structure-function studies of the RRM domain of RBPMS bound to a CAC-containing single-stranded RNA. These results provide insights into potential topologies of complexes formed by the RBPMS RRM domain and the tandem CAC repeat binding sites as detected by photoactivatable-ribonucleoside-enhanced crosslinking and immunoprecipitation. Thesemore » studies establish that the RRM domain of RBPMS forms a symmetrical dimer in the free state, with each monomer binding sequence-specifically to all three nucleotides of a CAC segment in the RNA bound state. Structure-guided mutations within the dimerization and RNA-binding interfaces of RBPMS RRM on RNA complex formation resulted in both disruption of dimerization and a decrease in RNA-binding affinity as observed by size exclusion chromatography and isothermal titration calorimetry. As anticipated from biochemical binding studies, over-expression of dimerization or RNA-binding mutants of Flag-HA-tagged RBPMS were no longer able to track with stress granules in HEK293 cells, thereby documenting the deleterious effects of such mutationsin vivo.« less
Beltrán-Valero de Bernabé, D; Jimenez, F J; Aquaron, R; Rodríguez de Córdoba, S
1999-01-01
We recently showed that alkaptonuria (AKU) is caused by loss-of-function mutations in the homogentisate 1,2 dioxygenase gene (HGO). Herein we describe haplotype and mutational analyses of HGO in seven new AKU pedigrees. These analyses identified two novel single-nucleotide polymorphisms (INV4+31A-->G and INV11+18A-->G) and six novel AKU mutations (INV1-1G-->A, W60G, Y62C, A122D, P230T, and D291E), which further illustrates the remarkable allelic heterogeneity found in AKU. Reexamination of all 29 mutations and polymorphisms thus far described in HGO shows that these nucleotide changes are not randomly distributed; the CCC sequence motif and its inverted complement, GGG, are preferentially mutated. These analyses also demonstrated that the nucleotide substitutions in HGO do not involve CpG dinucleotides, which illustrates important differences between HGO and other genes for the occurrence of mutation at specific short-sequence motifs. Because the CCC sequence motifs comprise a significant proportion (34.5%) of all mutated bases that have been observed in HGO, we conclude that the CCC triplet is a mutational hot spot in HGO. PMID:10205262
Sztuba-Solinska, Joanna; Diaz, Larissa; Kumar, Mia R.; Kolb, Gaëlle; Wiley, Michael R.; Jozwick, Lucas; Kuhn, Jens H.; Palacios, Gustavo; Radoshitzky, Sheli R.; J. Le Grice, Stuart F.; Johnson, Reed F.
2016-01-01
Ebola virus (EBOV) is a single-stranded negative-sense RNA virus belonging to the Filoviridae family. The leader and trailer non-coding regions of the EBOV genome likely regulate its transcription, replication, and progeny genome packaging. We investigated the cis-acting RNA signals involved in RNA–RNA and RNA–protein interactions that regulate replication of eGFP-encoding EBOV minigenomic RNA and identified heat shock cognate protein family A (HSC70) member 8 (HSPA8) as an EBOV trailer-interacting host protein. Mutational analysis of the trailer HSPA8 binding motif revealed that this interaction is essential for EBOV minigenome replication. Selective 2′-hydroxyl acylation analyzed by primer extension analysis of the secondary structure of the EBOV minigenomic RNA indicates formation of a small stem-loop composed of the HSPA8 motif, a 3′ stem-loop (nucleotides 1868–1890) that is similar to a previously identified structure in the replicative intermediate (RI) RNA and a panhandle domain involving a trailer-to-leader interaction. Results of minigenome assays and an EBOV reverse genetic system rescue support a role for both the panhandle domain and HSPA8 motif 1 in virus replication. PMID:27651462
Li, Yongquan; Huang, Shuangsheng; Zhang, Xiaosu; Huang, Tao; Li, Hongyu
2013-02-01
PilT is a hexameric ATPase required for type IV pili (Tfp) retraction in gram-negative bacterium. Retraction of Tfp mediates intimate attachment and motility on inorganic solid surfaces. We investigated the cloning and expression of pilT and pilU genes of Acidithiobacillus ferrooxidans strains ATCC 23270, and the results indicate that PilT and PilU contain the canonical conserved AIRNLIRE and GMQTXXXXLXXL motifs that are the characteristic motifs of the PilT protein family; PilT and PilU also contain the canonical nucleotide-binding motifs, named with Walker A box (GxxGxGKT/S) and Walker B box (hhhhDE), respectively. The pilT and pilU genes were expressed to produce 37.1- and 42.0-kDa proteins, respectively, and co-transcribed induced by 10 % mineral powder. However, ATPase activity of PilT was distinctly higher than those of PilU. These results indicated that the PilT protein was the real molecular motor of Tfp, while PilU could play a key role in the assembly, modification, and twitching motility of Tfp in A. ferrooxidans. However, PilT and PilU were nonetheless interrelated in the forming and function of the molecular motor of Tfp.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rha, Geun Bae; Wu, Guangteng; Shoelson, Steven E.
2010-04-15
Hepatocyte nuclear factor 4{alpha} (HNF4{alpha}) is a novel nuclear receptor that participates in a hierarchical network of transcription factors regulating the development and physiology of such vital organs as the liver, pancreas, and kidney. Among the various transcriptional coregulators with which HNF4{alpha} interacts, peroxisome proliferation-activated receptor {gamma} (PPAR{gamma}) coactivator 1{alpha} (PGC-1{alpha}) represents a novel coactivator whose activation is unusually robust and whose binding mode appears to be distinct from that of canonical coactivators such as NCoA/SRC/p160 family members. To elucidate the potentially unique molecular mechanism of PGC-1{alpha} recruitment, we have determined the crystal structure of HNF4{alpha} in complex with amore » fragment of PGC-1{alpha} containing all three of its LXXLL motifs. Despite the presence of all three LXXLL motifs available for interactions, only one is bound at the canonical binding site, with no additional contacts observed between the two proteins. However, a close inspection of the electron density map indicates that the bound LXXLL motif is not a selected one but an averaged structure of more than one LXXLL motif. Further biochemical and functional studies show that the individual LXXLL motifs can bind but drive only minimal transactivation. Only when more than one LXXLL motif is involved can significant transcriptional activity be measured, and full activation requires all three LXXLL motifs. These findings led us to propose a model wherein each LXXLL motif has an additive effect, and the multiple binding modes by HNF4{alpha} toward the LXXLL motifs of PGC-1{alpha} could account for the apparent robust activation by providing a flexible mechanism for combinatorial recruitment of additional coactivators and mediators.« less
A site-directed mutagenesis analysis of tNOX functional domains
NASA Technical Reports Server (NTRS)
Chueh, Pin-Ju; Morre, Dorothy M.; Morre, D. James
2002-01-01
Constitutive NADH oxidase proteins of the mammalian cell surface exhibit two different activities, oxidation of hydroquinones (or NADH) and protein disulfide-thiol interchange which alternate to yield oscillatory patterns with period lengths of 24 min. A drug-responsive tNOX (tumor-associated NADH oxidase) has a period length of about 22 min. The tNOX cDNA has been cloned and expressed. These two proteins are representative of cycling oxidase proteins of the plant and animal cell surface. In this report, we describe a series of eight amino acid replacements in tNOX which, when expressed in Escherichia coli, were analyzed for enzymatic activity, drug response and period length. Replacement sites selected include six cysteines that lie within the processed plasma membrane (34 kDa) form of the protein, and amino acids located in putative drug and adenine nucleotide (NADH) binding domains. The latter, plus two of the cysteine replacements, resulted in a loss of enzymatic activity. The recombinant tNOX with the modified drug binding site retained activity but the activity was no longer drug-responsive. The four remaining cysteine replacements were of interest in that both activity and drug response were retained but the period length for both NADH oxidation and protein disulfide-thiol interchange was increased from 22 min to 36 or 42 min. The findings confirm the correctness of the drug and adenine nucleotide binding motifs within the tNOX protein and imply a potential critical role of cysteine residues in determining the period length.
Biswas, N; Weller, S K
2001-05-18
Herpes simplex virus type 1 encodes a heterotrimeric helicase-primase complex composed of the products of the UL5, UL52, and UL8 genes. The UL5 protein contains seven motifs found in all members of helicase Superfamily 1 (SF1), and the UL52 protein contains several conserved motifs found in primases; however, the contributions of each subunit to the biochemical activities of the subcomplex are not clear. In this work, the DNA binding properties of wild type and mutant subcomplexes were examined using single-stranded, duplex, and forked substrates. A gel mobility shift assay indicated that the UL5-UL52 subcomplex binds more efficiently to the forked substrate than to either single strand or duplex DNA. Although nucleotides are not absolutely required for DNA binding, ADP stimulated the binding of UL5-UL52 to single strand DNA whereas ATP, ADP, and adenosine 5'-O-(thiotriphosphate) stimulated the binding to a forked substrate. We have previously shown that both subunits contact single-stranded DNA in a photocross-linking assay (Biswas, N., and Weller, S. K. (1999) J. Biol. Chem. 274, 8068-8076). In this study, photocross-linking assays with forked substrates indicate that the UL5 and UL52 subunits contact the forked substrates at different positions, UL52 at the single-stranded DNA tail and UL5 near the junction between single-stranded and double-stranded DNA. Neither subunit was able to cross-link a forked substrate when 5-iododeoxyuridine was located within the duplex portion. Photocross-linking experiments with subcomplexes containing mutant versions of UL5 and wild type UL52 indicated that the integrity of the ATP binding region is important for DNA binding of both subunits. These results support our previous proposal that UL5 and UL52 exhibit a complex interdependence for DNA binding (Biswas, N., and Weller, S. K. (1999) J. Biol. Chem. 274, 8068-8076) and indicate that the UL52 subunit may play a more active role in helicase activity than had previously been thought.
Mushtaq, Ameeq Ul; Lee, Yejin; Hwang, Eunha; Bang, Jeong Kyu; Hong, Eunmi; Byun, Youngjoo; Song, Ji-Joon; Jeon, Young Ho
2018-01-01
MeCP2 is a chromatin associated protein which is highly expressed in brain and relevant with Rett syndrome (RTT). There are AT-hook motifs in MeCP2 which can bind with AT-rich DNA, suggesting a role in chromatin binding. Here, we report the identification and characterization of another AT-rich DNA binding motif (residues 295 to 313) from the C-terminal transcription repression domain of MeCP2 by nuclear magnetic resonance (NMR) and isothermal calorimetry (ITC). This motif shows a micromolar affinity to AT-rich DNA, and it binds to the minor groove of DNA like AT-hook motifs. Together with the previous studies, our results provide an insight into a critical role of this motif in chromatin structure and function. Copyright © 2017 Elsevier Inc. All rights reserved.
McDonald, Caleb B.; Seldeen, Kenneth L.; Deegan, Brian J.; Bhat, Vikas; Farooq, Amjad
2010-01-01
A ubiquitous component of cellular signaling machinery, Gab1 docker plays a pivotal role in routing extracellular information in the form of growth factors and cytokines to downstream targets such as transcription factors within the nucleus. Here, using isothermal titration calorimetry (ITC) in combination with macromolecular modeling (MM), we show that although Gab1 contains four distinct RXXK motifs, designated G1, G2, G3 and G4, only G1 and G2 motifs bind to the cSH3 domain of Grb2 adaptor and do so with distinct mechanisms. Thus, while the G1 motif strictly requires the PPRPPKP consensus sequence for high-affinity binding to the cSH3 domain, the G2 motif displays preference for the PXVXRXLKPXR consensus. Such sequential differences in the binding of G1 and G2 motifs arise from their ability to adopt distinct polyproline type II (PPII)- and 310-helical conformations upon binding to the cSH3 domain, respectively. Collectively, our study provides detailed biophysical insights into a key protein-protein interaction involved in a diverse array of signaling cascades central to health and disease. PMID:21472810
Karttunen, Mikko; Choy, Wing-Yiu; Cino, Elio A
2018-06-07
Nuclear factor erythroid 2-related factor 2 (Nrf2) is a transcription factor and principal regulator of the antioxidant pathway. The Kelch domain of Kelch-like ECH-associated protein 1 (Keap1) binds to motifs in the N-terminal region of Nrf2, promoting its degradation. There is interest in developing ligands that can compete with Nrf2 for binding to Kelch, thereby activating its transcriptional activities and increasing antioxidant levels. Using experimental Δ G bind values of Kelch-binding motifs determined previously, a revised hydrophobicity-based model was developed for estimating Δ G bind from amino acid sequence and applied to rank potential uncharacterized Kelch-binding motifs identified from interaction databases and BLAST searches. Model predictions and molecular dynamics (MD) simulations suggested that full-length MAD2A binds Kelch more favorably than a high-affinity 20-mer Nrf2 E78P peptide, but that the motif in isolation is not a particularly strong binder. Endeavoring to develop shorter peptides for activating Nrf2, new designs were created based on the E78P peptide, some of which showed considerable propensity to form binding-competent structures in MD, and were predicted to interact with Kelch more favorably than the E78P peptide. The peptides could be promising new ligands for enhancing the oxidative stress response.
Wang, Min; Hancock, Timothy P; Chamberlain, Amanda J; Vander Jagt, Christy J; Pryce, Jennie E; Cocks, Benjamin G; Goddard, Mike E; Hayes, Benjamin J
2018-05-24
Topological association domains (TADs) are chromosomal domains characterised by frequent internal DNA-DNA interactions. The transcription factor CTCF binds to conserved DNA sequence patterns called CTCF binding motifs to either prohibit or facilitate chromosomal interactions. TADs and CTCF binding motifs control gene expression, but they are not yet well defined in the bovine genome. In this paper, we sought to improve the annotation of bovine TADs and CTCF binding motifs, and assess whether the new annotation can reduce the search space for cis-regulatory variants. We used genomic synteny to map TADs and CTCF binding motifs from humans, mice, dogs and macaques to the bovine genome. We found that our mapped TADs exhibited the same hallmark properties of those sourced from experimental data, such as housekeeping genes, transfer RNA genes, CTCF binding motifs, short interspersed elements, H3K4me3 and H3K27ac. We showed that runs of genes with the same pattern of allele-specific expression (ASE) (either favouring paternal or maternal allele) were often located in the same TAD or between the same conserved CTCF binding motifs. Analyses of variance showed that when averaged across all bovine tissues tested, TADs explained 14% of ASE variation (standard deviation, SD: 0.056), while CTCF explained 27% (SD: 0.078). Furthermore, we showed that the quantitative trait loci (QTLs) associated with gene expression variation (eQTLs) or ASE variation (aseQTLs), which were identified from mRNA transcripts from 141 lactating cows' white blood and milk cells, were highly enriched at putative bovine CTCF binding motifs. The linearly-furthermost, and most-significant aseQTL and eQTL for each genic target were located within the same TAD as the gene more often than expected (Chi-Squared test P-value < 0.001). Our results suggest that genomic synteny can be used to functionally annotate conserved transcriptional components, and provides a tool to reduce the search space for causative regulatory variants in the bovine genome.
Crystal structure of bacterial cell-surface alginate-binding protein with an M75 peptidase motif
DOE Office of Scientific and Technical Information (OSTI.GOV)
Maruyama, Yukie; Ochiai, Akihito; Mikami, Bunzo
Research highlights: {yields} Bacterial alginate-binding Algp7 is similar to component EfeO of Fe{sup 2+} transporter. {yields} We determined the crystal structure of Algp7 with a metal-binding motif. {yields} Algp7 consists of two helical bundles formed through duplication of a single bundle. {yields} A deep cleft involved in alginate binding locates around the metal-binding site. {yields} Algp7 may function as a Fe{sup 2+}-chelated alginate-binding protein. -- Abstract: A gram-negative Sphingomonas sp. A1 directly incorporates alginate polysaccharide into the cytoplasm via the cell-surface pit and ABC transporter. A cell-surface alginate-binding protein, Algp7, functions as a concentrator of the polysaccharide in the pit.more » Based on the primary structure and genetic organization in the bacterial genome, Algp7 was found to be homologous to an M75 peptidase motif-containing EfeO, a component of a ferrous ion transporter. Despite the presence of an M75 peptidase motif with high similarity, the Algp7 protein purified from recombinant Escherichia coli cells was inert on insulin B chain and N-benzoyl-Phe-Val-Arg-p-nitroanilide, both of which are substrates for a typical M75 peptidase, imelysin, from Pseudomonas aeruginosa. The X-ray crystallographic structure of Algp7 was determined at 2.10 A resolution by single-wavelength anomalous diffraction. Although a metal-binding motif, HxxE, conserved in zinc ion-dependent M75 peptidases is also found in Algp7, the crystal structure of Algp7 contains no metal even at the motif. The protein consists of two structurally similar up-and-down helical bundles as the basic scaffold. A deep cleft between the bundles is sufficiently large to accommodate macromolecules such as alginate polysaccharide. This is the first structural report on a bacterial cell-surface alginate-binding protein with an M75 peptidase motif.« less
Peptide-binding motifs of two common equine class I MHC molecules in Thoroughbred horses.
Bergmann, Tobias; Lindvall, Mikaela; Moore, Erin; Moore, Eugene; Sidney, John; Miller, Donald; Tallmadge, Rebecca L; Myers, Paisley T; Malaker, Stacy A; Shabanowitz, Jeffrey; Osterrieder, Nikolaus; Peters, Bjoern; Hunt, Donald F; Antczak, Douglas F; Sette, Alessandro
2017-05-01
Quantitative peptide-binding motifs of MHC class I alleles provide a valuable tool to efficiently identify putative T cell epitopes. Detailed information on equine MHC class I alleles is still very limited, and to date, only a single equine MHC class I allele, Eqca-1*00101 (ELA-A3 haplotype), has been characterized. The present study extends the number of characterized ELA class I specificities in two additional haplotypes found commonly in the Thoroughbred breed. Accordingly, we here report quantitative binding motifs for the ELA-A2 allele Eqca-16*00101 and the ELA-A9 allele Eqca-1*00201. Utilizing analyses of endogenously bound and eluted ligands and the screening of positional scanning combinatorial libraries, detailed and quantitative peptide-binding motifs were derived for both alleles. Eqca-16*00101 preferentially binds peptides with aliphatic/hydrophobic residues in position 2 and at the C-terminus, and Eqca-1*00201 has a preference for peptides with arginine in position 2 and hydrophobic/aliphatic residues at the C-terminus. Interestingly, the Eqca-16*00101 motif resembles that of the human HLA A02-supertype, while the Eqca-1*00201 motif resembles that of the HLA B27-supertype and two macaque class I alleles. It is expected that the identified motifs will facilitate the selection of candidate epitopes for the study of immune responses in horses.
Lee, J H; Maeda, S; Angelos, K L; Kamita, S G; Ramachandran, C; Walsh, D A
1992-11-03
Active gamma subunit of skeletal muscle phosphorylase kinase has been obtained by expression of the rat soleus cDNA in a baculovirus system. The protein exhibited the expected pH 6.8/8.2 activity ratio of 0.6, and its activity was insensitive to Ca2+ addition, indicating that it was free gamma subunit and not a gamma subunit-calmodulin complex. It was stimulated approximately 2-fold by Ca(2+)-calmodulin addition, demonstrating that it had retained high-affinity calmodulin binding. By site-directed mutagenesis, we have examined the role of six of the amino acids that constitute the consensus ATP binding site of the protein kinase, which in the gamma subunit is represented by the sequence 26Gly.Arg.Gly.Val.Ser.Ser.Val.Val33. Changes were evaluated by the kinetic determination of the dissociation constants of gamma-ATP, gamma-ADP, gamma-AMP.PCP, and gamma-phosphorylase and the maximum catalytic activity. The mutants Ser26-gamma, Ser29-gamma, Phe30-gamma, and Gly31-gamma each exhibited an essentially identical dissociation constant for gamma subunit phosphorylase, indicating that these mutations had not caused a global alteration in the protein structure but were limited to changes in the nucleotide binding site domain. Substitution of either Val33 (by Gly) or Gly28 (by Ser), two of the most conserved residues in all protein kinases, resulted in enzyme with marginally detectable activity. In noted contrast, the Ser26 mutant, which substituted the first glycine of the consensus glycine trio motif, and which is also very highly conserved, retained at least 25% of the enzymatic activity. The Gly31 substitution, which restored a glycine to a position characteristic for most protein kinases, had little overall effect upon the maximum rate of catalysis. Restoration of Ser30 to the more typical phenylalanine, which is present in most protein kinases, had minimal effect on catalysis. These data provide the first direct evaluation of the roles that different residues play within this consensus glycine trio/valine motif of the protein kinases, which up to now have only been surmised to be of importance because of their conservation. Two unexpected findings are that for one residue that is very conserved (Gly26) there is some flexibility of substitution not apparent from the evolutionary conservation and that a second quite conserved residue in protein kinases (equivalent to Gly at position 31) does not produce a protein optimized for nucleotide binding.
Finding the target sites of RNA-binding proteins
Li, Xiao; Kazan, Hilal; Lipshitz, Howard D; Morris, Quaid D
2014-01-01
RNA–protein interactions differ from DNA–protein interactions because of the central role of RNA secondary structure. Some RNA-binding domains (RBDs) recognize their target sites mainly by their shape and geometry and others are sequence-specific but are sensitive to secondary structure context. A number of small- and large-scale experimental approaches have been developed to measure RNAs associated in vitro and in vivo with RNA-binding proteins (RBPs). Generalizing outside of the experimental conditions tested by these assays requires computational motif finding. Often RBP motif finding is done by adapting DNA motif finding methods; but modeling secondary structure context leads to better recovery of RBP-binding preferences. Genome-wide assessment of mRNA secondary structure has recently become possible, but these data must be combined with computational predictions of secondary structure before they add value in predicting in vivo binding. There are two main approaches to incorporating structural information into motif models: supplementing primary sequence motif models with preferred secondary structure contexts (e.g., MEMERIS and RNAcontext) and directly modeling secondary structure recognized by the RBP using stochastic context-free grammars (e.g., CMfinder and RNApromo). The former better reconstruct known binding preferences for sequence-specific RBPs but are not suitable for modeling RBPs that recognize shape and geometry of RNAs. Future work in RBP motif finding should incorporate interactions between multiple RBDs and multiple RBPs in binding to RNA. WIREs RNA 2014, 5:111–130. doi: 10.1002/wrna.1201 PMID:24217996
DNA motif elucidation using belief propagation.
Wong, Ka-Chun; Chan, Tak-Ming; Peng, Chengbin; Li, Yue; Zhang, Zhaolei
2013-09-01
Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k=8∼10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors' websites: e.g. http://www.cs.toronto.edu/∼wkc/kmerHMM.
Regulation of TCF ETS-domain transcription factors by helix-loop-helix motifs.
Stinson, Julie; Inoue, Toshiaki; Yates, Paula; Clancy, Anne; Norton, John D; Sharrocks, Andrew D
2003-08-15
DNA binding by the ternary complex factor (TCF) subfamily of ETS-domain transcription factors is tightly regulated by intramolecular and intermolecular interactions. The helix-loop-helix (HLH)-containing Id proteins are trans-acting negative regulators of DNA binding by the TCFs. In the TCF, SAP-2/Net/ERP, intramolecular inhibition of DNA binding is promoted by the cis-acting NID region that also contains an HLH-like motif. The NID also acts as a transcriptional repression domain. Here, we have studied the role of HLH motifs in regulating DNA binding and transcription by the TCF protein SAP-1 and how Cdk-mediated phosphorylation affects the inhibitory activity of the Id proteins towards the TCFs. We demonstrate that the NID region of SAP-1 is an autoinhibitory motif that acts to inhibit DNA binding and also functions as a transcription repression domain. This region can be functionally replaced by fusion of Id proteins to SAP-1, whereby the Id moiety then acts to repress DNA binding in cis. Phosphorylation of the Ids by cyclin-Cdk complexes results in reduction in protein-protein interactions between the Ids and TCFs and relief of their DNA-binding inhibitory activity. In revealing distinct mechanisms through which HLH motifs modulate the activity of TCFs, our results therefore provide further insight into the role of HLH motifs in regulating TCF function and how the inhibitory properties of the trans-acting Id HLH proteins are themselves regulated by phosphorylation.
La-related protein 1 (LARP1) binds the mRNA cap, blocking eIF4F assembly on TOP mRNAs.
Lahr, Roni M; Fonseca, Bruno D; Ciotti, Gabrielle E; Al-Ashtal, Hiba A; Jia, Jian-Jun; Niklaus, Marius R; Blagden, Sarah P; Alain, Tommy; Berman, Andrea J
2017-04-07
The 5'terminal oligopyrimidine (5'TOP) motif is a cis -regulatory RNA element located immediately downstream of the 7-methylguanosine [m 7 G] cap of TOP mRNAs, which encode ribosomal proteins and translation factors. In eukaryotes, this motif coordinates the synchronous and stoichiometric expression of the protein components of the translation machinery. La-related protein 1 (LARP1) binds TOP mRNAs, regulating their stability and translation. We present crystal structures of the human LARP1 DM15 region in complex with a 5'TOP motif, a cap analog (m 7 GTP), and a capped cytidine (m 7 GpppC), resolved to 2.6, 1.8 and 1.7 Å, respectively. Our binding, competition, and immunoprecipitation data corroborate and elaborate on the mechanism of 5'TOP motif binding by LARP1. We show that LARP1 directly binds the cap and adjacent 5'TOP motif of TOP mRNAs, effectively impeding access of eIF4E to the cap and preventing eIF4F assembly. Thus, LARP1 is a specialized TOP mRNA cap-binding protein that controls ribosome biogenesis.
Rules for the recognition of dilysine retrieval motifs by coatomer
Ma, Wenfu; Goldberg, Jonathan
2013-01-01
Cytoplasmic dilysine motifs on transmembrane proteins are captured by coatomer α-COP and β′-COP subunits and packaged into COPI-coated vesicles for Golgi-to-ER retrieval. Numerous ER/Golgi proteins contain K(x)Kxx motifs, but the rules for their recognition are unclear. We present crystal structures of α-COP and β′-COP bound to a series of naturally occurring retrieval motifs—encompassing KKxx, KxKxx and non-canonical RKxx and viral KxHxx sequences. Binding experiments show that α-COP and β′-COP have generally the same specificity for KKxx and KxKxx, but only β′-COP recognizes the RKxx signal. Dilysine motif recognition involves lysine side-chain interactions with two acidic patches. Surprisingly, however, KKxx and KxKxx motifs bind differently, with their lysine residues transposed at the binding patches. We derive rules for retrieval motif recognition from key structural features: the reversed binding modes, the recognition of the C-terminal carboxylate group which enforces lysine positional context, and the tolerance of the acidic patches for non-lysine residues. PMID:23481256
Ao, Jingqun; Ding, Yang; Chen, Yuanyuan; Mu, Yinnan; Chen, Xinhua
2015-12-10
The C-type lectin-like receptors (CTLRs) play important roles in innate immunity as one type of pattern recognition receptors. Here, we cloned and characterized a C-type lectin-like receptor (LycCTLR) from large yellow croaker Larimichthys crocea. The full-length cDNA of LycCTLR is 880 nucleotides long, encoding a protein of 215 amino acids. The deduced LycCTLR contains a C-terminal C-type lectin-like domain (CTLD), an N-terminal cytoplasmic tail, and a transmembrane region. The CTLD of LycCTLR possesses six highly conserved cysteine residues (C1-C6), a conserved WI/MGL motif, and two sugar binding motifs, EPD (Glu-Pro-Asp) and WYD (Trp-Tyr-Asp). Ca(2+) binding site 1 and 2 were also found in the CTLD. The LycCTLR gene consists of five exons and four introns, showing the same genomic organization as tilapia (Oreochromis niloticus) and guppy (Poecilia retitculata) CTLRs. LycCTLR was constitutively expressed in various tissues tested, and its transcripts significantly increased in the head kidney and spleen after stimulation with inactivated trivalent bacterial vaccine. Recombinant LycCTLR (rLycCTLR) protein produced in Escherichia coli BL21 exhibited not only the hemagglutinating activity and a preference for galactose, but also the agglutinating activity against two food-borne pathogenic bacteria E. coli and Bacillus cereus in a Ca(2+)-dependent manner. These results indicate that LycCTLR is a potential galactose-binding C-type lectin that may play a role in the antibacterial immunity in fish.
Brucet, Marina; Querol-Audí, Jordi; Serra, Maria; Ramirez-Espain, Ximena; Bertlik, Kamila; Ruiz, Lidia; Lloberas, Jorge; Macias, Maria J; Fita, Ignacio; Celada, Antonio
2007-05-11
TREX1 is the most abundant mammalian 3' --> 5' DNA exonuclease. It has been described to form part of the SET complex and is responsible for the Aicardi-Goutières syndrome in humans. Here we show that the exonuclease activity is correlated to the binding preferences toward certain DNA sequences. In particular, we have found three motifs that are selected, GAG, ACA, and CTGC. To elucidate how the discrimination occurs, we determined the crystal structures of two murine TREX1 complexes, with a nucleotide product of the exonuclease reaction, and with a single-stranded DNA substrate. Using confocal microscopy, we observed TREX1 both in nuclear and cytoplasmic subcellular compartments. Remarkably, the presence of TREX1 in the nucleus requires the loss of a C-terminal segment, which we named leucine-rich repeat 3. Furthermore, we detected the presence of a conserved proline-rich region on the surface of TREX1. This observation points to interactions with proline-binding domains. The potential interacting motif "PPPVPRPP" does not contain aromatic residues and thus resembles other sequences that select SH3 and/or Group 2 WW domains. By means of nuclear magnetic resonance titration experiments, we show that, indeed, a polyproline peptide derived from the murine TREX1 sequence interacted with the WW2 domain of the elongation transcription factor CA150. Co-immunoprecipitation studies confirmed this interaction with the full-length TREX1 protein, thereby suggesting that TREX1 participates in more functional complexes than previously thought.
Leisy, D.J.; Rasmussen, C.; Owusu, E.O.; Rohrmann, G.F.
1997-01-01
The Autographa californica multinucleocapsid nuclear polyhedrosis virus (AcMNPV) ie-1 gene product (IE-1) is thought to play a central role in stimulating early viral transcription. IE-1 has been demonstrated to activate several early viral gene promoters and to negatively regulate the promoters of two other AcMNPV regulatory genes, ie-0 and ie-2. Our results indicate that IE-1 negatively regulates the expression of certain genes by binding directly, or as part of a complex, to promoter regions containing a specific IE-1-binding motif (5'-ACBYGTAA-3') near their mRNA start sites. The IE-1 binding motif was also found within the palindromic sequences of AcMNPV homologous repeat (hr) regions that have been shown to bind IE-1. The role of this IE-1 binding motif in the regulation of the ie-2 and pe-38 promoters was examined by introducing mutations in these promoters in which the central 6 bp were replaced with Bg/II sites. GUS reporter constructs containing ie-2 and pe-38 promoter fragments with and without these specific mutations were cotransfected into Sf9 cells with various amounts of an ie-1-containing plasmid (ple-1). Comparisons of GUS expression produced by the mutant and wild-type constructs demonstrated that the IE-1 binding motif mediated a significant decrease in expression from the ie-2 and pe-38 promoters in response to increasing pIe-1 concentrations. Electrophoretic mobility shift assays with pIe-1-transfected cell extracts and supershift assays with IE-1- specific antiserum demonstrated that IE-1 binds to promoter fragments containing the IE-1 binding motif but does not bind to promoter fragments lacking this motif.
Cotmore, S F; Christensen, J; Nüesch, J P; Tattersall, P
1995-01-01
A DNA fragment containing the minute virus of mice 3' replication origin was specifically coprecipitated in immune complexes containing the virally coded NS1, but not the NS2, polypeptide. Antibodies directed against the amino- or carboxy-terminal regions of NS1 precipitated the NS1-origin complexes, but antibodies directed against NS1 amino acids 284 to 459 blocked complex formation. Using affinity-purified histidine-tagged NS1 preparations, we have shown that the specific protein-DNA interaction is of moderate affinity, being stable in 0.1 M salt but rapidly lost at higher salt concentrations. In contrast, generalized (or nonspecific) DNA binding by NS1 could be demonstrated only in low salt. Addition of ATP or gamma S-ATP enhanced specific DNA binding by wild-type NS1 severalfold, but binding was lost under conditions which favored ATP hydrolysis. NS1 molecules with mutations in a critical lysine residue (amino acid 405) in the consensus ATP-binding site bound to the origin, but this binding could not be enhanced by ATP addition. DNase I protection assays carried out with wild-type NS1 in the presence of gamma S-ATP gave footprints which extended over 43 nucleotides on both DNA strands, from the middle of the origin bubble sequence to a position some 14 bp beyond the nick site. The DNA-binding site for NS1 was mapped to a 22-bp fragment from the middle of the 3' replication origin which contains the sequence ACCAACCA. This conforms to a reiterated motif (ACCA)2-3, which occurs, in more or less degenerate form, at many sites throughout the minute virus of mice genome (J. W. Bodner, Virus Genes 2:167-182, 1989). Insertion of a single copy of the sequence (ACCA)3 was shown to be sufficient to confer NS1 binding on an otherwise unrecognized plasmid fragment. The functions of NS1 in the viral life cycle are reevaluated in the light of this result. PMID:7853501
Cotmore, S F; Christensen, J; Nüesch, J P; Tattersall, P
1995-03-01
A DNA fragment containing the minute virus of mice 3' replication origin was specifically coprecipitated in immune complexes containing the virally coded NS1, but not the NS2, polypeptide. Antibodies directed against the amino- or carboxy-terminal regions of NS1 precipitated the NS1-origin complexes, but antibodies directed against NS1 amino acids 284 to 459 blocked complex formation. Using affinity-purified histidine-tagged NS1 preparations, we have shown that the specific protein-DNA interaction is of moderate affinity, being stable in 0.1 M salt but rapidly lost at higher salt concentrations. In contrast, generalized (or nonspecific) DNA binding by NS1 could be demonstrated only in low salt. Addition of ATP or gamma S-ATP enhanced specific DNA binding by wild-type NS1 severalfold, but binding was lost under conditions which favored ATP hydrolysis. NS1 molecules with mutations in a critical lysine residue (amino acid 405) in the consensus ATP-binding site bound to the origin, but this binding could not be enhanced by ATP addition. DNase I protection assays carried out with wild-type NS1 in the presence of gamma S-ATP gave footprints which extended over 43 nucleotides on both DNA strands, from the middle of the origin bubble sequence to a position some 14 bp beyond the nick site. The DNA-binding site for NS1 was mapped to a 22-bp fragment from the middle of the 3' replication origin which contains the sequence ACCAACCA. This conforms to a reiterated motif (ACCA)2-3, which occurs, in more or less degenerate form, at many sites throughout the minute virus of mice genome (J. W. Bodner, Virus Genes 2:167-182, 1989). Insertion of a single copy of the sequence (ACCA)3 was shown to be sufficient to confer NS1 binding on an otherwise unrecognized plasmid fragment. The functions of NS1 in the viral life cycle are reevaluated in the light of this result.
Ni2+-binding RNA motifs with an asymmetric purine-rich internal loop and a G-A base pair.
Hofmann, H P; Limmer, S; Hornung, V; Sprinzl, M
1997-01-01
RNA molecules with high affinity for immobilized Ni2+ were isolated from an RNA pool with 50 randomized positions by in vitro selection-amplification. The selected RNAs preferentially bind Ni2+ and Co2+ over other cations from first series transition metals. Conserved structure motifs, comprising about 15 nt, were identified that are likely to represent the Ni2+ binding sites. Two conserved motifs contain an asymmetric purine-rich internal loop and probably a mismatch G-A base pair. The structure of one of these motifs was studied with proton NMR spectroscopy and formation of the G-A pair at the junction of helix and internal loop was demonstrated. Using Ni2+ as a paramagnetic probe, a divalent metal ion binding site near this G-A base pair was identified. Ni2+ ions bound to this motif exert a specific stabilization effect. We propose that small asymmetric purine-rich loops that contain a G-A interaction may represent a divalent metal ion binding site in RNA. PMID:9409620
Barrijal, S; Perros, M; Gu, Z; Avalosse, B L; Belenguer, P; Amalric, F; Rommelaere, J
1992-01-01
Nucleolin, a major nucleolar protein, forms a specific complex with the genome (a single-stranded DNA molecule of minus polarity) of parvovirus MVMp in vitro. By means of South-western blotting experiments, we mapped the binding site to a 222-nucleotide motif within the non-structural transcription unit, referred to as NUBE (nucleolin-binding element). The specificity of the interaction was confirmed by competitive gel retardation assays. DNaseI and nuclease S1 probing showed that NUBE folds into a secondary structure, in agreement with a computer-assisted conformational prediction. The whole NUBE may be necessary for the interaction with nucleolin, as suggested by the failure of NUBE subfragments to bind the protein and by the nuclease footprinting experiments. The present work extends the previously reported ability of nucleolin to form a specific complex with ribosomal RNA, to a defined DNA substrate. Considering the tropism of MVMp DNA replication for host cell nucleoli, these data raise the possibility that nucleolin may contribute to the regulation of the parvoviral life-cycle. Images PMID:1408821
The Structure of the Human Centrin 2-Xeroderma Pigmentosum Group C Protein Complex
DOE Office of Scientific and Technical Information (OSTI.GOV)
Thompson,J.; Ryan, Z.; Salisbury, J.
2006-01-01
Human centrin-2 plays a key role in centrosome function and stimulates nucleotide excision repair by binding to the xeroderma pigmentosum group C protein. To determine the structure of human centrin-2 and to develop an understanding of molecular interactions between centrin and xeroderma pigmentosum group C protein, we characterized the crystal structure of calcium-loaded full-length centrin-2 complexed with a xeroderma pigmentosum group C peptide. Our structure shows that the carboxyl-terminal domain of centrin-2 binds this peptide and two calcium atoms, whereas the amino-terminal lobe is in a closed conformation positioned distantly by an ordered {alpha}-helical linker. A stretch of the amino-terminalmore » domain unique to centrins appears disordered. Two xeroderma pigmentosum group C peptides both bound to centrin-2 also interact to form an {alpha}-helical coiled-coil. The interface between centrin-2 and each peptide is predominantly nonpolar, and key hydrophobic residues of XPC have been identified that lead us to propose a novel binding motif for centrin.« less
Cold shock protein YB-1 is involved in hypoxia-dependent gene transcription.
Rauen, Thomas; Frye, Bjoern C; Wang, Jialin; Raffetseder, Ute; Alidousty, Christina; En-Nia, Abdelaziz; Floege, Jürgen; Mertens, Peter R
2016-09-16
Hypoxia-dependent gene regulation is largely orchestrated by hypoxia-inducible factors (HIFs), which associate with defined nucleotide sequences of hypoxia-responsive elements (HREs). Comparison of the regulatory HRE within the 3' enhancer of the human erythropoietin (EPO) gene with known binding motifs for cold shock protein Y-box (YB) protein-1 yielded strong similarities within the Y-box element and 3' adjacent sequences. DNA binding assays confirmed YB-1 binding to both, single- and double-stranded HRE templates. Under hypoxia, we observed nuclear shuttling of YB-1 and co-immunoprecipitation assays demonstrated that YB-1 and HIF-1α physically interact with each other. Cellular YB-1 depletion using siRNA significantly induced hypoxia-dependent EPO production at both, promoter and mRNA level. Vice versa, overexpressed YB-1 significantly reduced EPO-HRE-dependent gene transcription, whereas this effect was minor under normoxia. HIF-1α overexpression induced hypoxia-dependent gene transcription through the same element and accordingly, co-expression with YB-1 reduced HIF-1α-mediated EPO induction under hypoxic conditions. Taken together, we identified YB-1 as a novel binding factor for HREs that participates in fine-tuning of the hypoxia transcriptome. Copyright © 2016 Elsevier Inc. All rights reserved.
2011-01-01
Background Transcription factors (TFs) play a central role in regulating gene expression by interacting with cis-regulatory DNA elements associated with their target genes. Recent surveys have examined the DNA binding specificities of most Saccharomyces cerevisiae TFs, but a comprehensive evaluation of their data has been lacking. Results We analyzed in vitro and in vivo TF-DNA binding data reported in previous large-scale studies to generate a comprehensive, curated resource of DNA binding specificity data for all characterized S. cerevisiae TFs. Our collection comprises DNA binding site motifs and comprehensive in vitro DNA binding specificity data for all possible 8-bp sequences. Investigation of the DNA binding specificities within the basic leucine zipper (bZIP) and VHT1 regulator (VHR) TF families revealed unexpected plasticity in TF-DNA recognition: intriguingly, the VHR TFs, newly characterized by protein binding microarrays in this study, recognize bZIP-like DNA motifs, while the bZIP TF Hac1 recognizes a motif highly similar to the canonical E-box motif of basic helix-loop-helix (bHLH) TFs. We identified several TFs with distinct primary and secondary motifs, which might be associated with different regulatory functions. Finally, integrated analysis of in vivo TF binding data with protein binding microarray data lends further support for indirect DNA binding in vivo by sequence-specific TFs. Conclusions The comprehensive data in this curated collection allow for more accurate analyses of regulatory TF-DNA interactions, in-depth structural studies of TF-DNA specificity determinants, and future experimental investigations of the TFs' predicted target genes and regulatory roles. PMID:22189060
McDonald, Caleb B; Seldeen, Kenneth L; Deegan, Brian J; Bhat, Vikas; Farooq, Amjad
2011-01-01
A ubiquitous component of cellular signaling machinery, Gab1 docker plays a pivotal role in routing extracellular information in the form of growth factors and cytokines to downstream targets such as transcription factors within the nucleus. Here, using isothermal titration calorimetry (ITC) in combination with macromolecular modeling (MM), we show that although Gab1 contains four distinct RXXK motifs, designated G1, G2, G3, and G4, only G1 and G2 motifs bind to the cSH3 domain of Grb2 adaptor and do so with distinct mechanisms. Thus, while the G1 motif strictly requires the PPRPPKP consensus sequence for high-affinity binding to the cSH3 domain, the G2 motif displays preference for the PXVXRXLKPXR consensus. Such sequential differences in the binding of G1 and G2 motifs arise from their ability to adopt distinct polyproline type II (PPII)- and 3(10) -helical conformations upon binding to the cSH3 domain, respectively. Collectively, our study provides detailed biophysical insights into a key protein-protein interaction involved in a diverse array of signaling cascades central to health and disease. Copyright © 2010 John Wiley & Sons, Ltd.
RGAugury: a pipeline for genome-wide prediction of resistance gene analogs (RGAs) in plants.
Li, Pingchuan; Quan, Xiande; Jia, Gaofeng; Xiao, Jin; Cloutier, Sylvie; You, Frank M
2016-11-02
Resistance gene analogs (RGAs), such as NBS-encoding proteins, receptor-like protein kinases (RLKs) and receptor-like proteins (RLPs), are potential R-genes that contain specific conserved domains and motifs. Thus, RGAs can be predicted based on their conserved structural features using bioinformatics tools. Computer programs have been developed for the identification of individual domains and motifs from the protein sequences of RGAs but none offer a systematic assessment of the different types of RGAs. A user-friendly and efficient pipeline is needed for large-scale genome-wide RGA predictions of the growing number of sequenced plant genomes. An integrative pipeline, named RGAugury, was developed to automate RGA prediction. The pipeline first identifies RGA-related protein domains and motifs, namely nucleotide binding site (NB-ARC), leucine rich repeat (LRR), transmembrane (TM), serine/threonine and tyrosine kinase (STTK), lysin motif (LysM), coiled-coil (CC) and Toll/Interleukin-1 receptor (TIR). RGA candidates are identified and classified into four major families based on the presence of combinations of these RGA domains and motifs: NBS-encoding, TM-CC, and membrane associated RLP and RLK. All time-consuming analyses of the pipeline are paralleled to improve performance. The pipeline was evaluated using the well-annotated Arabidopsis genome. A total of 98.5, 85.2, and 100 % of the reported NBS-encoding genes, membrane associated RLPs and RLKs were validated, respectively. The pipeline was also successfully applied to predict RGAs for 50 sequenced plant genomes. A user-friendly web interface was implemented to ease command line operations, facilitate visualization and simplify result management for multiple datasets. RGAugury is an efficiently integrative bioinformatics tool for large scale genome-wide identification of RGAs. It is freely available at Bitbucket: https://bitbucket.org/yaanlpc/rgaugury .
Effective Feature Selection for Classification of Promoter Sequences.
K, Kouser; P G, Lavanya; Rangarajan, Lalitha; K, Acharya Kshitish
2016-01-01
Exploring novel computational methods in making sense of biological data has not only been a necessity, but also productive. A part of this trend is the search for more efficient in silico methods/tools for analysis of promoters, which are parts of DNA sequences that are involved in regulation of expression of genes into other functional molecules. Promoter regions vary greatly in their function based on the sequence of nucleotides and the arrangement of protein-binding short-regions called motifs. In fact, the regulatory nature of the promoters seems to be largely driven by the selective presence and/or the arrangement of these motifs. Here, we explore computational classification of promoter sequences based on the pattern of motif distributions, as such classification can pave a new way of functional analysis of promoters and to discover the functionally crucial motifs. We make use of Position Specific Motif Matrix (PSMM) features for exploring the possibility of accurately classifying promoter sequences using some of the popular classification techniques. The classification results on the complete feature set are low, perhaps due to the huge number of features. We propose two ways of reducing features. Our test results show improvement in the classification output after the reduction of features. The results also show that decision trees outperform SVM (Support Vector Machine), KNN (K Nearest Neighbor) and ensemble classifier LibD3C, particularly with reduced features. The proposed feature selection methods outperform some of the popular feature transformation methods such as PCA and SVD. Also, the methods proposed are as accurate as MRMR (feature selection method) but much faster than MRMR. Such methods could be useful to categorize new promoters and explore regulatory mechanisms of gene expressions in complex eukaryotic species.
Brendolise, Cyril; Espley, Richard V; Lin-Wang, Kui; Laing, William; Peng, Yongyan; McGhie, Tony; Dejnoprat, Supinya; Tomes, Sumathi; Hellens, Roger P; Allan, Andrew C
2017-01-01
In apple, the MYB transcription factor MYB10 controls the accumulation of anthocyanins. MYB10 is able to auto-activate its expression by binding its own promoter at a specific motif, the R1 motif. In some apple accessions a natural mutation, termed R6, has more copies of this motif within the MYB10 promoter resulting in stronger auto-activation and elevated anthocyanins. Here we show that other anthocyanin-related MYBs selected from apple, pear, strawberry, petunia, kiwifruit and Arabidopsis are able to activate promoters containing the R6 motif. To examine the specificity of this motif, members of the R2R3 MYB family were screened against a promoter harboring the R6 mutation. Only MYBs from subgroups 5 and 6 activate expression by binding the R6 motif, with these MYBs sharing conserved residues in their R2R3 DNA binding domains. Insertion of the apple R6 motif into orthologous promoters of MYB10 in pear ( PcMYB10 ) and Arabidopsis ( AtMY75 ) elevated anthocyanin levels. Introduction of the R6 motif into the promoter region of an anthocyanin biosynthetic enzyme F3'5'H of kiwifruit imparts regulation by MYB10. This results in elevated levels of delphinidin in both tobacco and kiwifruit. Finally, an R6 motif inserted into the promoter the vitamin C biosynthesis gene GDP-L-Gal phosphorylase increases vitamin C content in a MYB10-dependent manner. This motif therefore provides a tool to re-engineer novel MYB-regulated responses in plants.
Kluth, Marianne; Stindt, Jan; Dröge, Carola; Linnemann, Doris; Kubitz, Ralf; Schmitt, Lutz
2015-01-01
The human multidrug resistance protein 3 (MDR3/ABCB4) belongs to the ubiquitous family of ATP-binding cassette (ABC) transporters and is located in the canalicular membrane of hepatocytes. There it flops the phospholipids of the phosphatidylcholine (PC) family from the inner to the outer leaflet. Here, we report the characterization of wild type MDR3 and the Q1174E mutant, which was identified previously in a patient with progressive familial intrahepatic cholestasis type 3 (PFIC-3). We expressed different variants of MDR3 in the yeast Pichia pastoris, purified the proteins via tandem affinity chromatography, and determined MDR3-specific ATPase activity in the presence or absence of phospholipids. The ATPase activity of wild type MDR3 was stimulated 2-fold by liver PC or 1,2-dioleoyl-sn-glycero-3-phosphatidylethanolamine lipids. Furthermore, the cross-linking of MDR3 with a thiol-reactive fluorophore blocked ATP hydrolysis and exhibited no PC stimulation. Similarly, phosphatidylethanolamine, phosphatidylserine, and sphingomyelin lipids did not induce an increase of wild type MDR3 ATPase activity. The phosphate analogues beryllium fluoride and aluminum fluoride led to complete inhibition of ATPase activity, whereas orthovanadate inhibited exclusively the PC-stimulated ATPase activity of MDR3. The Q1174E mutation is located in the nucleotide-binding domain in direct proximity of the leucine of the ABC signature motif and extended the X loop, which is found in ABC exporters. Our data on the Q1174E mutant demonstrated basal ATPase activity, but PC lipids were incapable of stimulating ATPase activity highlighting the role of the extended X loop in the cross-talk of the nucleotide-binding domain and the transmembrane domain. PMID:25533467
Keilwagen, Jens; Grau, Jan; Paponov, Ivan A; Posch, Stefan; Strickert, Marc; Grosse, Ivo
2011-02-10
Transcription factors are a main component of gene regulation as they activate or repress gene expression by binding to specific binding sites in promoters. The de-novo discovery of transcription factor binding sites in target regions obtained by wet-lab experiments is a challenging problem in computational biology, which has not been fully solved yet. Here, we present a de-novo motif discovery tool called Dispom for finding differentially abundant transcription factor binding sites that models existing positional preferences of binding sites and adjusts the length of the motif in the learning process. Evaluating Dispom, we find that its prediction performance is superior to existing tools for de-novo motif discovery for 18 benchmark data sets with planted binding sites, and for a metazoan compendium based on experimental data from micro-array, ChIP-chip, ChIP-DSL, and DamID as well as Gene Ontology data. Finally, we apply Dispom to find binding sites differentially abundant in promoters of auxin-responsive genes extracted from Arabidopsis thaliana microarray data, and we find a motif that can be interpreted as a refined auxin responsive element predominately positioned in the 250-bp region upstream of the transcription start site. Using an independent data set of auxin-responsive genes, we find in genome-wide predictions that the refined motif is more specific for auxin-responsive genes than the canonical auxin-responsive element. In general, Dispom can be used to find differentially abundant motifs in sequences of any origin. However, the positional distribution learned by Dispom is especially beneficial if all sequences are aligned to some anchor point like the transcription start site in case of promoter sequences. We demonstrate that the combination of searching for differentially abundant motifs and inferring a position distribution from the data is beneficial for de-novo motif discovery. Hence, we make the tool freely available as a component of the open-source Java framework Jstacs and as a stand-alone application at http://www.jstacs.de/index.php/Dispom.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Lei; Zhang, Qing; Yang, Yu
Highlights: • RNA recognition motif domains of RBM5 are essential for cell proliferation inhibition. • RNA recognition motif domains of RBM5 are essential for apoptosis induction. • RNA recognition motif domains of RBM5 are essential for RNA binding. • RNA recognition motif domains of RBM5 are essential for caspase-2 alternative splicing. - Abstract: RBM5 is a known putative tumor suppressor gene that has been shown to function in cell growth inhibition by modulating apoptosis. RBM5 also plays a critical role in alternative splicing as an RNA binding protein. However, it is still unclear which domains of RBM5 are required formore » RNA binding and related functional activities. We hypothesized the two putative RNA recognition motif (RRM) domains of RBM5 spanning from amino acids 98–178 and 231–315 are essential for RBM5-mediated cell growth inhibition, apoptosis regulation, and RNA binding. To investigate this hypothesis, we evaluated the activities of the wide-type and mutant RBM5 gene transfer in low-RBM5 expressing A549 cells. We found that, unlike wild-type RBM5 (RBM5-wt), a RBM5 mutant lacking the two RRM domains (RBM5-ΔRRM), is unable to bind RNA, has compromised caspase-2 alternative splicing activity, lacks cell proliferation inhibition and apoptosis induction function in A549 cells. These data provide direct evidence that the two RRM domains of RBM5 are required for RNA binding and the RNA binding activity of RBM5 contributes to its function on apoptosis induction and cell growth inhibition.« less
NASA Astrophysics Data System (ADS)
Parry, Christian S.; Gorski, Jack; Stern, Lawrence J.
2003-03-01
The stable binding of processed foreign peptide to a class II major histocompatibility (MHC) molecule and subsequent presentation to a T cell receptor is a central event in immune recognition and regulation. Polymorphic residues on the floor of the peptide binding site form pockets that anchor peptide side chains. These and other residues in the helical wall of the groove determine the specificity of each allele and define a motif. Allele specific motifs allow the prediction of epitopes from the sequence of pathogens. There are, however, known epitopes that do not satisfy these motifs: anchor motifs are not adequate for predicting epitopes as there are apparently major and minor motifs. We present crystallographic studies into the nature of the interactions that govern the binding of these so called nonconforming peptides. We would like to understand the role of the P10 pocket and find out whether the peptides that do not obey the consensus anchor motif bind in the canonical conformation observed in in prior structures of class II MHC-peptide complexes. HLA-DRB3*0101 complexed with peptide crystallized in unit cell 92.10 x 92.10 x 248.30 (90, 90, 90), P41212, and the diffraction data is reliable to 2.2ÅWe are complementing our studies with dynamical long time simulations to answer these questions, particularly the interplay of the anchor motifs in peptide binding, the range of protein and ligand conformations, and water hydration structures.
The Coding of Biological Information: From Nucleotide Sequence to Protein Recognition
NASA Astrophysics Data System (ADS)
Štambuk, Nikola
The paper reviews the classic results of Swanson, Dayhoff, Grantham, Blalock and Root-Bernstein, which link genetic code nucleotide patterns to the protein structure, evolution and molecular recognition. Symbolic representation of the binary addresses defining particular nucleotide and amino acid properties is discussed, with consideration of: structure and metric of the code, direct correspondence between amino acid and nucleotide information, and molecular recognition of the interacting protein motifs coded by the complementary DNA and RNA strands.
Plesniak, Leigh; Horiuchi, Yuki; Sem, Daniel; Meinenger, David; Stiles, Linda; Shaffer, Jennifer; Jennings, Patricia A; Adams, Joseph A
2002-11-26
EnvZ is a histidine protein kinase important for osmoregulation in bacteria. While structural data are available for this enzyme, the nucleotide binding pocket is not well characterized. The ATP binding domain (EnvZB) was expressed, and its ability to bind nucleotide derivatives was assessed using equilbrium and stopped-flow fluorescence spectroscopy. The fluorescence emission of the trinitrophenyl derivatives, TNP-ATP and TNP-ADP, increase upon binding to EnvZB. The fluorescence enhancements were quantitatively abolished in the presence of excess ADP, indicating that the fluorescent probes occupy the nucleotide binding pocket. Both TNP-ATP and TNP-ADP bind to EnvZB with high affinity (K(d) = 2-3 microM). The TNP moiety attached to the ribose ring does not impede access of the fluorescent nucleotide into the binding pocket. The association rate constant for TNP-ADP is 7 microM(-1) s(-1), a value consistent with those for natural nucleotides and the eucaryotic protein kinases. Using competition experiments, it was found that ATP and ADP bind 30- and 150-fold more poorly, respectively, than the corresponding TNP-derivatized forms. Surprisingly, the physiological metal Mg(2+) is not required for ADP binding and only enhances ATP affinity by 3-fold. Although portions of the nucleotide pocket are disordered, the recombinant enzyme is highly stable, unfolding only at temperatures in excess of 70 degrees C. The unusually high affinity of the TNP derivatives compared to the natural nucleotides suggests that hydrophobic substitutions on the ribose ring enforce an altered binding mode that may be exploited for drug design strategies.
La-related protein 1 (LARP1) binds the mRNA cap, blocking eIF4F assembly on TOP mRNAs
Lahr, Roni M; Fonseca, Bruno D; Ciotti, Gabrielle E; Al-Ashtal, Hiba A; Jia, Jian-Jun; Niklaus, Marius R; Blagden, Sarah P; Alain, Tommy; Berman, Andrea J
2017-01-01
The 5’terminal oligopyrimidine (5’TOP) motif is a cis-regulatory RNA element located immediately downstream of the 7-methylguanosine [m7G] cap of TOP mRNAs, which encode ribosomal proteins and translation factors. In eukaryotes, this motif coordinates the synchronous and stoichiometric expression of the protein components of the translation machinery. La-related protein 1 (LARP1) binds TOP mRNAs, regulating their stability and translation. We present crystal structures of the human LARP1 DM15 region in complex with a 5’TOP motif, a cap analog (m7GTP), and a capped cytidine (m7GpppC), resolved to 2.6, 1.8 and 1.7 Å, respectively. Our binding, competition, and immunoprecipitation data corroborate and elaborate on the mechanism of 5’TOP motif binding by LARP1. We show that LARP1 directly binds the cap and adjacent 5’TOP motif of TOP mRNAs, effectively impeding access of eIF4E to the cap and preventing eIF4F assembly. Thus, LARP1 is a specialized TOP mRNA cap-binding protein that controls ribosome biogenesis. DOI: http://dx.doi.org/10.7554/eLife.24146.001 PMID:28379136
Two nucleotide binding sites modulate ( sup 3 H) glyburide binding to rat cortex membranes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Johnson, D.E.; Gopalakrishnan, M.; Triggle, D.J.
1991-03-11
The effects of nucleotides on the binding of the ATP-dependent K{sup +}-channel antagonist ({sup 3}H)glyburide (GLB) to rat cortex membranes were examined. Nucleotide triphosphates (NTPs) and nucleotide diphosphate (NDPs) inhibited the binding of GLB. This effect was dependent on the presence of dithiothreitol (DTT). Inhibition of binding by NTPs, with the exception of ATP{gamma}S, was dependent on the presence of Mg{sup 2+}. GLB binding showed a biphasic response to ADP: up to 3 mM, ADP inhibited binding, and above this concentration GLB binding increased rapidly, and was restored to normal levels by 10 mM ADP. In the presence of Mg{supmore » 2+}, ADP did not stimulate binding. Saturation analysis in the presence of Mg{sup 2+} and increasing concentrations of ADP showed that ADP results primarily in a change of the B{sub max} for GLB binding. The differential effects of NTPS and NDPs indicate that two nucleotide binding sites regulate GLB binding.« less
Motif-based analysis of large nucleotide data sets using MEME-ChIP
Ma, Wenxiu; Noble, William S; Bailey, Timothy L
2014-01-01
MEME-ChIP is a web-based tool for analyzing motifs in large DNA or RNA data sets. It can analyze peak regions identified by ChIP-seq, cross-linking sites identified by cLIP-seq and related assays, as well as sets of genomic regions selected using other criteria. MEME-ChIP performs de novo motif discovery, motif enrichment analysis, motif location analysis and motif clustering, providing a comprehensive picture of the DNA or RNA motifs that are enriched in the input sequences. MEME-ChIP performs two complementary types of de novo motif discovery: weight matrix–based discovery for high accuracy; and word-based discovery for high sensitivity. Motif enrichment analysis using DNA or RNA motifs from human, mouse, worm, fly and other model organisms provides even greater sensitivity. MEME-ChIP’s interactive HTML output groups and aligns significant motifs to ease interpretation. this protocol takes less than 3 h, and it provides motif discovery approaches that are distinct and complementary to other online methods. PMID:24853928
Delaunay, Jean-Louis; Bruneau, Alix; Hoffmann, Brice; Durand-Schneider, Anne-Marie; Barbu, Véronique; Jacquemin, Emmanuel; Maurice, Michèle; Housset, Chantal; Callebaut, Isabelle; Aït-Slimane, Tounsia
2017-02-01
ABCB4 (MDR3) is an adenosine triphosphate (ATP)-binding cassette (ABC) transporter expressed at the canalicular membrane of hepatocytes, where it mediates phosphatidylcholine (PC) secretion. Variations in the ABCB4 gene are responsible for several biliary diseases, including progressive familial intrahepatic cholestasis type 3 (PFIC3), a rare disease that can be lethal in the absence of liver transplantation. In this study, we investigated the effect and potential rescue of ABCB4 missense variations that reside in the highly conserved motifs of ABC transporters, involved in ATP binding. Five disease-causing variations in these motifs have been identified in ABCB4 (G535D, G536R, S1076C, S1176L, and G1178S), three of which are homologous to the gating mutations of cystic fibrosis transmembrane conductance regulator (CFTR or ABCC7; i.e., G551D, S1251N, and G1349D), that were previously shown to be function defective and corrected by ivacaftor (VX-770; Kalydeco), a clinically approved CFTR potentiator. Three-dimensional structural modeling predicted that all five ABCB4 variants would disrupt critical interactions in the binding of ATP and thereby impair ATP-induced nucleotide-binding domain dimerization and ABCB4 function. This prediction was confirmed by expression in cell models, which showed that the ABCB4 mutants were normally processed and targeted to the plasma membrane, whereas their PC secretion activity was dramatically decreased. As also hypothesized on the basis of molecular modeling, PC secretion activity of the mutants was rescued by the CFTR potentiator, ivacaftor (VX-770). Disease-causing variations in the ATP-binding sites of ABCB4 cause defects in PC secretion, which can be rescued by ivacaftor. These results provide the first experimental evidence that ivacaftor is a potential therapy for selected patients who harbor mutations in the ATP-binding sites of ABCB4. (Hepatology 2017;65:560-570). © 2016 by the American Association for the Study of Liver Diseases.
Comprehensive human transcription factor binding site map for combinatory binding motifs discovery.
Müller-Molina, Arnoldo J; Schöler, Hans R; Araúzo-Bravo, Marcos J
2012-01-01
To know the map between transcription factors (TFs) and their binding sites is essential to reverse engineer the regulation process. Only about 10%-20% of the transcription factor binding motifs (TFBMs) have been reported. This lack of data hinders understanding gene regulation. To address this drawback, we propose a computational method that exploits never used TF properties to discover the missing TFBMs and their sites in all human gene promoters. The method starts by predicting a dictionary of regulatory "DNA words." From this dictionary, it distills 4098 novel predictions. To disclose the crosstalk between motifs, an additional algorithm extracts TF combinatorial binding patterns creating a collection of TF regulatory syntactic rules. Using these rules, we narrowed down a list of 504 novel motifs that appear frequently in syntax patterns. We tested the predictions against 509 known motifs confirming that our system can reliably predict ab initio motifs with an accuracy of 81%-far higher than previous approaches. We found that on average, 90% of the discovered combinatorial binding patterns target at least 10 genes, suggesting that to control in an independent manner smaller gene sets, supplementary regulatory mechanisms are required. Additionally, we discovered that the new TFBMs and their combinatorial patterns convey biological meaning, targeting TFs and genes related to developmental functions. Thus, among all the possible available targets in the genome, the TFs tend to regulate other TFs and genes involved in developmental functions. We provide a comprehensive resource for regulation analysis that includes a dictionary of "DNA words," newly predicted motifs and their corresponding combinatorial patterns. Combinatorial patterns are a useful filter to discover TFBMs that play a major role in orchestrating other factors and thus, are likely to lock/unlock cellular functional clusters.
Comprehensive Human Transcription Factor Binding Site Map for Combinatory Binding Motifs Discovery
Müller-Molina, Arnoldo J.; Schöler, Hans R.; Araúzo-Bravo, Marcos J.
2012-01-01
To know the map between transcription factors (TFs) and their binding sites is essential to reverse engineer the regulation process. Only about 10%–20% of the transcription factor binding motifs (TFBMs) have been reported. This lack of data hinders understanding gene regulation. To address this drawback, we propose a computational method that exploits never used TF properties to discover the missing TFBMs and their sites in all human gene promoters. The method starts by predicting a dictionary of regulatory “DNA words.” From this dictionary, it distills 4098 novel predictions. To disclose the crosstalk between motifs, an additional algorithm extracts TF combinatorial binding patterns creating a collection of TF regulatory syntactic rules. Using these rules, we narrowed down a list of 504 novel motifs that appear frequently in syntax patterns. We tested the predictions against 509 known motifs confirming that our system can reliably predict ab initio motifs with an accuracy of 81%—far higher than previous approaches. We found that on average, 90% of the discovered combinatorial binding patterns target at least 10 genes, suggesting that to control in an independent manner smaller gene sets, supplementary regulatory mechanisms are required. Additionally, we discovered that the new TFBMs and their combinatorial patterns convey biological meaning, targeting TFs and genes related to developmental functions. Thus, among all the possible available targets in the genome, the TFs tend to regulate other TFs and genes involved in developmental functions. We provide a comprehensive resource for regulation analysis that includes a dictionary of “DNA words,” newly predicted motifs and their corresponding combinatorial patterns. Combinatorial patterns are a useful filter to discover TFBMs that play a major role in orchestrating other factors and thus, are likely to lock/unlock cellular functional clusters. PMID:23209563
Kishine, Masahiro; Tsutsumi, Katsuji; Kitta, Kazumi
2017-12-01
Simple sequence repeat (SSR) is a popular tool for individual fingerprinting. The long-core motif (e.g. tetra-, penta-, and hexa-nucleotide) simple sequence repeats (SSRs) are preferred because they make it easier to separate and distinguish neighbor alleles. In the present study, a new set of 8 tetra-nucleotide SSRs in potato ( Solanum tuberosum ) is reported. By using these 8 markers, 72 out of 76 cultivars obtained from Japan and the United States were clearly discriminated, while two pairs, both of which arose from natural variation, showed identical profiles. The combined probability of identity between two random cultivars for the set of 8 SSR markers was estimated to be 1.10 × 10 -8 , confirming the usefulness of the proposed SSR markers for fingerprinting analyses of potato.
Doxey, Andrew C; Cheng, Zhenyu; Moffatt, Barbara A; McConkey, Brendan J
2010-08-03
Aromatic amino acids play a critical role in protein-glycan interactions. Clusters of surface aromatic residues and their features may therefore be useful in distinguishing glycan-binding sites as well as predicting novel glycan-binding proteins. In this work, a structural bioinformatics approach was used to screen the Protein Data Bank (PDB) for coplanar aromatic motifs similar to those found in known glycan-binding proteins. The proteins identified in the screen were significantly associated with carbohydrate-related functions according to gene ontology (GO) enrichment analysis, and predicted motifs were found frequently within novel folds and glycan-binding sites not included in the training set. In addition to numerous binding sites predicted in structural genomics proteins of unknown function, one novel prediction was a surface motif (W34/W36/W192) in the tobacco pathogenesis-related protein, PR-5d. Phylogenetic analysis revealed that the surface motif is exclusive to a subfamily of PR-5 proteins from the Solanaceae family of plants, and is absent completely in more distant homologs. To confirm PR-5d's insoluble-polysaccharide binding activity, a cellulose-pulldown assay of tobacco proteins was performed and PR-5d was identified in the cellulose-binding fraction by mass spectrometry. Based on the combined results, we propose that the putative binding site in PR-5d may be an evolutionary adaptation of Solanaceae plants including potato, tomato, and tobacco, towards defense against cellulose-containing pathogens such as species of the deadly oomycete genus, Phytophthora. More generally, the results demonstrate that coplanar aromatic clusters on protein surfaces are a structural signature of glycan-binding proteins, and can be used to computationally predict novel glycan-binding proteins from 3 D structure.
Mechanism for CARMIL Protein Inhibition of Heterodimeric Actin-capping Protein*
Kim, Taekyung; Ravilious, Geoffrey E.; Sept, David; Cooper, John A.
2012-01-01
Capping protein (CP) controls the polymerization of actin filaments by capping their barbed ends. In lamellipodia, CP dissociates from the actin cytoskeleton rapidly, suggesting the possible existence of an uncapping factor, for which the protein CARMIL (capping protein, Arp2/3 and myosin-I linker) is a candidate. CARMIL binds to CP via two motifs. One, the CP interaction (CPI) motif, is found in a number of unrelated proteins; the other motif is unique to CARMILs, the CARMIL-specific interaction motif. A 115-aa CARMIL fragment of CARMIL with both motifs, termed the CP-binding region (CBR), binds to CP with high affinity, inhibits capping, and causes uncapping. We wanted to understand the structural basis for this function. We used a collection of mutants affecting the actin-binding surface of CP to test the possibility of a steric-blocking model, which remained open because a region of CBR was not resolved in the CBR/CP co-crystal structure. The CP actin-binding mutants bound CBR normally. In addition, a CBR mutant with all residues of the unresolved region changed showed nearly normal binding to CP. Having ruled out a steric blocking model, we tested an allosteric model with molecular dynamics. We found that CBR binding induces changes in the conformation of the actin-binding surface of CP. In addition, ∼30-aa truncations on the actin-binding surface of CP decreased the affinity of CBR for CP. Thus, CARMIL promotes uncapping by binding to a freely accessible site on CP bound to a filament barbed end and inducing a change in the conformation of the actin-binding surface of CP. PMID:22411988
A kinesin-1 binding motif in vaccinia virus that is widespread throughout the human genome
Dodding, Mark P; Mitter, Richard; Humphries, Ashley C; Way, Michael
2011-01-01
Transport of cargoes by kinesin-1 is essential for many cellular processes. Nevertheless, the number of proteins known to recruit kinesin-1 via its cargo binding light chain (KLC) is still quite small. We also know relatively little about the molecular features that define kinesin-1 binding. We now show that a bipartite tryptophan-based kinesin-1 binding motif, originally identified in Calsyntenin is present in A36, a vaccinia integral membrane protein. This bipartite motif in A36 is required for kinesin-1-dependent transport of the virus to the cell periphery. Bioinformatic analysis reveals that related bipartite tryptophan-based motifs are present in over 450 human proteins. Using vaccinia as a surrogate cargo, we show that regions of proteins containing this motif can function to recruit KLC and promote virus transport in the absence of A36. These proteins interact with the kinesin light chain outside the context of infection and have distinct preferences for KLC1 and KLC2. Our observations demonstrate that KLC binding can be conferred by a common set of features that are found in a wide range of proteins associated with diverse cellular functions and human diseases. PMID:21915095
Brendolise, Cyril; Espley, Richard V.; Lin-Wang, Kui; Laing, William; Peng, Yongyan; McGhie, Tony; Dejnoprat, Supinya; Tomes, Sumathi; Hellens, Roger P.; Allan, Andrew C.
2017-01-01
In apple, the MYB transcription factor MYB10 controls the accumulation of anthocyanins. MYB10 is able to auto-activate its expression by binding its own promoter at a specific motif, the R1 motif. In some apple accessions a natural mutation, termed R6, has more copies of this motif within the MYB10 promoter resulting in stronger auto-activation and elevated anthocyanins. Here we show that other anthocyanin-related MYBs selected from apple, pear, strawberry, petunia, kiwifruit and Arabidopsis are able to activate promoters containing the R6 motif. To examine the specificity of this motif, members of the R2R3 MYB family were screened against a promoter harboring the R6 mutation. Only MYBs from subgroups 5 and 6 activate expression by binding the R6 motif, with these MYBs sharing conserved residues in their R2R3 DNA binding domains. Insertion of the apple R6 motif into orthologous promoters of MYB10 in pear (PcMYB10) and Arabidopsis (AtMY75) elevated anthocyanin levels. Introduction of the R6 motif into the promoter region of an anthocyanin biosynthetic enzyme F3′5′H of kiwifruit imparts regulation by MYB10. This results in elevated levels of delphinidin in both tobacco and kiwifruit. Finally, an R6 motif inserted into the promoter the vitamin C biosynthesis gene GDP-L-Gal phosphorylase increases vitamin C content in a MYB10-dependent manner. This motif therefore provides a tool to re-engineer novel MYB-regulated responses in plants. PMID:29163590
Kandimalla, Ekambar R; Bhagat, Lakshmi; Zhu, Fu-Gang; Yu, Dong; Cong, Yan-Ping; Wang, Daqing; Tang, Jimmy X; Tang, Jin-Yan; Knetter, Cathrine F; Lien, Egil; Agrawal, Sudhir
2003-11-25
Bacterial and synthetic DNAs containing CpG dinucleotides in specific sequence contexts activate the vertebrate immune system through Toll-like receptor 9 (TLR9). In the present study, we used a synthetic nucleoside with a bicyclic heterobase [1-(2'-deoxy-beta-d-ribofuranosyl)-2-oxo-7-deaza-8-methyl-purine; R] to replace the C in CpG, resulting in an RpG dinucleotide. The RpG dinucleotide was incorporated in mouse- and human-specific motifs in oligodeoxynucleotides (oligos) and 3'-3-linked oligos, referred to as immunomers. Oligos containing the RpG motif induced cytokine secretion in mouse spleen-cell cultures. Immunomers containing RpG dinucleotides showed activity in transfected-HEK293 cells stably expressing mouse TLR9, suggesting direct involvement of TLR9 in the recognition of RpG motif. In J774 macrophages, RpG motifs activated NF-kappa B and mitogen-activated protein kinase pathways. Immunomers containing the RpG dinucleotide induced high levels of IL-12 and IFN-gamma, but lower IL-6 in time- and concentration-dependent fashion in mouse spleen-cell cultures costimulated with IL-2. Importantly, immunomers containing GTRGTT and GARGTT motifs were recognized to a similar extent by both mouse and human immune systems. Additionally, both mouse- and human-specific RpG immunomers potently stimulated proliferation of peripheral blood mononuclear cells obtained from diverse vertebrate species, including monkey, pig, horse, sheep, goat, rat, and chicken. An immunomer containing GTRGTT motif prevented conalbumin-induced and ragweed allergen-induced allergic inflammation in mice. We show that a synthetic bicyclic nucleotide is recognized in the C position of a CpG dinucleotide by immune cells from diverse vertebrate species without bias for flanking sequences, suggesting a divergent nucleotide motif recognition pattern of TLR9.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zoghbi, M. E.; Altenberg, G. A.
The functional unit of ATP-binding cassette (ABC) transporters consists of two transmembrane domains and two nucleotide-binding domains (NBDs). ATP binding elicits association of the two NBDs, forming a dimer in a head-to-tail arrangement, with two nucleotides “sandwiched” at the dimer interface. Each of the two nucleotide-binding sites is formed by residues from the two NBDs. We recently found that the prototypical NBD MJ0796 from Methanocaldococcus jannaschii dimerizes in response to ATP binding and dissociates completely following ATP hydrolysis. However, it is still unknown whether dissociation of NBD dimers follows ATP hydrolysis at one or both nucleotide-binding sites. Here, we usedmore » luminescence resonance energy transfer to study heterodimers formed by one active (donor-labeled) and one catalytically defective (acceptor-labeled) NBD. Rapid mixing experiments in a stop-flow chamber showed that NBD heterodimers with one functional and one inactive site dissociated at a rate indistinguishable from that of dimers with two hydrolysis-competent sites. Comparison of the rates of NBD dimer dissociation and ATP hydrolysis indicated that dissociation followed hydrolysis of one ATP. We conclude that ATP hydrolysis at one nucleotide-binding site drives NBD dimer dissociation.« less
Kirby, Thomas W.; Gassman, Natalie R.; Smith, Cassandra E.; ...
2015-08-25
We have characterized the nuclear localization signal (NLS) of XRCC1 structurally using X-ray crystallography and functionally using fluorescence imaging. Crystallography and binding studies confirm the bipartite nature of the XRCC1 NLS interaction with Importin α (Impα) in which the major and minor binding motifs are separated by >20 residues, and resolve previous inconsistent determinations. Binding studies of peptides corresponding to the bipartite NLS, as well as its major and minor binding motifs, to both wild-type and mutated forms of Impα reveal pronounced cooperative binding behavior that is generated by the proximity effect of the tethered major and minor motifs ofmore » the NLS. The cooperativity stems from the increased local concentration of the second motif near its cognate binding site that is a consequence of the stepwise binding behavior of the bipartite NLS. We predict that the stepwise dissociation of the NLS from Impα facilitates unloading by providing a partially complexed intermediate that is available for competitive binding by Nup50 or the Importin β binding domain. This behavior gives a basis for meeting the intrinsically conflicting high affinity and high flux requirements of an efficient nuclear transport system.« less
Saand, Mumtaz Ali; Xu, You-Ping; Munyampundu, Jean-Pierre; Li, Wen; Zhang, Xuan-Rui; Cai, Xin-Zhong
2015-01-01
Cyclic nucleotide-gated ion channels (CNGCs) are calcium-permeable channels that are involved in various biological functions. Nevertheless, phylogeny and function of plant CNGCs are not well understood. In this study, 333 CNGC genes from 15 plant species were identified using comprehensive bioinformatics approaches. Extensive bioinformatics analyses demonstrated that CNGCs of Group IVa were distinct to those of other groups in gene structure and amino acid sequence of cyclic nucleotide-binding domain. A CNGC-specific motif that recognizes all identified plant CNGCs was generated. Phylogenetic analysis indicated that CNGC proteins of flowering plant species formed five groups. However, CNGCs of the non-vascular plant Physcomitrella patens clustered only in two groups (IVa and IVb), while those of the vascular non-flowering plant Selaginella moellendorffii gathered in four (IVa, IVb, I and II). These data suggest that Group IV CNGCs are most ancient and Group III CNGCs are most recently evolved in flowering plants. Furthermore, silencing analyses revealed that a set of CNGC genes might be involved in disease resistance and abiotic stress responses in tomato and function of SlCNGCs does not correlate with the group that they are belonging to. Our results indicate that Group IVa CNGCs are structurally but not functionally unique among plant CNGCs. PMID:26546226
Loop L5 Assumes Three Distinct Orientations during the ATPase Cycle of the Mitotic Kinesin Eg5
Muretta, Joseph M.; Behnke-Parks, William M.; Major, Jennifer; Petersen, Karl J.; Goulet, Adeline; Moores, Carolyn A.; Thomas, David D.; Rosenfeld, Steven S.
2013-01-01
Members of the kinesin superfamily of molecular motors differ in several key structural domains, which probably allows these molecular motors to serve the different physiologies required of them. One of the most variable of these is a stem-loop motif referred to as L5. This loop is longest in the mitotic kinesin Eg5, and previous structural studies have shown that it can assume different conformations in different nucleotide states. However, enzymatic domains often consist of a mixture of conformations whose distribution shifts in response to substrate binding or product release, and this information is not available from the “static” images that structural studies provide. We have addressed this issue in the case of Eg5 by attaching a fluorescent probe to L5 and examining its fluorescence, using both steady state and time-resolved methods. This reveals that L5 assumes an equilibrium mixture of three orientations that differ in their local environment and segmental mobility. Combining these studies with transient state kinetics demonstrates that there is a major shift in this distribution during transitions that interconvert weak and strong microtubule binding states. Finally, in conjunction with previous cryo-EM reconstructions of Eg5·microtubule complexes, these fluorescence studies suggest a model in which L5 regulates both nucleotide and microtubule binding through a set of reversible interactions with helix α3. We propose that these features facilitate the production of sustained opposing force by Eg5, which underlies its role in supporting formation of a bipolar spindle in mitosis. PMID:24145034
Knihtila, Ryan; Holzapfel, Genevieve; Weiss, Kevin; Meilleur, Flora; Mattos, Carla
2015-01-01
RAS GTPase is a prototype for nucleotide-binding proteins that function by cycling between GTP and GDP, with hydrogen atoms playing an important role in the GTP hydrolysis mechanism. It is one of the most well studied proteins in the superfamily of small GTPases, which has representatives in a wide range of cellular functions. These proteins share a GTP-binding pocket with highly conserved motifs that promote hydrolysis to GDP. The neutron crystal structure of RAS presented here strongly supports a protonated γ-phosphate at physiological pH. This counters the notion that the phosphate groups of GTP are fully deprotonated at the start of the hydrolysis reaction, which has colored the interpretation of experimental and computational data in studies of the hydrolysis mechanism. The neutron crystal structure presented here puts in question our understanding of the pre-catalytic state associated with the hydrolysis reaction central to the function of RAS and other GTPases. PMID:26515069
Knihtila, Ryan; Holzapfel, Genevieve; Weiss, Kevin; ...
2015-10-29
RAS GTPase is a prototype for nucleotide-binding proteins that function by cycling between GTP and GDP, with hydrogen atoms playing an important role in the GTP hydrolysis mechanism. It is one of the most well studied proteins in the superfamily of small GTPases, which has representatives in a wide range of cellular functions. These proteins share a GTP-binding pocket with highly conserved motifs that promote hydrolysis to GDP. The neutron crystal structure of RAS presented here strongly supports a protonated gamma-phosphate at physiological pH. This counters the notion that the phosphate groups of GTP are fully deprotonated at the startmore » of the hydrolysis reaction, which has colored the interpretation of experimental and computational data in studies of the hydrolysis mechanism. As a result, the neutron crystal structure presented here puts in question our understanding of the pre-catalytic state associated with the hydrolysis reaction central to the function of RAS and other GTPases.« less
Inforna 2.0: A Platform for the Sequence-Based Design of Small Molecules Targeting Structured RNAs.
Disney, Matthew D; Winkelsas, Audrey M; Velagapudi, Sai Pradeep; Southern, Mark; Fallahi, Mohammad; Childs-Disney, Jessica L
2016-06-17
The development of small molecules that target RNA is challenging yet, if successful, could advance the development of chemical probes to study RNA function or precision therapeutics to treat RNA-mediated disease. Previously, we described Inforna, an approach that can mine motifs (secondary structures) within target RNAs, which is deduced from the RNA sequence, and compare them to a database of known RNA motif-small molecule binding partners. Output generated by Inforna includes the motif found in both the database and the desired RNA target, lead small molecules for that target, and other related meta-data. Lead small molecules can then be tested for binding and affecting cellular (dys)function. Herein, we describe Inforna 2.0, which incorporates all known RNA motif-small molecule binding partners reported in the scientific literature, a chemical similarity searching feature, and an improved user interface and is freely available via an online web server. By incorporation of interactions identified by other laboratories, the database has been doubled, containing 1936 RNA motif-small molecule interactions, including 244 unique small molecules and 1331 motifs. Interestingly, chemotype analysis of the compounds that bind RNA in the database reveals features in small molecule chemotypes that are privileged for binding. Further, this updated database expanded the number of cellular RNAs to which lead compounds can be identified.
Willenborg, Jörg; de Greeff, Astrid; Jarek, Michael; Valentin-Weigand, Peter; Goethe, Ralph
2014-04-01
Streptococcus suis (S. suis) is a neglected zoonotic streptococcus causing fatal diseases in humans and in pigs. The transcriptional regulator CcpA (catabolite control protein A) is involved in the metabolic adaptation to different carbohydrate sources and virulence of S. suis and other pathogenic streptococci. In this study, we determined the DNA binding characteristics of CcpA and identified the CcpA regulon during growth of S. suis. Electrophoretic mobility shift analyses showed promiscuous DNA binding of CcpA to cognate cre sites in vitro. In contrast, sequencing of immunoprecipitated chromatin revealed two specific consensus motifs, a pseudo-palindromic cre motif (WWGAAARCGYTTTCWW) and a novel cre2 motif (TTTTYHWDHHWWTTTY), within the regulatory elements of the genes directly controlled by CcpA. Via these elements CcpA regulates expression of genes involved in carbohydrate uptake and conversion, and in addition in important metabolic pathways of the central carbon metabolism, like glycolysis, mixed-acid fermentation, and the fragmentary TCA cycle. Furthermore, our analyses provide evidence that CcpA regulates the genes of the central carbon metabolism by binding either the pseudo-palindromic cre motif or the cre2 motif in a HPr(Ser)∼P independent conformation. © 2014 John Wiley & Sons Ltd.
Recognition of DNA bulges by dinuclear iron(II) metallosupramolecular helicates.
Malina, Jaroslav; Hannon, Michael J; Brabec, Viktor
2014-02-01
Bulged DNA structures are of general biological significance because of their important roles in a number of biochemical processes. Compounds capable of targeting bulged DNA sequences can be used as probes for studying their role in nucleic acid function, or could even have significant therapeutic potential. The interaction of [Fe(2)L(3)](4+) metallosupramolecular helicates (L = C(25)H(20)N(4)) with DNA duplexes containing bulges has been studied by measurement of the DNA melting temperature and gel electrophoresis. This study was aimed at exploring binding affinities of the helicates for DNA bulges of various sizes and nucleotide sequences. The studies reported herein reveal that both enantiomers of [Fe(2)L(3)](4+) bind to DNA bulges containing at least two unpaired nucleotides. In addition, these helicates show considerably enhanced affinity for duplexes containing unpaired pyrimidines in the bulge and/or pyrimidines flanking the bulge on both sides. We suggest that the bulge creates the structural motif, such as the triangular prismatic pocket formed by the unpaired bulge bases, to accommodate the [Fe(2)L(3)](4+) helicate molecule, and is probably responsible for the affinity for duplexes with a varying number of bulge bases. Our results reveal that DNA bulges represent another example of unusual DNA structures recognized by dinuclear iron(II) ([Fe(2)L(3)](4+)) supramolecular helicates. © 2013 FEBS.
ssHMM: extracting intuitive sequence-structure motifs from high-throughput RNA-binding protein data
Krestel, Ralf; Ohler, Uwe; Vingron, Martin; Marsico, Annalisa
2017-01-01
Abstract RNA-binding proteins (RBPs) play an important role in RNA post-transcriptional regulation and recognize target RNAs via sequence-structure motifs. The extent to which RNA structure influences protein binding in the presence or absence of a sequence motif is still poorly understood. Existing RNA motif finders either take the structure of the RNA only partially into account, or employ models which are not directly interpretable as sequence-structure motifs. We developed ssHMM, an RNA motif finder based on a hidden Markov model (HMM) and Gibbs sampling which fully captures the relationship between RNA sequence and secondary structure preference of a given RBP. Compared to previous methods which output separate logos for sequence and structure, it directly produces a combined sequence-structure motif when trained on a large set of sequences. ssHMM’s model is visualized intuitively as a graph and facilitates biological interpretation. ssHMM can be used to find novel bona fide sequence-structure motifs of uncharacterized RBPs, such as the one presented here for the YY1 protein. ssHMM reaches a high motif recovery rate on synthetic data, it recovers known RBP motifs from CLIP-Seq data, and scales linearly on the input size, being considerably faster than MEMERIS and RNAcontext on large datasets while being on par with GraphProt. It is freely available on Github and as a Docker image. PMID:28977546
Staufen1 dimerizes via a conserved motif and a degenerate dsRNA-binding domain to promote mRNA decay
Gleghorn, Michael L.; Gong, Chenguang; Kielkopf, Clara L.; Maquat, Lynne E.
2014-01-01
Staufen (STAU)1-mediated mRNA decay (SMD) degrades mammalian-cell mRNAs that bind the double-stranded (ds)RNA-binding protein STAU1 in their 3′-untranslated region. We report a new motif, which typifies STAU homologs from all vertebrate classes, that is responsible for human (h)STAU1 homodimerization. Our crystal structure and mutagenesis analyses reveal that this motif, now named the Staufen-swapping motif (SSM), and dsRNA-binding domain 5 (‘RBD’5) mediate protein dimerization: the two SSM α-helices of one molecule interact primarily through a hydrophobic patch with the two ‘RBD’5 α-helices of a second molecule. ‘RBD’5 adopts the canonical α-β-β-β-α fold of a functional RBD, but it lacks residues and features needed to bind duplex RNA. In cells, SSM-mediated hSTAU1 dimerization increases the efficiency of SMD by augmenting hSTAU1 binding to the ATP-dependent RNA helicase hUPF1. Dimerization regulates keratinocyte-mediated wound-healing and, undoubtedly, many other cellular processes. PMID:23524536
Mariani, Luca; Weinand, Kathryn; Vedenko, Anastasia; Barrera, Luis A; Bulyk, Martha L
2017-09-27
Transcription factors (TFs) control cellular processes by binding specific DNA motifs to modulate gene expression. Motif enrichment analysis of regulatory regions can identify direct and indirect TF binding sites. Here, we created a glossary of 108 non-redundant TF-8mer "modules" of shared specificity for 671 metazoan TFs from publicly available and new universal protein binding microarray data. Analysis of 239 ENCODE TF chromatin immunoprecipitation sequencing datasets and associated RNA sequencing profiles suggest the 8mer modules are more precise than position weight matrices in identifying indirect binding motifs and their associated tethering TFs. We also developed GENRE (genomically equivalent negative regions), a tunable tool for construction of matched genomic background sequences for analysis of regulatory regions. GENRE outperformed four state-of-the-art approaches to background sequence construction. We used our TF-8mer glossary and GENRE in the analysis of the indirect binding motifs for the co-occurrence of tethering factors, suggesting novel TF-TF interactions. We anticipate that these tools will aid in elucidating tissue-specific gene-regulatory programs. Copyright © 2017 Elsevier Inc. All rights reserved.
Binding Modes of Teixobactin to Lipid II: Molecular Dynamics Study.
Liu, Yang; Liu, Yaxin; Chan-Park, Mary B; Mu, Yuguang
2017-12-08
Teixobactin (TXB) is a newly discovered antibiotic targeting the bacterial cell wall precursor Lipid II (L II ). In the present work, four binding modes of TXB on L II were identified by a contact-map based clustering method. The highly flexible binary complex ensemble was generated by parallel tempering metadynamics simulation in a well-tempered ensemble (PTMetaD-WTE). In agreement with experimental findings, the pyrophosphate group and the attached first sugar subunit of L II are found to be the minimal motif for stable TXB binding. Three of the four binding modes involve the ring structure of TXB and have relatively higher binding affinities, indicating the importance of the ring motif of TXB in L II recognition. TXB-L II complexes with a ratio of 2:1 are also predicted with configurations such that the ring motif of two TXB molecules bound to the pyrophosphate-MurNAc moiety and the glutamic acid residue of one L II , respectively. Our findings disclose that the ring motif of TXB is critical to L II binding and novel antibiotics can be designed based on its mimetics.
The regulation of integrin function by divalent cations
Zhang, Kun; Chen, JianFeng
2012-01-01
Integrins are a family of α/β heterodimeric adhesion metalloprotein receptors and their functions are highly dependent on and regulated by different divalent cations. Recently advanced studies have revolutionized our perception of integrin metal ion-binding sites and their specific functions. Ligand binding to integrins is bridged by a divalent cation bound at the MIDAS motif on top of either α I domain in I domain-containing integrins or β I domain in α I domain-less integrins. The MIDAS motif in β I domain is flanked by ADMIDAS and SyMBS, the other two crucial metal ion binding sites playing pivotal roles in the regulation of integrin affinity and bidirectional signaling across the plasma membrane. The β-propeller domain of α subunit contains three or four β-hairpin loop-like Ca2+-binding motifs that have essential roles in integrin biogenesis. The function of another Ca2+-binding motif located at the genu of α subunit remains elusive. Here, we provide an overview of the integrin metal ion-binding sites and discuss their roles in the regulation of integrin functions. PMID:22647937
DiGiacomo, Vincent; Marivin, Arthur; Garcia-Marcos, Mikel
2018-01-23
Heterotrimeric G proteins are signal-transducing switches conserved across eukaryotes. In humans, they work as critical mediators of intercellular communication in the context of virtually any physiological process. While G protein regulation by G protein-coupled receptors (GPCRs) is well-established and has received much attention, it has become recently evident that heterotrimeric G proteins can also be activated by cytoplasmic proteins. However, this alternative mechanism of G protein regulation remains far less studied than GPCR-mediated signaling. This Viewpoint focuses on recent advances in the characterization of a group of nonreceptor proteins that contain a sequence dubbed the "Gα-binding and -activating (GBA) motif". So far, four proteins present in mammals [GIV (also known as Girdin), DAPLE, CALNUC, and NUCB2] and one protein in Caenorhabditis elegans (GBAS-1) have been described as possessing a functional GBA motif. The GBA motif confers guanine nucleotide exchange factor activity on Gαi subunits in vitro and activates G protein signaling in cells. The importance of this mechanism of signal transduction is highlighted by the fact that its dysregulation underlies human diseases, such as cancer, which has made the proteins attractive new candidates for therapeutic intervention. Here we discuss recent discoveries on the structural basis of GBA-mediated activation of G proteins and its evolutionary conservation and compare them with the better-studied mechanism mediated by GPCRs.
Sztuba-Solinska, Joanna; Diaz, Larissa; Kumar, Mia R; Kolb, Gaëlle; Wiley, Michael R; Jozwick, Lucas; Kuhn, Jens H; Palacios, Gustavo; Radoshitzky, Sheli R; J Le Grice, Stuart F; Johnson, Reed F
2016-11-16
Ebola virus (EBOV) is a single-stranded negative-sense RNA virus belonging to the Filoviridae family. The leader and trailer non-coding regions of the EBOV genome likely regulate its transcription, replication, and progeny genome packaging. We investigated the cis-acting RNA signals involved in RNA-RNA and RNA-protein interactions that regulate replication of eGFP-encoding EBOV minigenomic RNA and identified heat shock cognate protein family A (HSC70) member 8 (HSPA8) as an EBOV trailer-interacting host protein. Mutational analysis of the trailer HSPA8 binding motif revealed that this interaction is essential for EBOV minigenome replication. Selective 2'-hydroxyl acylation analyzed by primer extension analysis of the secondary structure of the EBOV minigenomic RNA indicates formation of a small stem-loop composed of the HSPA8 motif, a 3' stem-loop (nucleotides 1868-1890) that is similar to a previously identified structure in the replicative intermediate (RI) RNA and a panhandle domain involving a trailer-to-leader interaction. Results of minigenome assays and an EBOV reverse genetic system rescue support a role for both the panhandle domain and HSPA8 motif 1 in virus replication. Published by Oxford University Press on behalf of Nucleic Acids Research 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.
DIVERSITY in binding, regulation, and evolution revealed from high-throughput ChIP.
Mitra, Sneha; Biswas, Anushua; Narlikar, Leelavati
2018-04-01
Genome-wide in vivo protein-DNA interactions are routinely mapped using high-throughput chromatin immunoprecipitation (ChIP). ChIP-reported regions are typically investigated for enriched sequence-motifs, which are likely to model the DNA-binding specificity of the profiled protein and/or of co-occurring proteins. However, simple enrichment analyses can miss insights into the binding-activity of the protein. Note that ChIP reports regions making direct contact with the protein as well as those binding through intermediaries. For example, consider a ChIP experiment targeting protein X, which binds DNA at its cognate sites, but simultaneously interacts with four other proteins. Each of these proteins also binds to its own specific cognate sites along distant parts of the genome, a scenario consistent with the current view of transcriptional hubs and chromatin loops. Since ChIP will pull down all X-associated regions, the final reported data will be a union of five distinct sets of regions, each containing binding sites of one of the five proteins, respectively. Characterizing all five different motifs and the corresponding sets is important to interpret the ChIP experiment and ultimately, the role of X in regulation. We present diversity which attempts exactly this: it partitions the data so that each partition can be characterized with its own de novo motif. Diversity uses a Bayesian approach to identify the optimal number of motifs and the associated partitions, which together explain the entire dataset. This is in contrast to standard motif finders, which report motifs individually enriched in the data, but do not necessarily explain all reported regions. We show that the different motifs and associated regions identified by diversity give insights into the various complexes that may be forming along the chromatin, something that has so far not been attempted from ChIP data. Webserver at http://diversity.ncl.res.in/; standalone (Mac OS X/Linux) from https://github.com/NarlikarLab/DIVERSITY/releases/tag/v1.0.0.
Molecular mechanism of ATP binding and ion channel activation in P2X receptors
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hattori, Motoyuki; Gouaux, Eric
P2X receptors are trimeric ATP-activated ion channels permeable to Na{sup +}, K{sup +} and Ca{sup 2+}. The seven P2X receptor subtypes are implicated in physiological processes that include modulation of synaptic transmission, contraction of smooth muscle, secretion of chemical transmitters and regulation of immune responses. Despite the importance of P2X receptors in cellular physiology, the three-dimensional composition of the ATP-binding site, the structural mechanism of ATP-dependent ion channel gating and the architecture of the open ion channel pore are unknown. Here we report the crystal structure of the zebrafish P2X4 receptor in complex with ATP and a new structure ofmore » the apo receptor. The agonist-bound structure reveals a previously unseen ATP-binding motif and an open ion channel pore. ATP binding induces cleft closure of the nucleotide-binding pocket, flexing of the lower body {beta}-sheet and a radial expansion of the extracellular vestibule. The structural widening of the extracellular vestibule is directly coupled to the opening of the ion channel pore by way of an iris-like expansion of the transmembrane helices. The structural delineation of the ATP-binding site and the ion channel pore, together with the conformational changes associated with ion channel gating, will stimulate development of new pharmacological agents.« less
Santamaría-Hernando, Saray; Krell, Tino; Ramos-González, María-Isabel
2012-01-01
Proteins of the animal heme peroxidase (ANP) superfamily differ greatly in size since they have either one or two catalytic domains that match profile PS50292. The orf PP_2561 of Pseudomonas putida KT2440 that we have called PepA encodes a two-domain ANP. The alignment of these domains with those of PepA homologues revealed a variable number of insertions with the consensus G-x-D-G-x-x-[GN]-[TN]-x-D-D. This motif has also been detected in the structure of pseudopilin (pdb 3G20), where it was found to be involved in Ca(2+) coordination although a sequence analysis did not reveal the presence of any known calcium binding motifs in this protein. Isothermal titration calorimetry revealed that a peptide containing this consensus motif bound specifically calcium ions with affinities ranging between 33-79 µM depending on the pH. Microcalorimetric titrations of the purified N-terminal ANP-like domain of PepA revealed Ca(2+) binding with a K(D) of 12 µM and stoichiometry of 1.25 calcium ions per protein monomer. This domain exhibited peroxidase activity after its reconstitution with heme. These data led to the definition of a novel calcium binding motif that we have termed PERCAL and which was abundantly present in animal peroxidase-like domains of bacterial proteins. Bacterial heme peroxidases thus possess two different types of calcium binding motifs, namely PERCAL and the related hemolysin type calcium binding motif, with the latter being located outside the catalytic domains and in their C-terminal end. A phylogenetic tree of ANP-like catalytic domains of bacterial proteins with PERCAL motifs, including single domain peroxidases, was divided into two major clusters, representing domains with and without PERCAL motif containing insertions. We have verified that the recently reported classification of bacterial heme peroxidases in two families (cd09819 and cd09821) is unrelated to these insertions. Sequences matching PERCAL were detected in all kingdoms of life.
Beusch, Irene; Barraud, Pierre; Moursy, Ahmed; Cléry, Antoine; Allain, Frédéric Hai-Trieu
2017-01-01
HnRNP A1 regulates many alternative splicing events by the recognition of splicing silencer elements. Here, we provide the solution structures of its two RNA recognition motifs (RRMs) in complex with short RNA. In addition, we show by NMR that both RRMs of hnRNP A1 can bind simultaneously to a single bipartite motif of the human intronic splicing silencer ISS-N1, which controls survival of motor neuron exon 7 splicing. RRM2 binds to the upstream motif and RRM1 to the downstream motif. Combining the insights from the structure with in cell splicing assays we show that the architecture and organization of the two RRMs is essential to hnRNP A1 function. The disruption of the inter-RRM interaction or the loss of RNA binding capacity of either RRM impairs splicing repression by hnRNP A1. Furthermore, both binding sites within the ISS-N1 are important for splicing repression and their contributions are cumulative rather than synergistic. DOI: http://dx.doi.org/10.7554/eLife.25736.001 PMID:28650318
Functional interaction of proliferating cell nuclear antigen with MSH2-MSH6 and MSH2-MSH3 complexes.
Clark, A B; Valle, F; Drotschmann, K; Gary, R K; Kunkel, T A
2000-11-24
Eukaryotic DNA mismatch repair requires the concerted action of several proteins, including proliferating cell nuclear antigen (PCNA) and heterodimers of MSH2 complexed with either MSH3 or MSH6. Here we report that MSH3 and MSH6, but not MSH2, contain N-terminal sequence motifs characteristic of proteins that bind to PCNA. MSH3 and MSH6 peptides containing these motifs bound PCNA, as did the intact Msh2-Msh6 complex. This binding was strongly reduced when alanine was substituted for conserved residues in the motif. Yeast strains containing alanine substitutions in the PCNA binding motif of Msh6 or Msh3 had elevated mutation rates, indicating that these interactions are important for genome stability. When human MSH3 or MSH6 peptides containing the PCNA binding motif were added to a human cell extract, mismatch repair activity was inhibited at a step preceding DNA resynthesis. Thus, MSH3 and MSH6 interactions with PCNA may facilitate early steps in DNA mismatch repair and may also be important for other roles of these eukaryotic MutS homologs.
Mapping Hfq-RNA interaction surfaces using tryptophan fluorescence quenching
Robinson, Kirsten E.; Orans, Jillian; Kovach, Alexander R.; Link, Todd M.; Brennan, Richard G.
2014-01-01
Hfq is a posttranscriptional riboregulator and RNA chaperone that binds small RNAs and target mRNAs to effect their annealing and message-specific regulation in response to environmental stressors. Structures of Hfq-RNA complexes indicate that U-rich sequences prefer the proximal face and A-rich sequences the distal face; however, the Hfq-binding sites of most RNAs are unknown. Here, we present an Hfq-RNA mapping approach that uses single tryptophan-substituted Hfq proteins, all of which retain the wild-type Hfq structure, and tryptophan fluorescence quenching (TFQ) by proximal RNA binding. TFQ properly identified the respective distal and proximal binding of A15 and U6 RNA to Gram-negative Escherichia coli (Ec) Hfq and the distal face binding of (AA)3A, (AU)3A and (AC)3A to Gram-positive Staphylococcus aureus (Sa) Hfq. The inability of (GU)3G to bind the distal face of Sa Hfq reveals the (R-L)n binding motif is a more restrictive (A-L)n binding motif. Remarkably Hfq from Gram-positive Listeria monocytogenes (Lm) binds (GU)3G on its proximal face. TFQ experiments also revealed the Ec Hfq (A-R-N)n distal face-binding motif should be redefined as an (A-A-N)n binding motif. TFQ data also demonstrated that the 5′-untranslated region of hfq mRNA binds both the proximal and distal faces of Ec Hfq and the unstructured C-terminus. PMID:24288369
Zimmerman, Matthew D.; Proudfoot, Michael; Yakunin, Alexander; Minor, Wladek
2008-01-01
Summary HD-domain phosphohydrolases have nucleotidase and phosphodiesterase activities and play important roles in the metabolism of nucleotides and in signaling. We present three 2.1 Å resolution crystal structures (one in the free state and two complexed with natural substrates) of a HD-domain phosphohydrolase, the E. coli 5′-nucleotidase YfbR. The free-state structure of YfbR contains a large cavity accommodating the metal-coordinating HD motif (H33, H68, D69, and D137) and other conserved residues (R18, E72, and D77). Alanine scanning mutagenesis confirms that these residues are important for activity. Two structures of the catalytically inactive mutant E72A complexed with Co2+ and either TMP or dAMP disclose the novel binding mode of deoxyribonucleotides in the active site. Residue R18 stabilizes the phosphate on the Co2+, and residue D77 forms a strong hydrogen bond critical for binding the ribose. The indole side chain of W19 is located close to the 2′-carbon atom of the deoxyribose moiety and is proposed to act as the selectivity switch for deoxyribonucleotide, which is supported by comparison to YfdR, another 5′-nucleotidase in E. coli. The nucleotide bases of both dAMP and TMP make no specific hydrogen bonds with the protein, explaining the lack of nucleotide base selectivity. The YfbR E72A substrate complex structures also suggest a plausible single-step nucleophilic substitution mechanism. This is the first proposed molecular mechanism for a HD-domain phosphohydrolase based directly on substrate-bound crystal structures. PMID:18353368
The major resistance gene cluster in lettuce is highly duplicated and spans several megabases.
Meyers, B C; Chin, D B; Shen, K A; Sivaramakrishnan, S; Lavelle, D O; Zhang, Z; Michelmore, R W
1998-01-01
At least 10 Dm genes conferring resistance to the oomycete downy mildew fungus Bremia lactucae map to the major resistance cluster in lettuce. We investigated the structure of this cluster in the lettuce cultivar Diana, which contains Dm3. A deletion breakpoint map of the chromosomal region flanking Dm3 was saturated with a variety of molecular markers. Several of these markers are components of a family of resistance gene candidates (RGC2) that encode a nucleotide binding site and a leucine-rich repeat region. These motifs are characteristic of plant disease resistance genes. Bacterial artificial chromosome clones were identified by using duplicated restriction fragment length polymorphism markers from the region, including the nucleotide binding site-encoding region of RGC2. Twenty-two distinct members of the RGC2 family were characterized from the bacterial artificial chromosomes; at least two additional family members exist. The RGC2 family is highly divergent; the nucleotide identity was as low as 53% between the most distantly related copies. These RGC2 genes span at least 3.5 Mb. Eighteen members were mapped on the deletion breakpoint map. A comparison between the phylogenetic and physical relationships of these sequences demonstrated that closely related copies are physically separated from one another and indicated that complex rearrangements have shaped this region. Analysis of low-copy genomic sequences detected no genes, including RGC2, in the Dm3 region, other than sequences related to retrotransposons and transposable elements. The related but divergent family of RGC2 genes may act as a resource for the generation of new resistance phenotypes through infrequent recombination or unequal crossing over. PMID:9811791
Affinity and specificity of interactions between Nedd4 isoforms and the epithelial Na+ channel.
Henry, Pauline C; Kanelis, Voula; O'Brien, M Christine; Kim, Brian; Gautschi, Ivan; Forman-Kay, Julie; Schild, Laurent; Rotin, Daniela
2003-05-30
The epithelial Na+ channel (alphabetagammaENaC) regulates salt and fluid homeostasis and blood pressure. Each ENaC subunit contains a PY motif (PPXY) that binds to the WW domains of Nedd4, a Hect family ubiquitin ligase containing 3-4 WW domains and usually a C2 domain. It has been proposed that Nedd4-2, but not Nedd4-1, isoforms can bind to and suppress ENaC activity. Here we challenge this notion and show that, instead, the presence of a unique WW domain (WW3*) in either Nedd4-2 or Nedd4-1 determines high affinity interactions and the ability to suppress ENaC. WW3* from either Nedd4-2 or Nedd4-1 binds ENaC-PY motifs equally well (e.g. Kd approximately 10 microm for alpha- or betaENaC, 3-6-fold higher affinity than WW4), as determined by intrinsic tryptophan fluorescence. Moreover, dNedd4-1, which naturally contains a WW3* instead of WW2, is able to suppress ENaC function equally well as Nedd4-2. Homology models of the WW3*.betaENaC-PY complex revealed that a Pro and Ala conserved in all WW3*, but not other Nedd4-WW domains, help form the binding pocket for PY motif prolines. Extensive contacts are formed between the betaENaC-PY motif and the Pro in WW3*, and the small Ala creates a large pocket to accommodate the peptide. Indeed, mutating the conserved Pro and Ala in WW3* reduces binding affinity 2-3-fold. Additionally, we demonstrate that mutations in PY motif residues that form contacts with the WW domain based on our previously solved structure either abolish or severely reduce binding affinity to the WW domain and that the extent of binding correlates with the level of ENaC suppression. Independently, we show that a peptide encompassing the PY motif of sgk1, previously proposed to bind to Nedd4-2 and alter its ability to regulate ENaC, does not bind (or binds poorly) the WW domains of Nedd4-2. Collectively, these results suggest that high affinity of WW domain-PY-motif interactions rather than affiliation with Nedd4-1/Nedd-2 is critical for ENaC suppression by Nedd4 proteins.
Singh, D D; Saikrishnan, K; Kumar, Prashant; Surolia, A; Sekar, K; Vijayan, M
2005-10-01
The crystal structure of a complex of methyl-alpha-D-mannoside with banana lectin from Musa paradisiaca reveals two primary binding sites in the lectin, unlike in other lectins with beta-prism I fold which essentially consists of three Greek key motifs. It has been suggested that the fold evolved through successive gene duplication and fusion of an ancestral Greek key motif. In other lectins, all from dicots, the primary binding site exists on one of the three motifs in the three-fold symmetric molecule. Banana is a monocot, and the three motifs have not diverged enough to obliterate sequence similarity among them. Two Greek key motifs in it carry one primary binding site each. A common secondary binding site exists on the third Greek key. Modelling shows that both the primary sites can support 1-2, 1-3, and 1-6 linked mannosides with the second residue interacting in each case primarily with the secondary binding site. Modelling also readily leads to a bound branched mannopentose with the nonreducing ends of the two branches anchored at the two primary binding sites, providing a structural explanation for the lectin's specificity for branched alpha-mannans. A comparison of the dimeric banana lectin with other beta-prism I fold lectins, provides interesting insights into the variability in their quaternary structure.
Fukutomi, Toshiaki; Takagi, Kenji; Mizushima, Tsunehiro; Ohuchi, Noriaki
2014-01-01
Transcription factor Nrf2 (NF-E2-related factor 2) coordinately regulates cytoprotective gene expression, but under unstressed conditions, Nrf2 is degraded rapidly through Keap1 (Kelch-like ECH-associated protein 1)-mediated ubiquitination. Nrf2 harbors two Keap1-binding motifs, DLG and ETGE. Interactions between these two motifs and Keap1 constitute a key regulatory nexus for cellular Nrf2 activity through the formation of a two-site binding hinge-and-latch mechanism. In this study, we determined the minimum Keap1-binding sequence of the DLG motif, the low-affinity latch site, and defined a new DLGex motif that covers a sequence much longer than that previously defined. We have successfully clarified the crystal structure of the Keap1-DC-DLGex complex at 1.6 Å. DLGex possesses a complicated helix structure, which interprets well the human-cancer-derived loss-of-function mutations in DLGex. In thermodynamic analyses, Keap1-DLGex binding is characterized as enthalpy and entropy driven, while Keap1-ETGE binding is characterized as purely enthalpy driven. In kinetic analyses, Keap1-DLGex binding follows a fast-association and fast-dissociation model, while Keap1-ETGE binding contains a slow-reaction step that leads to a stable conformation. These results demonstrate that the mode of DLGex binding to Keap1 is distinct from that of ETGE structurally, thermodynamically, and kinetically and support our contention that the DLGex motif serves as a converter transmitting environmental stress to Nrf2 induction as the latch site. PMID:24366543
Cherepanov, A V; de Vries, S
2001-01-01
The interaction of nucleotides with T4 DNA and RNA ligases has been characterized using ultraviolet visible (UV-VIS) absorbance and fluorescence spectroscopy. Both enzymes bind nucleotides with the K(d) between 0.1 and 20 microM. Nucleotide binding results in a decrease of absorbance at 260 nm due to pi-stacking with an aromatic residue, possibly phenylalanine, and causes red-shifting of the absorbance maximum due to hydrogen bonding with the exocyclic amino group. T4 DNA ligase is shown to have, besides the catalytic ATP binding site, another noncovalent nucleotide binding site. ATP bound there alters the pi-stacking of the nucleotide in the catalytic site, increasing its optical extinction. The K(d) for the noncovalent site is approximately 1000-fold higher than for the catalytic site. Nucleotides quench the protein fluorescence showing that a tryptophan residue is located in the active site of the ligase. The decrease of absorbance around 298 nm suggests that the hydrogen bonding interactions of this tryptophan residue are weakened in the ligase-nucleotide complex. The excitation/emission properties of T4 RNA ligase indicate that its ATP binding pocket is in contact with solvent, which is excluded upon binding of the nucleotide. Overall, the spectroscopic analysis reveals important similarities between T4 ligases and related nucleotidyltransferases, despite the low sequence similarity. PMID:11721015
Mabrouk, T; Lemay, G
1994-01-01
It has been demonstrated that the sigma 3 protein of reovirus harbors a zinc-binding domain in its amino-terminal portion. A putative zinc finger in the CCHH form is located in this domain and was considered to be a good candidate for the zinc-binding motif. We performed site-directed mutagenesis to substitute amino acids in this region and demonstrated that many of these mutants, although expressed in COS cells, were unstable compared with the wild-type protein. Further analysis revealed that zinc-binding capability, as measured by retention on a zinc chelate affinity adsorbent, correlates with stability. These studies also allowed us to identify a CCHC box as the most probable zinc-binding motif. Images PMID:8035527
Modular and configurable optimal sequence alignment software: Cola.
Zamani, Neda; Sundström, Görel; Höppner, Marc P; Grabherr, Manfred G
2014-01-01
The fundamental challenge in optimally aligning homologous sequences is to define a scoring scheme that best reflects the underlying biological processes. Maximising the overall number of matches in the alignment does not always reflect the patterns by which nucleotides mutate. Efficiently implemented algorithms that can be parameterised to accommodate more complex non-linear scoring schemes are thus desirable. We present Cola, alignment software that implements different optimal alignment algorithms, also allowing for scoring contiguous matches of nucleotides in a nonlinear manner. The latter places more emphasis on short, highly conserved motifs, and less on the surrounding nucleotides, which can be more diverged. To illustrate the differences, we report results from aligning 14,100 sequences from 3' untranslated regions of human genes to 25 of their mammalian counterparts, where we found that a nonlinear scoring scheme is more consistent than a linear scheme in detecting short, conserved motifs. Cola is freely available under LPGL from https://github.com/nedaz/cola.
The Runt domain of AML1 (RUNX1) binds a sequence-conserved RNA motif that mimics a DNA element.
Fukunaga, Junichi; Nomura, Yusuke; Tanaka, Yoichiro; Amano, Ryo; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Sakamoto, Taiichi; Kozu, Tomoko
2013-07-01
AML1 (RUNX1) is a key transcription factor for hematopoiesis that binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. Aberrations in the AML1 gene are frequently found in human leukemia. To better understand AML1 and its potential utility for diagnosis and therapy, we obtained RNA aptamers that bind specifically to the AML1 Runt domain. Enzymatic probing and NMR analyses revealed that Apt1-S, which is a truncated variant of one of the aptamers, has a CACG tetraloop and two stem regions separated by an internal loop. All the isolated aptamers were found to contain the conserved sequence motif 5'-NNCCAC-3' and 5'-GCGMGN'N'-3' (M:A or C; N and N' form Watson-Crick base pairs). The motif contains one AC mismatch and one base bulged out. Mutational analysis of Apt1-S showed that three guanines of the motif are important for Runt binding as are the three guanines of RDE, which are directly recognized by three arginine residues of the Runt domain. Mutational analyses of the Runt domain revealed that the amino acid residues used for Apt1-S binding were similar to those used for RDE binding. Furthermore, the aptamer competed with RDE for binding to the Runt domain in vitro. These results demonstrated that the Runt domain of the AML1 protein binds to the motif of the aptamer that mimics DNA. Our findings should provide new insights into RNA function and utility in both basic and applied sciences.
The Runt domain of AML1 (RUNX1) binds a sequence-conserved RNA motif that mimics a DNA element
Fukunaga, Junichi; Nomura, Yusuke; Tanaka, Yoichiro; Amano, Ryo; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Sakamoto, Taiichi; Kozu, Tomoko
2013-01-01
AML1 (RUNX1) is a key transcription factor for hematopoiesis that binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. Aberrations in the AML1 gene are frequently found in human leukemia. To better understand AML1 and its potential utility for diagnosis and therapy, we obtained RNA aptamers that bind specifically to the AML1 Runt domain. Enzymatic probing and NMR analyses revealed that Apt1-S, which is a truncated variant of one of the aptamers, has a CACG tetraloop and two stem regions separated by an internal loop. All the isolated aptamers were found to contain the conserved sequence motif 5′-NNCCAC-3′ and 5′-GCGMGN′N′-3′ (M:A or C; N and N′ form Watson–Crick base pairs). The motif contains one AC mismatch and one base bulged out. Mutational analysis of Apt1-S showed that three guanines of the motif are important for Runt binding as are the three guanines of RDE, which are directly recognized by three arginine residues of the Runt domain. Mutational analyses of the Runt domain revealed that the amino acid residues used for Apt1-S binding were similar to those used for RDE binding. Furthermore, the aptamer competed with RDE for binding to the Runt domain in vitro. These results demonstrated that the Runt domain of the AML1 protein binds to the motif of the aptamer that mimics DNA. Our findings should provide new insights into RNA function and utility in both basic and applied sciences. PMID:23709277
Cheatle Jarvela, Alys M.; Brubaker, Lisa; Vedenko, Anastasia; Gupta, Anisha; Armitage, Bruce A.; Bulyk, Martha L.; Hinman, Veronica F.
2014-01-01
Gene regulatory networks (GRNs) describe the progression of transcriptional states that take a single-celled zygote to a multicellular organism. It is well documented that GRNs can evolve extensively through mutations to cis-regulatory modules (CRMs). Transcription factor proteins that bind these CRMs may also evolve to produce novelty. Coding changes are considered to be rarer, however, because transcription factors are multifunctional and hence are more constrained to evolve in ways that will not produce widespread detrimental effects. Recent technological advances have unearthed a surprising variation in DNA-binding abilities, such that individual transcription factors may recognize both a preferred primary motif and an additional secondary motif. This provides a source of modularity in function. Here, we demonstrate that orthologous transcription factors can also evolve a changed preference for a secondary binding motif, thereby offering an unexplored mechanism for GRN evolution. Using protein-binding microarray, surface plasmon resonance, and in vivo reporter assays, we demonstrate an important difference in DNA-binding preference between Tbrain protein orthologs in two species of echinoderms, the sea star, Patiria miniata, and the sea urchin, Strongylocentrotus purpuratus. Although both orthologs recognize the same primary motif, only the sea star Tbr also has a secondary binding motif. Our in vivo assays demonstrate that this difference may allow for greater evolutionary change in timing of regulatory control. This uncovers a layer of transcription factor binding divergence that could exist for many pairs of orthologs. We hypothesize that this divergence provides modularity that allows orthologous transcription factors to evolve novel roles in GRNs through modification of binding to secondary sites. PMID:25016582
Transient α-helices in the disordered RPEL motifs of the serum response factor coactivator MKL1
NASA Astrophysics Data System (ADS)
Mizuguchi, Mineyuki; Fuju, Takahiro; Obita, Takayuki; Ishikawa, Mitsuru; Tsuda, Masaaki; Tabuchi, Akiko
2014-06-01
The megakaryoblastic leukemia 1 (MKL1) protein functions as a transcriptional coactivator of the serum response factor. MKL1 has three RPEL motifs (RPEL1, RPEL2, and RPEL3) in its N-terminal region. MKL1 binds to monomeric G-actin through RPEL motifs, and the dissociation of MKL1 from G-actin promotes the translocation of MKL1 to the nucleus. Although structural data are available for RPEL motifs of MKL1 in complex with G-actin, the structural characteristics of RPEL motifs in the free state have been poorly defined. Here we characterized the structures of free RPEL motifs using NMR and CD spectroscopy. NMR and CD measurements showed that free RPEL motifs are largely unstructured in solution. However, NMR analysis identified transient α-helices in the regions where helices α1 and α2 are induced upon binding to G-actin. Proline mutagenesis showed that the transient α-helices are locally formed without helix-helix interactions. The helix content is higher in the order of RPEL1, RPEL2, and RPEL3. The amount of preformed structure may correlate with the binding affinity between the intrinsically disordered protein and its target molecule.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tang, K.H.; /Ohio State U.; Niebuhr, M.
2009-04-30
We report small-angle X-ray scattering (SAXS) and sedimentation velocity (SV) studies on the enzyme-DNA complexes of rat DNA polymerase {beta} (Pol {beta}) and African swine fever virus DNA polymerase X (ASFV Pol X) with one-nucleotide gapped DNA. The results indicated formation of a 2 : 1 Pol {beta}-DNA complex, whereas only 1 : 1 Pol X-DNA complex was observed. Three-dimensional structural models for the 2 : 1 Pol {beta}-DNA and 1 : 1 Pol X-DNA complexes were generated from the SAXS experimental data to correlate with the functions of the DNA polymerases. The former indicates interactions of the 8 kDamore » 5{prime}-dRP lyase domain of the second Pol {beta} molecule with the active site of the 1 : 1 Pol {beta}-DNA complex, while the latter demonstrates how ASFV Pol X binds DNA in the absence of DNA-binding motif(s). As ASFV Pol X has no 5{prime}-dRP lyase domain, it is reasonable not to form a 2 : 1 complex. Based on the enhanced activities of the 2 : 1 complex and the observation that the 8 kDa domain is not in an optimal configuration for the 5{prime}-dRP lyase reaction in the crystal structures of the closed ternary enzyme-DNA-dNTP complexes, we propose that the asymmetric 2 : 1 Pol {beta}-DNA complex enhances the function of Pol {beta}.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Biswas, Shyamasri; Buhrman, Greg; Gagnon, Keith
2012-07-11
Box C/D ribonucleoproteins (RNP) guide the 2'-O-methylation of targeted nucleotides in archaeal and eukaryotic rRNAs. The archaeal L7Ae and eukaryotic 15.5kD box C/D RNP core protein homologues initiate RNP assembly by recognizing kink-turn (K-turn) motifs. The crystal structure of the 15.5kD core protein from the primitive eukaryote Giardia lamblia is described here to a resolution of 1.8 {angstrom}. The Giardia 15.5kD protein exhibits the typical {alpha}-{beta}-{alpha} sandwich fold exhibited by both archaeal L7Ae and eukaryotic 15.5kD proteins. Characteristic of eukaryotic homologues, the Giardia 15.5kD protein binds the K-turn motif but not the variant K-loop motif. The highly conserved residues ofmore » loop 9, critical for RNA binding, also exhibit conformations similar to those of the human 15.5kD protein when bound to the K-turn motif. However, comparative sequence analysis indicated a distinct evolutionary position between Archaea and Eukarya. Indeed, assessment of the Giardia 15.5kD protein in denaturing experiments demonstrated an intermediate stability in protein structure when compared with that of the eukaryotic mouse 15.5kD and archaeal Methanocaldococcus jannaschii L7Ae proteins. Most notable was the ability of the Giardia 15.5kD protein to assemble in vitro a catalytically active chimeric box C/D RNP utilizing the archaeal M. jannaschii Nop56/58 and fibrillarin core proteins. In contrast, a catalytically competent chimeric RNP could not be assembled using the mouse 15.5kD protein. Collectively, these analyses suggest that the G. lamblia 15.5kD protein occupies a unique position in the evolution of this box C/D RNP core protein retaining structural and functional features characteristic of both archaeal L7Ae and higher eukaryotic 15.5kD homologues.« less
Seo, Min-Duk; Park, Sung Jean; Kim, Hyun-Jung; Lee, Bong Jin
2007-01-09
Epstein-Barr virus latency is maintained by the latent membrane protein (LMP) 2A, which mimics the B-cell receptor (BCR) and perturbs BCR signaling. The cytoplasmic N-terminal domain of LMP2A is composed of 119 amino acids. The N-terminal domain of LMP2A (LMP2A NTD) contains two PY motifs (PPPPY) that interact with the WW domains of Nedd4 family ubiquitin-protein ligases. Based on our analysis of NMR data, we found that the LMP2A NTD adopts an overall random-coil structure in its native state. However, the region between residues 60 and 90 was relatively ordered, and seemed to form the hydrophobic core of the LMP2A NTD. This region resides between two PY motifs and is important for WW domain binding. Mapping of the residues involved in the interaction between the LMP2A NTD and WW domains was achieved by chemical shift perturbation, by the addition of WW2 and WW3 peptides. Interestingly, the binding of the WW domains mainly occurred in the hydrophobic core of the LMP2A NTD. In addition, we detected a difference in the binding modes of the two PY motifs against the two WW peptides. The binding of the WW3 peptide caused the resonances of five residues (Tyr(60), Glu(61), Asp(62), Trp(65), and Gly(66)) just behind the N-terminal PY motif of the LMP2A NTD to disappear. A similar result was obtained with WW2 binding. However, near the C-terminal PY motif, the chemical shift perturbation caused by WW2 binding was different from that due to WW3 binding, indicating that the residues near the PY motifs are involved in selective binding of WW domains. The present work represents the first structural study of the LMP2A NTD and provides fundamental structural information about its interaction with ubiquitin-protein ligase.
A robust methodology to subclassify pseudokinases based on their nucleotide-binding properties
Murphy, James M.; Zhang, Qingwei; Young, Samuel N.; Reese, Michael L.; Bailey, Fiona P.; Eyers, Patrick A.; Ungureanu, Daniela; Hammaren, Henrik; Silvennoinen, Olli; Varghese, Leila N.; Chen, Kelan; Tripaydonis, Anne; Jura, Natalia; Fukuda, Koichi; Qin, Jun; Nimchuk, Zachary; Mudgett, Mary Beth; Elowe, Sabine; Gee, Christine L.; Liu, Ling; Daly, Roger J.; Manning, Gerard; Babon, Jeffrey J.; Lucet, Isabelle S.
2017-01-01
Protein kinase-like domains that lack conserved residues known to catalyse phosphoryl transfer, termed pseudokinases, have emerged as important signalling domains across all kingdoms of life. Although predicted to function principally as catalysis-independent protein-interaction modules, several pseudokinase domains have been attributed unexpected catalytic functions, often amid controversy. We established a thermal-shift assay as a benchmark technique to define the nucleotide-binding properties of kinase-like domains. Unlike in vitro kinase assays, this assay is insensitive to the presence of minor quantities of contaminating kinases that may otherwise lead to incorrect attribution of catalytic functions to pseudokinases. We demonstrated the utility of this method by classifying 31 diverse pseudokinase domains into four groups: devoid of detectable nucleotide or cation binding; cation-independent nucleotide binding; cation binding; and nucleotide binding enhanced by cations. Whereas nine pseudokinases bound ATP in a divalent cation-dependent manner, over half of those examined did not detectably bind nucleotides, illustrating that pseudokinase domains predominantly function as non-catalytic protein-interaction modules within signalling networks and that only a small subset is potentially catalytically active. We propose that henceforth the thermal-shift assay be adopted as the standard technique for establishing the nucleotide-binding and catalytic potential of kinase-like domains. PMID:24107129
Carlini, Leslie E; Getz, Michael J; Strauch, Arthur R; Kelm, Robert J
2002-03-08
An asymmetric polypurine-polypyrimidine cis-element located in the 5' region of the mouse vascular smooth muscle alpha-actin gene serves as a binding site for multiple proteins with specific affinity for either single- or double-stranded DNA. Here, we test the hypothesis that single-stranded DNA-binding proteins are responsible for preventing a cryptic MCAT enhancer centered within this element from cooperating with a nearby serum response factor-interacting CArG motif to trans-activate the minimal promoter in fibroblasts and smooth muscle cells. DNA binding studies revealed that the core MCAT sequence mediates binding of transcription enhancer factor-1 to the double-stranded polypurine-polypyrimidine element while flanking nucleotides account for interaction of Pur alpha and Pur beta with the purine-rich strand and MSY1 with the complementary pyrimidine-rich strand. Mutations that selectively impaired high affinity single-stranded DNA binding by fibroblast or smooth muscle cell-derived Pur alpha, Pur beta, and MSY1 in vitro, released the cryptic MCAT enhancer from repression in transfected cells. Additional experiments indicated that Pur alpha, Pur beta, and MSY1 also interact specifically, albeit weakly, with double-stranded DNA and with transcription enhancer factor-1. These results are consistent with two plausible models of cryptic MCAT enhancer regulation by Pur alpha, Pur beta, and MSY1 involving either competitive single-stranded DNA binding or masking of MCAT-bound transcription enhancer factor-1.
Loimaranta, Vuokko; Hytönen, Jukka; Pulliainen, Arto T.; Sharma, Ashu; Tenovuo, Jorma; Strömberg, Nicklas; Finne, Jukka
2009-01-01
Scavenger receptors are innate immune molecules recognizing and inducing the clearance of non-host as well as modified host molecules. To recognize a wide pattern of invading microbes, many scavenger receptors bind to common pathogen-associated molecular patterns, such as lipopolysaccharides and lipoteichoic acids. Similarly, the gp340/DMBT1 protein, a member of the human scavenger receptor cysteine-rich protein family, displays a wide ligand repertoire. The peptide motif VEVLXXXXW derived from its scavenger receptor cysteine-rich domains is involved in some of these interactions, but most of the recognition mechanisms are unknown. In this study, we used mass spectrometry sequencing, gene inactivation, and recombinant proteins to identify Streptococcus pyogenes protein Spy0843 as a recognition receptor of gp340. Antibodies against Spy0843 are shown to protect against S. pyogenes infection, but no function or host receptor have been identified for the protein. Spy0843 belongs to the leucine-rich repeat (Lrr) family of eukaryotic and prokaryotic proteins. Experiments with truncated forms of the recombinant proteins confirmed that the Lrr region is needed in the binding of Spy0843 to gp340. The same motif of two other Lrr proteins, LrrG from the Gram-positive S. agalactiae and BspA from the Gram-negative Tannerella forsythia, also mediated binding to gp340. Moreover, inhibition of Spy0843 binding occurred with peptides containing the VEVLXXXXW motif, but also peptides devoid of the XXXXW motif inhibited binding of Lrr proteins. These results thus suggest that the conserved Lrr motif in bacterial proteins serves as a novel pattern recognition motif for unique core peptides of human scavenger receptor gp340. PMID:19465482
Broitman, S; Amosova, O; Dolinnaya, N G; Fresco, J R
1999-07-30
A DNA third strand with a 3'-psoralen substituent was designed to form a triplex with the sequence downstream of the T.A mutant base pair of the human sickle cell beta-globin gene. Triplex-mediated psoralen modification of the mutant T residue was sought as an approach to gene repair. The 24-nucleotide purine-rich target sequence switches from one strand to the other and has four pyrimidine interruptions. Therefore, a third strand sequence favorable to two triplex motifs was used, one parallel and the other antiparallel to it. To cope with the pyrimidine interruptions, which weaken third strand binding, 5-methylcytosine and 5-propynyluracil were used in the third strand. Further, a six residue "hook" complementary to an overhang of a linear duplex target was added to the 5'-end of the third strand via a T(4) linker. In binding to the overhang by Watson-Crick pairing, the hook facilitates triplex formation. This third strand also binds specifically to the target within a supercoiled plasmid. The psoralen moiety at the 3'-end of the third strand forms photoadducts to the targeted T with high efficiency. Such monoadducts are known to preferentially trigger reversion of the mutation by DNA repair enzymes.
Lemloh, Marie-Louise; Altintoprak, Klara; Wege, Christina; Weiss, Ingrid M; Rothenstein, Dirk
2017-01-28
Proteins regulate diverse biological processes by the specific interaction with, e.g., nucleic acids, proteins and inorganic molecules. The generation of inorganic hybrid materials, such as shell formation in mollusks, is a protein-controlled mineralization process. Moreover, inorganic-binding peptides are attractive for the bioinspired mineralization of non-natural inorganic functional materials for technical applications. However, it is still challenging to identify mineral-binding peptide motifs from biological systems as well as for technical systems. Here, three complementary approaches were combined to analyze protein motifs consisting of alternating positively and negatively charged amino acids: (i) the screening of natural biomineralization proteins; (ii) the selection of inorganic-binding peptides derived from phage display; and (iii) the mineralization of tobacco mosaic virus (TMV)-based templates. A respective peptide motif displayed on the TMV surface had a major impact on the SiO₂ mineralization. In addition, similar motifs were found in zinc oxide- and zirconia-binding peptides indicating a general binding feature. The comparative analysis presented here raises new questions regarding whether or not there is a common design principle based on acidic and basic amino acids for peptides interacting with minerals.
Jørgensen, Casper Møller; Fields, Christopher J.; Chander, Preethi; Watt, Desmond; Burgner, John W.; Smith, Janet L.; Switzer, Robert L.
2011-01-01
Summary The PyrR protein regulates expression of pyrimidine biosynthetic (pyr) genes in many bacteria. PyrR binds to specific sites in the 5’ leader RNA of target operons and favors attenuation of transcription. Filter binding and gel mobility assays were used to characterize the binding of PyrR from Bacillus caldolyticus to RNA sequences (binding loops) from the three attenuation regions of the B. caldolyticus pyr operon. Binding of PyrR to the three binding loops and modulation of RNA binding by nucleotides was similar for all three RNAs. Apparent dissociation constants at 0° C ranged from 0.13 to 0.87 nM in the absence of effectors; dissociation constants were decreased by 3 to 12 fold by uridine nucleotides and increased by 40 to 200 fold by guanosine nucleotides. The binding data suggest that pyr operon expression is regulated by the ratio of intracellular uridine nucleotides to guanosine nucleotides; the effects of nucleoside addition to the growth medium on aspartate transcarbamylase (pyrB) levels in B. subtilis cells in vivo supported this conclusion. Analytical ultracentrifugation established that RNA binds to dimeric PyrR, even though the tetrameric form of unbound PyrR predominates in solution at the concentrations studied. PMID:18190533
Methods for decoding Cas9 protospacer adjacent motif (PAM) sequences: A brief overview.
Karvelis, Tautvydas; Gasiunas, Giedrius; Siksnys, Virginijus
2017-05-15
Recently the Cas9, an RNA guided DNA endonuclease, emerged as a powerful tool for targeted genome manipulations. Cas9 protein can be reprogrammed to cleave, bind or nick any DNA target by simply changing crRNA sequence, however a short nucleotide sequence, termed PAM, is required to initiate crRNA hybridization to the DNA target. PAM sequence is recognized by Cas9 protein and must be determined experimentally for each Cas9 variant. Exploration of Cas9 orthologs could offer a diversity of PAM sequences and novel biochemical properties that may be beneficial for genome editing applications. Here we briefly review and compare Cas9 PAM identification assays that can be adopted for other PAM-dependent CRISPR-Cas systems. Copyright © 2017 Elsevier Inc. All rights reserved.
Structural basis for genome wide recognition of 5-bp GC motifs by SMAD transcription factors.
Martin-Malpartida, Pau; Batet, Marta; Kaczmarska, Zuzanna; Freier, Regina; Gomes, Tiago; Aragón, Eric; Zou, Yilong; Wang, Qiong; Xi, Qiaoran; Ruiz, Lidia; Vea, Angela; Márquez, José A; Massagué, Joan; Macias, Maria J
2017-12-12
Smad transcription factors activated by TGF-β or by BMP receptors form trimeric complexes with Smad4 to target specific genes for cell fate regulation. The CAGAC motif has been considered as the main binding element for Smad2/3/4, whereas Smad1/5/8 have been thought to preferentially bind GC-rich elements. However, chromatin immunoprecipitation analysis in embryonic stem cells showed extensive binding of Smad2/3/4 to GC-rich cis-regulatory elements. Here, we present the structural basis for specific binding of Smad3 and Smad4 to GC-rich motifs in the goosecoid promoter, a nodal-regulated differentiation gene. The structures revealed a 5-bp consensus sequence GGC(GC)|(CG) as the binding site for both TGF-β and BMP-activated Smads and for Smad4. These 5GC motifs are highly represented as clusters in Smad-bound regions genome-wide. Our results provide a basis for understanding the functional adaptability of Smads in different cellular contexts, and their dependence on lineage-determining transcription factors to target specific genes in TGF-β and BMP pathways.
Seong, Ki Moon; Park, Hweon; Kim, Seong Jung; Ha, Hyo Nam; Lee, Jae Yung; Kim, Joon
2007-06-01
A yeast transcriptional activator, Gcn4p, induces the expression of genes that are involved in amino acid and purine biosynthetic pathways under amino acid starvation. Gcn4p has an acidic activation domain in the central region and a bZIP domain in the C-terminus that is divided into the DNA-binding motif and dimerization leucine zipper motif. In order to identify amino acids in the DNA-binding motif of Gcn4p which are involved in transcriptional activation, we constructed mutant libraries in the DNA-binding motif through an innovative application of random mutagenesis. Mutant library made by oligonucleotides which were mutated randomly using the Poisson distribution showed that the actual mutation frequency was in good agreement with expected values. This method could save the time and effort to create a mutant library with a predictable mutation frequency. Based on the studies using the mutant libraries constructed by the new method, the specific residues of the DNA-binding domain in Gcn4p appear to be involved in the transcriptional activities on a conserved binding site.
NASA Astrophysics Data System (ADS)
Urbina-Navarrete, J.; Rothschild, L.
2016-12-01
End-of-life electronics waste (e-waste) containing toxic and valuable materials is a rapidly progressing human health and environmental issue. Using synthetic biology tools, we have developed a recycling method for e-waste. Our innovation is to use a recombinant version of a naturally-occurring silica-degrading enzyme to depolymerize the silica in metal- and glass- containing e-waste components, and subsequently, to use engineered bacterial surfaces to bind and separate metals from a solution. The bacteria with bound metals can then be used as "bio-ink" to print new circuits using a novel plasma jet electronics printing technology. Here, we present the results from our initial studies that focus on the specificity of metal-binding motifs for a cognate metal. The candidate motifs that show high affinity and specificity will be engineered into bacterial surfaces for downstream applications in biologically-mediated metal recycling. Since the chemistry and role of Cu in metalloproteins is relatively well-characterized, we are using Cu as a proxy to elucidate metal and biological ligand interactions with various metals in e-waste. We assess the binding parameters of 3 representative classes of Cu-binding motifs using isothermal titration calorimetry; 1) natural motifs found in metalloproteins, 2) consensus motifs, and 3) rationally designed peptides that are predicted, in silico, to bind Cu. Our results indicate that naturally-occurring motifs have relative high affinity and specificity for Cu (association constant for Cu Ka 104 M-1, Zn Ka 103 M-1) when competing ions are present in the aqueous milieu. However, motifs developed through rational design by applying quantum mechanical methods that take into account complexation energies of the elemental binding partners and molecular geometry of the cognate metal, not only show high affinity for the cognate metal (Cu Ka 106 M-1), but they show specificity and discrimination against other metal ions that would be competitors for the same binding sites. This is an initial proof-of-concept study that focuses on Cu-binding; however the overall objective of this research is to have peptides that selectively bind many metals from e-waste and this would allow for the separation of the metals from a solution, at ambient temperatures and under non-toxic conditions.
Zhang, ZhiZhuo; Chang, Cheng Wei; Hugo, Willy; Cheung, Edwin; Sung, Wing-Kin
2013-03-01
Although de novo motifs can be discovered through mining over-represented sequence patterns, this approach misses some real motifs and generates many false positives. To improve accuracy, one solution is to consider some additional binding features (i.e., position preference and sequence rank preference). This information is usually required from the user. This article presents a de novo motif discovery algorithm called SEME (sampling with expectation maximization for motif elicitation), which uses pure probabilistic mixture model to model the motif's binding features and uses expectation maximization (EM) algorithms to simultaneously learn the sequence motif, position, and sequence rank preferences without asking for any prior knowledge from the user. SEME is both efficient and accurate thanks to two important techniques: the variable motif length extension and importance sampling. Using 75 large-scale synthetic datasets, 32 metazoan compendium benchmark datasets, and 164 chromatin immunoprecipitation sequencing (ChIP-Seq) libraries, we demonstrated the superior performance of SEME over existing programs in finding transcription factor (TF) binding sites. SEME is further applied to a more difficult problem of finding the co-regulated TF (coTF) motifs in 15 ChIP-Seq libraries. It identified significantly more correct coTF motifs and, at the same time, predicted coTF motifs with better matching to the known motifs. Finally, we show that the learned position and sequence rank preferences of each coTF reveals potential interaction mechanisms between the primary TF and the coTF within these sites. Some of these findings were further validated by the ChIP-Seq experiments of the coTFs. The application is available online.
SMARTIV: combined sequence and structure de-novo motif discovery for in-vivo RNA binding data.
Polishchuk, Maya; Paz, Inbal; Yakhini, Zohar; Mandel-Gutfreund, Yael
2018-05-25
Gene expression regulation is highly dependent on binding of RNA-binding proteins (RBPs) to their RNA targets. Growing evidence supports the notion that both RNA primary sequence and its local secondary structure play a role in specific Protein-RNA recognition and binding. Despite the great advance in high-throughput experimental methods for identifying sequence targets of RBPs, predicting the specific sequence and structure binding preferences of RBPs remains a major challenge. We present a novel webserver, SMARTIV, designed for discovering and visualizing combined RNA sequence and structure motifs from high-throughput RNA-binding data, generated from in-vivo experiments. The uniqueness of SMARTIV is that it predicts motifs from enriched k-mers that combine information from ranked RNA sequences and their predicted secondary structure, obtained using various folding methods. Consequently, SMARTIV generates Position Weight Matrices (PWMs) in a combined sequence and structure alphabet with assigned P-values. SMARTIV concisely represents the sequence and structure motif content as a single graphical logo, which is informative and easy for visual perception. SMARTIV was examined extensively on a variety of high-throughput binding experiments for RBPs from different families, generated from different technologies, showing consistent and accurate results. Finally, SMARTIV is a user-friendly webserver, highly efficient in run-time and freely accessible via http://smartiv.technion.ac.il/.
Garamszegi, Sara; Franzosa, Eric A.; Xia, Yu
2013-01-01
A central challenge in host-pathogen systems biology is the elucidation of general, systems-level principles that distinguish host-pathogen interactions from within-host interactions. Current analyses of host-pathogen and within-host protein-protein interaction networks are largely limited by their resolution, treating proteins as nodes and interactions as edges. Here, we construct a domain-resolved map of human-virus and within-human protein-protein interaction networks by annotating protein interactions with high-coverage, high-accuracy, domain-centric interaction mechanisms: (1) domain-domain interactions, in which a domain in one protein binds to a domain in a second protein, and (2) domain-motif interactions, in which a domain in one protein binds to a short, linear peptide motif in a second protein. Analysis of these domain-resolved networks reveals, for the first time, significant mechanistic differences between virus-human and within-human interactions at the resolution of single domains. While human proteins tend to compete with each other for domain binding sites by means of sequence similarity, viral proteins tend to compete with human proteins for domain binding sites in the absence of sequence similarity. Independent of their previously established preference for targeting human protein hubs, viral proteins also preferentially target human proteins containing linear motif-binding domains. Compared to human proteins, viral proteins participate in more domain-motif interactions, target more unique linear motif-binding domains per residue, and contain more unique linear motifs per residue. Together, these results suggest that viruses surmount genome size constraints by convergently evolving multiple short linear motifs in order to effectively mimic, hijack, and manipulate complex host processes for their survival. Our domain-resolved analyses reveal unique signatures of pleiotropy, economy, and convergent evolution in viral-host interactions that are otherwise hidden in the traditional binary network, highlighting the power and necessity of high-resolution approaches in host-pathogen systems biology. PMID:24339775
Garamszegi, Sara; Franzosa, Eric A; Xia, Yu
2013-01-01
A central challenge in host-pathogen systems biology is the elucidation of general, systems-level principles that distinguish host-pathogen interactions from within-host interactions. Current analyses of host-pathogen and within-host protein-protein interaction networks are largely limited by their resolution, treating proteins as nodes and interactions as edges. Here, we construct a domain-resolved map of human-virus and within-human protein-protein interaction networks by annotating protein interactions with high-coverage, high-accuracy, domain-centric interaction mechanisms: (1) domain-domain interactions, in which a domain in one protein binds to a domain in a second protein, and (2) domain-motif interactions, in which a domain in one protein binds to a short, linear peptide motif in a second protein. Analysis of these domain-resolved networks reveals, for the first time, significant mechanistic differences between virus-human and within-human interactions at the resolution of single domains. While human proteins tend to compete with each other for domain binding sites by means of sequence similarity, viral proteins tend to compete with human proteins for domain binding sites in the absence of sequence similarity. Independent of their previously established preference for targeting human protein hubs, viral proteins also preferentially target human proteins containing linear motif-binding domains. Compared to human proteins, viral proteins participate in more domain-motif interactions, target more unique linear motif-binding domains per residue, and contain more unique linear motifs per residue. Together, these results suggest that viruses surmount genome size constraints by convergently evolving multiple short linear motifs in order to effectively mimic, hijack, and manipulate complex host processes for their survival. Our domain-resolved analyses reveal unique signatures of pleiotropy, economy, and convergent evolution in viral-host interactions that are otherwise hidden in the traditional binary network, highlighting the power and necessity of high-resolution approaches in host-pathogen systems biology.
Kiosze-Becker, Kristin; Ori, Alessandro; Gerovac, Milan; Heuer, André; Nürenberg-Goloub, Elina; Rashid, Umar Jan; Becker, Thomas; Beckmann, Roland; Beck, Martin; Tampé, Robert
2016-01-01
Ribosome recycling orchestrated by the ATP binding cassette (ABC) protein ABCE1 can be considered as the final—or the first—step within the cyclic process of protein synthesis, connecting translation termination and mRNA surveillance with re-initiation. An ATP-dependent tweezer-like motion of the nucleotide-binding domains in ABCE1 transfers mechanical energy to the ribosome and tears the ribosome subunits apart. The post-recycling complex (PRC) then re-initiates mRNA translation. Here, we probed the so far unknown architecture of the 1-MDa PRC (40S/30S·ABCE1) by chemical cross-linking and mass spectrometry (XL-MS). Our study reveals ABCE1 bound to the translational factor-binding (GTPase) site with multiple cross-link contacts of the helix–loop–helix motif to the S24e ribosomal protein. Cross-linking of the FeS cluster domain to the ribosomal protein S12 substantiates an extreme lever-arm movement of the FeS cluster domain during ribosome recycling. We were thus able to reconstitute and structurally analyse a key complex in the translational cycle, resembling the link between translation initiation and ribosome recycling. PMID:27824037
Predictive Structure and Topology of Peroxisomal ATP-Binding Cassette (ABC) Transporters
Andreoletti, Pierre; Raas, Quentin; Gondcaille, Catherine; Cherkaoui-Malki, Mustapha; Trompier, Doriane; Savary, Stéphane
2017-01-01
The peroxisomal ATP-binding Cassette (ABC) transporters, which are called ABCD1, ABCD2 and ABCD3, are transmembrane proteins involved in the transport of various lipids that allow their degradation inside the organelle. Defective ABCD1 leads to the accumulation of very long-chain fatty acids and is associated with a complex and severe neurodegenerative disorder called X-linked adrenoleukodystrophy (X-ALD). Although the nucleotide-binding domain is highly conserved and characterized within the ABC transporters family, solid data are missing for the transmembrane domain (TMD) of ABCD proteins. The lack of a clear consensus on the secondary and tertiary structure of the TMDs weakens any structure-function hypothesis based on the very diverse ABCD1 mutations found in X-ALD patients. Therefore, we first reinvestigated thoroughly the structure-function data available and performed refined alignments of ABCD protein sequences. Based on the 2.85 Å resolution crystal structure of the mitochondrial ABC transporter ABCB10, here we propose a structural model of peroxisomal ABCD proteins that specifies the position of the transmembrane and coupling helices, and highlight functional motifs and putative important amino acid residues. PMID:28737695
Deconvoluting AMP-activated protein kinase (AMPK) adenine nucleotide binding and sensing
Gu, Xin; Yan, Yan; Novick, Scott J.; Kovach, Amanda; Goswami, Devrishi; Ke, Jiyuan; Tan, M. H. Eileen; Wang, Lili; Li, Xiaodan; de Waal, Parker W.; Webb, Martin R.; Griffin, Patrick R.; Xu, H. Eric
2017-01-01
AMP-activated protein kinase (AMPK) is a central cellular energy sensor that adapts metabolism and growth to the energy state of the cell. AMPK senses the ratio of adenine nucleotides (adenylate energy charge) by competitive binding of AMP, ADP, and ATP to three sites (CBS1, CBS3, and CBS4) in its γ-subunit. Because these three binding sites are functionally interconnected, it remains unclear how nucleotides bind to individual sites, which nucleotides occupy each site under physiological conditions, and how binding to one site affects binding to the other sites. Here, we comprehensively analyze nucleotide binding to wild-type and mutant AMPK protein complexes by quantitative competition assays and by hydrogen-deuterium exchange MS. We also demonstrate that NADPH, in addition to the known AMPK ligand NADH, directly and competitively binds AMPK at the AMP-sensing CBS3 site. Our findings reveal how AMP binding to one site affects the conformation and adenine nucleotide binding at the other two sites and establish CBS3, and not CBS1, as the high affinity exchangeable AMP/ADP/ATP-binding site. We further show that AMP binding at CBS4 increases AMP binding at CBS3 by 2 orders of magnitude and reverses the AMP/ATP preference of CBS3. Together, these results illustrate how the three CBS sites collaborate to enable highly sensitive detection of cellular energy states to maintain the tight ATP homeostastis required for cellular metabolism. PMID:28615457
Hamed, Mazen Y; Arya, Gaurav
2016-05-01
Energy calculations based on MM-GBSA were employed to study various zinc finger protein (ZF) motifs binding to DNA. Mutants of both the DNA bound to their specific amino acids were studied. Calculated energies gave evidence for a relationship between binding energy and affinity of ZF motifs to their sites on DNA. ΔG values were -15.82(12), -3.66(12), and -12.14(11.6) kcal/mol for finger one, finger two, and finger three, respectively. The mutations in the DNA bases reduced the value of the negative energies of binding (maximum value for ΔΔG = 42Kcal/mol for F1 when GCG mutated to GGG, and ΔΔG = 22 kcal/mol for F2, the loss in total energy of binding originated in the loss in electrostatic energies upon mutation (r = .98). The mutations in key amino acids in the ZF motif in positions-1, 2, 3, and 6 showed reduced binding energies to DNA with correlation coefficients between total free energy and electrostatic was .99 and with Van der Waal was .93. Results agree with experimentally found selectivity which showed that Arginine in position-1 is specific to G, while Aspartic acid (D) in position 2 plays a complicated role in binding. There is a correlation between the MD calculated free energies of binding and those obtained experimentally for prepared ZF motifs bound to triplet bases in other reports (), our results may help in the design of ZF motifs based on the established recognition codes based on energies and contributing energies to the total energy.
Nomura, Yusuke; Tanaka, Yoichiro; Fukunaga, Jun-ichi; Fujiwara, Kazuya; Chiba, Manabu; Iibuchi, Hiroaki; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Kozu, Tomoko; Sakamoto, Taiichi
2013-12-01
AML1/RUNX1 is an essential transcription factor involved in the differentiation of hematopoietic cells. AML1 binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. In a previous study, we obtained RNA aptamers against the AML1 Runt domain by systematic evolution of ligands by exponential enrichment and revealed that RNA aptamers exhibit higher affinity for the Runt domain than that for RDE and possess the 5'-GCGMGNN-3' and 5'-N'N'CCAC-3' conserved motif (M: A or C; N and N' form Watson-Crick base pairs) that is important for Runt domain binding. In this study, to understand the structural basis of recognition of the Runt domain by the aptamer motif, the solution structure of a 22-mer RNA was determined using nuclear magnetic resonance. The motif contains the AH(+)-C mismatch and base triple and adopts an unusual backbone structure. Structural analysis of the aptamer motif indicated that the aptamer binds to the Runt domain by mimicking the RDE sequence and structure. Our data should enhance the understanding of the structural basis of DNA mimicry by RNA molecules.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Yong; Kovach, Amanda; Suino-Powell, Kelly
2008-07-23
The functional interaction between the peroxisome proliferator-activated receptor {gamma} (PPAR{gamma}) and its coactivator PGC-1{alpha} is crucial for the normal physiology of PPAR{gamma} and its pharmacological response to antidiabetic treatment with rosiglitazone. Here we report the crystal structure of the PPAR{gamma} ligand-binding domain bound to rosiglitazone and to a large PGC-1{alpha} fragment that contains two LXXLL-related motifs. The structure reveals critical contacts mediated through the first LXXLL motif of PGC-1{alpha} and the PPAR{gamma} coactivator binding site. Through a combination of biochemical and structural studies, we demonstrate that the first LXXLL motif is the most potent among all nuclear receptor coactivator motifsmore » tested, and only this motif of the two LXXLL-related motifs in PGC-1{alpha} is capable of binding to PPAR{gamma}. Our studies reveal that the strong interaction of PGC-1{alpha} and PPAR{gamma} is mediated through both hydrophobic and specific polar interactions. Mutations within the context of the full-length PGC-1{alpha} indicate that the first PGC-1{alpha} motif is necessary and sufficient for PGC-1{alpha} to coactivate PPAR{gamma} in the presence or absence of rosiglitazone. These results provide a molecular basis for specific recruitment and functional interplay between PPAR{gamma} and PGC-1{alpha} in glucose homeostasis and adipocyte differentiation.« less
Pan, Xiaoyong; Shen, Hong-Bin
2017-02-28
RNAs play key roles in cells through the interactions with proteins known as the RNA-binding proteins (RBP) and their binding motifs enable crucial understanding of the post-transcriptional regulation of RNAs. How the RBPs correctly recognize the target RNAs and why they bind specific positions is still far from clear. Machine learning-based algorithms are widely acknowledged to be capable of speeding up this process. Although many automatic tools have been developed to predict the RNA-protein binding sites from the rapidly growing multi-resource data, e.g. sequence, structure, their domain specific features and formats have posed significant computational challenges. One of current difficulties is that the cross-source shared common knowledge is at a higher abstraction level beyond the observed data, resulting in a low efficiency of direct integration of observed data across domains. The other difficulty is how to interpret the prediction results. Existing approaches tend to terminate after outputting the potential discrete binding sites on the sequences, but how to assemble them into the meaningful binding motifs is a topic worth of further investigation. In viewing of these challenges, we propose a deep learning-based framework (iDeep) by using a novel hybrid convolutional neural network and deep belief network to predict the RBP interaction sites and motifs on RNAs. This new protocol is featured by transforming the original observed data into a high-level abstraction feature space using multiple layers of learning blocks, where the shared representations across different domains are integrated. To validate our iDeep method, we performed experiments on 31 large-scale CLIP-seq datasets, and our results show that by integrating multiple sources of data, the average AUC can be improved by 8% compared to the best single-source-based predictor; and through cross-domain knowledge integration at an abstraction level, it outperforms the state-of-the-art predictors by 6%. Besides the overall enhanced prediction performance, the convolutional neural network module embedded in iDeep is also able to automatically capture the interpretable binding motifs for RBPs. Large-scale experiments demonstrate that these mined binding motifs agree well with the experimentally verified results, suggesting iDeep is a promising approach in the real-world applications. The iDeep framework not only can achieve promising performance than the state-of-the-art predictors, but also easily capture interpretable binding motifs. iDeep is available at http://www.csbio.sjtu.edu.cn/bioinf/iDeep.
Huang, Mengmeng; Mu, Changkao; Wu, Yuehong; Ye, Fei; Wang, Dan; Sun, Cong; Lv, Zhengbing; Han, Bingnan; Wang, Chunlin; Xu, Xue-Wei
2017-11-01
C-type lectins are a superfamily of Ca 2+ -dependent carbohydrate-recognition proteins, which play crucial roles in innate immunity including nonself-recognition and pathogen elimination. In the present study, two single-CRD containing C-type lectins were identified from swimming crab Portunus trituberculatus (designated as PtCTL-2 and PtCTL-3). The open reading frame (ORF) of PtCTL-2 encoded polypeptides of 485 amino acids with a signal peptide and a single carbohydrate-recognition domain (CRD), while PtCTL-3's ORF encoded polypeptides of 241 amino acids with a coiled-coil region and a single-CRD. The key motifs determining carbohydrate binding specificity in PtCTL-2 and PtCTL-3 were EPR (Glu-Pro-Arg) and QPD (Gln-Pro-Asp). EPR is a motif being identified for the first time, whereas QPD is a typical motif in C-type lectins. Different PAMPs binding features of the two recombinant proteins - PtCTL-2 (rPtCTL-2) and PtCTL-3 (rPtCTL-3) have been observed in our experiments. rPtCTL-2 could bind three pathogen-associated molecular patterns (PAMPs) with relatively high affinity, including glucan, lipopolysaccharide (LPS) and peptidoglycan (PGN), while rPtCTL-3 could barely bind any of them. However, rPtCTL-2 could bind seven kinds of microbes and rPtCTL-3 could bind six kinds in microbe binding assay. Moreover, rPtCTL-2 and rPtCTL-3 exhibited similar agglutination activity against Gram-positive bacteria, Gram-negative bacteria and fungi in agglutination assay. All these results illustrated that PtCTL-2 and PtCTL-3 could function as important pattern-recognition receptors (PRR) with broad nonself-recognition spectrum involved in immune defense against invaders. In addition, the results of carbohydrate binding specificity showed that PtCTL-2 with novel key motif had broad carbohydrate binding specificity, while PtCTL-3 with typical key motif possessed different carbohydrate binding specificity from the classical binding rule. Furthermore, PtCTL-2 and PtCTL-3 could also function as opsonin to enhance encapsulation of hemocytes against Ni-NTA beads. Copyright © 2017 Elsevier Ltd. All rights reserved.
Germline variant FGFR4 p.G388R exposes a membrane-proximal STAT3 binding site.
Ulaganathan, Vijay K; Sperl, Bianca; Rapp, Ulf R; Ullrich, Axel
2015-12-24
Variant rs351855-G/A is a commonly occurring single-nucleotide polymorphism of coding regions in exon 9 of the fibroblast growth factor receptor FGFR4 (CD334) gene (c.1162G>A). It results in an amino-acid change at codon 388 from glycine to arginine (p.Gly388Arg) in the transmembrane domain of the receptor. Despite compelling genetic evidence for the association of this common variant with cancers of the bone, breast, colon, prostate, skin, lung, head and neck, as well as soft-tissue sarcomas and non-Hodgkin lymphoma, the underlying biological mechanism has remained elusive. Here we show that substitution of the conserved glycine 388 residue to a charged arginine residue alters the transmembrane spanning segment and exposes a membrane-proximal cytoplasmic signal transducer and activator of transcription 3 (STAT3) binding site Y(390)-(P)XXQ(393). We demonstrate that such membrane-proximal STAT3 binding motifs in the germline of type I membrane receptors enhance STAT3 tyrosine phosphorylation by recruiting STAT3 proteins to the inner cell membrane. Remarkably, such germline variants frequently co-localize with somatic mutations in the Catalogue of Somatic Mutations in Cancer (COSMIC) database. Using Fgfr4 single nucleotide polymorphism knock-in mice and transgenic mouse models for breast and lung cancers, we validate the enhanced STAT3 signalling induced by the FGFR4 Arg388-variant in vivo. Thus, our findings elucidate the molecular mechanism behind the genetic association of rs351855 with accelerated cancer progression and suggest that germline variants of cell-surface molecules that recruit STAT3 to the inner cell membrane are a significant risk for cancer prognosis and disease progression.
Zhang, Shuwei; Ding, Feng; Peng, Hongxiang; Huang, Yu; Lu, Jiang
2018-02-01
Downy mildew, caused by Plasmopara viticola, can result in a substantial decrease in grapevine productivity. Vitis vinifera is a widely cultivated grapevine species, which is susceptible to this disease. Repeated pesticide applications are harmful for both the environment and human health. Thus, it is essential to develop varieties/cultivars that are resistant to downy mildew and other diseases. In our previous studies, we investigated the natural resistance of the Chinese wild grapevine V. quinquangularis accession 'PS' against P. viticola and obtained several candidate resistance (R) genes that may play important roles in plant disease resistance. In the present study, we isolated a CC-NBS-LRR-type R gene from 'PS' and designated it VqCN. Its open reading frame is 2676 bp which encodes a protein of 891 amino acids with a predicted molecular mass of 102.12 kDa and predicted isoelectric point of 6.53. Multiple alignments with other disease resistant (R) proteins revealed a conserved phosphate-binding loop (P-loop), resistance nucleotide binding site, a hydrophobic domain (GLPL) and methionine-histidine-aspartate (MHD) motifs, which are typical components of nucleotide-binding site leucine-rich repeat proteins, as well as a coiled-coil region in the N-terminus. Quantitative real-time polymerase chain reaction analysis showed that the transcript of VqCN was rapidly and highly induced after infection with P. viticola in 'PS'. Moreover, the leaves of susceptible 'Cabernet Sauvignon' transiently expressing VqCN manifested increased resistance to P. viticola. The results indicated that VqCN might play a positive role in protecting grapevine against infection with P. viticola. Cloning and functional analysis of a putative resistance gene provide a basis for disease-resistance breeding.
Ramírez-Iglesias, José Rubén; Pérez-Gordones, María Carolina; Del Castillo, Jesús Rafael; Mijares, Alfredo; Benaim, Gustavo; Mendoza, Marta
2018-05-09
The plasma membrane Ca 2+ -ATPase (PMCA) from trypanosomatids lacks a classical calmodulin (CaM) binding domain, although CaM stimulated activities have been detected by biochemical assays. Recently we proposed that the Trypanosoma equiperdum CaM-sensitive PMCA (TePMCA) contains a potential 1-18 CaM-binding motif at the C-terminal region of the pump. In the present study, we evaluated the potential CaM-binding motifs using CaM from Trypanosoma cruzi and either the recombinant full length TePMCA C-terminal sequence (P14) or synthetic peptides comprising different regions of the C-terminal domain. We demonstrated that P14 and a synthetic peptide corresponding to residues 1037-1062 (which contains the predicted 1-18 binding motif) competed efficiently for binding to TcCaM, exhibiting similar IC 50 s of 200 nM. A stable complex of this peptide and TcCaM was formed in the presence of Ca 2+ , as determined by native-polyacrylamide gel electrophoresis. A predicted structure obtained by molecular docking showed an interaction of the 1-18 binding motif with the Ca 2+ /CaM complex. Moreover, when the peptide was incubated with CaM and Ca 2+ , a blue shift in the tryptophan fluorescence spectrum (from 350 to 329 nm) was observed. Substitutions at W 1039 and F 1056 , strongly decreased both CaM-peptide interaction and the complex assembly. Our results demonstrated the presence of a functional 1-18 motif at the TePMCA C-terminal domain. Furthermore, on the basis of spectrofluorometric assays and the resulting structure modeled by docking we propose that the L 1042 and W 1060 residues might also participate as anchors to form a 1-4-18-22 motif. Copyright © 2018 Elsevier B.V. All rights reserved.
Molecular Basis for Failure of “Atypical” C1 Domain of Vav1 to Bind Diacylglycerol/Phorbol Ester*
Geczy, Tamas; Peach, Megan L.; El Kazzouli, Saïd; Sigano, Dina M.; Kang, Ji-Hye; Valle, Christopher J.; Selezneva, Julia; Woo, Wonhee; Kedei, Noemi; Lewin, Nancy E.; Garfield, Susan H.; Lim, Langston; Mannan, Poonam; Marquez, Victor E.; Blumberg, Peter M.
2012-01-01
C1 domains, the recognition motif of the second messenger diacylglycerol and of the phorbol esters, are classified as typical (ligand-responsive) or atypical (not ligand-responsive). The C1 domain of Vav1, a guanine nucleotide exchange factor, plays a critical role in regulation of Vav activity through stabilization of the Dbl homology domain, which is responsible for exchange activity of Vav. Although the C1 domain of Vav1 is classified as atypical, it retains a binding pocket geometry homologous to that of the typical C1 domains of PKCs. This study clarifies the basis for its failure to bind ligands. Substituting Vav1-specific residues into the C1b domain of PKCδ, we identified five crucial residues (Glu9, Glu10, Thr11, Thr24, and Tyr26) along the rim of the binding cleft that weaken binding potency in a cumulative fashion. Reciprocally, replacing these incompatible residues in the Vav1 C1 domain with the corresponding residues from PKCδ C1b (δC1b) conferred high potency for phorbol ester binding. Computer modeling predicts that these unique residues in Vav1 increase the hydrophilicity of the rim of the binding pocket, impairing membrane association and thereby preventing formation of the ternary C1-ligand-membrane binding complex. The initial design of diacylglycerol-lactones to exploit these Vav1 unique residues showed enhanced selectivity for C1 domains incorporating these residues, suggesting a strategy for the development of ligands targeting Vav1. PMID:22351766
Kino, Tomoshige
2018-05-11
The human genome contains numerous single nucleotide variations (SNVs), and the human GR gene harbors ∼450 of these genetic changes. Among them, extremely rare non-synonymous variants known as pathologic GR gene mutations develop a characteristic pathologic condition, familial/sporadic generalized glucocorticoid resistance syndrome, by replacing the amino acids critical for GR protein structure and functions, whereas others known as pathologic polymorphisms develop mild manifestations recognized mainly at population bases by changing the GR activities slightly. Recent progress on the structural analysis to the GR protein and subsequent computer-based structural simulation revealed details of the molecular defects caused by such pathologic GR gene mutations, including their impact on the receptor interaction to ligands, nuclear receptor coactivators (NCoAs) or DNA glucocorticoid response elements (GREs). Indeed, those found in the GR ligand-binding domain significantly damage protein structure of the ligand-binding pocket and/or the activation function-2 transactivation domain and change their molecular interaction to glucocorticoids or the LxxLL signature motif of NCoAs. Two mutations found in GR DBD also affect interaction of the mutant receptors to GRE DNA by affecting the critical amino acid for the interaction or changing local hydrophobic circumstance. In this review, we discuss recent findings on the structural simulation of the pathologic GR mutants in connection to their functional and clinical impacts along with brief explanation to recent research achievement on the GR polymorphisms.
René, P; Lenne, F; Ventura, M A; Bertagna, X; de Keyzer, Y
2000-01-04
In the pituitary, vasopressin triggers ACTH release through a specific receptor subtype, termed V3 or V1b. We cloned the V3 cDNA and showed that its expression was almost exclusive to pituitary corticotrophs and some corticotroph tumors. To study the determinants of this tissue specificity, we have now cloned the gene for the human (h) V3 receptor and characterized its structure. It is composed of two exons, spanning 10kb, with the coding region interrupted between transmembrane domains 6 and 7. We established that the transcription initiation site is located 498 nucleotides upstream of the initiator codon and showed that two polyadenylation sites may be used, while the most frequent is the most downstream. Sequence analysis of the promoter region showed no TATA box but identified consensus binding motifs for Sp1, CREB, and half sites of the estrogen receptor binding site. However comparison with another corticotroph-specific gene, proopiomelanocortin, did not identify common regulatory elements in the two promoters except for a short GC-rich region. Unexpectedly, hV3 gene analysis revealed that a formerly cloned 'artifactual' hV3 cDNA indeed corresponded to a spliced antisense transcript, overlapping the 5' part of the coding sequence in exon 1 and the promoter region. This transcript, hV3rev, was detected in normal pituitary and in many corticotroph tumors expressing hV3 sense mRNA and may therefore play a role in hV3 gene expression.
Regulation of the alpha-glucuronidase-encoding gene ( aguA) from Aspergillus niger.
de Vries, R P; van de Vondervoort, P J I; Hendriks, L; van de Belt, M; Visser, J
2002-09-01
The alpha-glucuronidase gene aguA from Aspergillus niger was cloned and characterised. Analysis of the promoter region of aguA revealed the presence of four putative binding sites for the major carbon catabolite repressor protein CREA and one putative binding site for the transcriptional activator XLNR. In addition, a sequence motif was detected which differed only in the last nucleotide from the XLNR consensus site. A construct in which part of the aguA coding region was deleted still resulted in production of a stable mRNA upon transformation of A. niger. The putative XLNR binding sites and two of the putative CREA binding sites were mutated individually in this construct and the effects on expression were examined in A. niger transformants. Northern analysis of the transformants revealed that the consensus XLNR site is not actually functional in the aguA promoter, whereas the sequence that diverges from the consensus at a single position is functional. This indicates that XLNR is also able to bind to the sequence GGCTAG, and the XLNR binding site consensus should therefore be changed to GGCTAR. Both CREA sites are functional, indicating that CREA has a strong influence on aguA expression. A detailed expression analysis of aguA in four genetic backgrounds revealed a second regulatory system involved in activation of aguA gene expression. This system responds to the presence of glucuronic and galacturonic acids, and is not dependent on XLNR.
Goring, Mark E; Leibovitch, Matthew; Gea-Mallorqui, Ester; Karls, Shawn; Richard, Francis; Hanic-Joyce, Pamela J; Joyce, Paul B M
2013-10-01
We report that the temperature-sensitive (ts) phenotype in Saccharomyces cerevisiae associated with a variant tRNA nucleotidyltransferase containing an amino acid substitution at position 189 results from a reduced ability to incorporate AMP and CMP into tRNAs. We show that this defect can be compensated for by a second-site suppressor converting residue arginine 64 to tryptophan. The R64W substitution does not alter the structure or thermal stability of the enzyme dramatically but restores catalytic activity in vitro and suppresses the ts phenotype in vivo. R64 is found in motif A known to be involved in catalysis and nucleotide triphosphate binding while E189 lies within motif C previously thought only to connect the head and neck domains of the protein. Although mutagenesis experiments indicate that residues R64 and E189 do not interact directly, our data suggest a critical role for residue E189 in enzyme structure and function. Both R64 and E189 may contribute to the organization of the catalytic domain of the enzyme. These results, along with overexpression and deletion analyses, show that the ts phenotype of cca1-E189F does not arise from thermal instability of the variant tRNA nucleotidyltransferase but instead from the inability of a partially active enzyme to support growth only at higher temperatures. © 2013.
Sequence requirement of the ade6-4095 meiotic recombination hotspot in Schizosaccharomyces pombe.
Foulis, Steven J; Fowler, Kyle R; Steiner, Walter W
2018-02-01
Homologous recombination occurs at a greatly elevated frequency in meiosis compared to mitosis and is initiated by programmed double-strand DNA breaks (DSBs). DSBs do not occur at uniform frequency throughout the genome in most organisms, but occur preferentially at a limited number of sites referred to as hotspots. The location of hotspots have been determined at nucleotide-level resolution in both the budding and fission yeasts, and while several patterns have emerged regarding preferred locations for DSB hotspots, it remains unclear why particular sites experience DSBs at much higher frequency than other sites with seemingly similar properties. Short sequence motifs, which are often sites for binding of transcription factors, are known to be responsible for a number of hotspots. In this study we identified the minimum sequence required for activity of one of such motif identified in a screen of random sequences capable of producing recombination hotspots. The experimentally determined sequence, GGTCTRGACC, closely matches the previously inferred sequence. Full hotspot activity requires an effective sequence length of 9.5 bp, whereas moderate activity requires an effective sequence length of approximately 8.2 bp and shows significant association with DSB hotspots. In combination with our previous work, this result is consistent with a large number of different sequence motifs capable of producing recombination hotspots, and supports a model in which hotspots can be rapidly regenerated by mutation as they are lost through recombination.
Mistri, Tapan Kumar; Arindrarto, Wibowo; Ng, Wei Ping; Wang, Choayang; Lim, Leng Hiong; Sun, Lili; Chambers, Ian; Wohland, Thorsten; Robson, Paul
2018-03-20
Oct4 and Sox2 regulate the expression of target genes such as Nanog, Fgf4 , and Utf1 , by binding to their respective regulatory motifs. Their functional cooperation is reflected in their ability to heterodimerize on adjacent cis regulatory motifs, the composite Sox/Oct motif. Given that Oct4 and Sox2 regulate many developmental genes, a quantitative analysis of their synergistic action on different Sox/Oct motifs would yield valuable insights into the mechanisms of early embryonic development. In the present study, we measured binding affinities of Oct4 and Sox2 to different Sox/Oct motifs using fluorescence correlation spectroscopy. We found that the synergistic binding interaction is driven mainly by the level of Sox2 in the case of the Fgf4 Sox/Oct motif. Taking into account Sox2 expression levels fluctuate more than Oct4 , our finding provides an explanation on how Sox2 controls the segregation of the epiblast and primitive endoderm populations within the inner cell mass of the developing rodent blastocyst. © 2018 The Author(s). Published by Portland Press Limited on behalf of the Biochemical Society.
Sandmann, Michael; Talbert, Paul; Demidov, Dmitri; Kuhlmann, Markus; Rutten, Twan; Conrad, Udo; Lermontova, Inna
2017-01-01
KINETOCHORE NULL2 (KNL2) is involved in recognition of centromeres and in centromeric localization of the centromere-specific histone cenH3. Our study revealed a cenH3 nucleosome binding CENPC-k motif at the C terminus of Arabidopsis thaliana KNL2, which is conserved among a wide spectrum of eukaryotes. Centromeric localization of KNL2 is abolished by deletion of the CENPC-k motif and by mutating single conserved amino acids, but can be restored by insertion of the corresponding motif of Arabidopsis CENP-C. We showed by electrophoretic mobility shift assay that the C terminus of KNL2 binds DNA sequence-independently and interacts with the centromeric transcripts in vitro. Chromatin immunoprecipitation with anti-KNL2 antibodies indicated that in vivo KNL2 is preferentially associated with the centromeric repeat pAL1 Complete deletion of the CENPC-k motif did not influence its ability to interact with DNA in vitro. Therefore, we suggest that KNL2 recognizes centromeric nucleosomes, similar to CENP-C, via the CENPC-k motif and binds adjoining DNA. © 2017 American Society of Plant Biologists. All rights reserved.
Suzuki, Masaharu; Ketterling, Matthew G; McCarty, Donald R
2005-09-01
We have developed a simple quantitative computational approach for objective analysis of cis-regulatory sequences in promoters of coregulated genes. The program, designated MotifFinder, identifies oligo sequences that are overrepresented in promoters of coregulated genes. We used this approach to analyze promoter sequences of Viviparous1 (VP1)/abscisic acid (ABA)-regulated genes and cold-regulated genes, respectively, of Arabidopsis (Arabidopsis thaliana). We detected significantly enriched sequences in up-regulated genes but not in down-regulated genes. This result suggests that gene activation but not repression is mediated by specific and common sequence elements in promoters. The enriched motifs include several known cis-regulatory sequences as well as previously unidentified motifs. With respect to known cis-elements, we dissected the flanking nucleotides of the core sequences of Sph element, ABA response elements (ABREs), and the C repeat/dehydration-responsive element. This analysis identified the motif variants that may correlate with qualitative and quantitative differences in gene expression. While both VP1 and cold responses are mediated in part by ABA signaling via ABREs, these responses correlate with unique ABRE variants distinguished by nucleotides flanking the ACGT core. ABRE and Sph motifs are tightly associated uniquely in the coregulated set of genes showing a strict dependence on VP1 and ABA signaling. Finally, analysis of distribution of the enriched sequences revealed a striking concentration of enriched motifs in a proximal 200-base region of VP1/ABA and cold-regulated promoters. Overall, each class of coregulated genes possesses a discrete set of the enriched motifs with unique distributions in their promoters that may account for the specificity of gene regulation.
NASA Astrophysics Data System (ADS)
Schechinger, Linda Sue
I. To investigate the delivery of nucleotide-based drugs, we are studying molecular recognition of nucleotide derivatives in environments that are similar to cell membranes. The Nowick group previously discovered that membrane-like surfactant micelles tetradecyltrimethylammonium bromide (TTAB) micelle facilitate molecular of adenosine monophosphate (AMP) recognition. The micelles bind nucleotides by means of electrostatic interactions and hydrogen bonding. We observed binding by following 1H NMR chemical shift changes of unique hexylthymine protons upon addition of AMP. Cationic micelles are required for binding. In surfactant-free or sodium dodecylsulfate solutions, no hydrogen bonding is observed. These observations suggest that the cationic surfactant headgroups bind the nucleotide phosphate group, while the intramicellar base binds the nucleotide base. The micellar system was optimized to enhance binding and selectivity for adenosine nucleotides. The selectivity for adenosine and the number of phosphate groups attached to the adenosine were both investigated. Addition of cytidine, guanidine, or uridine monophosphates, results in no significant downfield shifting of the NH resonance. Selectivity for the phosphate is limited, since adenosine mono-, di-, and triphosphates all have similar binding constants. We successfully achieved molecular recognition of adenosine nucleotides in micellar environments. There is significant difference in the binding interactions between the adenosine nucleotides and three other natural nucleotides. II. The UCI Chemistry Outreach Program (UCICOP) addresses the declining interest of the nations youth for science. UCICOP brings fun and exciting chemistry experiments to local high schools, to remind students that science is fun and has many practical uses. Volunteer students and alumni of UCI perform the demonstrations using scripts and material provided by UCICOP. The preparation of scripts and materials is done by two coordinators. These coordinators organize the program and provide continuity to the program. The success of UCICOP can be measured by the high praise and gratitude expressed by the teachers, students and volunteers.
Petrov, Artem; Arzhanik, Vladimir; Makarov, Gennady; Koliasnikov, Oleg
2016-08-01
Antibodies are the family of proteins, which are responsible for antigen recognition. The computational modeling of interaction between an antigen and an antibody is very important when crystallographic structure is unavailable. In this research, we have discovered the correlation between the amino acid sequence of antibody and its specific binding characteristics on the example of the novel conservative binding motif, which consists of four residues: Arg H52, Tyr H33, Thr H59, and Glu H61. These residues are specifically oriented in the binding site and interact with each other in a specific manner. The residues of the binding motif are involved in interaction strictly with negatively charged groups of antigens, and form a binding complex. Mechanism of interaction and characteristics of the complex were also discovered. The results of this research can be used to increase the accuracy of computational antibody-antigen interaction modeling and for post-modeling quality control of the modeled structures.
A peptide affinity column for the identification of integrin alpha IIb-binding proteins.
Daxecker, Heide; Raab, Markus; Bernard, Elise; Devocelle, Marc; Treumann, Achim; Moran, Niamh
2008-03-01
To understand the regulation of integrin alpha(IIb)beta(3), a critical platelet adhesion molecule, we have developed a peptide affinity chromatography method using the known integrin regulatory motif, LAMWKVGFFKR. Using standard Fmoc chemistry, this peptide was synthesized onto a Toyopearl AF-Amino-650 M resin on a 6-aminohexanoic acid (Ahx) linker. Peptide density was controlled by acetylation of 83% of the Ahx amino groups. Four recombinant human proteins (CIB1, PP1, ICln and RN181), previously identified as binding to this integrin regulatory motif, were specifically retained by the column containing the integrin peptide but not by a column presenting an irrelevant peptide. Hemoglobin, creatine kinase, bovine serum albumin, fibrinogen and alpha-tubulin failed to bind under the chosen conditions. Immunodetection methods confirmed the binding of endogenous platelet proteins, including CIB1, PP1, ICln RN181, AUP-1 and beta3-integrin, from a detergent-free platelet lysate. Thus, we describe a reproducible method that facilitates the reliable extraction of specific integrin-binding proteins from complex biological matrices. This methodology may enable the sensitive and specific identification of proteins that interact with linear, membrane-proximal peptide motifs such as the integrin regulatory motif LAMWKVGFFKR.
NASA Technical Reports Server (NTRS)
Childs-Disney, Jessica L. (Inventor); Disney, Matthew D. (Inventor)
2017-01-01
Disclosed are methods for identifying a nucleic acid (e.g., RNA, DNA, etc.) motif which interacts with a ligand. The method includes providing a plurality of ligands immobilized on a support, wherein each particular ligand is immobilized at a discrete location on the support; contacting the plurality of immobilized ligands with a nucleic acid motif library under conditions effective for one or more members of the nucleic acid motif library to bind with the immobilized ligands; and identifying members of the nucleic acid motif library that are bound to a particular immobilized ligand. Also disclosed are methods for selecting, from a plurality of candidate ligands, one or more ligands that have increased likelihood of binding to a nucleic acid molecule comprising a particular nucleic acid motif, as well as methods for identifying a nucleic acid which interacts with a ligand.
de Keyzer, Jeanine; Steel, Gregor J.; Hale, Sarah J.; Humphries, Daniel; Stirling, Colin J.
2009-01-01
Protein translocation and folding in the endoplasmic reticulum of Saccharomyces cerevisiae involves two distinct Hsp70 chaperones, Lhs1p and Kar2p. Both proteins have the characteristic domain structure of the Hsp70 family consisting of a conserved N-terminal nucleotide binding domain and a C-terminal substrate binding domain. Kar2p is a canonical Hsp70 whose substrate binding activity is regulated by cochaperones that promote either ATP hydrolysis or nucleotide exchange. Lhs1p is a member of the Grp170/Lhs1p subfamily of Hsp70s and was previously shown to function as a nucleotide exchange factor (NEF) for Kar2p. Here we show that in addition to this NEF activity, Lhs1p can function as a holdase that prevents protein aggregation in vitro. Analysis of the nucleotide requirement of these functions demonstrates that nucleotide binding to Lhs1p stimulates the interaction with Kar2p and is essential for NEF activity. In contrast, Lhs1p holdase activity is nucleotide-independent and unaffected by mutations that interfere with ATP binding and NEF activity. In vivo, these mutants show severe protein translocation defects and are unable to support growth despite the presence of a second Kar2p-specific NEF, Sil1p. Thus, Lhs1p-dependent nucleotide exchange activity is vital for ER protein biogenesis in vivo. PMID:19759005
DOE Office of Scientific and Technical Information (OSTI.GOV)
Park, E.; Prakash, L.; Guzder, S.N.
1992-12-01
Xeroderma pigmentosum (XP) patients are extremely sensitive to ultraviolet (UV) light and suffer from a high incidence of skin cancers, due to a defect in nucleotide excision repair. The disease is genetically heterogeneous, and seven complementation groups, A-G, have been identified. Homologs of human excision repair genes ERCC1, XPDC/ERCC2, and XPAC have been identified in the yeast Saccharomyces cerevisiae. Since no homolog of human XPBC/ERCC3 existed among the known yeast genes, we cloned the yeast homolog by using XPBC cDNA as a hybridization probe. The yeast homolog, RAD25 (SSL2), encodes a protein of 843 amino acids (M[sub r] 95,356). Themore » RAD25 (SSL2)- and XPCX-encoded proteins share 55% identical and 72% conserved amino acid residues, and the two proteins resemble one another in containing the conserved DNA helicase sequence motifs. A nonsense mutation at codon 799 that deletes the 45 C-terminal amino acid residues in RAD25 (SSL2) confers UV sensitivity. This mutation shows epistasis with genes in the excision repair group, whereas a synergistic increase in UN sensitivity occurs when it is combined with mutations in genes in other DNA repair pathways, indicating that RAD25 (SSL2) functions in excision repair but not in other repair pathways. We also show that RAD25 (SSL2) is an essential gene. A mutation of the Lys[sup 392] residue to arginine in the conserved Walker type A nucleotide-binding motif is lethal, suggesting an essential role of the putative RAD 25 (SSL2) ATPase/DNA helicase activity in viability. 40 refs., 3 figs., 1 tab.« less
Anion induced conformational preference of Cα NN motif residues in functional proteins.
Patra, Piya; Ghosh, Mahua; Banerjee, Raja; Chakrabarti, Jaydeb
2017-12-01
Among different ligand binding motifs, anion binding C α NN motif consisting of peptide backbone atoms of three consecutive residues are observed to be important for recognition of free anions, like sulphate or biphosphate and participate in different key functions. Here we study the interaction of sulphate and biphosphate with C α NN motif present in different proteins. Instead of total protein, a peptide fragment has been studied keeping C α NN motif flanked in between other residues. We use classical force field based molecular dynamics simulations to understand the stability of this motif. Our data indicate fluctuations in conformational preferences of the motif residues in absence of the anion. The anion gives stability to one of these conformations. However, the anion induced conformational preferences are highly sequence dependent and specific to the type of anion. In particular, the polar residues are more favourable compared to the other residues for recognising the anion. © 2017 Wiley Periodicals, Inc.
Pedersen, Kim Brint; Chodavarapu, Harshita
2017-01-01
Angiotensin-converting enzyme 2 (ACE2) has protective effects on a wide range of morbidities associated with elevated angiotensin-II signaling. Most tissues, including pancreatic islets, express ACE2 mainly from the proximal promoter region. We previously found that hepatocyte nuclear factors 1α and 1β stimulate ACE2 expression from three highly conserved hepatocyte nuclear factor 1 binding motifs in the proximal promoter region. We hypothesized that other highly conserved motifs would also affect ACE2 expression. By systematic mutation of conserved elements, we identified five regions affecting ACE2 expression, of which two regions bound transcriptional activators. One of these is a functional FOXA binding motif. We further identified the main protein binding the FOXA motif in 832/13 insulinoma cells as well as in mouse pancreatic islets as FOXA2. PMID:29082356
The crystal structure of NADPH:ferredoxin reductase from Azotobacter vinelandii.
Sridhar Prasad, G.; Kresge, N.; Muhlberg, A. B.; Shaw, A.; Jung, Y. S.; Burgess, B. K.; Stout, C. D.
1998-01-01
NADPH:ferredoxin reductase (AvFPR) is involved in the response to oxidative stress in Azotobacter vinelandii. The crystal structure of AvFPR has been determined at 2.0 A resolution. The polypeptide fold is homologous with six other oxidoreductases whose structures have been solved including Escherichia coli flavodoxin reductase (EcFldR) and spinach, and Anabaena ferredoxin:NADP+ reductases (FNR). AvFPR is overall most homologous to EcFldR. The structure is comprised of a N-terminal six-stranded antiparallel beta-barrel domain, which binds FAD, and a C-terminal five-stranded parallel beta-sheet domain, which binds NADPH/NADP+ and has a classical nucleotide binding fold. The two domains associate to form a deep cleft where the NADPH and FAD binding sites are juxtaposed. The structure displays sequence conserved motifs in the region surrounding the two dinucleotide binding sites, which are characteristic of the homologous enzymes. The folded over conformation of FAD in AvFPR is similar to that in EcFldR due to stacking of Phe255 on the adenine ring of FAD, but it differs from that in the FNR enzymes, which lack a homologous aromatic residue. The structure of AvFPR displays three unique features in the environment of the bound FAD. Two features may affect the rate of reduction of FAD: the absence of an aromatic residue stacked on the isoalloxazine ring in the NADPH binding site; and the interaction of a carbonyl group with N10 of the flavin. Both of these features are due to the substitution of a conserved C-terminal tyrosine residue with alanine (Ala254) in AvFPR. An additional unique feature may affect the interaction of AvFPR with its redox partner ferredoxin I (FdI). This is the extension of the C-terminus by three residues relative to EcFldR and by four residues relative to FNR. The C-terminal residue, Lys258, interacts with the AMP phosphate of FAD. Consequently, both phosphate groups are paired with a basic group due to the simultaneous interaction of the FMN phosphate with Arg51 in a conserved FAD binding motif. The fourth feature, common to homologous oxidoreductases, is a concentration of 10 basic residues on the face of the protein surrounding the active site, in addition to Arg51 and Lys258. PMID:9865948
Informative priors based on transcription factor structural class improve de novo motif discovery.
Narlikar, Leelavati; Gordân, Raluca; Ohler, Uwe; Hartemink, Alexander J
2006-07-15
An important problem in molecular biology is to identify the locations at which a transcription factor (TF) binds to DNA, given a set of DNA sequences believed to be bound by that TF. In previous work, we showed that information in the DNA sequence of a binding site is sufficient to predict the structural class of the TF that binds it. In particular, this suggests that we can predict which locations in any DNA sequence are more likely to be bound by certain classes of TFs than others. Here, we argue that traditional methods for de novo motif finding can be significantly improved by adopting an informative prior probability that a TF binding site occurs at each sequence location. To demonstrate the utility of such an approach, we present priority, a powerful new de novo motif finding algorithm. Using data from TRANSFAC, we train three classifiers to recognize binding sites of basic leucine zipper, forkhead, and basic helix loop helix TFs. These classifiers are used to equip priority with three class-specific priors, in addition to a default prior to handle TFs of other classes. We apply priority and a number of popular motif finding programs to sets of yeast intergenic regions that are reported by ChIP-chip to be bound by particular TFs. priority identifies motifs the other methods fail to identify, and correctly predicts the structural class of the TF recognizing the identified binding sites. Supplementary material and code can be found at http://www.cs.duke.edu/~amink/.
Landès-Devauchelle, C; Bras, F; Dezélée, S; Teninges, D
1995-11-10
The nucleotide sequence of the genes 2 and 3 of the Drosophila rhabdovirus sigma was determined from cDNAs to viral genome and poly(A)+ mRNAs. Gene 2 comprises 1032 nucleotides and contains a long ORF encoding a molecular weight 35,208 polypeptide present in infected cells and in virions which migrates in SDS-PAGE as a doublet of M(r) about 60 kDa. The distribution of acidic charges as well as the electrophoretic properties of the protein are characteristic of the rhabdovirus P proteins. Gene 3 comprises 923 nucleotides and contains a long ORF capable of coding a polypeptide of 298 amino acids of MW 33,790. The putative protein (PP3) is similar in size to a minor component of the virions. Computer analysis shows that the sequence of PP3 contains three motifs related to the conserved motifs of reverse transcriptases.
Kluth, Marianne; Stindt, Jan; Dröge, Carola; Linnemann, Doris; Kubitz, Ralf; Schmitt, Lutz
2015-02-20
The human multidrug resistance protein 3 (MDR3/ABCB4) belongs to the ubiquitous family of ATP-binding cassette (ABC) transporters and is located in the canalicular membrane of hepatocytes. There it flops the phospholipids of the phosphatidylcholine (PC) family from the inner to the outer leaflet. Here, we report the characterization of wild type MDR3 and the Q1174E mutant, which was identified previously in a patient with progressive familial intrahepatic cholestasis type 3 (PFIC-3). We expressed different variants of MDR3 in the yeast Pichia pastoris, purified the proteins via tandem affinity chromatography, and determined MDR3-specific ATPase activity in the presence or absence of phospholipids. The ATPase activity of wild type MDR3 was stimulated 2-fold by liver PC or 1,2-dioleoyl-sn-glycero-3-phosphatidylethanolamine lipids. Furthermore, the cross-linking of MDR3 with a thiol-reactive fluorophore blocked ATP hydrolysis and exhibited no PC stimulation. Similarly, phosphatidylethanolamine, phosphatidylserine, and sphingomyelin lipids did not induce an increase of wild type MDR3 ATPase activity. The phosphate analogues beryllium fluoride and aluminum fluoride led to complete inhibition of ATPase activity, whereas orthovanadate inhibited exclusively the PC-stimulated ATPase activity of MDR3. The Q1174E mutation is located in the nucleotide-binding domain in direct proximity of the leucine of the ABC signature motif and extended the X loop, which is found in ABC exporters. Our data on the Q1174E mutant demonstrated basal ATPase activity, but PC lipids were incapable of stimulating ATPase activity highlighting the role of the extended X loop in the cross-talk of the nucleotide-binding domain and the transmembrane domain. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Characterization of Novel Calmodulin Binding Domains within IQ Motifs of IQGAP1
Jang, Deok-Jin; Ban, Byungkwan; Lee, Jin-A
2011-01-01
IQ motif-containing GTPase-activating protein 1 (IQGAP1), which is a well-known calmodulin (CaM) binding protein, is involved in a wide range of cellular processes including cell proliferation, tumorigenesis, adhesion, and migration. Interaction of IQGAP1 with CaM is important for its cellular functions. Although each IQ domain of IQGAP1 for CaM binding has been characterized in a Ca2+-dependent or -independent manner, it was not clear which IQ motifs are physiologically relevant for CaM binding in the cells. In this study, we performed immunoprecipitation using 3xFLAGhCaM in mammalian cell lines to characterize the domains of IQGAP1 that are key for CaM binding under physiological conditions. Interestingly, using this method, we identified two novel domains, IQ(2.7-3) and IQ(3.5-4.4), within IQGAP1 that were involved in Ca2+-independent or -dependent CaM binding, respectively. Mutant analysis clearly showed that the hydrophobic regions within IQ(2.7-3) were mainly involved in apoCaM binding, while the basic amino acids and hydrophobic region of IQ(3.5-4.4) were required for Ca2+/CaM binding. Finally, we showed that IQ(2.7-3) was the main apoCaM binding domain and both IQ(2.7-3) and IQ(3.5-4.4) were required for Ca2+/CaM binding within IQ(1- 2-3-4). Thus, we identified and characterized novel direct CaM binding motifs essential for IQGAP1. This finding indicates that IQGAP1 plays a dynamic role via direct interactions with CaM in a Ca2+-dependent or -independent manner. PMID:22080369
Khund-Sayeed, Syed; He, Ximiao; Holzberg, Timothy; Wang, Jun; Rajagopal, Divya; Upadhyay, Shriyash; Durell, Stewart R; Mukherjee, Sanjit; Weirauch, Matthew T; Rose, Robert; Vinson, Charles
2016-09-12
We evaluated DNA binding of the B-HLH family members TCF4 and USF1 using protein binding microarrays (PBMs) containing double-stranded DNA probes with cytosine on both strands or 5-methylcytosine (5mC) or 5-hydroxymethylcytosine (5hmC) on one DNA strand and cytosine on the second strand. TCF4 preferentially bound the E-box motif (CAN|NTG) with strongest binding to the 8-mer CAG|GTGGT. 5mC uniformly decreases DNA binding of both TCF4 and USF1. The bulkier 5hmC also inhibited USF1 binding to DNA. In contrast, 5hmC dramatically enhanced TCF4 binding to E-box motifs ACAT|GTG and ACAC|GTG, being better bound than any 8-mer containing cytosine. Examination of X-ray structures of the closely related TCF3 and USF1 bound to DNA suggests TCF3 can undergo a conformational shift to preferentially bind to 5hmC while the USF1 basic region is bulkier and rigid precluding a conformation shift to bind 5hmC. These results greatly expand the regulatory DNA sequence landscape bound by TCF4.
Cyclic nucleotide binding proteins in the Arabidopsis thaliana and Oryza sativa genomes
Bridges, Dave; Fraser, Marie E; Moorhead, Greg BG
2005-01-01
Background Cyclic nucleotides are ubiquitous intracellular messengers. Until recently, the roles of cyclic nucleotides in plant cells have proven difficult to uncover. With an understanding of the protein domains which can bind cyclic nucleotides (CNB and GAF domains) we scanned the completed genomes of the higher plants Arabidopsis thaliana (mustard weed) and Oryza sativa (rice) for the effectors of these signalling molecules. Results Our analysis found that several ion channels and a class of thioesterases constitute the possible cyclic nucleotide binding proteins in plants. Contrary to some reports, we found no biochemical or bioinformatic evidence for a plant cyclic nucleotide regulated protein kinase, suggesting that cyclic nucleotide functions in plants have evolved differently than in mammals. Conclusion This paper provides a molecular framework for the discussion of cyclic nucleotide function in plants, and resolves a longstanding debate about the presence of a cyclic nucleotide dependent kinase in plants. PMID:15644130
Nucleotide Interdependency in Transcription Factor Binding Sites in the Drosophila Genome.
Dresch, Jacqueline M; Zellers, Rowan G; Bork, Daniel K; Drewell, Robert A
2016-01-01
A long-standing objective in modern biology is to characterize the molecular components that drive the development of an organism. At the heart of eukaryotic development lies gene regulation. On the molecular level, much of the research in this field has focused on the binding of transcription factors (TFs) to regulatory regions in the genome known as cis-regulatory modules (CRMs). However, relatively little is known about the sequence-specific binding preferences of many TFs, especially with respect to the possible interdependencies between the nucleotides that make up binding sites. A particular limitation of many existing algorithms that aim to predict binding site sequences is that they do not allow for dependencies between nonadjacent nucleotides. In this study, we use a recently developed computational algorithm, MARZ, to compare binding site sequences using 32 distinct models in a systematic and unbiased approach to explore nucleotide dependencies within binding sites for 15 distinct TFs known to be critical to Drosophila development. Our results indicate that many of these proteins have varying levels of nucleotide interdependencies within their DNA recognition sequences, and that, in some cases, models that account for these dependencies greatly outperform traditional models that are used to predict binding sites. We also directly compare the ability of different models to identify the known KRUPPEL TF binding sites in CRMs and demonstrate that a more complex model that accounts for nucleotide interdependencies performs better when compared with simple models. This ability to identify TFs with critical nucleotide interdependencies in their binding sites will lead to a deeper understanding of how these molecular characteristics contribute to the architecture of CRMs and the precise regulation of transcription during organismal development.
Nucleotide Interdependency in Transcription Factor Binding Sites in the Drosophila Genome
Dresch, Jacqueline M.; Zellers, Rowan G.; Bork, Daniel K.; Drewell, Robert A.
2016-01-01
A long-standing objective in modern biology is to characterize the molecular components that drive the development of an organism. At the heart of eukaryotic development lies gene regulation. On the molecular level, much of the research in this field has focused on the binding of transcription factors (TFs) to regulatory regions in the genome known as cis-regulatory modules (CRMs). However, relatively little is known about the sequence-specific binding preferences of many TFs, especially with respect to the possible interdependencies between the nucleotides that make up binding sites. A particular limitation of many existing algorithms that aim to predict binding site sequences is that they do not allow for dependencies between nonadjacent nucleotides. In this study, we use a recently developed computational algorithm, MARZ, to compare binding site sequences using 32 distinct models in a systematic and unbiased approach to explore nucleotide dependencies within binding sites for 15 distinct TFs known to be critical to Drosophila development. Our results indicate that many of these proteins have varying levels of nucleotide interdependencies within their DNA recognition sequences, and that, in some cases, models that account for these dependencies greatly outperform traditional models that are used to predict binding sites. We also directly compare the ability of different models to identify the known KRUPPEL TF binding sites in CRMs and demonstrate that a more complex model that accounts for nucleotide interdependencies performs better when compared with simple models. This ability to identify TFs with critical nucleotide interdependencies in their binding sites will lead to a deeper understanding of how these molecular characteristics contribute to the architecture of CRMs and the precise regulation of transcription during organismal development. PMID:27330274
Building a stable RNA U-turn with a protonated cytidine
Gottstein-Schmidtke, Sina R.; Duchardt-Ferner, Elke; Groher, Florian; Weigand, Julia E.; Gottstein, Daniel; Suess, Beatrix; Wöhnert, Jens
2014-01-01
The U-turn is a classical three-dimensional RNA folding motif first identified in the anticodon and T-loops of tRNAs. It also occurs frequently as a building block in other functional RNA structures in many different sequence and structural contexts. U-turns induce sharp changes in the direction of the RNA backbone and often conform to the 3-nt consensus sequence 5′-UNR-3′ (N = any nucleotide, R = purine). The canonical U-turn motif is stabilized by a hydrogen bond between the N3 imino group of the U residue and the 3′ phosphate group of the R residue as well as a hydrogen bond between the 2′-hydroxyl group of the uridine and the N7 nitrogen of the R residue. Here, we demonstrate that a protonated cytidine can functionally and structurally replace the uridine at the first position of the canonical U-turn motif in the apical loop of the neomycin riboswitch. Using NMR spectroscopy, we directly show that the N3 imino group of the protonated cytidine forms a hydrogen bond with the backbone phosphate 3′ from the third nucleotide of the U-turn analogously to the imino group of the uridine in the canonical motif. In addition, we compare the stability of the hydrogen bonds in the mutant U-turn motif to the wild type and describe the NMR signature of the C+-phosphate interaction. Our results have implications for the prediction of RNA structural motifs and suggest simple approaches for the experimental identification of hydrogen bonds between protonated C-imino groups and the phosphate backbone. PMID:24951555
Common fold in helix–hairpin–helix proteins
Shao, Xuguang; Grishin, Nick V.
2000-01-01
Helix–hairpin–helix (HhH) is a widespread motif involved in non-sequence-specific DNA binding. The majority of HhH motifs function as DNA-binding modules, however, some of them are used to mediate protein–protein interactions or have acquired enzymatic activity by incorporating catalytic residues (DNA glycosylases). From sequence and structural analysis of HhH-containing proteins we conclude that most HhH motifs are integrated as a part of a five-helical domain, termed (HhH)2 domain here. It typically consists of two consecutive HhH motifs that are linked by a connector helix and displays pseudo-2-fold symmetry. (HhH)2 domains show clear structural integrity and a conserved hydrophobic core composed of seven residues, one residue from each α-helix and each hairpin, and deserves recognition as a distinct protein fold. In addition to known HhH in the structures of RuvA, RadA, MutY and DNA-polymerases, we have detected new HhH motifs in sterile alpha motif and barrier-to-autointegration factor domains, the α-subunit of Escherichia coli RNA-polymerase, DNA-helicase PcrA and DNA glycosylases. Statistically significant sequence similarity of HhH motifs and pronounced structural conservation argue for homology between (HhH)2 domains in different protein families. Our analysis helps to clarify how non-symmetric protein motifs bind to the double helix of DNA through the formation of a pseudo-2-fold symmetric (HhH)2 functional unit. PMID:10908318
Tamayo, Joel V; Teramoto, Takamasa; Chatterjee, Seema; Hall, Traci M Tanaka; Gavis, Elizabeth R
2017-04-04
The Drosophila hnRNP F/H homolog, Glorund (Glo), regulates nanos mRNA translation by interacting with a structured UA-rich motif in the nanos 3' untranslated region. Glo regulates additional RNAs, however, and mammalian homologs bind G-tract sequences to regulate alternative splicing, suggesting that Glo also recognizes G-tract RNA. To gain insight into how Glo recognizes both structured UA-rich and G-tract RNAs, we used mutational analysis guided by crystal structures of Glo's RNA-binding domains and identified two discrete RNA-binding surfaces that allow Glo to recognize both RNA motifs. By engineering Glo variants that favor a single RNA-binding mode, we show that a subset of Glo's functions in vivo is mediated solely by the G-tract binding mode, whereas regulation of nanos requires both recognition modes. Our findings suggest a molecular mechanism for the evolution of dual RNA motif recognition in Glo that may be applied to understanding the functional diversity of other RNA-binding proteins. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
Tamayo, Joel V.; Teramoto, Takamasa; Chatterjee, Seema; ...
2017-04-04
The Drosophila hnRNP F/H homolog, Glorund (Glo), regulates nanos mRNA translation by interacting with a structured UA-rich motif in the nanos 3' untranslated region. Glo regulates additional RNAs, however, and mammalian homologs bind G-tract sequences to regulate alternative splicing, suggesting that Glo also recognizes G-tract RNA. To gain insight into how Glo recognizes both structured UA-rich and G-tract RNAs, we used mutational analysis guided by crystal structures of Glo’s RNA-binding domains and identified two discrete RNA-binding surfaces that allow Glo to recognize both RNA motifs. By engineering Glo variants that favor a single RNA-binding mode, we show that a subsetmore » of Glo’s functions in vivo is mediated solely by the G-tract binding mode, whereas regulation of nanos requires both recognition modes. Lastly, our findings suggest a molecular mechanism for the evolution of dual RNA motif recognition in Glo that may be applied to understanding the functional diversity of other RNA-binding proteins.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tamayo, Joel V.; Teramoto, Takamasa; Chatterjee, Seema
The Drosophila hnRNP F/H homolog, Glorund (Glo), regulates nanos mRNA translation by interacting with a structured UA-rich motif in the nanos 3' untranslated region. Glo regulates additional RNAs, however, and mammalian homologs bind G-tract sequences to regulate alternative splicing, suggesting that Glo also recognizes G-tract RNA. To gain insight into how Glo recognizes both structured UA-rich and G-tract RNAs, we used mutational analysis guided by crystal structures of Glo’s RNA-binding domains and identified two discrete RNA-binding surfaces that allow Glo to recognize both RNA motifs. By engineering Glo variants that favor a single RNA-binding mode, we show that a subsetmore » of Glo’s functions in vivo is mediated solely by the G-tract binding mode, whereas regulation of nanos requires both recognition modes. Our findings suggest a molecular mechanism for the evolution of dual RNA motif recognition in Glo that may be applied to understanding the functional diversity of other RNA-binding proteins.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tamayo, Joel V.; Teramoto, Takamasa; Chatterjee, Seema
The Drosophila hnRNP F/H homolog, Glorund (Glo), regulates nanos mRNA translation by interacting with a structured UA-rich motif in the nanos 3' untranslated region. Glo regulates additional RNAs, however, and mammalian homologs bind G-tract sequences to regulate alternative splicing, suggesting that Glo also recognizes G-tract RNA. To gain insight into how Glo recognizes both structured UA-rich and G-tract RNAs, we used mutational analysis guided by crystal structures of Glo’s RNA-binding domains and identified two discrete RNA-binding surfaces that allow Glo to recognize both RNA motifs. By engineering Glo variants that favor a single RNA-binding mode, we show that a subsetmore » of Glo’s functions in vivo is mediated solely by the G-tract binding mode, whereas regulation of nanos requires both recognition modes. Lastly, our findings suggest a molecular mechanism for the evolution of dual RNA motif recognition in Glo that may be applied to understanding the functional diversity of other RNA-binding proteins.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tseng, Quincy; Orans, Jillian; Hast, Michael A.
2012-03-16
MutS{beta} is a eukaryotic mismatch repair protein that preferentially targets extrahelical unpaired nucleotides and shares partial functional redundancy with MutS{alpha} (MSH2-MSH6). Although mismatch recognition by MutS{alpha} has been shown to involve a conserved Phe-X-Glu motif, little is known about the lesion-binding mechanism of MutS{beta}. Combined MSH3/MSH6 deficiency triggers a strong predisposition to cancer in mice and defects in msh2 and msh6 account for roughly half of hereditary nonpolyposis colorectal cancer mutations. These three MutS homologs are also believed to play a role in trinucleotide repeat instability, which is a hallmark of many neurodegenerative disorders. The baculovirus overexpression and purification ofmore » recombinant human MutS{beta} and three truncation mutants are presented here. Binding assays with heteroduplex DNA were carried out for biochemical characterization. Crystallization and preliminary X-ray diffraction analysis of the protein bound to a heteroduplex DNA substrate are also reported.« less
Comparative genomics of pyridoxal 5′-phosphate-dependent transcription factor regulons in Bacteria
Suvorova, Inna A.
2016-01-01
The MocR-subfamily transcription factors (MocR-TFs) characterized by the GntR-family DNA-binding domain and aminotransferase-like sensory domain are broadly distributed among certain lineages of Bacteria. Characterized MocR-TFs bind pyridoxal 5′-phosphate (PLP) and control transcription of genes involved in PLP, gamma aminobutyric acid (GABA) and taurine metabolism via binding specific DNA operator sites. To identify putative target genes and DNA binding motifs of MocR-TFs, we performed comparative genomics analysis of over 250 bacterial genomes. The reconstructed regulons for 825 MocR-TFs comprise structural genes from over 200 protein families involved in diverse biological processes. Using the genome context and metabolic subsystem analysis we tentatively assigned functional roles for 38 out of 86 orthologous groups of studied regulators. Most of these MocR-TF regulons are involved in PLP metabolism, as well as utilization of GABA, taurine and ectoine. The remaining studied MocR-TF regulators presumably control genes encoding enzymes involved in reduction/oxidation processes, various transporters and PLP-dependent enzymes, for example aminotransferases. Predicted DNA binding motifs of MocR-TFs are generally similar in each orthologous group and are characterized by two to four repeated sequences. Identified motifs were classified according to their structures. Motifs with direct and/or inverted repeat symmetry constitute the majority of inferred DNA motifs, suggesting preferable TF dimerization in head-to-tail or head-to-head configuration. The obtained genomic collection of in silico reconstructed MocR-TF motifs and regulons in Bacteria provides a basis for future experimental characterization of molecular mechanisms for various regulators in this family. PMID:28348826
Liu, Zihao; Ma, Shiqing; Duan, Shun; Xuliang, Deng; Sun, Yingchun; Zhang, Xi; Xu, Xinhua; Guan, Binbin; Wang, Chao; Hu, Meilin; Qi, Xingying; Zhang, Xu; Gao, Ping
2016-03-02
Bacterial adhesion and biofilm formation are the primary causes of implant-associated infection, which is difficult to eliminate and may induce failure in dental implants. Chimeric peptides with both binding and antimicrobial motifs may provide a promising alternative to inhibit biofilm formation on titanium surfaces. In this study, chimeric peptides were designed by connecting an antimicrobial motif (JH8194: KRLFRRWQWRMKKY) with a binding motif (minTBP-1: RKLPDA) directly or via flexible/rigid linkers to modify Ti surfaces. We evaluated the binding behavior of peptides using quartz crystal microbalance (QCM) and atomic force microscopy (AFM) techniques and investigated the effect of the modification of titanium surfaces with these peptides on the bioactivity of Streptococcus gordonii (S. gordonii) and Streptococcus sanguis (S. sanguis). Compared with the flexible linker (GGGGS), the rigid linker (PAPAP) significantly increased the adsorption of the chimeric peptide on titanium surfaces (p < 0.05). Concentration-dependent adsorption is consistent with a single Langmuir model, whereas time-dependent adsorption is in line with a two-domain Langmuir model. Additionally, the chimeric peptide with the rigid linker exhibited more effective antimicrobial ability than the peptide with the flexible linker. This finding was ascribed to the ability of the rigid linker to separate functional domains and reduce their interference to the maximum extent. Consequently, the performance of chimeric peptides with specific titanium-binding motifs and antimicrobial motifs against bacteria can be optimized by the proper selection of linkers. This rational design of chimeric peptides provides a promising alternative to inhibit the formation of biofilms on titanium surfaces with the potential to prevent peri-implantitis and peri-implant mucositis.
Moriuchi, Hiromi; Unno, Hideaki; Goda, Shuichiro; Tateno, Hiroaki; Hirabayashi, Jun; Hatakeyama, Tomomitsu
2015-07-01
CEL-I is a galactose/N-acetylgalactosamine-specific C-type lectin isolated from the sea cucumber Cucumaria echinata. Its carbohydrate-binding site contains a QPD (Gln-Pro-Asp) motif, which is generally recognized as the galactose specificity-determining motif in the C-type lectins. In our previous study, replacement of the QPD motif by an EPN (Glu-Pro-Asn) motif led to a weak binding affinity for mannose. Therefore, we examined the effects of an additional mutation in the carbohydrate-binding site on the specificity of the lectin. Trp105 of EPN-CEL-I was replaced by a histidine residue using site-directed mutagenesis, and the binding affinity of the resulting mutant, EPNH-CEL-I, was examined by sugar-polyamidoamine dendrimer assay, isothermal titration calorimetry, and glycoconjugate microarray analysis. Tertiary structure of the EPNH-CEL-I/mannose complex was determined by X-ray crystallographic analysis. Sugar-polyamidoamine dendrimer assay and glycoconjugate microarray analysis revealed a drastic change in the specificity of EPNH-CEL-I from galactose/N-acetylgalactosamine to mannose. The association constant of EPNH-CEL-I for mannose was determined to be 3.17×10(3) M(-1) at 25°C. Mannose specificity of EPNH-CEL-I was achieved by stabilization of the binding of mannose in a correct orientation, in which the EPN motif can form proper hydrogen bonds with 3- and 4-hydroxy groups of the bound mannose. Specificity of CEL-I can be engineered by mutating a limited number of amino acid residues in addition to the QPD/EPN motifs. Versatility of the C-type carbohydrate-recognition domain structure in the recognition of various carbohydrate chains could become a promising platform to develop novel molecular recognition proteins. Copyright © 2015 Elsevier B.V. All rights reserved.
Sahu, Santosh Kumar; Aradhyam, Gopala Krishna; Gummadi, Sathyanarayana N
2009-10-01
Phospholipid scramblases are a group of four homologous proteins conserved from C. elegans to human. In human, two members of the scramblase family, hPLSCR1 and hPLSCR3 are known to bring about Ca2+ dependent translocation of phosphatidylserine and cardiolipin respectively during apoptotic processes. However, affinities of Ca2+/Mg2+ binding to human scramblases and conformational changes taking place in them remains unknown. In the present study, we analyzed the Ca2+ and Mg2+ binding to the calcium binding motifs of hPLSCR1-4 and hPLSCR1 by spectroscopic methods and isothermal titration calorimetry. The results in this study show that (i) affinities of the peptides are in the order hPLSCR1>hPLSCR3>hPLSCR2>hPLSCR4 for Ca2+ and in the order hPLSCR1>hPLSCR2>hPLSCR3>hPLSCR4 for Mg2+, (ii) binding of ions brings about conformational change in the secondary structure of the peptides. The affinity of Ca2+ and Mg2+ binding to protein hPLSCR1 was similar to that of the peptide I. A sequence comparison shows the existence of scramblase-like motifs among other protein families. Based on the above results, we hypothesize that the Ca2+ binding motif of hPLSCR1 is a novel type of Ca2+ binding motif. Our findings will be relevant in understanding the calcium dependent scrambling activity of hPLSCRs and their biological function.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Poat, J.A.; Cripps, H.E.; Iversen, L.L.
1988-05-01
Forskolin labelled with (/sup 3/H) bound to high- and low-affinity sites in the rat brain. The high-affinity site was discretely located, with highest densities in the striatum, nucleus accumbens, olfactory tubercule, substantia nigra, hippocampus, and the molecular layers of the cerebellum. This site did not correlate well with the distribution of adenylate cyclase. The high-affinity striatal binding site may be associated with a stimulatory guanine nucleotide-binding protein. Thus, the number of sites was increased by the addition of Mg/sup 2 +/ and guanylyl imidodiphosphate. Cholera toxin stereotaxically injected into rat striatum increased the number of binding sites, and no furthermore » increase was noted following the subsequent addition of guanyl nucleotide. High-affinity forskolin binding sites in non-dopamine-rich brain areas (hippocampus and cerebullum) were modulated in a qualitatively different manner by guanyl nucleotides. In these areas the number of binding sites was significantly reduced by the addition of guanyl nucleotide. These results suggest that forskolin may have a potential role in identifying different functional/structural guanine nucleotide-binding proteins.« less
Ding, Zhong; Peng, Deliang; Huang, Wenkun; He, Wenting; Gao, Bida
2008-02-01
A cDNA, named Dd-ace-2, encoding an acetylcholinesterase (AChE, EC3.1.1.7), was isolated from sweet-potato-stem nematode, Ditylenchus destructor. The nucleotide and amino acid sequences among different nematode species were compared and analyzed with DNAMAN5.0, MEGA3.0 softwares. The results showed that the complete nucleotide sequence of Dd-ace-2 gene of Ditylenchus destructor contains 2425 base pairs from which deduced 734 amino acids (GenBank accession No. EF583058). The homology rates of amino acid sequences of Dd-ace-2 gene between Ditylenchus destructor and Meloidogyne incognita, Caenorhabditis elegans, Dictyocaulus viviparous were 48.0%, 42.7%, 42.1% respectively. The mature acetylcholinesterase sequences of Ditylenchus destructor may encode by the first 701 residues of deduced 734 amino acids.The conserved motifs involved in the catalytic triad, the choline binding site and 10 aromatic residues lining the catalytic gorge were present in the Dd-ace-2 deduced protein. Phylogenetic analysis based on AChEs of other nematodes and species showed that the deduced AChE formed the same cluster with ACE-2s.
Sloma, Michael F.; Mathews, David H.
2016-01-01
RNA secondary structure prediction is widely used to analyze RNA sequences. In an RNA partition function calculation, free energy nearest neighbor parameters are used in a dynamic programming algorithm to estimate statistical properties of the secondary structure ensemble. Previously, partition functions have largely been used to estimate the probability that a given pair of nucleotides form a base pair, the conditional stacking probability, the accessibility to binding of a continuous stretch of nucleotides, or a representative sample of RNA structures. Here it is demonstrated that an RNA partition function can also be used to calculate the exact probability of formation of hairpin loops, internal loops, bulge loops, or multibranch loops at a given position. This calculation can also be used to estimate the probability of formation of specific helices. Benchmarking on a set of RNA sequences with known secondary structures indicated that loops that were calculated to be more probable were more likely to be present in the known structure than less probable loops. Furthermore, highly probable loops are more likely to be in the known structure than the set of loops predicted in the lowest free energy structures. PMID:27852924
Parag-Sharma, Kshitij; Leyme, Anthony; DiGiacomo, Vincent; Marivin, Arthur; Broselid, Stefan; Garcia-Marcos, Mikel
2016-12-30
GIV (aka Girdin) is a guanine nucleotide exchange factor that activates heterotrimeric G protein signaling downstream of RTKs and integrins, thereby serving as a platform for signaling cascade cross-talk. GIV is recruited to the cytoplasmic tail of receptors upon stimulation, but the mechanism of activation of its G protein regulatory function is not well understood. Here we used assays in humanized yeast models and G protein activity biosensors in mammalian cells to investigate the role of GIV subcellular compartmentalization in regulating its ability to promote G protein signaling. We found that in unstimulated cells GIV does not co-fractionate with its substrate G protein Gα i3 on cell membranes and that constitutive membrane anchoring of GIV in yeast cells or rapid membrane translocation in mammalian cells via chemically induced dimerization leads to robust G protein activation. We show that membrane recruitment of the GIV "Gα binding and activating" motif alone is sufficient for G protein activation and that it does not require phosphomodification. Furthermore, we engineered a synthetic protein to show that recruitment of the GIV "Gα binding and activating" motif to membranes via association with active RTKs, instead of via chemically induced dimerization, is also sufficient for G protein activation. These results reveal that recruitment of GIV to membranes in close proximity to its substrate G protein is a major mechanism responsible for the activation of its G protein regulatory function. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Takahashi, Takeshi; Kojima, Kyosuke; Zhang, Wei; Sasaki, Kanae; Ito, Masaru; Suzuki, Hironori; Kawasaki, Masato; Wakatsuki, Soichi; Takahara, Terunao; Shibata, Hideki; Maki, Masatoshi
2015-01-01
ALG-2, a 22-kDa penta-EF-hand protein, is involved in cell death, signal transduction, membrane trafficking, etc., by interacting with various proteins in mammalian cells in a Ca2+-dependent manner. Most known ALG-2-interacting proteins contain proline-rich regions in which either PPYPXnYP (type 1 motif) or PXPGF (type 2 motif) is commonly found. Previous X-ray crystal structural analysis of the complex between ALG-2 and an ALIX peptide revealed that the peptide binds to the two hydrophobic pockets. In the present study, we resolved the crystal structure of the complex between ALG-2 and a peptide of Sec31A (outer shell component of coat complex II, COPII; containing the type 2 motif) and found that the peptide binds to the third hydrophobic pocket (Pocket 3). While amino acid substitution of Phe85, a Pocket 3 residue, with Ala abrogated the interaction with Sec31A, it did not affect the interaction with ALIX. On the other hand, amino acid substitution of Tyr180, a Pocket 1 residue, with Ala caused loss of binding to ALIX, but maintained binding to Sec31A. We conclude that ALG-2 recognizes two types of motifs at different hydrophobic surfaces. Furthermore, based on the results of serial mutational analysis of the ALG-2-binding sites in Sec31A, the type 2 motif was newly defined. PMID:25667979
Chong, P Andrew; Lin, Hong; Wrana, Jeffrey L; Forman-Kay, Julie D
2010-10-26
Smad ubiquitination regulatory factor 2 (Smurf2) is an E3 ubiquitin ligase that participates in degradation of TGF-β receptors and other targets. Smurf2 WW domains recognize PPXY (PY) motifs on ubiquitin ligase target proteins or on adapters, such as Smad7, that bind to E3 target proteins. We previously demonstrated that the isolated WW3 domain of Smurf2, but not the WW2 domain, can directly bind to a Smad7 PY motif. We show here that the WW2 augments this interaction by binding to the WW3 and making auxiliary contacts with the PY motif and a novel E/D-S/T-P motif, which is N-terminal to all Smad PY motifs. The WW2 likely enhances the selectivity of Smurf2 for the Smad proteins. NMR titrations confirm that Smad1 and Smad2 are bound by Smurf2 with the same coupled WW domain arrangement used to bind Smad7. The analogous WW domains in the short isoform of Smurf1 recognize the Smad7 PY peptide using the same coupled mechanism. However, a longer Smurf1 isoform, which has an additional 26 residues in the inter-WW domain linker, is only partially able to use the coupled WW domain binding mechanism. The longer linker results in a decrease in affinity for the Smad7 peptide. Interdomain coupling of WW domains enhances selectivity and enables the tuning of interactions by isoform switching.
Chong, P. Andrew; Lin, Hong; Wrana, Jeffrey L.; Forman-Kay, Julie D.
2010-01-01
Smad ubiquitination regulatory factor 2 (Smurf2) is an E3 ubiquitin ligase that participates in degradation of TGF-β receptors and other targets. Smurf2 WW domains recognize PPXY (PY) motifs on ubiquitin ligase target proteins or on adapters, such as Smad7, that bind to E3 target proteins. We previously demonstrated that the isolated WW3 domain of Smurf2, but not the WW2 domain, can directly bind to a Smad7 PY motif. We show here that the WW2 augments this interaction by binding to the WW3 and making auxiliary contacts with the PY motif and a novel E/D-S/T-P motif, which is N-terminal to all Smad PY motifs. The WW2 likely enhances the selectivity of Smurf2 for the Smad proteins. NMR titrations confirm that Smad1 and Smad2 are bound by Smurf2 with the same coupled WW domain arrangement used to bind Smad7. The analogous WW domains in the short isoform of Smurf1 recognize the Smad7 PY peptide using the same coupled mechanism. However, a longer Smurf1 isoform, which has an additional 26 residues in the inter-WW domain linker, is only partially able to use the coupled WW domain binding mechanism. The longer linker results in a decrease in affinity for the Smad7 peptide. Interdomain coupling of WW domains enhances selectivity and enables the tuning of interactions by isoform switching. PMID:20937913
Structural Dynamics as a Contributor to Error-prone Replication by an RNA-dependent RNA Polymerase*
Moustafa, Ibrahim M.; Korboukh, Victoria K.; Arnold, Jamie J.; Smidansky, Eric D.; Marcotte, Laura L.; Gohara, David W.; Yang, Xiaorong; Sánchez-Farrán, María Antonieta; Filman, David; Maranas, Janna K.; Boehr, David D.; Hogle, James M.; Colina, Coray M.; Cameron, Craig E.
2014-01-01
RNA viruses encoding high- or low-fidelity RNA-dependent RNA polymerases (RdRp) are attenuated. The ability to predict residues of the RdRp required for faithful incorporation of nucleotides represents an essential step in any pipeline intended to exploit perturbed fidelity as the basis for rational design of vaccine candidates. We used x-ray crystallography, molecular dynamics simulations, NMR spectroscopy, and pre-steady-state kinetics to compare a mutator (H273R) RdRp from poliovirus to the wild-type (WT) enzyme. We show that the nucleotide-binding site toggles between the nucleotide binding-occluded and nucleotide binding-competent states. The conformational dynamics between these states were enhanced by binding to primed template RNA. For the WT, the occluded conformation was favored; for H273R, the competent conformation was favored. The resonance for Met-187 in our NMR spectra reported on the ability of the enzyme to check the correctness of the bound nucleotide. Kinetic experiments were consistent with the conformational dynamics contributing to the established pre-incorporation conformational change and fidelity checkpoint. For H273R, residues comprising the active site spent more time in the catalytically competent conformation and were more positively correlated than the WT. We propose that by linking the equilibrium between the binding-occluded and binding-competent conformations of the nucleotide-binding pocket and other active-site dynamics to the correctness of the bound nucleotide, faithful nucleotide incorporation is achieved. These studies underscore the need to apply multiple biophysical and biochemical approaches to the elucidation of the physical basis for polymerase fidelity. PMID:25378410
Analysis of the interactome of the Ser/Thr Protein Phosphatase type 1 in Plasmodium falciparum.
Hollin, Thomas; De Witte, Caroline; Lenne, Astrid; Pierrot, Christine; Khalife, Jamal
2016-03-17
Protein Phosphatase 1 (PP1) is an enzyme essential to cell viability in the malaria parasite Plasmodium falciparum (Pf). The activity of PP1 is regulated by the binding of regulatory subunits, of which there are up to 200 in humans, but only 3 have been so far reported for the parasite. To better understand the P. falciparum PP1 (PfPP1) regulatory network, we here report the use of three strategies to characterize the PfPP1 interactome: co-affinity purified proteins identified by mass spectrometry, yeast two-hybrid (Y2H) screening and in silico analysis of the P. falciparum predicted proteome. Co-affinity purification followed by MS analysis identified 6 PfPP1 interacting proteins (Pips) of which 3 contained the RVxF consensus binding, 2 with a Fxx[RK]x[RK] motif, also shown to be a PP1 binding motif and one with both binding motifs. The Y2H screens identified 134 proteins of which 30 present the RVxF binding motif and 20 have the Fxx[RK]x[RK] binding motif. The in silico screen of the Pf predicted proteome using a consensus RVxF motif as template revealed the presence of 55 potential Pips. As further demonstration, 35 candidate proteins were validated as PfPP1 interacting proteins in an ELISA-based assay. To the best of our knowledge, this is the first study on PfPP1 interactome. The data reports several conserved PP1 interacting proteins as well as a high number of specific interactors to PfPP1. Their analysis indicates a high diversity of biological functions for PP1 in Plasmodium. Based on the present data and on an earlier study of the Pf interactome, a potential implication of Pips in protein folding/proteolysis, transcription and pathogenicity networks is proposed. The present work provides a starting point for further studies on the structural basis of these interactions and their functions in P. falciparum.
Novel DNA Motif Binding Activity Observed In Vivo With an Estrogen Receptor α Mutant Mouse
Li, Leping; Grimm, Sara A.; Winuthayanon, Wipawee; Hamilton, Katherine J.; Pockette, Brianna; Rubel, Cory A.; Pedersen, Lars C.; Fargo, David; Lanz, Rainer B.; DeMayo, Francesco J.; Schütz, Günther; Korach, Kenneth S.
2014-01-01
Estrogen receptor α (ERα) interacts with DNA directly or indirectly via other transcription factors, referred to as “tethering.” Evidence for tethering is based on in vitro studies and a widely used “KIKO” mouse model containing mutations that prevent direct estrogen response element DNA- binding. KIKO mice are infertile, due in part to the inability of estradiol (E2) to induce uterine epithelial proliferation. To elucidate the molecular events that prevent KIKO uterine growth, regulation of the pro-proliferative E2 target gene Klf4 and of Klf15, a progesterone (P4) target gene that opposes the pro-proliferative activity of KLF4, was evaluated. Klf4 induction was impaired in KIKO uteri; however, Klf15 was induced by E2 rather than by P4. Whole uterine chromatin immunoprecipitation-sequencing revealed enrichment of KIKO ERα binding to hormone response elements (HREs) motifs. KIKO binding to HRE motifs was verified using reporter gene and DNA-binding assays. Because the KIKO ERα has HRE DNA-binding activity, we evaluated the “EAAE” ERα, which has more severe DNA-binding domain mutations, and demonstrated a lack of estrogen response element or HRE reporter gene induction or DNA-binding. The EAAE mouse has an ERα null–like phenotype, with impaired uterine growth and transcriptional activity. Our findings demonstrate that the KIKO mouse model, which has been used by numerous investigators, cannot be used to establish biological functions for ERα tethering, because KIKO ERα effectively stimulates transcription using HRE motifs. The EAAE-ERα DNA-binding domain mutant mouse demonstrates that ERα DNA-binding is crucial for biological and transcriptional processes in reproductive tissues and that ERα tethering may not contribute to estrogen responsiveness in vivo. PMID:24713037
Structural and Histone Binding Ability Characterizations of Human PWWP Domains
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wu, Hong; Zeng, Hong; Lam, Robert
2013-09-25
The PWWP domain was first identified as a structural motif of 100-130 amino acids in the WHSC1 protein and predicted to be a protein-protein interaction domain. It belongs to the Tudor domain 'Royal Family', which consists of Tudor, chromodomain, MBT and PWWP domains. While Tudor, chromodomain and MBT domains have long been known to bind methylated histones, PWWP was shown to exhibit histone binding ability only until recently. The PWWP domain has been shown to be a DNA binding domain, but sequence analysis and previous structural studies show that the PWWP domain exhibits significant similarity to other 'Royal Family' members,more » implying that the PWWP domain has the potential to bind histones. In order to further explore the function of the PWWP domain, we used the protein family approach to determine the crystal structures of the PWWP domains from seven different human proteins. Our fluorescence polarization binding studies show that PWWP domains have weak histone binding ability, which is also confirmed by our NMR titration experiments. Furthermore, we determined the crystal structures of the BRPF1 PWWP domain in complex with H3K36me3, and HDGF2 PWWP domain in complex with H3K79me3 and H4K20me3. PWWP proteins constitute a new family of methyl lysine histone binders. The PWWP domain consists of three motifs: a canonical {beta}-barrel core, an insertion motif between the second and third {beta}-strands and a C-terminal {alpha}-helix bundle. Both the canonical {beta}-barrel core and the insertion motif are directly involved in histone binding. The PWWP domain has been previously shown to be a DNA binding domain. Therefore, the PWWP domain exhibits dual functions: binding both DNA and methyllysine histones.« less
Molecular Control of Polyene Macrolide Biosynthesis
Santos-Aberturas, Javier; Vicente, Cláudia M.; Guerra, Susana M.; Payero, Tamara D.; Martín, Juan F.; Aparicio, Jesús F.
2011-01-01
Control of polyene macrolide production in Streptomyces natalensis is mediated by the transcriptional activator PimM. This regulator, which combines an N-terminal PAS domain with a C-terminal helix-turn-helix motif, is highly conserved among polyene biosynthetic gene clusters. PimM, truncated forms of the protein without the PAS domain (PimMΔPAS), and forms containing just the DNA-binding domain (DBD) (PimMDBD) were overexpressed in Escherichia coli as GST-fused proteins. GST-PimM binds directly to eight promoters of the pimaricin cluster, as demonstrated by electrophoretic mobility shift assays. Assays with truncated forms of the protein revealed that the PAS domain does not mediate specificity or the distinct recognition of target genes, which rely on the DBD domain, but significantly reduces binding affinity up to 500-fold. Transcription start points were identified by 5′-rapid amplification of cDNA ends, and the binding regions of PimMDBD were investigated by DNase I protection studies. In all cases, binding took place covering the −35 hexamer box of each promoter, suggesting an interaction of PimM and RNA polymerase to cause transcription activation. Information content analysis of the 16 sequences protected in target promoters was used to deduce the structure of the PimM-binding site. This site displays dyad symmetry, spans 14 nucleotides, and adjusts to the consensus TVGGGAWWTCCCBA. Experimental validation of this binding site was performed by using synthetic DNA duplexes. Binding of PimM to the promoter region of one of the polyketide synthase genes from the Streptomyces nodosus amphotericin cluster containing the consensus binding site was also observed, thus proving the applicability of the findings reported here to other antifungal polyketides. PMID:21187288
Cao, Shuang-Shuang; Du, Yu-Zhou
2014-09-15
The mitogenome of Chilo auricilius (Lepidoptera: Pyraloidea: Crambidae) was a circular molecule made up of 15,367 bp. Sesamia inferens, Chilo suppressalis, Tryporyza incertulas, and C. auricilius, are closely related, well known rice stem borers that are widely distributed in the main rice-growing regions of China. The gene order and orientation of all four stem borers were similar to that of other insect mitogenomes. Among the four stem borers, all AT contents were below 83%, while all AT contents of tRNA genes were above 80%. The genomes were compact, with only 121-257 bp of non-coding intergenic spacer. There are 56 or 62-bp overlapping nucleotides in Crambidae moths, but were only 25-bp overlapping nucleotides in the noctuid moth S. inferens. There was a conserved motif 'ATACTAAA' between trnS2 (UCN) and nad1 in Crambidae moths, but this same region was 'ATCATA' in the noctuid S. inferens. And there was a 6-bp motif 'ATGATAA' of overlapping nucleotides, which was conserved in Lepidoptera, and a 14-bp motif 'TAAGCTATTTAAAT' conserved in the three Crambidae moths (C. suppressalis, C. auricilius and T. incertulas), but not in the noctuid. Finally, there were no stem-and-loop structures in the two Chilo moths. Copyright © 2014 Elsevier B.V. All rights reserved.
Bergmann, Tobias; Moore, Carrie; Sidney, John; Miller, Donald; Tallmadge, Rebecca; Harman, Rebecca M; Oseroff, Carla; Wriston, Amanda; Shabanowitz, Jeffrey; Hunt, Donald F; Osterrieder, Nikolaus; Peters, Bjoern; Antczak, Douglas F; Sette, Alessandro
2015-11-01
Here we describe a detailed quantitative peptide-binding motif for the common equine leukocyte antigen (ELA) class I allele Eqca-1*00101, present in roughly 25 % of Thoroughbred horses. We determined a preliminary binding motif by sequencing endogenously bound ligands. Subsequently, a positional scanning combinatorial library (PSCL) was used to further characterize binding specificity and derive a quantitative motif involving aspartic acid in position 2 and hydrophobic residues at the C-terminus. Using this motif, we selected and tested 9- and 10-mer peptides derived from the equine herpesvirus type 1 (EHV-1) proteome for their capacity to bind Eqca-1*00101. PSCL predictions were very efficient, with an receiver operating characteristic (ROC) curve performance of 0.877, and 87 peptides derived from 40 different EHV-1 proteins were identified with affinities of 500 nM or higher. Quantitative analysis revealed that Eqca-1*00101 has a narrow peptide-binding repertoire, in comparison to those of most human, non-human primate, and mouse class I alleles. Peripheral blood mononuclear cells from six EHV-1-infected, or vaccinated but uninfected, Eqca-1*00101-positive horses were used in IFN-γ enzyme-linked immunospot (ELISPOT) assays. When we screened the 87 Eqca-1*00101-binding peptides for T cell reactivity, only one Eqca-1*00101 epitope, derived from the intermediate-early protein ICP4, was identified. Thus, despite its common occurrence in several horse breeds, Eqca-1*00101 is associated with a narrow binding repertoire and a similarly narrow T cell response to an important equine viral pathogen. Intriguingly, these features are shared with other human and macaque major histocompatibility complex (MHC) molecules with a similar specificity for D in position 2 or 3 in their main anchor motif.
Bergmann, Tobias; Moore, Carrie; Sidney, John; Miller, Donald; Tallmadge, Rebecca; Harman, Rebecca M.; Oseroff, Carla; Wriston, Amanda; Shabanowitz, Jeffrey; Hunt, Donald F.; Osterrieder, Nikolaus; Peters, Bjoern; Antczak, Douglas F.; Sette, Alessandro
2016-01-01
Here we describe a detailed quantitative peptide-binding motif for the common equine leukocyte antigen (ELA) class I allele Eqca-1*00101, present in roughly 25 % of Thoroughbred horses. We determined a preliminary binding motif by sequencing endogenously bound ligands. Subsequently, a positional scanning combinatorial library (PSCL) was used to further characterize binding specificity and derive a quantitative motif involving aspartic acid in position 2 and hydrophobic residues at the C-terminus. Using this motif, we selected and tested 9- and 10-mer peptides derived from the equine herpesvirus type 1 (EHV-1) proteome for their capacity to bind Eqca-1*00101. PSCL predictions were very efficient, with an receiver operating characteristic (ROC) curve performance of 0.877, and 87 peptides derived from 40 different EHV-1 proteins were identified with affinities of 500 nM or higher. Quantitative analysis revealed that Eqca-1*00101 has a narrow peptide-binding repertoire, in comparison to those of most human, non-human primate, and mouse class I alleles. Peripheral blood mononuclear cells from six EHV-1-infected, or vaccinated but uninfected, Eqca-1*00101-positive horses were used in IFN-γ enzyme-linked immunospot (ELISPOT) assays. When we screened the 87 Eqca-1*00101-binding peptides for T cell reactivity, only one Eqca-1*00101 epitope, derived from the intermediate-early protein ICP4, was identified. Thus, despite its common occurrence in several horse breeds, Eqca-1*00101 is associated with a narrow binding repertoire and a similarly narrow T cell response to an important equine viral pathogen. Intriguingly, these features are shared with other human and macaque major histocompatibility complex (MHC) molecules with a similar specificity for D in position 2 or 3 in their main anchor motif. PMID:26399241
Structural and biochemical analysis of Bcl-2 interaction with the hepatitis B virus protein HBx.
Jiang, Tianyu; Liu, Minhao; Wu, Jianping; Shi, Yigong
2016-02-23
HBx is a hepatitis B virus protein that is required for viral infectivity and replication. Anti-apoptotic Bcl-2 family members are thought to be among the important host targets of HBx. However, the structure and function of HBx are poorly understood and the molecular mechanism of HBx-induced carcinogenesis remains unknown. In this study, we report biochemical and structural characterization of HBx. The recombinant HBx protein contains metal ions, in particular iron and zinc. A BH3-like motif in HBx (residues 110-135) binds Bcl-2 with a dissociation constant of ∼193 μM, which is drastically lower than that for a canonical BH3 motif from Bim or Bad. Structural analysis reveals that, similar to other BH3 motifs, the BH3-like motif of HBx adopts an amphipathic α-helix and binds the conserved BH3-binding groove on Bcl-2. Unlike the helical Bim or Bad BH3 motif, the C-terminal portion of the bound HBx BH3-like motif has an extended conformation and makes considerably fewer interactions with Bcl-2. These observations suggest that HBx may modulate Bcl-2 function in a way that is different from that of the classical BH3-only proteins.
The amino acid motif L/IIxxFE defines a novel actin-binding sequence in PDZ-RhoGEF
Banerjee, Jayashree; Fischer, Christopher C.; Wedegaertner, Philip B.
2009-01-01
PDZ-RhoGEF is a member of the regulator of G protein signaling (RGS) domain-containing RhoGEFs (RGS-RhoGEFs) that link activated heterotrimeric G protein α subunits of the G12 family to activation of the small GTPase RhoA. Unique among the RGS-RhoGEFs, PDZ-RhoGEF contains a short sequence that localizes the protein to the actin cytoskeleton. In this report, we demonstrate that the actin-binding domain, located between amino acids 561–585, directly binds to F-actin in vitro. Extensive mutagenesis identifies isoleucine 568, isoleucine 569, phenylalanine 572, and glutamic acid 573 as necessary for binding to actin and for co-localization with the actin cytoskeleton in cells. These results define a novel actin-binding sequence in PDZ-RhoGEF with a critical amino acid motif of IIxxFE. Moreover, sequence analysis identifies a similar actin-binding motif in the N-terminus of the RhoGEF frabin, and, as with PDZ-RhoGEF, mutagenesis and actin interaction experiments demonstrate a motif of LIxxFE, consisting of the key amino acids leucine 23, isoleucine 24, phenylalanine 27, and glutamic acid 28. Taken together, results with PDZ-RhoGEF and frabin identify a novel actin binding sequence. Lastly, inducible dimerization of the actin-binding region of PDZ-RhoGEF revealed a dimerization-dependent actin bundling activity in vitro. PDZ-RhoGEF exists in cells as a dimer, raising the possibility that PDZ-RhoGEF could influence actin structure independent of its ability to activate RhoA. PMID:19618964
Zhu, Li; Hwang, Peter; Witkowska, H. Ewa; Liu, Haichuan; Li, Wu
2014-01-01
Tooth enamel is the hardest tissue in vertebrate animals. Consisting of millions of carbonated hydroxyapatite crystals, this highly mineralized tissue develops from a protein matrix in which amelogenin is the predominant component. The enamel matrix proteins are eventually and completely degraded and removed by proteinases to form mineral-enriched tooth enamel. Identification of the apatite-binding motifs in amelogenin is critical for understanding the amelogenin–crystal interactions and amelogenin–proteinases interactions during tooth enamel biomineralization. A stepwise strategy is introduced to kinetically and quantitatively identify the crystal-binding motifs in amelogenin, including a peptide screening assay, a competitive adsorption assay, and a kinetic-binding assay using amelogenin and gene-engineered amelogenin mutants. A modified enzyme-linked immunosorbent assay on crystal surfaces is also applied to compare binding amounts of amelogenin and its mutants on different planes of apatite crystals. We describe the detailed protocols for these assays and provide the considerations for these experiments in this chapter. PMID:24188774
Chen, Yan; Carrington-Lawrence, Stacy D.; Bai, Ping; Weller, Sandra K.
2005-01-01
Herpes simplex virus type 1 (HSV-1) encodes a heterotrimeric helicase-primase (UL5/8/52) complex. UL5 contains seven motifs found in helicase superfamily 1, and UL52 contains conserved motifs found in primases. The contributions of each subunit to the biochemical activities of the complex, however, remain unclear. We have previously demonstrated that a mutation in the putative zinc finger at UL52 C terminus abrogates not only primase but also ATPase, helicase, and DNA-binding activities of a UL5/UL52 subcomplex, indicating a complex interdependence between the two subunits. To test this hypothesis and to further investigate the role of the zinc finger in the enzymatic activities of the helicase-primase, a series of mutations were constructed in this motif. They differed in their ability to complement a UL52 null virus: totally defective, partial complementation, and potentiating. In this study, four of these mutants were studied biochemically after expression and purification from insect cells infected with recombinant baculoviruses. All mutants show greatly reduced primase activity. Complementation-defective mutants exhibited severe defects in ATPase, helicase, and DNA-binding activities. Partially complementing mutants displayed intermediate levels of these activities, except that one showed a wild-type level of helicase activity. These data suggest that the UL52 zinc finger motif plays an important role in the activities of the helicase-primase complex. The observation that mutations in UL52 affected helicase, ATPase, and DNA-binding activities indicates that UL52 binding to DNA via the zinc finger may be necessary for loading UL5. Alternatively, UL5 and UL52 may share a DNA-binding interface. PMID:15994803
Chen, Yan; Carrington-Lawrence, Stacy D; Bai, Ping; Weller, Sandra K
2005-07-01
Herpes simplex virus type 1 (HSV-1) encodes a heterotrimeric helicase-primase (UL5/8/52) complex. UL5 contains seven motifs found in helicase superfamily 1, and UL52 contains conserved motifs found in primases. The contributions of each subunit to the biochemical activities of the complex, however, remain unclear. We have previously demonstrated that a mutation in the putative zinc finger at UL52 C terminus abrogates not only primase but also ATPase, helicase, and DNA-binding activities of a UL5/UL52 subcomplex, indicating a complex interdependence between the two subunits. To test this hypothesis and to further investigate the role of the zinc finger in the enzymatic activities of the helicase-primase, a series of mutations were constructed in this motif. They differed in their ability to complement a UL52 null virus: totally defective, partial complementation, and potentiating. In this study, four of these mutants were studied biochemically after expression and purification from insect cells infected with recombinant baculoviruses. All mutants show greatly reduced primase activity. Complementation-defective mutants exhibited severe defects in ATPase, helicase, and DNA-binding activities. Partially complementing mutants displayed intermediate levels of these activities, except that one showed a wild-type level of helicase activity. These data suggest that the UL52 zinc finger motif plays an important role in the activities of the helicase-primase complex. The observation that mutations in UL52 affected helicase, ATPase, and DNA-binding activities indicates that UL52 binding to DNA via the zinc finger may be necessary for loading UL5. Alternatively, UL5 and UL52 may share a DNA-binding interface.
Verma, Anjali; Rajagopalan, Pavithra; Lotke, Rishikesh; Varghese, Rebu; Selvam, Deepak; Kundu, Tapas K.
2016-01-01
ABSTRACT Of the various genetic subtypes of human immunodeficiency virus types 1 and 2 (HIV-1 and HIV-2) and simian immunodeficiency virus (SIV), only in subtype C of HIV-1 is a genetically variant NF-κB binding site found at the core of the viral promoter in association with a subtype-specific Sp1III motif. How the subtype-associated variations in the core transcription factor binding sites (TFBS) influence gene expression from the viral promoter has not been examined previously. Using panels of infectious viral molecular clones, we demonstrate that subtype-specific NF-κB and Sp1III motifs have evolved for optimal gene expression, and neither of the motifs can be replaced by a corresponding TFBS variant. The variant NF-κB motif binds NF-κB with an affinity 2-fold higher than that of the generic NF-κB site. Importantly, in the context of an infectious virus, the subtype-specific Sp1III motif demonstrates a profound loss of function in association with the generic NF-κB motif. An additional substitution of the Sp1III motif fully restores viral replication, suggesting that the subtype C-specific Sp1III has evolved to function with the variant, but not generic, NF-κB motif. A change of only two base pairs in the central NF-κB motif completely suppresses viral transcription from the provirus and converts the promoter into heterochromatin refractory to tumor necrosis factor alpha (TNF-α) induction. The present work represents the first demonstration of functional incompatibility between an otherwise functional NF-κB motif and a unique Sp1 site in the context of an HIV-1 promoter. Our work provides important leads as to the evolution of the HIV-1 subtype C viral promoter with relevance for gene expression regulation and viral latency. IMPORTANCE Subtype-specific genetic variations provide a powerful tool to examine how these variations offer a replication advantage to specific viral subtypes, if any. Only in subtype C of HIV-1 are two genetically distinct transcription factor binding sites positioned at the most critical location of the viral promoter. Since a single promoter regulates viral gene expression, the promoter variations can play a critical role in determining the replication fitness of the viral strains. Our work for the first time provides a scientific explanation for the presence of a unique NF-κB binding motif in subtype C, a major HIV-1 genetic family responsible for half of the global HIV-1 infections. The results offer compelling evidence that the subtype C viral promoter not only is stronger but also is endowed with a qualitative gain-of-function advantage. The genetically variant NF-κB and the Sp1III motifs may be respond differently to specific cell signal pathways, and these mechanisms must be examined. PMID:27194770
Jaeger, Sébastien; Thieffry, Denis
2017-01-01
Abstract Transcription factor (TF) databases contain multitudes of binding motifs (TFBMs) from various sources, from which non-redundant collections are derived by manual curation. The advent of high-throughput methods stimulated the production of novel collections with increasing numbers of motifs. Meta-databases, built by merging these collections, contain redundant versions, because available tools are not suited to automatically identify and explore biologically relevant clusters among thousands of motifs. Motif discovery from genome-scale data sets (e.g. ChIP-seq) also produces redundant motifs, hampering the interpretation of results. We present matrix-clustering, a versatile tool that clusters similar TFBMs into multiple trees, and automatically creates non-redundant TFBM collections. A feature unique to matrix-clustering is its dynamic visualisation of aligned TFBMs, and its capability to simultaneously treat multiple collections from various sources. We demonstrate that matrix-clustering considerably simplifies the interpretation of combined results from multiple motif discovery tools, and highlights biologically relevant variations of similar motifs. We also ran a large-scale application to cluster ∼11 000 motifs from 24 entire databases, showing that matrix-clustering correctly groups motifs belonging to the same TF families, and drastically reduced motif redundancy. matrix-clustering is integrated within the RSAT suite (http://rsat.eu/), accessible through a user-friendly web interface or command-line for its integration in pipelines. PMID:28591841
Zheng, Heping; Shabalin, Ivan G.; Handing, Katarzyna B.; Bujnicki, Janusz M.; Minor, Wladek
2015-01-01
The ubiquitous presence of magnesium ions in RNA has long been recognized as a key factor governing RNA folding, and is crucial for many diverse functions of RNA molecules. In this work, Mg2+-binding architectures in RNA were systematically studied using a database of RNA crystal structures from the Protein Data Bank (PDB). Due to the abundance of poorly modeled or incorrectly identified Mg2+ ions, the set of all sites was comprehensively validated and filtered to identify a benchmark dataset of 15 334 ‘reliable’ RNA-bound Mg2+ sites. The normalized frequencies by which specific RNA atoms coordinate Mg2+ were derived for both the inner and outer coordination spheres. A hierarchical classification system of Mg2+ sites in RNA structures was designed and applied to the benchmark dataset, yielding a set of 41 types of inner-sphere and 95 types of outer-sphere coordinating patterns. This classification system has also been applied to describe six previously reported Mg2+-binding motifs and detect them in new RNA structures. Investigation of the most populous site types resulted in the identification of seven novel Mg2+-binding motifs, and all RNA structures in the PDB were screened for the presence of these motifs. PMID:25800744
Schuchardt, Brett J.; Bhat, Vikas; Mikles, David C.; McDonald, Caleb B.; Sudol, Marius; Farooq, Amjad
2014-01-01
The newly discovered transactivation function of ErbB4 receptor tyrosine kinase is believed to be mediated by virtue of the ability of its proteolytically-cleaved intracellular domain (ICD) to physically associate with YAP2 transcriptional regulator. In an effort to unearth the molecular basis of YAP2-ErbB4 interaction, we have conducted a detailed biophysical analysis of the binding of WW domains of YAP2 to PPXY motifs located within the ICD of ErbB4. Our data show that the WW1 domain of YAP2 binds to PPXY motifs within the ICD in a differential manner and that this behavior is by and large replicated by the WW2 domain. Remarkably, while both WW domains absolutely require the integrity of the PPXY consensus sequence, non-consensus residues within and flanking this motif do not appear to be critical for binding. In spite of this shared mode of binding, the WW domains of YAP2 display distinct conformational dynamics in complex with PPXY motifs derived from ErbB4. Collectively, our study lends new insights into the molecular basis of a key protein-protein interaction involved in a diverse array of cellular processes. PMID:24472438
Schuchardt, Brett J; Bhat, Vikas; Mikles, David C; McDonald, Caleb B; Sudol, Marius; Farooq, Amjad
2014-06-01
The newly discovered transactivation function of ErbB4 receptor tyrosine kinase is believed to be mediated by virtue of the ability of its proteolytically-cleaved intracellular domain (ICD) to physically associate with YAP2 transcriptional regulator. In an effort to unearth the molecular basis of YAP2-ErbB4 interaction, we have conducted a detailed biophysical analysis of the binding of WW domains of YAP2 to PPXY motifs located within the ICD of ErbB4. Our data show that the WW1 domain of YAP2 binds to PPXY motifs within the ICD in a differential manner and that this behavior is by and large replicated by the WW2 domain. Remarkably, while both WW domains absolutely require the integrity of the PPXY consensus sequence, non-consensus residues within and flanking this motif do not appear to be critical for binding. In spite of this shared mode of binding, the WW domains of YAP2 display distinct conformational dynamics in complex with PPXY motifs derived from ErbB4. Collectively, our study lends new insights into the molecular basis of a key protein-protein interaction involved in a diverse array of cellular processes. Copyright © 2014 Elsevier Masson SAS. All rights reserved.
Characterization of the mouse junD promoter--high basal level activity due to an octamer motif.
de Groot, R P; Karperien, M; Pals, C; Kruijer, W
1991-01-01
The product of the junD gene belongs to the Jun/Fos family of nuclear DNA binding transcription factors. This family regulates the expression of TPA responsive genes by binding to the TPA responsive element (TRE). Unlike its counterparts c-jun and junB, junD expression is hardly inducible by growth factors and phorbol esters. In fact, junD is constitutively expressed at high levels in a wide variety of cells. To unravel the molecular mechanisms underlying constitutive junD expression, we have cloned and characterized the mouse junD promoter. We show that the high constitutive expression is caused by multiple cis-acting elements in its promoter, including an SP1 binding site, an octamer motif, a CAAT box, a Zif268 binding site and a TRE-like sequence. The octamer motif is the major determinant of junD promoter activity, while somewhat smaller contributions are made by the TRE and Zif268 binding site. The SP1 and CAAT box are shown to be of minor importance. The junD TRE is in its behavior indistinguishable from previously identified TREs. However, the junD promoter is not TPA inducible due to the presence of the octamer motif. Images PMID:1714380
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gavel, O.Y.; Bursakov, S.A.; Rocco, G.Di
2009-05-18
Adenylate kinase (AK) mediates the reversible transfer of phosphate groups between the adenylate nucleotides and contributes to the maintenance of their constant cellular level, necessary for energy metabolism and nucleic acid synthesis. The AK were purified from crude extracts of two sulfate-reducing bacteria (SRB), Desulfovibrio (D.) gigas NCIB 9332 and Desulfovibrio desulfuricans ATCC 27774, and biochemically and spectroscopically characterized in the native and fully cobalt- or zinc-substituted forms. These are the first reported adenylate kinases that bind either zinc or cobalt and are related to the subgroup of metal-containing AK found, in most cases, in Gram-positive bacteria. The electronic absorptionmore » spectrum is consistent with tetrahedral coordinated cobalt, predominantly via sulfur ligands, and is supported by EPR. The involvement of three cysteines in cobalt or zinc coordination was confirmed by chemical methods. Extended X-ray absorption fine structure (EXAFS) indicate that cobalt or zinc are bound by three cysteine residues and one histidine in the metal-binding site of the 'LID' domain. The sequence {sup 129}Cys-X{sub 5}-His-X{sub 15}-Cys-X{sub 2}-Cys of the AK from D. gigas is involved in metal coordination and represents a new type of binding motif that differs from other known zinc-binding sites of AK. Cobalt and zinc play a structural role in stabilizing the LID domain.« less
Schuschke, Christian; Schwarz, Matthias; Hohner, Chantal; Silva, Thais N; Fromm, Lukas; Döpper, Tibor; Görling, Andreas; Libuda, Jörg
2018-04-19
We have studied the anchoring mechanism of a phosphonic acid on an atomically defined oxide surface. Using time-resolved infrared reflection absorption spectroscopy, we investigated the reaction of deuterated phenylphosphonic acid (DPPA, C 6 H 5 PO 3 D 2 ) with an atomically defined Co 3 O 4 (111) surface in situ during film growth by physical vapor deposition. We show that the binding motif of the phosphonate anchor group changes as a function of coverage. At low coverage, DPPA binds in the form of a chelating tridentate phosphonate, while a transition to a chelating bidentate occurs close to monolayer saturation coverage. However, the coverage-dependent change in the binding motif is not associated with a major change of the molecular orientation, suggesting that the rigid phosphonate linker always maintains the DPPA in a strongly tilted orientation irrespective of the surface coverage.
Helix–hairpin–helix motifs confer salt resistance and processivity on chimeric DNA polymerases
Pavlov, Andrey R.; Belova, Galina I.; Kozyavkin, Sergei A.; Slesarev, Alexei I.
2002-01-01
Helix–hairpin–helix (HhH) is a widespread motif involved in sequence-nonspecific DNA binding. The majority of HhH motifs function as DNA-binding modules with typical occurrence of one HhH motif or one or two (HhH)2 domains in proteins. We recently identified 24 HhH motifs in DNA topoisomerase V (Topo V). Although these motifs are dispensable for the topoisomerase activity of Topo V, their removal narrows the salt concentration range for topoisomerase activity tenfold. Here, we demonstrate the utility of Topo V's HhH motifs for modulating DNA-binding properties of the Stoffel fragment of TaqDNA polymerase and Pfu DNA polymerase. Different HhH cassettes fused with either NH2 terminus or COOH terminus of DNA polymerases broaden the salt concentration range of the polymerase activity significantly (up to 0.5 M NaCl or 1.8 M potassium glutamate). We found that anions play a major role in the inhibition of DNA polymerase activity. The resistance of initial extension rates and the processivity of chimeric polymerases to salts depend on the structure of added HhH motifs. Regardless of the type of the construct, the thermal stability of chimeric Taq polymerases increases under the optimal ionic conditions, as compared with that of TaqDNA polymerase or its Stoffel fragment. Our approach to raise the salt tolerance, processivity, and thermostability of Taq and Pfu DNA polymerases may be applied to all pol1- and polB-type polymerases, as well as to other DNA processing enzymes. PMID:12368475
Nucleotide binding properties of bovine brain uncoating ATPase.
Gao, B; Emoto, Y; Greene, L; Eisenberg, E
1993-04-25
Many functions of the 70-kDa heat-shock proteins (hsp70s) appear to be regulated by bound nucleotide. In this study we examined the nucleotide binding properties of purified bovine brain uncoating ATPase, one of the constitutively expressed members of the hsp70 family. We found that uncoating ATPase purified by ATP-agarose column chromatography retained one ADP molecule bound per enzyme molecule which could not be removed by extensive dialysis. Since this bound ADP exchanged rapidly with free ADP or ATP, the inability to remove the bound nucleotide was not due to slow dissociation but rather to strong binding of the nucleotide to the uncoating ATPase. In confirmation of this view, equilibrium dialysis experiments suggested that the dissociation constants for both ADP and ATP were less than 0.1 microM. Schmid et al. (Schmid, S. L., Braell, W. A., and Rothman, J. E. (1985) J. Biol. Chem 260, 10057-10062) suggested that the uncoating ATPase had two sites for bound nucleotide, one specific for ATP and one binding both ATP and ATP analogues but not ADP. In contrast, we found that enzyme with bound ADP did not bind further adenosine 5'-(beta,gamma-imino)triphosphate or dATP, nor did more than one ATP molecule bind per enzyme even in 200 microM free ATP. These results strongly suggest that the enzyme has only one binding site for nucleotide. During steady-state ATP hydrolysis, 85% of the bound nucleotide at this site was determined to be ATP and 15% ADP; this is consistent with the rate of ADP release determined in the exchange experiments noted above, where ADP release was found to be six times faster than the overall rate of ATP hydrolysis.
Kling, Ralf C.; Tschammer, Nuska; Lanig, Harald; Clark, Timothy; Gmeiner, Peter
2014-01-01
Partial agonists exhibit a submaximal capacity to enhance the coupling of one receptor to an intracellular binding partner. Although a multitude of studies have reported different ligand-specific conformations for a given receptor, little is known about the mechanism by which different receptor conformations are connected to the capacity to activate the coupling to G-proteins. We have now performed molecular-dynamics simulations employing our recently described active-state homology model of the dopamine D2 receptor-Gαi protein-complex coupled to the partial agonists aripiprazole and FAUC350, in order to understand the structural determinants of partial agonism better. We have compared our findings with our model of the D2R-Gαi-complex in the presence of the full agonist dopamine. The two partial agonists are capable of inducing different conformations of important structural motifs, including the extracellular loop regions, the binding pocket and, in particular, intracellular G-protein-binding domains. As G-protein-coupling to certain intracellular epitopes of the receptor is considered the key step of allosterically triggered nucleotide-exchange, it is tempting to assume that impaired coupling between the receptor and the G-protein caused by distinct ligand-specific conformations is a major determinant of partial agonist efficacy. PMID:24932547
Blind prediction of noncanonical RNA structure at atomic accuracy.
Watkins, Andrew M; Geniesse, Caleb; Kladwang, Wipapat; Zakrevsky, Paul; Jaeger, Luc; Das, Rhiju
2018-05-01
Prediction of RNA structure from nucleotide sequence remains an unsolved grand challenge of biochemistry and requires distinct concepts from protein structure prediction. Despite extensive algorithmic development in recent years, modeling of noncanonical base pairs of new RNA structural motifs has not been achieved in blind challenges. We report a stepwise Monte Carlo (SWM) method with a unique add-and-delete move set that enables predictions of noncanonical base pairs of complex RNA structures. A benchmark of 82 diverse motifs establishes the method's general ability to recover noncanonical pairs ab initio, including multistrand motifs that have been refractory to prior approaches. In a blind challenge, SWM models predicted nucleotide-resolution chemical mapping and compensatory mutagenesis experiments for three in vitro selected tetraloop/receptors with previously unsolved structures (C7.2, C7.10, and R1). As a final test, SWM blindly and correctly predicted all noncanonical pairs of a Zika virus double pseudoknot during a recent community-wide RNA-Puzzle. Stepwise structure formation, as encoded in the SWM method, enables modeling of noncanonical RNA structure in a variety of previously intractable problems.
Arginine methylation promotes translation repression activity of eIF4G-binding protein, Scd6.
Poornima, Gopalakrishna; Shah, Shanaya; Vignesh, Venkadasubramanian; Parker, Roy; Rajyaguru, Purusharth I
2016-11-02
Regulation of translation plays a critical role in determining mRNA fate. A new role was recently reported for a subset of RGG-motif proteins in repressing translation initiation by binding eIF4G1. However the signaling mechanism(s) that leads to spatial and temporal regulation of repression activity of RGG-motif proteins remains unknown. Here we report the role of arginine methylation in regulation of repression activity of Scd6, a conserved RGG-motif protein. We demonstrate that Scd6 gets arginine methylated at its RGG-motif and Hmt1 plays an important role in its methylation. We identify specific methylated arginine residues in the Scd6 RGG-motif in vivo We provide evidence that methylation augments Scd6 repression activity. Arginine methylation defective (AMD) mutant of Scd6 rescues the growth defect caused by overexpression of Scd6, a feature of translation repressors in general. Live-cell imaging of the AMD mutant revealed that it is defective in inducing formation of stress granules. Live-cell imaging and pull-down results indicate that it fails to bind eIF4G1 efficiently. Consistent with these results, a strain lacking Hmt1 is also defective in Scd6-eIF4G1 interaction. Our results establish that arginine methylation augments Scd6 repression activity by promoting eIF4G1-binding. We propose that arginine methylation of translation repressors with RGG-motif could be a general modulator of their repression activity. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
G-quadruplex RNA binding and recognition by the lysine-specific histone demethylase-1 enzyme.
Hirschi, Alexander; Martin, William J; Luka, Zigmund; Loukachevitch, Lioudmila V; Reiter, Nicholas J
2016-08-01
Lysine-specific histone demethylase 1 (LSD1) is an essential epigenetic regulator in metazoans and requires the co-repressor element-1 silencing transcription factor (CoREST) to efficiently catalyze the removal of mono- and dimethyl functional groups from histone 3 at lysine positions 4 and 9 (H3K4/9). LSD1 interacts with over 60 regulatory proteins and also associates with lncRNAs (TERRA, HOTAIR), suggesting a regulatory role for RNA in LSD1 function. We report that a stacked, intramolecular G-quadruplex (GQ) forming TERRA RNA (GG[UUAGGG]8UUA) binds tightly to the functional LSD1-CoREST complex (Kd ≈ 96 nM), in contrast to a single GQ RNA unit ([UUAGGG]4U), a GQ DNA ([TTAGGG]4T), or an unstructured single-stranded RNA. Stabilization of a parallel-stranded GQ RNA structure by monovalent potassium ions (K(+)) is required for high affinity binding to the LSD1-CoREST complex. These data indicate that LSD1 can distinguish between RNA and DNA as well as structured versus unstructured nucleotide motifs. Further, cross-linking mass spectrometry identified the primary location of GQ RNA binding within the SWIRM/amine oxidase domain (AOD) of LSD1. An ssRNA binding region adjacent to this GQ binding site was also identified via X-ray crystallography. This RNA binding interface is consistent with kinetic assays, demonstrating that a GQ-forming RNA can serve as a noncompetitive inhibitor of LSD1-catalyzed demethylation. The identification of a GQ RNA binding site coupled with kinetic data suggests that structured RNAs can function as regulatory molecules in LSD1-mediated mechanisms. © 2016 Hirschi et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
G-quadruplex RNA binding and recognition by the lysine-specific histone demethylase-1 enzyme
Hirschi, Alexander; Martin, William J.; Luka, Zigmund; Loukachevitch, Lioudmila V.; Reiter, Nicholas J.
2016-01-01
Lysine-specific histone demethylase 1 (LSD1) is an essential epigenetic regulator in metazoans and requires the co-repressor element-1 silencing transcription factor (CoREST) to efficiently catalyze the removal of mono- and dimethyl functional groups from histone 3 at lysine positions 4 and 9 (H3K4/9). LSD1 interacts with over 60 regulatory proteins and also associates with lncRNAs (TERRA, HOTAIR), suggesting a regulatory role for RNA in LSD1 function. We report that a stacked, intramolecular G-quadruplex (GQ) forming TERRA RNA (GG[UUAGGG]8UUA) binds tightly to the functional LSD1–CoREST complex (Kd ≈ 96 nM), in contrast to a single GQ RNA unit ([UUAGGG]4U), a GQ DNA ([TTAGGG]4T), or an unstructured single-stranded RNA. Stabilization of a parallel-stranded GQ RNA structure by monovalent potassium ions (K+) is required for high affinity binding to the LSD1–CoREST complex. These data indicate that LSD1 can distinguish between RNA and DNA as well as structured versus unstructured nucleotide motifs. Further, cross-linking mass spectrometry identified the primary location of GQ RNA binding within the SWIRM/amine oxidase domain (AOD) of LSD1. An ssRNA binding region adjacent to this GQ binding site was also identified via X-ray crystallography. This RNA binding interface is consistent with kinetic assays, demonstrating that a GQ-forming RNA can serve as a noncompetitive inhibitor of LSD1-catalyzed demethylation. The identification of a GQ RNA binding site coupled with kinetic data suggests that structured RNAs can function as regulatory molecules in LSD1-mediated mechanisms. PMID:27277658
Building a stable RNA U-turn with a protonated cytidine.
Gottstein-Schmidtke, Sina R; Duchardt-Ferner, Elke; Groher, Florian; Weigand, Julia E; Gottstein, Daniel; Suess, Beatrix; Wöhnert, Jens
2014-08-01
The U-turn is a classical three-dimensional RNA folding motif first identified in the anticodon and T-loops of tRNAs. It also occurs frequently as a building block in other functional RNA structures in many different sequence and structural contexts. U-turns induce sharp changes in the direction of the RNA backbone and often conform to the 3-nt consensus sequence 5'-UNR-3' (N = any nucleotide, R = purine). The canonical U-turn motif is stabilized by a hydrogen bond between the N3 imino group of the U residue and the 3' phosphate group of the R residue as well as a hydrogen bond between the 2'-hydroxyl group of the uridine and the N7 nitrogen of the R residue. Here, we demonstrate that a protonated cytidine can functionally and structurally replace the uridine at the first position of the canonical U-turn motif in the apical loop of the neomycin riboswitch. Using NMR spectroscopy, we directly show that the N3 imino group of the protonated cytidine forms a hydrogen bond with the backbone phosphate 3' from the third nucleotide of the U-turn analogously to the imino group of the uridine in the canonical motif. In addition, we compare the stability of the hydrogen bonds in the mutant U-turn motif to the wild type and describe the NMR signature of the C+-phosphate interaction. Our results have implications for the prediction of RNA structural motifs and suggest simple approaches for the experimental identification of hydrogen bonds between protonated C-imino groups and the phosphate backbone. © 2014 Gottstein-Schmidtke et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Singh, Vijay Shankar; Dubey, Ashutosh Prakash; Gupta, Ankush; Singh, Sudhir; Singh, Bhupendra Narain
2017-01-01
ABSTRACT Azospirillum brasilense Sp7 uses glycerol as a carbon source for growth and nitrogen fixation. When grown in medium containing glycerol as a source of carbon, it upregulates the expression of a protein which was identified as quinoprotein alcohol dehydrogenase (ExaA). Inactivation of exaA adversely affects the growth of A. brasilense on glycerol. A determination of the transcription start site of exaA revealed an RpoN-dependent −12/−24 promoter consensus. The expression of an exaA::lacZ fusion was induced maximally by glycerol and was dependent on σ54. Bioinformatic analysis of the sequence flanking the −12/−24 promoter revealed a 17-bp sequence motif with a dyad symmetry of 6 nucleotides upstream of the promoter, the disruption of which caused a drastic reduction in promoter activity. The electrophoretic mobility of a DNA fragment containing the 17-bp sequence motif was retarded by purified EraR, a LuxR-type transcription regulator that is transcribed divergently from exaA. EraR also showed a positive interaction with RpoN in two-hybrid and pulldown assays. IMPORTANCE Quinoprotein alcohol dehydrogenase (ExaA) plays an important role in the catabolism of alcohols in bacteria. Although exaA expression is thought to be regulated by a two-component system consisting of EraS and EraR, the mechanism of regulation was not known. This study shows the details of the regulation of expression of the exaA gene in A. brasilense. We have shown here that exaA of A. brasilense is maximally induced by glycerol and harbors a σ54-dependent promoter. The response regulator EraR binds to an inverted repeat located upstream of the exaA promoter. This study shows that a LuxR-type response regulator (EraR) binds upstream of the exaA gene and physically interacts with σ54. The unique feature of this regulation is that EraR is a LuxR-type transcription regulator that lacks the GAFTGA motif, a characteristic feature of the enhancer binding proteins that are known to interact with σ54 in other bacteria. PMID:28439037
Singh, Vijay Shankar; Dubey, Ashutosh Prakash; Gupta, Ankush; Singh, Sudhir; Singh, Bhupendra Narain; Tripathi, Anil Kumar
2017-07-01
Azospirillum brasilense Sp7 uses glycerol as a carbon source for growth and nitrogen fixation. When grown in medium containing glycerol as a source of carbon, it upregulates the expression of a protein which was identified as quinoprotein alcohol dehydrogenase (ExaA). Inactivation of exaA adversely affects the growth of A. brasilense on glycerol. A determination of the transcription start site of exaA revealed an RpoN-dependent -12/-24 promoter consensus. The expression of an exaA :: lacZ fusion was induced maximally by glycerol and was dependent on σ 54 Bioinformatic analysis of the sequence flanking the -12/-24 promoter revealed a 17-bp sequence motif with a dyad symmetry of 6 nucleotides upstream of the promoter, the disruption of which caused a drastic reduction in promoter activity. The electrophoretic mobility of a DNA fragment containing the 17-bp sequence motif was retarded by purified EraR, a LuxR-type transcription regulator that is transcribed divergently from exaA EraR also showed a positive interaction with RpoN in two-hybrid and pulldown assays. IMPORTANCE Quinoprotein alcohol dehydrogenase (ExaA) plays an important role in the catabolism of alcohols in bacteria. Although exaA expression is thought to be regulated by a two-component system consisting of EraS and EraR, the mechanism of regulation was not known. This study shows the details of the regulation of expression of the exaA gene in A. brasilense We have shown here that exaA of A. brasilense is maximally induced by glycerol and harbors a σ 54 -dependent promoter. The response regulator EraR binds to an inverted repeat located upstream of the exaA promoter. This study shows that a LuxR-type response regulator (EraR) binds upstream of the exaA gene and physically interacts with σ 54 The unique feature of this regulation is that EraR is a LuxR-type transcription regulator that lacks the GAFTGA motif, a characteristic feature of the enhancer binding proteins that are known to interact with σ 54 in other bacteria. Copyright © 2017 American Society for Microbiology.
Larsen, Svend Arild; Mogensen, Line; Dietz, Rune; Baagøe, Hans Jørgen; Andersen, Mogens; Werge, Thomas; Rasmussen, Henrik Berg
2005-12-01
In this study we have identified and characterized dopamine receptor D4 (DRD4) exon III tandem repeats in 33 public available nucleotide sequences from different mammalian species. We found that the tandem repeat in canids could be described in a novel and simple way, namely, as a structure composed of 15- and 12- bp modules. Tandem repeats composed of 18-bp modules were found in sequences from the horse, zebra, onager, and donkey, Asiatic bear, polar bear, common raccoon, dolphin, harbor porpoise, and domestic cat. Several of these sequences have been analyzed previously without a tandem repeat being found. In the domestic cow and gray seal we identified tandem repeats composed of 36-bp modules, each consisting of two closely related 18-bp basic units. A tandem repeat consisting of 9-bp modules was identified in sequences from mink and ferret. In the European otter we detected an 18-bp tandem repeat, while a tandem repeat consisting of 27-bp modules was identified in a sequence from European badger. Both these tandem repeats were composed of 9-bp basic units, which were closely related with the 9-bp repeat modules identified in the mink and ferret. Tandem repeats could not be identified in sequences from rodents. All tandem repeats possessed a high GC content with a strong bias for C. On phylogenetic analysis of the tandem repeats evolutionary related species were clustered into the same groups. The degree of conservation of the tandem repeats varied significantly between species. The deduced amino acid sequences of most of the tandem repeats exhibited a high propensity for disorder. This was also the case with an amino acid sequence of the human DRD4 exon III tandem repeat, which was included in the study for comparative purposes. We identified proline-containing motifs for SH3 and WW domain binding proteins, potential phosphorylation sites, PDZ domain binding motifs, and FHA domain binding motifs in the amino acid sequences of the tandem repeats. The numbers of potential functional sites varied pronouncedly between species. Our observations provide a platform for future studies of the architecture and evolution of the DRD4 exon III tandem repeat, and they suggest that differences in the structure of this tandem repeat contribute to specialization and generation of diversity in receptor function.
Searching for statistically significant regulatory modules.
Bailey, Timothy L; Noble, William Stafford
2003-10-01
The regulatory machinery controlling gene expression is complex, frequently requiring multiple, simultaneous DNA-protein interactions. The rate at which a gene is transcribed may depend upon the presence or absence of a collection of transcription factors bound to the DNA near the gene. Locating transcription factor binding sites in genomic DNA is difficult because the individual sites are small and tend to occur frequently by chance. True binding sites may be identified by their tendency to occur in clusters, sometimes known as regulatory modules. We describe an algorithm for detecting occurrences of regulatory modules in genomic DNA. The algorithm, called mcast, takes as input a DNA database and a collection of binding site motifs that are known to operate in concert. mcast uses a motif-based hidden Markov model with several novel features. The model incorporates motif-specific p-values, thereby allowing scores from motifs of different widths and specificities to be compared directly. The p-value scoring also allows mcast to only accept motif occurrences with significance below a user-specified threshold, while still assigning better scores to motif occurrences with lower p-values. mcast can search long DNA sequences, modeling length distributions between motifs within a regulatory module, but ignoring length distributions between modules. The algorithm produces a list of predicted regulatory modules, ranked by E-value. We validate the algorithm using simulated data as well as real data sets from fruitfly and human. http://meme.sdsc.edu/MCAST/paper
Wang, Yaofeng; Kraut, Rachel; Mu, Yuguang
2015-01-01
The Amyloid-β (Aβ)-derived, sphingolipid binding domain (SBD) peptide is a fluorescently tagged probe used to trace the diffusion behavior of sphingolipid-containing microdomains in cell membranes through binding to a constellation of glycosphingolipids, sphingomyelin, and cholesterol. However, the molecular details of the binding mechanism between SBD and plasma membrane domains remain unclear. Here, to investigate how the peptide recognizes the lipid surface at an atomically detailed level, SBD peptides in the environment of raft-like bilayers were examined in micro-seconds-long molecular dynamics simulations. We found that SBD adopted a coil-helix-coil structural motif, which binds to multiple GT1b gangliosides via salt bridges and CH–π interactions. Our simulation results demonstrate that the CH–π and electrostatic forces between SBD monomers and GT1b gangliosides clusters are the main driving forces in the binding process. The presence of the fluorescent dye and linker molecules do not change the binding mechanism of SBD probes with gangliosides, which involves the helix-turn-helix structural motif that was suggested to constitute a glycolipid binding domain common to some sphingolipid interacting proteins, including HIV gp120, prion, and Aβ. PMID:26540054
Klein-Hessling, Stefan; Schneider, Günter; Heinfling, Annette; Chuvpilo, Sergei; Serfling, Edgar
1996-01-01
HMG I(Y) proteins bind to double-stranded A+T oligonucleotides longer than three base pairs. Such motifs form part of numerous NF-AT-binding sites of lymphokine promoters, including the interleukin 4 (IL-4) promoter. NF-AT factors share short homologous peptide sequences in their DNA-binding domain with NF-κB factors and bind to certain NF-κB sites. It has been shown that HMG I(Y) proteins enhance NF-κB binding to the interferon β promoter and virus-mediated interferon β promoter induction. We show that HMG I(Y) proteins exert an opposite effect on the DNA binding of NF-AT factors and the induction of the IL-4 promoter in T lymphocytes. Introduction of mutations into a high-affinity HMG I(Y)-binding site of the IL-4 promoter, which decreased HMG I(Y)-binding to a NF-AT-binding sequence, the Pu-bB (or P) site, distinctly increased the induction of the IL-4 promoter in Jurkat T leukemia cells. High concentrations of HMG I(Y) proteins are able to displace NF-ATp from its binding to the Pu-bB site. High HMG I(Y) concentrations are typical for Jurkat cells and peripheral blood T lymphocytes, whereas El4 T lymphoma cells and certain T helper type 2 cell clones contain relatively low HMG I(Y) concentrations. Our results indicate that HMG I(Y) proteins do not cooperate, but instead compete with NF-AT factors for the binding to DNA even though NF-AT factors share some DNA-binding properties with NF-kB factors. This competition between HMG I(Y) and NF-AT proteins for DNA binding might be due to common contacts with minor groove nucleotides of DNA and may be one mechanism contributing to the selective IL-4 expression in certain T lymphocyte populations, such as T helper type 2 cells. PMID:8986808
An intracellular motif of GLUT4 regulates fusion of GLUT4-containing vesicles.
Heyward, Catherine A; Pettitt, Trevor R; Leney, Sophie E; Welsh, Gavin I; Tavaré, Jeremy M; Wakelam, Michael J O
2008-05-20
Insulin stimulates glucose uptake by adipocytes through increasing translocation of the glucose transporter GLUT4 from an intracellular compartment to the plasma membrane. Fusion of GLUT4-containing vesicles at the cell surface is thought to involve phospholipase D activity, generating the signalling lipid phosphatidic acid, although the mechanism of action is not yet clear. Here we report the identification of a putative phosphatidic acid-binding motif in a GLUT4 intracellular loop. Mutation of this motif causes a decrease in the insulin-induced exposure of GLUT4 at the cell surface of 3T3-L1 adipocytes via an effect on vesicle fusion. The potential phosphatidic acid-binding motif identified in this study is unique to GLUT4 among the sugar transporters, therefore this motif may provide a unique mechanism for regulating insulin-induced translocation by phospholipase D signalling.
Ca2+-Induced Rigidity Change of the Myosin VIIa IQ Motif-Single α Helix Lever Arm Extension.
Li, Jianchao; Chen, Yiyun; Deng, Yisong; Unarta, Ilona Christy; Lu, Qing; Huang, Xuhui; Zhang, Mingjie
2017-04-04
Several unconventional myosins contain a highly charged single α helix (SAH) immediately following the calmodulin (CaM) binding IQ motifs, functioning to extend lever arms of these myosins. How such SAH is connected to the IQ motifs and whether the conformation of the IQ motifs-SAH segments are regulated by Ca 2+ fluctuations are not known. Here, we demonstrate by solving its crystal structure that the predicted SAH of myosin VIIa (Myo7a) forms a stable SAH. The structure of Myo7a IQ5-SAH segment in complex with apo-CaM reveals that the SAH sequence can extend the length of the Myo7a lever arm. Although Ca 2+ -CaM remains bound to IQ5-SAH, the Ca 2+ -induced CaM binding mode change softens the conformation of the IQ5-SAH junction, revealing a Ca 2+ -induced lever arm flexibility change for Myo7a. We further demonstrate that the last IQ motif of several other myosins also binds to both apo- and Ca 2+ -CaM, suggesting a common Ca 2+ -induced conformational regulation mechanism. Copyright © 2017 Elsevier Ltd. All rights reserved.
Regulation of the scp Genes in the Cyanobacterium Synechocystis sp. PCC 6803--What is New?
Cheregi, Otilia; Funk, Christiane
2015-08-12
In the cyanobacterium Synechocystis sp. PCC 6803 there are five genes encoding small CAB-like (SCP) proteins, which have been shown to be up-regulated under stress. Analyses of the promoter sequences of the scp genes revealed the existence of an NtcA binding motif in two scp genes, scpB and scpE. Binding of NtcA, the key transcriptional regulator during nitrogen stress, to the promoter regions was shown by electrophoretic mobility shift assay. The metabolite 2-oxoglutarate did not increase the affinity of NtcA for binding to the promoters of scpB and scpE. A second motif, the HIP1 palindrome 5' GGCGATCGCC 3', was detected in the upstream regions of scpB and scpC. The transcription factor encoded by sll1130 has been suggested to recognize this motif to regulate heat-responsive genes. Our data suggest that HIP1 is not a regulatory element within the scp genes. Further, the presence of the high light regulatory (HLR1) motif was confirmed in scpB-E, in accordance to their induced transcriptions in cells exposed to high light. The HLR1 motif was newly discovered in eight additional genes.
Folio, Christelle; Sierra, Natalia; Dujardin, Marie; Alvarez, Guzman
2017-01-01
Feline immunodeficiency virus (FIV) is a member of the Retroviridae family. It is the causative agent of an acquired immunodeficiency syndrome (AIDS) in cats and wild felines. Its capsid protein (CA) drives the assembly of the viral particle, which is a critical step in the viral replication cycle. Here, the first atomic structure of full-length FIV CA to 1.67 Å resolution is determined. The crystallized protein exhibits an original tetrameric assembly, composed of dimers which are stabilized by an intermolecular disulfide bridge induced by the crystallogenesis conditions. The FIV CA displays a standard α-helical CA topology with two domains, separated by a linker shorter than other retroviral CAs. The β-hairpin motif at its amino terminal end, which interacts with nucleotides in HIV-1, is unusually long in FIV CA. Interestingly, this functional β-motif is formed in this construct in the absence of the conserved N-terminal proline. The FIV CA exhibits a cis Arg–Pro bond in the CypA-binding loop, which is absent in known structures of lentiviral CAs. This structure represents the first tri-dimensional structure of a functional, full-length FIV CA. PMID:29120364
Aloise, P; Kagawa, Y; Coleman, P S
1991-06-05
Three F1 preparations, the beef heart (MF1) and thermophilic bacterium (TF1) holoenzymes, and the alpha 3 beta 3 "core" complex of TF1 reconstituted from individually expressed alpha and beta subunits, were compared as to their kinetic and binding stoichiometric responses to covalent photoaffinity labeling with BzATP and BzADP (+/- Mg2+). Each enzyme displayed an enhanced pseudo-first order rate of photoinhibition and one-third of the sites covalent binding to a catalytic site for full inhibition, plus, but not minus Mg2+. Titration of near stoichiometric [MgBzADP]/[F1] ratios during photolysis disclosed two sequential covalent binding patterns for each enzyme; a high affinity binding corresponding to unistoichiometric covalent association concomitant with enzyme inhibition, followed by a low affinity multisite-saturating covalent association. Thus, in the absence of the structural asymmetry inducing gamma delta epsilon subunits of the holoenzyme, the sequential binding of nucleotide at putative catalytic sites on the alpha 3 beta 3 complex of any F1 appears sufficient to effect binding affinity changes. With MF1, final covalent saturation of BzADP-accessible sites was achieved with 2 mol of BzADP/mol of enzyme, but with TF1 or its alpha 3 beta 3 complex, saturation required 3 mol of BzADP/mol of enzyme. Such differential final labeling stoichiometries could arise because of the endogenous presence of 1 nucleotide already bound to one of the 3 potential catalytic sites on normally prepared MF1, whereas TF1, possessing no endogenous nucleotide, has 3 vacant BzADP-accessible sites. Kinetics measurements revealed that regardless of the incremental extent of inhibition of the TF1 holoenzyme by BzADP during photolysis, the two higher apparent Km values (approximately 1.5 x 10(-4) and approximately 10(-3) M, respectively) of the progressively inactivated incubation are unchanged relative to fully unmodified enzyme. As reported for BzATP (or BzADP) and MF1 (Ackerman, S.H., Grubmeyer, C., and Coleman, P.S. (1987) J. Biol. Chem. 262, 13765-13772), this supports the fact that the photocovalent inhibition of F1 is a one-hit one-kill phenomenon. Isoelectric focusing gels revealed that [3H]BzADP covalently modifies both TF1 and MF1 exclusively on the beta subunit, whether or not Mg2+ is present. A single 19-residue [3H]BzADP-labeled peptide was resolved from a tryptic digest of MF1, and this peptide corresponded with the one believed to contain at least a portion of the beta subunit catalytic site domain (i.e. beta Ala-338----beta Arg-356).
PH motifs in PAR1&2 endow breast cancer growth.
Kancharla, A; Maoz, M; Jaber, M; Agranovich, D; Peretz, T; Grisaru-Granovsky, S; Uziely, B; Bar-Shavit, R
2015-11-24
Although emerging roles of protease-activated receptor1&2 (PAR1&2) in cancer are recognized, their underlying signalling events are poorly understood. Here we show signal-binding motifs in PAR1&2 that are critical for breast cancer growth. This occurs via the association of the pleckstrin homology (PH) domain with Akt/PKB as a key signalling event of PARs. Other PH-domain signal-proteins such as Etk/Bmx and Vav3 also associate with PAR1 and PAR2 through their PH domains. PAR1 and PAR2 bind with priority to Etk/Bmx. A point mutation in PAR2, H349A, but not in R352A, abrogates PH-protein association and is sufficient to markedly reduce PAR2-instigated breast tumour growth in vivo and placental extravillous trophoblast (EVT) invasion in vitro. Similarly, the PAR1 mutant hPar1-7A, which is unable to bind the PH domain, reduces mammary tumours and EVT invasion, endowing these motifs with physiological significance and underscoring the importance of these previously unknown PAR1 and PAR2 PH-domain-binding motifs in both pathological and physiological invasion processes.
Boisgerault, F; Khalil, I; Tieng, V; Connan, F; Tabary, T; Cohen, J H; Choppin, J; Charron, D; Toubert, A
1996-01-01
The peptide-binding motif of HLA-A29, the predisposing allele for birdshot retinopathy, was determined after acid-elution of endogenous peptides from purified HLA-A29 molecules. Individual and pooled HPLC fractions were sequenced by Edman degradation. Major anchor residues could be defined as glutamate at the second position of the peptide and as tyrosine at the carboxyl terminus. In vitro binding of polyglycine synthetic peptides to purified HLA-A29 molecules also revealed the need for an auxiliary anchor residue at the third position, preferably phenylalanine. By using this motif, we synthesized six peptides from the retinal soluble antigen, a candidate autoantigen in autoimmune uveoretinitis. Their in vitro binding was tested on HLA-A29 and also on HLA-B44 and HLA-B61, two alleles sharing close peptide-binding motifs. Two peptides derived from the carboxyl-terminal sequence of the human retinal soluble antigen bound efficiently to HLA-A29. This study could contribute to the prediction of T-cell epitopes from retinal autoantigens implicated in birdshot retinopathy. PMID:8622959
Hatakeyama, Tomomitsu; Ishimine, Tomohiro; Baba, Tomohiro; Kimura, Masanari; Unno, Hideaki; Goda, Shuichiro
2013-07-01
CEL-I is a Gal/GalNAc-specific C-type lectin isolated from the sea cucumber Cucumaria echinata. This lectin is composed of two carbohydrate-recognition domains (CRDs) with the carbohydrate-recognition motif QPD (Gln-Pro- Asp), which is generally known to exist in galactose-specific C-type CRDs. In the present study, a mutant CEL-I with EPN (Glu-Pro-Asn) motif, which is thought to be responsible for the carbohydrate-recognition of mannose-specific Ctype CRDs, was produced in Escherichia coli, and its effects on the carbohydrate-binding specificity were examined using polyamidoamine dendrimer (PD) conjugated with carbohydrates. Although wild-type CEL-I effectively formed complexes with N-acetylgalactosamine (GalNAc)-PD but not with mannose-PD, the mutant CEL-I showed relatively weak but definite affinity for mannose-PD. These results indicated that the QPD and EPN motifs play a significant role in the carbohydrate-recognition mechanism of CEL-I, especially in the discrimination of galactose and mannose. Additional mutations in the recombinant CEL-I binding site may further increase its specificity for mannose, and should provide insights into designing novel carbohydrate-recognition proteins.
Ryon, J J; Fixman, E D; Houchens, C; Zong, J; Lieberman, P M; Chang, Y N; Hayward, G S; Hayward, S D
1993-01-01
Herpesvirus papio (HVP) is a B-lymphotropic baboon virus with an estimated 40% homology to Epstein-Barr virus (EBV). We have cloned and sequenced ori-Lyt of herpesvirus papio and found a striking degree of nucleotide homology (89%) with ori-Lyt of EBV. Transcriptional elements form an integral part of EBV ori-Lyt. The promoter and enhancer domains of EBV ori-Lyt are conserved in herpesvirus papio. The EBV ori-Lyt promoter contains four binding sites for the EBV lytic cycle transactivator Zta, and the enhancer includes one Zta and two Rta response elements. All five of the Zta response elements and one of the Rta motifs are conserved in HVP ori-Lyt, and the HVP DS-L leftward promoter and the enhancer were activated in transient transfection assays by the EBV Zta and Rta transactivators. The EBV ori-Lyt enhancer contains a palindromic sequence, GGTCAGCTGACC, centered on a PvuII restriction site. This sequence, with a single base change, is also present in the HVP ori-Lyt enhancer. DNase I footprinting demonstrated that the PvuII sequence was bound by a protein present in a Raji nuclear extract. Mobility shift and competition assays using oligonucleotide probes identified this sequence as a binding site for the cellular transcription factor MLTF. Mutagenesis of the binding site indicated that MLTF contributes significantly to the constitutive activity of the ori-Lyt enhancer. The high degree of conservation of cis-acting signal sequences in HVP ori-Lyt was further emphasized by the finding that an HVP ori-Lyt-containing plasmid was replicated in Vero cells by a set of cotransfected EBV replication genes. The central domain of EBV ori-Lyt contains two related AT-rich palindromes, one of which is partially duplicated in the HVP sequence. The AT-rich palindromes are functionally important cis-acting motifs. Deletion of these palindromes severely diminished replication of an ori-Lyt target plasmid. Images PMID:8389916
In-cell RNA structure probing with SHAPE-MaP.
Smola, Matthew J; Weeks, Kevin M
2018-06-01
This protocol is an extension to: Nat. Protoc. 10, 1643-1669 (2015); doi:10.1038/nprot.2015.103; published online 01 October 2015RNAs play key roles in many cellular processes. The underlying structure of RNA is an important determinant of how transcripts function, are processed, and interact with RNA-binding proteins and ligands. RNA structure analysis by selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) takes advantage of the reactivity of small electrophilic chemical probes that react with the 2'-hydroxyl group to assess RNA structure at nucleotide resolution. When coupled with mutational profiling (MaP), in which modified nucleotides are detected as internal miscodings during reverse transcription and then read out by massively parallel sequencing, SHAPE yields quantitative per-nucleotide measurements of RNA structure. Here, we provide an extension to our previous in vitro SHAPE-MaP protocol with detailed guidance for undertaking and analyzing SHAPE-MaP probing experiments in live cells. The MaP strategy works for both abundant-transcriptome experiments and for cellular RNAs of low to moderate abundance, which are not well examined by whole-transcriptome methods. In-cell SHAPE-MaP, performed in roughly 3 d, can be applied in cell types ranging from bacteria to cultured mammalian cells and is compatible with a variety of structure-probing reagents. We detail several strategies by which in-cell SHAPE-MaP can inform new biological hypotheses and emphasize downstream analyses that reveal sequence or structure motifs important for RNA interactions in cells.
Liu, Y; Chatterjee, A; Chatterjee, A K
1994-01-01
Our previous genetic analysis (J. W. Willis, J. K. Engwall, and A. K. Chatterjee, Phytopathology 77:1199-1205, 1987) had revealed a tight linkage between pel-3 (pel, pectate lyase gene) and peh-1 (peh, polygalacturonase gene) within the chromosome of Erwinia carotovora subsp. carotovora 71. Nucleotide sequencing, transcript assays, and expression of enzymatic activities in Escherichia coli have now confirmed that a 3,500-bp segment contains the open reading frames (ORFs) for Pel-3 and Peh-1. The 1,041-bp pel-3 ORF and the 1,206-bp peh-1 ORF are separated by a 579-bp sequence. The genes are transcribed divergently from their own promoters. In E. coli and E. carotovora subsp. carotovora 71, peh-1 is better expressed than pel-3. However, plant signals activate the expression of both the genes in E. carotovora subsp. carotovora. A consensus integration host factor (IHF)-binding sequence upstream of pel-3 appears physiologically significant, since pel-3 promoter activity is higher in an E. coli IHF+ strain than in an IHF- strain. While peh-1 has extensive homology with plant and bacterial peh genes, pel-3 appears not to have significant homology with the pel genes belonging to the pelBC, pelADE, or periplasmic pel families. Pel-3 also is unusual in that it is predicted to contain an ATP- and GTP-binding site motif A (P-loop) not found in the other Pels. Images PMID:8074530
Wang, Guan-Feng; Ji, Jiabing; El-Kasmi, Farid; Dangl, Jeffery L; Johal, Guri; Balint-Kurti, Peter J
2015-02-01
Plant disease resistance is often mediated by nucleotide binding-leucine rich repeat (NLR) proteins which remain auto-inhibited until recognition of specific pathogen-derived molecules causes their activation, triggering a rapid, localized cell death called a hypersensitive response (HR). Three domains are recognized in one of the major classes of NLR proteins: a coiled-coil (CC), a nucleotide binding (NB-ARC) and a leucine rich repeat (LRR) domains. The maize NLR gene Rp1-D21 derives from an intergenic recombination event between two NLR genes, Rp1-D and Rp1-dp2 and confers an autoactive HR. We report systematic structural and functional analyses of Rp1 proteins in maize and N. benthamiana to characterize the molecular mechanism of NLR activation/auto-inhibition. We derive a model comprising the following three main features: Rp1 proteins appear to self-associate to become competent for activity. The CC domain is signaling-competent and is sufficient to induce HR. This can be suppressed by the NB-ARC domain through direct interaction. In autoactive proteins, the interaction of the LRR domain with the NB-ARC domain causes de-repression and thus disrupts the inhibition of HR. Further, we identify specific amino acids and combinations thereof that are important for the auto-inhibition/activity of Rp1 proteins. We also provide evidence for the function of MHD2, a previously uncharacterized, though widely conserved NLR motif. This work reports several novel insights into the precise structural requirement for NLR function and informs efforts towards utilizing these proteins for engineering disease resistance.
Xu, Zhenglei; Yu, Zhichao; Nai, Shumei; Shi, Ruiyue; Tang, Qinhong; Zhang, Haiyang; Ye, Lijuan; Wang, Lisheng; Hong, Yincai
2017-10-01
Spon2 is a proto-oncogene matrix protein that plays an essential role in the tumorigenesis and metastasis of gastric cancer. The protein has recently been found to function as a guanine nucleotide exchange factor through the activation of RhoGTPase. Here, computational modeling and bioinformatics analysis were employed to investigate the molecular mechanism and biological implication underlying Spon2 autoinhibition. It is revealed that the binding of PxxP motif to SH domain can stabilize the intramolecular interaction between the N-terminal helix and DH domain of Spon2, thus shifting the protein into an autoinhibitory state. Here, we proposed releasing Spon2 autoinhibition by targeting SH domain with competitive peptide ligands. To verify this notion, the PxxP sequence was adopted as the start to derive an array of efficient SH binders by using a structure-based rational design strategy, which were then substantiated with fluorescence spectroscopy analysis and guanine nucleotide exchange test. Consequently, the obtained peptide ligands were determined to have a moderate or high affinity for SH domain; they can also enhance Spon2 exchange activity by 1.2-6.1 folds, exhibiting a significant correlation with their SH-binding affinity (Pearson's coefficient=0.92). In addition, neutral substitution of conserved residues in a high-affinity peptide ligand can largely reduce its Spon2-activating potency, confirming that the designed peptide activates Spon2 by competitively disrupting SH-PxxP interaction. Copyright © 2017 Elsevier Inc. All rights reserved.
Shien, J-H; Wang, Y-S; Chen, C-H; Shieh, H K; Hu, C-C; Chang, P-C
2008-10-01
Live attenuated vaccines have been used for control of the disease caused by goose parvovirus (GPV), but the mechanism involved in attenuation of GPV remains elusive. This report presents the complete nucleotide sequences of two live attenuated strains of GPV (82-0321V and VG32/1) that were independently developed in Taiwan and Europe, together with the parental strain of 82-0321V and a field strain isolated in Taiwan in 2006. Sequence comparisons showed that 82-0321V and VG32/1 had multiple deletions and substitutions in the inverted terminal repeats region when compared with their parental strain or the field virus, but these changes did not affect the formation of the hairpin structure essential for viral replication. Moreover, 82-0321V and VG32/1 had five amino acid changes in the non-structural protein, but these changes were located at positions distant from known functional motifs in the non-structural protein. In contrast, 82-0321V had nine changes and VG32/1 had 11 changes in their capsid proteins (VP1), and the majority of these changes occurred at positions close to the putative receptor binding sites of VP1, as predicted using the structure of adeno-associated virus 2 as the model system. Taken together, the results suggest that changes in sequence near the receptor binding sites of VP1 might be responsible for attenuation of GPV. This is the first report of complete nucleotide sequences of GPV other than the virulent B strain, and suggests a possible mechanism for attenuation of GPV.
Moustafa, Ibrahim M.; Shen, Hujun; Morton, Brandon; Colina, Coray M.; Cameron, Craig E.
2011-01-01
The viral RNA-dependent RNA polymerase (RdRp) is essential for multiplication of all RNA viruses. The sequence diversity of an RNA virus population contributes to its ability to infect the host. This diversity emanates from errors made by the RdRp during RNA synthesis. The physical basis for RdRp fidelity is unclear but is linked to conformational changes occurring during the nucleotide-addition cycle. To understand RdRp dynamics that might influence RdRp function, we have analyzed all-atom molecular dynamics (MD) simulations on the nanosecond timescale of four RdRps from the picornavirus family that exhibit 30–74% sequence identity. Principal component analysis showed that the major motions observed during the simulations derived from conserved structural motifs and regions of known function. Dynamics of residues participating in the same biochemical property, for example RNA binding, nucleotide binding or catalysis, were correlated even when spatially distant on the RdRp structure. The conserved and correlated dynamics of functional, structural elements suggest co-evolution of dynamics with structure and function of the RdRp. Crystal structures of all picornavirus RdRps exhibit a template-nascent RNA duplex channel too small to fully accommodate duplex RNA. Simulations revealed opening and closing motions of the RNA and NTP channels, which might be relevant to NTP entry, PPi exit and translocation. A role for nanosecond timescale dynamics in RdRp fidelity is supported by altered dynamics of the high-fidelity G64S derivative of PV RdRp relative to wild-type enzyme. PMID:21575642
DNA binding site characterization by means of Rényi entropy measures on nucleotide transitions.
Perera, A; Vallverdu, M; Claria, F; Soria, J M; Caminal, P
2008-06-01
In this work, parametric information-theory measures for the characterization of binding sites in DNA are extended with the use of transitional probabilities on the sequence. We propose the use of parametric uncertainty measures such as Rényi entropies obtained from the transition probabilities for the study of the binding sites, in addition to nucleotide frequency-based Rényi measures. Results are reported in this work comparing transition frequencies (i.e., dinucleotides) and base frequencies for Shannon and parametric Rényi entropies for a number of binding sites found in E. Coli, lambda and T7 organisms. We observe that the information provided by both approaches is not redundant. Furthermore, under the presence of noise in the binding site matrix we observe overall improved robustness of nucleotide transition-based algorithms when compared with nucleotide frequency-based method.
Kobayashi, Y M; Alseikhan, B A; Jones, L R
2000-06-09
Triadin is an integral membrane protein of the junctional sarcoplasmic reticulum that binds to the high capacity Ca(2+)-binding protein calsequestrin and anchors it to the ryanodine receptor. The lumenal domain of triadin contains multiple repeats of alternating lysine and glutamic acid residues, which have been defined as KEKE motifs and have been proposed to promote protein associations. Here we identified the specific residues of triadin responsible for binding to calsequestrin by mutational analysis of triadin 1, the major cardiac isoform. A series of deletional fusion proteins of triadin 1 was generated, and by using metabolically labeled calsequestrin in filter-overlay assays, the calsequestrin-binding domain of triadin 1 was localized to a single KEKE motif comprised of 25 amino acids. Alanine mutagenesis within this motif demonstrated that the critical amino acids of triadin binding to calsequestrin are the even-numbered residues Lys(210), Lys(212), Glu(214), Lys(216), Gly(218), Gln(220), Lys(222), and Lys(224). Replacement of the odd-numbered residues within this motif by alanine had no effect on calsequestrin binding to triadin. The results suggest a model in which residues 210-224 of triadin form a beta-strand, with the even-numbered residues in the strand interacting with charged residues of calsequestrin, stabilizing a "polar zipper" that links the two proteins together. This small, highly charged beta-strand of triadin may tether calsequestrin to the junctional face membrane, allowing calsequestrin to sequester Ca(2+) in the vicinity of the ryanodine receptor during Ca(2+) uptake and Ca(2+) release.
Sequence-specific DNA binding by MYC/MAX to low-affinity non-E-box motifs.
Allevato, Michael; Bolotin, Eugene; Grossman, Mark; Mane-Padros, Daniel; Sladek, Frances M; Martinez, Ernest
2017-01-01
The MYC oncoprotein regulates transcription of a large fraction of the genome as an obligatory heterodimer with the transcription factor MAX. The MYC:MAX heterodimer and MAX:MAX homodimer (hereafter MYC/MAX) bind Enhancer box (E-box) DNA elements (CANNTG) and have the greatest affinity for the canonical MYC E-box (CME) CACGTG. However, MYC:MAX also recognizes E-box variants and was reported to bind DNA in a "non-specific" fashion in vitro and in vivo. Here, in order to identify potential additional non-canonical binding sites for MYC/MAX, we employed high throughput in vitro protein-binding microarrays, along with electrophoretic mobility-shift assays and bioinformatic analyses of MYC-bound genomic loci in vivo. We identified all hexameric motifs preferentially bound by MYC/MAX in vitro, which include the low-affinity non-E-box sequence AACGTT, and found that the vast majority (87%) of MYC-bound genomic sites in a human B cell line contain at least one of the top 21 motifs bound by MYC:MAX in vitro. We further show that high MYC/MAX concentrations are needed for specific binding to the low-affinity sequence AACGTT in vitro and that elevated MYC levels in vivo more markedly increase the occupancy of AACGTT sites relative to CME sites, especially at distal intergenic and intragenic loci. Hence, MYC binds diverse DNA motifs with a broad range of affinities in a sequence-specific and dose-dependent manner, suggesting that MYC overexpression has more selective effects on the tumor transcriptome than previously thought.
Bosselut, R; Levin, J; Adjadj, E; Ghysdael, J
1993-11-11
Ets proteins form a family of sequence specific DNA binding proteins which bind DNA through a 85 aminoacids conserved domain, the Ets domain, whose sequence is unrelated to any other characterized DNA binding domain. Unlike all other known Ets proteins, which bind specific DNA sequences centered over either GGAA or GGAT core motifs, E74 and Elf1 selectively bind to GGAA corecontaining sites. Elf1 and E74 differ from other Ets proteins in three residues located in an otherwise highly conserved region of the Ets domain, referred to as conserved region III (CRIII). We show that a restricted selectivity for GGAA core-containing sites could be conferred to Ets1 upon changing a single lysine residue within CRIII to the threonine found in Elf1 and E74 at this position. Conversely, the reciprocal mutation in Elf1 confers to this protein the ability to bind to GGAT core containing EBS. This, together with the fact that mutation of two invariant arginine residues in CRIII abolishes DNA binding, indicates that CRIII plays a key role in Ets domain recognition of the GGAA/T core motif and lead us to discuss a model of Ets proteins--core motif interaction.
Ng, Chai Ann; Ke, Ying; Perry, Matthew D.; Tan, Peter S.; Hill, Adam P.; Vandenberg, Jamie I.
2013-01-01
Kv11.1 potassium channels are important for regulation of the normal rhythm of the heartbeat. Reduced activity of Kv11.1 channels causes long QT syndrome type 2, a disorder that increases the risk of cardiac arrhythmias and sudden cardiac arrest. Kv11.1 channels are members of the KCNH subfamily of voltage-gated K+ channels. However, they also share many similarities with the cyclic nucleotide gated ion channel family, including having a cyclic nucleotide-binding homology (cNBH) domain. Kv11.1 channels, however, are not directly regulated by cyclic nucleotides. Recently, crystal structures of the cNBH domain from mEAG and zELK channels, both members of the KCNH family of voltage-gated potassium channels, revealed that a C-terminal β9-strand in the cNBH domain occupied the putative cyclic nucleotide-binding site thereby precluding binding of cyclic nucleotides. Here we show that mutations to residues in the β9-strand affect the stability of the open state relative to the closed state of Kv11.1 channels. We also show that disrupting the structure of the β9-strand reduces the stability of the inactivated state relative to the open state. Clinical mutations located in this β9-strand result in reduced trafficking efficiency, which suggests that binding of the C-terminal β9-strand to the putative cyclic nucleotide-binding pocket is also important for assembly and trafficking of Kv11.1 channels. PMID:24204727
Moreno, Renata; Hernández-Arranz, Sofía; La Rosa, Ruggero; Yuste, Luis; Madhushani, Anjana; Shingler, Victoria; Rojo, Fernando
2015-01-01
The Crc protein is a global regulator that has a key role in catabolite repression and optimization of metabolism in Pseudomonads. Crc inhibits gene expression post-transcriptionally, preventing translation of mRNAs bearing an AAnAAnAA motif [the catabolite activity (CA) motif] close to the translation start site. Although Crc was initially believed to bind RNA by itself, this idea was recently challenged by results suggesting that a protein co-purifying with Crc, presumably the Hfq protein, could account for the detected RNA-binding activity. Hfq is an abundant protein that has a central role in post-transcriptional gene regulation. Herein, we show that the Pseudomonas putida Hfq protein can recognize the CA motifs of RNAs through its distal face and that Crc facilitates formation of a more stable complex at these targets. Crc was unable to bind RNA in the absence of Hfq. However, pull-down assays showed that Crc and Hfq can form a co-complex with RNA containing a CA motif in vitro. Inactivation of the hfq or the crc gene impaired catabolite repression to a similar extent. We propose that Crc and Hfq cooperate in catabolite repression, probably through forming a stable co-complex with RNAs containing CA motifs to result in inhibition of translation initiation. © 2014 Society for Applied Microbiology and John Wiley & Sons Ltd.
Molecular origin of the binding of WWOX tumor suppressor to ErbB4 receptor tyrosine kinase.
Schuchardt, Brett J; Bhat, Vikas; Mikles, David C; McDonald, Caleb B; Sudol, Marius; Farooq, Amjad
2013-12-23
The ability of WWOX tumor suppressor to physically associate with the intracellular domain (ICD) of ErbB4 receptor tyrosine kinase is believed to play a central role in downregulating the transcriptional function of the latter. Herein, using various biophysical methods, we show that while the WW1 domain of WWOX binds to PPXY motifs located within the ICD of ErbB4 in a physiologically relevant manner, the WW2 domain does not. Importantly, while the WW1 domain absolutely requires the integrity of the PPXY consensus sequence, nonconsensus residues within and flanking this motif do not appear to be critical for binding. This strongly suggests that the WW1 domain of WWOX is rather promiscuous toward its cellular partners. We also provide evidence that the lack of binding of the WW2 domain of WWOX to PPXY motifs is due to the replacement of a signature tryptophan, lining the hydrophobic ligand binding groove, with tyrosine (Y85). Consistent with this notion, the Y85W substitution within the WW2 domain exquisitely restores its binding to PPXY motifs in a manner akin to the binding of the WW1 domain of WWOX. Of particular significance is the observation that the WW2 domain augments the binding of the WW1 domain to ErbB4, implying that the former serves as a chaperone within the context of the WW1-WW2 tandem module of WWOX in agreement with our findings reported previously. Altogether, our study sheds new light on the molecular basis of an important WW-ligand interaction involved in mediating a plethora of cellular processes.
Molecular Origin of the Binding of WWOX Tumor Suppressor to ErbB4 Receptor Tyrosine Kinase
Schuchardt, Brett J.; Bhat, Vikas; Mikles, David C.; McDonald, Caleb B.; Sudol, Marius; Farooq, Amjad
2014-01-01
The ability of WWOX tumor suppressor to physically associate with the intracellular domain (ICD) of ErbB4 receptor tyrosine kinase is believed to play a central role in down-regulating the transcriptional function of the latter. Herein, using various biophysical methods, we show that while the WW1 domain of WWOX binds to PPXY motifs located within the ICD of ErbB4 in a physiologically-relevant manner, the WW2 domain does not. Importantly, while the WW1 domain absolutely requires the integrity of the PPXY consensus sequence, non-consensus residues within and flanking this motif do not appear to be critical for binding. This strongly suggests that the WW1 domain of WWOX is rather promiscuous toward its cellular partners. We also provide evidence that the lack of binding of WW2 domain of WWOX to PPXY motifs is due to the replacement of a signature tryptophan, lining the hydrophobic ligand binding groove, with tyrosine (Y85). Consistent with this notion, the Y85W substitution within the WW2 domain exquisitely restores its binding to PPXY motifs in a manner akin to the binding of WW1 domain of WWOX. Of particular significance is the observation that WW2 domain augments the binding of WW1 domain to ErbB4, implying that the former serves as a chaperone within the context of the WW1–WW2 tandem module of WWOX in agreement with our findings reported previously. Taken together, our study sheds new light on the molecular basis of an important WW-ligand interaction involved in mediating a plethora of cellular processes. PMID:24308844
Mohtar, M Aiman; Hernychova, Lenka; O'Neill, J Robert; Lawrence, Melanie L; Murray, Euan; Vojtesek, Borek; Hupp, Ted R
2018-04-01
AGR2 is an oncogenic endoplasmic reticulum (ER)-resident protein disulfide isomerase. AGR2 protein has a relatively unique property for a chaperone in that it can bind sequence-specifically to a specific peptide motif (TTIYY). A synthetic TTIYY-containing peptide column was used to affinity-purify AGR2 from crude lysates highlighting peptide selectivity in complex mixtures. Hydrogen-deuterium exchange mass spectrometry localized the dominant region in AGR2 that interacts with the TTIYY peptide to within a structural loop from amino acids 131-135 (VDPSL). A peptide binding site consensus of Tx[IL][YF][YF] was developed for AGR2 by measuring its activity against a mutant peptide library. Screening the human proteome for proteins harboring this motif revealed an enrichment in transmembrane proteins and we focused on validating EpCAM as a potential AGR2-interacting protein. AGR2 and EpCAM proteins formed a dose-dependent protein-protein interaction in vitro Proximity ligation assays demonstrated that endogenous AGR2 and EpCAM protein associate in cells. Introducing a single alanine mutation in EpCAM at Tyr251 attenuated its binding to AGR2 in vitro and in cells. Hydrogen-deuterium exchange mass spectrometry was used to identify a stable binding site for AGR2 on EpCAM, adjacent to the TLIYY motif and surrounding EpCAM's detergent binding site. These data define a dominant site on AGR2 that mediates its specific peptide-binding function. EpCAM forms a model client protein for AGR2 to study how an ER-resident chaperone can dock specifically to a peptide motif and regulate the trafficking a protein destined for the secretory pathway. © 2018 by The American Society for Biochemistry and Molecular Biology, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Han, S.; Tainer, J.A.
2001-08-01
ADP-ribosylation is a widely occurring and biologically critical covalent chemical modification process in pathogenic mechanisms, intracellular signaling systems, DNA repair, and cell division. The reaction is catalyzed by ADP-ribosyltransferases, which transfer the ADP-ribose moiety of NAD to a target protein with nicotinamide release. A family of bacterial toxins and eukaryotic enzymes has been termed the mono-ADP-ribosyltransferases, in distinction to the poly-ADP-ribosyltransferases, which catalyze the addition of multiple ADP-ribose groups to the carboxyl terminus of eukaryotic nucleoproteins. Despite the limited primary sequence homology among the different ADP-ribosyltransferases, a central cleft bearing NAD-binding pocket formed by the two perpendicular b-sheet core hasmore » been remarkably conserved between bacterial toxins and eukaryotic mono- and poly-ADP-ribosyltransferases. The majority of bacterial toxins and eukaryotic mono-ADP-ribosyltransferases are characterized by conserved His and catalytic Glu residues. In contrast, Diphtheria toxin, Pseudomonas exotoxin A, and eukaryotic poly-ADP-ribosyltransferases are characterized by conserved Arg and catalytic Glu residues. The NAD-binding core of a binary toxin and a C3-like toxin family identified an ARTT motif (ADP-ribosylating turn-turn motif) that is implicated in substrate specificity and recognition by structural and mutagenic studies. Here we apply structure-based sequence alignment and comparative structural analyses of all known structures of ADP-ribosyltransfeases to suggest that this ARTT motif is functionally important in many ADP-ribosylating enzymes that bear a NAD binding cleft as characterized by conserved Arg and catalytic Glu residues. Overall, structure-based sequence analysis reveals common core structures and conserved active sites of ADP-ribosyltransferases to support similar NAD binding mechanisms but differing mechanisms of target protein binding via sequence variations within the ARTT motif structural framework. Thus, we propose here that the ARTT motif represents an experimentally testable general recognition motif region for many ADP-ribosyltransferases and thereby potentially provides a unified structural understanding of substrate recognition in ADP-ribosylation processes.« less
The complete nucleotide sequence of RNA 3 of a peach isolate of Prunus necrotic ringspot virus.
Hammond, R W; Crosslin, J M
1995-04-01
The complete nucleotide sequence of RNA 3 of the PE-5 peach isolate of Prunus necrotic ringspot ilarvirus (PNRSV) was obtained from cloned cDNA. The RNA sequence is 1941 nucleotides and contains two open reading frames (ORFs). ORF 1 consisted of 284 amino acids with a calculated molecular weight of 31,729 Da and ORF 2 contained 224 amino acids with a calculated molecular weight of 25,018 Da. ORF 2 corresponds to the coat protein gene. Expression of ORF 2 engineered into a pTrcHis vector in Escherichia coli results in a fusion polypeptide of approximately 28 kDa which cross-reacts with PNRSV polyclonal antiserum. Analysis of the coat protein amino acid sequence reveals a putative "zinc-finger" domain at the amino-terminal portion of the protein. Two tetranucleotide AUGC motifs occur in the 3'-UTR of the RNA and may function in coat protein binding and genome activation. ORF 1 homologies to other ilarviruses and alfalfa mosaic virus are confined to limited regions of conserved amino acids. The translated amino acid sequence of the coat protein gene shows 92% similarity to one isolate of apple mosaic virus, a closely related member of the ilarvirus group of plant viruses, but only 66% similarity to the amino acid sequence of the coat protein gene of a second isolate. These relationships are also reflected at the nucleotide sequence level. These results in one instance confirm the close similarities observed at the biophysical and serological levels between these two viruses, but on the other hand call into question the nomenclature used to describe these viruses.
Castro-Mondragon, Jaime Abraham; Jaeger, Sébastien; Thieffry, Denis; Thomas-Chollier, Morgane; van Helden, Jacques
2017-07-27
Transcription factor (TF) databases contain multitudes of binding motifs (TFBMs) from various sources, from which non-redundant collections are derived by manual curation. The advent of high-throughput methods stimulated the production of novel collections with increasing numbers of motifs. Meta-databases, built by merging these collections, contain redundant versions, because available tools are not suited to automatically identify and explore biologically relevant clusters among thousands of motifs. Motif discovery from genome-scale data sets (e.g. ChIP-seq) also produces redundant motifs, hampering the interpretation of results. We present matrix-clustering, a versatile tool that clusters similar TFBMs into multiple trees, and automatically creates non-redundant TFBM collections. A feature unique to matrix-clustering is its dynamic visualisation of aligned TFBMs, and its capability to simultaneously treat multiple collections from various sources. We demonstrate that matrix-clustering considerably simplifies the interpretation of combined results from multiple motif discovery tools, and highlights biologically relevant variations of similar motifs. We also ran a large-scale application to cluster ∼11 000 motifs from 24 entire databases, showing that matrix-clustering correctly groups motifs belonging to the same TF families, and drastically reduced motif redundancy. matrix-clustering is integrated within the RSAT suite (http://rsat.eu/), accessible through a user-friendly web interface or command-line for its integration in pipelines. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lu, Jia; Harrison, Rane A.; Li, Lianbo
KRAS G12C, the most common RAS mutation found in non-small-cell lung cancer, has been the subject of multiple recent covalent small-molecule inhibitor campaigns including efforts directed at the guanine nucleotide pocket and separate work focused on an inducible pocket adjacent to the switch motifs. Multiple conformations of switch II have been observed, suggesting that switch II pocket (SIIP) binders may be capable of engaging a range of KRAS conformations. Here we report the use of hydrogen/deuterium-exchange mass spectrometry (HDX MS) to discriminate between conformations of switch II induced by two chemical classes of SIIP binders. We investigated the structural basismore » for differences in HDX MS using X-ray crystallography and discovered a new SIIP configuration in response to binding of a quinazoline chemotype. These results have implications for structure-guided drug design targeting the RAS SIIP.« less
NASA Astrophysics Data System (ADS)
Nakamura, Hideki; Lee, Albert A.; Afshar, Ali Sobhi; Watanabe, Shigeki; Rho, Elmer; Razavi, Shiva; Suarez, Allister; Lin, Yu-Chun; Tanigawa, Makoto; Huang, Brian; Derose, Robert; Bobb, Diana; Hong, William; Gabelli, Sandra B.; Goutsias, John; Inoue, Takanari
2018-01-01
Some protein components of intracellular non-membrane-bound entities, such as RNA granules, are known to form hydrogels in vitro. The physico-chemical properties and functional role of these intracellular hydrogels are difficult to study, primarily due to technical challenges in probing these materials in situ. Here, we present iPOLYMER, a strategy for a rapid induction of protein-based hydrogels inside living cells that explores the chemically inducible dimerization paradigm. Biochemical and biophysical characterizations aided by computational modelling show that the polymer network formed in the cytosol resembles a physiological hydrogel-like entity that acts as a size-dependent molecular sieve. We functionalize these polymers with RNA-binding motifs that sequester polyadenine-containing nucleotides to synthetically mimic RNA granules. These results show that iPOLYMER can be used to synthetically reconstitute the nucleation of biologically functional entities, including RNA granules in intact cells.
Dansault, Anouk; David, Gabriel; Schwartz, Claire; Jaliffa, Carolina; Vieira, Véronique; de la Houssaye, Guillaume; Bigot, Karine; Catin, Françise; Tattu, Laurent; Chopin, Catherine; Halimi, Philippe; Roche, Olivier; Van Regemorter, Nicole; Munier, Francis; Schorderet, Daniel; Dufier, Jean-Louis; Marsac, Cécile; Ricquier, Daniel; Menasche, Maurice; Penfornis, Alfred; Abitbol, Marc
2007-04-02
The PAX6 gene was first described as a candidate for human aniridia. However, PAX6 expression is not restricted to the eye and it appears to be crucial for brain development. We studied PAX6 mutations in a large spectrum of patients who presented with aniridia phenotypes, Peters' anomaly, and anterior segment malformations associated or not with neurological anomalies. Patients and related families were ophthalmologically phenotyped, and in some cases neurologically and endocrinologically examined. We screened the PAX6 gene by direct sequencing in three groups of patients: those affected by aniridia; those with diverse ocular manifestations; and those with Peters' anomaly. Two mutations were investigated by generating crystallographic representations of the amino acid changes. Three novel heterozygous mutations affecting three unrelated families were identified: the g.572T>C nucleotide change, located in exon 5, and corresponding to the Leucine 46 Proline amino-acid mutation (L46P); the g.655A>G nucleotide change, located in exon 6, and corresponding to the Serine 74 Glycine amino-acid mutation (S74G); and the nucleotide deletion 579delG del, located in exon 6, which induces a frameshift mutation leading to a stop codon (V48fsX53). The L46P mutation was identified in affected patients presenting bilateral microphthalmia, cataracts, and nystagmus. The S74G mutation was found in a large family that had congenital ocular abnormalities, diverse neurological manifestations, and variable cognitive impairments. The 579delG deletion (V48fsX53) caused in the affected members of the same family bilateral aniridia associated with congenital cataract, foveal hypolasia, and nystagmus. We also detected a novel intronic nucleotide change, IVS2+9G>A (very likely a mutation) in an apparently isolated patient affected by a complex ocular phenotype, characterized primarily by a bilateral microphthalmia. Whether this nucleotide change is indeed pathogenic remains to be demonstrated. Two previously known heterozygous mutations of the PAX6 gene sequence were also detected in patients affected by aniridia: a de novo previously known nucleotide change, g.972C>T (Q179X), in exon 8, leading to a stop codon and a heterozygous g.555C>A (C40X) recurrent nonsense mutation in exon 5. No mutations were found in patients with Peters' anomaly. We identified three mutations associated with aniridia phenotypes (Q179X, C40X, and V48fsX53). The three other mutations reported here cause non-aniridia ocular phenotypes associated in some cases with neurological anomalies. The IVS2+9G>A nucleotide change was detected in a patient with a microphthalmia phenotype. The L46P mutation was detected in a family with microphthalmia, cataract, and nystagmus. This mutation is located in the DNA-binding paired-domain and the crystallographic representations of this mutation show that this mutation may affect the helix-turn-helix motif, and as a consequence the DNA-binding properties of the resulting mutated protein. Ser74 is located in the PAX6 PD linker region, essential for DNA recognition and DNA binding, and the side chain of the Ser74 contributes to DNA recognition by the linker domain through direct contacts. Crystallographic representations show that the S74G mutation results in no side chain and therefore perturbs the DNA-binding properties of PAX6. This study highlights the severity and diversity of the consequences of PAX6 mutations that appeared to result from the complexity of the PAX6 gene structure, and the numerous possibilities for DNA binding. This study emphasizes the fact that neurodevelopmental abnormalities may be caused by PAX6 mutations. The neuro-developmental abnormalities caused by PAX6 mutations are probably still overlooked in the current clinical examinations performed throughout the world in patients affected by PAX6 mutations.
Dansault, Anouk; David, Gabriel; Schwartz, Claire; Jaliffa, Carolina; Vieira, Véronique; de la Houssaye, Guillaume; Bigot, Karine; Catin, Françise; Tattu, Laurent; Chopin, Catherine; Halimi, Philippe; Roche, Olivier; Van Regemorter, Nicole; Munier, Francis; Schorderet, Daniel; Dufier, Jean-Louis; Marsac, Cécile; Ricquier, Daniel; Menasche, Maurice; Penfornis, Alfred
2007-01-01
Purpose The PAX6 gene was first described as a candidate for human aniridia. However, PAX6 expression is not restricted to the eye and it appears to be crucial for brain development. We studied PAX6 mutations in a large spectrum of patients who presented with aniridia phenotypes, Peters' anomaly, and anterior segment malformations associated or not with neurological anomalies. Methods Patients and related families were ophthalmologically phenotyped, and in some cases neurologically and endocrinologically examined. We screened the PAX6 gene by direct sequencing in three groups of patients: those affected by aniridia; those with diverse ocular manifestations; and those with Peters' anomaly. Two mutations were investigated by generating crystallographic representations of the amino acid changes. Results Three novel heterozygous mutations affecting three unrelated families were identified: the g.572T>C nucleotide change, located in exon 5, and corresponding to the Leucine 46 Proline amino-acid mutation (L46P); the g.655A>G nucleotide change, located in exon 6, and corresponding to the Serine 74 Glycine amino-acid mutation (S74G); and the nucleotide deletion 579delG del, located in exon 6, which induces a frameshift mutation leading to a stop codon (V48fsX53). The L46P mutation was identified in affected patients presenting bilateral microphthalmia, cataracts, and nystagmus. The S74G mutation was found in a large family that had congenital ocular abnormalities, diverse neurological manifestations, and variable cognitive impairments. The 579delG deletion (V48fsX53) caused in the affected members of the same family bilateral aniridia associated with congenital cataract, foveal hypolasia, and nystagmus. We also detected a novel intronic nucleotide change, IVS2+9G>A (very likely a mutation) in an apparently isolated patient affected by a complex ocular phenotype, characterized primarily by a bilateral microphthalmia. Whether this nucleotide change is indeed pathogenic remains to be demonstrated. Two previously known heterozygous mutations of the PAX6 gene sequence were also detected in patients affected by aniridia: a de novo previously known nucleotide change, g.972C>T (Q179X), in exon 8, leading to a stop codon and a heterozygous g.555C>A (C40X) recurrent nonsense mutation in exon 5. No mutations were found in patients with Peters' anomaly. Conclusions We identified three mutations associated with aniridia phenotypes (Q179X, C40X, and V48fsX53). The three other mutations reported here cause non-aniridia ocular phenotypes associated in some cases with neurological anomalies. The IVS2+9G>A nucleotide change was detected in a patient with a microphthalmia phenotype. The L46P mutation was detected in a family with microphthalmia, cataract, and nystagmus. This mutation is located in the DNA-binding paired-domain and the crystallographic representations of this mutation show that this mutation may affect the helix-turn-helix motif, and as a consequence the DNA-binding properties of the resulting mutated protein. Ser74 is located in the PAX6 PD linker region, essential for DNA recognition and DNA binding, and the side chain of the Ser74 contributes to DNA recognition by the linker domain through direct contacts. Crystallographic representations show that the S74G mutation results in no side chain and therefore perturbs the DNA-binding properties of PAX6. This study highlights the severity and diversity of the consequences of PAX6 mutations that appeared to result from the complexity of the PAX6 gene structure, and the numerous possibilities for DNA binding. This study emphasizes the fact that neurodevelopmental abnormalities may be caused by PAX6 mutations. The neuro-developmental abnormalities caused by PAX6 mutations are probably still overlooked in the current clinical examinations performed throughout the world in patients affected by PAX6 mutations. PMID:17417613
A Screen for Novel Phosphoinositide 3-kinase Effector Proteins*
Dixon, Miles J.; Gray, Alexander; Boisvert, François-Michel; Agacan, Mark; Morrice, Nicholas A.; Gourlay, Robert; Leslie, Nicholas R.; Downes, C. Peter; Batty, Ian H.
2011-01-01
Class I phosphoinositide 3-kinases exert important cellular effects through their two primary lipid products, phosphatidylinositol 3,4,5-trisphosphate and phosphatidylinositol 3,4-bisphosphate (PtdIns(3,4)P2). As few molecular targets for PtdIns(3,4)P2 have yet been identified, a screen for PI 3-kinase-responsive proteins that is selective for these is described. This features a tertiary approach incorporating a unique, primary recruitment of target proteins in intact cells to membranes selectively enriched in PtdIns(3,4)P2. A secondary purification of these proteins, optimized using tandem pleckstrin homology domain containing protein-1 (TAPP-1), an established PtdIns(3,4)P2 selective ligand, yields a fraction enriched in proteins of potentially similar lipid binding character that are identified by liquid chromatography-tandem MS. Thirdly, this approach is coupled to stable isotope labeling with amino acids in cell culture using differential isotope labeling of cells stimulated in the absence and presence of the PI 3-kinase inhibitor wortmannin. This provides a ratio-metric readout that distinguishes authentically responsive components from copurifying background proteins. Enriched fractions thus obtained from astrocytoma cells revealed a subset of proteins that exhibited ratios indicative of their initial, cellular responsiveness to PI 3-kinase activation. The inclusion among these of tandem pleckstrin homology domain containing protein-1, three isoforms of Akt, switch associated protein-70, early endosome antigen-1 and of additional proteins expressing recognized lipid binding domains demonstrates the utility of this strategy and lends credibility to the novel candidate proteins identified. The latter encompass a broad set of proteins that include the gene product of TBC1D2A, a putative Rab guanine nucleotide triphosphatase activating protein (GAP) and IQ motif containing GAP1, a potential tumor promoter. A sequence comparison of the former protein indicates the presence of a pleckstrin homology domain whose lipid binding character remains to be established. IQ motif containing GAP1 lacks known lipid interacting components and a preliminary analysis here indicates that this may exemplify a novel class of atypical phosphoinositide (aPI) binding domain. PMID:21263009
Verma, Apoorva; Jing-Song, Fan; Finch-Edmondson, Megan L.; Velazquez-Campoy, Adrian; Balasegaran, Shanker; Sudol, Marius; Sivaraman, Jayaraman
2018-01-01
YES-associated protein (YAP) is a major effector protein of the Hippo tumor suppressor pathway, and is phosphorylated by the serine/threonine kinase LATS. Their binding is mediated by the interaction between WW domains of YAP and PPxY motifs of LATS. Their isoforms, YAP2 and LATS1 contain two WW domains and two PPxY motifs respectively. Here, we report the study of the interaction of these domains both in vitro and in human cell lines, to better understand the mechanism of their binding. We show that there is a reciprocal binding preference of YAP2-WW1 with LATS1-PPxY2, and YAP2-WW2 with LATS1-PPxY1. We solved the NMR structures of these complexes and identified several conserved residues that play a critical role in binding. We further created a YAP2 mutant by swapping the WW domains, and found that YAP2 phosphorylation at S127 by LATS1 is not affected by the spatial configuration of its WW domains. This is likely because the region between the PPxY motifs of LATS1 is unstructured, even upon binding with its partner. Based on our observations, we propose possible models for the interaction between YAP2 and LATS1. PMID:29487715
Verma, Apoorva; Jing-Song, Fan; Finch-Edmondson, Megan L; Velazquez-Campoy, Adrian; Balasegaran, Shanker; Sudol, Marius; Sivaraman, Jayaraman
2018-01-30
YES-associated protein (YAP) is a major effector protein of the Hippo tumor suppressor pathway, and is phosphorylated by the serine/threonine kinase LATS. Their binding is mediated by the interaction between WW domains of YAP and PPxY motifs of LATS. Their isoforms, YAP2 and LATS1 contain two WW domains and two PPxY motifs respectively. Here, we report the study of the interaction of these domains both in vitro and in human cell lines, to better understand the mechanism of their binding. We show that there is a reciprocal binding preference of YAP2-WW1 with LATS1-PPxY2, and YAP2-WW2 with LATS1-PPxY1. We solved the NMR structures of these complexes and identified several conserved residues that play a critical role in binding. We further created a YAP2 mutant by swapping the WW domains, and found that YAP2 phosphorylation at S127 by LATS1 is not affected by the spatial configuration of its WW domains. This is likely because the region between the PPxY motifs of LATS1 is unstructured, even upon binding with its partner. Based on our observations, we propose possible models for the interaction between YAP2 and LATS1.
Rational and Modular Design of Potent Ligands Targeting the RNA that Causes Myotonic Dystrophy 2
Lee, Melissa M.; Pushechnikov, Alexei; Disney, Matthew D.
2009-01-01
Most ligands targeting RNA are identified through screening a therapeutic target for binding members of a ligand library. A potential alternative way to construct RNA binders is through rational design using information about the RNA motifs ligands prefer to bind. Herein, we describe such an approach to design modularly assembled ligands targeting the RNA that causes myotonic dystrophy type 2 (DM2), a currently untreatable disease. A previous study identified that 6′-N-5-hexynoate kanamycin A (1) prefers to bind 2×2 nucleotide, pyrimidine-rich RNA internal loops. Multiple copies of such loops were found in the RNA hairpin that causes DM2. The 1 ligand was then modularly displayed on a peptoid scaffold with varied number and spacing to target several internal loops simultaneously. Modularly assembled ligands were tested for binding to a series of RNAs and for inhibiting the formation of the toxic DM2 RNA-muscleblind protein (MBNL-1) interaction. The most potent ligand displays three 1 modules, each separated by four spacing submonomers, and inhibits the formation of the RNA-protein complex with an IC50 of 25 nM. This ligand is higher affinity and more specific for binding DM2 RNA than MBNL-1. It binds the DM2 RNA at least 20-times more tightly than related RNAs and 15-fold more tightly than MBNL-1. A related control peptoid displaying 6′-N-5-hexynoate neamine (2) is >100-fold less potent at inhibiting the RNA-protein interaction and binds to DM2 RNA >125-fold more weakly. Uptake studies into a mouse myoblast cell line also show that the most potent ligand is cell permeable. PMID:19348464
Hovey, Liam; Fowler, C Andrew; Mahling, Ryan; Lin, Zesen; Miller, Mark Stephen; Marx, Dagan C; Yoder, Jesse B; Kim, Elaine H; Tefft, Kristin M; Waite, Brett C; Feldkamp, Michael D; Yu, Liping; Shea, Madeline A
2017-05-01
Several members of the voltage-gated sodium channel family are regulated by calmodulin (CaM) and ionic calcium. The neuronal voltage-gated sodium channel Na V 1.2 contains binding sites for both apo (calcium-depleted) and calcium-saturated CaM. We have determined equilibrium dissociation constants for rat Na V 1.2 IQ motif [IQRAYRRYLLK] binding to apo CaM (~3nM) and (Ca 2+ ) 4 -CaM (~85nM), showing that apo CaM binding is favored by 30-fold. For both apo and (Ca 2+ ) 4 -CaM, NMR demonstrated that Na V 1.2 IQ motif peptide (Na V 1.2 IQp ) exclusively made contacts with C-domain residues of CaM (CaM C ). To understand how calcium triggers conformational change at the CaM-IQ interface, we determined a solution structure (2M5E.pdb) of (Ca 2+ ) 2 -CaM C bound to Na V 1.2 IQp . The polarity of (Ca 2+ ) 2 -CaM C relative to the IQ motif was opposite to that seen in apo CaM C -Na v 1.2 IQp (2KXW), revealing that CaM C recognizes nested, anti-parallel sites in Na v 1.2 IQp . Reversal of CaM may require transient release from the IQ motif during calcium binding, and facilitate a re-orientation of CaM N allowing interactions with non-IQ Na V 1.2 residues or auxiliary regulatory proteins interacting in the vicinity of the IQ motif. Copyright © 2017 Elsevier B.V. All rights reserved.
Solution structure of an ATP-binding RNA aptamer reveals a novel fold.
Dieckmann, T; Suzuki, E; Nakamura, G K; Feigon, J
1996-01-01
In vitro selection has been used to isolate several RNA aptamers that bind specifically to biological cofactors. A well-characterized example in the ATP-binding RNA aptamer family, which contains a conserved 11-base loop opposite a bulged G and flanked by regions of double-stranded RNA. The nucleotides in the consensus sequence provide a binding pocket for ATP (or AMP), which binds with a Kd in the micromolar range. Here we present the three-dimensional solution structure of a 36-nucleotide ATP-binding RNA aptamer complexed with AMP, determined from NMR-derived distance and dihedral angle restraints. The conserved loop and bulged G form a novel compact, folded structure around the AMP. The backbone tracing of the loop nucleotides can be described by a Greek zeta (zeta). Consecutive loop nucleotides G, A, A form a U-turn at the bottom of the zeta, and interact with the AMP to form a structure similar to a GNRA tetraloop, with AMP standing in for the final A. Two asymmetric G. G base pairs close the stems flanking the internal loop. Mutated aptamers support the existence of the tertiary interactions within the consensus nucleotides and with the AMP found in the calculated structures. PMID:8756406
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sekiyama, Naotaka; Arthanari, Haribabu; Papadopoulos, Evangelos
The eIF4E-binding protein (4E-BP) is a phosphorylation-dependent regulator of protein synthesis. The nonphosphorylated or minimally phosphorylated form binds translation initiation factor 4E (eIF4E), preventing binding of eIF4G and the recruitment of the small ribosomal subunit. Signaling events stimulate serial phosphorylation of 4E-BP, primarily by mammalian target of rapamycin complex 1 (mTORC1) at residues T 37/T 46, followed by T 70 and S 65. Hyperphosphorylated 4E-BP dissociates from eIF4E, allowing eIF4E to interact with eIF4G and translation initiation to resume. Because overexpression of eIF4E is linked to cellular transformation, 4E-BP is a tumor suppressor, and up-regulation of its activity is amore » goal of interest for cancer therapy. A recently discovered small molecule, eIF4E/eIF4G interaction inhibitor 1 (4EGI-1), disrupts the eIF4E/eIF4G interaction and promotes binding of 4E-BP1 to eIF4E. Structures of 14- to 16-residue 4E-BP fragments bound to eIF4E contain the eIF4E consensus binding motif, 54YXXXXLΦ 60 (motif 1) but lack known phosphorylation sites. We report in this paper a 2.1-Å crystal structure of mouse eIF4E in complex with m 7GTP and with a fragment of human 4E-BP1, extended C-terminally from the consensus-binding motif (4E-BP1 50–84). The extension, which includes a proline-turn-helix segment (motif 2) followed by a loop of irregular structure, reveals the location of two phosphorylation sites (S 65 and T 70). Our major finding is that the C-terminal extension (motif 3) is critical to 4E-BP1–mediated cell cycle arrest and that it partially overlaps with the binding site of 4EGI-1. Finally, the binding of 4E-BP1 and 4EGI-1 to eIF4E is therefore not mutually exclusive, and both ligands contribute to shift the equilibrium toward the inhibition of translation initiation.« less
Argo_CUDA: Exhaustive GPU based approach for motif discovery in large DNA datasets.
Vishnevsky, Oleg V; Bocharnikov, Andrey V; Kolchanov, Nikolay A
2018-02-01
The development of chromatin immunoprecipitation sequencing (ChIP-seq) technology has revolutionized the genetic analysis of the basic mechanisms underlying transcription regulation and led to accumulation of information about a huge amount of DNA sequences. There are a lot of web services which are currently available for de novo motif discovery in datasets containing information about DNA/protein binding. An enormous motif diversity makes their finding challenging. In order to avoid the difficulties, researchers use different stochastic approaches. Unfortunately, the efficiency of the motif discovery programs dramatically declines with the query set size increase. This leads to the fact that only a fraction of top "peak" ChIP-Seq segments can be analyzed or the area of analysis should be narrowed. Thus, the motif discovery in massive datasets remains a challenging issue. Argo_Compute Unified Device Architecture (CUDA) web service is designed to process the massive DNA data. It is a program for the detection of degenerate oligonucleotide motifs of fixed length written in 15-letter IUPAC code. Argo_CUDA is a full-exhaustive approach based on the high-performance GPU technologies. Compared with the existing motif discovery web services, Argo_CUDA shows good prediction quality on simulated sets. The analysis of ChIP-Seq sequences revealed the motifs which correspond to known transcription factor binding sites.
Brown, Jessica A.; Pack, Lindsey R.; Sherrer, Shanen M.; Kshetry, Ajay K.; Newmister, Sean A.; Fowler, Jason D.; Taylor, John-Stephen; Suo, Zucai
2010-01-01
DNA polymerase λ (Pol λ) is a novel X-family DNA polymerase that shares 34% sequence identity with DNA polymerase β (Pol β). Pre-steady state kinetic studies have shown that the Pol λ•DNA complex binds both correct and incorrect nucleotides 130-fold tighter on average than the Pol β•DNA complex, although, the base substitution fidelity of both polymerases is 10−4 to 10−5. To better understand Pol λ’s tight nucleotide binding affinity, we created single- and double-substitution mutants of Pol λ to disrupt interactions between active site residues and an incoming nucleotide or a template base. Single-turnover kinetic assays showed that Pol λ binds to an incoming nucleotide via cooperative interactions with active site residues (R386, R420, K422, Y505, F506, A510, and R514). Disrupting protein interactions with an incoming correct or incorrect nucleotide impacted binding with each of the common structural moieties in the following order: triphosphate ≫ base > ribose. In addition, the loss of Watson-Crick hydrogen bonding between the nucleotide and template base led to a moderate increase in the Kd. The fidelity of Pol λ was maintained predominantly by a single residue, R517, which has minor groove interactions with the DNA template. PMID:20851705
Vashisht, Kapil; Verma, Sonia; Gupta, Sunita; Lynn, Andrew M; Dixit, Rajnikant; Mishra, Neelima; Valecha, Neena; Hamblin, Karleigh A; Maytum, Robin; Pandey, Kailash C; van der Giezen, Mark
2017-01-24
Charged, solvent-exposed residues at the entrance to the substrate binding site (gatekeeper residues) produce electrostatic dipole interactions with approaching substrates, and control their access by a novel mechanism called "electrostatic gatekeeper effect". This proof-of-concept study demonstrates that the nucleotide specificity can be engineered by altering the electrostatic properties of the gatekeeper residues outside the binding site. Using Blastocystis succinyl-CoA synthetase (SCS, EC 6.2.1.5), we demonstrated that the gatekeeper mutant (ED) resulted in ATP-specific SCS to show high GTP specificity. Moreover, nucleotide binding site mutant (LF) had no effect on GTP specificity and remained ATP-specific. However, via combination of the gatekeeper mutant with the nucleotide binding site mutant (ED+LF), a complete reversal of nucleotide specificity was obtained with GTP, but no detectable activity was obtained with ATP. This striking result of the combined mutant (ED+LF) was due to two changes; negatively charged gatekeeper residues (ED) favored GTP access, and nucleotide binding site residues (LF) altered ATP binding, which was consistent with the hypothesis of the "electrostatic gatekeeper effect". These results were further supported by molecular modeling and simulation studies. Hence, it is imperative to extend the strategy of the gatekeeper effect in a different range of crucial enzymes (synthetases, kinases, and transferases) to engineer substrate specificity for various industrial applications and substrate-based drug design.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wynn, R. Max; Kato, Masato; Chuang, Jacinta L.
2008-10-21
Human pyruvate dehydrogenase complex (PDC) is down-regulated by pyruvate dehydrogenase kinase (PDK) isoforms 1-4. PDK4 is overexpressed in skeletal muscle in type 2 diabetes, resulting in impaired glucose utilization. Here we show that human PDK4 has robust core-free basal activity, which is considerably higher than activity levels of other PDK isoforms stimulated by the PDC core. PDK4 binds the L3 lipoyl domain, but its activity is not significantly stimulated by any individual lipoyl domains or the core of PDC. The 2.0-{angstrom} crystal structures of the PDK4 dimer with bound ADP reveal an open conformation with a wider active-site cleft, comparedmore » with that in the closed conformation epitomized by the PDK2-ADP structure. The open conformation in PDK4 shows partially ordered C-terminal cross-tails, in which the conserved DW (Asp{sup 394}-Trp{sup 395}) motif from one subunit anchors to the N-terminal domain of the other subunit. The open conformation fosters a reduced binding affinity for ADP, facilitating the efficient removal of product inhibition by this nucleotide. Alteration or deletion of the DW-motif disrupts the C-terminal cross-tail anchor, resulting in the closed conformation and the nearly complete inactivation of PDK4. Fluorescence quenching and enzyme activity data suggest that compounds AZD7545 and dichloroacetate lock PDK4 in the open and the closed conformational states, respectively. We propose that PDK4 with bound ADP exists in equilibrium between the open and the closed conformations. The favored metastable open conformation is responsible for the robust basal activity of PDK4 in the absence of the PDC core.« less
van der Meulen, Sjoerd B; de Jong, Anne; Kok, Jan
2016-01-01
RNA sequencing has revolutionized genome-wide transcriptome analyses, and the identification of non-coding regulatory RNAs in bacteria has thus increased concurrently. Here we reveal the transcriptome map of the lactic acid bacterial paradigm Lactococcus lactis MG1363 by employing differential RNA sequencing (dRNA-seq) and a combination of manual and automated transcriptome mining. This resulted in a high-resolution genome annotation of L. lactis and the identification of 60 cis-encoded antisense RNAs (asRNAs), 186 trans-encoded putative regulatory RNAs (sRNAs) and 134 novel small ORFs. Based on the putative targets of asRNAs, a novel classification is proposed. Several transcription factor DNA binding motifs were identified in the promoter sequences of (a)sRNAs, providing insight in the interplay between lactococcal regulatory RNAs and transcription factors. The presence and lengths of 14 putative sRNAs were experimentally confirmed by differential Northern hybridization, including the abundant RNA 6S that is differentially expressed depending on the available carbon source. For another sRNA, LLMGnc_147, functional analysis revealed that it is involved in carbon uptake and metabolism. L. lactis contains 13% leaderless mRNAs (lmRNAs) that, from an analysis of overrepresentation in GO classes, seem predominantly involved in nucleotide metabolism and DNA/RNA binding. Moreover, an A-rich sequence motif immediately following the start codon was uncovered, which could provide novel insight in the translation of lmRNAs. Altogether, this first experimental genome-wide assessment of the transcriptome landscape of L. lactis and subsequent sRNA studies provide an extensive basis for the investigation of regulatory RNAs in L. lactis and related lactococcal species.
Identification of a Novel Mucin Gene HCG22 Associated With Steroid-Induced Ocular Hypertension
Jeong, Shinwu; Patel, Nitin; Edlund, Christopher K.; Hartiala, Jaana; Hazelett, Dennis J.; Itakura, Tatsuo; Wu, Pei-Chang; Avery, Robert L.; Davis, Janet L.; Flynn, Harry W.; Lalwani, Geeta; Puliafito, Carmen A.; Wafapoor, Hussein; Hijikata, Minako; Keicho, Naoto; Gao, Xiaoyi; Argüeso, Pablo; Allayee, Hooman; Coetzee, Gerhard A.; Pletcher, Mathew T.; Conti, David V.; Schwartz, Stephen G.; Eaton, Alexander M.; Fini, M. Elizabeth
2015-01-01
Purpose. The pathophysiology of ocular hypertension (OH) leading to primary open-angle glaucoma shares many features with a secondary form of OH caused by treatment with glucocorticoids, but also exhibits distinct differences. In this study, a pharmacogenomics approach was taken to discover candidate genes for this disorder. Methods. A genome-wide association study was performed, followed by an independent candidate gene study, using a cohort enrolled from patients treated with off-label intravitreal triamcinolone, and handling change in IOP as a quantitative trait. Results. An intergenic quantitative trait locus (QTL) was identified at chromosome 6p21.33 near the 5′ end of HCG22 that attained the accepted statistical threshold for genome-level significance. The HCG22 transcript, encoding a novel mucin protein, was expressed in trabecular meshwork cells, and expression was stimulated by IL-1, and inhibited by triamcinolone acetate and TGF-β. Bioinformatic analysis defined the QTL as an approximately 4 kilobase (kb) linkage disequilibrium block containing 10 common single nucleotide polymorphisms (SNPs). Four of these SNPs were identified in the National Center for Biotechnology Information (NCBI) GTEx eQTL browser as modifiers of HCG22 expression. Most are predicted to disrupt or improve motifs for transcription factor binding, the most relevant being disruption of the glucocorticoid receptor binding motif. A second QTL was identified within the predicted signal peptide of the HCG22 encoded protein that could affect its secretion. Translation, O-glycosylation, and secretion of the predicted HCG22 protein was verified in cultured trabecular meshwork cells. Conclusions. Identification of two independent QTLs that could affect expression of the HCG22 mucin gene product via two different mechanisms (transcription or secretion) is highly suggestive of a role in steroid-induced OH. PMID:25813999
Adenosine triphosphate (ATP) reduces amyloid-β protein misfolding in vitro.
Coskuner, Orkid; Murray, Ian V J
2014-01-01
Alzheimer's disease (AD) is a devastating disease of aging that initiates decades prior to clinical manifestation and represents an impending epidemic. Two early features of AD are metabolic dysfunction and changes in amyloid-β protein (Aβ) levels. Since levels of ATP decrease over the course of the disease and Aβ is an early biomarker of AD, we sought to uncover novel linkages between the two. First and remarkably, a GxxxG motif is common between both Aβ (oligomerization motif) and nucleotide binding proteins (Rossmann fold). Second, ATP was demonstrated to protect against Aβ mediated cytotoxicity. Last, there is structural similarity between ATP and amyloid binding/inhibitory compounds such as ThioT, melatonin, and indoles. Thus, we investigated whether ATP alters misfolding of the pathologically relevant Aβ42. To test this hypothesis, we performed computational and biochemical studies. Our computational studies demonstrate that ATP interacts strongly with Tyr10 and Ser26 of Aβ fibrils in solution. Experimentally, both ATP and ADP reduced Aβ misfolding at physiological intracellular concentrations, with thresholds at ~500 μM and 1 mM respectively. This inhibition of Aβ misfolding is specific; requiring Tyr10 of Aβ and is enhanced by magnesium. Last, cerebrospinal fluid ATP levels are in the nanomolar range and decreased with AD pathology. This initial and novel finding regarding the ATP interaction with Aβ and reduction of Aβ misfolding has potential significance to the AD field. It provides an underlying mechanism for published links between metabolic dysfunction and AD. It also suggests a potential role of ATP in AD pathology, as the occurrence of misfolded extracellular Aβ mirrors lowered extracellular ATP levels. Last, the findings suggest that Aβ conformation change may be a sensor of metabolic dysfunction.
Wynn, R Max; Kato, Masato; Chuang, Jacinta L; Tso, Shih-Chia; Li, Jun; Chuang, David T
2008-09-12
Human pyruvate dehydrogenase complex (PDC) is down-regulated by pyruvate dehydrogenase kinase (PDK) isoforms 1-4. PDK4 is overexpressed in skeletal muscle in type 2 diabetes, resulting in impaired glucose utilization. Here we show that human PDK4 has robust core-free basal activity, which is considerably higher than activity levels of other PDK isoforms stimulated by the PDC core. PDK4 binds the L3 lipoyl domain, but its activity is not significantly stimulated by any individual lipoyl domains or the core of PDC. The 2.0-A crystal structures of the PDK4 dimer with bound ADP reveal an open conformation with a wider active-site cleft, compared with that in the closed conformation epitomized by the PDK2-ADP structure. The open conformation in PDK4 shows partially ordered C-terminal cross-tails, in which the conserved DW (Asp(394)-Trp(395)) motif from one subunit anchors to the N-terminal domain of the other subunit. The open conformation fosters a reduced binding affinity for ADP, facilitating the efficient removal of product inhibition by this nucleotide. Alteration or deletion of the DW-motif disrupts the C-terminal cross-tail anchor, resulting in the closed conformation and the nearly complete inactivation of PDK4. Fluorescence quenching and enzyme activity data suggest that compounds AZD7545 and dichloroacetate lock PDK4 in the open and the closed conformational states, respectively. We propose that PDK4 with bound ADP exists in equilibrium between the open and the closed conformations. The favored metastable open conformation is responsible for the robust basal activity of PDK4 in the absence of the PDC core.
Structural modeling and molecular simulation analysis of HvAP2/EREBP from barley.
Pandey, Bharati; Sharma, Pradeep; Tyagi, Chetna; Goyal, Sukriti; Grover, Abhinav; Sharma, Indu
2016-06-01
AP2/ERF transcription factors play a critical role in plant development and stress adaptation. This study reports the three-dimensional ab initio-based model of AP2/EREBP protein of barley and its interaction with DNA. Full-length coding sequence of HvAP2/EREBP gene isolated from two Indian barley cultivars, RD 2503 and RD 31, was used to model the protein. Of five protein models obtained, the one with lowest C-score was chosen for further analysis. The N- and C-terminal regions of HvAP2 protein were found to be highly disordered. The dynamic properties of AP2/EREBP and its interaction with DNA were investigated by molecular dynamics simulation. Analysis of trajectories from simulation yielded the equilibrated conformation between 2-10ns for protein and 7-15ns for protein-DNA complex. We established relationship between DNA having GCC box and DNA-binding domain of HvAP2/EREBP was established by modeling 11-base-pair-long nucleotide sequence and HvAP2/EREBP protein using ab initio method. Analysis of protein-DNA interaction showed that a β-sheet motif constituting amino acid residues THR105, ARG100, ARG93, and ARG83 seems to play important role in stabilizing the complex as they form strong hydrogen bond interactions with the DNA motif. Taken together, this study provides first-hand comprehensive information detailing structural conformation and interactions of HvAP2/EREBP proteins in barley. The study intensifies the role of computational approaches for preliminary examination of unknown proteins in the absence of experimental information. It also provides molecular insight into protein-DNA binding for understanding and enhancing abiotic stress resistance for improving the water use efficiency in crop plants.
Amelio, Antonio L.; McAnany, Peterjon K.; Bloom, David C.
2006-01-01
A previous study demonstrated that the latency-associated transcript (LAT) promoter and the LAT enhancer/reactivation critical region (rcr) are enriched in acetyl histone H3 (K9, K14) during herpes simplex virus type 1 (HSV-1) latency, whereas all lytic genes analyzed (ICP0, UL54, ICP4, and DNA polymerase) are not (N. J. Kubat, R. K. Tran, P. McAnany, and D. C. Bloom, J. Virol. 78:1139-1149, 2004). This suggests that the HSV-1 latent genome is organized into histone H3 (K9, K14) hyperacetylated and hypoacetylated regions corresponding to transcriptionally permissive and transcriptionally repressed chromatin domains, respectively. Such an organization implies that chromatin insulators, similar to those of cellular chromosomes, may separate distinct transcriptional domains of the HSV-1 latent genome. In the present study, we sought to identify cis elements that could partition the HSV-1 genome into distinct chromatin domains. Sequence analysis coupled with chromatin immunoprecipitation and luciferase reporter assays revealed that (i) the long and short repeats and the unique-short region of the HSV-1 genome contain clustered CTCF (CCCTC-binding factor) motifs, (ii) CTCF motif clusters similar to those in HSV-1 are conserved in other alphaherpesviruses, (iii) CTCF binds to these motifs on latent HSV-1 genomes in vivo, and (iv) a 1.5-kb region containing the CTCF motif cluster in the LAT region possesses insulator activities, specifically, enhancer blocking and silencing. The finding that CTCF, a cellular protein associated with chromatin insulators, binds to motifs on the latent genome and insulates the LAT enhancer suggests that CTCF may facilitate the formation of distinct chromatin boundaries during herpesvirus latency. PMID:16474142
Analysis of zinc binding sites in protein crystal structures.
Alberts, I L; Nadassy, K; Wodak, S J
1998-08-01
The geometrical properties of zinc binding sites in a dataset of high quality protein crystal structures deposited in the Protein Data Bank have been examined to identify important differences between zinc sites that are directly involved in catalysis and those that play a structural role. Coordination angles in the zinc primary coordination sphere are compared with ideal values for each coordination geometry, and zinc coordination distances are compared with those in small zinc complexes from the Cambridge Structural Database as a guide of expected trends. We find that distances and angles in the primary coordination sphere are in general close to the expected (or ideal) values. Deviations occur primarily for oxygen coordinating atoms and are found to be mainly due to H-bonding of the oxygen coordinating ligand to protein residues, bidentate binding arrangements, and multi-zinc sites. We find that H-bonding of oxygen containing residues (or water) to zinc bound histidines is almost universal in our dataset and defines the elec-His-Zn motif. Analysis of the stereochemistry shows that carboxyl elec-His-Zn motifs are geometrically rigid, while water elec-His-Zn motifs show the most geometrical variation. As catalytic motifs have a higher proportion of carboxyl elec atoms than structural motifs, they provide a more rigid framework for zinc binding. This is understood biologically, as a small distortion in the zinc position in an enzyme can have serious consequences on the enzymatic reaction. We also analyze the sequence pattern of the zinc ligands and residues that provide elecs, and identify conserved hydrophobic residues in the endopeptidases that also appear to contribute to stabilizing the catalytic zinc site. A zinc binding template in protein crystal structures is derived from these observations.
The glycine-rich motif of Pyrococcus abyssi DNA polymerase D is critical for protein stability.
Castrec, Benoît; Laurent, Sébastien; Henneke, Ghislaine; Flament, Didier; Raffin, Jean-Paul
2010-03-05
A glycine-rich motif described as being involved in human polymerase delta proliferating cell nuclear antigen (PCNA) binding has also been identified in all euryarchaeal DNA polymerase D (Pol D) family members. We redefined the motif as the (G)-PYF box. In the present study, Pol D (G)-PYF box motif mutants from Pyrococcus abyssi were generated to investigate its role in functional interactions with the cognate PCNA. We demonstrated that this motif is not essential for interactions between PabPol D (P. abyssi Pol D) and PCNA, using surface plasmon resonance and primer extension studies. Interestingly, the (G)-PYF box is located in a hydrophobic region close to the active site. The (G)-PYF box mutants exhibited altered DNA binding properties. In addition, the thermal stability of all mutants was reduced compared to that of wild type, and this effect could be attributed to increased exposure of the hydrophobic region. These studies suggest that the (G)-PYF box motif mediates intersubunit interactions and that it may be crucial for the thermostability of PabPol D. (c) 2010 Elsevier Ltd. All rights reserved.
Sidell, Neil; Mathad, Raveendra I.; Shu, Feng-jue; Zhang, Zhenjiang; Kallen, Caleb B.; Yang, Danzhou
2011-01-01
DNA-intercalating molecules can impair DNA replication, DNA repair, and gene transcription. We previously demonstrated that XR5944, a DNA bis-intercalator, specifically blocks binding of estrogen receptor-α (ERα) to the consensus estrogen response element (ERE). The consensus ERE sequence is AGGTCAnnnTGACCT, where nnn is known as the tri-nucleotide spacer. Recent work has shown that the tri-nucleotide spacer can modulate ERα-ERE binding affinity and ligand-mediated transcriptional responses. To further understand the mechanism by which XR5944 inhibits ERα-ERE binding, we tested its ability to interact with consensus EREs with variable tri-nucleotide spacer sequences and with natural but non-consensus ERE sequences using one dimensional nuclear magnetic resonance (1D 1H NMR) titration studies. We found that the tri-nucleotide spacer sequence significantly modulates the binding of XR5944 to EREs. Of the sequences that were tested, EREs with CGG and AGG spacers showed the best binding specificity with XR5944, while those spaced with TTT demonstrated the least specific binding. The binding stoichiometry of XR5944 with EREs was 2:1, which can explain why the spacer influences the drug-DNA interaction; each XR5944 spans four nucleotides (including portions of the spacer) when intercalating with DNA. To validate our NMR results, we conducted functional studies using reporter constructs containing consensus EREs with tri-nucleotide spacers CGG, CTG, and TTT. Results of reporter assays in MCF-7 cells indicated that XR5944 was significantly more potent in inhibiting the activity of CGG- than TTT-spaced EREs, consistent with our NMR results. Taken together, these findings predict that the anti-estrogenic effects of XR5944 will depend not only on ERE half-site composition but also on the tri-nucleotide spacer sequence of EREs located in the promoters of estrogen-responsive genes. PMID:21333738
Ruddock, L. W.; Freedman, R. B.; Klappa, P.
2000-01-01
Using a cross-linking approach, we recently demonstrated that radiolabeled peptides or misfolded proteins specifically interact in vitro with two luminal proteins in crude extracts from pancreas microsomes. The proteins were the folding catalysts protein disulfide isomerase (PDI) and PDIp, a glycosylated, PDI-related protein, expressed exclusively in the pancreas. In this study, we explore the specificity of these proteins in binding peptides and related ligands and show that tyrosine and tryptophan residues in peptides are the recognition motifs for their binding by PDIp. This peptide-binding specificity may reflect the selectivity of PDIp in binding regions of unfolded polypeptide during catalysis of protein folding. PMID:10794419
A Feature-Based Approach to Modeling Protein–DNA Interactions
Segal, Eran
2008-01-01
Transcription factor (TF) binding to its DNA target site is a fundamental regulatory interaction. The most common model used to represent TF binding specificities is a position specific scoring matrix (PSSM), which assumes independence between binding positions. However, in many cases, this simplifying assumption does not hold. Here, we present feature motif models (FMMs), a novel probabilistic method for modeling TF–DNA interactions, based on log-linear models. Our approach uses sequence features to represent TF binding specificities, where each feature may span multiple positions. We develop the mathematical formulation of our model and devise an algorithm for learning its structural features from binding site data. We also developed a discriminative motif finder, which discovers de novo FMMs that are enriched in target sets of sequences compared to background sets. We evaluate our approach on synthetic data and on the widely used TF chromatin immunoprecipitation (ChIP) dataset of Harbison et al. We then apply our algorithm to high-throughput TF ChIP data from mouse and human, reveal sequence features that are present in the binding specificities of mouse and human TFs, and show that FMMs explain TF binding significantly better than PSSMs. Our FMM learning and motif finder software are available at http://genie.weizmann.ac.il/. PMID:18725950
Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models.
Park, Byungkyu; Im, Jinyong; Tuvshinjargal, Narankhuu; Lee, Wook; Han, Kyungsook
2014-11-01
As many structures of protein-DNA complexes have been known in the past years, several computational methods have been developed to predict DNA-binding sites in proteins. However, its inverse problem (i.e., predicting protein-binding sites in DNA) has received much less attention. One of the reasons is that the differences between the interaction propensities of nucleotides are much smaller than those between amino acids. Another reason is that DNA exhibits less diverse sequence patterns than protein. Therefore, predicting protein-binding DNA nucleotides is much harder than predicting DNA-binding amino acids. We computed the interaction propensity (IP) of nucleotide triplets with amino acids using an extensive dataset of protein-DNA complexes, and developed two support vector machine (SVM) models that predict protein-binding nucleotides from sequence data alone. One SVM model predicts protein-binding nucleotides using DNA sequence data alone, and the other SVM model predicts protein-binding nucleotides using both DNA and protein sequences. In a 10-fold cross-validation with 1519 DNA sequences, the SVM model that uses DNA sequence data only predicted protein-binding nucleotides with an accuracy of 67.0%, an F-measure of 67.1%, and a Matthews correlation coefficient (MCC) of 0.340. With an independent dataset of 181 DNAs that were not used in training, it achieved an accuracy of 66.2%, an F-measure 66.3% and a MCC of 0.324. Another SVM model that uses both DNA and protein sequences achieved an accuracy of 69.6%, an F-measure of 69.6%, and a MCC of 0.383 in a 10-fold cross-validation with 1519 DNA sequences and 859 protein sequences. With an independent dataset of 181 DNAs and 143 proteins, it showed an accuracy of 67.3%, an F-measure of 66.5% and a MCC of 0.329. Both in cross-validation and independent testing, the second SVM model that used both DNA and protein sequence data showed better performance than the first model that used DNA sequence data. To the best of our knowledge, this is the first attempt to predict protein-binding nucleotides in a given DNA sequence from the sequence data alone. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Xu, Jian-Zhong; Yang, Han-Kun; Liu, Li-Ming; Wang, Ying-Yu; Zhang, Wei-Guo
2018-03-25
l-lysine is an important amino acid in animals and humans and NADPH is a vital cofactor for maximizing the efficiency of l-lysine fermentation. Dihydrodipicolinate reductase (DHDPR), an NAD(P)H-dependent enzyme, shows a variance in nucleotide-cofactor affinity in bacteria. In this study, we rationally engineered Corynebacterium glutamicum DHDPR (CgDHDPR) to switch its nucleotide-cofactor specificity resulting in an increase in final titer (from 82.6 to 117.3 g L -1 ), carbon yield (from 0.35 to 0.44 g [g glucose] -1 ) and productivity (from 2.07 to 2.93 g L -1 hr -1 ) of l-lysine in JL-6 ΔdapB::Ec-dapB C115G,G116C in fed-batch fermentation. To do this, we comparatively analyzed the characteristics of CgDHDPR and Escherichia coli DHDPR (EcDHDPR), indicating that hetero-expression of NADH-dependent EcDHDPR increased l-lysine production. Subsequently, we rationally modified the conserved structure of cofactor-binding motif, and results indicated that introducing the mutation K11A or R13A in CgDHDPR and introducing the mutation R16A or R39A in EcDHDPR modifies the nucleotide-cofactor affinity of DHDPR. Lastly, the effects of these mutated DHDPRs on l-lysine production were investigated. The highest increase (26.2%) in l-lysine production was observed for JL-6 ΔdapB::Ec-dapB C115G,G116C , followed by JL-6 Cg-dapB C37G,G38C (21.4%) and JL-6 ΔdapB::Ec-dapB C46G,G47C (15.2%). This is the first report of a rational modification of DHDPR that enhances the l-lysine production and yield through the modulation of nucleotide-cofactor specificity. © 2018 Wiley Periodicals, Inc.
Sequence information gain based motif analysis.
Maynou, Joan; Pairó, Erola; Marco, Santiago; Perera, Alexandre
2015-11-09
The detection of regulatory regions in candidate sequences is essential for the understanding of the regulation of a particular gene and the mechanisms involved. This paper proposes a novel methodology based on information theoretic metrics for finding regulatory sequences in promoter regions. This methodology (SIGMA) has been tested on genomic sequence data for Homo sapiens and Mus musculus. SIGMA has been compared with different publicly available alternatives for motif detection, such as MEME/MAST, Biostrings (Bioconductor package), MotifRegressor, and previous work such Qresiduals projections or information theoretic based detectors. Comparative results, in the form of Receiver Operating Characteristic curves, show how, in 70% of the studied Transcription Factor Binding Sites, the SIGMA detector has a better performance and behaves more robustly than the methods compared, while having a similar computational time. The performance of SIGMA can be explained by its parametric simplicity in the modelling of the non-linear co-variability in the binding motif positions. Sequence Information Gain based Motif Analysis is a generalisation of a non-linear model of the cis-regulatory sequences detection based on Information Theory. This generalisation allows us to detect transcription factor binding sites with maximum performance disregarding the covariability observed in the positions of the training set of sequences. SIGMA is freely available to the public at http://b2slab.upc.edu.
ATtRACT-a database of RNA-binding proteins and associated motifs.
Giudice, Girolamo; Sánchez-Cabo, Fátima; Torroja, Carlos; Lara-Pezzi, Enrique
2016-01-01
RNA-binding proteins (RBPs) play a crucial role in key cellular processes, including RNA transport, splicing, polyadenylation and stability. Understanding the interaction between RBPs and RNA is key to improve our knowledge of RNA processing, localization and regulation in a global manner. Despite advances in recent years, a unified non-redundant resource that includes information on experimentally validated motifs, RBPs and integrated tools to exploit this information is lacking. Here, we developed a database named ATtRACT (available athttp://attract.cnic.es) that compiles information on 370 RBPs and 1583 RBP consensus binding motifs, 192 of which are not present in any other database. To populate ATtRACT we (i) extracted and hand-curated experimentally validated data from CISBP-RNA, SpliceAid-F, RBPDB databases, (ii) integrated and updated the unavailable ASD database and (iii) extracted information from Protein-RNA complexes present in Protein Data Bank database through computational analyses. ATtRACT provides also efficient algorithms to search a specific motif and scan one or more RNA sequences at a time. It also allows discoveringde novomotifs enriched in a set of related sequences and compare them with the motifs included in the database.Database URL:http:// attract. cnic. es. © The Author(s) 2016. Published by Oxford University Press.
Identification of sequence-structure RNA binding motifs for SELEX-derived aptamers.
Hoinka, Jan; Zotenko, Elena; Friedman, Adam; Sauna, Zuben E; Przytycka, Teresa M
2012-06-15
Systematic Evolution of Ligands by EXponential Enrichment (SELEX) represents a state-of-the-art technology to isolate single-stranded (ribo)nucleic acid fragments, named aptamers, which bind to a molecule (or molecules) of interest via specific structural regions induced by their sequence-dependent fold. This powerful method has applications in designing protein inhibitors, molecular detection systems, therapeutic drugs and antibody replacement among others. However, full understanding and consequently optimal utilization of the process has lagged behind its wide application due to the lack of dedicated computational approaches. At the same time, the combination of SELEX with novel sequencing technologies is beginning to provide the data that will allow the examination of a variety of properties of the selection process. To close this gap we developed, Aptamotif, a computational method for the identification of sequence-structure motifs in SELEX-derived aptamers. To increase the chances of identifying functional motifs, Aptamotif uses an ensemble-based approach. We validated the method using two published aptamer datasets containing experimentally determined motifs of increasing complexity. We were able to recreate the author's findings to a high degree, thus proving the capability of our approach to identify binding motifs in SELEX data. Additionally, using our new experimental dataset, we illustrate the application of Aptamotif to elucidate several properties of the selection process.
Will, Katrin; Warnecke, Gabriele; Wiesmüller, Lisa; Deppert, Wolfgang
1998-01-01
Mutant, but not wild-type p53 binds with high affinity to a variety of MAR-DNA elements (MARs), suggesting that MAR-binding of mutant p53 relates to the dominant-oncogenic activities proposed for mutant p53. MARs recognized by mutant p53 share AT richness and contain variations of an AATATATTT “DNA-unwinding motif,” which enhances the structural dynamics of chromatin and promotes regional DNA base-unpairing. Mutant p53 specifically interacted with MAR-derived oligonucleotides carrying such unwinding motifs, catalyzing DNA strand separation when this motif was located within a structurally labile sequence environment. Addition of GC-clamps to the respective MAR-oligonucleotides or introducing mutations into the unwinding motif strongly reduced DNA strand separation, but supported the formation of tight complexes between mutant p53 and such oligonucleotides. We conclude that the specific interaction of mutant p53 with regions of MAR-DNA with a high potential for base-unpairing provides the basis for the high-affinity binding of mutant p53 to MAR-DNA. PMID:9811860
Seet, Bruce T; Berry, Donna M; Maltzman, Jonathan S; Shabason, Jacob; Raina, Monica; Koretzky, Gary A; McGlade, C Jane; Pawson, Tony
2007-02-07
The relationship between the binding affinity and specificity of modular interaction domains is potentially important in determining biological signaling responses. In signaling from the T-cell receptor (TCR), the Gads C-terminal SH3 domain binds a core RxxK sequence motif in the SLP-76 scaffold. We show that residues surrounding this motif are largely optimized for binding the Gads C-SH3 domain resulting in a high-affinity interaction (K(D)=8-20 nM) that is essential for efficient TCR signaling in Jurkat T cells, since Gads-mediated signaling declines with decreasing affinity. Furthermore, the SLP-76 RxxK motif has evolved a very high specificity for the Gads C-SH3 domain. However, TCR signaling in Jurkat cells is tolerant of potential SLP-76 crossreactivity, provided that very high-affinity binding to the Gads C-SH3 domain is maintained. These data provide a quantitative argument that the affinity of the Gads C-SH3 domain for SLP-76 is physiologically important and suggest that the integrity of TCR signaling in vivo is sustained both by strong selection of SLP-76 for the Gads C-SH3 domain and by a capacity to buffer intrinsic crossreactivity.
Sacristán-Reviriego, Almudena; Madrid, Marisa; Cansado, José; Martín, Humberto; Molina, María
2014-01-01
Dual-specificity MAPK phosphatases (MKPs) are essential for the negative regulation of MAPK pathways. Similar to other MAPK-interacting proteins, most MKPs bind MAPKs through specific docking domains known as D-motifs. However, we found that the Saccharomyces cerevisiae MKP Msg5 binds the MAPK Slt2 within the cell wall integrity (CWI) pathway through a distinct motif (IYT). Here, we demonstrate that the IYT motif mediates binding of the Msg5 paralogue Sdp1 to Slt2 as well as of the MKP Pmp1 to its CWI MAPK counterpart Pmk1 in the evolutionarily distant yeast Schizosaccharomyces pombe. As a consequence, removal of the IYT site in Msg5, Sdp1 and Pmp1 reduces MAPK trapping caused by the overexpression of catalytically inactive versions of these phosphatases. Accordingly, an intact IYT site is necessary for inactive Sdp1 to prevent nuclear accumulation of Slt2. We also show that both Ile and Tyr but not Thr are essential for the functionality of the IYT motif. These results provide mechanistic insight into MKP-MAPK interplay and stress the relevance of this conserved non-canonical docking site in the regulation of the CWI pathway in fungi. PMID:24465549
NF-Y Binding Site Architecture Defines a C-Fos Targeted Promoter Class
Haubrock, Martin; Hartmann, Fabian; Wingender, Edgar
2016-01-01
ChIP-seq experiments detect the chromatin occupancy of known transcription factors in a genome-wide fashion. The comparisons of several species-specific ChIP-seq libraries done for different transcription factors have revealed a complex combinatorial and context-specific co-localization behavior for the identified binding regions. In this study we have investigated human derived ChIP-seq data to identify common cis-regulatory principles for the human transcription factor c-Fos. We found that in four different cell lines, c-Fos targeted proximal and distal genomic intervals show prevalences for either AP-1 motifs or CCAAT boxes as known binding motifs for the transcription factor NF-Y, and thereby act in a mutually exclusive manner. For proximal regions of co-localized c-Fos and NF-YB binding, we gathered evidence that a characteristic configuration of repeating CCAAT motifs may be responsible for attracting c-Fos, probably provided by a nearby AP-1 bound enhancer. Our results suggest a novel regulatory function of NF-Y in gene-proximal regions. Specific CCAAT dimer repeats bound by the transcription factor NF-Y define this novel cis-regulatory module. Based on this behavior we propose a new enhancer promoter interaction model based on AP-1 motif defined enhancers which interact with CCAAT-box characterized promoter regions. PMID:27517874
Conserved and divergent features of the structure and function of La and La-related proteins (LARPs)
Bayfield, Mark A.; Yang, Ruiqing; Maraia, Richard J.
2010-01-01
Genuine La proteins contain two RNA binding motifs, a La motif (LAM) followed by a RNA recognition motif (RRM), arranged in a unique way to bind RNA. These proteins interact with an extensive variety of cellular RNAs and exhibit activities in two broad categories: i) to promote the metabolism of nascent pol III transcripts, including precursor-tRNAs, by binding to their common, UUU-3’OH containing ends, and ii) to modulate the translation of certain mRNAs involving an unknown binding mechanism. Characterization of several La-RNA crystal structures as well as biochemical studies reveal insight into their unique two-motif domain architecture and how the LAM recognizes UUU-3’OH while the RRM binds other parts of a pre-tRNA. Recent studies of members of distinct families of conserved La-related proteins (LARPs) indicate that some of these harbor activity related to genuine La proteins, suggesting that their UUU-3’OH binding mode has been appropriated for the assembly and regulation of a specific snRNP (e.g., 7SK snRNA assembly by hLARP7/PIP7S). Analyses of other LARP family members (i.e., hLARP4, hLARP6) suggest more diverged RNA binding modes and specialization for cytoplasmic mRNA-related functions. Thus it appears that while genuine La proteins exhibit broad general involvement in both snRNA-related and mRNA-related functions, different LARP families may have evolved specialized activities in either snRNA or mRNA related functions. In this review, we summarize recent progress that has led to greater understanding of the structure and function of La proteins and their roles in tRNA processing and RNP assembly dynamics, as well as progress on the different LARPs. PMID:20138158
Bayfield, Mark A; Yang, Ruiqing; Maraia, Richard J
2010-01-01
Genuine La proteins contain two RNA binding motifs, a La motif (LAM) followed by a RNA recognition motif (RRM), arranged in a unique way to bind RNA. These proteins interact with an extensive variety of cellular RNAs and exhibit activities in two broad categories: i) to promote the metabolism of nascent pol III transcripts, including precursor-tRNAs, by binding to their common, UUU-3'OH containing ends, and ii) to modulate the translation of certain mRNAs involving an unknown binding mechanism. Characterization of several La-RNA crystal structures as well as biochemical studies reveal insight into their unique two-motif domain architecture and how the LAM recognizes UUU-3'OH while the RRM binds other parts of a pre-tRNA. Recent studies of members of distinct families of conserved La-related proteins (LARPs) indicate that some of these harbor activity related to genuine La proteins, suggesting that their UUU-3'OH binding mode has been appropriated for the assembly and regulation of a specific snRNP (e.g., 7SK snRNP assembly by hLARP7/PIP7S). Analyses of other LARP family members suggest more diverged RNA binding modes and specialization for cytoplasmic mRNA-related functions. Thus it appears that while genuine La proteins exhibit broad general involvement in both snRNA-related and mRNA-related functions, different LARP families may have evolved specialized activities in either snRNA or mRNA-related functions. In this review, we summarize recent progress that has led to greater understanding of the structure and function of La proteins and their roles in tRNA processing and RNP assembly dynamics, as well as progress on the different LARPs.
Pustovalova, Yulia; Magalhães, Mariana T. Q.; D’Souza, Sanjay; Rizzo, Alessandro A.; Korza, George; Walker, Graham C.; Korzhnev, Dmitry M.
2016-01-01
Translesion synthesis (TLS) is a mutagenic branch of cellular DNA damage tolerance that enables bypass replication over DNA lesions carried out by specialized low-fidelity DNA polymerases. The replicative bypass of most types of DNA damage is performed in a two-step process of Rev1/Polζ-dependent TLS. In the first step, a Y-family TLS enzyme, typically Polη, Polι or Polκ, inserts a nucleotide across DNA lesion. In the second step, a four-subunit B-family DNA polymerase Polζ (Rev3/Rev7/PolD2/PolD3 complex) extends the distorted DNA primer-template. The coordinated action of error-prone TLS enzymes is regulated through their interactions with the two scaffold proteins, the sliding clamp PCNA and the TLS polymerase Rev1. Rev1 interactions with all other TLS enzymes are mediated by its C-terminal domain (Rev1-CT), which can simultaneously bind the Rev7 subunit of Polζ and Rev1-interacting regions (RIRs) from Polη, Polι or Polκ. In this work, we identified a previously unknown RIR motif in the C-terminal part of PolD3 subunit of Polζ whose interaction with the Rev1-CT is among the tightest mediated by RIR motifs. Three-dimensional structure of the Rev1-CT/PolD3-RIR complex determined by NMR spectroscopy revealed a structural basis for the relatively high affinity of this interaction. The unexpected discovery of PolD3-RIR motif suggests a mechanism of 'inserter' to 'extender' DNA polymerase switch upon Rev1/Polζ-dependent TLS, in which the PolD3-RIR binding to the Rev1-CT (i) helps displace the 'inserter' Polη, Polι or Polκ from its complex with Rev1, and (ii) facilitates assembly of the four-subunit 'extender' Polζ through simultaneous interaction of Rev1-CT with Rev7 and PolD3 subunits. PMID:26982350
Guo, Changjiang; Sun, Xiaoguang; Chen, Xiao; Yang, Sihai; Li, Jing; Wang, Long; Zhang, Xiaohui
2016-01-01
Most rice blast resistance genes (R-genes) encode proteins with nucleotide-binding site (NBS) and leucine-rich repeat (LRR) domains. Our previous study has shown that more rice blast R-genes can be cloned in rapidly evolving NBS-LRR gene families. In the present study, two rapidly evolving R-gene families in rice were selected for cloning a subset of genes from their paralogs in three resistant rice lines. A total of eight functional blast R-genes were identified among nine NBS-LRR genes, and some of these showed resistance to three or more blast strains. Evolutionary analysis indicated that high nucleotide diversity of coding regions served as important parameters in the determination of gene resistance. We also observed that amino-acid variants (nonsynonymous mutations, insertions, or deletions) in essential motifs of the NBS domain contribute to the blast resistance capacity of NBS-LRR genes. These results suggested that the NBS regions might also play an important role in resistance specificity determination. On the other hand, different splicing patterns of introns were commonly observed in R-genes. The results of the present study contribute to improving the effectiveness of R-gene identification by using evolutionary analysis method and acquisition of novel blast resistance genes.
The Nucleotide Sequence and Spliced pol mRNA Levels of the Nonprimate Spumavirus Bovine Foamy Virus
Holzschu, Donald L.; Delaney, Mari A.; Renshaw, Randall W.; Casey, James W.
1998-01-01
We have determined the complete nucleotide sequence of a replication-competent clone of bovine foamy virus (BFV) and have quantitated the amount of splice pol mRNA processed early in infection. The 544-amino-acid Gag protein precursor has little sequence similarity with its primate foamy virus homologs, but the putative nucleocapsid (NC) protein, like the primate NCs, contains the three glycine-arginine-rich regions that are postulated to bind genomic RNA during virion assembly. The BFV gag and pol open reading frames overlap, with pro and pol in the same translational frame. As with the human foamy virus (HFV) and feline foamy virus, we have detected a spliced pol mRNA by PCR. Quantitatively, this mRNA approximates the level of full-length genomic RNA early in infection. The integrase (IN) domain of reverse transcriptase does not contain the canonical HH-CC zinc finger motif present in all characterized retroviral INs, but it does contain a nearby histidine residue that could conceivably participate as a member of the zinc finger. The env gene encodes a protein that is over 40% identical in sequence to the HFV Env. By comparison, the Gag precursor of BFV is predicted to be only 28% identical to the HFV protein. PMID:9499074
Sloma, Michael F; Mathews, David H
2016-12-01
RNA secondary structure prediction is widely used to analyze RNA sequences. In an RNA partition function calculation, free energy nearest neighbor parameters are used in a dynamic programming algorithm to estimate statistical properties of the secondary structure ensemble. Previously, partition functions have largely been used to estimate the probability that a given pair of nucleotides form a base pair, the conditional stacking probability, the accessibility to binding of a continuous stretch of nucleotides, or a representative sample of RNA structures. Here it is demonstrated that an RNA partition function can also be used to calculate the exact probability of formation of hairpin loops, internal loops, bulge loops, or multibranch loops at a given position. This calculation can also be used to estimate the probability of formation of specific helices. Benchmarking on a set of RNA sequences with known secondary structures indicated that loops that were calculated to be more probable were more likely to be present in the known structure than less probable loops. Furthermore, highly probable loops are more likely to be in the known structure than the set of loops predicted in the lowest free energy structures. © 2016 Sloma and Mathews; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Li de La Sierra-Gallay, Ines; Collinet, Bruno; Graille, Marc; Quevillon-Cheruel, Sophie; Liger, Dominique; Minard, Philippe; Blondeau, Karine; Henckes, Gilles; Aufrère, Robert; Leulliot, Nicolas; Zhou, Cong-Zhao; Sorel, Isabelle; Ferrer, Jean-Luc; Poupon, Anne; Janin, Joël; van Tilbeurgh, Herman
2004-03-01
The protein product of the YGR205w gene of Saccharomyces cerevisiae was targeted as part of our yeast structural genomics project. YGR205w codes for a small (290 amino acids) protein with unknown structure and function. The only recognizable sequence feature is the presence of a Walker A motif (P loop) indicating a possible nucleotide binding/converting function. We determined the three-dimensional crystal structure of Se-methionine substituted protein using multiple anomalous diffraction. The structure revealed a well known mononucleotide fold and strong resemblance to the structure of small metabolite phosphorylating enzymes such as pantothenate and phosphoribulo kinase. Biochemical experiments show that YGR205w binds specifically ATP and, less tightly, ADP. The structure also revealed the presence of two bound sulphate ions, occupying opposite niches in a canyon that corresponds to the active site of the protein. One sulphate is bound to the P-loop in a position that corresponds to the position of beta-phosphate in mononucleotide protein ATP complex, suggesting the protein is indeed a kinase. The nature of the phosphate accepting substrate remains to be determined. Copyright 2004 Wiley-Liss, Inc.
Blangy, A; Léopold, P; Vidal, F; Rassoulzadegan, M; Cuzin, F
1991-01-01
We have reported previously (1) two unexpected consequences of the microinjection into fertilized mouse eggs of a recombinant plasmid designated p12B1, carrying a 343 bp insert of non-repetitive mouse DNA. Injected at very low concentrations, this plasmid could be established as an extrachromosomal genetic element. When injected in greater concentration, an early arrest of embryonic development resulted. In the present work, we have studied this toxic effect in more detail by microinjecting short synthetic oligonucleotides with sequences from the mouse insert. Lethality was associated with the nucleotide sequence GTCACATG, identical with the CDEl element of yeast centromeres. Development of injected embryos was arrested between the one-cell and the early morula stages, with abnormal structures and DNA contents. Electrophoretic mobility shift and DNAse foot-printing assays demonstrated the binding of mouse nuclear protein(s) to the CDEl-like box. Base changes within the CDEl sequence prevented both the toxic effects in embryos and the formation of protein complex in vitro, suggesting that protein binding at such sites in chromosomal DNA plays an important role in early development. Images PMID:1766880
Akiyama, Benjamin M.; Loper, John; Najarro, Kevin; Stone, Michael D.
2012-01-01
The unique cellular activity of the telomerase reverse transcriptase ribonucleoprotein (RNP) requires proper assembly of protein and RNA components into a functional complex. In the ciliate model organism Tetrahymena thermophila, the La-domain protein p65 is required for in vivo assembly of telomerase. Single-molecule and biochemical studies have shown that p65 promotes efficient RNA assembly with the telomerase reverse transcriptase (TERT) protein, in part by inducing a bend in the conserved stem IV region of telomerase RNA (TER). The domain architecture of p65 consists of an N-terminal domain, a La-RRM motif, and a C-terminal domain (CTD). Using single-molecule Förster resonance energy transfer (smFRET), we demonstrate the p65CTD is necessary for the RNA remodeling activity of the protein and is sufficient to induce a substantial conformational change in stem IV of TER. Moreover, nuclease protection assays directly map the site of p65CTD interaction to stem IV and reveal that, in addition to bending stem IV, p65 binding reorganizes nucleotides that comprise the low-affinity TERT binding site within stem–loop IV. PMID:22315458
Programmable RNA recognition and cleavage by CRISPR/Cas9.
O'Connell, Mitchell R; Oakes, Benjamin L; Sternberg, Samuel H; East-Seletsky, Alexandra; Kaplan, Matias; Doudna, Jennifer A
2014-12-11
The CRISPR-associated protein Cas9 is an RNA-guided DNA endonuclease that uses RNA-DNA complementarity to identify target sites for sequence-specific double-stranded DNA (dsDNA) cleavage. In its native context, Cas9 acts on DNA substrates exclusively because both binding and catalysis require recognition of a short DNA sequence, known as the protospacer adjacent motif (PAM), next to and on the strand opposite the twenty-nucleotide target site in dsDNA. Cas9 has proven to be a versatile tool for genome engineering and gene regulation in a large range of prokaryotic and eukaryotic cell types, and in whole organisms, but it has been thought to be incapable of targeting RNA. Here we show that Cas9 binds with high affinity to single-stranded RNA (ssRNA) targets matching the Cas9-associated guide RNA sequence when the PAM is presented in trans as a separate DNA oligonucleotide. Furthermore, PAM-presenting oligonucleotides (PAMmers) stimulate site-specific endonucleolytic cleavage of ssRNA targets, similar to PAM-mediated stimulation of Cas9-catalysed DNA cleavage. Using specially designed PAMmers, Cas9 can be specifically directed to bind or cut RNA targets while avoiding corresponding DNA sequences, and we demonstrate that this strategy enables the isolation of a specific endogenous messenger RNA from cells. These results reveal a fundamental connection between PAM binding and substrate selection by Cas9, and highlight the utility of Cas9 for programmable transcript recognition without the need for tags.
The hURAT1 rs559946 polymorphism and the incidence of gout in Han Chinese men.
Li, C; Yu, Q; Han, L; Wang, C; Chu, N; Liu, S
2014-01-01
Our previous study identified rs559946, a human urate transporter 1 (hURAT1) single nucleotide polymorphism (SNP), as being significantly associated with risk of primary hyperuricaemia (HUA) in a Han Chinese population. In the current study we aimed to identify the genetic effects of rs559946 on gout susceptibility in Han Chinese men. A total of 335 patients with gout and 376 healthy controls were recruited for a case-control association study. To examine the functional effect of rs559946, we performed luciferase reporter assays and an electrophoretic mobility shift assay (EMSA). rs559946 was found to be significantly associated with gout susceptibility (p = 0.004), with T-allele carriers showing a decreased risk of gout [odds ratio (OR) 0.70, 95% confidence interval (CI) 0.55-0.89]. Multiple linear regression analysis identified a significant association between rs559946 genotypes and tophi. Luciferase reporter assays show increased transcriptional activity of the hURAT1 promoter with the C allele of rs559946. EMSA detected binding of nuclear proteins to both the T and C alleles, although increased binding was observed with the T allele. Cold competition assays suggest that rs559946 may bind within a glucocorticoid receptor (GR) binding motif. Our study suggests that the rs559946 polymorphism is associated with increased HUA risk and may also contribute to gout development in Han Chinese men. The T to C substitution within rs559946 increased the transcriptional activity, and potentially increases gout susceptibility.
Programmable RNA recognition and cleavage by CRISPR/Cas9
O’Connell, Mitchell R.; Oakes, Benjamin L.; Sternberg, Samuel H.; East-Seletsky, Alexandra; Kaplan, Matias; Doudna, Jennifer A.
2014-01-01
The CRISPR-associated protein Cas9 is an RNA-guided DNA endonuclease that uses RNA:DNA complementarity to identify target sites for sequence-specific doublestranded DNA (dsDNA) cleavage1-5. In its native context, Cas9 acts on DNA substrates exclusively because both binding and catalysis require recognition of a short DNA sequence, the protospacer adjacent motif (PAM), next to and on the strand opposite the 20-nucleotide target site in dsDNA4-7. Cas9 has proven to be a versatile tool for genome engineering and gene regulation in many cell types and organisms8, but it has been thought to be incapable of targeting RNA5. Here we show that Cas9 binds with high affinity to single-stranded RNA (ssRNA) targets matching the Cas9-associated guide RNA sequence when the PAM is presented in trans as a separate DNA oligonucleotide. Furthermore, PAM-presenting oligonucleotides (PAMmers) stimulate site-specific endonucleolytic cleavage of ssRNA targets, similar to PAM-mediated stimulation of Cas9-catalyzed DNA cleavage7. Using specially designed PAMmers, Cas9 can be specifically directed to bind or cut RNA targets while avoiding corresponding DNA sequences, and we demonstrate that this strategy enables the isolation of a specific endogenous mRNA from cells. These results reveal a fundamental connection between PAM binding and substrate selection by Cas9, and highlight the utility of Cas9 for programmable and tagless transcript recognition. PMID:25274302
Identifying novel sequence variants of RNA 3D motifs
Zirbel, Craig L.; Roll, James; Sweeney, Blake A.; Petrov, Anton I.; Pirrung, Meg; Leontis, Neocles B.
2015-01-01
Predicting RNA 3D structure from sequence is a major challenge in biophysics. An important sub-goal is accurately identifying recurrent 3D motifs from RNA internal and hairpin loop sequences extracted from secondary structure (2D) diagrams. We have developed and validated new probabilistic models for 3D motif sequences based on hybrid Stochastic Context-Free Grammars and Markov Random Fields (SCFG/MRF). The SCFG/MRF models are constructed using atomic-resolution RNA 3D structures. To parameterize each model, we use all instances of each motif found in the RNA 3D Motif Atlas and annotations of pairwise nucleotide interactions generated by the FR3D software. Isostericity relations between non-Watson–Crick basepairs are used in scoring sequence variants. SCFG techniques model nested pairs and insertions, while MRF ideas handle crossing interactions and base triples. We use test sets of randomly-generated sequences to set acceptance and rejection thresholds for each motif group and thus control the false positive rate. Validation was carried out by comparing results for four motif groups to RMDetect. The software developed for sequence scoring (JAR3D) is structured to automatically incorporate new motifs as they accumulate in the RNA 3D Motif Atlas when new structures are solved and is available free for download. PMID:26130723
Proteolytic dissection of Zab, the Z-DNA-binding domain of human ADAR1
NASA Technical Reports Server (NTRS)
Schwartz, T.; Lowenhaupt, K.; Kim, Y. G.; Li, L.; Brown, B. A. 2nd; Herbert, A.; Rich, A.
1999-01-01
Zalpha is a peptide motif that binds to Z-DNA with high affinity. This motif binds to alternating dC-dG sequences stabilized in the Z-conformation by means of bromination or supercoiling, but not to B-DNA. Zalpha is part of the N-terminal region of double-stranded RNA adenosine deaminase (ADAR1), a candidate enzyme for nuclear pre-mRNA editing in mammals. Zalpha is conserved in ADAR1 from many species; in each case, there is a second similar motif, Zbeta, separated from Zalpha by a more divergent linker. To investigate the structure-function relationship of Zalpha, its domain structure was studied by limited proteolysis. Proteolytic profiles indicated that Zalpha is part of a domain, Zab, of 229 amino acids (residues 133-361 in human ADAR1). This domain contains both Zalpha and Zbeta as well as a tandem repeat of a 49-amino acid linker module. Prolonged proteolysis revealed a minimal core domain of 77 amino acids (positions 133-209), containing only Zalpha, which is sufficient to bind left-handed Z-DNA; however, the substrate binding is strikingly different from that of Zab. The second motif, Zbeta, retains its structural integrity only in the context of Zab and does not bind Z-DNA as a separate entity. These results suggest that Zalpha and Zbeta act as a single bipartite domain. In the presence of substrate DNA, Zab becomes more resistant to proteases, suggesting that it adopts a more rigid structure when bound to its substrate, possibly with conformational changes in parts of the protein.
Interaction of the Sliding Clamp β-Subunit and Hda, a DnaA-Related Protein
Kurz, Mareike; Dalrymple, Brian; Wijffels, Gene; Kongsuwan, Kritaya
2004-01-01
In Escherichia coli, interactions between the replication initiation protein DnaA, the β subunit of DNA polymerase III (the sliding clamp protein), and Hda, the recently identified DnaA-related protein, are required to convert the active ATP-bound form of DnaA to an inactive ADP-bound form through the accelerated hydrolysis of ATP. This rapid hydrolysis of ATP is proposed to be the main mechanism that blocks multiple initiations during cell cycle and acts as a molecular switch from initiation to replication. However, the biochemical mechanism for this crucial step in DNA synthesis has not been resolved. Using purified Hda and β proteins in a plate binding assay and Ni-nitrilotriacetic acid pulldown analysis, we show for the first time that Hda directly interacts with β in vitro. A new β-binding motif, a hexapeptide with the consensus sequence QL[SP]LPL, related to the previously identified β-binding pentapeptide motif (QL[SD]LF) was found in the amino terminus of the Hda protein. Mutants of Hda with amino acid changes in the hexapeptide motif are severely defective in their ability to bind β. A 10-amino-acid peptide containing the E. coli Hda β-binding motif was shown to compete with Hda for binding to β in an Hda-β interaction assay. These results establish that the interaction of Hda with β is mediated through the hexapeptide sequence. We propose that this interaction may be crucial to the events that lead to the inactivation of DnaA and the prevention of excess initiation of rounds of replication. PMID:15150238
Interaction of the sliding clamp beta-subunit and Hda, a DnaA-related protein.
Kurz, Mareike; Dalrymple, Brian; Wijffels, Gene; Kongsuwan, Kritaya
2004-06-01
In Escherichia coli, interactions between the replication initiation protein DnaA, the beta subunit of DNA polymerase III (the sliding clamp protein), and Hda, the recently identified DnaA-related protein, are required to convert the active ATP-bound form of DnaA to an inactive ADP-bound form through the accelerated hydrolysis of ATP. This rapid hydrolysis of ATP is proposed to be the main mechanism that blocks multiple initiations during cell cycle and acts as a molecular switch from initiation to replication. However, the biochemical mechanism for this crucial step in DNA synthesis has not been resolved. Using purified Hda and beta proteins in a plate binding assay and Ni-nitrilotriacetic acid pulldown analysis, we show for the first time that Hda directly interacts with beta in vitro. A new beta-binding motif, a hexapeptide with the consensus sequence QL[SP]LPL, related to the previously identified beta-binding pentapeptide motif (QL[SD]LF) was found in the amino terminus of the Hda protein. Mutants of Hda with amino acid changes in the hexapeptide motif are severely defective in their ability to bind beta. A 10-amino-acid peptide containing the E. coli Hda beta-binding motif was shown to compete with Hda for binding to beta in an Hda-beta interaction assay. These results establish that the interaction of Hda with beta is mediated through the hexapeptide sequence. We propose that this interaction may be crucial to the events that lead to the inactivation of DnaA and the prevention of excess initiation of rounds of replication.
Beyond Atg8 binding: The role of AIM/LIR motifs in autophagy.
Fracchiolla, Dorotea; Sawa-Makarska, Justyna; Martens, Sascha
2017-05-04
Selective macroautophagy/autophagy mediates the selective delivery of cytoplasmic cargo material via autophagosomes into the lytic compartment for degradation. This selectivity is mediated by cargo receptor molecules that link the cargo to the phagophore (the precursor of the autophagosome) membrane via their simultaneous interaction with the cargo and Atg8 proteins on the membrane. Atg8 proteins are attached to membrane in a conjugation reaction and the cargo receptors bind them via short peptide motifs called Atg8-interacting motifs/LC3-interacting regions (AIMs/LIRs). We have recently shown for the yeast Atg19 cargo receptor that the AIM/LIR motifs also serve to recruit the Atg12-Atg5-Atg16 complex, which stimulates Atg8 conjugation, to the cargo. We could further show in a reconstituted system that the recruitment of the Atg12-Atg5-Atg16 complex is sufficient for cargo-directed Atg8 conjugation. Our results suggest that AIM/LIR motifs could have more general roles in autophagy.
The L7Ae protein binds to two kink-turns in the Pyrococcus furiosus RNase P RNA
Lai, Stella M.; Lai, Lien B.; Foster, Mark P.; Gopalan, Venkat
2014-01-01
The RNA-binding protein L7Ae, known for its role in translation (as part of ribosomes) and RNA modification (as part of sn/oRNPs), has also been identified as a subunit of archaeal RNase P, a ribonucleoprotein complex that employs an RNA catalyst for the Mg2+-dependent 5′ maturation of tRNAs. To better understand the assembly and catalysis of archaeal RNase P, we used a site-specific hydroxyl radical-mediated footprinting strategy to pinpoint the binding sites of Pyrococcus furiosus (Pfu) L7Ae on its cognate RNase P RNA (RPR). L7Ae derivatives with single-Cys substitutions at residues in the predicted RNA-binding interface (K42C/C71V, R46C/C71V, V95C/C71V) were modified with an iron complex of EDTA-2-aminoethyl 2-pyridyl disulfide. Upon addition of hydrogen peroxide and ascorbate, these L7Ae-tethered nucleases were expected to cleave the RPR at nucleotides proximal to the EDTA-Fe–modified residues. Indeed, footprinting experiments with an enzyme assembled with the Pfu RPR and five protein cofactors (POP5, RPP21, RPP29, RPP30 and L7Ae–EDTA-Fe) revealed specific RNA cleavages, localizing the binding sites of L7Ae to the RPR's catalytic and specificity domains. These results support the presence of two kink-turns, the structural motifs recognized by L7Ae, in distinct functional domains of the RPR and suggest testable mechanisms by which L7Ae contributes to RNase P catalysis. PMID:25361963
Cold shock protein YB-1 is involved in hypoxia-dependent gene transcription
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rauen, Thomas; Frye, Bjoern C.; Pneumology, University Medical Center, University of Freiburg, Freiburg
Hypoxia-dependent gene regulation is largely orchestrated by hypoxia-inducible factors (HIFs), which associate with defined nucleotide sequences of hypoxia-responsive elements (HREs). Comparison of the regulatory HRE within the 3′ enhancer of the human erythropoietin (EPO) gene with known binding motifs for cold shock protein Y-box (YB) protein-1 yielded strong similarities within the Y-box element and 3′ adjacent sequences. DNA binding assays confirmed YB-1 binding to both, single- and double-stranded HRE templates. Under hypoxia, we observed nuclear shuttling of YB-1 and co-immunoprecipitation assays demonstrated that YB-1 and HIF-1α physically interact with each other. Cellular YB-1 depletion using siRNA significantly induced hypoxia-dependent EPOmore » production at both, promoter and mRNA level. Vice versa, overexpressed YB-1 significantly reduced EPO-HRE-dependent gene transcription, whereas this effect was minor under normoxia. HIF-1α overexpression induced hypoxia-dependent gene transcription through the same element and accordingly, co-expression with YB-1 reduced HIF-1α-mediated EPO induction under hypoxic conditions. Taken together, we identified YB-1 as a novel binding factor for HREs that participates in fine-tuning of the hypoxia transcriptome. - Highlights: • Hypoxia drives nuclear translocation of cold shock protein YB-1. • YB-1 physically interacts with hypoxia-inducible factor (HIF)-1α. • YB-1 binds to the hypoxia-responsive element (HRE) within the erythropoietin (EPO) 3′ enhancer. • YB-1 trans-regulates transcription of hypoxia-dependent genes such as EPO and VEGF.« less
Liu, Gary W; Livesay, Brynn R; Kacherovsky, Nataly A; Cieslewicz, Maryelise; Lutz, Emi; Waalkes, Adam; Jensen, Michael C; Salipante, Stephen J; Pun, Suzie H
2015-08-19
Peptide ligands are used to increase the specificity of drug carriers to their target cells and to facilitate intracellular delivery. One method to identify such peptide ligands, phage display, enables high-throughput screening of peptide libraries for ligands binding to therapeutic targets of interest. However, conventional methods for identifying target binders in a library by Sanger sequencing are low-throughput, labor-intensive, and provide a limited perspective (<0.01%) of the complete sequence space. Moreover, the small sample space can be dominated by nonspecific, preferentially amplifying "parasitic sequences" and plastic-binding sequences, which may lead to the identification of false positives or exclude the identification of target-binding sequences. To overcome these challenges, we employed next-generation Illumina sequencing to couple high-throughput screening and high-throughput sequencing, enabling more comprehensive access to the phage display library sequence space. In this work, we define the hallmarks of binding sequences in next-generation sequencing data, and develop a method that identifies several target-binding phage clones for murine, alternatively activated M2 macrophages with a high (100%) success rate: sequences and binding motifs were reproducibly present across biological replicates; binding motifs were identified across multiple unique sequences; and an unselected, amplified library accurately filtered out parasitic sequences. In addition, we validate the Multiple Em for Motif Elicitation tool as an efficient and principled means of discovering binding sequences.
Recognition of p63 by the E3 ligase ITCH: Effect of an ectodermal dysplasia mutant.
Bellomaria, A; Barbato, Gaetano; Melino, G; Paci, M; Melino, Sonia
2010-09-15
The E3 ubiquitin ligase Itch mediates the degradation of the p63 protein. Itch contains four WW domains which are pivotal for the substrate recognition process. Indeed, this domain is implicated in several signalling complexes crucially involved in human diseases including Muscular Dystrophy, Alzheimer's Disease and Huntington Disease. WW domains are highly compact protein-protein binding modules that interact with short proline-rich sequences. The four WW domains present in Itch belong to the Group I type, which binds polypeptides with a PY motif characterized by a PP xY consensus sequence, where x can be any residue. Accordingly, the Itch-p63 interaction results from a direct binding of Itch-WW2 domain with the PY motif of p63. Here, we report a structural analysis of the Itch-p63 interaction by fluorescence, CD and NMR spectroscopy. Indeed, we studied the in vitro interaction between Itch-WW2 domain and p63(534-551), an 18-mer peptide encompassing a fragment of the p63 protein including the PY motif. In addition, we evaluated the conformation and the interaction with Itch-WW2 of a site specific mutant of p63, I549T, that has been reported in both Hay-Wells syndrome and Rapp-Hodgkin syndrome. Based on our results, we propose an extended PP xY motif for the Itch recognition motif (P-P-P-Y-x(4)-[ST]-[ILV]), which includes these C-terminal residues to the PP xY motif.
Sekiyama, Naotaka; Arthanari, Haribabu; Papadopoulos, Evangelos; ...
2015-07-13
The eIF4E-binding protein (4E-BP) is a phosphorylation-dependent regulator of protein synthesis. The nonphosphorylated or minimally phosphorylated form binds translation initiation factor 4E (eIF4E), preventing binding of eIF4G and the recruitment of the small ribosomal subunit. Signaling events stimulate serial phosphorylation of 4E-BP, primarily by mammalian target of rapamycin complex 1 (mTORC1) at residues T 37/T 46, followed by T 70 and S 65. Hyperphosphorylated 4E-BP dissociates from eIF4E, allowing eIF4E to interact with eIF4G and translation initiation to resume. Because overexpression of eIF4E is linked to cellular transformation, 4E-BP is a tumor suppressor, and up-regulation of its activity is amore » goal of interest for cancer therapy. A recently discovered small molecule, eIF4E/eIF4G interaction inhibitor 1 (4EGI-1), disrupts the eIF4E/eIF4G interaction and promotes binding of 4E-BP1 to eIF4E. Structures of 14- to 16-residue 4E-BP fragments bound to eIF4E contain the eIF4E consensus binding motif, 54YXXXXLΦ 60 (motif 1) but lack known phosphorylation sites. We report in this paper a 2.1-Å crystal structure of mouse eIF4E in complex with m 7GTP and with a fragment of human 4E-BP1, extended C-terminally from the consensus-binding motif (4E-BP1 50–84). The extension, which includes a proline-turn-helix segment (motif 2) followed by a loop of irregular structure, reveals the location of two phosphorylation sites (S 65 and T 70). Our major finding is that the C-terminal extension (motif 3) is critical to 4E-BP1–mediated cell cycle arrest and that it partially overlaps with the binding site of 4EGI-1. Finally, the binding of 4E-BP1 and 4EGI-1 to eIF4E is therefore not mutually exclusive, and both ligands contribute to shift the equilibrium toward the inhibition of translation initiation.« less
Boyoglu-Barnum, S; Todd, S O; Meng, J; Barnum, T R; Chirkova, T; Haynes, L M; Jadhao, S J; Tripp, R A; Oomens, A G; Moore, M L; Anderson, L J
2017-05-15
Respiratory syncytial virus (RSV) belongs to the family Paramyxoviridae and is the single most important cause of serious lower respiratory tract infections in young children, yet no highly effective treatment or vaccine is available. Through a CX3C chemokine motif ( 182 CWAIC 186 ) in the G protein, RSV binds to the corresponding chemokine receptor, CX3CR1. Since RSV binding to CX3CR1 contributes to disease pathogenesis, we investigated whether a mutation in the CX3C motif by insertion of an alanine, A 186 , within the CX3C motif, mutating it to CX4C ( 182 CWAIAC 187 ), which is known to block binding to CX3CR1, might decrease disease. We studied the effect of the CX4C mutation in two strains of RSV (A2 and r19F) in a mouse challenge model. We included RSV r19F because it induces mucus production and airway resistance, two manifestations of RSV infection in humans, in mice. Compared to wild-type (wt) virus, mice infected with CX4C had a 0.7 to 1.2 log 10 -fold lower virus titer in the lung at 5 days postinfection (p.i.) and had markedly reduced weight loss, pulmonary inflammatory cell infiltration, mucus production, and airway resistance after challenge. This decrease in disease was not dependent on decrease in virus replication but did correspond to a decrease in pulmonary Th2 and inflammatory cytokines. Mice infected with CX4C viruses also had higher antibody titers and a Th1-biased T cell memory response at 75 days p.i. These results suggest that the CX4C mutation in the G protein could improve the safety and efficacy of a live attenuated RSV vaccine. IMPORTANCE RSV binds to the corresponding chemokine receptor, CX3CR1, through a CX3C chemokine motif ( 182 CWAIC 186 ) in the G protein. RSV binding to CX3CR1 contributes to disease pathogenesis; therefore, we investigated whether a mutation in the CX3C motif by insertion of an alanine, A 186 , within the CX3C motif, mutating it to CX4C ( 182 CWAIAC 187 ), known to block binding to CX3CR1, might decrease disease. The effect of this mutation and treatment with the F(ab') 2 form of the anti-RSV G 131-2G monoclonal antibody (MAb) show that mutating the CX3C motif to CX4C blocks much of the disease and immune modulation associated with the G protein and should improve the safety and efficacy of a live attenuated RSV vaccine. Copyright © 2017 American Society for Microbiology.
Assessment of composite motif discovery methods.
Klepper, Kjetil; Sandve, Geir K; Abul, Osman; Johansen, Jostein; Drablos, Finn
2008-02-26
Computational discovery of regulatory elements is an important area of bioinformatics research and more than a hundred motif discovery methods have been published. Traditionally, most of these methods have addressed the problem of single motif discovery - discovering binding motifs for individual transcription factors. In higher organisms, however, transcription factors usually act in combination with nearby bound factors to induce specific regulatory behaviours. Hence, recent focus has shifted from single motifs to the discovery of sets of motifs bound by multiple cooperating transcription factors, so called composite motifs or cis-regulatory modules. Given the large number and diversity of methods available, independent assessment of methods becomes important. Although there have been several benchmark studies of single motif discovery, no similar studies have previously been conducted concerning composite motif discovery. We have developed a benchmarking framework for composite motif discovery and used it to evaluate the performance of eight published module discovery tools. Benchmark datasets were constructed based on real genomic sequences containing experimentally verified regulatory modules, and the module discovery programs were asked to predict both the locations of these modules and to specify the single motifs involved. To aid the programs in their search, we provided position weight matrices corresponding to the binding motifs of the transcription factors involved. In addition, selections of decoy matrices were mixed with the genuine matrices on one dataset to test the response of programs to varying levels of noise. Although some of the methods tested tended to score somewhat better than others overall, there were still large variations between individual datasets and no single method performed consistently better than the rest in all situations. The variation in performance on individual datasets also shows that the new benchmark datasets represents a suitable variety of challenges to most methods for module discovery.
Benard, Emmanuel; Michel, Christian J
2009-08-01
We present here the SEGM web server (Stochastic Evolution of Genetic Motifs) in order to study the evolution of genetic motifs both in the direct evolutionary sense (past-present) and in the inverse evolutionary sense (present-past). The genetic motifs studied can be nucleotides, dinucleotides and trinucleotides. As an example of an application of SEGM and to understand its functionalities, we give an analysis of inverse mutations of splice sites of human genome introns. SEGM is freely accessible at http://lsiit-bioinfo.u-strasbg.fr:8080/webMathematica/SEGM/SEGM.html directly or by the web site http://dpt-info.u-strasbg.fr/~michel/. To our knowledge, this SEGM web server is to date the only computational biology software in this evolutionary approach.
Nucleotide-dependent conformational states of actin
Pfaendtner, Jim; Branduardi, Davide; Parrinello, Michele; Pollard, Thomas D.; Voth, Gregory A.
2009-01-01
The influence of the state of the bound nucleotide (ATP, ADP-Pi, or ADP) on the conformational free-energy landscape of actin is investigated. Nucleotide-dependent folding of the DNase-I binding (DB) loop in monomeric actin and the actin trimer is carried out using all-atom molecular dynamics (MD) calculations accelerated with a multiscale implementation of the metadynamics algorithm. Additionally, an investigation of the opening and closing of the actin nucleotide binding cleft is performed. Nucleotide-dependent free-energy profiles for all of these conformational changes are calculated within the framework of metadynamics. We find that in ADP-bound monomer, the folded and unfolded states of the DB loop have similar relative free-energy. This result helps explain the experimental difficulty in obtaining an ordered crystal structure for this region of monomeric actin. However, we find that in the ADP-bound actin trimer, the folded DB loop is stable and in a free-energy minimum. It is also demonstrated that the nucleotide binding cleft favors a closed conformation for the bound nucleotide in the ATP and ADP-Pi states, whereas the ADP state favors an open confirmation, both in the monomer and trimer. These results suggest a mechanism of allosteric interactions between the nucleotide binding cleft and the DB loop. This behavior is confirmed by an additional simulation that shows the folding free-energy as a function of the nucleotide cleft width, which demonstrates that the barrier for folding changes significantly depending on the value of the cleft width. PMID:19620726
Switch II Mutants Reveal Coupling between the Nucleotide- and Actin-Binding Regions in Myosin V
Trivedi, Darshan V.; David, Charles; Jacobs, Donald J.; Yengo, Christopher M.
2012-01-01
Conserved active-site elements in myosins and other P-loop NTPases play critical roles in nucleotide binding and hydrolysis; however, the mechanisms of allosteric communication among these mechanoenzymes remain unresolved. In this work we introduced the E442A mutation, which abrogates a salt-bridge between switch I and switch II, and the G440A mutation, which abolishes a main-chain hydrogen bond associated with the interaction of switch II with the γ phosphate of ATP, into myosin V. We used fluorescence resonance energy transfer between mant-labeled nucleotides or IAEDANS-labeled actin and FlAsH-labeled myosin V to examine the conformation of the nucleotide- and actin-binding regions, respectively. We demonstrate that in the absence of actin, both the G440A and E442A mutants bind ATP with similar affinity and result in only minor alterations in the conformation of the nucleotide-binding pocket (NBP). In the presence of ADP and actin, both switch II mutants disrupt the formation of a closed NBP actomyosin.ADP state. The G440A mutant also prevents ATP-induced opening of the actin-binding cleft. Our results indicate that the switch II region is critical for stabilizing the closed NBP conformation in the presence of actin, and is essential for communication between the active site and actin-binding region. PMID:22713570
The primary structure of L37--a rat ribosomal protein with a zinc finger-like motif.
Chan, Y L; Paz, V; Olvera, J; Wool, I G
1993-04-30
The amino acid sequence of the rat 60S ribosomal subunit protein L37 was deduced from the sequence of nucleotides in a recombinant cDNA. Ribosomal protein L37 has 96 amino acids, the NH2-terminal methionine is removed after translation of the mRNA, and has a molecular weight of 10,939. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type. Hybridization of the cDNA to digests of nuclear DNA suggests that there are 13 or 14 copies of the L37 gene. The mRNA for the protein is about 500 nucleotides in length. Rat L37 is related to Saccharomyces cerevisiae ribosomal protein YL35 and to Caenorhabditis elegans L37. We have identified in the data base a DNA sequence that encodes the chicken homolog of rat L37.