DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Suhkmann; Zhang, Ziming; Upchurch, Sean
2004-04-16
2 ARID is a homologous family of DNA-binding domains that occur in DNA binding proteins from a wide variety of species, ranging from yeast to nematodes, insects, mammals and plants. SWI1, a member of the SWI/SNF protein complex that is involved in chromatin remodeling during transcription, contains the ARID motif. The ARID domain of human SWI1 (also known as p270) does not select for a specific DNA sequence from a random sequence pool. The lack of sequence specificity shown by the SWI1 ARID domain stands in contrast to the other characterized ARID domains, which recognize specific AT-rich sequences. We havemore » solved the three-dimensional structure of human SWI1 ARID using solution NMR methods. In addition, we have characterized non-specific DNA-binding by the SWI1 ARID domain. Results from this study indicate that a flexible long internal loop in ARID motif is likely to be important for sequence specific DNA-recognition. The structure of human SWI1 ARID domain also represents a distinct structural subfamily. Studies of ARID indicate that boundary of the DNA binding structural and functional domains can extend beyond the sequence homologous region in a homologous family of proteins. Structural studies of homologous domains such as ARID family of DNA-binding domains should provide information to better predict the boundary of structural and functional domains in structural genomic studies. Key Words: ARID, SWI1, NMR, structural genomics, protein-DNA interaction.« less
Churchill, M E; Jones, D N; Glaser, T; Hefner, H; Searles, M A; Travers, A A
1995-01-01
The high mobility group (HMG) protein HMG-D from Drosophila melanogaster is a highly abundant chromosomal protein that is closely related to the vertebrate HMG domain proteins HMG1 and HMG2. In general, chromosomal HMG domain proteins lack sequence specificity. However, using both NMR spectroscopy and standard biochemical techniques we show that binding of HMG-D to a single DNA site is sequence selective. The preferred duplex DNA binding site comprises at least 5 bp and contains the deformable dinucleotide TG embedded in A/T-rich sequences. The TG motif constitutes a common core element in the binding sites of the well-characterized sequence-specific HMG domain proteins. We show that a conserved aromatic residue in helix 1 of the HMG domain may be involved in recognition of this core sequence. In common with other HMG domain proteins HMG-D binds preferentially to DNA sites that are stably bent and underwound, therefore HMG-D can be considered an architecture-specific protein. Finally, we show that HMG-D bends DNA and may confer a superhelical DNA conformation at a natural DNA binding site in the Drosophila fushi tarazu scaffold-associated region. Images PMID:7720717
APOBEC3G Interacts with ssDNA by Two Modes: AFM Studies
NASA Astrophysics Data System (ADS)
Shlyakhtenko, Luda S.; Dutta, Samrat; Banga, Jaspreet; Li, Ming; Harris, Reuben S.; Lyubchenko, Yuri L.
2015-10-01
APOBEC3G (A3G) protein has antiviral activity against HIV and other pathogenic retroviruses. A3G has two domains: a catalytic C-terminal domain (CTD) that deaminates cytidine, and a N-terminal domain (NTD) that binds to ssDNA. Although abundant information exists about the biological activities of A3G protein, the interplay between sequence specific deaminase activity and A3G binding to ssDNA remains controversial. We used the topographic imaging and force spectroscopy modalities of Atomic Force Spectroscopy (AFM) to characterize the interaction of A3G protein with deaminase specific and nonspecific ssDNA substrates. AFM imaging demonstrated that A3G has elevated affinity for deaminase specific ssDNA than for nonspecific ssDNA. AFM force spectroscopy revealed two distinct binding modes by which A3G interacts with ssDNA. One mode requires sequence specificity, as demonstrated by stronger and more stable complexes with deaminase specific ssDNA than with nonspecific ssDNA. Overall these observations enforce prior studies suggesting that both domains of A3G contribute to the sequence specific binding of ssDNA.
APOBEC3G Interacts with ssDNA by Two Modes: AFM Studies.
Shlyakhtenko, Luda S; Dutta, Samrat; Banga, Jaspreet; Li, Ming; Harris, Reuben S; Lyubchenko, Yuri L
2015-10-27
APOBEC3G (A3G) protein has antiviral activity against HIV and other pathogenic retroviruses. A3G has two domains: a catalytic C-terminal domain (CTD) that deaminates cytidine, and a N-terminal domain (NTD) that binds to ssDNA. Although abundant information exists about the biological activities of A3G protein, the interplay between sequence specific deaminase activity and A3G binding to ssDNA remains controversial. We used the topographic imaging and force spectroscopy modalities of Atomic Force Spectroscopy (AFM) to characterize the interaction of A3G protein with deaminase specific and nonspecific ssDNA substrates. AFM imaging demonstrated that A3G has elevated affinity for deaminase specific ssDNA than for nonspecific ssDNA. AFM force spectroscopy revealed two distinct binding modes by which A3G interacts with ssDNA. One mode requires sequence specificity, as demonstrated by stronger and more stable complexes with deaminase specific ssDNA than with nonspecific ssDNA. Overall these observations enforce prior studies suggesting that both domains of A3G contribute to the sequence specific binding of ssDNA.
Munde, Manoj; Poon, Gregory M. K.; Wilson, W. David
2013-01-01
Members of the ETS family of transcription factors regulate a functionally diverse array of genes. All ETS proteins share a structurally-conserved but sequence-divergent DNA-binding domain, known as the ETS domain. Although the structure and thermodynamics of the ETS-DNA complexes are well known, little is known about the kinetics of sequence recognition, a facet that offers potential insight into its molecular mechanism. We have characterized DNA binding by the ETS domain of PU.1 by biosensor-surface plasmon resonance (SPR). SPR analysis revealed a striking kinetic profile for DNA binding by the PU.1 ETS domain. At low salt concentrations, it binds high-affinity cognate DNA with a very slow association rate constant (≤105 M−1 s−1), compensated by a correspondingly small dissociation rate constant. The kinetics are strongly salt-dependent but mutually balance to produce a relatively weak dependence in the equilibrium constant. This profile contrasts sharply with reported data for other ETS domains (e.g., Ets-1, TEL) for which high-affinity binding is driven by rapid association (>107 M−1 s−1). We interpret this difference in terms of the hydration properties of ETS-DNA binding and propose that at least two mechanisms of sequence recognition are employed by this family of DNA-binding domain. Additionally, we use SPR to demonstrate the potential for pharmacological inhibition of sequence-specific ETS-DNA binding, using the minor groove-binding distamycin as a model compound. Our work establishes SPR as a valuable technique for extending our understanding of the molecular mechanisms of ETS-DNA interactions as well as developing potential small-molecule agents for biotechnological and therapeutic purposes. PMID:23416556
APOBEC3G Interacts with ssDNA by Two Modes: AFM Studies
Shlyakhtenko, Luda S.; Dutta, Samrat; Banga, Jaspreet; Li, Ming; Harris, Reuben S.; Lyubchenko, Yuri L.
2015-01-01
APOBEC3G (A3G) protein has antiviral activity against HIV and other pathogenic retroviruses. A3G has two domains: a catalytic C-terminal domain (CTD) that deaminates cytidine, and a N-terminal domain (NTD) that binds to ssDNA. Although abundant information exists about the biological activities of A3G protein, the interplay between sequence specific deaminase activity and A3G binding to ssDNA remains controversial. We used the topographic imaging and force spectroscopy modalities of Atomic Force Spectroscopy (AFM) to characterize the interaction of A3G protein with deaminase specific and nonspecific ssDNA substrates. AFM imaging demonstrated that A3G has elevated affinity for deaminase specific ssDNA than for nonspecific ssDNA. AFM force spectroscopy revealed two distinct binding modes by which A3G interacts with ssDNA. One mode requires sequence specificity, as demonstrated by stronger and more stable complexes with deaminase specific ssDNA than with nonspecific ssDNA. Overall these observations enforce prior studies suggesting that both domains of A3G contribute to the sequence specific binding of ssDNA. PMID:26503602
Bosselut, R; Levin, J; Adjadj, E; Ghysdael, J
1993-11-11
Ets proteins form a family of sequence specific DNA binding proteins which bind DNA through a 85 aminoacids conserved domain, the Ets domain, whose sequence is unrelated to any other characterized DNA binding domain. Unlike all other known Ets proteins, which bind specific DNA sequences centered over either GGAA or GGAT core motifs, E74 and Elf1 selectively bind to GGAA corecontaining sites. Elf1 and E74 differ from other Ets proteins in three residues located in an otherwise highly conserved region of the Ets domain, referred to as conserved region III (CRIII). We show that a restricted selectivity for GGAA core-containing sites could be conferred to Ets1 upon changing a single lysine residue within CRIII to the threonine found in Elf1 and E74 at this position. Conversely, the reciprocal mutation in Elf1 confers to this protein the ability to bind to GGAT core containing EBS. This, together with the fact that mutation of two invariant arginine residues in CRIII abolishes DNA binding, indicates that CRIII plays a key role in Ets domain recognition of the GGAA/T core motif and lead us to discuss a model of Ets proteins--core motif interaction.
Comparison between TRF2 and TRF1 of their telomeric DNA-bound structures and DNA-binding activities
Hanaoka, Shingo; Nagadoi, Aritaka; Nishimura, Yoshifumi
2005-01-01
Mammalian telomeres consist of long tandem arrays of double-stranded telomeric TTAGGG repeats packaged by the telomeric DNA-binding proteins TRF1 and TRF2. Both contain a similar C-terminal Myb domain that mediates sequence-specific binding to telomeric DNA. In a DNA complex of TRF1, only the single Myb-like domain consisting of three helices can bind specifically to double-stranded telomeric DNA. TRF2 also binds to double-stranded telomeric DNA. Although the DNA binding mode of TRF2 is likely identical to that of TRF1, TRF2 plays an important role in the t-loop formation that protects the ends of telomeres. Here, to clarify the details of the double-stranded telomeric DNA-binding modes of TRF1 and TRF2, we determined the solution structure of the DNA-binding domain of human TRF2 bound to telomeric DNA; it consists of three helices, and like TRF1, the third helix recognizes TAGGG sequence in the major groove of DNA with the N-terminal arm locating in the minor groove. However, small but significant differences are observed; in contrast to the minor groove recognition of TRF1, in which an arginine residue recognizes the TT sequence, a lysine residue of TRF2 interacts with the TT part. We examined the telomeric DNA-binding activities of both DNA-binding domains of TRF1 and TRF2 and found that TRF1 binds more strongly than TRF2. Based on the structural differences of both domains, we created several mutants of the DNA-binding domain of TRF2 with stronger binding activities compared to the wild-type TRF2. PMID:15608118
DNA binding specificity of the basic-helix-loop-helix protein MASH-1.
Meierhan, D; el-Ariss, C; Neuenschwander, M; Sieber, M; Stackhouse, J F; Allemann, R K
1995-09-05
Despite the high degree of sequence similarity in their basic-helix-loop-helix (BHLH) domains, MASH-1 and MyoD are involved in different biological processes. In order to define possible differences between the DNA binding specificities of these two proteins, we investigated the DNA binding properties of MASH-1 by circular dichroism spectroscopy and by electrophoretic mobility shift assays (EMSA). Upon binding to DNA, the BHLH domain of MASH-1 underwent a conformational change from a mainly unfolded to a largely alpha-helical form, and surprisingly, this change was independent of the specific DNA sequence. The same conformational transition could be induced by the addition of 20% 2,2,2-trifluoroethanol. The apparent dissociation constants (KD) of the complexes of full-length MASH-1 with various oligonucleotides were determined from half-saturation points in EMSAs. MASH-1 bound as a dimer to DNA sequences containing an E-box with high affinity KD = 1.4-4.1 x 10(-14) M2). However, the specificity of DNA binding was low. The dissociation constant for the complex between MASH-1 and the highest affinity E-box sequence (KD = 1.4 x 10(-14) M2) was only a factor of 10 smaller than for completely unrelated DNA sequences (KD = approximately 1 x 10(-13) M2). The DNA binding specificity of MASH-1 was not significantly increased by the formation of an heterodimer with the ubiquitous E12 protein. MASH-1 and MyoD displayed similar binding site preferences, suggesting that their different target gene specificities cannot be explained solely by differential DNA binding. An explanation for these findings is provided on the basis of the known crystal structure of the BHLH domain of MyoD.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Adámik, Matej; Bažantová, Pavla; Department of Biology and Ecology, Faculty of Science, University of Ostrava, Chittussiho 10, 701 03 Ostrava
Highlights: • DNA binding of p53 family core domains is inhibited by cadmium, cobalt and nickel. • Binding to DNA protects p53 family core domains from metal induced inhibition. • Cadmium, cobalt and nickel induced inhibition was reverted by EDTA in vitro. - Abstract: Site-specific DNA recognition and binding activity belong to common attributes of all three members of tumor suppressor p53 family proteins: p53, p63 and p73. It was previously shown that heavy metals can affect p53 conformation, sequence-specific binding and suppress p53 response to DNA damage. Here we report for the first time that cadmium, nickel and cobalt,more » which have already been shown to disturb various DNA repair mechanisms, can also influence p63 and p73 sequence-specific DNA binding activity and transactivation of p53 family target genes. Based on results of electrophoretic mobility shift assay and luciferase reporter assay, we conclude that cadmium inhibits sequence-specific binding of all three core domains to p53 consensus sequences and abolishes transactivation of several promoters (e.g. BAX and MDM2) by 50 μM concentrations. In the presence of specific DNA, all p53 family core domains were partially protected against loss of DNA binding activity due to cadmium treatment. Effective cadmium concentration to abolish DNA–protein interactions was about two times higher for p63 and p73 proteins than for p53. Furthermore, we detected partial reversibility of cadmium inhibition for all p53 family members by EDTA. DTT was able to reverse cadmium inhibition only for p53 and p73. Nickel and cobalt abolished DNA–p53 interaction at sub-millimolar concentrations while inhibition of p63 and p73 DNA binding was observed at millimolar concentrations. In summary, cadmium strongly inhibits p53, p63 and p73 DNA binding in vitro and in cells in comparison to nickel and cobalt. The role of cadmium inhibition of p53 tumor suppressor family in carcinogenesis is discussed.« less
Smaczniak, Cezary; Muiño, Jose M; Chen, Dijun; Angenent, Gerco C; Kaufmann, Kerstin
2017-08-01
Floral organ identities in plants are specified by the combinatorial action of homeotic master regulatory transcription factors. However, how these factors achieve their regulatory specificities is still largely unclear. Genome-wide in vivo DNA binding data show that homeotic MADS domain proteins recognize partly distinct genomic regions, suggesting that DNA binding specificity contributes to functional differences of homeotic protein complexes. We used in vitro systematic evolution of ligands by exponential enrichment followed by high-throughput DNA sequencing (SELEX-seq) on several floral MADS domain protein homo- and heterodimers to measure their DNA binding specificities. We show that specification of reproductive organs is associated with distinct binding preferences of a complex formed by SEPALLATA3 and AGAMOUS. Binding specificity is further modulated by different binding site spacing preferences. Combination of SELEX-seq and genome-wide DNA binding data allows differentiation between targets in specification of reproductive versus perianth organs in the flower. We validate the importance of DNA binding specificity for organ-specific gene regulation by modulating promoter activity through targeted mutagenesis. Our study shows that intrafamily protein interactions affect DNA binding specificity of floral MADS domain proteins. Differential DNA binding of MADS domain protein complexes plays a role in the specificity of target gene regulation. © 2017 American Society of Plant Biologists. All rights reserved.
Das, Devashish; Faridounnia, Maryam; Kovacic, Lidija; Kaptein, Robert; Boelens, Rolf; Folkers, Gert E.
2017-01-01
The nucleotide excision repair protein complex ERCC1-XPF is required for incision of DNA upstream of DNA damage. Functional studies have provided insights into the binding of ERCC1-XPF to various DNA substrates. However, because no structure for the ERCC1-XPF-DNA complex has been determined, the mechanism of substrate recognition remains elusive. Here we biochemically characterize the substrate preferences of the helix-hairpin-helix (HhH) domains of XPF and ERCC-XPF and show that the binding to single-stranded DNA (ssDNA)/dsDNA junctions is dependent on joint binding to the DNA binding domain of ERCC1 and XPF. We reveal that the homodimeric XPF is able to bind various ssDNA sequences but with a clear preference for guanine-containing substrates. NMR titration experiments and in vitro DNA binding assays also show that, within the heterodimeric ERCC1-XPF complex, XPF specifically recognizes ssDNA. On the other hand, the HhH domain of ERCC1 preferentially binds dsDNA through the hairpin region. The two separate non-overlapping DNA binding domains in the ERCC1-XPF heterodimer jointly bind to an ssDNA/dsDNA substrate and, thereby, at least partially dictate the incision position during damage removal. Based on structural models, NMR titrations, DNA-binding studies, site-directed mutagenesis, charge distribution, and sequence conservation, we propose that the HhH domain of ERCC1 binds to dsDNA upstream of the damage, and XPF binds to the non-damaged strand within a repair bubble. PMID:28028171
Zinc-binding Domain of the Bacteriophage T7 DNA Primase Modulates Binding to the DNA Template*
Lee, Seung-Joo; Zhu, Bin; Akabayov, Barak; Richardson, Charles C.
2012-01-01
The zinc-binding domain (ZBD) of prokaryotic DNA primases has been postulated to be crucial for recognition of specific sequences in the single-stranded DNA template. To determine the molecular basis for this role in recognition, we carried out homolog-scanning mutagenesis of the zinc-binding domain of DNA primase of bacteriophage T7 using a bacterial homolog from Geobacillus stearothermophilus. The ability of T7 DNA primase to catalyze template-directed oligoribonucleotide synthesis is eliminated by substitution of any five-amino acid residue-long segment within the ZBD. The most significant defect occurs upon substitution of a region (Pro-16 to Cys-20) spanning two cysteines that coordinate the zinc ion. The role of this region in primase function was further investigated by generating a protein library composed of multiple amino acid substitutions for Pro-16, Asp-18, and Asn-19 followed by genetic screening for functional proteins. Examination of proteins selected from the screening reveals no change in sequence-specific recognition. However, the more positively charged residues in the region facilitate DNA binding, leading to more efficient oligoribonucleotide synthesis on short templates. The results suggest that the zinc-binding mode alone is not responsible for sequence recognition, but rather its interaction with the RNA polymerase domain is critical for DNA binding and for sequence recognition. Consequently, any alteration in the ZBD that disturbs its conformation leads to loss of DNA-dependent oligoribonucleotide synthesis. PMID:23024359
de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas
2014-01-01
The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. PMID:24792163
Zhao, A; Guo, A; Liu, Z; Pape, L
1997-01-01
The coding sequences for a Schizosaccharomyces pombe sequence-specific DNA binding protein, Reb1p, have been cloned. The predicted S. pombe Reb1p is 24-29% identical to mouse TTF-1 (transcription termination factor-1) and Saccharomyces cerevisiae REB1 protein, both of which direct termination of RNA polymerase I catalyzed transcripts. The S.pombe Reb1 cDNA encodes a predicted polypeptide of 504 amino acids with a predicted molecular weight of 58.4 kDa. The S. pombe Reb1p is unusual in that the bipartite DNA binding motif identified originally in S.cerevisiae and Klyveromyces lactis REB1 proteins is uninterrupted and thus S.pombe Reb1p may contain the smallest natural REB1 homologous DNA binding domain. Its genomic coding sequences were shown to be interrupted by two introns. A recombinant histidine-tagged Reb1 protein bearing the rDNA binding domain has two homologous, sequence-specific binding sites in the S. pomber DNA intergenic spacer, located between 289 and 480 nt downstream of the end of the approximately 25S rRNA coding sequences. Each binding site is 13-14 bp downstream of two of the three proposed in vivo termination sites. The core of this 17 bp site, AGGTAAGGGTAATGCAC, is specifically protected by Reb1p in footprinting analysis. PMID:9016645
Selective DNA demethylation by fusion of TDG with a sequence-specific DNA-binding domain
Gregory, David J.; Mikhaylova, Lyudmila; Fedulov, Alexey V.
2012-01-01
Our ability to selectively manipulate gene expression by epigenetic means is limited, as there is no approach for targeted reactivation of epigenetically silenced genes, in contrast to what is available for selective gene silencing. We aimed to develop a tool for selective transcriptional activation by DNA demethylation. Here we present evidence that direct targeting of thymine-DNA-glycosylase (TDG) to specific sequences in the DNA can result in local DNA demethylation at potential regulatory sequences and lead to enhanced gene induction. When TDG was fused to a well-characterized DNA-binding domain [the Rel-homology domain (RHD) of NFκB], we observed decreased DNA methylation and increased transcriptional response to unrelated stimulus of inducible nitric oxide synthase (NOS2). The effect was not seen for control genes lacking either RHD-binding sites or high levels of methylation, nor in control mock-transduced cells. Specific reactivation of epigenetically silenced genes may thus be achievable by this approach, which provides a broadly useful strategy to further our exploration of biological mechanisms and to improve control over the epigenome. PMID:22419066
DNA Recognition by a σ 54 Transcriptional Activator from Aquifex aeolicus
Vidangos, Natasha K.; Heideker, Johanna; Lyubimov, Artem; ...
2014-08-23
Transcription initiation by bacterial σ 54-polymerase requires the action of a transcriptional activator protein. Activators bind sequence-specifically upstream of the transcription initiation site via a DNA-binding domain. The structurally characterized DNA-binding domains from activators all belong to the Factor for Inversion Stimulation (Fis) family of helix-turn-helix DNA-binding proteins. We report here structures of the free and DNA-bound forms of the DNA-binding domain of NtrC4 (4DBD) from Aquifex aeolicus, a member of the NtrC family of σ 54 activators. Two NtrC4 binding sites were identified upstream (-145 and -85 base pairs) from the start of the lpxC gene, which is responsiblemore » for the first committed step in Lipid A biosynthesis. This is the first experimental evidence for σ 54 regulation in lpxC expression. 4DBD was crystallized both without DNA and in complex with the -145 binding site. The structures, together with biochemical data, indicate that NtrC4 binds to DNA in a manner that is similar to that of its close homologue, Fis. Ultimately, the greater sequence specificity for the binding of 4DBD relative to Fis seems to arise from a larger number of base specific contacts contributing to affinity than for Fis.« less
Mouw, M; Pintel, D J
1998-11-10
GST-NS1 purified from Escherichia coli and insect cells binds double-strand DNA in an (ACCA)2-3-dependent fashion under similar ionic conditions, independent of the presence of anti-NS1 antisera or exogenously supplied ATP and interacts with single-strand DNA and RNA in a sequence-independent manner. An amino-terminal domain (amino acids 1-275) of NS1 [GST-NS1(1-275)], representing 41% of the full-length NS1 molecule, includes a domain that binds double-strand DNA in a sequence-specific manner at levels comparable to full-length GST-NS1, as well as single-strand DNA and RNA in a sequence-independent manner. The deletion of 15 additional amino-terminal amino acids yielded a molecule [GST-NS1(1-275)] that maintained (ACCA)2-3-specific double-strand DNA binding; however, this molecule was more sensitive to increasing ionic conditions than full-length GST-NS1 and GST-NS1(1-275) and could not be demonstrated to bind single-strand nucleic acids. A quantitative filter binding assay showed that E. coli- and baculovirus-expressed GST-NS1 and E. coli GST-NS1(1-275) specifically bound double-strand DNA with similar equilibrium kinetics [as measured by their apparent equilibrium DNA binding constants (KD)], whereas GST-NS1(16-275) bound 4- to 8-fold less well. Copyright 1998 Academic Press.
de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas
2014-06-01
The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Yoga, Yano M. K.; Traore, Daouda A. K.; Sidiqi, Mahjooba; Szeto, Chris; Pendini, Nicole R.; Barker, Andrew; Leedman, Peter J.; Wilce, Jacqueline A.; Wilce, Matthew C. J.
2012-01-01
Poly-C-binding proteins are triple KH (hnRNP K homology) domain proteins with specificity for single stranded C-rich RNA and DNA. They play diverse roles in the regulation of protein expression at both transcriptional and translational levels. Here, we analyse the contributions of individual αCP1 KH domains to binding C-rich oligonucleotides using biophysical and structural methods. Using surface plasmon resonance (SPR), we demonstrate that KH1 makes the most stable interactions with both RNA and DNA, KH3 binds with intermediate affinity and KH2 only interacts detectibly with DNA. The crystal structure of KH1 bound to a 5′-CCCTCCCT-3′ DNA sequence shows a 2:1 protein:DNA stoichiometry and demonstrates a molecular arrangement of KH domains bound to immediately adjacent oligonucleotide target sites. SPR experiments, with a series of poly-C-sequences reveals that cytosine is preferred at all four positions in the oligonucleotide binding cleft and that a C-tetrad binds KH1 with 10 times higher affinity than a C-triplet. The basis for this high affinity interaction is finally detailed with the structure determination of a KH1.W.C54S mutant bound to 5′-ACCCCA-3′ DNA sequence. Together, these data establish the lead role of KH1 in oligonucleotide binding by αCP1 and reveal the molecular basis of its specificity for a C-rich tetrad. PMID:22344691
Yoga, Yano M K; Traore, Daouda A K; Sidiqi, Mahjooba; Szeto, Chris; Pendini, Nicole R; Barker, Andrew; Leedman, Peter J; Wilce, Jacqueline A; Wilce, Matthew C J
2012-06-01
Poly-C-binding proteins are triple KH (hnRNP K homology) domain proteins with specificity for single stranded C-rich RNA and DNA. They play diverse roles in the regulation of protein expression at both transcriptional and translational levels. Here, we analyse the contributions of individual αCP1 KH domains to binding C-rich oligonucleotides using biophysical and structural methods. Using surface plasmon resonance (SPR), we demonstrate that KH1 makes the most stable interactions with both RNA and DNA, KH3 binds with intermediate affinity and KH2 only interacts detectibly with DNA. The crystal structure of KH1 bound to a 5'-CCCTCCCT-3' DNA sequence shows a 2:1 protein:DNA stoichiometry and demonstrates a molecular arrangement of KH domains bound to immediately adjacent oligonucleotide target sites. SPR experiments, with a series of poly-C-sequences reveals that cytosine is preferred at all four positions in the oligonucleotide binding cleft and that a C-tetrad binds KH1 with 10 times higher affinity than a C-triplet. The basis for this high affinity interaction is finally detailed with the structure determination of a KH1.W.C54S mutant bound to 5'-ACCCCA-3' DNA sequence. Together, these data establish the lead role of KH1 in oligonucleotide binding by αCP1 and reveal the molecular basis of its specificity for a C-rich tetrad.
Watada, Hirotaka; Mirmira, Raghavendra G.; Kalamaras, Julie; German, Michael S.
2000-01-01
The developmentally important homeodomain transcription factors of the NK-2 class contain a highly conserved region, the NK2-specific domain (NK2-SD). The function of this domain, however, remains unknown. The primary structure of the NK2-SD suggests that it might function as an accessory DNA-binding domain or as a protein–protein interaction interface. To assess the possibility that the NK2-SD may contribute to DNA-binding specificity, we used a PCR-based approach to identify a consensus DNA-binding sequences for Nkx2.2, an NK-2 family member involved in pancreas and central nervous system development. The consensus sequence (TCTAAGTGAGCTT) is similar to the known binding sequences for other NK-2 homeodomain proteins, but we show that the NK2-SD does not contribute significantly to specific DNA binding to this sequence. To determine whether the NK2-SD contributes to transactivation, we used GAL4-Nkx2.2 fusion constructs to map a powerful transcriptional activation domain in the C-terminal region beyond the conserved NK2-SD. Interestingly, this C-terminal region functions as a transcriptional activator only in the absence of an intact NK2-SD. The NK2-SD also can mask transactivation from the paired homeodomain transcription factor Pax6, but it has no effect on transcription by itself. These results demonstrate that the NK2-SD functions as an intramolecular regulator of the C-terminal activation domain in Nkx2.2 and support a model in which interactions through the NK2-SD regulate the ability of NK-2-class proteins to activate specific genes during development. PMID:10944215
Toward rules relating zinc finger protein sequences and DNA binding site preferences.
Desjarlais, J R; Berg, J M
1992-08-15
Zinc finger proteins of the Cys2-His2 type consist of tandem arrays of domains, where each domain appears to contact three adjacent base pairs of DNA through three key residues. We have designed and prepared a series of variants of the central zinc finger within the DNA binding domain of Sp1 by using information from an analysis of a large data base of zinc finger protein sequences. Through systematic variations at two of the three contact positions (underlined), relatively specific recognition of sequences of the form 5'-GGGGN(G or T)GGG-3' has been achieved. These results provide the basis for rules that may develop into a code that will allow the design of zinc finger proteins with preselected DNA site specificity.
Aggarwal, Pooja; Das Gupta, Mainak; Joseph, Agnel Praveen; Chatterjee, Nirmalya; Srinivasan, N.; Nath, Utpal
2010-01-01
The TCP transcription factors control multiple developmental traits in diverse plant species. Members of this family share an ∼60-residue-long TCP domain that binds to DNA. The TCP domain is predicted to form a basic helix-loop-helix (bHLH) structure but shares little sequence similarity with canonical bHLH domain. This classifies the TCP domain as a novel class of DNA binding domain specific to the plant kingdom. Little is known about how the TCP domain interacts with its target DNA. We report biochemical characterization and DNA binding properties of a TCP member in Arabidopsis thaliana, TCP4. We have shown that the 58-residue domain of TCP4 is essential and sufficient for binding to DNA and possesses DNA binding parameters comparable to canonical bHLH proteins. Using a yeast-based random mutagenesis screen and site-directed mutants, we identified the residues important for DNA binding and dimer formation. Mutants defective in binding and dimerization failed to rescue the phenotype of an Arabidopsis line lacking the endogenous TCP4 activity. By combining structure prediction, functional characterization of the mutants, and molecular modeling, we suggest a possible DNA binding mechanism for this class of transcription factors. PMID:20363772
DNA-binding regulates site-specific ubiquitination of IRF-1.
Landré, Vivien; Pion, Emmanuelle; Narayan, Vikram; Xirodimas, Dimitris P; Ball, Kathryn L
2013-02-01
Understanding the determinants for site-specific ubiquitination by E3 ligase components of the ubiquitin machinery is proving to be a challenge. In the present study we investigate the role of an E3 ligase docking site (Mf2 domain) in an intrinsically disordered domain of IRF-1 [IFN (interferon) regulatory factor-1], a short-lived IFNγ-regulated transcription factor, in ubiquitination of the protein. Ubiquitin modification of full-length IRF-1 by E3 ligases such as CHIP [C-terminus of the Hsc (heat-shock cognate) 70-interacting protein] and MDM2 (murine double minute 2), which dock to the Mf2 domain, was specific for lysine residues found predominantly in loop structures that extend from the DNA-binding domain, whereas no modification was detected in the more conformationally flexible C-terminal half of the protein. The E3 docking site was not available when IRF-1 was in its DNA-bound conformation and cognate DNA-binding sequences strongly suppressed ubiquitination, highlighting a strict relationship between ligase binding and site-specific modification at residues in the DNA-binding domain. Hyperubiquitination of a non-DNA-binding mutant supports a mechanism where an active DNA-bound pool of IRF-1 is protected from polyubiquitination and degradation.
Wienk, Hans; Slootweg, Jack C.; Speerstra, Sietske; Kaptein, Robert; Boelens, Rolf; Folkers, Gert E.
2013-01-01
To maintain the integrity of the genome, multiple DNA repair systems exist to repair damaged DNA. Recognition of altered DNA, including bulky adducts, pyrimidine dimers and interstrand crosslinks (ICL), partially depends on proteins containing helix-hairpin-helix (HhH) domains. To understand how ICL is specifically recognized by the Fanconi anemia proteins FANCM and FAAP24, we determined the structure of the HhH domain of FAAP24. Although it resembles other HhH domains, the FAAP24 domain contains a canonical hairpin motif followed by distorted motif. The HhH domain can bind various DNA substrates; using nuclear magnetic resonance titration experiments, we demonstrate that the canonical HhH motif is required for double-stranded DNA (dsDNA) binding, whereas the unstructured N-terminus can interact with single-stranded DNA. Both DNA binding surfaces are used for binding to ICL-like single/double-strand junction-containing DNA substrates. A structural model for FAAP24 bound to dsDNA has been made based on homology with the translesion polymerase iota. Site-directed mutagenesis, sequence conservation and charge distribution support the dsDNA-binding model. Analogous to other HhH domain-containing proteins, we suggest that multiple FAAP24 regions together contribute to binding to single/double-strand junction, which could contribute to specificity in ICL DNA recognition. PMID:23661679
Deep-sea vent phage DNA polymerase specifically initiates DNA synthesis in the absence of primers.
Zhu, Bin; Wang, Longfei; Mitsunobu, Hitoshi; Lu, Xueling; Hernandez, Alfredo J; Yoshida-Takashima, Yukari; Nunoura, Takuro; Tabor, Stanley; Richardson, Charles C
2017-03-21
A DNA polymerase is encoded by the deep-sea vent phage NrS-1. NrS-1 has a unique genome organization containing genes that are predicted to encode a helicase and a single-stranded DNA (ssDNA)-binding protein. The gene for an unknown protein shares weak homology with the bifunctional primase-polymerases (prim-pols) from archaeal plasmids but is missing the zinc-binding domain typically found in primases. We show that this gene product has efficient DNA polymerase activity and is processive in DNA synthesis in the presence of the NrS-1 helicase and ssDNA-binding protein. Remarkably, this NrS-1 DNA polymerase initiates DNA synthesis from a specific template DNA sequence in the absence of any primer. The de novo DNA polymerase activity resides in the N-terminal domain of the protein, whereas the C-terminal domain enhances DNA binding.
Engineering and Application of Zinc Finger Proteins and TALEs for Biomedical Research.
Kim, Moon-Soo; Kini, Anu Ganesh
2017-08-01
Engineered DNA-binding domains provide a powerful technology for numerous biomedical studies due to their ability to recognize specific DNA sequences. Zinc fingers (ZF) are one of the most common DNA-binding domains and have been extensively studied for a variety of applications, such as gene regulation, genome engineering and diagnostics. Another novel DNA-binding domain known as a transcriptional activator-like effector (TALE) has been more recently discovered, which has a previously undescribed DNA-binding mode. Due to their modular architecture and flexibility, TALEs have been rapidly developed into artificial gene targeting reagents. Here, we describe the methods used to design these DNA-binding proteins and their key applications in biomedical research.
Cong, Le; Zhou, Ruhong; Kuo, Yu-chi; Cunniff, Margaret; Zhang, Feng
2012-01-01
Transcription activator-like effectors (TALE) are sequence-specific DNA binding proteins that harbor modular, repetitive DNA binding domains. TALEs have enabled the creation of customizable designer transcriptional factors and sequence-specific nucleases for genome engineering. Here we report two improvements of the TALE toolbox for achieving efficient activation and repression of endogenous gene expression in mammalian cells. We show that the naturally occurring repeat variable diresidue (RVD) Asn-His (NH) has high biological activity and specificity for guanine, a highly prevalent base in mammalian genomes. We also report an effective TALE transcriptional repressor architecture for targeted inhibition of transcription in mammalian cells. These findings will improve the precision and effectiveness of genome engineering that can be achieved using TALEs. PMID:22828628
Arthur, A K; Höss, A; Fanning, E
1988-01-01
The genomic coding sequence of the large T antigen of simian virus 40 (SV40) was cloned into an Escherichia coli expression vector by joining new restriction sites, BglII and BamHI, introduced at the intron boundaries of the gene. Full-length large T antigen, as well as deletion and amino acid substitution mutants, were inducibly expressed from the lac promoter of pUC9, albeit with different efficiencies and protein stabilities. Specific interaction with SV40 origin DNA was detected for full-length T antigen and certain mutants. Deletion mutants lacking T-antigen residues 1 to 130 and 260 to 708 retained specific origin-binding activity, demonstrating that the region between residues 131 and 259 must carry the essential binding domain for DNA-binding sites I and II. A sequence between residues 302 and 320 homologous to a metal-binding "finger" motif is therefore not required for origin-specific binding. However, substitution of serine for either of two cysteine residues in this motif caused a dramatic decrease in origin DNA-binding activity. This region, as well as other regions of the full-length protein, may thus be involved in stabilizing the DNA-binding domain and altering its preference for binding to site I or site II DNA. Images PMID:2835505
Lee, Susan D.; Surtees, Jennifer A.; Alani, Eric
2007-01-01
In eukaryotic mismatch repair (MMR) MSH2-MSH6 initiates the repair of base-base and small insertion/deletion mismatches while MSH2-MSH3 repairs larger insertion/deletion mismatches. In this study we showed that the msh2Δ1 mutation, containing a complete deletion of the conserved mismatch recognition Domain I of MSH2, conferred a separation of function phenotype with respect to MSH2-MSH3 and MSH2-MSH6 functions. Strains bearing the msh2Δ1 mutation were nearly wild-type in MSH2-MSH6-mediated MMR and in suppressing recombination between DNA sequences predicted to form mismatches recognized by MSH2-MSH6. However, these strains were completely defective in MSH2-MSH3-mediated MMR and recombination functions. This information encouraged us to analyze the contributions of Domain I to the mismatch binding specificity of MSH2-MSH3 in genetic and biochemical assays. We found that Domain I in MSH2 contributed a non-specific DNA binding activity while Domain I of MSH3 appeared important for mismatch binding specificity and for suppressing non-specific DNA-binding. These observations reveal distinct requirements for the MSH2 DNA binding Domain I in the repair of DNA mismatches and suggest that the binding of MSH2-MSH3 to mismatch DNA involves protein-DNA contacts that appear very different from those required for MSH2-MSH6 mismatch binding. PMID:17157869
Lee, Susan D; Surtees, Jennifer A; Alani, Eric
2007-02-09
In eukaryotic mismatch repair (MMR) MSH2-MSH6 initiates the repair of base-base and small insertion/deletion mismatches while MSH2-MSH3 repairs larger insertion/deletion mismatches. Here, we show that the msh2Delta1 mutation, containing a complete deletion of the conserved mismatch recognition domain I of MSH2, conferred a separation of function phenotype with respect to MSH2-MSH3 and MSH2-MSH6 functions. Strains bearing the msh2Delta1 mutation were nearly wild-type in MSH2-MSH6-mediated MMR and in suppressing recombination between DNA sequences predicted to form mismatches recognized by MSH2-MSH6. However, these strains were completely defective in MSH2-MSH3-mediated MMR and recombination functions. This information encouraged us to analyze the contributions of domain I to the mismatch binding specificity of MSH2-MSH3 in genetic and biochemical assays. We found that domain I in MSH2 contributed a non-specific DNA binding activity while domain I of MSH3 appeared important for mismatch binding specificity and for suppressing non-specific DNA binding. These observations reveal distinct requirements for the MSH2 DNA binding domain I in the repair of DNA mismatches and suggest that the binding of MSH2-MSH3 to mismatch DNA involves protein-DNA contacts that appear very different from those required for MSH2-MSH6 mismatch binding.
Parrilla-Doblas, Jara Teresa; Ariza, Rafael R.; Roldán-Arjona, Teresa
2017-01-01
ABSTRACT DNA methylation is a crucial epigenetic mark associated to gene silencing, and its targeted removal is a major goal of epigenetic editing. In animal cells, DNA demethylation involves iterative 5mC oxidation by TET enzymes followed by replication-dependent dilution and/or replication-independent DNA repair of its oxidized derivatives. In contrast, plants use specific DNA glycosylases that directly excise 5mC and initiate its substitution for unmethylated C in a base excision repair process. In this work, we have fused the catalytic domain of Arabidopsis ROS1 5mC DNA glycosylase (ROS1_CD) to the DNA binding domain of yeast GAL4 (GBD). We show that the resultant GBD-ROS1_CD fusion protein binds specifically a GBD-targeted DNA sequence in vitro. We also found that transient in vivo expression of GBD-ROS1_CD in human cells specifically reactivates transcription of a methylation-silenced reporter gene, and that such reactivation requires both ROS1_CD catalytic activity and GBD binding capacity. Finally, we show that reactivation induced by GBD-ROS1_CD is accompanied by decreased methylation levels at several CpG sites of the targeted promoter. All together, these results show that plant 5mC DNA glycosylases can be used for targeted active DNA demethylation in human cells. PMID:28277978
Isalan, M; Klug, A; Choo, Y
2001-07-01
DNA-binding domains with predetermined sequence specificity are engineered by selection of zinc finger modules using phage display, allowing the construction of customized transcription factors. Despite remarkable progress in this field, the available protein-engineering methods are deficient in many respects, thus hampering the applicability of the technique. Here we present a rapid and convenient method that can be used to design zinc finger proteins against a variety of DNA-binding sites. This is based on a pair of pre-made zinc finger phage-display libraries, which are used in parallel to select two DNA-binding domains each of which recognizes given 5 base pair sequences, and whose products are recombined to produce a single protein that recognizes a composite (9 base pair) site of predefined sequence. Engineering using this system can be completed in less than two weeks and yields proteins that bind sequence-specifically to DNA with Kd values in the nanomolar range. To illustrate the technique, we have selected seven different proteins to bind various regions of the human immunodeficiency virus 1 (HIV-1) promoter.
Viola, Ivana L; Uberti Manassero, Nora G; Ripoll, Rodrigo; Gonzalez, Daniel H
2011-04-01
The TCP domain is a DNA-binding domain present in plant transcription factors that modulate different processes. In the present study, we show that Arabidopsis class I TCP proteins are able to interact with a dyad-symmetric sequence composed of two GTGGG half-sites. TCP20 establishes symmetric interactions with the 5' half of each strand, whereas TCP11 interacts mainly with the 3' half. SELEX (systematic evolution of ligands by exponential enrichment) experiments with TCP15 and TCP20 indicated that these proteins have similar, although not identical, DNA-binding preferences and are able to interact with non-palindromic binding sites of the type GTGGGNCCNN. TCP11 shows a different DNA-binding specificity, with a preference for the sequence GTGGGCCNNN. The distinct DNA-binding properties of TCP11 are due to the presence of a threonine residue at position 15 of the TCP domain, a position that is occupied by an arginine residue in most TCP proteins. TCP11 also forms heterodimers with TCP15 that have increased DNA-binding efficiency. The expression in plants of a repressor form of TCP11 demonstrated that this protein is a developmental regulator that influences the growth of leaves, stems and petioles, and pollen development. The results suggest that changes in DNA-binding preferences may be one of the mechanisms through which class I TCP proteins achieve functional specificity.
Structural basis of DNA target recognition by the B3 domain of Arabidopsis epigenome reader VAL1
Sasnauskas, Giedrius; Kauneckaitė, Kotryna; Siksnys, Virginijus
2018-01-01
Abstract Arabidopsis thaliana requires a prolonged period of cold exposure during winter to initiate flowering in a process termed vernalization. Exposure to cold induces epigenetic silencing of the FLOWERING LOCUS C (FLC) gene by Polycomb group (PcG) proteins. A key role in this epigenetic switch is played by transcriptional repressors VAL1 and VAL2, which specifically recognize Sph/RY DNA sequences within FLC via B3 DNA binding domains, and mediate recruitment of PcG silencing machinery. To understand the structural mechanism of site-specific DNA recognition by VAL1, we have solved the crystal structure of VAL1 B3 domain (VAL1-B3) bound to a 12 bp oligoduplex containing the canonical Sph/RY DNA sequence 5′-CATGCA-3′/5′-TGCATG-3′. We find that VAL1-B3 makes H-bonds and van der Waals contacts to DNA bases of all six positions of the canonical Sph/RY element. In agreement with the structure, in vitro DNA binding studies show that VAL1-B3 does not tolerate substitutions at any position of the 5′-TGCATG-3′ sequence. The VAL1-B3–DNA structure presented here provides a structural model for understanding the specificity of plant B3 domains interacting with the Sph/RY and other DNA sequences. PMID:29660015
Golovenko, Dmitrij; Manakova, Elena; Zakrys, Linas; Zaremba, Mindaugas; Sasnauskas, Giedrius; Gražulis, Saulius; Siksnys, Virginijus
2014-01-01
The B3 DNA-binding domains (DBDs) of plant transcription factors (TF) and DBDs of EcoRII and BfiI restriction endonucleases (EcoRII-N and BfiI-C) share a common structural fold, classified as the DNA-binding pseudobarrel. The B3 DBDs in the plant TFs recognize a diverse set of target sequences. The only available co-crystal structure of the B3-like DBD is that of EcoRII-N (recognition sequence 5′-CCTGG-3′). In order to understand the structural and molecular mechanisms of specificity of B3 DBDs, we have solved the crystal structure of BfiI-C (recognition sequence 5′-ACTGGG-3′) complexed with 12-bp cognate oligoduplex. Structural comparison of BfiI-C–DNA and EcoRII-N–DNA complexes reveals a conserved DNA-binding mode and a conserved pattern of interactions with the phosphodiester backbone. The determinants of the target specificity are located in the loops that emanate from the conserved structural core. The BfiI-C–DNA structure presented here expands a range of templates for modeling of the DNA-bound complexes of the B3 family of plant TFs. PMID:24423868
Molecular determinants of origin discrimination by Orc1 initiators in archaea.
Dueber, Erin C; Costa, Alessandro; Corn, Jacob E; Bell, Stephen D; Berger, James M
2011-05-01
Unlike bacteria, many eukaryotes initiate DNA replication from genomic sites that lack apparent sequence conservation. These loci are identified and bound by the origin recognition complex (ORC), and subsequently activated by a cascade of events that includes recruitment of an additional factor, Cdc6. Archaeal organisms generally possess one or more Orc1/Cdc6 homologs, belonging to the Initiator clade of ATPases associated with various cellular activities (AAA(+)) superfamily; however, these proteins recognize specific sequences within replication origins. Atomic resolution studies have shown that archaeal Orc1 proteins contact double-stranded DNA through an N-terminal AAA(+) domain and a C-terminal winged-helix domain (WHD), but use remarkably few base-specific contacts. To investigate the biochemical effects of these associations, we mutated the DNA-interacting elements of the Orc1-1 and Orc1-3 paralogs from the archaeon Sulfolobus solfataricus, and tested their effect on origin binding and deformation. We find that the AAA(+) domain has an unpredicted role in controlling the sequence selectivity of DNA binding, despite an absence of base-specific contacts to this region. Our results show that both the WHD and ATPase region influence origin recognition by Orc1/Cdc6, and suggest that not only DNA sequence, but also local DNA structure help define archaeal initiator binding sites. © The Author(s) 2011. Published by Oxford University Press.
Ramachandrakurup, Sreelakshmi; Ramakrishnan, Vigneshwar
2017-09-01
Protein-DNA interactions are an important class of biomolecular interactions inside the cell. Delineating the mechanisms of protein-DNA interactions and more specifically, how proteins search and bind to their specific cognate sequences has been the quest of many in the scientific community. Restriction enzymes have served as useful model systems to this end. In this work, we have investigated using molecular dynamics simulations the effect of L43K mutation on NaeI, a type IIE restriction enzyme. NaeI has two domains, the Topo and the Endo domains, each binding to identical strands of DNA sequences (GCCGGC) 2 . The binding of the DNA to the Topo domain is thought to enhance the binding and cleavage of DNA at the Endo domain. Interestingly, it has been found that the mutation of an amino acid that is distantly-located from the DNA cleavage site (L43K) converts the restriction endonuclease to a topoisomerase. Our investigations reveal that the L43K mutation not only induces local structural changes (as evidenced by changes in hydrogen bond propensities and differences in the percentage of secondary structure assignments of the residues in the ligase-like domain) but also alters the overall protein dynamics and DNA conformation which probably leads to the loss of specific cleavage of the recognition site. In a larger context, our study underscores the importance of considering the role of distantly-located amino acids in understanding protein-DNA interactions. Copyright © 2017 Elsevier Inc. All rights reserved.
Chimeric TALE recombinases with programmable DNA sequence specificity.
Mercer, Andrew C; Gaj, Thomas; Fuller, Roberta P; Barbas, Carlos F
2012-11-01
Site-specific recombinases are powerful tools for genome engineering. Hyperactivated variants of the resolvase/invertase family of serine recombinases function without accessory factors, and thus can be re-targeted to sequences of interest by replacing native DNA-binding domains (DBDs) with engineered zinc-finger proteins (ZFPs). However, imperfect modularity with particular domains, lack of high-affinity binding to all DNA triplets, and difficulty in construction has hindered the widespread adoption of ZFPs in unspecialized laboratories. The discovery of a novel type of DBD in transcription activator-like effector (TALE) proteins from Xanthomonas provides an alternative to ZFPs. Here we describe chimeric TALE recombinases (TALERs): engineered fusions between a hyperactivated catalytic domain from the DNA invertase Gin and an optimized TALE architecture. We use a library of incrementally truncated TALE variants to identify TALER fusions that modify DNA with efficiency and specificity comparable to zinc-finger recombinases in bacterial cells. We also show that TALERs recombine DNA in mammalian cells. The TALER architecture described herein provides a platform for insertion of customized TALE domains, thus significantly expanding the targeting capacity of engineered recombinases and their potential applications in biotechnology and medicine.
Recombinant antibody mediated delivery of organelle-specific DNA pH sensors along endocytic pathways
NASA Astrophysics Data System (ADS)
Modi, Souvik; Halder, Saheli; Nizak, Clément; Krishnan, Yamuna
2013-12-01
DNA has been used to build nanomachines with potential in cellulo and in vivo applications. However their different in cellulo applications are limited by the lack of generalizable strategies to deliver them to precise intracellular locations. Here we describe a new molecular design of DNA pH sensors with response times that are nearly 20 fold faster. Further, by changing the sequence of the pH sensitive domain of the DNA sensor, we have been able to tune their pH sensitive regimes and create a family of DNA sensors spanning ranges from pH 4 to 7.6. To enable a generalizable targeting methodology, this new sensor design also incorporates a `handle' domain. We have identified, using a phage display screen, a set of three recombinant antibodies (scFv) that bind sequence specifically to the handle domain. Sequence analysis of these antibodies revealed several conserved residues that mediate specific interactions with the cognate DNA duplex. We also found that all three scFvs clustered into different branches indicating that their specificity arises from mutations in key residues. When one of these scFvs is fused to a membrane protein (furin) that traffics via the cell surface, the scFv-furin chimera binds the `handle' and ferries a family of DNA pH sensors along the furin endocytic pathway. Post endocytosis, all DNA nanodevices retain their functionality in cellulo and provide spatiotemporal pH maps of retrogradely trafficking furin inside living cells. This new molecular technology of DNA-scFv-protein chimeras can be used to site-specifically complex DNA nanostructures for bioanalytical applications.DNA has been used to build nanomachines with potential in cellulo and in vivo applications. However their different in cellulo applications are limited by the lack of generalizable strategies to deliver them to precise intracellular locations. Here we describe a new molecular design of DNA pH sensors with response times that are nearly 20 fold faster. Further, by changing the sequence of the pH sensitive domain of the DNA sensor, we have been able to tune their pH sensitive regimes and create a family of DNA sensors spanning ranges from pH 4 to 7.6. To enable a generalizable targeting methodology, this new sensor design also incorporates a `handle' domain. We have identified, using a phage display screen, a set of three recombinant antibodies (scFv) that bind sequence specifically to the handle domain. Sequence analysis of these antibodies revealed several conserved residues that mediate specific interactions with the cognate DNA duplex. We also found that all three scFvs clustered into different branches indicating that their specificity arises from mutations in key residues. When one of these scFvs is fused to a membrane protein (furin) that traffics via the cell surface, the scFv-furin chimera binds the `handle' and ferries a family of DNA pH sensors along the furin endocytic pathway. Post endocytosis, all DNA nanodevices retain their functionality in cellulo and provide spatiotemporal pH maps of retrogradely trafficking furin inside living cells. This new molecular technology of DNA-scFv-protein chimeras can be used to site-specifically complex DNA nanostructures for bioanalytical applications. Electronic supplementary information (ESI) available: Detailed description of all oligonucleotide sequences used in this study; list of figures that support claims from the main text. Mainly these show sensor sequences, phage display results, scFv purification and binding data, cell images clamped at different pH and co-localization studies with endocytic tracers. See DOI: 10.1039/c3nr03769j
Sequence Discrimination by Alternatively Spliced Isoforms of a DNA Binding Zinc Finger Domain
NASA Astrophysics Data System (ADS)
Gogos, Joseph A.; Hsu, Tien; Bolton, Jesse; Kafatos, Fotis C.
1992-09-01
Two major developmentally regulated isoforms of the Drosophila chorion transcription factor CF2 differ by an extra zinc finger within the DNA binding domain. The preferred DNA binding sites were determined and are distinguished by an internal duplication of TAT in the site recognized by the isoform with the extra finger. The results are consistent with modular interactions between zinc fingers and trinucleotides and also suggest rules for recognition of AT-rich DNA sites by zinc finger proteins. The results show how modular finger interactions with trinucleotides can be used, in conjunction with alternative splicing, to alter the binding specificity and increase the spectrum of sites recognized by a DNA binding domain. Thus, CF2 may potentially regulate distinct sets of target genes during development.
2015-01-01
The protein MeCP2 mediates epigenetic regulation by binding methyl-CpG (mCpG) sites on chromatin. MeCP2 consists of six domains of which one, the methyl binding domain (MBD), binds mCpG sites in duplex DNA. We show that solution conditions with physiological or greater salt concentrations or the presence of nonspecific competitor DNA is necessary for the MBD to discriminate mCpG from CpG with high specificity. The specificity for mCpG over CpG is >100-fold under these solution conditions. In contrast, the MBD does not discriminate hydroxymethyl-CpG from CpG. The MBD is unusual among site-specific DNA binding proteins in that (i) specificity is not conferred by the enhanced affinity for the specific site but rather by suppression of its affinity for generic DNA, (ii) its specific binding to mCpG is highly electrostatic, and (iii) it takes up as well as displaces monovalent cations upon DNA binding. The MBD displays an unusually high affinity for single-stranded DNA independent of modification or sequence. In addition, the MBD forms a discrete dimer on DNA via a noncooperative binding pathway. Because the affinity of the second monomer is 1 order of magnitude greater than that of nonspecific binding, the MBD dimer is a unique molecular complex. The significance of these results in the context of neuronal function and development and MeCP2-related developmental disorders such as Rett syndrome is discussed. PMID:24828757
Evers, R; Smid, A; Rudloff, U; Lottspeich, F; Grummt, I
1995-03-15
Termination of mouse ribosomal gene transcription by RNA polymerase I (Pol I) requires the specific interaction of a DNA binding protein, mTTF-I, with an 18 bp sequence element located downstream of the rRNA coding region. Here we describe the molecular cloning and functional characterization of the cDNA encoding this transcription termination factor. Recombinant mTTF-I binds specifically to the murine terminator elements and terminates Pol I transcription in a reconstituted in vitro system. Deletion analysis has defined a modular structure of mTTF-I comprising a dispensable N-terminal half, a large C-terminal DNA binding region and an internal domain which is required for transcription termination. Significantly, the C-terminal region of mTTF-I reveals striking homology to the DNA binding domains of the proto-oncogene c-Myb and the yeast transcription factor Reb1p. Site-directed mutagenesis of one of the tryptophan residues that is conserved in the homology region of c-Myb, Reb1p and mTTF-I abolishes specific DNA binding, a finding which underscores the functional relevance of these residues in DNA-protein interactions.
Evers, R; Smid, A; Rudloff, U; Lottspeich, F; Grummt, I
1995-01-01
Termination of mouse ribosomal gene transcription by RNA polymerase I (Pol I) requires the specific interaction of a DNA binding protein, mTTF-I, with an 18 bp sequence element located downstream of the rRNA coding region. Here we describe the molecular cloning and functional characterization of the cDNA encoding this transcription termination factor. Recombinant mTTF-I binds specifically to the murine terminator elements and terminates Pol I transcription in a reconstituted in vitro system. Deletion analysis has defined a modular structure of mTTF-I comprising a dispensable N-terminal half, a large C-terminal DNA binding region and an internal domain which is required for transcription termination. Significantly, the C-terminal region of mTTF-I reveals striking homology to the DNA binding domains of the proto-oncogene c-Myb and the yeast transcription factor Reb1p. Site-directed mutagenesis of one of the tryptophan residues that is conserved in the homology region of c-Myb, Reb1p and mTTF-I abolishes specific DNA binding, a finding which underscores the functional relevance of these residues in DNA-protein interactions. Images PMID:7720715
Measurements of nonlinear Hall-driven reconnection in the reversed field pinch
NASA Astrophysics Data System (ADS)
Tharp, Timothy D.
Complex organisms are able to develop because of the complex regulatory systems that control their gene expression. The first step in this regulation, transcription initiation, is controlled by transcription factors. Transcription factors are modular proteins composed of two distinct domains, the DNA binding domain and the regulatory domain. These molecules are involved in a plethora of important biological processes including embryogenesis, development, cell health, and cancer. Tissue enriched transcription factors Nkx-2.5 and Gata4 are involved in cardiac development and cardiac health. In this thesis the DNA binding specificity of Nkx-2.5 will be analyzed using a high throughput double stranded DNA platform called Cognate Site Identifier (CSI) arrays (Chapter 2). The full DNA binding specificity of Nkx-2.5 and Nkx-2.5 mutants will be visualized using Sequence Specificity Landscapes (SSLs). In Chapter 3, the definition of binding specificity will be investigated by evaluating a number of different DNA binding folds by CSI and SSLs. CSI and SSLs will also be used to evaluate different pyrrole/imidazole hairpin polyamides in order to better characterize these small molecule DNA binding domains. CSI and SSL data will be applied to the genome in order to explain the biological function an artificial transcription factor. Chapter 4 will discuss the mechanism of nonspecific DNA binding. The historical means of predicting DNA binding will be challenged by utilizing high throughput experiments. The effect of salt concentration on both specific and nonspecific binding will also be investigated. Finally, in Chapter 5, a generation of Protein DNA Dimerizer will be discussed. A PDD that regulates transcription on genomic DNA by binding cooperatively with the heart IF Gata4 will be characterized. These studies provide understanding of, and a means to control, how transcription factors sample the endless sea of DNA in the genome in order to regulate gene expression with such wonderful specificity.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bach, Christian; Sherman, William; Pallis, Jani
Zinc finger nucleases (ZFNs) are associated with cell death and apoptosis by binding at countless undesired locations. This cytotoxicity is associated with the binding ability of engineered zinc finger domains to bind dissimilar DNA sequences with high affinity. In general, binding preferences of transcription factors are associated with significant degenerated diversity and complexity which convolutes the design and engineering of precise DNA binding domains. Evolutionary success of natural zinc finger proteins, however, evinces that nature created specific evolutionary traits and strategies, such as modularity and rank-specific recognition to cope with binding complexity that are critical for creating clinical viable toolsmore » to precisely modify the human genome. Our findings indicate preservation of general modularity and significant alteration of the rank-specific binding preferences of the three-finger binding domain of transcription factor SP1 when exchanging amino acids in the 2nd finger.« less
Bach, Christian; Sherman, William; Pallis, Jani; ...
2014-01-01
Zinc finger nucleases (ZFNs) are associated with cell death and apoptosis by binding at countless undesired locations. This cytotoxicity is associated with the binding ability of engineered zinc finger domains to bind dissimilar DNA sequences with high affinity. In general, binding preferences of transcription factors are associated with significant degenerated diversity and complexity which convolutes the design and engineering of precise DNA binding domains. Evolutionary success of natural zinc finger proteins, however, evinces that nature created specific evolutionary traits and strategies, such as modularity and rank-specific recognition to cope with binding complexity that are critical for creating clinical viable toolsmore » to precisely modify the human genome. Our findings indicate preservation of general modularity and significant alteration of the rank-specific binding preferences of the three-finger binding domain of transcription factor SP1 when exchanging amino acids in the 2nd finger.« less
Peters, R; King, C Y; Ukiyama, E; Falsafi, S; Donahoe, P K; Weiss, M A
1995-04-11
SRY, a genetic "master switch" for male development in mammals, exhibits two biochemical activities: sequence-specific recognition of duplex DNA and sequence-independent binding to the sharp angles of four-way DNA junctions. Here, we distinguish between these activities by analysis of a mutant SRY associated with human sex reversal (46, XY female with pure gonadal dysgenesis). The substitution (168T in human SRY) alters a nonpolar side chain in the minor-groove DNA recognition alpha-helix of the HMG box [Haqq, C.M., King, C.-Y., Ukiyama, E., Haqq, T.N., Falsalfi, S., Donahoe, P.K., & Weiss, M.A. (1994) Science 266, 1494-1500]. The native (but not mutant) side chain inserts between specific base pairs in duplex DNA, interrupting base stacking at a site of induced DNA bending. Isotope-aided 1H-NMR spectroscopy demonstrates that analogous side-chain insertion occurs on binding of SRY to a four-way junction, establishing a shared mechanism of sequence- and structure-specific DNA binding. Although the mutant DNA-binding domain exhibits > 50-fold reduction in sequence-specific DNA recognition, near wild-type affinity for four-way junctions is retained. Our results (i) identify a shared SRY-DNA contact at a site of either induced or intrinsic DNA bending, (ii) demonstrate that this contact is not required to bind an intrinsically bent DNA target, and (iii) rationalize patterns of sequence conservation or diversity among HMG boxes. Clinical association of the I68T mutation with human sex reversal supports the hypothesis that specific DNA recognition by SRY is required for male sex determination.
IFI16 Preferentially Binds to DNA with Quadruplex Structure and Enhances DNA Quadruplex Formation.
Hároníková, Lucia; Coufal, Jan; Kejnovská, Iva; Jagelská, Eva B; Fojta, Miroslav; Dvořáková, Petra; Muller, Petr; Vojtesek, Borivoj; Brázda, Václav
2016-01-01
Interferon-inducible protein 16 (IFI16) is a member of the HIN-200 protein family, containing two HIN domains and one PYRIN domain. IFI16 acts as a sensor of viral and bacterial DNA and is important for innate immune responses. IFI16 binds DNA and binding has been described to be DNA length-dependent, but a preference for supercoiled DNA has also been demonstrated. Here we report a specific preference of IFI16 for binding to quadruplex DNA compared to other DNA structures. IFI16 binds to quadruplex DNA with significantly higher affinity than to the same sequence in double stranded DNA. By circular dichroism (CD) spectroscopy we also demonstrated the ability of IFI16 to stabilize quadruplex structures with quadruplex-forming oligonucleotides derived from human telomere (HTEL) sequences and the MYC promotor. A novel H/D exchange mass spectrometry approach was developed to assess protein interactions with quadruplex DNA. Quadruplex DNA changed the IFI16 deuteration profile in parts of the PYRIN domain (aa 0-80) and in structurally identical parts of both HIN domains (aa 271-302 and aa 586-617) compared to single stranded or double stranded DNAs, supporting the preferential affinity of IFI16 for structured DNA. Our results reveal the importance of quadruplex DNA structure in IFI16 binding and improve our understanding of how IFI16 senses DNA. IFI16 selectivity for quadruplex structure provides a mechanistic framework for IFI16 in immunity and cellular processes including DNA damage responses and cell proliferation.
The GAGA protein of Drosophila is phosphorylated by CK2.
Bonet, Carles; Fernández, Irene; Aran, Xavier; Bernués, Jordi; Giralt, Ernest; Azorín, Fernando
2005-08-19
The GAGA factor of Drosophila is a sequence-specific DNA-binding protein that contributes to multiple processes from the regulation of gene expression to the structural organisation of heterochromatin and chromatin remodelling. GAGA is known to interact with various other proteins (tramtrack, pipsqueak, batman and dSAP18) and protein complexes (PRC1, NURF and FACT). GAGA functions are likely regulated at the level of post-translational modifications. Little is known, however, about its actual pattern of modification. It was proposed that GAGA can be O-glycosylated. Here, we report that GAGA519 isoform is a phosphoprotein that is phosphorylated by CK2 at the region of the DNA-binding domain. Our results indicate that phosphorylation occurs at S388 and, to a lesser extent, at S378. These two residues are located in a region of the DNA-binding domain that makes no direct contact with DNA, being dispensable for sequence-specific recognition. Phosphorylation at these sites does not abolish DNA binding but reduces the affinity of the interaction. These results are discussed in the context of the various functions and interactions that GAGA supports.
Specificity determinants for the abscisic acid response element.
Sarkar, Aditya Kumar; Lahiri, Ansuman
2013-01-01
Abscisic acid (ABA) response elements (ABREs) are a group of cis-acting DNA elements that have been identified from promoter analysis of many ABA-regulated genes in plants. We are interested in understanding the mechanism of binding specificity between ABREs and a class of bZIP transcription factors known as ABRE binding factors (ABFs). In this work, we have modeled the homodimeric structure of the bZIP domain of ABRE binding factor 1 from Arabidopsis thaliana (AtABF1) and studied its interaction with ACGT core motif-containing ABRE sequences. We have also examined the variation in the stability of the protein-DNA complex upon mutating ABRE sequences using the protein design algorithm FoldX. The high throughput free energy calculations successfully predicted the ability of ABF1 to bind to alternative core motifs like GCGT or AAGT and also rationalized the role of the flanking sequences in determining the specificity of the protein-DNA interaction.
Structural Determinants of DNA Binding by a P. falciparum ApiAP2 Transcriptional Regulator
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lindner, Scott E.; De Silva, Erandi K.; Keck, James L.
2010-11-05
Putative transcription factors have only recently been identified in the Plasmodium spp., with the major family of regulators comprising the Apicomplexan Apetala2 (AP2) proteins. To better understand the DNA-binding mechanisms of these transcriptional regulators, we characterized the structure and in vitro function of an AP2 DNA-binding domain from a prototypical Apicomplexan AP2 protein, PF14{_}0633 from Plasmodium falciparum. The X-ray crystal structure of the PF14{_}0633 AP2 domain bound to DNA reveals a {beta}-sheet fold that binds the DNA major groove through base-specific and backbone contacts; a prominent {alpha}-helix supports the {beta}-sheet structure. Substitution of predicted DNA-binding residues with alanine weakened ormore » eliminated DNA binding in solution. In contrast to plant AP2 domains, the PF14{_}0633 AP2 domain dimerizes upon binding to DNA through a domain-swapping mechanism in which the {alpha}-helices of the AP2 domains pack against the {beta}-sheets of the dimer mates. DNA-induced dimerization of PF14{_}0633 may be important for tethering two distal DNA loci together in the nucleus and/or for inducing functional rearrangements of its domains to facilitate transcriptional regulation. Consistent with a multisite binding mode, at least two copies of the consensus sequence recognized by PF14{_}0633 are present upstream of a previously identified group of sporozoite-stage genes. Taken together, these findings illustrate how Plasmodium has adapted the AP2 DNA-binding domain for genome-wide transcriptional regulation.« less
Sauvé, Simon; Tremblay, Luc; Lavigne, Pierre
2004-09-17
Basic region-helix1-loop-helix2-leucine zipper (b/H(1)LH(2)/LZ) transcription factors bind specific DNA sequence in their target gene promoters as dimers. Max, a b/H(1)LH(2)/LZ transcription factor, is the obligate heterodimeric partner of the related b/H(1)LH(2)/LZ proteins of the Myc and Mad families. These heterodimers specifically bind E-box DNA sequence (CACGTG) to activate (e.g. c-Myc/Max) and repress (e.g. Mad1/Max) transcription. Max can also homodimerize and bind E-box sequences in c-Myc target gene promoters. While the X-ray structure of the Max b/H(1)LH(2)/LZ/DNA complex and that of others have been reported, the precise sequence of events leading to the reversible and specific binding of these important transcription factors is still largely unknown. In order to provide insights into the DNA binding mechanism, we have solved the NMR solution structure of a covalently homodimerized version of a Max b/H(1)LH(2)/LZ protein with two stabilizing mutations in the LZ, and characterized its backbone dynamics from (15)N spin-relaxation measurements in the absence of DNA. Apart from minor differences in the pitch of the LZ, possibly resulting from the mutations in the construct, we observe that the packing of the helices in the H(1)LH(2) domain is almost identical to that of the two crystal structures, indicating that no important conformational change in these helices occurs upon DNA binding. Conversely to the crystal structures of the DNA complexes, the first 14 residues of the basic region are found to be mostly unfolded while the loop is observed to be flexible. This indicates that these domains undergo conformational changes upon DNA binding. On the other hand, we find the last four residues of the basic region form a persistent helical turn contiguous to H(1). In addition, we provide evidence of the existence of internal motions in the backbone of H(1) that are of larger amplitude and longer time-scale (nanoseconds) than the ones in the H(2) and LZ domain. Most interestingly, we note that conformers in the ensemble of calculated structures have highly conserved basic residues (located in the persistent helical turn of the basic region and in the loop) known to be important for specific binding in a conformation that matches that of the DNA-bound state. These partially prefolded conformers can directly fit into the major groove of DNA and as such are proposed to lie on the pathway leading to the reversible and specific DNA binding. In these conformers, the conserved basic side-chains form a cluster that elevates the local electrostatic potential and could provide the necessary driving force for the generation of the internal motions localized in the H(1) and therefore link structural determinants with the DNA binding function. Overall, our results suggests that the Max homodimeric b/H(1)LH(2)/LZ can rapidly and preferentially bind DNA sequence through transient and partially prefolded states and subsequently, adopt the fully helical bound state in a DNA-assisted mechanism or induced-fit.
Fedoseeva, Daria M.; Sosin, Dmitri V.; Grachev, Sergei A.; Serebraykova, Marina V.; Romanenko, Svetlana A.; Vorobieva, Nadezhda V.; Kravatsky, Yuri V.
2013-01-01
Genome instability plays a key role in multiple biological processes and diseases, including cancer. Genome-wide mapping of DNA double-strand breaks (DSBs) is important for understanding both chromosomal architecture and specific chromosomal regions at DSBs. We developed a method for precise genome-wide mapping of blunt-ended DSBs in human chromosomes, and observed non-random fragmentation and DSB hot spots. These hot spots are scattered along chromosomes and delimit protected 50–250 kb DNA domains. We found that about 30% of the domains (denoted forum domains) possess coordinately expressed genes and that PARP1 and HNRNPA2B1 specifically bind DNA sequences at the forum domain termini. Thus, our data suggest a novel type of gene regulation: a coordinated transcription or silencing of gene clusters delimited by DSB hot spots as well as PARP1 and HNRNPa2B1 binding sites. PMID:23593027
Molecular Dynamics Simulations of DNA-Free and DNA-Bound TAL Effectors
Wan, Hua; Hu, Jian-ping; Li, Kang-shun; Tian, Xu-hong; Chang, Shan
2013-01-01
TAL (transcriptional activator-like) effectors (TALEs) are DNA-binding proteins, containing a modular central domain that recognizes specific DNA sequences. Recently, the crystallographic studies of TALEs revealed the structure of DNA-recognition domain. In this article, molecular dynamics (MD) simulations are employed to study two crystal structures of an 11.5-repeat TALE, in the presence and absence of DNA, respectively. The simulated results indicate that the specific binding of RVDs (repeat-variable diresidues) with DNA leads to the markedly reduced fluctuations of tandem repeats, especially at the two ends. In the DNA-bound TALE system, the base-specific interaction is formed mainly by the residue at position 13 within a TAL repeat. Tandem repeats with weak RVDs are unfavorable for the TALE-DNA binding. These observations are consistent with experimental studies. By using principal component analysis (PCA), the dominant motions are open-close movements between the two ends of the superhelical structure in both DNA-free and DNA-bound TALE systems. The open-close movements are found to be critical for the recognition and binding of TALE-DNA based on the analysis of free energy landscape (FEL). The conformational analysis of DNA indicates that the 5′ end of DNA target sequence has more remarkable structural deformability than the other sites. Meanwhile, the conformational change of DNA is likely associated with the specific interaction of TALE-DNA. We further suggest that the arrangement of N-terminal repeats with strong RVDs may help in the design of efficient TALEs. This study provides some new insights into the understanding of the TALE-DNA recognition mechanism. PMID:24130757
The zinc fingers of YY1 bind single-stranded RNA with low sequence specificity.
Wai, Dorothy C C; Shihab, Manar; Low, Jason K K; Mackay, Joel P
2016-11-02
Classical zinc fingers (ZFs) are traditionally considered to act as sequence-specific DNA-binding domains. More recently, classical ZFs have been recognised as potential RNA-binding modules, raising the intriguing possibility that classical-ZF transcription factors are involved in post-transcriptional gene regulation via direct RNA binding. To date, however, only one classical ZF-RNA complex, that involving TFIIIA, has been structurally characterised. Yin Yang-1 (YY1) is a multi-functional transcription factor involved in many regulatory processes, and binds DNA via four classical ZFs. Recent evidence suggests that YY1 also interacts with RNA, but the molecular nature of the interaction remains unknown. In the present work, we directly assess the ability of YY1 to bind RNA using in vitro assays. Systematic Evolution of Ligands by EXponential enrichment (SELEX) was used to identify preferred RNA sequences bound by the YY1 ZFs from a randomised library over multiple rounds of selection. However, a strong motif was not consistently recovered, suggesting that the RNA sequence selectivity of these domains is modest. YY1 ZF residues involved in binding to single-stranded RNA were identified by NMR spectroscopy and found to be largely distinct from the set of residues involved in DNA binding, suggesting that interactions between YY1 and ssRNA constitute a separate mode of nucleic acid binding. Our data are consistent with recent reports that YY1 can bind to RNA in a low-specificity, yet physiologically relevant manner. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Evolutionary and biophysical relationships among the papillomavirus E2 proteins.
Blakaj, Dukagjin M; Fernandez-Fuentes, Narcis; Chen, Zigui; Hegde, Rashmi; Fiser, Andras; Burk, Robert D; Brenowitz, Michael
2009-01-01
Infection by human papillomavirus (HPV) may result in clinical conditions ranging from benign warts to invasive cancer. The HPV E2 protein represses oncoprotein transcription and is required for viral replication. HPV E2 binds to palindromic DNA sequences of highly conserved four base pair sequences flanking an identical length variable 'spacer'. E2 proteins directly contact the conserved but not the spacer DNA. Variation in naturally occurring spacer sequences results in differential protein affinity that is dependent on their sensitivity to the spacer DNA's unique conformational and/or dynamic properties. This article explores the biophysical character of this core viral protein with the goal of identifying characteristics that associated with risk of virally caused malignancy. The amino acid sequence, 3d structure and electrostatic features of the E2 protein DNA binding domain are highly conserved; specific interactions with DNA binding sites have also been conserved. In contrast, the E2 protein's transactivation domain does not have extensive surfaces of highly conserved residues. Rather, regions of high conservation are localized to small surface patches. Implications to cancer biology are discussed.
Kemme, Catherine A; Esadze, Alexandre; Iwahara, Junji
2015-11-10
Functions of transcription factors require formation of specific complexes at particular sites in cis-regulatory elements of genes. However, chromosomal DNA contains numerous sites that are similar to the target sequences recognized by transcription factors. The influence of such "quasi-specific" sites on functions of the transcription factors is not well understood at present by experimental means. In this work, using fluorescence methods, we have investigated the influence of quasi-specific DNA sites on the efficiency of target location by the zinc finger DNA-binding domain of the inducible transcription factor Egr-1, which recognizes a 9 bp sequence. By stopped-flow assays, we measured the kinetics of Egr-1's association with a target site on 143 bp DNA in the presence of various competitor DNAs, including nonspecific and quasi-specific sites. The presence of quasi-specific sites on competitor DNA significantly decelerated the target association by the Egr-1 protein. The impact of the quasi-specific sites depended strongly on their affinity, their concentration, and the degree of their binding to the protein. To quantitatively describe the kinetic impact of the quasi-specific sites, we derived an analytical form of the apparent kinetic rate constant for the target association and used it for fitting to the experimental data. Our kinetic data with calf thymus DNA as a competitor suggested that there are millions of high-affinity quasi-specific sites for Egr-1 among the 3 billion bp of genomic DNA. This study quantitatively demonstrates that naturally abundant quasi-specific sites on DNA can considerably impede the target search processes of sequence-specific DNA-binding proteins.
2015-01-01
Functions of transcription factors require formation of specific complexes at particular sites in cis-regulatory elements of genes. However, chromosomal DNA contains numerous sites that are similar to the target sequences recognized by transcription factors. The influence of such “quasi-specific” sites on functions of the transcription factors is not well understood at present by experimental means. In this work, using fluorescence methods, we have investigated the influence of quasi-specific DNA sites on the efficiency of target location by the zinc finger DNA-binding domain of the inducible transcription factor Egr-1, which recognizes a 9 bp sequence. By stopped-flow assays, we measured the kinetics of Egr-1’s association with a target site on 143 bp DNA in the presence of various competitor DNAs, including nonspecific and quasi-specific sites. The presence of quasi-specific sites on competitor DNA significantly decelerated the target association by the Egr-1 protein. The impact of the quasi-specific sites depended strongly on their affinity, their concentration, and the degree of their binding to the protein. To quantitatively describe the kinetic impact of the quasi-specific sites, we derived an analytical form of the apparent kinetic rate constant for the target association and used it for fitting to the experimental data. Our kinetic data with calf thymus DNA as a competitor suggested that there are millions of high-affinity quasi-specific sites for Egr-1 among the 3 billion bp of genomic DNA. This study quantitatively demonstrates that naturally abundant quasi-specific sites on DNA can considerably impede the target search processes of sequence-specific DNA-binding proteins. PMID:26502071
Characterization of the DNA binding properties of polyomavirus capsid protein
NASA Technical Reports Server (NTRS)
Chang, D.; Cai, X.; Consigli, R. A.; Spooner, B. S. (Principal Investigator)
1993-01-01
The DNA binding properties of the polyomavirus structural proteins VP1, VP2, and VP3 were studied by Southwestern analysis. The major viral structural protein VP1 and host-contributed histone proteins of polyomavirus virions were shown to exhibit DNA binding activity, but the minor capsid proteins VP2 and VP3 failed to bind DNA. The N-terminal first five amino acids (Ala-1 to Lys-5) were identified as the VP1 DNA binding domain by genetic and biochemical approaches. Wild-type VP1 expressed in Escherichia coli (RK1448) exhibited DNA binding activity, but the N-terminal truncated VP1 mutants (lacking Ala-1 to Lys-5 and Ala-1 to Cys-11) failed to bind DNA. The synthetic peptide (Ala-1 to Cys-11) was also shown to have an affinity for DNA binding. Site-directed mutagenesis of the VP1 gene showed that the point mutations at Pro-2, Lys-3, and Arg-4 on the VP1 molecule did not affect DNA binding properties but that the point mutation at Lys-5 drastically reduced DNA binding affinity. The N-terminal (Ala-1 to Lys-5) region of VP1 was found to be essential and specific for DNA binding, while the DNA appears to be non-sequence specific. The DNA binding domain and the nuclear localization signal are located in the same N-terminal region.
BuD, a helix–loop–helix DNA-binding domain for genome modification
Stella, Stefano; Molina, Rafael; López-Méndez, Blanca; Juillerat, Alexandre; Bertonati, Claudia; Daboussi, Fayza; Campos-Olivas, Ramon; Duchateau, Phillippe; Montoya, Guillermo
2014-01-01
DNA editing offers new possibilities in synthetic biology and biomedicine for modulation or modification of cellular functions to organisms. However, inaccuracy in this process may lead to genome damage. To address this important problem, a strategy allowing specific gene modification has been achieved through the addition, removal or exchange of DNA sequences using customized proteins and the endogenous DNA-repair machinery. Therefore, the engineering of specific protein–DNA interactions in protein scaffolds is key to providing ‘toolkits’ for precise genome modification or regulation of gene expression. In a search for putative DNA-binding domains, BurrH, a protein that recognizes a 19 bp DNA target, was identified. Here, its apo and DNA-bound crystal structures are reported, revealing a central region containing 19 repeats of a helix–loop–helix modular domain (BurrH domain; BuD), which identifies the DNA target by a single residue-to-nucleotide code, thus facilitating its redesign for gene targeting. New DNA-binding specificities have been engineered in this template, showing that BuD-derived nucleases (BuDNs) induce high levels of gene targeting in a locus of the human haemoglobin β (HBB) gene close to mutations responsible for sickle-cell anaemia. Hence, the unique combination of high efficiency and specificity of the BuD arrays can push forward diverse genome-modification approaches for cell or organism redesign, opening new avenues for gene editing. PMID:25004980
Rivera-Cancel, Giomar; Motta-Mena, Laura B.; Gardner, Kevin H.
2012-01-01
Light-oxygen-voltage (LOV) domains serve as the photosensory modules for a wide range of plant and bacterial proteins, conferring blue light dependent regulation to effector activities as diverse as enzymes and DNA binding. LOV domains can also be engineered into a variety of exogenous targets, enabling similar regulation for new protein-based reagents. Common to these proteins is the ability for LOV domains to reversibly form a photochemical adduct between an internal flavin chromophore and the surrounding protein, using this to trigger conformational changes that affect output activity. Using the Erythrobacter litoralis protein EL222 model system which links LOV regulation to a helix-turn-helix (HTH) DNA binding domain, we demonstrated that the LOV domain binds and inhibits the HTH domain in the dark, releasing these interactions upon illumination [Nash et al. (2011) Proc. Natl. Acad. Sci. USA 108, 9449–9454]. Here we combine genomic and in vitro selection approaches to identify optimal DNA binding sites for EL222. Within the bacterial host, we observe binding several genomic sites using a 12 bp sequence consensus that is also found by in vitro selection methods. Sequence-specific alterations in the DNA consensus reduce EL222-binding affinity in a manner consistent with the expected binding mode: a protein dimer binding to two repeats. Finally, we demonstrate the light-dependent activation of transcription of two genes adjacent to an EL222 binding site. Taken together, these results shed light on the native function of EL222 and provide useful reagents for further basic and applications research of this versatile protein. PMID:23205774
DOE Office of Scientific and Technical Information (OSTI.GOV)
Akabayov, B.; Lee, S; Akabayov, S
2009-01-01
Synthesis of oligoribonucleotide primers for lagging-strand DNA synthesis in the DNA replication system of bacteriophage T7 is catalyzed by the primase domain of the gene 4 helicase-primase. The primase consists of a zinc-binding domain (ZBD) and an RNA polymerase (RPD) domain. The ZBD is responsible for recognition of a specific sequence in the ssDNA template whereas catalytic activity resides in the RPD. The ZBD contains a zinc ion coordinated with four cysteine residues. We have examined the ligation state of the zinc ion by X-ray absorption spectroscopy and biochemical analysis of genetically altered primases. The ZBD of primase engaged inmore » catalysis exhibits considerable asymmetry in coordination to zinc, as evidenced by a gradual increase in electron density of the zinc together with elongation of the zinc-sulfur bonds. Both wild-type primase and primase reconstituted from purified ZBD and RPD have a similar electronic change in the level of the zinc ion as well as the configuration of the ZBD. Single amino acid replacements in the ZBD (H33A and C36S) result in the loss of both zinc binding and its structural integrity. Thus the zinc in the ZBD may act as a charge modulation indicator for the surrounding sulfur atoms necessary for recognition of specific DNA sequences.« less
Common fold in helix–hairpin–helix proteins
Shao, Xuguang; Grishin, Nick V.
2000-01-01
Helix–hairpin–helix (HhH) is a widespread motif involved in non-sequence-specific DNA binding. The majority of HhH motifs function as DNA-binding modules, however, some of them are used to mediate protein–protein interactions or have acquired enzymatic activity by incorporating catalytic residues (DNA glycosylases). From sequence and structural analysis of HhH-containing proteins we conclude that most HhH motifs are integrated as a part of a five-helical domain, termed (HhH)2 domain here. It typically consists of two consecutive HhH motifs that are linked by a connector helix and displays pseudo-2-fold symmetry. (HhH)2 domains show clear structural integrity and a conserved hydrophobic core composed of seven residues, one residue from each α-helix and each hairpin, and deserves recognition as a distinct protein fold. In addition to known HhH in the structures of RuvA, RadA, MutY and DNA-polymerases, we have detected new HhH motifs in sterile alpha motif and barrier-to-autointegration factor domains, the α-subunit of Escherichia coli RNA-polymerase, DNA-helicase PcrA and DNA glycosylases. Statistically significant sequence similarity of HhH motifs and pronounced structural conservation argue for homology between (HhH)2 domains in different protein families. Our analysis helps to clarify how non-symmetric protein motifs bind to the double helix of DNA through the formation of a pseudo-2-fold symmetric (HhH)2 functional unit. PMID:10908318
Gabsalilow, Lilia; Schierling, Benno; Friedhoff, Peter; Pingoud, Alfred; Wende, Wolfgang
2013-04-01
Targeted genome engineering requires nucleases that introduce a highly specific double-strand break in the genome that is either processed by homology-directed repair in the presence of a homologous repair template or by non-homologous end-joining (NHEJ) that usually results in insertions or deletions. The error-prone NHEJ can be efficiently suppressed by 'nickases' that produce a single-strand break rather than a double-strand break. Highly specific nickases have been produced by engineering of homing endonucleases and more recently by modifying zinc finger nucleases (ZFNs) composed of a zinc finger array and the catalytic domain of the restriction endonuclease FokI. These ZF-nickases work as heterodimers in which one subunit has a catalytically inactive FokI domain. We present two different approaches to engineer highly specific nickases; both rely on the sequence-specific nicking activity of the DNA mismatch repair endonuclease MutH which we fused to a DNA-binding module, either a catalytically inactive variant of the homing endonuclease I-SceI or the DNA-binding domain of the TALE protein AvrBs4. The fusion proteins nick strand specifically a bipartite recognition sequence consisting of the MutH and the I-SceI or TALE recognition sequences, respectively, with a more than 1000-fold preference over a stand-alone MutH site. TALE-MutH is a programmable nickase.
Zhang, Lu; Xu, Jinhao; Ma, Jinbiao
2016-07-25
RNA-binding protein exerts important biological function by specifically recognizing RNA motif. SELEX (Systematic evolution of ligands by exponential enrichment), an in vitro selection method, can obtain consensus motif with high-affinity and specificity for many target molecules from DNA or RNA libraries. Here, we combined SELEX with next-generation sequencing to study the protein-RNA interaction in vitro. A pool of RNAs with 20 bp random sequences were transcribed by T7 promoter, and target protein was inserted into plasmid containing SBP-tag, which can be captured by streptavidin beads. Through only one cycle, the specific RNA motif can be obtained, which dramatically improved the selection efficiency. Using this method, we found that human hnRNP A1 RRMs domain (UP1 domain) bound RNA motifs containing AGG and AG sequences. The EMSA experiment indicated that hnRNP A1 RRMs could bind the obtained RNA motif. Taken together, this method provides a rapid and effective method to study the RNA binding specificity of proteins.
Solution structure of telomere binding domain of AtTRB2 derived from Arabidopsis thaliana
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yun, Ji-Hye; Lee, Won Kyung; Kim, Heeyoun
Highlights: • We have determined solution structure of Myb domain of AtTRB2. • The Myb domain of AtTRB2 is located in the N-terminal region. • The Myb domain of AtTRB2 binds to plant telomeric DNA without fourth helix. • Helix 2 and 3 of the Myb domain of AtTRB2 are involved in DNA recognition. • AtTRB2 is a novel protein distinguished from other known plant TBP. - Abstract: Telomere homeostasis is regulated by telomere-associated proteins, and the Myb domain is well conserved for telomere binding. AtTRB2 is a member of the SMH (Single-Myb-Histone)-like family in Arabidopsis thaliana, having an N-terminalmore » Myb domain, which is responsible for DNA binding. The Myb domain of AtTRB2 contains three α-helices and loops for DNA binding, which is unusual given that other plant telomere-binding proteins have an additional fourth helix that is essential for DNA binding. To understand the structural role for telomeric DNA binding of AtTRB2, we determined the solution structure of the Myb domain of AtTRB2 (AtTRB2{sub 1–64}) using nuclear magnetic resonance (NMR) spectroscopy. In addition, the inter-molecular interaction between AtTRB2{sub 1–64} and telomeric DNA has been characterized by the electrophoretic mobility shift assay (EMSA) and NMR titration analyses for both plant (TTTAGGG)n and human (TTAGGG)n telomere sequences. Data revealed that Trp28, Arg29, and Val47 residues located in Helix 2 and Helix 3 are crucial for DNA binding, which are well conserved among other plant telomere binding proteins. We concluded that although AtTRB2 is devoid of the additional fourth helix in the Myb-extension domain, it is able to bind to plant telomeric repeat sequences as well as human telomeric repeat sequences.« less
p53 Specifically Binds Triplex DNA In Vitro and in Cells
Brázdová, Marie; Tichý, Vlastimil; Helma, Robert; Bažantová, Pavla; Polášková, Alena; Krejčí, Aneta; Petr, Marek; Navrátilová, Lucie; Tichá, Olga; Nejedlý, Karel; Bennink, Martin L.; Subramaniam, Vinod; Bábková, Zuzana; Martínek, Tomáš; Lexa, Matej; Adámik, Matej
2016-01-01
Triplex DNA is implicated in a wide range of biological activities, including regulation of gene expression and genomic instability leading to cancer. The tumor suppressor p53 is a central regulator of cell fate in response to different type of insults. Sequence and structure specific modes of DNA recognition are core attributes of the p53 protein. The focus of this work is the structure-specific binding of p53 to DNA containing triplex-forming sequences in vitro and in cells and the effect on p53-driven transcription. This is the first DNA binding study of full-length p53 and its deletion variants to both intermolecular and intramolecular T.A.T triplexes. We demonstrate that the interaction of p53 with intermolecular T.A.T triplex is comparable to the recognition of CTG-hairpin non-B DNA structure. Using deletion mutants we determined the C-terminal DNA binding domain of p53 to be crucial for triplex recognition. Furthermore, strong p53 recognition of intramolecular T.A.T triplexes (H-DNA), stabilized by negative superhelicity in plasmid DNA, was detected by competition and immunoprecipitation experiments, and visualized by AFM. Moreover, chromatin immunoprecipitation revealed p53 binding T.A.T forming sequence in vivo. Enhanced reporter transactivation by p53 on insertion of triplex forming sequence into plasmid with p53 consensus sequence was observed by luciferase reporter assays. In-silico scan of human regulatory regions for the simultaneous presence of both consensus sequence and T.A.T motifs identified a set of candidate p53 target genes and p53-dependent activation of several of them (ABCG5, ENOX1, INSR, MCC, NFAT5) was confirmed by RT-qPCR. Our results show that T.A.T triplex comprises a new class of p53 binding sites targeted by p53 in a DNA structure-dependent mode in vitro and in cells. The contribution of p53 DNA structure-dependent binding to the regulation of transcription is discussed. PMID:27907175
Molecular dynamics studies on the DNA-binding process of ERG.
Beuerle, Matthias G; Dufton, Neil P; Randi, Anna M; Gould, Ian R
2016-11-15
The ETS family of transcription factors regulate gene targets by binding to a core GGAA DNA-sequence. The ETS factor ERG is required for homeostasis and lineage-specific functions in endothelial cells, some subset of haemopoietic cells and chondrocytes; its ectopic expression is linked to oncogenesis in multiple tissues. To date details of the DNA-binding process of ERG including DNA-sequence recognition outside the core GGAA-sequence are largely unknown. We combined available structural and experimental data to perform molecular dynamics simulations to study the DNA-binding process of ERG. In particular we were able to reproduce the ERG DNA-complex with a DNA-binding simulation starting in an unbound configuration with a final root-mean-square-deviation (RMSD) of 2.1 Å to the core ETS domain DNA-complex crystal structure. This allowed us to elucidate the relevance of amino acids involved in the formation of the ERG DNA-complex and to identify Arg385 as a novel key residue in the DNA-binding process. Moreover we were able to show that water-mediated hydrogen bonds are present between ERG and DNA in our simulations and that those interactions have the potential to achieve sequence recognition outside the GGAA core DNA-sequence. The methodology employed in this study shows the promising capabilities of modern molecular dynamics simulations in the field of protein DNA-interactions.
Puranik, Swati; Kumar, Karunesh; Srivastava, Prem S; Prasad, Manoj
2011-10-01
The NAC (NAM/ATAF1,2/CUC2) proteins are among the largest family of plant transcription factors. Its members have been associated with diverse plant processes and intricately regulate the expression of several genes. Inspite of this immense progress, knowledge of their DNA-binding properties are still limited. In our recent publication,1 we reported isolation of a membrane-associated NAC domain protein from Setaria italica (SiNAC). Transactivation analysis revealed that it was a functionally active transcription factor as it could stimulate expression of reporter genes in vivo. Truncations of the transmembrane region of the protein lead to its nuclear localization. Here we describe expression and purification of SiNAC DNA-binding domain. We further report identification of a novel DNA-binding site, [C/G][A/T][T/A][G/C]TC[C/G][A/T][C/G][G/C] for SiNAC by electrophoretic mobility shift assay. The SiNAC-GST protein could bind to the NAC recognition sequence in vitro as well as to sequences where some bases had been reshuffled. The results presented here contribute to our understanding of the DNA-binding specificity of SiNAC protein.
Puranik, Swati; Kumar, Karunesh; Srivastava, Prem S
2011-01-01
The NAC (NAM/ATAF1,2/CUC2) proteins are among the largest family of plant transcription factors. Its members have been associated with diverse plant processes and intricately regulate the expression of several genes. Inspite of this immense progress, knowledge of their DNA-binding properties are still limited. In our recent publication,1 we reported isolation of a membrane-associated NAC domain protein from Setaria italica (SiNAC). Transactivation analysis revealed that it was a functionally active transcription factor as it could stimulate expression of reporter genes in vivo. Truncation of the transmembrane region of the protein lead to its nuclear localization. Here we describe expression and purification of SiNAC DNA-binding domain. We further report identification of a novel DNA-binding site, [C/G][A/T] [T/A][G/C]TC[C/G][A/T][C/G][G/C] for SiNAC by electrophoretic mobility shift assay. The SiNAC-GST protein could bind to the NAC recognition sequence in vitro as well as to sequences where some bases had been reshuffled. The results presented here contribute to our understanding of the DNA-binding specificity of SiNAC protein. PMID:21918373
Dissecting the protein architecture of DNA-binding transcription factors in bacteria and archaea.
Rivera-Gómez, Nancy; Martínez-Núñez, Mario Alberto; Pastor, Nina; Rodriguez-Vazquez, Katya; Perez-Rueda, Ernesto
2017-08-01
Gene regulation at the transcriptional level is a central process in all organisms where DNA-binding transcription factors play a fundamental role. This class of proteins binds specifically at DNA sequences, activating or repressing gene expression as a function of the cell's metabolic status, operator context and ligand-binding status, among other factors, through the DNA-binding domain (DBD). In addition, TFs may contain partner domains (PaDos), which are involved in ligand binding and protein-protein interactions. In this work, we systematically evaluated the distribution, abundance and domain organization of DNA-binding TFs in 799 non-redundant bacterial and archaeal genomes. We found that the distributions of the DBDs and their corresponding PaDos correlated with the size of the genome. We also identified specific combinations between the DBDs and their corresponding PaDos. Within each class of DBDs there are differences in the actual angle formed at the dimerization interface, responding to the presence/absence of ligands and/or crystallization conditions, setting the orientation of the resulting helices and wings facing the DNA. Our results highlight the importance of PaDos as central elements that enhance the diversity of regulatory functions in all bacterial and archaeal organisms, and our results also demonstrate the role of PaDos in sensing diverse signal compounds. The highly specific interactions between DBDs and PaDos observed in this work, together with our structural analysis highlighting the difficulty in predicting both inter-domain geometry and quaternary structure, suggest that these systems appeared once and evolved with diverse duplication events in all the analysed organisms.
Replication of damaged DNA in vitro is blocked by p53
Zhou, Jianmin; Prives, Carol
2003-01-01
The tumor suppressor protein p53 may have other roles and functions in addition to its well-documented ability to serve as a sequence-specific transcriptional activator in response to DNA damage. We showed previously that p53 can block the replication of polyomavirus origin-containing DNA (Py ori-DNA) in vitro when p53 binding sites are present on the late side of the Py ori. Here we have both further extended these observations and have also examined whether p53 might be able to bind directly to and inhibit the replication of damaged DNA. We found that p53 strongly inhibits replication of γ-irradiated Py ori-DNA and such inhibition requires both the central DNA binding domain and the extreme C-terminus of the p53 protein. An endogenous p53 binding site lies within the Py origin and is required for the ability of p53 to block initiation of replication from γ-irradiated Py ori-DNA, suggesting the possibility of DNA looping caused by p53 binding both non-specifically to sites of DNA damage and specifically to the endogenous site in the polyomavirus origin. Our results thus suggest the possibility that under some circumstances p53 might serve as a direct regulator of DNA replication and suggest as well an additional function for cooperation between its two autonomous DNA binding domains. PMID:12853603
Structure of 5-hydroxymethylcytosine-specific restriction enzyme, AbaSI, in complex with DNA.
Horton, John R; Borgaro, Janine G; Griggs, Rose M; Quimby, Aine; Guan, Shengxi; Zhang, Xing; Wilson, Geoffrey G; Zheng, Yu; Zhu, Zhenyu; Cheng, Xiaodong
2014-07-01
AbaSI, a member of the PvuRts1I-family of modification-dependent restriction endonucleases, cleaves deoxyribonucleic acid (DNA) containing 5-hydroxymethylctosine (5hmC) and glucosylated 5hmC (g5hmC), but not DNA containing unmodified cytosine. AbaSI has been used as a tool for mapping the genomic locations of 5hmC, an important epigenetic modification in the DNA of higher organisms. Here we report the crystal structures of AbaSI in the presence and absence of DNA. These structures provide considerable, although incomplete, insight into how this enzyme acts. AbaSI appears to be mainly a homodimer in solution, but interacts with DNA in our structures as a homotetramer. Each AbaSI subunit comprises an N-terminal, Vsr-like, cleavage domain containing a single catalytic site, and a C-terminal, SRA-like, 5hmC-binding domain. Two N-terminal helices mediate most of the homodimer interface. Dimerization brings together the two catalytic sites required for double-strand cleavage, and separates the 5hmC binding-domains by ∼70 Å, consistent with the known activity of AbaSI which cleaves DNA optimally between symmetrically modified cytosines ∼22 bp apart. The eukaryotic SET and RING-associated (SRA) domains bind to DNA containing 5-methylcytosine (5mC) in the hemi-methylated CpG sequence. They make contacts in both the major and minor DNA grooves, and flip the modified cytosine out of the helix into a conserved binding pocket. In contrast, the SRA-like domain of AbaSI, which has no sequence specificity, contacts only the minor DNA groove, and in our current structures the 5hmC remains intra-helical. A conserved, binding pocket is nevertheless present in this domain, suitable for accommodating 5hmC and g5hmC. We consider it likely, therefore, that base-flipping is part of the recognition and cleavage mechanism of AbaSI, but that our structures represent an earlier, pre-flipped stage, prior to actual recognition. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Structure of 5-hydroxymethylcytosine-specific restriction enzyme, AbaSI, in complex with DNA
DOE Office of Scientific and Technical Information (OSTI.GOV)
Horton, John R.; Borgaro, Janine G.; Griggs, Rose M.
2014-07-03
AbaSI, a member of the PvuRts1I-family of modification-dependent restriction endonucleases, cleaves DNA containing 5-hydroxymethylctosine (5hmC) and glucosylated 5hmC (g5hmC), but not DNA containing unmodified cytosine. AbaSI has been used as a tool for mapping the genomic locations of 5hmC, an important epigenetic modification in the DNA of higher organisms. Here we report the crystal structures of AbaSI in the presence and absence of DNA. These structures provide considerable, although incomplete, insight into how this enzyme acts. AbaSI appears to be mainly a homodimer in solution, but interacts with DNA in our structures as a homotetramer. Each AbaSI subunit comprises anmore » N-terminal, Vsr-like, cleavage domain containing a single catalytic site, and a C-terminal, SRA-like, 5hmC-binding domain. Two N-terminal helices mediate most of the homodimer interface. Dimerization brings together the two catalytic sites required for double-strand cleavage, and separates the 5hmC binding-domains by ~ 70 Å, consistent with the known activity of AbaSI which cleaves DNA optimally between symmetrically modified cytosines ~ 22 bp apart. The eukaryotic SET and RING-associated (SRA) domains bind to DNA containing 5-methylcytosine (5mC) in the hemi-methylated CpG sequence. They make contacts in both the major and minor DNA grooves, and flip the modified cytosine out of the helix into a conserved binding pocket. In contrast, the SRA-like domain of AbaSI, which has no sequence specificity, contacts only the minor DNA groove, and in our current structures the 5hmC remains intra-helical. A conserved, binding pocket is nevertheless present in this domain, suitable for accommodating 5hmC and g5hmC. We consider it likely, therefore, that base-flipping is part of the recognition and cleavage mechanism of AbaSI, but that our structures represent an earlier, pre-flipped stage, prior to actual recognition.« less
The Replication Focus Targeting Sequence (RFTS) Domain Is a DNA-competitive Inhibitor of Dnmt1
DOE Office of Scientific and Technical Information (OSTI.GOV)
Syeda, Farisa; Fagan, Rebecca L.; Wean, Matthew
Dnmt1 (DNA methyltransferase 1) is the principal enzyme responsible for maintenance of cytosine methylation at CpG dinucleotides in the mammalian genome. The N-terminal replication focus targeting sequence (RFTS) domain of Dnmt1 has been implicated in subcellular localization, protein association, and catalytic function. However, progress in understanding its function has been limited by the lack of assays for and a structure of this domain. Here, we show that the naked DNA- and polynucleosome-binding activities of Dnmt1 are inhibited by the RFTS domain, which functions by virtue of binding the catalytic domain to the exclusion of DNA. Kinetic analysis with a fluorogenicmore » DNA substrate established the RFTS domain as a 600-fold inhibitor of Dnmt1 enzymatic activity. The crystal structure of the RFTS domain reveals a novel fold and supports a mechanism in which an RFTS-targeted Dnmt1-binding protein, such as Uhrf1, may activate Dnmt1 for DNA binding.« less
Protein Cofactors Are Essential for High-Affinity DNA Binding by the Nuclear Factor κB RelA Subunit.
Mulero, Maria Carmen; Shahabi, Shandy; Ko, Myung Soo; Schiffer, Jamie M; Huang, De-Bin; Wang, Vivien Ya-Fan; Amaro, Rommie E; Huxford, Tom; Ghosh, Gourisankar
2018-05-22
Transcription activator proteins typically contain two functional domains: a DNA binding domain (DBD) that binds to DNA with sequence specificity and an activation domain (AD) whose established function is to recruit RNA polymerase. In this report, we show that purified recombinant nuclear factor κB (NF-κB) RelA dimers bind specific κB DNA sites with an affinity significantly lower than that of the same dimers from nuclear extracts of activated cells, suggesting that additional nuclear cofactors might facilitate DNA binding by the RelA dimers. Additionally, recombinant RelA binds DNA with relatively low affinity at a physiological salt concentration in vitro. The addition of p53 or RPS3 (ribosomal protein S3) increases RelA:DNA binding affinity 2- to >50-fold depending on the protein and ionic conditions. These cofactor proteins do not form stable ternary complexes, suggesting that they stabilize the RelA:DNA complex through dynamic interactions. Surprisingly, the RelA-DBD alone fails to bind DNA under the same solution conditions even in the presence of cofactors, suggesting an important role of the RelA-AD in DNA binding. Reduced RelA:DNA binding at a physiological ionic strength suggests that multiple cofactors might be acting simultaneously to mitigate the electrolyte effect and stabilize the RelA:DNA complex in vivo. Overall, our observations suggest that the RelA-AD and multiple cofactor proteins function cooperatively to prime the RelA-DBD and stabilize the RelA:DNA complex in cells. Our study provides a mechanism for nuclear cofactor proteins in NF-κB-dependent gene regulation.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Agarkar, Vinod B.; Babayeva, Nigar D.; Rizzino, Angie
2010-10-08
Ets proteins are transcription factors that activate or repress the expression of genes that are involved in various biological processes, including cellular proliferation, differentiation, development, transformation and apoptosis. Like other Ets-family members, Elf3 functions as a sequence-specific DNA-binding transcriptional factor. A mouse Elf3 C-terminal fragment (amino-acid residues 269-371) containing the DNA-binding domain has been crystallized in complex with mouse type II TGF-{beta} receptor promoter (TR-II) DNA. The crystals belonged to space group P2{sub 1}2{sub 1}2{sub 1}, with unit-cell parameters a = 42.66, b = 52, c = 99.78 {angstrom}, and diffracted to a resolution of 2.2 {angstrom}.
The identification of FANCD2 DNA binding domains reveals nuclear localization sequences.
Niraj, Joshi; Caron, Marie-Christine; Drapeau, Karine; Bérubé, Stéphanie; Guitton-Sert, Laure; Coulombe, Yan; Couturier, Anthony M; Masson, Jean-Yves
2017-08-21
Fanconi anemia (FA) is a recessive genetic disorder characterized by congenital abnormalities, progressive bone-marrow failure, and cancer susceptibility. The FA pathway consists of at least 21 FANC genes (FANCA-FANCV), and the encoded protein products interact in a common cellular pathway to gain resistance against DNA interstrand crosslinks. After DNA damage, FANCD2 is monoubiquitinated and accumulates on chromatin. FANCD2 plays a central role in the FA pathway, using yet unidentified DNA binding regions. By using synthetic peptide mapping and DNA binding screen by electromobility shift assays, we found that FANCD2 bears two major DNA binding domains predominantly consisting of evolutionary conserved lysine residues. Furthermore, one domain at the N-terminus of FANCD2 bears also nuclear localization sequences for the protein. Mutations in the bifunctional DNA binding/NLS domain lead to a reduction in FANCD2 monoubiquitination and increase in mitomycin C sensitivity. Such phenotypes are not fully rescued by fusion with an heterologous NLS, which enable separation of DNA binding and nuclear import functions within this domain that are necessary for FANCD2 functions. Collectively, our results enlighten the importance of DNA binding and NLS residues in FANCD2 to activate an efficient FA pathway. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
The physical size of transcription factors is key to transcriptional regulation in chromatin domains
NASA Astrophysics Data System (ADS)
Maeshima, Kazuhiro; Kaizu, Kazunari; Tamura, Sachiko; Nozaki, Tadasu; Kokubo, Tetsuro; Takahashi, Koichi
2015-02-01
Genetic information, which is stored in the long strand of genomic DNA as chromatin, must be scanned and read out by various transcription factors. First, gene-specific transcription factors, which are relatively small (˜50 kDa), scan the genome and bind regulatory elements. Such factors then recruit general transcription factors, Mediators, RNA polymerases, nucleosome remodellers, and histone modifiers, most of which are large protein complexes of 1-3 MDa in size. Here, we propose a new model for the functional significance of the size of transcription factors (or complexes) for gene regulation of chromatin domains. Recent findings suggest that chromatin consists of irregularly folded nucleosome fibres (10 nm fibres) and forms numerous condensed domains (e.g., topologically associating domains). Although the flexibility and dynamics of chromatin allow repositioning of genes within the condensed domains, the size exclusion effect of the domain may limit accessibility of DNA sequences by transcription factors. We used Monte Carlo computer simulations to determine the physical size limit of transcription factors that can enter condensed chromatin domains. Small gene-specific transcription factors can penetrate into the chromatin domains and search their target sequences, whereas large transcription complexes cannot enter the domain. Due to this property, once a large complex binds its target site via gene-specific factors it can act as a ‘buoy’ to keep the target region on the surface of the condensed domain and maintain transcriptional competency. This size-dependent specialization of target-scanning and surface-tethering functions could provide novel insight into the mechanisms of various DNA transactions, such as DNA replication and repair/recombination.
Oda, Masako; Kanoh, Yutaka; Watanabe, Yoshihisa; Masai, Hisao
2012-01-01
Replication timing of metazoan DNA during S-phase may be determined by many factors including chromosome structures, nuclear positioning, patterns of histone modifications, and transcriptional activity. It may be determined by Mb-domain structures, termed as "replication domains", and recent findings indicate that replication timing is under developmental and cell type-specific regulation. We examined replication timing on the human 5q23/31 3.5-Mb segment in T cells and non-T cells. We used two independent methods to determine replication timing. One is quantification of nascent replicating DNA in cell cycle-fractionated stage-specific S phase populations. The other is FISH analyses of replication foci. Although the locations of early- and late-replicating domains were common between the two cell lines, the timing transition region (TTR) between early and late domains were offset by 200-kb. We show that Special AT-rich sequence Binding protein 1 (SATB1), specifically expressed in T-cells, binds to the early domain immediately adjacent to TTR and delays the replication timing of the TTR. Measurement of the chromosome copy number along the TTR during synchronized S phase suggests that the fork movement may be slowed down by SATB1. Our results reveal a novel role of SATB1 in cell type-specific regulation of replication timing along the chromosome.
Flexible DNA binding of the BTB/POZ-domain protein FBI-1.
Pessler, Frank; Hernandez, Nouria
2003-08-01
POZ-domain transcription factors are characterized by the presence of a protein-protein interaction domain called the POZ or BTB domain at their N terminus and zinc fingers at their C terminus. Despite the large number of POZ-domain transcription factors that have been identified to date and the significant insights that have been gained into their cellular functions, relatively little is known about their DNA binding properties. FBI-1 is a BTB/POZ-domain protein that has been shown to modulate HIV-1 Tat trans-activation and to repress transcription of some cellular genes. We have used various viral and cellular FBI-1 binding sites to characterize the interaction of a POZ-domain protein with DNA in detail. We find that FBI-1 binds to inverted sequence repeats downstream of the HIV-1 transcription start site. Remarkably, it binds efficiently to probes carrying these repeats in various orientations and spacings with no particular rotational alignment, indicating that its interaction with DNA is highly flexible. Indeed, FBI-1 binding sites in the adenovirus 2 major late promoter, the c-fos gene, and the c-myc P1 and P2 promoters reveal variously spaced direct, inverted, and everted sequence repeats with the consensus sequence G(A/G)GGG(T/C)(C/T)(T/C)(C/T) for each repeat.
The Runt domain of AML1 (RUNX1) binds a sequence-conserved RNA motif that mimics a DNA element.
Fukunaga, Junichi; Nomura, Yusuke; Tanaka, Yoichiro; Amano, Ryo; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Sakamoto, Taiichi; Kozu, Tomoko
2013-07-01
AML1 (RUNX1) is a key transcription factor for hematopoiesis that binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. Aberrations in the AML1 gene are frequently found in human leukemia. To better understand AML1 and its potential utility for diagnosis and therapy, we obtained RNA aptamers that bind specifically to the AML1 Runt domain. Enzymatic probing and NMR analyses revealed that Apt1-S, which is a truncated variant of one of the aptamers, has a CACG tetraloop and two stem regions separated by an internal loop. All the isolated aptamers were found to contain the conserved sequence motif 5'-NNCCAC-3' and 5'-GCGMGN'N'-3' (M:A or C; N and N' form Watson-Crick base pairs). The motif contains one AC mismatch and one base bulged out. Mutational analysis of Apt1-S showed that three guanines of the motif are important for Runt binding as are the three guanines of RDE, which are directly recognized by three arginine residues of the Runt domain. Mutational analyses of the Runt domain revealed that the amino acid residues used for Apt1-S binding were similar to those used for RDE binding. Furthermore, the aptamer competed with RDE for binding to the Runt domain in vitro. These results demonstrated that the Runt domain of the AML1 protein binds to the motif of the aptamer that mimics DNA. Our findings should provide new insights into RNA function and utility in both basic and applied sciences.
The Runt domain of AML1 (RUNX1) binds a sequence-conserved RNA motif that mimics a DNA element
Fukunaga, Junichi; Nomura, Yusuke; Tanaka, Yoichiro; Amano, Ryo; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Sakamoto, Taiichi; Kozu, Tomoko
2013-01-01
AML1 (RUNX1) is a key transcription factor for hematopoiesis that binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. Aberrations in the AML1 gene are frequently found in human leukemia. To better understand AML1 and its potential utility for diagnosis and therapy, we obtained RNA aptamers that bind specifically to the AML1 Runt domain. Enzymatic probing and NMR analyses revealed that Apt1-S, which is a truncated variant of one of the aptamers, has a CACG tetraloop and two stem regions separated by an internal loop. All the isolated aptamers were found to contain the conserved sequence motif 5′-NNCCAC-3′ and 5′-GCGMGN′N′-3′ (M:A or C; N and N′ form Watson–Crick base pairs). The motif contains one AC mismatch and one base bulged out. Mutational analysis of Apt1-S showed that three guanines of the motif are important for Runt binding as are the three guanines of RDE, which are directly recognized by three arginine residues of the Runt domain. Mutational analyses of the Runt domain revealed that the amino acid residues used for Apt1-S binding were similar to those used for RDE binding. Furthermore, the aptamer competed with RDE for binding to the Runt domain in vitro. These results demonstrated that the Runt domain of the AML1 protein binds to the motif of the aptamer that mimics DNA. Our findings should provide new insights into RNA function and utility in both basic and applied sciences. PMID:23709277
Ouaray, Zahra; ElSawy, Karim M; Lane, David P; Essex, Jonathan W; Verma, Chandra
2016-10-01
Most p53 mutations associated with cancer are located in its DNA binding domain (DBD). Many structures (X-ray and NMR) of this domain are available in the protein data bank (PDB) and a vast conformational heterogeneity characterizes the various free and complexed states. The major difference between the apo and the holo-complexed states appears to lie in the L1 loop. In particular, the conformations of this loop appear to depend intimately on the sequence of DNA to which it binds. This conclusion builds upon recent observations that implicate the tetramerization and the C-terminal domains (respectively TD and Cter) in DNA binding specificity. Detailed PCA analysis of the most recent collection of DBD structures from the PDB have been carried out. In contrast to recommendations that small molecules/drugs stabilize the flexible L1 loop to rescue mutant p53, our study highlights a need to retain the flexibility of the p53 DNA binding surface (DBS). It is the adaptability of this region that enables p53 to engage in the diverse interactions responsible for its functionality. Proteins 2016; 84:1443-1461. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Characterization of monomeric DNA-binding protein Histone H1 in Leishmania braziliensis.
Carmelo, Emma; González, Gloria; Cruz, Teresa; Osuna, Antonio; Hernández, Mariano; Valladares, Basilio
2011-08-01
Histone H1 in Leishmania presents relevant differences compared to higher eukaryote counterparts, such as the lack of a DNA-binding central globular domain. Despite that, it is apparently fully functional since its differential expression levels have been related to changes in chromatin condensation and infectivity, among other features. The localization and the aggregation state of L. braziliensis H1 has been determined by immunolocalization, mass spectrometry, cross-linking and electrophoretic mobility shift assays. Analysis of H1 sequences from the Leishmania Genome Database revealed that our protein is included in a very divergent group of histones H1 that is present only in L. braziliensis. An antibody raised against recombinant L. braziliensis H1 recognized specifically that protein by immunoblot in L. braziliensis extracts, but not in other Leishmania species, a consequence of the sequence divergences observed among Leishmania species. Mass spectrometry analysis and in vitro DNA-binding experiments have also proven that L. braziliensis H1 is monomeric in solution, but oligomerizes upon binding to DNA. Finally, despite the lack of a globular domain, L. braziliensis H1 is able to form complexes with DNA in vitro, with higher affinity for supercoiled compared to linear DNA.
Ha, Sung Chul; Choi, Jongkeun; Hwang, Hye-Yeon; Rich, Alexander; Kim, Yang-Gyun; Kim, Kyeong Kyu
2009-02-01
The Z-DNA conformation preferentially occurs at alternating purine-pyrimidine repeats, and is specifically recognized by Z alpha domains identified in several Z-DNA-binding proteins. The binding of Z alpha to foreign or chromosomal DNA in various sequence contexts is known to influence various biological functions, including the DNA-mediated innate immune response and transcriptional modulation of gene expression. For these reasons, understanding its binding mode and the conformational diversity of Z alpha bound Z-DNAs is of considerable importance. However, structural studies of Z alpha bound Z-DNA have been mostly limited to standard CG-repeat DNAs. Here, we have solved the crystal structures of three representative non-CG repeat DNAs, d(CACGTG)(2), d(CGTACG)(2) and d(CGGCCG)(2) complexed to hZ alpha(ADAR1) and compared those structures with that of hZ alpha(ADAR1)/d(CGCGCG)(2) and the Z alpha-free Z-DNAs. hZ alpha(ADAR1) bound to each of the three Z-DNAs showed a well conserved binding mode with very limited structural deviation irrespective of the DNA sequence, although varying numbers of residues were in contact with Z-DNA. Z-DNAs display less structural alterations in the Z alpha-bound state than in their free form, thereby suggesting that conformational diversities of Z-DNAs are restrained by the binding pocket of Z alpha. These data suggest that Z-DNAs are recognized by Z alpha through common conformational features regardless of the sequence and structural alterations.
In vivo binding of PRDM9 reveals interactions with noncanonical genomic sites
Grey, Corinne; Clément, Julie A.J.; Buard, Jérôme; Leblanc, Benjamin; Gut, Ivo; Gut, Marta; Duret, Laurent
2017-01-01
In mouse and human meiosis, DNA double-strand breaks (DSBs) initiate homologous recombination and occur at specific sites called hotspots. The localization of these sites is determined by the sequence-specific DNA binding domain of the PRDM9 histone methyl transferase. Here, we performed an extensive analysis of PRDM9 binding in mouse spermatocytes. Unexpectedly, we identified a noncanonical recruitment of PRDM9 to sites that lack recombination activity and the PRDM9 binding consensus motif. These sites include gene promoters, where PRDM9 is recruited in a DSB-dependent manner. Another subset reveals DSB-independent interactions between PRDM9 and genomic sites, such as the binding sites for the insulator protein CTCF. We propose that these DSB-independent sites result from interactions between hotspot-bound PRDM9 and genomic sequences located on the chromosome axis. PMID:28336543
Meinhardt, Sarah; Swint-Kruse, Liskin
2008-12-01
In protein families, conserved residues often contribute to a common general function, such as DNA-binding. However, unique attributes for each homolog (e.g. recognition of alternative DNA sequences) must arise from variation in other functionally-important positions. The locations of these "specificity determinant" positions are obscured amongst the background of varied residues that do not make significant contributions to either structure or function. To isolate specificity determinants, a number of bioinformatics algorithms have been developed. When applied to the LacI/GalR family of transcription regulators, several specificity determinants are predicted in the 18 amino acids that link the DNA-binding and regulatory domains. However, results from alternative algorithms are only in partial agreement with each other. Here, we experimentally evaluate these predictions using an engineered repressor comprising the LacI DNA-binding domain, the LacI linker, and the GalR regulatory domain (LLhG). "Wild-type" LLhG has altered DNA specificity and weaker lacO(1) repression compared to LacI or a similar LacI:PurR chimera. Next, predictions of linker specificity determinants were tested, using amino acid substitution and in vivo repression assays to assess functional change. In LLhG, all predicted sites are specificity determinants, as well as three sites not predicted by any algorithm. Strategies are suggested for diminishing the number of false negative predictions. Finally, individual substitutions at LLhG specificity determinants exhibited a broad range of functional changes that are not predicted by bioinformatics algorithms. Results suggest that some variants have altered affinity for DNA, some have altered allosteric response, and some appear to have changed specificity for alternative DNA ligands.
Wang, Qianqian; Li, Lanlan; Wang, Xiaoting; Liu, Huanxiang; Yao, Xiaojun
2014-11-01
The Z-DNA-binding domain of human double-stranded RNA adenosine deaminase I (hZαADAR1) can specifically recognize the left-handed Z-DNA which preferentially occurs at alternating purine-pyrimidine repeats, especially the CG-repeats. The interactions of hZαADAR1 and Z-DNAs in different sequence contexts can affect many important biological functions including gene regulation and chromatin remodeling. Therefore it is of great necessity to fully understand their recognition mechanisms. However, most existing studies are aimed at the standard CG-repeat Z-DNA rather than the non-CG-repeats, and whether the molecular basis of hZαADAR1 binding to various Z-DNAs are identical or not is still unclear on the atomic level. Here, based on the recently determined crystal structures of three representative non-CG-repeat Z-DNAs (d(CACGTG)2, d(CGTACG)2 and d(CGGCCG)2) in complex with hZαADAR1, 40 ns molecular dynamics simulation together with binding free energy calculation were performed for each system. For comparison, the standard CG-repeat Z-DNA (d(CGCGCG)2) complexed with hZαADAR1 was also simulated. The consistent results demonstrate that nonpolar interaction is the driving force during the protein-DNA binding process, and that polar interaction mainly from helix α3 also provides important contributions. Five common hot-spot residues were identified, namely Lys169, Lys170, Asn173, Arg174 and Tyr177. Hydrogen bond analysis coupled with surface charge distribution further reveal the interfacial information between hZαADAR1 and Z-DNA in detail. All of the analysis illustrate that four complexes share the common key features and the similar binding modes irrespective of Z-DNA sequences, suggesting that Z-DNA recognition by hZαADAR1 is conformation-specific rather than sequence-specific. Additionally, by analyzing the conformational changes of hZαADAR1, we found that the binding of Z-DNA could effectively stabilize hZαADAR1 protein. Our study can provide some valuable information for better understanding the binding mechanism between hZαADAR1 or even other Z-DNA-binding protein and Z-DNA.
NASA Astrophysics Data System (ADS)
Knight, Jonathan D.; Li, Rong; Botchan, Michael
1991-04-01
The E2 transactivator protein of bovine papillomavirus binds its specific DNA target sequence as a dimer. We have found that E2 dimers, performed in solution independent of DNA, exhibit substantial cooperativity of DNA binding as detected by both nitrocellulose filter retention and footprint analysis techniques. If the binding sites are widely spaced, E2 forms stable DNA loops visible by electron microscopy. When three widely separated binding sites reside on te DNA, E2 condenses the molecule into a bow-tie structure. This implies that each E2 dimer has at least two independent surfaces for multimerization. Two naturally occurring shorter forms of the protein, E2C and D8/E2, which function in vivo as repressors of transcription, do not form such loops. Thus, the looping function of E2 maps to the 161-amino acid activation domain. These results support the looping model of transcription activation by enhancers.
Cook, W B; Walker, J C
1992-01-01
A cDNA encoding a nuclear-encoded chloroplast nucleic acid-binding protein (NBP) has been isolated from maize. Identified as an in vitro DNA-binding activity, NBP belongs to a family of nuclear-encoded chloroplast proteins which share a common domain structure and are thought to be involved in posttranscriptional regulation of chloroplast gene expression. NBP contains an N-terminal chloroplast transit peptide, a highly acidic domain and a pair of ribonucleoprotein consensus sequence domains. NBP is expressed in a light-dependent, organ-specific manner which is consistent with its involvement in chloroplast biogenesis. The relationship of NBP to the other members of this protein family and their possible regulatory functions are discussed. Images PMID:1346929
Directing an artificial zinc finger protein to new targets by fusion to a non-DNA-binding domain.
Lim, Wooi F; Burdach, Jon; Funnell, Alister P W; Pearson, Richard C M; Quinlan, Kate G R; Crossley, Merlin
2016-04-20
Transcription factors are often regarded as having two separable components: a DNA-binding domain (DBD) and a functional domain (FD), with the DBD thought to determine target gene recognition. While this holds true for DNA bindingin vitro, it appears thatin vivoFDs can also influence genomic targeting. We fused the FD from the well-characterized transcription factor Krüppel-like Factor 3 (KLF3) to an artificial zinc finger (AZF) protein originally designed to target the Vascular Endothelial Growth Factor-A (VEGF-A) gene promoter. We compared genome-wide occupancy of the KLF3FD-AZF fusion to that observed with AZF. AZF bound to theVEGF-Apromoter as predicted, but was also found to occupy approximately 25,000 other sites, a large number of which contained the expected AZF recognition sequence, GCTGGGGGC. Interestingly, addition of the KLF3 FD re-distributes the fusion protein to new sites, with total DNA occupancy detected at around 50,000 sites. A portion of these sites correspond to known KLF3-bound regions, while others contained sequences similar but not identical to the expected AZF recognition sequence. These results show that FDs can influence and may be useful in directing AZF DNA-binding proteins to specific targets and provide insights into how natural transcription factors operate. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Chang, P K; Ehrlich, K C; Yu, J; Bhatnagar, D; Cleveland, T E
1995-01-01
The aflR gene from Aspergillus parasiticus and Aspergillus flavus may be involved in the regulation of aflatoxin biosynthesis. The aflR gene product, AFLR, possesses a GAL4-type binuclear zinc finger DNA-binding domain. A transformant, SU1-N3 (pHSP), containing an additional copy of aflR, showed increased transcription of aflR and the aflatoxin pathway structural genes, nor-1, ver-1, and omt-1, when cells were grown in nitrate medium, which normally suppresses aflatoxin production. Electrophoretic mobility shift assays showed that the recombinant protein containing the DNA-binding domain, AFLR1, bound specifically to the palindromic sequence, TTAGGCCTAA, 120 bp upstream of the AFLR translation start site. Expression of aflR thus appears to be autoregulated. Increased expression of aflatoxin biosynthetic genes in the transformant might result from an elevated basal level of AFLR, allowing it to overcome nitrate inhibition and to bind to the aflR promotor region, thereby initiating aflatoxin biosynthesis. Results further suggest that aflR is involved in the regulation of multiple parts of the aflatoxin biosynthetic pathway. PMID:7793958
The FOXP2 forkhead domain binds to a variety of DNA sequences with different rates and affinities.
Webb, Helen; Steeb, Olga; Blane, Ashleigh; Rotherham, Lia; Aron, Shaun; Machanick, Philip; Dirr, Heini; Fanucchi, Sylvia
2017-07-01
FOXP2 is a member of the P subfamily of FOX transcription factors, the DNA-binding domain of which is the winged helix forkhead domain (FHD). In this work we show that the FOXP2 FHD is able to bind to various DNA sequences, including a novel sequence identified in this work, with different affinities and rates as detected using surface plasmon resonance. Combining the experimental work with molecular docking, we show that high-affinity sequences remain bound to the protein for longer, form a greater number of interactions with the protein and induce a greater structural change in the protein than low-affinity sequences. We propose a binding model for the FOXP2 FHD that involves three types of binding sequence: low affinity sites which allow for rapid scanning of the genome by the protein in a partially unstructured state; moderate affinity sites which serve to locate the protein near target sites and high-affinity sites which secure the protein to the DNA and induce a conformational change necessary for functional binding and the possible initiation of downstream transcriptional events. © The Authors 2017. Published by Oxford University Press on behalf of the Japanese Biochemical Society. All rights reserved.
Lerner, D R; Raikhel, N V
1992-06-05
Chitin-binding proteins are present in a wide range of plant species, including both monocots and dicots, even though these plants contain no chitin. To investigate the relationship between in vitro antifungal and insecticidal activities of chitin-binding proteins and their unknown endogenous functions, the stinging nettle lectin (Urtica dioica agglutinin, UDA) cDNA was cloned using a synthetic gene as the probe. The nettle lectin cDNA clone contained an open reading frame encoding 374 amino acids. Analysis of the deduced amino acid sequence revealed a 21-amino acid putative signal sequence and the 86 amino acids encoding the two chitin-binding domains of nettle lectin. These domains were fused to a 19-amino acid "spacer" domain and a 244-amino acid carboxyl extension with partial identity to a chitinase catalytic domain. The authenticity of the cDNA clone was confirmed by deduced amino acid sequence identity with sequence data obtained from tryptic digests, RNA gel blot, and polymerase chain reaction analyses. RNA gel blot analysis also showed the nettle lectin message was present primarily in rhizomes and inflorescence (with immature seeds) but not in leaves or stems. Chitinase enzymatic activity was found when the chitinase-like domain alone or the chitinase-like domain with the chitin-binding domains were expressed in Escherichia coli. This is the first example of a chitin-binding protein with both a duplication of the 43-amino acid chitin-binding domain and a fusion of the chitin-binding domains to a structurally unrelated domain, the chitinase domain.
Moody, Colleen L; Tretyachenko-Ladokhina, Vira; Laue, Thomas M; Senear, Donald F; Cocco, Melanie J
2011-08-09
The cytidine repressor (CytR) is a member of the LacR family of bacterial repressors with distinct functional features. The Escherichia coli CytR regulon comprises nine operons whose palindromic operators vary in both sequence and, most significantly, spacing between the recognition half-sites. This suggests a strong likelihood that protein folding would be coupled to DNA binding as a mechanism to accommodate the variety of different operator architectures to which CytR is targeted. Such coupling is a common feature of sequence-specific DNA-binding proteins, including the LacR family repressors; however, there are no significant structural rearrangements upon DNA binding within the three-helix DNA-binding domains (DBDs) studied to date. We used nuclear magnetic resonance (NMR) spectroscopy to characterize the CytR DBD free in solution and to determine the high-resolution structure of a CytR DBD monomer bound specifically to one DNA half-site of the uridine phosphorylase (udp) operator. We find that the free DBD populates multiple distinct conformations distinguished by up to four sets of NMR peaks per residue. This structural heterogeneity is previously unknown in the LacR family. These stable structures coalesce into a single, more stable udp-bound form that features a three-helix bundle containing a canonical helix-turn-helix motif. However, this structure differs from all other LacR family members whose structures are known with regard to the packing of the helices and consequently their relative orientations. Aspects of CytR activity are unique among repressors; we identify here structural properties that are also distinct and that might underlie the different functional properties. © 2011 American Chemical Society
Oda, Masako; Kanoh, Yutaka; Watanabe, Yoshihisa; Masai, Hisao
2012-01-01
Background Replication timing of metazoan DNA during S-phase may be determined by many factors including chromosome structures, nuclear positioning, patterns of histone modifications, and transcriptional activity. It may be determined by Mb-domain structures, termed as “replication domains”, and recent findings indicate that replication timing is under developmental and cell type-specific regulation. Methodology/Principal Findings We examined replication timing on the human 5q23/31 3.5-Mb segment in T cells and non-T cells. We used two independent methods to determine replication timing. One is quantification of nascent replicating DNA in cell cycle-fractionated stage-specific S phase populations. The other is FISH analyses of replication foci. Although the locations of early- and late-replicating domains were common between the two cell lines, the timing transition region (TTR) between early and late domains were offset by 200-kb. We show that Special AT-rich sequence Binding protein 1 (SATB1), specifically expressed in T-cells, binds to the early domain immediately adjacent to TTR and delays the replication timing of the TTR. Measurement of the chromosome copy number along the TTR during synchronized S phase suggests that the fork movement may be slowed down by SATB1. Conclusions Our results reveal a novel role of SATB1 in cell type-specific regulation of replication timing along the chromosome. PMID:22879953
Two high-mobility group box domains act together to underwind and kink DNA
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sánchez-Giraldo, R.; Acosta-Reyes, F. J.; Malarkey, C. S.
The crystal structure of HMGB1 box A bound to an unmodified AT-rich DNA fragment is reported at a resolution of 2 Å. A new mode of DNA recognition for HMG box proteins is found in which two box A domains bind in an unusual configuration generating a highly kinked DNA structure. High-mobility group protein 1 (HMGB1) is an essential and ubiquitous DNA architectural factor that influences a myriad of cellular processes. HMGB1 contains two DNA-binding domains, box A and box B, which have little sequence specificity but have remarkable abilities to underwind and bend DNA. Although HMGB1 box A ismore » thought to be responsible for the majority of HMGB1–DNA interactions with pre-bent or kinked DNA, little is known about how it recognizes unmodified DNA. Here, the crystal structure of HMGB1 box A bound to an AT-rich DNA fragment is reported at a resolution of 2 Å. Two box A domains of HMGB1 collaborate in an unusual configuration in which the Phe37 residues of both domains stack together and intercalate the same CG base pair, generating highly kinked DNA. This represents a novel mode of DNA recognition for HMGB proteins and reveals a mechanism by which structure-specific HMG boxes kink linear DNA.« less
Directed evolution of the TALE N-terminal domain for recognition of all 5' bases.
Lamb, Brian M; Mercer, Andrew C; Barbas, Carlos F
2013-11-01
Transcription activator-like effector (TALE) proteins can be designed to bind virtually any DNA sequence. General guidelines for design of TALE DNA-binding domains suggest that the 5'-most base of the DNA sequence bound by the TALE (the N0 base) should be a thymine. We quantified the N0 requirement by analysis of the activities of TALE transcription factors (TALE-TF), TALE recombinases (TALE-R) and TALE nucleases (TALENs) with each DNA base at this position. In the absence of a 5' T, we observed decreases in TALE activity up to >1000-fold in TALE-TF activity, up to 100-fold in TALE-R activity and up to 10-fold reduction in TALEN activity compared with target sequences containing a 5' T. To develop TALE architectures that recognize all possible N0 bases, we used structure-guided library design coupled with TALE-R activity selections to evolve novel TALE N-terminal domains to accommodate any N0 base. A G-selective domain and broadly reactive domains were isolated and characterized. The engineered TALE domains selected in the TALE-R format demonstrated modularity and were active in TALE-TF and TALEN architectures. Evolved N-terminal domains provide effective and unconstrained TALE-based targeting of any DNA sequence as TALE binding proteins and designer enzymes.
Molecular mechanisms of floral organ specification by MADS domain proteins.
Yan, Wenhao; Chen, Dijun; Kaufmann, Kerstin
2016-02-01
Flower development is a model system to understand organ specification in plants. The identities of different types of floral organs are specified by homeotic MADS transcription factors that interact in a combinatorial fashion. Systematic identification of DNA-binding sites and target genes of these key regulators show that they have shared and unique sets of target genes. DNA binding by MADS proteins is not based on 'simple' recognition of a specific DNA sequence, but depends on DNA structure and combinatorial interactions. Homeotic MADS proteins regulate gene expression via alternative mechanisms, one of which may be to modulate chromatin structure and accessibility in their target gene promoters. Copyright © 2015 Elsevier Ltd. All rights reserved.
Custom-Designed Molecular Scissors for Site-Specific Manipulation of the Plant and Mammalian Genomes
NASA Astrophysics Data System (ADS)
Kandavelou, Karthikeyan; Chandrasegaran, Srinivasan
Zinc finger nucleases (ZFNs) are custom-designed molecular scissors, engineered to cut at specific DNA sequences. ZFNs combine the zinc finger proteins (ZFPs) with the nonspecific cleavage domain of the FokI restriction enzyme. The DNA-binding specificity of ZFNs can be easily altered experimentally. This easy manipulation of the ZFN recognition specificity enables one to deliver a targeted double-strand break (DSB) to a genome. The targeted DSB stimulates local gene targeting by several orders of magnitude at that specific cut site via homologous recombination (HR). Thus, ZFNs have become an important experimental tool to make site-specific and permanent alterations to genomes of not only plants and mammals but also of many other organisms. Engineering of custom ZFNs involves many steps. The first step is to identify a ZFN site at or near the chosen chromosomal target within the genome to which ZFNs will bind and cut. The second step is to design and/or select various ZFP combinations that will bind to the chosen target site with high specificity and affinity. The DNA coding sequence for the designed ZFPs are then assembled by polymerase chain reaction (PCR) using oligonucleotides. The third step is to fuse the ZFP constructs to the FokI cleavage domain. The ZFNs are then expressed as proteins by using the rabbit reticulocyte in vitro transcription/translation system and the protein products assayed for their DNA cleavage specificity.
Lin, Jiangguo; Countryman, Preston; Buncher, Noah; Kaur, Parminder; E, Longjiang; Zhang, Yiyun; Gibson, Greg; You, Changjiang; Watkins, Simon C; Piehler, Jacob; Opresko, Patricia L; Kad, Neil M; Wang, Hong
2014-02-01
Human telomeres are maintained by the shelterin protein complex in which TRF1 and TRF2 bind directly to duplex telomeric DNA. How these proteins find telomeric sequences among a genome of billions of base pairs and how they find protein partners to form the shelterin complex remains uncertain. Using single-molecule fluorescence imaging of quantum dot-labeled TRF1 and TRF2, we study how these proteins locate TTAGGG repeats on DNA tightropes. By virtue of its basic domain TRF2 performs an extensive 1D search on nontelomeric DNA, whereas TRF1's 1D search is limited. Unlike the stable and static associations observed for other proteins at specific binding sites, TRF proteins possess reduced binding stability marked by transient binding (∼ 9-17 s) and slow 1D diffusion on specific telomeric regions. These slow diffusion constants yield activation energy barriers to sliding ∼ 2.8-3.6 κ(B)T greater than those for nontelomeric DNA. We propose that the TRF proteins use 1D sliding to find protein partners and assemble the shelterin complex, which in turn stabilizes the interaction with specific telomeric DNA. This 'tag-team proofreading' represents a more general mechanism to ensure a specific set of proteins interact with each other on long repetitive specific DNA sequences without requiring external energy sources.
Abe, Yoshito; Fujisaki, Naoki; Miyoshi, Takanori; Watanabe, Noriko; Katayama, Tsutomu; Ueda, Tadashi
2016-01-01
DnaAcos, a mutant of the initiator DnaA, causes overinitiation of chromosome replication in Escherichia coli, resulting in inhibition of cell division. CedA was found to be a multi-copy suppressor which represses the dnaAcos inhibition of cell division. However, functional mechanism of CedA remains elusive except for previously indicated possibilities in binding to DNA and RNA polymerase. In this study, we searched for the specific sites of CedA in binding of DNA and RNA polymerase and in repression of cell division inhibition. First, DNA sequence to which CedA preferentially binds was determined. Next, the several residues and β4 region in CedA C-terminal domain was suggested to specifically interact with the DNA. Moreover, we found that the flexible N-terminal region was required for tight binding to longer DNA as well as interaction with RNA polymerase. Based on these results, several cedA mutants were examined in ability for repressing dnaAcos cell division inhibition. We found that the N-terminal region was dispensable and that Glu32 in the C-terminal domain was required for the repression. These results suggest that CedA has multiple roles and residues with different functions are positioned in the two regions. PMID:26400504
Morellet, Nelly; Li, Xianghong; Wieninger, Silke A; Taylor, Jennifer L; Bischerour, Julien; Moriau, Séverine; Lescop, Ewen; Bardiaux, Benjamin; Mathy, Nathalie; Assrir, Nadine; Bétermier, Mireille; Nilges, Michael; Hickman, Alison B; Dyda, Fred; Craig, Nancy L; Guittet, Eric
2018-01-01
Abstract The piggyBac transposase (PB) is distinguished by its activity and utility in genome engineering, especially in humans where it has highly promising therapeutic potential. Little is known, however, about the structure–function relationships of the different domains of PB. Here, we demonstrate in vitro and in vivo that its C-terminal Cysteine-Rich Domain (CRD) is essential for DNA breakage, joining and transposition and that it binds to specific DNA sequences in the left and right transposon ends, and to an additional unexpectedly internal site at the left end. Using NMR, we show that the CRD adopts the specific fold of the cross-brace zinc finger protein family. We determine the interaction interfaces between the CRD and its target, the 5′-TGCGT-3′/3′-ACGCA-5′ motifs found in the left, left internal and right transposon ends, and use NMR results to propose docking models for the complex, which are consistent with our site-directed mutagenesis data. Our results provide support for a model of the PB/DNA interactions in the context of the transpososome, which will be useful for the rational design of PB mutants with increased activity. PMID:29385532
Paull, T T; Cortez, D; Bowers, B; Elledge, S J; Gellert, M
2001-05-22
The tumor suppressor Brca1 plays an important role in protecting mammalian cells against genomic instability, but little is known about its modes of action. In this work we demonstrate that recombinant human Brca1 protein binds strongly to DNA, an activity conferred by a domain in the center of the Brca1 polypeptide. As a result of this binding, Brca1 inhibits the nucleolytic activities of the Mre11/Rad50/Nbs1 complex, an enzyme implicated in numerous aspects of double-strand break repair. Brca1 displays a preference for branched DNA structures and forms protein-DNA complexes cooperatively between multiple DNA strands, but without DNA sequence specificity. This fundamental property of Brca1 may be an important part of its role in DNA repair and transcription.
Mapping and analysis of Caenorhabditis elegans transcription factor sequence specificities
Narasimhan, Kamesh; Lambert, Samuel A; Yang, Ally WH; Riddell, Jeremy; Mnaimneh, Sanie; Zheng, Hong; Albu, Mihai; Najafabadi, Hamed S; Reece-Hoyes, John S; Fuxman Bass, Juan I; Walhout, Albertha JM; Weirauch, Matthew T; Hughes, Timothy R
2015-01-01
Caenorhabditis elegans is a powerful model for studying gene regulation, as it has a compact genome and a wealth of genomic tools. However, identification of regulatory elements has been limited, as DNA-binding motifs are known for only 71 of the estimated 763 sequence-specific transcription factors (TFs). To address this problem, we performed protein binding microarray experiments on representatives of canonical TF families in C. elegans, obtaining motifs for 129 TFs. Additionally, we predict motifs for many TFs that have DNA-binding domains similar to those already characterized, increasing coverage of binding specificities to 292 C. elegans TFs (∼40%). These data highlight the diversification of binding motifs for the nuclear hormone receptor and C2H2 zinc finger families and reveal unexpected diversity of motifs for T-box and DM families. Motif enrichment in promoters of functionally related genes is consistent with known biology and also identifies putative regulatory roles for unstudied TFs. DOI: http://dx.doi.org/10.7554/eLife.06967.001 PMID:25905672
A close relative of the nuclear, chromosomal high-mobility group protein HMG1 in yeast mitochondria.
Diffley, J F; Stillman, B
1991-01-01
ABF2 (ARS-binding factor 2), a small, basic DNA-binding protein that binds specifically to the autonomously replicating sequence ARS1, is located primarily in the mitochondria of the yeast Saccharomyces cerevisiae. The abundance of ABF2 and the phenotype of abf2- null mutants argue that this protein plays a key role in the structure, maintenance, and expression of the yeast mitochondrial genome. The predicted amino acid sequence of ABF2 is closely related to the high-mobility group proteins HMG1 and HMG2 from vertebrate cell nuclei and to several other DNA-binding proteins. Additionally, ABF2 and the other HMG-related proteins are related to a globular domain from the heat shock protein hsp70 family. ABF2 interacts with DNA both nonspecifically and in a specific manner within regulatory regions, suggesting a mechanism whereby it may aid in compacting the mitochondrial genome without interfering with expression. Images PMID:1881919
TALE-PvuII fusion proteins--novel tools for gene targeting.
Yanik, Mert; Alzubi, Jamal; Lahaye, Thomas; Cathomen, Toni; Pingoud, Alfred; Wende, Wolfgang
2013-01-01
Zinc finger nucleases (ZFNs) consist of zinc fingers as DNA-binding module and the non-specific DNA-cleavage domain of the restriction endonuclease FokI as DNA-cleavage module. This architecture is also used by TALE nucleases (TALENs), in which the DNA-binding modules of the ZFNs have been replaced by DNA-binding domains based on transcription activator like effector (TALE) proteins. Both TALENs and ZFNs are programmable nucleases which rely on the dimerization of FokI to induce double-strand DNA cleavage at the target site after recognition of the target DNA by the respective DNA-binding module. TALENs seem to have an advantage over ZFNs, as the assembly of TALE proteins is easier than that of ZFNs. Here, we present evidence that variant TALENs can be produced by replacing the catalytic domain of FokI with the restriction endonuclease PvuII. These fusion proteins recognize only the composite recognition site consisting of the target site of the TALE protein and the PvuII recognition sequence (addressed site), but not isolated TALE or PvuII recognition sites (unaddressed sites), even at high excess of protein over DNA and long incubation times. In vitro, their preference for an addressed over an unaddressed site is > 34,000-fold. Moreover, TALE-PvuII fusion proteins are active in cellula with minimal cytotoxicity.
Directed evolution of the TALE N-terminal domain for recognition of all 5′ bases
Lamb, Brian M.; Mercer, Andrew C.; Barbas, Carlos F.
2013-01-01
Transcription activator-like effector (TALE) proteins can be designed to bind virtually any DNA sequence. General guidelines for design of TALE DNA-binding domains suggest that the 5′-most base of the DNA sequence bound by the TALE (the N0 base) should be a thymine. We quantified the N0 requirement by analysis of the activities of TALE transcription factors (TALE-TF), TALE recombinases (TALE-R) and TALE nucleases (TALENs) with each DNA base at this position. In the absence of a 5′ T, we observed decreases in TALE activity up to >1000-fold in TALE-TF activity, up to 100-fold in TALE-R activity and up to 10-fold reduction in TALEN activity compared with target sequences containing a 5′ T. To develop TALE architectures that recognize all possible N0 bases, we used structure-guided library design coupled with TALE-R activity selections to evolve novel TALE N-terminal domains to accommodate any N0 base. A G-selective domain and broadly reactive domains were isolated and characterized. The engineered TALE domains selected in the TALE-R format demonstrated modularity and were active in TALE-TF and TALEN architectures. Evolved N-terminal domains provide effective and unconstrained TALE-based targeting of any DNA sequence as TALE binding proteins and designer enzymes. PMID:23980031
Prakash, Aishwarya; Natarajan, Amarnath; Marky, Luis A.; Ouellette, Michel M.; Borgstahl, Gloria E. O.
2011-01-01
Replication protein A (RPA), a key player in DNA metabolism, has 6 single-stranded DNA-(ssDNA-) binding domains (DBDs) A-F. SELEX experiments with the DBDs-C, -D, and -E retrieve a 20-nt G-quadruplex forming sequence. Binding studies show that RPA-DE binds preferentially to the G-quadruplex DNA, a unique preference not observed with other RPA constructs. Circular dichroism experiments show that RPA-CDE-core can unfold the G-quadruplex while RPA-DE stabilizes it. Binding studies show that RPA-C binds pyrimidine- and purine-rich sequences similarly. This difference between RPA-C and RPA-DE binding was also indicated by the inability of RPA-CDE-core to unfold an oligonucleotide containing a TC-region 5′ to the G-quadruplex. Molecular modeling studies of RPA-DE and telomere-binding proteins Pot1 and Stn1 reveal structural similarities between the proteins and illuminate potential DNA-binding sites for RPA-DE and Stn1. These data indicate that DBDs of RPA have different ssDNA recognition properties. PMID:21772997
Specific Inhibition of the transcription factor Ci by a Cobalt(III)-Schiff base-DNA conjugate
Hurtado, Ryan R.; Harney, Allison S.; Heffern, Marie C.; Holbrook, Robert J.; Holmgren, Robert A.; Meade, Thomas J.
2012-01-01
We describe the use of Co(III) Schiff base-DNA conjugates, a versatile class of research tools that target C2H2 transcription factors, to inhibit the Hedgehog (Hh) pathway. In developing mammalian embryos, Hh signaling is critical for the formation and development of many tissues and organs. Inappropriate activation of the Hedgehog (Hh) pathway has been implicated in a variety of cancers including medulloblastomas and basal cell carcinomas. It is well known that Hh regulates the activity of the Gli family of C2H2 zinc finger transcription factors in mammals. In Drosophila the function of the Gli proteins is performed by a single transcription factor with an identical DNA binding consensus sequence, Cubitus Interruptus (Ci). We have demonstrated previously that conjugation of a specific 17 base-pair oligonucleotide to a Co(III) Schiff base complex results in a targeted inhibitor of the Snail family C2H2 zinc finger transcription factors. Modification of the oligonucleotide sequence in the Co(III) Schiff base-DNA conjugate to that of Ci’s consensus sequence (Co(III)-Ci) generates an equally selective inhibitor of Ci. Co(III)-Ci irreversibly binds the Ci zinc finger domain and prevents it from binding DNA in vitro. In a Ci responsive tissue culture reporter gene assay, Co(III)-Ci reduces the transcriptional activity of Ci in a concentration dependent manner. In addition, injection of wild-type Drosophila embryos with Co(III)-Ci phenocopies a Ci loss of function phenotype, demonstrating effectiveness in vivo. This study provides evidence that Co(III) Schiff base-DNA conjugates are a versatile class of specific and potent tools for studying zinc finger domain proteins and have potential applications as customizable anti-cancer therapeutics. PMID:22214326
2000-08-01
4). Sequence recognition of all four DNA bases is achieved by positioning an N- methylimidazole opposite guanine or N-methylpyrrole opposite...unique sequences of DNA based upon selective binding motifs to all four DNA bases , although relatively little is known about the ability of these agents to
Mitsuda, Nobutaka; Hisabori, Toru; Takeyasu, Kunio; Sato, Masa H
2004-07-01
A 38-bp pollen-specific cis-acting region of the AVP1 gene is involved in the expression of the Arabidopsis thaliana V-PPase during pollen development. Here, we report the isolation and structural characterization of AtVOZ1 and AtVOZ2, novel transcription factors that bind to the 38-bp cis-acting region of A. thaliana V-PPase gene, AVP1. AtVOZ1 and AtVOZ2 show 53% amino acid sequence similarity. Homologs of AtVOZ1 and AtVOZ2 are found in various vascular plants as well as a moss, Physcomitrella patens. Promoter-beta-glucuronidase reporter analysis shows that AtVOZ1 is specifically expressed in the phloem tissue and AtVOZ2 is strongly expressed in the root. In vivo transient effector-reporter analysis in A. thaliana suspension-cultured cells demonstrates that AtVOZ1 and AtVOZ2 function as transcriptional activators in the Arabidopsis cell. Two conserved regions termed Domain-A and Domain-B were identified from an alignment of AtVOZ proteins and their homologs of O. sativa and P. patens. AtVOZ2 binds as a dimer to the specific palindromic sequence, GCGTNx7ACGC, with Domain-B, which is comprised of a functional novel zinc coordinating motif and a conserved basic region. Domain-B is shown to function as both the DNA-binding and the dimerization domains of AtVOZ2. From highly the conservative nature among all identified VOZ proteins, we conclude that Domain-B is responsible for the DNA binding and dimerization of all VOZ-family proteins and designate it as the VOZ-domain.
Park, Chin-Ju; Lee, Joon-Hwa; Choi, Byong-Seok
2005-01-01
Replication protein A (RPA) is a three-subunit complex with multiple roles in DNA metabolism. DNA-binding domain A in the large subunit of human RPA (hRPA70A) binds to single-stranded DNA (ssDNA) and is responsible for the species-specific RPA–T antigen (T-ag) interaction required for Simian virus 40 replication. Although Saccharomyces cerevisiae RPA70A (scRPA70A) shares high sequence homology with hRPA70A, the two are not functionally equivalent. To elucidate the similarities and differences between these two homologous proteins, we determined the solution structure of scRPA70A, which closely resembled the structure of hRPA70A. The structure of ssDNA-bound scRPA70A, as simulated by residual dipolar coupling-based homology modeling, suggested that the positioning of the ssDNA is the same for scRPA70A and hRPA70A, although the conformational changes that occur in the two proteins upon ssDNA binding are not identical. NMR titrations of hRPA70A with T-ag showed that the T-ag binding surface is separate from the ssDNA-binding region and is more neutral than the corresponding part of scRPA70A. These differences might account for the species-specific nature of the hRPA70A–T-ag interaction. Our results provide insight into how these two homologous RPA proteins can exhibit functional differences, but still both retain their ability to bind ssDNA. PMID:16043636
A conserved mechanism for replication origin recognition and binding in archaea.
Majerník, Alan I; Chong, James P J
2008-01-15
To date, methanogens are the only group within the archaea where firing DNA replication origins have not been demonstrated in vivo. In the present study we show that a previously identified cluster of ORB (origin recognition box) sequences do indeed function as an origin of replication in vivo in the archaeon Methanothermobacter thermautotrophicus. Although the consensus sequence of ORBs in M. thermautotrophicus is somewhat conserved when compared with ORB sequences in other archaea, the Cdc6-1 protein from M. thermautotrophicus (termed MthCdc6-1) displays sequence-specific binding that is selective for the MthORB sequence and does not recognize ORBs from other archaeal species. Stabilization of in vitro MthORB DNA binding by MthCdc6-1 requires additional conserved sequences 3' to those originally described for M. thermautotrophicus. By testing synthetic sequences bearing mutations in the MthORB consensus sequence, we show that Cdc6/ORB binding is critically dependent on the presence of an invariant guanine found in all archaeal ORB sequences. Mutation of a universally conserved arginine residue in the recognition helix of the winged helix domain of archaeal Cdc6-1 shows that specific origin sequence recognition is dependent on the interaction of this arginine residue with the invariant guanine. Recognition of a mutated origin sequence can be achieved by mutation of the conserved arginine residue to a lysine or glutamine residue. Thus despite a number of differences in protein and DNA sequences between species, the mechanism of origin recognition and binding appears to be conserved throughout the archaea.
Mukherjee, Koel; Pandey, Dev Mani; Vidyarthi, Ambarish Saran
2015-02-06
Gaining access to sequence and structure information of telomere binding proteins helps in understanding the essential biological processes involve in conserved sequence specific interaction between DNA and the proteins. Rice telomere binding protein (RTBP1) and Nicotiana glutinosa telomere repeat binding factor (NgTRF1) are helix turn helix motif type of proteins that plays role in telomeric DNA protection and length regulation. Both the proteins share same type of domain but till now there is very less communication on the in silico studies of these complete proteins.Here we intend to do a comparative study between two proteins through modeling of the complete proteins, physiochemical characterization, MD simulation and DNA-protein docking. I-TASSER and CLC protein work bench was performed to find out the protein 3D structure as well as the different parameters to characterize the proteins. MD simulation was completed by GROMOS forcefield of GROMACS for 10 ns of time stretch. The simulated 3D structures were docked with template DNA (3D DNA modeled through 3D-DART) of TTTAGGG conserved sequence motif using HADDOCK web server.Digging up all the facts about the proteins it was reveled that around 120 amino acids in the tail part was showing a good sequence similarity between the proteins. Molecular modeling, sequence characterization and secondary structure prediction also indicates the similarity between the protein's structure and sequence. The result of MD simulation highlights on the RMSD, RMSF, Rg, PCA and Energy plots which also conveys the similar type of motional behavior between them. The best complex formation for both the proteins in docking result also indicates for the first interaction site which is mainly the helix3 region of the DNA binding domain. The overall computational analysis reveals that RTBP1 and NgTRF1 proteins display good amount of similarity in their physicochemical properties, structure, dynamics and binding mode.
Mukherjee, Koel; Pandey, Dev Mani; Vidyarthi, Ambarish Saran
2015-09-01
Gaining access to sequence and structure information of telomere-binding proteins helps in understanding the essential biological processes involve in conserved sequence-specific interaction between DNA and the proteins. Rice telomere-binding protein (RTBP1) and Nicotiana glutinosa telomere repeat binding factor (NgTRF1) are helix-turn-helix motif type of proteins that plays role in telomeric DNA protection and length regulation. Both the proteins share same type of domain, but till now there is very less communication on the in silico studies of these complete proteins. Here we intend to do a comparative study between two proteins through modeling of the complete proteins, physiochemical characterization, MD simulation and DNA-protein docking. I-TASSER and CLC protein work bench was performed to find out the protein 3D structure as well as the different parameters to characterize the proteins. MD simulation was completed by GROMOS forcefield of GROMACS for 10 ns of time stretch. The simulated 3D structures were docked with template DNA (3D DNA modeled through 3D-DART) of TTTAGGG conserved sequence motif using HADDOCK Web server. By digging up all the facts about the proteins, it was revealed that around 120 amino acids in the tail part were showing a good sequence similarity between the proteins. Molecular modeling, sequence characterization and secondary structure prediction also indicate the similarity between the protein's structure and sequence. The result of MD simulation highlights on the RMSD, RMSF, Rg, PCA and energy plots which also conveys the similar type of motional behavior between them. The best complex formation for both the proteins in docking result also indicates for the first interaction site which is mainly the helix3 region of the DNA-binding domain. The overall computational analysis reveals that RTBP1 and NgTRF1 proteins display good amount of similarity in their physicochemical properties, structure, dynamics and binding mode.
Marzo, Mar; Liu, Danxu; Ruiz, Alfredo; Chalmers, Ronald
2013-01-01
Galileo is a DNA transposon responsible for the generation of several chromosomal inversions in Drosophila. In contrast to other members of the P-element superfamily, it has unusually long terminal inverted-repeats (TIRs) that resemble those of Foldback elements. To investigate the function of the long TIRs we derived consensus and ancestral sequences for the Galileo transposase in three species of Drosophilids. Following gene synthesis, we expressed and purified their constituent THAP domains and tested their binding activity towards the respective Galileo TIRs. DNase I footprinting located the most proximal DNA binding site about 70 bp from the transposon end. Using this sequence we identified further binding sites in the tandem repeats that are found within the long TIRs. This suggests that the synaptic complex between Galileo ends may be a complicated structure containing higher-order multimers of the transposase. We also attempted to reconstitute Galileo transposition in Drosophila embryos but no events were detected. Thus, although the limited numbers of Galileo copies in each genome were sufficient to provide functional consensus sequences for the THAP domains, they do not specify a fully active transposase. Since the THAP recognition sequence is short, and will occur many times in a large genome, it seems likely that the multiple binding sites within the long, internally repetitive, TIRs of Galileo and other Foldback-like elements may provide the transposase with its binding specificity. PMID:23648487
Marzo, Mar; Liu, Danxu; Ruiz, Alfredo; Chalmers, Ronald
2013-08-01
Galileo is a DNA transposon responsible for the generation of several chromosomal inversions in Drosophila. In contrast to other members of the P-element superfamily, it has unusually long terminal inverted-repeats (TIRs) that resemble those of Foldback elements. To investigate the function of the long TIRs we derived consensus and ancestral sequences for the Galileo transposase in three species of Drosophilids. Following gene synthesis, we expressed and purified their constituent THAP domains and tested their binding activity towards the respective Galileo TIRs. DNase I footprinting located the most proximal DNA binding site about 70 bp from the transposon end. Using this sequence we identified further binding sites in the tandem repeats that are found within the long TIRs. This suggests that the synaptic complex between Galileo ends may be a complicated structure containing higher-order multimers of the transposase. We also attempted to reconstitute Galileo transposition in Drosophila embryos but no events were detected. Thus, although the limited numbers of Galileo copies in each genome were sufficient to provide functional consensus sequences for the THAP domains, they do not specify a fully active transposase. Since the THAP recognition sequence is short, and will occur many times in a large genome, it seems likely that the multiple binding sites within the long, internally repetitive, TIRs of Galileo and other Foldback-like elements may provide the transposase with its binding specificity. Copyright © 2013 The Authors. Published by Elsevier B.V. All rights reserved.
Ciolkowski, Ingo; Wanke, Dierk; Birkenbihl, Rainer P; Somssich, Imre E
2008-09-01
WRKY transcription factors have been shown to play a major role in regulating, both positively and negatively, the plant defense transcriptome. Nearly all studied WRKY factors appear to have a stereotypic binding preference to one DNA element termed the W-box. How specificity for certain promoters is accomplished therefore remains completely unknown. In this study, we tested five distinct Arabidopsis WRKY transcription factor subfamily members for their DNA binding selectivity towards variants of the W-box embedded in neighboring DNA sequences. These studies revealed for the first time differences in their binding site preferences, which are partly dependent on additional adjacent DNA sequences outside of the TTGACY-core motif. A consensus WRKY binding site derived from these studies was used for in silico analysis to identify potential target genes within the Arabidopsis genome. Furthermore, we show that even subtle amino acid substitutions within the DNA binding region of AtWRKY11 strongly impinge on its binding activity. Additionally, all five factors were found localized exclusively to the plant cell nucleus and to be capable of trans-activating expression of a reporter gene construct in vivo.
Chu, Chien-Hsin; Chang, Lung-Chun; Hsu, Hong-Ming; Wei, Shu-Yi; Liu, Hsing-Wei; Lee, Yu; Kuo, Chung-Chi; Indra, Dharmu; Chen, Chinpan; Ong, Shiou-Jeng; Tai, Jung-Hsiang
2011-01-01
Nuclear proteins usually contain specific peptide sequences, referred to as nuclear localization signals (NLSs), for nuclear import. These signals remain unexplored in the protozoan pathogen, Trichomonas vaginalis. The nuclear import of a Myb2 transcription factor was studied here using immunodetection of a hemagglutinin-tagged Myb2 overexpressed in the parasite. The tagged Myb2 was localized to the nucleus as punctate signals. With mutations of its polybasic sequences, 48KKQK51 and 61KR62, Myb2 was localized to the nucleus, but the signal was diffusive. When fused to a C-terminal non-nuclear protein, the Myb2 sequence spanning amino acid (aa) residues 48 to 143, which is embedded within the R2R3 DNA-binding domain (aa 40 to 156), was essential and sufficient for efficient nuclear import of a bacterial tetracycline repressor (TetR), and yet the transport efficiency was reduced with an additional fusion of a firefly luciferase to TetR, while classical NLSs from the simian virus 40 T-antigen had no function in this assay system. Myb2 nuclear import and DNA-binding activity were substantially perturbed with mutation of a conserved isoleucine (I74) in helix 2 to proline that altered secondary structure and ternary folding of the R2R3 domain. Disruption of DNA-binding activity alone by point mutation of a lysine residue, K51, preceding the structural domain had little effect on Myb2 nuclear localization, suggesting that nuclear translocation of Myb2, which requires an ordered structural domain, is independent of its DNA binding activity. These findings provide useful information for testing whether myriad Mybs in the parasite use a common module to regulate nuclear import. PMID:22021237
Yan, Qin; Gong, Lili; Deng, Mi; Zhang, Lan; Sun, Shuming; Liu, Jiao; Ma, Haili; Yuan, Dan; Chen, Pei-Chao; Hu, Xiaohui; Liu, Jinping; Qin, Jichao; Xiao, Ling; Huang, Xiao-Qin; Zhang, Jian; Wan-Cheng Li, David
2010-01-01
Pax-6 is an evolutionarily conserved transcription factor regulating brain and eye development. Four Pax-6 isoforms have been reported previously. Although the longer Pax-6 isoforms (p46 and p48) bear two DNA-binding domains, the paired domain (PD) and the homeodomain (HD), the shorter Pax-6 isoform p32 contains only the HD for DNA binding. Although a third domain, the proline-, serine- and threonine-enriched activation (PST) domain, in the C termini of all Pax-6 isoforms mediates their transcriptional modulation via phosphorylation, how p32 Pax-6 could regulate target genes remains to be elucidated. In the present study, we show that sumoylation at K91 is required for p32 Pax-6 to bind to a HD-specific site and regulate expression of target genes. First, in vitro-synthesized p32 Pax-6 alone cannot bind the P3 sequence, which contains the HD recognition site, unless it is preincubated with nuclear extracts precleared by anti–Pax-6 but not by anti-small ubiquitin-related modifier 1 (anti-SUMO1) antibody. Second, in vitro-synthesized p32 Pax-6 can be sumoylated by SUMO1, and the sumoylated p32 Pax-6 then can bind to the P3 sequence. Third, Pax-6 and SUMO1 are colocalized in the embryonic optic and lens vesicles and can be coimmunoprecipitated. Finally, SUMO1-conjugated p32 Pax-6 exists in both the nucleus and cytoplasm, and sumoylation significantly enhances the DNA-binding ability of p32 Pax-6 and positively regulates gene expression. Together, our results demonstrate that sumoylation activates p32 Pax-6 in both DNA-binding and transcriptional activities. In addition, our studies demonstrate that p32 and p46 Pax-6 possess differential DNA-binding and regulatory activities. PMID:21084637
Isolation and characterization of target sequences of the chicken CdxA homeobox gene.
Margalit, Y; Yarus, S; Shapira, E; Gruenbaum, Y; Fainsod, A
1993-01-01
The DNA binding specificity of the chicken homeodomain protein CDXA was studied. Using a CDXA-glutathione-S-transferase fusion protein, DNA fragments containing the binding site for this protein were isolated. The sources of DNA were oligonucleotides with random sequence and chicken genomic DNA. The DNA fragments isolated were sequenced and tested in DNA binding assays. Sequencing revealed that most DNA fragments are AT rich which is a common feature of homeodomain binding sites. By electrophoretic mobility shift assays it was shown that the different target sequences isolated bind to the CDXA protein with different affinities. The specific sequences bound by the CDXA protein in the genomic fragments isolated, were determined by DNase I footprinting. From the footprinted sequences, the CDXA consensus binding site was determined. The CDXA protein binds the consensus sequence A, A/T, T, A/T, A, T, A/G. The CAUDAL binding site in the ftz promoter is also included in this consensus sequence. When tested, some of the genomic target sequences were capable of enhancing the transcriptional activity of reporter plasmids when introduced into CDXA expressing cells. This study determined the DNA sequence specificity of the CDXA protein and it also shows that this protein can further activate transcription in cells in culture. Images PMID:7909943
Genomic structure of the human D-site binding protein (DBP) gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shutler, G.; Glassco, T.; Kang, Xiaolin
1996-06-15
The human gene for the D-Site Binding Protein (DBP) has been sequenced and characterized. This gene is a member of the b/ZIP family of transcription factors and is one of three genes forming the PAR sub-family. DBP has been implicated in the diurnal regulation of a variety of liver-specific genes. Examination of the genomic structure of DBP reveals that the gene is divided into four exons and is contained within a relatively compact region of approximately 6 kb. These exons appear to correspond to functional divisions the DBP protein. Exon 1 contains a long 5{prime} UTR, and conservation between themore » rat and the human genes of the presence of small open reading frames within this region suggests that is may play a role in translational control. Exon 2 contains a limited region of similarity to the other PAR domain genes, which may be part of a potential activation domain. Exon 3 contains the PAR domain and differs by only 1 of 71 amino acids between rat and human. Exon 4, containing both the basic and the leucine zipper domains, is likewise highly conserved. The overall degree of homology between the rat and the human cDNA sequences is 82% for the nucleic acid sequence and 92% for the protein sequence. comparison of the rat and human proximal promoters reveals extensive sequence conservation, with two previously characterized DNA binding sites being conserved at the functional and sequence levels. 31 refs., 4 figs.« less
Electrostatic control of DNA intersegmental translocation by the ETS transcription factor ETV6.
Vo, Tam; Wang, Shuo; Poon, Gregory M K; Wilson, W David
2017-08-11
To find their DNA target sites in complex solution environments containing excess heterogeneous DNA, sequence-specific DNA-binding proteins execute various translocation mechanisms known collectively as facilitated diffusion. For proteins harboring a single DNA contact surface, long-range translocation occurs by jumping between widely spaced DNA segments. We have configured biosensor-based surface plasmon resonance to directly measure the affinity and kinetics of this intersegmental jumping by the ETS-family transcription factor ETS variant 6 (ETV6). To isolate intersegmental target binding in a functionally defined manner, we pre-equilibrated ETV6 with excess salmon sperm DNA, a heterogeneous polymer, before exposing the nonspecifically bound protein to immobilized oligomeric DNA harboring a high-affinity ETV6 site. In this way, the mechanism of ETV6-target association could be toggled electrostatically through varying NaCl concentration in the bulk solution. Direct measurements of association and dissociation kinetics of the site-specific complex indicated that 1) freely diffusive binding by ETV6 proceeds through a nonspecific-like intermediate, 2) intersegmental jumping is rate-limited by dissociation from the nonspecific polymer, and 3) dissociation of the specific complex is independent of the history of complex formation. These results show that target searches by proteins with an ETS domain, such as ETV6, whose single DNA-binding domain cannot contact both source and destination sites simultaneously, are nonetheless strongly modulated by intersegmental jumping in heterogeneous site environments. Our findings establish biosensors as a general technique for directly and specifically measuring target site search by DNA-binding proteins via intersegmental translocation. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Pérez-Quintero, Alvaro L.; Rodriguez-R, Luis M.; Dereeper, Alexis; López, Camilo; Koebnik, Ralf; Szurek, Boris; Cunnac, Sebastien
2013-01-01
Transcription Activators-Like Effectors (TALEs) belong to a family of virulence proteins from the Xanthomonas genus of bacterial plant pathogens that are translocated into the plant cell. In the nucleus, TALEs act as transcription factors inducing the expression of susceptibility genes. A code for TALE-DNA binding specificity and high-resolution three-dimensional structures of TALE-DNA complexes were recently reported. Accurate prediction of TAL Effector Binding Elements (EBEs) is essential to elucidate the biological functions of the many sequenced TALEs as well as for robust design of artificial TALE DNA-binding domains in biotechnological applications. In this work a program with improved EBE prediction performances was developed using an updated specificity matrix and a position weight correction function to account for the matching pattern observed in a validation set of TALE-DNA interactions. To gain a systems perspective on the large TALE repertoires from X. oryzae strains, this program was used to predict rice gene targets for 99 sequenced family members. Integrating predictions and available expression data in a TALE-gene network revealed multiple candidate transcriptional targets for many TALEs as well as several possible instances of functional convergence among TALEs. PMID:23869221
SRY, like HMG1, recognizes sharp angles in DNA.
Ferrari, S; Harley, V R; Pontiggia, A; Goodfellow, P N; Lovell-Badge, R; Bianchi, M E
1992-01-01
HMG boxes are DNA binding domains present in chromatin proteins, general transcription factors for nucleolar and mitochondrial RNA polymerases, and gene- and tissue-specific transcriptional regulators. The HMG boxes of HMG1, an abundant component of chromatin, interact specifically with four-way junctions, DNA structures that are cross-shaped and contain angles of approximately 60 and 120 degrees between their arms. We show here also that the HMG box of SRY, the protein that determines the expression of male-specific genes in humans, recognizes four-way junction DNAs irrespective of their sequence. In addition, when SRY binds to linear duplex DNA containing its specific target AACAAAG, it produces a sharp bend. Therefore, the interaction between HMG boxes and DNA appears to be predominantly structure-specific. The production of the recognition of a kink in DNA can serve several distinct functions, such as the repair of DNA lesions, the folding of DNA segments with bound transcriptional factors into productive complexes or the wrapping of DNA in chromatin. Images PMID:1425584
Foggetti, Giorgia; Raimondi, Ivan; Campomenosi, Paola; Menichini, Paola
2014-01-01
TP63 is a member of the TP53 gene family that encodes for up to ten different TA and ΔN isoforms through alternative promoter usage and alternative splicing. Besides being a master regulator of gene expression for squamous epithelial proliferation, differentiation and maintenance, P63, through differential expression of its isoforms, plays important roles in tumorigenesis. All P63 isoforms share an immunoglobulin-like folded DNA binding domain responsible for binding to sequence-specific response elements (REs), whose overall consensus sequence is similar to that of the canonical p53 RE. Using a defined assay in yeast, where P63 isoforms and RE sequences are the only variables, and gene expression assays in human cell lines, we demonstrated that human TA- and ΔN-P63α proteins exhibited differences in transactivation specificity not observed with the corresponding P73 or P53 protein isoforms. These differences 1) were dependent on specific features of the RE sequence, 2) could be related to intrinsic differences in their oligomeric state and cooperative DNA binding, and 3) appeared to be conserved in evolution. Since genotoxic stress can change relative ratio of TA- and ΔN-P63α protein levels, the different transactivation specificity of each P63 isoform could potentially influence cellular responses to specific stresses. PMID:24926492
Initial Characterization of the Pf-Int Recombinase from the Malaria Parasite Plasmodium falciparum
Ghorbal, Mehdi; Scheidig-Benatar, Christine; Bouizem, Salma; Thomas, Christophe; Paisley, Genevieve; Faltermeier, Claire; Liu, Melanie; Scherf, Artur; Lopez-Rubio, Jose-Juan; Gopaul, Deshmukh N.
2012-01-01
Background Genetic variation is an essential means of evolution and adaptation in many organisms in response to environmental change. Certain DNA alterations can be carried out by site-specific recombinases (SSRs) that fall into two families: the serine and the tyrosine recombinases. SSRs are seldom found in eukaryotes. A gene homologous to a tyrosine site-specific recombinase has been identified in the genome of Plasmodium falciparum. The sequence is highly conserved among five other members of Plasmodia. Methodology/Principal Findings The predicted open reading frame encodes for a ∼57 kDa protein containing a C-terminal domain including the putative tyrosine recombinase conserved active site residues R-H-R-(H/W)-Y. The N-terminus has the typical alpha-helical bundle and potentially a mixed alpha-beta domain resembling that of λ-Int. Pf-Int mRNA is expressed differentially during the P. falciparum erythrocytic life stages, peaking in the schizont stage. Recombinant Pf-Int and affinity chromatography of DNA from genomic or synthetic origin were used to identify potential DNA targets after sequencing or micro-array hybridization. Interestingly, the sequences captured also included highly variable subtelomeric genes such as var, rif, and stevor sequences. Electrophoretic mobility shift assays with DNA were carried out to verify Pf-Int/DNA binding. Finally, Pf-Int knock-out parasites were created in order to investigate the biological role of Pf-Int. Conclusions/Significance Our data identify for the first time a malaria parasite gene with structural and functional features of recombinases. Pf-Int may bind to and alter DNA, either in a sequence specific or in a non-specific fashion, and may contribute to programmed or random DNA rearrangements. Pf-Int is the first molecular player identified with a potential role in genome plasticity in this pathogen. Finally, Pf-Int knock-out parasite is viable showing no detectable impact on blood stage development, which is compatible with such function. PMID:23056326
Abe, Yoshito; Fujisaki, Naoki; Miyoshi, Takanori; Watanabe, Noriko; Katayama, Tsutomu; Ueda, Tadashi
2016-02-01
DnaAcos, a mutant of the initiator DnaA, causes overinitiation of chromosome replication in Escherichia coli, resulting in inhibition of cell division. CedA was found to be a multi-copy suppressor which represses the dnaAcos inhibition of cell division. However, functional mechanism of CedA remains elusive except for previously indicated possibilities in binding to DNA and RNA polymerase. In this study, we searched for the specific sites of CedA in binding of DNA and RNA polymerase and in repression of cell division inhibition. First, DNA sequence to which CedA preferentially binds was determined. Next, the several residues and β4 region in CedA C-terminal domain was suggested to specifically interact with the DNA. Moreover, we found that the flexible N-terminal region was required for tight binding to longer DNA as well as interaction with RNA polymerase. Based on these results, several cedA mutants were examined in ability for repressing dnaAcos cell division inhibition. We found that the N-terminal region was dispensable and that Glu32 in the C-terminal domain was required for the repression. These results suggest that CedA has multiple roles and residues with different functions are positioned in the two regions. © The Authors 2015. Published by Oxford University Press on behalf of the Japanese Biochemical Society. All rights reserved.
Peixoto, Paul; Liu, Yang; Depauw, Sabine; Hildebrand, Marie-Paule; Boykin, David W; Bailly, Christian; Wilson, W David; David-Cordonnier, Marie-Hélène
2008-06-01
The development of small molecules to control gene expression could be the spearhead of future-targeted therapeutic approaches in multiple pathologies. Among heterocyclic dications developed with this aim, a phenyl-furan-benzimidazole dication DB293 binds AT-rich sites as a monomer and 5'-ATGA sequence as a stacked dimer, both in the minor groove. Here, we used a protein/DNA array approach to evaluate the ability of DB293 to specifically inhibit transcription factors DNA-binding in a single-step, competitive mode. DB293 inhibits two POU-domain transcription factors Pit-1 and Brn-3 but not IRF-1, despite the presence of an ATGA and AT-rich sites within all three consensus sequences. EMSA, DNase I footprinting and surface-plasmon-resonance experiments determined the precise binding site, affinity and stoichiometry of DB293 interaction to the consensus targets. Binding of DB293 occurred as a cooperative dimer on the ATGA part of Brn-3 site but as two monomers on AT-rich sites of IRF-1 sequence. For Pit-1 site, ATGA or AT-rich mutated sequences identified the contribution of both sites for DB293 recognition. In conclusion, DB293 is a strong inhibitor of two POU-domain transcription factors through a cooperative binding to ATGA. These findings are the first to show that heterocyclic dications can inhibit major groove transcription factors and they open the door to the control of transcription factors activity by those compounds.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lu, Xun; Guanga, Gerald P; Wan, Cheng
2012-11-13
MafA is a proto-oncoprotein and is critical for insulin gene expression in pancreatic β-cells. Maf proteins belong to the AP1 superfamily of basic region-leucine zipper (bZIP) transcription factors. Residues in the basic helix and an ancillary N-terminal domain, the Extended Homology Region (EHR), endow maf proteins with unique DNA binding properties: binding a 13 bp consensus site consisting of a core AP1 site (TGACTCA) flanked by TGC sequences and binding DNA stably as monomers. To further characterize maf DNA binding, we determined the structure of a MafA–DNA complex. MafA forms base-specific hydrogen bonds with the flanking G –5C –4 andmore » central C 0/G 0 bases, but not with the core-TGA bases. However, in vitro binding studies utilizing a pulse–chase electrophoretic mobility shift assay protocol revealed that mutating either the core-TGA or flanking-TGC bases dramatically increases the binding off rate. Comparing the known maf structures, we propose that DNA binding specificity results from positioning the basic helix through unique phosphate contacts. The EHR does not contact DNA directly but stabilizes DNA binding by contacting the basic helix. Collectively, these results suggest a novel multistep DNA binding process involving a conformational change from contacting the core-TGA to contacting the flanking-TGC bases.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
S Menon; S Wang
The PhoP protein from Mycobacterium tuberculosis is a response regulator of the OmpR/PhoB subfamily, whose structure consists of an N-terminal receiver domain and a C-terminal DNA-binding domain. How the DNA-binding activities are regulated by phosphorylation of the receiver domain remains unclear due to a lack of structural information on the full-length proteins. Here we report the crystal structure of the full-length PhoP of M. tuberculosis. Unlike other known structures of full-length proteins of the same subfamily, PhoP forms a dimer through its receiver domain with the dimer interface involving {alpha}4-{beta}5-{alpha}5, a common interface for activated receiver domain dimers. However, themore » switch residues, Thr99 and Tyr118, are in a conformation resembling those of nonactivated receiver domains. The Tyr118 side chain is involved in the dimer interface interactions. The receiver domain is tethered to the DNA-binding domain through a flexible linker and does not impose structural constraints on the DNA-binding domain. This structure suggests that phosphorylation likely facilitates/stabilizes receiver domain dimerization, bringing the DNA-binding domains to close proximity, thereby increasing their binding affinity for direct repeat DNA sequences.« less
TFBSshape: a motif database for DNA shape features of transcription factor binding sites.
Yang, Lin; Zhou, Tianyin; Dror, Iris; Mathelier, Anthony; Wasserman, Wyeth W; Gordân, Raluca; Rohs, Remo
2014-01-01
Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein-DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone.
TFBSshape: a motif database for DNA shape features of transcription factor binding sites
Yang, Lin; Zhou, Tianyin; Dror, Iris; Mathelier, Anthony; Wasserman, Wyeth W.; Gordân, Raluca; Rohs, Remo
2014-01-01
Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein–DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone. PMID:24214955
TALE-PvuII Fusion Proteins – Novel Tools for Gene Targeting
Yanik, Mert; Alzubi, Jamal; Lahaye, Thomas; Cathomen, Toni; Pingoud, Alfred; Wende, Wolfgang
2013-01-01
Zinc finger nucleases (ZFNs) consist of zinc fingers as DNA-binding module and the non-specific DNA-cleavage domain of the restriction endonuclease FokI as DNA-cleavage module. This architecture is also used by TALE nucleases (TALENs), in which the DNA-binding modules of the ZFNs have been replaced by DNA-binding domains based on transcription activator like effector (TALE) proteins. Both TALENs and ZFNs are programmable nucleases which rely on the dimerization of FokI to induce double-strand DNA cleavage at the target site after recognition of the target DNA by the respective DNA-binding module. TALENs seem to have an advantage over ZFNs, as the assembly of TALE proteins is easier than that of ZFNs. Here, we present evidence that variant TALENs can be produced by replacing the catalytic domain of FokI with the restriction endonuclease PvuII. These fusion proteins recognize only the composite recognition site consisting of the target site of the TALE protein and the PvuII recognition sequence (addressed site), but not isolated TALE or PvuII recognition sites (unaddressed sites), even at high excess of protein over DNA and long incubation times. In vitro, their preference for an addressed over an unaddressed site is > 34,000-fold. Moreover, TALE-PvuII fusion proteins are active in cellula with minimal cytotoxicity. PMID:24349308
Specific minor groove solvation is a crucial determinant of DNA binding site recognition
Harris, Lydia-Ann; Williams, Loren Dean; Koudelka, Gerald B.
2014-01-01
The DNA sequence preferences of nearly all sequence specific DNA binding proteins are influenced by the identities of bases that are not directly contacted by protein. Discrimination between non-contacted base sequences is commonly based on the differential abilities of DNA sequences to allow narrowing of the DNA minor groove. However, the factors that govern the propensity of minor groove narrowing are not completely understood. Here we show that the differential abilities of various DNA sequences to support formation of a highly ordered and stable minor groove solvation network are a key determinant of non-contacted base recognition by a sequence-specific binding protein. In addition, disrupting the solvent network in the non-contacted region of the binding site alters the protein's ability to recognize contacted base sequences at positions 5–6 bases away. This observation suggests that DNA solvent interactions link contacted and non-contacted base recognition by the protein. PMID:25429976
A DNA sequence obtained by replacement of the dopamine RNA aptamer bases is not an aptamer.
Álvarez-Martos, Isabel; Ferapontova, Elena E
2017-08-05
A unique specificity of the aptamer-ligand biorecognition and binding facilitates bioanalysis and biosensor development, contributing to discrimination of structurally related molecules, such as dopamine and other catecholamine neurotransmitters. The aptamer sequence capable of specific binding of dopamine is a 57 nucleotides long RNA sequence reported in 1997 (Biochemistry, 1997, 36, 9726). Later, it was suggested that the DNA homologue of the RNA aptamer retains the specificity of dopamine binding (Biochem. Biophys. Res. Commun., 2009, 388, 732). Here, we show that the DNA sequence obtained by the replacement of the RNA aptamer bases for their DNA analogues is not able of specific biorecognition of dopamine, in contrast to the original RNA aptamer sequence. This DNA sequence binds dopamine and structurally related catecholamine neurotransmitters non-specifically, as any DNA sequence, and, thus, is not an aptamer and cannot be used neither for in vivo nor in situ analysis of dopamine in the presence of structurally related neurotransmitters. Copyright © 2017 Elsevier Inc. All rights reserved.
Stepanchick, Ann; Zhi, Huijun; Cavanaugh, Alice H; Rothblum, Katrina; Schneider, David A; Rothblum, Lawrence I
2013-03-29
The human homologue of yeast Rrn3 is an RNA polymerase I-associated transcription factor that is essential for ribosomal DNA (rDNA) transcription. The generally accepted model is that Rrn3 functions as a bridge between RNA polymerase I and the transcription factors bound to the committed template. In this model Rrn3 would mediate an interaction between the mammalian Rrn3-polymerase I complex and SL1, the rDNA transcription factor that binds to the core promoter element of the rDNA. In the course of studying the role of Rrn3 in recruitment, we found that Rrn3 was in fact a DNA-binding protein. Analysis of the sequence of Rrn3 identified a domain with sequence similarity to the DNA binding domain of heat shock transcription factor 2. Randomization, or deletion, of the amino acids in this region in Rrn3, amino acids 382-400, abrogated its ability to bind DNA, indicating that this domain was an important contributor to DNA binding by Rrn3. Control experiments demonstrated that these mutant Rrn3 constructs were capable of interacting with both rpa43 and SL1, two other activities demonstrated to be essential for Rrn3 function. However, neither of these Rrn3 mutants was capable of functioning in transcription in vitro. Moreover, although wild-type human Rrn3 complemented a yeast rrn3-ts mutant, the DNA-binding site mutant did not. These results demonstrate that DNA binding by Rrn3 is essential for transcription by RNA polymerase I.
Stepanchick, Ann; Zhi, Huijun; Cavanaugh, Alice H.; Rothblum, Katrina; Schneider, David A.; Rothblum, Lawrence I.
2013-01-01
The human homologue of yeast Rrn3 is an RNA polymerase I-associated transcription factor that is essential for ribosomal DNA (rDNA) transcription. The generally accepted model is that Rrn3 functions as a bridge between RNA polymerase I and the transcription factors bound to the committed template. In this model Rrn3 would mediate an interaction between the mammalian Rrn3-polymerase I complex and SL1, the rDNA transcription factor that binds to the core promoter element of the rDNA. In the course of studying the role of Rrn3 in recruitment, we found that Rrn3 was in fact a DNA-binding protein. Analysis of the sequence of Rrn3 identified a domain with sequence similarity to the DNA binding domain of heat shock transcription factor 2. Randomization, or deletion, of the amino acids in this region in Rrn3, amino acids 382–400, abrogated its ability to bind DNA, indicating that this domain was an important contributor to DNA binding by Rrn3. Control experiments demonstrated that these mutant Rrn3 constructs were capable of interacting with both rpa43 and SL1, two other activities demonstrated to be essential for Rrn3 function. However, neither of these Rrn3 mutants was capable of functioning in transcription in vitro. Moreover, although wild-type human Rrn3 complemented a yeast rrn3-ts mutant, the DNA-binding site mutant did not. These results demonstrate that DNA binding by Rrn3 is essential for transcription by RNA polymerase I. PMID:23393135
Non-B-DNA structures on the interferon-beta promoter?
Robbe, K; Bonnefoy, E
1998-01-01
The high mobility group (HMG) I protein intervenes as an essential factor during the virus induced expression of the interferon-beta (IFN-beta) gene. It is a non-histone chromatine associated protein that has the dual capacity of binding to a non-B-DNA structure such as cruciform-DNA as well as to AT rich B-DNA sequences. In this work we compare the binding affinity of HMGI for a synthetic cruciform-DNA to its binding affinity for the HMGI-binding-site present in the positive regulatory domain II (PRDII) of the IFN-beta promoter. Using gel retardation experiments, we show that HMGI protein binds with at least ten times more affinity to the synthetic cruciform-DNA structure than to the PRDII B-DNA sequence. DNA hairpin sequences are present in both the human and the murine PRDII-DNAs. We discuss in this work the presence of, yet putative, non-B-DNA structures in the IFN-beta promoter.
Structural Basis for Sequence-specific DNA Recognition by an Arabidopsis WRKY Transcription Factor*
Yamasaki, Kazuhiko; Kigawa, Takanori; Watanabe, Satoru; Inoue, Makoto; Yamasaki, Tomoko; Seki, Motoaki; Shinozaki, Kazuo; Yokoyama, Shigeyuki
2012-01-01
The WRKY family transcription factors regulate plant-specific reactions that are mostly related to biotic and abiotic stresses. They share the WRKY domain, which recognizes a DNA element (TTGAC(C/T)) termed the W-box, in target genes. Here, we determined the solution structure of the C-terminal WRKY domain of Arabidopsis WRKY4 in complex with the W-box DNA by NMR. A four-stranded β-sheet enters the major groove of DNA in an atypical mode termed the β-wedge, where the sheet is nearly perpendicular to the DNA helical axis. Residues in the conserved WRKYGQK motif contact DNA bases mainly through extensive apolar contacts with thymine methyl groups. The importance of these contacts was verified by substituting the relevant T bases with U and by surface plasmon resonance analyses of DNA binding. PMID:22219184
2011-01-01
Background Existing methods of predicting DNA-binding proteins used valuable features of physicochemical properties to design support vector machine (SVM) based classifiers. Generally, selection of physicochemical properties and determination of their corresponding feature vectors rely mainly on known properties of binding mechanism and experience of designers. However, there exists a troublesome problem for designers that some different physicochemical properties have similar vectors of representing 20 amino acids and some closely related physicochemical properties have dissimilar vectors. Results This study proposes a systematic approach (named Auto-IDPCPs) to automatically identify a set of physicochemical and biochemical properties in the AAindex database to design SVM-based classifiers for predicting and analyzing DNA-binding domains/proteins. Auto-IDPCPs consists of 1) clustering 531 amino acid indices in AAindex into 20 clusters using a fuzzy c-means algorithm, 2) utilizing an efficient genetic algorithm based optimization method IBCGA to select an informative feature set of size m to represent sequences, and 3) analyzing the selected features to identify related physicochemical properties which may affect the binding mechanism of DNA-binding domains/proteins. The proposed Auto-IDPCPs identified m=22 features of properties belonging to five clusters for predicting DNA-binding domains with a five-fold cross-validation accuracy of 87.12%, which is promising compared with the accuracy of 86.62% of the existing method PSSM-400. For predicting DNA-binding sequences, the accuracy of 75.50% was obtained using m=28 features, where PSSM-400 has an accuracy of 74.22%. Auto-IDPCPs and PSSM-400 have accuracies of 80.73% and 82.81%, respectively, applied to an independent test data set of DNA-binding domains. Some typical physicochemical properties discovered are hydrophobicity, secondary structure, charge, solvent accessibility, polarity, flexibility, normalized Van Der Waals volume, pK (pK-C, pK-N, pK-COOH and pK-a(RCOOH)), etc. Conclusions The proposed approach Auto-IDPCPs would help designers to investigate informative physicochemical and biochemical properties by considering both prediction accuracy and analysis of binding mechanism simultaneously. The approach Auto-IDPCPs can be also applicable to predict and analyze other protein functions from sequences. PMID:21342579
Kshirsagar, Rucha; Khan, Krishnendu; Joshi, Mamata V; Hosur, Ramakrishna V; Muniyappa, K
2017-05-23
A plethora of evidence suggests that different types of DNA quadruplexes are widely present in the genome of all organisms. The existence of a growing number of proteins that selectively bind and/or process these structures underscores their biological relevance. Moreover, G-quadruplex DNA has been implicated in the alignment of four sister chromatids by forming parallel guanine quadruplexes during meiosis; however, the underlying mechanism is not well defined. Here we show that a G/C-rich motif associated with a meiosis-specific DNA double-strand break (DSB) in Saccharomyces cerevisiae folds into G-quadruplex, and the C-rich sequence complementary to the G-rich sequence forms an i-motif. The presence of G-quadruplex or i-motif structures upstream of the green fluorescent protein-coding sequence markedly reduces the levels of gfp mRNA expression in S. cerevisiae cells, with a concomitant decrease in green fluorescent protein abundance, and blocks primer extension by DNA polymerase, thereby demonstrating the functional significance of these structures. Surprisingly, although S. cerevisiae Hop1, a component of synaptonemal complex axial/lateral elements, exhibits strong affinity to G-quadruplex DNA, it displays a much weaker affinity for the i-motif structure. However, the Hop1 C-terminal but not the N-terminal domain possesses strong i-motif binding activity, implying that the C-terminal domain has a distinct substrate specificity. Additionally, we found that Hop1 promotes intermolecular pairing between G/C-rich DNA segments associated with a meiosis-specific DSB site. Our results support the idea that the G/C-rich motifs associated with meiosis-specific DSBs fold into intramolecular G-quadruplex and i-motif structures, both in vitro and in vivo, thus revealing an important link between non-B form DNA structures and Hop1 in meiotic chromosome synapsis and recombination. Copyright © 2017 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Bidlingmaier, Scott; Ha, Kevin; Lee, Nam-Kyung; Su, Yang; Liu, Bin
2016-04-01
Although the bioactive sphingolipid ceramide is an important cell signaling molecule, relatively few direct ceramide-interacting proteins are known. We used an approach combining yeast surface cDNA display and deep sequencing technology to identify novel proteins binding directly to ceramide. We identified 234 candidate ceramide-binding protein fragments and validated binding for 20. Most (17) bound selectively to ceramide, although a few (3) bound to other lipids as well. Several novel ceramide-binding domains were discovered, including the EF-hand calcium-binding motif, the heat shock chaperonin-binding motif STI1, the SCP2 sterol-binding domain, and the tetratricopeptide repeat region motif. Interestingly, four of the verified ceramide-binding proteins (HPCA, HPCAL1, NCS1, and VSNL1) and an additional three candidate ceramide-binding proteins (NCALD, HPCAL4, and KCNIP3) belong to the neuronal calcium sensor family of EF hand-containing proteins. We used mutagenesis to map the ceramide-binding site in HPCA and to create a mutant HPCA that does not bind to ceramide. We demonstrated selective binding to ceramide by mammalian cell-produced wild type but not mutant HPCA. Intriguingly, we also identified a fragment from prostaglandin D2synthase that binds preferentially to ceramide 1-phosphate. The wide variety of proteins and domains capable of binding to ceramide suggests that many of the signaling functions of ceramide may be regulated by direct binding to these proteins. Based on the deep sequencing data, we estimate that our yeast surface cDNA display library covers ∼60% of the human proteome and our selection/deep sequencing protocol can identify target-interacting protein fragments that are present at extremely low frequency in the starting library. Thus, the yeast surface cDNA display/deep sequencing approach is a rapid, comprehensive, and flexible method for the analysis of protein-ligand interactions, particularly for the study of non-protein ligands. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Koentjoro, Maharani Pertiwi; Adachi, Naruhiko; Senda, Miki; Ogawa, Naoto; Senda, Toshiya
2018-03-01
LysR-type transcriptional regulators (LTTRs) are among the most abundant transcriptional regulators in bacteria. CbnR is an LTTR derived from Cupriavidus necator (formerly Alcaligenes eutrophus or Ralstonia eutropha) NH9 and is involved in transcriptional activation of the cbnABCD genes encoding chlorocatechol degradative enzymes. CbnR interacts with a cbnA promoter region of approximately 60 bp in length that contains the recognition-binding site (RBS) and activation-binding site (ABS). Upon inducer binding, CbnR seems to undergo conformational changes, leading to the activation of the transcription. Since the interaction of an LTTR with RBS is considered to be the first step of the transcriptional activation, the CbnR-RBS interaction is responsible for the selectivity of the promoter to be activated. To understand the sequence selectivity of CbnR, we determined the crystal structure of the DNA-binding domain of CbnR in complex with RBS of the cbnA promoter at 2.55 Å resolution. The crystal structure revealed details of the interactions between the DNA-binding domain and the promoter DNA. A comparison with the previously reported crystal structure of the DNA-binding domain of BenM in complex with its cognate RBS showed several differences in the DNA interactions, despite the structural similarity between CbnR and BenM. These differences explain the observed promoter sequence selectivity between CbnR and BenM. Particularly, the difference between Thr33 in CbnR and Ser33 in BenM appears to affect the conformations of neighboring residues, leading to the selective interactions with DNA. Atomic coordinates and structure factors for the DNA-binding domain of Cupriavidus necatorNH9 CbnR in complex with RBS are available in the Protein Data Bank under the accession code 5XXP. © 2018 Federation of European Biochemical Societies.
Structure of apo-CAP reveals that large conformational changes are necessary for DNA binding
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sharma, Hitesh; Yu, Shaoning; Kong, Jilie
2009-10-21
The binding of cAMP to the Escherichia coli catabolite gene activator protein (CAP) produces a conformational change that enables it to bind specific DNA sequences and regulate transcription, which it cannot do in the absence of the nucleotide. The crystal structures of the unliganded CAP containing a D138L mutation and the unliganded WT CAP were determined at 2.3 and 3.6 {angstrom} resolution, respectively, and reveal that the two DNA binding domains have dimerized into one rigid body and their two DNA recognition helices become buried. The WT structure shows multiple orientations of this rigid body relative to the nucleotide bindingmore » domain supporting earlier biochemical data suggesting that the inactive form exists in an equilibrium among different conformations. Comparison of the structures of the liganded and unliganded CAP suggests that cAMP stabilizes the active DNA binding conformation of CAP through the interactions that the N{sup 6} of the adenosine makes with the C-helices. These interactions are associated with the reorientation and elongation of the C-helices that precludes the formation of the inactive structure.« less
Hutchens, T W; Allen, M H; Li, C M; Yip, T T
1992-09-07
The metal ion specificity of most 'zinc-finger' metal binding domains is unknown. The human estrogen receptor protein contains two different C2-C2 type 'zinc-finger' sequences within its DNA-binding domain (ERDBD). Copper inhibits the function of this protein by mechanisms which remain unclear. We have used electrospray ionization mass spectrometry to evaluate directly the 71-residue ERDBD (K180-M250) in the absence and presence of Cu(II) ions. The ERDBD showed a high affinity for Cu and was completely occupied with 4 Cu bound; each Cu ion was evidently bound to only two ligand residues (net loss of only 2 Da per bound Cu). The Cu binding stoichiometry was confirmed by atomic absorption. These results (i) provide the first direct physical evidence for the ability of the estrogen receptor DNA-binding domain to bind Cu and (ii) document a twofold difference in the Zn- and Cu-binding capacity. Differences in the ERDBD domain structure with bound Zn and Cu are predicted. Given the relative intracellular contents of Zn and Cu, our findings demonstrate the need to investigate further the Cu occupancy of this and other zinc-finger domains both in vitro and in vivo.
Foti, M; Omichinski, J G; Stahl, S; Maloney, D; West, J; Schweitzer, B I
1999-02-05
We investigate here the effects of the incorporation of the nucleoside analogs araC (1-beta-D-arabinofuranosylcytosine) and ganciclovir (9-[(1,3-dihydroxy-2-propoxy)methyl] guanine) into the DNA binding recognition sequence for the GATA-1 erythroid transcription factor. A 10-fold decrease in binding affinity was observed for the ganciclovir-substituted DNA complex in comparison to an unmodified DNA of the same sequence composition. AraC substitution did not result in any changes in binding affinity. 1H-15N HSQC and NOESY NMR experiments revealed a number of chemical shift changes in both DNA and protein in the ganciclovir-modified DNA-protein complex when compared to the unmodified DNA-protein complex. These changes in chemical shift and binding affinity suggest a change in the binding mode of the complex when ganciclovir is incorporated into the GATA DNA binding site.
The Agrobacterium tumefaciens Transcription Factor BlcR Is Regulated via Oligomerization
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pan, Yi; Fiscus, Valena; Meng, Wuyi
2012-02-08
The Agrobacterium tumefaciens BlcR is a member of the emerging isocitrate lyase transcription regulators that negatively regulates metabolism of {gamma}-butyrolactone, and its repressing function is relieved by succinate semialdehyde (SSA). Our crystal structure showed that BlcR folded into the DNA- and SSA-binding domains and dimerized via the DNA-binding domains. Mutational analysis identified residues, including Phe{sup 147}, that are important for SSA association; BlcR{sup F147A} existed as tetramer. Two BlcR dimers bound to target DNA and in a cooperative manner, and the distance between the two BlcR-binding sequences in DNA was critical for BlcR-DNA association. Tetrameric BlcR{sup F147A} retained DNA bindingmore » activity, and importantly, this activity was not affected by the distance separating the BlcR-binding sequences in DNA. SSA did not dissociate tetrameric BlcR{sup F147A} or BlcR{sup F147A}-DNA. As well as in the SSA-binding site, Phe{sup 147} is located in a structurally flexible loop that may be involved in BlcR oligomerization. We propose that SSA regulates BlcR DNA-binding function via oligomerization.« less
Comparative Analysis of Transcription Factors Families across Fungal Tree of Life
DOE Office of Scientific and Technical Information (OSTI.GOV)
Salamov, Asaf; Grigoriev, Igor
2015-03-19
Transcription factors (TFs) are proteins that regulate the transcription of genes, by binding to specific DNA sequences. Based on literature (Shelest, 2008; Weirauch and Hughes,2011) collected and manually curated list of DBD Pfam domains (in total 62 DBD domains) We looked for distribution of TFs in 395 fungal genomes plus additionally in plant genomes (Phytozome), prokaryotes(IMG), some animals/metazoans and protists genomes
MFP1 is a thylakoid-associated, nucleoid-binding protein with a coiled-coil structure
Jeong, Sun Yong; Rose, Annkatrin; Meier, Iris
2003-01-01
Plastid DNA, like bacterial and mitochondrial DNA, is organized into protein–DNA complexes called nucleoids. Plastid nucleoids are believed to be associated with the inner envelope in developing plastids and the thylakoid membranes in mature chloroplasts, but the mechanism for this re-localization is unknown. Here, we present the further characterization of the coiled-coil DNA-binding protein MFP1 as a protein associated with nucleoids and with the thylakoid membranes in mature chloroplasts. MFP1 is located in plastids in both suspension culture cells and leaves and is attached to the thylakoid membranes with its C-terminal DNA-binding domain oriented towards the stroma. It has a major DNA-binding activity in mature Arabidopsis chloroplasts and binds to all tested chloroplast DNA fragments without detectable sequence specificity. Its expression is tightly correlated with the accumulation of thylakoid membranes. Importantly, it is associated in vivo with nucleoids, suggesting a function for MFP1 at the interface between chloroplast nucleoids and the developing thylakoid membrane system. PMID:12930969
Diversity, expansion, and evolutionary novelty of plant DNA-binding transcription factor families.
Lehti-Shiu, Melissa D; Panchy, Nicholas; Wang, Peipei; Uygun, Sahra; Shiu, Shin-Han
2017-01-01
Plant transcription factors (TFs) that interact with specific sequences via DNA-binding domains are crucial for regulating transcriptional initiation and are fundamental to plant development and environmental response. In addition, expansion of TF families has allowed functional divergence of duplicate copies, which has contributed to novel, and in some cases adaptive, traits in plants. Thus, TFs are central to the generation of the diverse plant species that we see today. Major plant agronomic traits, including those relevant to domestication, have also frequently arisen through changes in TF coding sequence or expression patterns. Here our goal is to provide an overview of plant TF evolution by first comparing the diversity of DNA-binding domains and the sizes of these domain families in plants and other eukaryotes. Because TFs are among the most highly expanded gene families in plants, the birth and death process of TFs as well as the mechanisms contributing to their retention are discussed. We also provide recent examples of how TFs have contributed to novel traits that are important in plant evolution and in agriculture.This article is part of a Special Issue entitled: Plant Gene Regulatory Mechanisms and Networks, edited by Dr. Erich Grotewold and Dr. Nathan Springer. Copyright © 2016 Elsevier B.V. All rights reserved.
Nelson, Christopher S; Fuller, Chris K; Fordyce, Polly M; Greninger, Alexander L; Li, Hao; DeRisi, Joseph L
2013-07-01
The transcription factor forkhead box P2 (FOXP2) is believed to be important in the evolution of human speech. A mutation in its DNA-binding domain causes severe speech impairment. Humans have acquired two coding changes relative to the conserved mammalian sequence. Despite intense interest in FOXP2, it has remained an open question whether the human protein's DNA-binding specificity and chromatin localization are conserved. Previous in vitro and ChIP-chip studies have provided conflicting consensus sequences for the FOXP2-binding site. Using MITOMI 2.0 microfluidic affinity assays, we describe the binding site of FOXP2 and its affinity profile in base-specific detail for all substitutions of the strongest binding site. We find that human and chimp FOXP2 have similar binding sites that are distinct from previously suggested consensus binding sites. Additionally, through analysis of FOXP2 ChIP-seq data from cultured neurons, we find strong overrepresentation of a motif that matches our in vitro results and identifies a set of genes with FOXP2 binding sites. The FOXP2-binding sites tend to be conserved, yet we identified 38 instances of evolutionarily novel sites in humans. Combined, these data present a comprehensive portrait of FOXP2's-binding properties and imply that although its sequence specificity has been conserved, some of its genomic binding sites are newly evolved.
Nelson, Christopher S.; Fuller, Chris K.; Fordyce, Polly M.; Greninger, Alexander L.; Li, Hao; DeRisi, Joseph L.
2013-01-01
The transcription factor forkhead box P2 (FOXP2) is believed to be important in the evolution of human speech. A mutation in its DNA-binding domain causes severe speech impairment. Humans have acquired two coding changes relative to the conserved mammalian sequence. Despite intense interest in FOXP2, it has remained an open question whether the human protein’s DNA-binding specificity and chromatin localization are conserved. Previous in vitro and ChIP-chip studies have provided conflicting consensus sequences for the FOXP2-binding site. Using MITOMI 2.0 microfluidic affinity assays, we describe the binding site of FOXP2 and its affinity profile in base-specific detail for all substitutions of the strongest binding site. We find that human and chimp FOXP2 have similar binding sites that are distinct from previously suggested consensus binding sites. Additionally, through analysis of FOXP2 ChIP-seq data from cultured neurons, we find strong overrepresentation of a motif that matches our in vitro results and identifies a set of genes with FOXP2 binding sites. The FOXP2-binding sites tend to be conserved, yet we identified 38 instances of evolutionarily novel sites in humans. Combined, these data present a comprehensive portrait of FOXP2’s-binding properties and imply that although its sequence specificity has been conserved, some of its genomic binding sites are newly evolved. PMID:23625967
The binding of TIA-1 to RNA C-rich sequences is driven by its C-terminal RRM domain.
Cruz-Gallardo, Isabel; Aroca, Ángeles; Gunzburg, Menachem J; Sivakumaran, Andrew; Yoon, Je-Hyun; Angulo, Jesús; Persson, Cecilia; Gorospe, Myriam; Karlsson, B Göran; Wilce, Jacqueline A; Díaz-Moreno, Irene
2014-01-01
T-cell intracellular antigen-1 (TIA-1) is a key DNA/RNA binding protein that regulates translation by sequestering target mRNAs in stress granules (SG) in response to stress conditions. TIA-1 possesses three RNA recognition motifs (RRM) along with a glutamine-rich domain, with the central domains (RRM2 and RRM3) acting as RNA binding platforms. While the RRM2 domain, which displays high affinity for U-rich RNA sequences, is primarily responsible for interaction with RNA, the contribution of RRM3 to bind RNA as well as the target RNA sequences that it binds preferentially are still unknown. Here we combined nuclear magnetic resonance (NMR) and surface plasmon resonance (SPR) techniques to elucidate the sequence specificity of TIA-1 RRM3. With a novel approach using saturation transfer difference NMR (STD-NMR) to quantify protein-nucleic acids interactions, we demonstrate that isolated RRM3 binds to both C- and U-rich stretches with micromolar affinity. In combination with RRM2 and in the context of full-length TIA-1, RRM3 significantly enhanced the binding to RNA, particularly to cytosine-rich RNA oligos, as assessed by biotinylated RNA pull-down analysis. Our findings provide new insight into the role of RRM3 in regulating TIA-1 binding to C-rich stretches, that are abundant at the 5' TOPs (5' terminal oligopyrimidine tracts) of mRNAs whose translation is repressed under stress situations.
The binding of TIA-1 to RNA C-rich sequences is driven by its C-terminal RRM domain
Cruz-Gallardo, Isabel; Aroca, Ángeles; Gunzburg, Menachem J; Sivakumaran, Andrew; Yoon, Je-Hyun; Angulo, Jesús; Persson, Cecilia; Gorospe, Myriam; Karlsson, B Göran; Wilce, Jacqueline A; Díaz-Moreno, Irene
2014-01-01
T-cell intracellular antigen-1 (TIA-1) is a key DNA/RNA binding protein that regulates translation by sequestering target mRNAs in stress granules (SG) in response to stress conditions. TIA-1 possesses three RNA recognition motifs (RRM) along with a glutamine-rich domain, with the central domains (RRM2 and RRM3) acting as RNA binding platforms. While the RRM2 domain, which displays high affinity for U-rich RNA sequences, is primarily responsible for interaction with RNA, the contribution of RRM3 to bind RNA as well as the target RNA sequences that it binds preferentially are still unknown. Here we combined nuclear magnetic resonance (NMR) and surface plasmon resonance (SPR) techniques to elucidate the sequence specificity of TIA-1 RRM3. With a novel approach using saturation transfer difference NMR (STD-NMR) to quantify protein–nucleic acids interactions, we demonstrate that isolated RRM3 binds to both C- and U-rich stretches with micromolar affinity. In combination with RRM2 and in the context of full-length TIA-1, RRM3 significantly enhanced the binding to RNA, particularly to cytosine-rich RNA oligos, as assessed by biotinylated RNA pull-down analysis. Our findings provide new insight into the role of RRM3 in regulating TIA-1 binding to C-rich stretches, that are abundant at the 5′ TOPs (5′ terminal oligopyrimidine tracts) of mRNAs whose translation is repressed under stress situations. PMID:24824036
The consequences of sequence erosion in the evolution of recombination hotspots.
Tiemann-Boege, Irene; Schwarz, Theresa; Striedner, Yasmin; Heissl, Angelika
2017-12-19
Meiosis is initiated by a double-strand break (DSB) introduced in the DNA by a highly controlled process that is repaired by recombination. In many organisms, recombination occurs at specific and narrow regions of the genome, known as recombination hotspots, which overlap with regions enriched for DSBs. In recent years, it has been demonstrated that conversions and mutations resulting from the repair of DSBs lead to a rapid sequence evolution at recombination hotspots eroding target sites for DSBs. We still do not fully understand the effect of this erosion in the recombination activity, but evidence has shown that the binding of trans -acting factors like PRDM9 is affected. PRDM9 is a meiosis-specific, multi-domain protein that recognizes DNA target motifs by its zinc finger domain and directs DSBs to these target sites. Here we discuss the changes in affinity of PRDM9 to eroded recognition sequences, and explain how these changes in affinity of PRDM9 can affect recombination, leading sometimes to sterility in the context of hybrid crosses. We also present experimental data showing that DNA methylation reduces PRDM9 binding in vitro Finally, we discuss PRDM9-independent hotspots, posing the question how these hotspots evolve and change with sequence erosion.This article is part of the themed issue 'Evolutionary causes and consequences of recombination rate variation in sexual organisms'. © 2017 The Authors.
The consequences of sequence erosion in the evolution of recombination hotspots
Schwarz, Theresa; Heissl, Angelika
2017-01-01
Meiosis is initiated by a double-strand break (DSB) introduced in the DNA by a highly controlled process that is repaired by recombination. In many organisms, recombination occurs at specific and narrow regions of the genome, known as recombination hotspots, which overlap with regions enriched for DSBs. In recent years, it has been demonstrated that conversions and mutations resulting from the repair of DSBs lead to a rapid sequence evolution at recombination hotspots eroding target sites for DSBs. We still do not fully understand the effect of this erosion in the recombination activity, but evidence has shown that the binding of trans-acting factors like PRDM9 is affected. PRDM9 is a meiosis-specific, multi-domain protein that recognizes DNA target motifs by its zinc finger domain and directs DSBs to these target sites. Here we discuss the changes in affinity of PRDM9 to eroded recognition sequences, and explain how these changes in affinity of PRDM9 can affect recombination, leading sometimes to sterility in the context of hybrid crosses. We also present experimental data showing that DNA methylation reduces PRDM9 binding in vitro. Finally, we discuss PRDM9-independent hotspots, posing the question how these hotspots evolve and change with sequence erosion. This article is part of the themed issue ‘Evolutionary causes and consequences of recombination rate variation in sexual organisms’. PMID:29109225
2017-01-01
Abstract Target search as performed by DNA-binding proteins is a complex process, in which multiple factors contribute to both thermodynamic discrimination of the target sequence from overwhelmingly abundant off-target sites and kinetic acceleration of dynamic sequence interrogation. TRF1, the protein that binds to telomeric tandem repeats, faces an intriguing variant of the search problem where target sites are clustered within short fragments of chromosomal DNA. In this study, we use extensive (>0.5 ms in total) MD simulations to study the dynamical aspects of sequence-specific binding of TRF1 at both telomeric and non-cognate DNA. For the first time, we describe the spontaneous formation of a sequence-specific native protein–DNA complex in atomistic detail, and study the mechanism by which proteins avoid off-target binding while retaining high affinity for target sites. Our calculated free energy landscapes reproduce the thermodynamics of sequence-specific binding, while statistical approaches allow for a comprehensive description of intermediate stages of complex formation. PMID:28633355
Lenzmeier, B A; Giebler, H A; Nyborg, J K
1998-02-01
Efficient human T-cell leukemia virus type 1 (HTLV-1) replication and viral gene expression are dependent upon the virally encoded oncoprotein Tax. To activate HTLV-1 transcription, Tax interacts with the cellular DNA binding protein cyclic AMP-responsive element binding protein (CREB) and recruits the coactivator CREB binding protein (CBP), forming a nucleoprotein complex on the three viral cyclic AMP-responsive elements (CREs) in the HTLV-1 promoter. Short stretches of dG-dC-rich (GC-rich) DNA, immediately flanking each of the viral CREs, are essential for Tax recruitment of CBP in vitro and Tax transactivation in vivo. Although the importance of the viral CRE-flanking sequences is well established, several studies have failed to identify an interaction between Tax and the DNA. The mechanistic role of the viral CRE-flanking sequences has therefore remained enigmatic. In this study, we used high resolution methidiumpropyl-EDTA iron(II) footprinting to show that Tax extended the CREB footprint into the GC-rich DNA flanking sequences of the viral CRE. The Tax-CREB footprint was enhanced but not extended by the KIX domain of CBP, suggesting that the coactivator increased the stability of the nucleoprotein complex. Conversely, the footprint pattern of CREB on a cellular CRE lacking GC-rich flanking sequences did not change in the presence of Tax or Tax plus KIX. The minor-groove DNA binding drug chromomycin A3 bound to the GC-rich flanking sequences and inhibited the association of Tax and the Tax-CBP complex without affecting CREB binding. Tax specifically cross-linked to the viral CRE in the 5'-flanking sequence, and this cross-link was blocked by chromomycin A3. Together, these data support a model where Tax interacts directly with both CREB and the minor-groove viral CRE-flanking sequences to form a high-affinity binding site for the recruitment of CBP to the HTLV-1 promoter.
Fisher, R P; Topper, J N; Clayton, D A
1987-07-17
Selective transcription of human mitochondrial DNA requires a transcription factor (mtTF) in addition to an essentially nonselective RNA polymerase. Partially purified mtTF is able to sequester promoter-containing DNA in preinitiation complexes in the absence of mitochondrial RNA polymerase, suggesting a DNA-binding mechanism for factor activity. Functional domains, required for positive transcriptional regulation by mtTF, are identified within both major promoters of human mtDNA through transcription of mutant promoter templates in a reconstituted in vitro system. These domains are essentially coextensive with DNA sequences protected from nuclease digestion by mtTF-binding. Comparison of the sequences of the two mtTF-responsive elements reveals significant homology only when one sequence is inverted; the binding sites are in opposite orientations with respect to the predominant direction of transcription. Thus mtTF may function bidirectionally, requiring additional protein-DNA interactions to dictate transcriptional polarity. The mtTF-responsive elements are arrayed as direct repeats, separated by approximately 80 bp within the displacement-loop region of human mitochondrial DNA; this arrangement may reflect duplication of an ancestral bidirectional promoter, giving rise to separate, unidirectional promoters for each strand.
Nagano, Yukio; Furuhashi, Hirofumi; Inaba, Takehito; Sasaki, Yukiko
2001-01-01
Complementary DNA encoding a DNA-binding protein, designated PLATZ1 (plant AT-rich sequence- and zinc-binding protein 1), was isolated from peas. The amino acid sequence of the protein is similar to those of other uncharacterized proteins predicted from the genome sequences of higher plants. However, no paralogous sequences have been found outside the plant kingdom. Multiple alignments among these paralogous proteins show that several cysteine and histidine residues are invariant, suggesting that these proteins are a novel class of zinc-dependent DNA-binding proteins with two distantly located regions, C-x2-H-x11-C-x2-C-x(4–5)-C-x2-C-x(3–7)-H-x2-H and C-x2-C-x(10–11)-C-x3-C. In an electrophoretic mobility shift assay, the zinc chelator 1,10-o-phenanthroline inhibited DNA binding, and two distant zinc-binding regions were required for DNA binding. A protein blot with 65ZnCl2 showed that both regions are required for zinc-binding activity. The PLATZ1 protein non-specifically binds to A/T-rich sequences, including the upstream region of the pea GTPase pra2 and plastocyanin petE genes. Expression of the PLATZ1 repressed those of the reporter constructs containing the coding sequence of luciferase gene driven by the cauliflower mosaic virus (CaMV) 35S90 promoter fused to the tandem repeat of the A/T-rich sequences. These results indicate that PLATZ1 is a novel class of plant-specific zinc-dependent DNA-binding protein responsible for A/T-rich sequence-mediated transcriptional repression. PMID:11600698
Molecular basis of splotch and Waardenburg Pax-3 mutations.
Chalepakis, G; Goulding, M; Read, A; Strachan, T; Gruss, P
1994-01-01
Pax genes control certain aspects of development, as mutations result in (semi)dominant defects apparent during embryogenesis. Pax-3 has been associated with the mouse mutant splotch (Sp) and the human Waardenburg syndrome type 1 (WS1). We have examined the molecular basis of splotch and WS1 by studying the effect of mutations on DNA binding, using a defined target sequence. Pax-3 contains two different types of functional DNA-binding domains, a paired domain and a homeodomain. Mutational analysis of Pax-3 reveals different modes of DNA binding depending on the presence of these domains. A segment of Pax-3 located between the two DNA-binding domains, including a conserved octapeptide, participates in protein homodimerization. Pax-3 mutations found in splotch alleles and WS1 individuals change DNA binding and, in the case of a protein product of the Sp allele, dimerization. These findings were taken as a basis to define the molecular nature of the mutants. Images PMID:7909605
Expression regulation by a methyl-CpG binding domain in an E. coli based, cell-free TX-TL system
NASA Astrophysics Data System (ADS)
Schenkelberger, M.; Shanak, S.; Finkler, M.; Worst, E. G.; Noireaux, V.; Helms, V.; Ott, A.
2017-04-01
Cytosine methylation plays an important role in the epigenetic regulation of eukaryotic gene expression. The methyl-CpG binding domain (MBD) is common to a family of eukaryotic transcriptional regulators. How MBD, a stretch of about 80 amino acids, recognizes CpGs in a methylation dependent manner, and as a function of sequence, is only partly understood. Here we show, using an Escherichia coli cell-free expression system, that MBD from the human transcriptional regulator MeCP2 performs as a specific, methylation-dependent repressor in conjunction with the BDNF (brain-derived neurotrophic factor) promoter sequence. Mutation of either base flanking the central CpG pair changes the expression level of the target gene. However, the relative degree of repression as a function of MBD concentration remains unaltered. Molecular dynamics simulations that address the DNA B fiber ratio and the handedness reveal cooperative transitions in the promoter DNA upon MBD binding that correlate well with our experimental observations. We suggest that not only steric hindrance, but also conformational changes of the BDNF promoter as a result of MBD binding are required for MBD to act as a specific inhibitory element. Our work demonstrates that the prokaryotic transcription machinery can reproduce features of epigenetic mammalian transcriptional regulatory elements.
In vitro selection of zinc fingers with altered DNA-binding specificity.
Jamieson, A C; Kim, S H; Wells, J A
1994-05-17
We have used random mutagenesis and phage display to alter the DNA-binding specificity of Zif268, a transcription factor that contains three zinc finger domains. Four residues in the helix of finger 1 of Zif268 that potentially mediate DNA binding were identified from an X-ray structure of the Zif268-DNA complex. A library was constructed in which these residues were randomly mutated and the Zif268 variants were fused to a truncated version of the gene III coat protein on the surface of M13 filamentous phage particles. The phage displayed the mutant proteins in a monovalent fashion and were sorted by repeated binding and elution from affinity matrices containing different DNA sequences. When the matrix contained the natural nine base pair operator sequence 5'-GCG-TGG-GCG-3', native-like zinc fingers were isolated. New finger 1 variants were found by sorting with two different operators in which the singly modified triplets, GTG and TCG, replaced the native finger 1 triplet, GCG. Overall, the selected finger 1 variants contained a preponderance of polar residues at the four sites. Interestingly, the net charge of the four residues in any selected finger never derived more that one unit from neutrality despite the fact that about half the variants contained three or four charged residues over the four sites. Measurements of the dissociation constants for two of these purified finger 1 variants by gel-shift assay showed their specificities to vary over a 10-fold range, with the greatest affinity being for the DNA binding site for which they were sorted.(ABSTRACT TRUNCATED AT 250 WORDS)
Molecular Cloning of Drebrin: Progress and Perspectives.
Kojima, Nobuhiko
2017-01-01
Chicken drebrin isoforms were first identified in the optic tectum of developing brain. Although the time course of protein expression was different in each drebrin isoform, the similarity between their protein structures was suggested by biochemical analysis of purified protein. To determine their protein structures, the cloning of drebrin cDNAs was conducted. Comparison between the cDNA sequences shows that all drebrin cDNAs are identical except that the internal insertion sequences are present or absent in their sequences. Chicken drebrin are now classified into three isoforms, namely, drebrins E1, E2, and A. Genomic cloning demonstrated that the three isoforms are generated by an alternative splicing of individual exons encoding the insertion sequences from single drebrin gene. The mechanism should be precisely regulated in cell-type-specific and developmental stage-specific fashion. Drebrin protein, which is well conserved in various vertebrate species, although mammalian drebrin has only two isoforms, namely, drebrin E and drebrin A, is different from chicken drebrin that has three isoforms. Drebrin belongs to an actin-depolymerizing factor homology (ADF-H) domain protein family. Besides the ADF-H domain, drebrin has other domains, including the actin-binding domain and Homer-binding motifs. Diversity of protein isoform and multiple domains of drebrin could interact differentially with the actin cytoskeleton and other intracellular proteins and regulate diverse cellular processes.
Roy, A; Roy Chattopadhyay, N
2013-07-01
Cancer involves various sets of altered gene functions which embrace all the three basic mechanisms of regulation of gene expression. However, no common mechanism is inferred till date for this versatile disease and thus no full proof remedy can be offered. Here we show that the basic mechanisms are interlinked and indicate towards one of those mechanisms as being the superior one; the methylation of cytosines in specific DNA sequences, for the initiation and maintenance of carcinogenesis. The analyses of the previous reports and the nucleotide sequences of the DNA methyltransferases strongly support the assumption that the mutation(s) in the DNA-binding site(s) of DNA-methyltransferases acts as a master regulator; though it continues the cycle from mutation to repair to methylation. We anticipate that our hypothesis will start a line of study for the proposal of a treatment regime for cancers by introducing wild type methyltransferases in the diseased cells and/or germ cells, and/or by targeting ligands to the altered binding domain(s) where a mutation in the concerned enzyme(s) is seen. Copyright © 2013. Published by Elsevier Ltd.
Sun, Han; Zeng, Jun; Cao, Zhendong; Li, Yan; Qian, Weiqiang
2015-01-01
Active DNA demethylation plays crucial roles in the regulation of gene expression in both plants and animals. In Arabidopsis thaliana, active DNA demethylation is initiated by the ROS1 subfamily of 5-methylcytosine-specific DNA glycosylases via a base excision repair mechanism. Recently, IDM1 and IDM2 were shown to be required for the recruitment of ROS1 to some of its target loci. However, the mechanism(s) by which IDM1 is targeted to specific genomic loci remains to be determined. Affinity purification of IDM1- and IDM2- associating proteins demonstrated that IDM1 and IDM2 copurify together with two novel components, methyl-CpG-binding domain protein 7 (MBD7) and IDM2-like protein 1 (IDL1). IDL1 encodes an α-crystallin domain protein that shows high sequence similarity with IDM2. MBD7 interacts with IDM2 and IDL1 in vitro and in vivo and they form a protein complex associating with IDM1 in vivo. MBD7 directly binds to the target loci and is required for the H3K18 and H3K23 acetylation in planta. MBD7 dysfunction causes DNA hypermethylation and silencing of reporter genes and a subset of endogenous genes. Our results suggest that a histone acetyltransferase complex functions in active DNA demethylation and in suppression of gene silencing at some loci in Arabidopsis. PMID:25933434
Zinc finger nuclease technology: advances and obstacles in modelling and treating genetic disorders.
Jabalameli, Hamid Reza; Zahednasab, Hamid; Karimi-Moghaddam, Amin; Jabalameli, Mohammad Reza
2015-03-01
Zinc finger nucleases (ZFNs) are engineered restriction enzymes designed to target specific DNA sequences within the genome. Assembly of zinc finger DNA-binding domain to a DNA-cleavage domain enables the enzyme machinery to target unique locus in the genome and invoke endogenous DNA repair mechanisms. This machinery offers a versatile approach in allele editing and gene therapy. Here we discuss the architecture of ZFNs and strategies for generating targeted modifications within the genome. We review advances in gene therapy and modelling of the disease using these enzymes and finally, discuss the practical obstacles in using this technology. Copyright © 2014 Elsevier B.V. All rights reserved.
Liu, Ying; Matthews, Kathleen S.; Bondos, Sarah E.
2008-01-01
During animal development, distinct tissues, organs, and appendages are specified through differential gene transcription by Hox transcription factors. However, the conserved Hox homeodomains bind DNA with high affinity yet low specificity. We have therefore explored the structure of the Drosophila melanogaster Hox protein Ultrabithorax and the impact of its nonhomeodomain regions on DNA binding properties. Computational and experimental approaches identified several conserved, intrinsically disordered regions outside the homeodomain of Ultrabithorax that impact DNA binding by the homeodomain. Full-length Ultrabithorax bound to target DNA 2.5-fold weaker than its isolated homeodomain. Using N-terminal and C-terminal deletion mutants, we demonstrate that the YPWM region and the disordered microexons (termed the I1 region) inhibit DNA binding ∼2-fold, whereas the disordered I2 region inhibits homeodomain-DNA interaction a further ∼40-fold. Binding is restored almost to homeodomain affinity by the mostly disordered N-terminal 174 amino acids (R region) in a length-dependent manner. Both the I2 and R regions contain portions of the activation domain, functionally linking DNA binding and transcription regulation. Given that (i) the I1 region and a portion of the R region alter homeodomain-DNA binding as a function of pH and (ii) an internal deletion within I1 increases Ultrabithorax-DNA affinity, I1 must directly impact homeodomain-DNA interaction energetics. However, I2 appears to indirectly affect DNA binding in a manner countered by the N terminus. The amino acid sequences of I2 and much of the I1 and R regions vary significantly among Ultrabithorax orthologues, potentially diversifying Hox-DNA interactions. PMID:18508761
MorTAL Kombat: the story of defense against TAL effectors through loss-of-susceptibility
Hutin, Mathilde; Pérez-Quintero, Alvaro L.; Lopez, Camilo; Szurek, Boris
2015-01-01
Many plant-pathogenic xanthomonads rely on Transcription Activator-Like (TAL) effectors to colonize their host. This particular family of type III effectors functions as specific plant transcription factors via a programmable DNA-binding domain. Upon binding to the promoters of plant disease susceptibility genes in a sequence-specific manner, the expression of these host genes is induced. However, plants have evolved specific strategies to counter the action of TAL effectors and confer resistance. One mechanism is to avoid the binding of TAL effectors by mutations of their DNA binding sites, resulting in resistance by loss-of-susceptibility. This article reviews our current knowledge of the susceptibility hubs targeted by Xanthomonas TAL effectors, possible evolutionary scenarios for plants to combat the pathogen with loss-of-function alleles, and how this knowledge can be used overall to develop new pathogen-informed breeding strategies and improve crop resistance. PMID:26236326
McLaughlin, Paul J; Keegan, Liam P
2014-08-01
Nearly 150 different enzymatically modified forms of the four canonical residues in RNA have been identified. For instance, enzymes of the ADAR (adenosine deaminase acting on RNA) family convert adenosine residues into inosine in cellular dsRNAs. Recent findings show that DNA endonuclease V enzymes have undergone an evolutionary transition from cleaving 3' to deoxyinosine in DNA and ssDNA to cleaving 3' to inosine in dsRNA and ssRNA in humans. Recent work on dsRNA-binding domains of ADARs and other proteins also shows that a degree of sequence specificity is achieved by direct readout in the minor groove. However, the level of sequence specificity observed is much less than that of DNA major groove-binding helix-turn-helix proteins. We suggest that the evolution of DNA-binding proteins following the RNA to DNA genome transition represents the major advantage that DNA genomes have over RNA genomes. We propose that a hypothetical RNA modification, a RRAR (ribose reductase acting on genomic dsRNA) produced the first stretches of DNA in RNA genomes. We discuss why this is the most satisfactory explanation for the origin of DNA. The evolution of this RNA modification and later steps to DNA genomes are likely to have been driven by cellular genome co-evolution with viruses and intragenomic parasites. RNA modifications continue to be involved in host-virus conflicts; in vertebrates, edited cellular dsRNAs with inosine-uracil base pairs appear to be recognized as self RNA and to suppress activation of innate immune sensors that detect viral dsRNA.
Luo, Si-Wei; Liang, Zhi; Wu, Jia-Rui
2017-01-01
Quantitatively detecting correlations of multiple protein-protein interactions (PPIs) in vivo is a big challenge. Here we introduce a novel method, termed Protein-interactome Footprinting (PiF), to simultaneously measure multiple PPIs in one cell. The principle of PiF is that each target physical PPI in the interactome is simultaneously transcoded into a specific DNA sequence based on dimerization of the target proteins fused with DNA-binding domains. The interaction intensity of each target protein is quantified as the copy number of the specific DNA sequences bound by each fusion protein dimers. Using PiF, we quantitatively reveal dynamic patterns of PPIs and their correlation network in E. coli two-component systems. PMID:28338015
Sequence analysis of DBL2β domain of vargene of Indonesian Plasmodium falciparum
NASA Astrophysics Data System (ADS)
Sulistyaningsih, E.; Romadhon, B. D.; Palupi, I.; Hidayah, F.; Dewi, R.; Prasetyo, A.
2018-03-01
Malaria is a major health problem in tropical countries including Indonesia. The most deadly agent is Plasmodium falciparum. In P. falciparum infection, PfEMP1 is supposed to play an important role in the pathogenesis of malaria. PfEMP1 is encoded by var gene family, it is a polymorphic protein where the extra-cellular portion contains of three distinct binding domains: Duffy binding-like (DBL), Cysteine-rich interdomain regions (CIDR) and C2. PfEMP1 varies in domain composition and binding specificity. The study explored the characteristic of Indonesian DBL2β-var genes and investigated its role to the malaria outcome. Twenty blood samples from clinically mild to severe malaria patients in Jember, East Java were collected for DNA extraction. Diagnosis was confirmed by Giemsa-stained thick blood smear. PCR was conducted using specific primer targeting on the full-length of DBL2ß and resulted approximately single band of 1,7 kb in a sample. This band was observed only from severe malaria sample. Sequence analysis directly from PCR product showed 74-99% similarities with previous sequences in Gene Bank. In conclusion, the DBL2β domain of vargene of Indonesian isolates was 1603 nucleotides in length and there was a possible association of the existence of DBL2β domain with the severity of malaria outcome.
Radiation-induced oxidative damage to the DNA-binding domain of the lactose repressor
Gillard, Nathalie; Goffinont, Stephane; Buré, Corinne; Davidkova, Marie; Maurizot, Jean-Claude; Cadene, Martine; Spotheim-Maurizot, Melanie
2007-01-01
Understanding the cellular effects of radiation-induced oxidation requires the unravelling of key molecular events, particularly damage to proteins with important cellular functions. The Escherichia coli lactose operon is a classical model of gene regulation systems. Its functional mechanism involves the specific binding of a protein, the repressor, to a specific DNA sequence, the operator. We have shown previously that upon irradiation with γ-rays in solution, the repressor loses its ability to bind the operator. Water radiolysis generates hydroxyl radicals (OH· radicals) which attack the protein. Damage of the repressor DNA-binding domain, called the headpiece, is most likely to be responsible of this loss of function. Using CD, fluorescence spectroscopy and a combination of proteolytic cleavage with MS, we have examined the state of the irradiated headpiece. CD measurements revealed a dose-dependent conformational change involving metastable intermediate states. Fluorescence measurements showed a gradual degradation of tyrosine residues. MS was used to count the number of oxidations in different regions of the headpiece and to narrow down the parts of the sequence bearing oxidized residues. By calculating the relative probabilities of reaction of each amino acid with OH· radicals, we can predict the most probable oxidation targets. By comparing the experimental results with the predictions we conclude that Tyr7, Tyr12, Tyr17, Met42 and Tyr47 are the most likely hotspots of oxidation. The loss of repressor function is thus correlated with chemical modifications and conformational changes of the headpiece. PMID:17263689
Jun, S; Wallen, R V; Goriely, A; Kalionis, B; Desplan, C
1998-11-10
Pax proteins, characterized by the presence of a paired domain, play key regulatory roles during development. The paired domain is a bipartite DNA-binding domain that contains two helix-turn-helix domains joined by a linker region. Each of the subdomains, the PAI and RED domains, has been shown to be a distinct DNA-binding domain. The PAI domain is the most critical, but in specific circumstances, the RED domain is involved in DNA recognition. We describe a Pax protein, originally called Lune, that is the product of the Drosophila eye gone gene (eyg). It is unique among Pax proteins, because it contains only the RED domain. eyg seems to play a role both in the organogenesis of the salivary gland during embryogenesis and in the development of the eye. A high-affinity binding site for the Eyg RED domain was identified by using systematic evolution of ligands by exponential enrichment techniques. This binding site is related to a binding site previously identified for the RED domain of the Pax-6 5a isoform. Eyg also contains another DNA-binding domain, a Prd-class homeodomain (HD), whose palindromic binding site is similar to other Prd-class HDs. The ability of Pax proteins to use the PAI, RED, and HD, or combinations thereof, may be one mechanism that allows them to be used at different stages of development to regulate various developmental processes through the activation of specific target genes.
Jun, Susie; Wallen, Robert V.; Goriely, Anne; Kalionis, Bill; Desplan, Claude
1998-01-01
Pax proteins, characterized by the presence of a paired domain, play key regulatory roles during development. The paired domain is a bipartite DNA-binding domain that contains two helix–turn–helix domains joined by a linker region. Each of the subdomains, the PAI and RED domains, has been shown to be a distinct DNA-binding domain. The PAI domain is the most critical, but in specific circumstances, the RED domain is involved in DNA recognition. We describe a Pax protein, originally called Lune, that is the product of the Drosophila eye gone gene (eyg). It is unique among Pax proteins, because it contains only the RED domain. eyg seems to play a role both in the organogenesis of the salivary gland during embryogenesis and in the development of the eye. A high-affinity binding site for the Eyg RED domain was identified by using systematic evolution of ligands by exponential enrichment techniques. This binding site is related to a binding site previously identified for the RED domain of the Pax-6 5a isoform. Eyg also contains another DNA-binding domain, a Prd-class homeodomain (HD), whose palindromic binding site is similar to other Prd-class HDs. The ability of Pax proteins to use the PAI, RED, and HD, or combinations thereof, may be one mechanism that allows them to be used at different stages of development to regulate various developmental processes through the activation of specific target genes. PMID:9811867
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fradkin, L.G.; Yoshinaga, S.K.; Berk, A.J.
1987-11-01
The inhibition of transcription by RNA polymerase III in poliovirus-infected cells was studied. Experiments utilizing two different cell lines showed that the initiation step of transcription by RNA polymerase III was impaired by infection of these cells with the virus. The observed inhibition of transcription was not due to shut-off of host cell protein synthesis by poliovirus. Among four distinct components required for accurate transcription in vitro from cloned DNA templates, activities of RNA polymerase III and transcription factor TFIIIA were not significantly affected by virus infection. The activity of transcription factor TFIIIC, the limiting component required for transcription ofmore » RNA polymerase III genes, was severely inhibited in infected cells, whereas that of transcription factor TFIIIB was inhibited to a lesser extent. The sequence-specific DNA-binding of TFIIIC to the adenovirus VA1 gene internal promoted, however, was not altered by infection of cells with the virus. The authors conclude that (i) at least two transcription factors, TFIIIB and TFIIIC, are inhibited by infection of cells with poliovirtus, (ii) inactivation of TFIIIC does not involve destruction of its DNA-binding domain, and (iii) sequence-specific DNA binding by TFIIIC may be necessary but is not sufficient for the formation of productive transcription complexes.« less
DNA binding of the p21 repressor ZBTB2 is inhibited by cytosine hydroxymethylation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lafaye, Céline; Barbier, Ewa; Miscioscia, Audrey
2014-03-28
Highlights: • 5-hmC epigenetic modification is measurable in HeLa, SH-SY5Y and UT7-MPL cell lines. • ZBTB2 binds to DNA probes containing 5-mC but not to sequences containing 5-hmC. • This differential binding is verified with DNA sequences involved in p21 regulation. - Abstract: Recent studies have demonstrated that the modified base 5-hydroxymethylcytosine (5-hmC) is detectable at various rates in DNA extracted from human tissues. This oxidative product of 5-methylcytosine (5-mC) constitutes a new and important actor of epigenetic mechanisms. We designed a DNA pull down assay to trap and identify nuclear proteins bound to 5-hmC and/or 5-mC. We applied thismore » strategy to three cancerous cell lines (HeLa, SH-SY5Y and UT7-MPL) in which we also measured 5-mC and 5-hmC levels by HPLC-MS/MS. We found that the putative oncoprotein Zinc finger and BTB domain-containing protein 2 (ZBTB2) is associated with methylated DNA sequences and that this interaction is inhibited by the presence of 5-hmC replacing 5-mC. As published data mention ZBTB2 recognition of p21 regulating sequences, we verified that this sequence specific binding was also alleviated by 5-hmC. ZBTB2 being considered as a multifunctional cell proliferation activator, notably through p21 repression, this work points out new epigenetic processes potentially involved in carcinogenesis.« less
Extended HSR/CARD domain mediates AIRE binding to DNA
DOE Office of Scientific and Technical Information (OSTI.GOV)
Maslovskaja, Julia, E-mail: julia.maslovskaja@ut.ee; Saare, Mario; Liiv, Ingrid
Autoimmune regulator (AIRE) activates the transcription of many genes in an unusual promiscuous and stochastic manner. The mechanism by which AIRE binds to the chromatin and DNA is not fully understood, and the regulatory elements that AIRE target genes possess are not delineated. In the current study, we demonstrate that AIRE activates the expression of transiently transfected luciferase reporters that lack defined promoter regions, as well as intron and poly(A) signal sequences. Our protein-DNA interaction experiments with mutated AIRE reveal that the intact homogeneously staining region/caspase recruitment domain (HSR/CARD) and amino acids R113 and K114 are key elements involved inmore » AIRE binding to DNA. - Highlights: • Promoter and mRNA processing elements are not important for AIRE to activate gene expression from reporter plasmids. • AIRE protein fragment aa 1–138 mediates direct binding to DNA. • Integrity of the HSR/CARD domain is needed for AIRE binding to DNA.« less
Heterogeneous RNA-binding protein M4 is a receptor for carcinoembryonic antigen in Kupffer cells.
Bajenova, O V; Zimmer, R; Stolper, E; Salisbury-Rowswell, J; Nanji, A; Thomas, P
2001-08-17
Here we report the isolation of the recombinant cDNA clone from rat macrophages, Kupffer cells (KC) that encodes a protein interacting with carcinoembryonic antigen (CEA). To isolate and identify the CEA receptor gene we used two approaches: screening of a KC cDNA library with a specific antibody and the yeast two-hybrid system for protein interaction using as a bait the N-terminal part of the CEA encoding the binding site. Both techniques resulted in the identification of the rat heterogeneous RNA-binding protein (hnRNP) M4 gene. The rat ortholog cDNA sequence has not been previously described. The open reading frame for this gene contains a 2351-base pair sequence with the polyadenylation signal AATAAA and a termination poly(A) tail. The mRNA shows ubiquitous tissue expression as a 2.4-kilobase transcript. The deduced amino acid sequence comprised a 78-kDa membrane protein with 3 putative RNA-binding domains, arginine/methionine/glutamine-rich C terminus and 3 potential membrane spanning regions. When hnRNP M4 protein is expressed in pGEX4T-3 vector system in Escherichia coli it binds (125)I-labeled CEA in a Ca(2+)-dependent fashion. Transfection of rat hnRNP M4 cDNA into a non-CEA binding mouse macrophage cell line p388D1 resulted in CEA binding. These data provide evidence for a new function of hnRNP M4 protein as a CEA-binding protein in Kupffer cells.
Duan, Ming-Rui; Nan, Jie; Liang, Yu-He; Mao, Peng; Lu, Lu; Li, Lanfen; Wei, Chunhong; Lai, Luhua; Li, Yi; Su, Xiao-Dong
2007-01-01
WRKY proteins, defined by the conserved WRKYGQK sequence, are comprised of a large superfamily of transcription factors identified specifically from the plant kingdom. This superfamily plays important roles in plant disease resistance, abiotic stress, senescence as well as in some developmental processes. In this study, the Arabidopsis WRKY1 was shown to be involved in the salicylic acid signaling pathway and partially dependent on NPR1; a C-terminal domain of WRKY1, AtWRKY1-C, was constructed for structural studies. Previous investigations showed that DNA binding of the WRKY proteins was localized at the WRKY domains and these domains may define novel zinc-binding motifs. The crystal structure of the AtWRKY1-C determined at 1.6 Å resolution has revealed that this domain is composed of a globular structure with five β strands, forming an antiparallel β-sheet. A novel zinc-binding site is situated at one end of the β-sheet, between strands β4 and β5. Based on this high-resolution crystal structure and site-directed mutagenesis, we have defined and confirmed that the DNA-binding residues of AtWRKY1-C are located at β2 and β3 strands. These results provided us with structural information to understand the mechanism of transcriptional control and signal transduction events of the WRKY proteins. PMID:17264121
Generation of TALE-Based Designer Epigenome Modifiers.
Nitsch, Sandra; Mussolino, Claudio
2018-01-01
Manipulation of gene expression can be facilitated by editing the genome or the epigenome. Precise genome editing is traditionally achieved by using designer nucleases which are generally exploited to eliminate a specific gene product. Upon the introduction of a site-specific DNA double-strand break (DSB) by the nuclease, endogenous DSB repair mechanisms are in turn harnessed to induce DNA sequence changes that can result in target gene inactivation. Minimal off-target effects can be obtained by endowing designer nucleases with the highly specific DNA-binding domain (DBD) derived from transcription activator-like effectors (TALEs). In contrast, epigenome editing allows gene expression control without inducing changes in the DNA sequence by specifically altering epigenetic marks, as histone tails modifications or DNA methylation patterns within promoter or enhancer regions. Importantly, this approach allows both up- and downregulation of the target gene expression, and the effect is generally reversible. TALE-based designer epigenome modifiers combine the high specificity of TALE-derived DBDs with the power of epigenetic modifier domains to induce fast and long-lasting changes in the epigenetic landscape of a target gene and control its expression. Here we provide a detailed description for the generation of TALE-based designer epigenome modifiers and of a suitable reporter cell line to easily monitor their activity.
Pandey, Bharati; Grover, Abhinav; Sharma, Pradeep
2018-02-12
The WRKY transcription factors are a class of DNA-binding proteins involved in diverse plant processes play critical roles in response to abiotic and biotic stresses. Genome-wide divergence analysis of WRKY gene family in Hordeum vulgare provided a framework for molecular evolution and functional roles. So far, the crystal structure of WRKY from barley has not been resolved; moreover, knowledge of the three-dimensional structure of WRKY domain is pre-requisites for exploring the protein-DNA recognition mechanisms. Homology modelling based approach was used to generate structures for WRKY DNA binding domain (DBD) and its variants using AtWRKY1 as a template. Finally, the stability and conformational changes of the generated model in unbound and bound form was examined through atomistic molecular dynamics (MD) simulations for 100 ns time period. In this study, we investigated the comparative binding pattern of WRKY domain and its variants with W-box cis-regulatory element using molecular docking and dynamics (MD) simulations assays. The atomic insight into WRKY domain exhibited significant variation in the intermolecular hydrogen bonding pattern, leading to the structural anomalies in the variant type and differences in the DNA-binding specificities. Based on the MD analysis, residual contribution and interaction contour, wild-type WRKY (HvWRKY46) were found to interact with DNA through highly conserved heptapeptide in the pre- and post-MD simulated complexes, whereas heptapeptide interaction with DNA was missing in variants (I and II) in post-MD complexes. Consequently, through principal component analysis, wild-type WRKY was also found to be more stable by obscuring a reduced conformational space than the variant I (HvWRKY34). Lastly, high binding free energy for wild-type and variant II allowed us to conclude that wild-type WRKY-DNA complex was more stable relative to variants I. The results of our study revealed complete dynamic and structural information about WRKY domain-DNA interactions. However, no structure base information reported to date for WRKY variants and their mechanism of interaction with DNA. Our findings highlighted the importance of selecting a sequence to generate newer transgenic plants that would be increasingly tolerance to stress conditions.
Two new insulator proteins, Pita and ZIPIC, target CP190 to chromatin
Maksimenko, Oksana; Bartkuhn, Marek; Stakhov, Viacheslav; Herold, Martin; Zolotarev, Nickolay; Jox, Theresa; Buxa, Melanie K.; Kirsch, Ramona; Bonchuk, Artem; Fedotova, Anna; Kyrchanova, Olga
2015-01-01
Insulators are multiprotein–DNA complexes that regulate the nuclear architecture. The Drosophila CP190 protein is a cofactor for the DNA-binding insulator proteins Su(Hw), CTCF, and BEAF-32. The fact that CP190 has been found at genomic sites devoid of either of the known insulator factors has until now been unexplained. We have identified two DNA-binding zinc-finger proteins, Pita, and a new factor named ZIPIC, that interact with CP190 in vivo and in vitro at specific interaction domains. Genomic binding sites for these proteins are clustered with CP190 as well as with CTCF and BEAF-32. Model binding sites for Pita or ZIPIC demonstrate a partial enhancer-blocking activity and protect gene expression from PRE-mediated silencing. The function of the CTCF-bound MCP insulator sequence requires binding of Pita. These results identify two new insulator proteins and emphasize the unifying function of CP190, which can be recruited by many DNA-binding insulator proteins. PMID:25342723
Generalized theory on the mechanism of site-specific DNA-protein interactions
NASA Astrophysics Data System (ADS)
Niranjani, G.; Murugan, R.
2016-05-01
We develop a generalized theoretical framework on the binding of transcription factor proteins (TFs) with specific sites on DNA that takes into account the interplay of various factors regarding overall electrostatic potential at the DNA-protein interface, occurrence of kinetic traps along the DNA sequence, presence of other roadblock protein molecules along DNA and crowded environment, conformational fluctuations in the DNA binding domains (DBDs) of TFs, and the conformational state of the DNA. Starting from a Smolochowski type theoretical framework on site-specific binding of TFs we logically build our model by adding the effects of these factors one by one. Our generalized two-step model suggests that the electrostatic attractive forces present inbetween the positively charged DBDs of TFs and the negatively charged phosphate backbone of DNA, along with the counteracting shielding effects of solvent ions, is the core factor that creates a fluidic type environment at the DNA-protein interface. This in turn facilitates various one-dimensional diffusion (1Dd) processes such as sliding, hopping and intersegmental transfers. These facilitating processes as well as flipping dynamics of conformational states of DBDs of TFs between stationary and mobile states can enhance the 1Dd coefficient on a par with three-dimensional diffusion (3Dd). The random coil conformation of DNA also plays critical roles in enhancing the site-specific association rate. The extent of enhancement over the 3Dd controlled rate seems to be directly proportional to the maximum possible 1Dd length. We show that the overall site-specific binding rate scales with the length of DNA in an asymptotic way. For relaxed DNA, the specific binding rate will be independent of the length of DNA as length increases towards infinity. For condensed DNA as in in vivo conditions, the specific binding rate depends on the length of DNA in a turnover way with a maximum. This maximum rate seems to scale with the maximum possible 1Dd length of TFs in a square root manner. Results suggest that 1Dd processes contribute much less to the enhancement of specific binding rate under in vivo conditions for condensed DNA. There exists a critical length of binding stretch of TFs beyond which the probability associated with the random occurrence of similar specific binding sites will be close to zero. TFs in natural systems from prokaryotes to eukaryotes seem to handle sequence-mediated kinetic traps via increasing the length of their recognition stretch or combinatorial binding. TFs overcome the hurdles of roadblocks via switching efficiently between sliding, hopping and intersegmental transfer modes. The site-specific binding rate as well as the maximum possible 1Dd length seem to be directly proportional to the square root of the probability (p R) of finding a nonspecific binding site to be free from dynamic roadblocks. Here p R seems to be a function of the number of nsbs available per DNA binding protein (ϕ) inside the living cell. It seems that p R > 0.8 when ϕ > 10 which is true for the Escherichia coli cell system.
Small molecule and peptide-mediated inhibition of Epstein-Barr virus nuclear antigen 1 dimerization
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Sun Young; Song, Kyung-A; Samsung Biomedical Research Institute
Highlights: Black-Right-Pointing-Pointer Evidence that targeting EBNA1 dimer, an EBV onco-antigen, can be achievable. Black-Right-Pointing-Pointer A small molecule and a peptide as EBNA1 dimerization inhibitors identified. Black-Right-Pointing-Pointer Both inhibitors associated with EBNA1 and blocked EBNA1 DNA binding activity. Black-Right-Pointing-Pointer Also, prevented its dimerization, and repressed viral gene transcription. -- Abstract: Latent Epstein-Barr virus (EBV) infection is associated with human B cell lymphomas and certain carcinomas. EBV episome persistence, replication, and gene expression are dependent on EBV-encoded nuclear antigen 1 (EBNA1)'s DNA binding domain (DBD)/dimerization domain (DD)-mediated sequence-specific DNA binding activity. Homodimerization of EBNA1 is essential for EBNA1 DNA binding and transactivation.more » In this study, we characterized a novel small molecule EBNA1 inhibitor EiK1, screened from the previous high throughput screening (HTS). The EiK1 compound specifically inhibited the EBNA1-dependent, OriP-enhanced transcription, but not EBNA1-independent transcription. A Surface Plasmon Resonance Biacore assay revealed that EiK1 associates with EBNA1 amino acid 459-607 DBD/DD. Consistent with the SPR data, in vitro gel shift assays showed that EiK1 suppressed the activity of EBNA1 binding to the cognate familial repeats (FR) sequence, but not control RBP-J{kappa} binding to the J{kappa} site. Subsequently, a cross-linker-mediated in vitro multimerization assay and EBNA1 homodimerization-dependent yeast two-hybrid assay showed that EiK1 significantly inhibited EBNA1 dimerization. In an attempt to identify more highly specific peptide inhibitors, small peptides encompassing the EBNA1 DBD/DD were screened for inhibition of EBNA1 DBD-mediated DNA binding function. The small peptide P85, covering EBNA1 a.a. 560-574, significantly blocked EBNA1 DNA binding activity in vitro, prevented dimerization in vitro and in vivo, associated with EBNA1 in vitro, and repressed EBNA1-dependent transcription in vivo. Collectively, this study describes two novel inhibitors of EBNA1 dimerization. This study demonstrates that EBNA1 homodimerization can be effectively targeted by a small molecule or peptide.« less
Discrimination against RNA Backbones by a ssDNA Binding Protein.
Lloyd, Neil R; Wuttke, Deborah S
2018-05-01
Pot1 is the shelterin component responsible for the protection of the single-stranded DNA (ssDNA) overhang at telomeres in nearly all eukaryotic organisms. The C-terminal domain of the DNA-binding domain, Pot1pC, exhibits non-specific ssDNA recognition, achieved through thermodynamically equivalent alternative binding conformations. Given this flexibility, it is unclear how specificity for ssDNA over RNA, an activity required for biological function, is achieved. Examination of the ribose-position specificity of Pot1pC shows that ssDNA specificity is additive but not uniformly distributed across the ligand. High-resolution structures of several Pot1pC complexes with RNA-DNA chimeric ligands reveal Pot1pC discriminates against RNA by utilizing non-compensatory binding modes that feature significant rearrangement of the binding interface. These alternative conformations, accessed through both ligand and protein flexibility, recover much, but not all, of the binding energy, leading to the observed reduction in affinities. These findings suggest that intermolecular interfaces are remarkably sophisticated in their tuning of specificity toward flexible ligands. Copyright © 2018 Elsevier Ltd. All rights reserved.
TALE: a tale of genome editing.
Zhang, Mingjie; Wang, Feng; Li, Shifei; Wang, Yan; Bai, Yun; Xu, Xueqing
2014-01-01
Transcription activator-like effectors (TALEs), first identified in Xanthomonas bacteria, are naturally occurring or artificially designed proteins that modulate gene transcription. These proteins recognize and bind DNA sequences based on a variable numbers of tandem repeats. Each repeat is comprised of a set of ∼ 34 conserved amino acids; within this conserved domain, there are usually two amino acids that distinguish one TALE from another. Interestingly, TALEs have revealed a simple cipher for the one-to-one recognition of proteins for DNA bases. Synthetic TALEs have been used to successfully target genes in a variety of species, including humans. Depending on the type of functional domain that is fused to the TALE of interest, these proteins can have diverse biological effects. For example, after binding DNA, TALEs fused to transcriptional activation domains can function as robust transcription factors (TALE-TFs), while fused to restriction endonucleases (TALENs) can cut DNA. Targeted genome editing, in theory, is capable of modifying any endogenous gene sequence of interest; this can be performed in cells or organisms, and may be applied to clinical gene-based therapies in the future. With current technologies, highly accurate, specific, and reliable gene editing cannot be achieved. Thus, recognition and binding mechanisms governing TALE biology are currently hot research areas. In this review, we summarize the major advances in TALE technology over the past several years with a focus on the interaction between TALEs and DNA, TALE design and construction, potential applications for this technology, and unique characteristics that make TALEs superior to zinc finger endonucleases. Copyright © 2013 Elsevier Ltd. All rights reserved.
Molecular Control of Polyene Macrolide Biosynthesis
Santos-Aberturas, Javier; Vicente, Cláudia M.; Guerra, Susana M.; Payero, Tamara D.; Martín, Juan F.; Aparicio, Jesús F.
2011-01-01
Control of polyene macrolide production in Streptomyces natalensis is mediated by the transcriptional activator PimM. This regulator, which combines an N-terminal PAS domain with a C-terminal helix-turn-helix motif, is highly conserved among polyene biosynthetic gene clusters. PimM, truncated forms of the protein without the PAS domain (PimMΔPAS), and forms containing just the DNA-binding domain (DBD) (PimMDBD) were overexpressed in Escherichia coli as GST-fused proteins. GST-PimM binds directly to eight promoters of the pimaricin cluster, as demonstrated by electrophoretic mobility shift assays. Assays with truncated forms of the protein revealed that the PAS domain does not mediate specificity or the distinct recognition of target genes, which rely on the DBD domain, but significantly reduces binding affinity up to 500-fold. Transcription start points were identified by 5′-rapid amplification of cDNA ends, and the binding regions of PimMDBD were investigated by DNase I protection studies. In all cases, binding took place covering the −35 hexamer box of each promoter, suggesting an interaction of PimM and RNA polymerase to cause transcription activation. Information content analysis of the 16 sequences protected in target promoters was used to deduce the structure of the PimM-binding site. This site displays dyad symmetry, spans 14 nucleotides, and adjusts to the consensus TVGGGAWWTCCCBA. Experimental validation of this binding site was performed by using synthetic DNA duplexes. Binding of PimM to the promoter region of one of the polyketide synthase genes from the Streptomyces nodosus amphotericin cluster containing the consensus binding site was also observed, thus proving the applicability of the findings reported here to other antifungal polyketides. PMID:21187288
Gustafsson, Jan-Ake
2005-06-01
Our interest in nuclear receptors (NRs) originated from early studies on hepatic steroid metabolism. We discovered a new hypothalamo-pituitary-liver axis, imprinted neonatally by androgens and operating through sexually differentiated GH secretory patterns. Male and female patterns have opposite effects on sexually differentiated hepatic genes, explaining sexually dimorphic liver patterns. To further understand steroid action, we purified the glucocorticoid receptor (GR) leading to our discovery of the NR three-domain structure, with separable DNA binding domain and ligand binding domains and a third domain now known to have transcriptional regulatory properties. Knowledge of this domain structure has been immensely important for deciphering NR actions. Using this first purified NR, we collaborated with Keith Yamamoto and first demonstrated specific NR binding to DNA. This also was the first demonstration of a mammalian transcription factor, a breakthrough that led to discovery of NR response elements. In further collaboration with Yamamoto, we cloned the first NR cDNA sequences, leading to cloning of the superfamily of NR genes. With Yamamoto and Kaptein, we determined the first three-dimensional NR structure, that of DNA binding domain. Later work on orphan receptors resulted in the first discovery of: 1) endogenous ligands for an orphan receptor (fatty acids as activators of peroxisomal proliferator-activated receptor alpha); 2) liver X receptor beta (OR-1) and its role in central nervous system cholesterol homeostasis; and 3) estrogen receptor beta, leading to a paradigm shift in understanding of estrogen signaling, of importance in endocrinology, immunology, and oncology and to development of estrogen receptor beta agonists for treatment of autoimmune diseases, prostate disease, depression, and ovulatory dysfunction.
Evers, R; Grummt, I
1995-01-01
Both the DNA elements and the nuclear factors that direct termination of ribosomal gene transcription exhibit species-specific differences. Even between mammals--e.g., human and mouse--the termination signals are not identical and the respective transcription termination factors (TTFs) which bind to the terminator sequence are not fully interchangeable. To elucidate the molecular basis for this species-specificity, we have cloned TTF-I from human and mouse cells and compared their structural and functional properties. Recombinant TTF-I exhibits species-specific DNA binding and terminates transcription both in cell-free transcription assays and in transfection experiments. Chimeric constructs of mouse TTF-I and human TTF-I reveal that the major determinant for species-specific DNA binding resides within the C terminus of TTF-I. Replacing 31 C-terminal amino acids of mouse TTF-I with the homologous human sequences relaxes the DNA-binding specificity and, as a consequence, allows the chimeric factor to bind the human terminator sequence and to specifically stop rDNA transcription. Images Fig. 2 Fig. 3 Fig. 4 PMID:7597036
Structural and Thermodynamic Signatures of DNA Recognition by Mycobacterium tuberculosis DnaA
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tsodikov, Oleg V.; Biswas, Tapan
An essential protein, DnaA, binds to 9-bp DNA sites within the origin of replication oriC. These binding events are prerequisite to forming an enigmatic nucleoprotein scaffold that initiates replication. The number, sequences, positions, and orientations of these short DNA sites, or DnaA boxes, within the oriCs of different bacteria vary considerably. To investigate features of DnaA boxes that are important for binding Mycobacterium tuberculosis DnaA (MtDnaA), we have determined the crystal structures of the DNA binding domain (DBD) of MtDnaA bound to a cognate MtDnaA-box (at 2.0 {angstrom} resolution) and to a consensus Escherichia coli DnaA-box (at 2.3 {angstrom}). Thesemore » structures, complemented by calorimetric equilibrium binding studies of MtDnaA DBD in a series of DnaA-box variants, reveal the main determinants of DNA recognition and establish the [T/C][T/A][G/A]TCCACA sequence as a high-affinity MtDnaA-box. Bioinformatic and calorimetric analyses indicate that DnaA-box sequences in mycobacterial oriCs generally differ from the optimal binding sequence. This sequence variation occurs commonly at the first 2 bp, making an in vivo mycobacterial DnaA-box effectively a 7-mer and not a 9-mer. We demonstrate that the decrease in the affinity of these MtDnaA-box variants for MtDnaA DBD relative to that of the highest-affinity box TTGTCCACA is less than 10-fold. The understanding of DnaA-box recognition by MtDnaA and E. coli DnaA enables one to map DnaA-box sequences in the genomes of M. tuberculosis and other eubacteria.« less
Soyk, Sebastian; Simková, Klára; Zürcher, Evelyne; Luginbühl, Leonie; Brand, Luise H; Vaughan, Cara K; Wanke, Dierk; Zeeman, Samuel C
2014-04-01
Plant BZR1-BAM transcription factors contain a β-amylase (BAM)-like domain, characteristic of proteins involved in starch breakdown. The enzyme-derived domains appear to be noncatalytic, but they determine the function of the two Arabidopsis thaliana BZR1-BAM isoforms (BAM7 and BAM8) during transcriptional initiation. Removal or swapping of the BAM domains demonstrates that the BAM7 BAM domain restricts DNA binding and transcriptional activation, while the BAM8 BAM domain allows both activities. Furthermore, we demonstrate that BAM7 and BAM8 interact on the protein level and cooperate during transcriptional regulation. Site-directed mutagenesis of residues in the BAM domain of BAM8 shows that its function as a transcriptional activator is independent of catalysis but requires an intact substrate binding site, suggesting it may bind a ligand. Microarray experiments with plants overexpressing truncated versions lacking the BAM domain indicate that the pseudo-enzymatic domain increases selectivity for the preferred cis-regulatory element BBRE (BZR1-BAM Responsive Element). Side specificity toward the G-box may allow crosstalk to other signaling networks. This work highlights the importance of the enzyme-derived domain of BZR1-BAMs, supporting their potential role as metabolic sensors. © 2014 American Society of Plant Biologists. All rights reserved.
Context influences on TALE–DNA binding revealed by quantitative profiling
Rogers, Julia M.; Barrera, Luis A.; Reyon, Deepak; Sander, Jeffry D.; Kellis, Manolis; Joung, J Keith; Bulyk, Martha L.
2015-01-01
Transcription activator-like effector (TALE) proteins recognize DNA using a seemingly simple DNA-binding code, which makes them attractive for use in genome engineering technologies that require precise targeting. Although this code is used successfully to design TALEs to target specific sequences, off-target binding has been observed and is difficult to predict. Here we explore TALE–DNA interactions comprehensively by quantitatively assaying the DNA-binding specificities of 21 representative TALEs to ∼5,000–20,000 unique DNA sequences per protein using custom-designed protein-binding microarrays (PBMs). We find that protein context features exert significant influences on binding. Thus, the canonical recognition code does not fully capture the complexity of TALE–DNA binding. We used the PBM data to develop a computational model, Specificity Inference For TAL-Effector Design (SIFTED), to predict the DNA-binding specificity of any TALE. We provide SIFTED as a publicly available web tool that predicts potential genomic off-target sites for improved TALE design. PMID:26067805
Context influences on TALE-DNA binding revealed by quantitative profiling.
Rogers, Julia M; Barrera, Luis A; Reyon, Deepak; Sander, Jeffry D; Kellis, Manolis; Joung, J Keith; Bulyk, Martha L
2015-06-11
Transcription activator-like effector (TALE) proteins recognize DNA using a seemingly simple DNA-binding code, which makes them attractive for use in genome engineering technologies that require precise targeting. Although this code is used successfully to design TALEs to target specific sequences, off-target binding has been observed and is difficult to predict. Here we explore TALE-DNA interactions comprehensively by quantitatively assaying the DNA-binding specificities of 21 representative TALEs to ∼5,000-20,000 unique DNA sequences per protein using custom-designed protein-binding microarrays (PBMs). We find that protein context features exert significant influences on binding. Thus, the canonical recognition code does not fully capture the complexity of TALE-DNA binding. We used the PBM data to develop a computational model, Specificity Inference For TAL-Effector Design (SIFTED), to predict the DNA-binding specificity of any TALE. We provide SIFTED as a publicly available web tool that predicts potential genomic off-target sites for improved TALE design.
Siponen, Marina I.; Wisniewska, Magdalena; Lehtiö, Lari; Johansson, Ida; Svensson, Linda; Raszewski, Grzegorz; Nilsson, Lennart; Sigvardsson, Mikael; Berglund, Helena
2010-01-01
The early B-cell factor (EBF) transcription factors are central regulators of development in several organs and tissues. This protein family shows low sequence similarity to other protein families, which is why structural information for the functional domains of these proteins is crucial to understand their biochemical features. We have used a modular approach to determine the crystal structures of the structured domains in the EBF family. The DNA binding domain reveals a striking resemblance to the DNA binding domains of the Rel homology superfamily of transcription factors but contains a unique zinc binding structure, termed zinc knuckle. Further the EBF proteins contain an IPT/TIG domain and an atypical helix-loop-helix domain with a novel type of dimerization motif. The data presented here provide insights into unique structural features of the EBF proteins and open possibilities for detailed molecular investigations of this important transcription factor family. PMID:20592035
Klug, Aaron
2010-02-01
A long-standing goal of molecular biologists has been to construct DNA-binding proteins for the control of gene expression. The classical Cys2His2 (C2H2) zinc finger design is ideally suited for such purposes. Discriminating between closely related DNA sequences both in vitro and in vivo, this naturally occurring design was adopted for engineering zinc finger proteins (ZFPs) to target genes specifically. Zinc fingers were discovered in 1985, arising from the interpretation of our biochemical studies on the interaction of the Xenopus protein transcription factor IIIA (TFIIIA) with 5S RNA. Subsequent structural studies revealed its three-dimensional structure and its interaction with DNA. Each finger constitutes a self-contained domain stabilized by a zinc (Zn) ion ligated to a pair of cysteines and a pair of histidines and also by an inner structural hydrophobic core. This discovery showed not only a new protein fold but also a novel principle of DNA recognition. Whereas other DNA-binding proteins generally make use of the 2-fold symmetry of the double helix, functioning as homo- or heterodimers, zinc fingers can be linked linearly in tandem to recognize nucleic acid sequences of varying lengths. This modular design offers a large number of combinatorial possibilities for the specific recognition of DNA (or RNA). It is therefore not surprising that the zinc finger is found widespread in nature, including 3% of the genes of the human genome. The zinc finger design can be used to construct DNA-binding proteins for specific intervention in gene expression. By fusing selected zinc finger peptides to repression or activation domains, genes can be selectively switched off or on by targeting the peptide to the desired gene target. It was also suggested that by combining an appropriate zinc finger peptide with other effector or functional domains, e.g. from nucleases or integrases to form chimaeric proteins, genomes could be modified or manipulated. The first example of the power of the method was published in 1994 when a three-finger protein was constructed to block the expression of a human oncogene transformed into a mouse cell line. The same paper also described how a reporter gene was activated by targeting an inserted 9-base pair (bp) sequence, which acts as the promoter. Thus, by fusing zinc finger peptides to repression or activation domains, genes can be selectively switched off or on. It was also suggested that, by combining zinc fingers with other effector or functional domains, e.g. from nucleases or integrases, to form chimaeric proteins, genomes could be manipulated or modified. Several applications of such engineered ZFPs are described here, including some of therapeutic importance, and also their adaptation for breeding improved crop plants.
Takai, T; Nishita, Y; Iguchi-Ariga, S M; Ariga, H
1994-01-01
We have previously reported the human cDNA encoding MSSP-1, a sequence-specific double- and single-stranded DNA binding protein [Negishi, Nishita, Saëgusa, Kakizaki, Galli, Kihara, Tamai, Miyajima, Iguchi-Ariga and Ariga (1994) Oncogene, 9, 1133-1143]. MSSP-1 binds to a DNA replication origin/transcriptional enhancer of the human c-myc gene and has turned out to be identical with Scr2, a human protein which complements the defect of cdc2 kinase in S.pombe [Kataoka and Nojima (1994) Nucleic Acid Res., 22, 2687-2693]. We have cloned the cDNA for MSSP-2, another member of the MSSP family of proteins. The MSSP-2 cDNA shares highly homologous sequences with MSSP-1 cDNA, except for the insertion of 48 bp coding 16 amino acids near the C-terminus. Like MSSP-1, MSSP-2 has RNP-1 consensus sequences. The results of the experiments using bacterially expressed MSSP-2, and its deletion mutants, as histidine fusion proteins suggested that the binding specificity of MSSP-2 to double- and single-stranded DNA is the same as that of MSSP-1, and that the RNP consensus sequences are required for the DNA binding of the protein. MSSP-2 stimulated the DNA replication of an SV40-derived plasmid containing the binding sequence for MSSP-1 or -2. MSSP-2 is hence suggested to play an important role in regulation of DNA replication. Images PMID:7838710
Boer, D. Roeland; Ruiz-Masó, José Angel; Rueda, Manuel; Petoukhov, Maxim V.; Machón, Cristina; Svergun, Dmitri I.; Orozco, Modesto; del Solar, Gloria; Coll, Miquel
2016-01-01
DNA replication initiation is a vital and tightly regulated step in all replicons and requires an initiator factor that specifically recognizes the DNA replication origin and starts replication. RepB from the promiscuous streptococcal plasmid pMV158 is a hexameric ring protein evolutionary related to viral initiators. Here we explore the conformational plasticity of the RepB hexamer by i) SAXS, ii) sedimentation experiments, iii) molecular simulations and iv) X-ray crystallography. Combining these techniques, we derive an estimate of the conformational ensemble in solution showing that the C-terminal oligomerisation domains of the protein form a rigid cylindrical scaffold to which the N-terminal DNA-binding/catalytic domains are attached as highly flexible appendages, featuring multiple orientations. In addition, we show that the hinge region connecting both domains plays a pivotal role in the observed plasticity. Sequence comparisons and a literature survey show that this hinge region could exists in other initiators, suggesting that it is a common, crucial structural element for DNA binding and manipulation. PMID:26875695
Deciphering the genomic targets of alkylating polyamide conjugates using high-throughput sequencing
Chandran, Anandhakumar; Syed, Junetha; Taylor, Rhys D.; Kashiwazaki, Gengo; Sato, Shinsuke; Hashiya, Kaori; Bando, Toshikazu; Sugiyama, Hiroshi
2016-01-01
Chemically engineered small molecules targeting specific genomic sequences play an important role in drug development research. Pyrrole-imidazole polyamides (PIPs) are a group of molecules that can bind to the DNA minor-groove and can be engineered to target specific sequences. Their biological effects rely primarily on their selective DNA binding. However, the binding mechanism of PIPs at the chromatinized genome level is poorly understood. Herein, we report a method using high-throughput sequencing to identify the DNA-alkylating sites of PIP-indole-seco-CBI conjugates. High-throughput sequencing analysis of conjugate 2 showed highly similar DNA-alkylating sites on synthetic oligos (histone-free DNA) and on human genomes (chromatinized DNA context). To our knowledge, this is the first report identifying alkylation sites across genomic DNA by alkylating PIP conjugates using high-throughput sequencing. PMID:27098039
Lighting Up the Thioflavin T by Parallel-Stranded TG(GA) n DNA Homoduplexes.
Zhu, Jinbo; Yan, Zhiqiang; Zhou, Weijun; Liu, Chuanbo; Wang, Jin; Wang, Erkang
2018-06-22
Thioflavin T (ThT) was once regarded to be a specific fluorescent probe for the human telomeric G-quadruplex, but more other kinds of DNA were found that can also bind to ThT in recent years. Herein, we focus on G-rich parallel-stranded DNA and utilize fluorescence, absorbance, circular dichroism, and surface plasmon resonance spectroscopy to investigate its interaction with ThT. Pyrene label and molecular modeling are applied to unveil the binding mechanism. We find a new class of non-G-quadruplex G-rich parallel-stranded ( ps) DNA with the sequence of TG(GA) n can bind to ThT and increase the fluorescence with an enhancement ability superior to G-quadruplex. The optimal binding specificity for ThT is conferred by two parts. The first part is composed of two bases TG at the 5' end, which is a critical domain and plays an important role in the formation of the binding site for ThT. The second part is the rest alternative d(GA) bases, which forms the ps homoduplex and cooperates with the TG bases at the 5' end to bind the ThT.
Fenstermacher, Katherine J; Achuthan, Vasudevan; Schneider, Thomas D; DeStefano, Jeffrey J
2018-01-16
DNA polymerases (DNAPs) recognize 3' recessed termini on duplex DNA and carry out nucleotide catalysis. Unlike promoter-specific RNA polymerases (RNAPs), no sequence specificity is required for binding or initiation of catalysis. Despite this, previous results indicate that viral reverse transcriptases bind much more tightly to DNA primers that mimic the polypurine tract. In the current report, primer sequences that bind with high affinity to Taq and Klenow polymerases were identified using a modified Selective Evolution of Ligands by Exponential Enrichment (SELEX) approach. Two Taq -specific primers that bound ∼10 (Taq1) and over 100 (Taq2) times more stably than controls to Taq were identified. Taq1 contained 8 nucleotides (5' -CACTAAAG-3') that matched the phage T3 RNAP "core" promoter. Both primers dramatically outcompeted primers with similar binding thermodynamics in PCR reactions. Similarly, exonuclease minus Klenow polymerase also selected a high affinity primer that contained a related core promoter sequence from phage T7 RNAP (5' -ACTATAG-3'). For both Taq and Klenow, even small modifications to the sequence resulted in large losses in binding affinity suggesting that binding was highly sequence-specific. The results are discussed in the context of possible effects on multi-primer (multiplex) PCR assays, molecular information theory, and the evolution of RNAPs and DNAPs. Importance This work further demonstrates that primer-dependent DNA polymerases can have strong sequence biases leading to dramatically tighter binding to specific sequences. These may be related to biological function, or be a consequences of the structural architecture of the enzyme. New sequence specificity for Taq and Klenow polymerases were uncovered and among them were sequences that contained the core promoter elements from T3 and T7 phage RNA polymerase promoters. This suggests the intriguing possibility that phage RNA polymerases exploited intrinsic binding affinities of ancestral DNA polymerases to develop their promotors. Conversely, DNA polymerases could have evolved from related RNA polymerases and retained the intrinsic binding preference despite there being no clear function for such a preference in DNA biology. Copyright © 2018 American Society for Microbiology.
Siaud, Nicolas; Lam, Isabel; Christ, Nicole; Schlacher, Katharina; Xia, Bing; Jasin, Maria
2011-01-01
The breast cancer suppressor BRCA2 is essential for the maintenance of genomic integrity in mammalian cells through its role in DNA repair by homologous recombination (HR). Human BRCA2 is 3,418 amino acids and is comprised of multiple domains that interact with the RAD51 recombinase and other proteins as well as with DNA. To gain insight into the cellular function of BRCA2 in HR, we created fusions consisting of various BRCA2 domains and also introduced mutations into these domains to disrupt specific protein and DNA interactions. We find that a BRCA2 fusion peptide deleted for the DNA binding domain and active in HR is completely dependent on interaction with the PALB2 tumor suppressor for activity. Conversely, a BRCA2 fusion peptide deleted for the PALB2 binding domain is dependent on an intact DNA binding domain, providing a role for this conserved domain in vivo; mutagenesis suggests that both single-stranded and double-stranded DNA binding activities in the DNA binding domain are required for its activity. Given that PALB2 itself binds DNA, these results suggest alternative mechanisms to deliver RAD51 to DNA. In addition, the BRCA2 C terminus contains both RAD51-dependent and -independent activities which are essential to HR in some contexts. Finally, binding the small peptide DSS1 is essential for activity when its binding domain is present, but not when it is absent. Our results reveal functional redundancy within the BRCA2 protein and emphasize the plasticity of this large protein built for optimal HR function in mammalian cells. The occurrence of disease-causing mutations throughout BRCA2 suggests sub-optimal HR from a variety of domain modulations. PMID:22194698
Pastor, N; Pardo, L; Weinstein, H
1997-01-01
The binding of the TATA box-binding protein (TBP) to a TATA sequence in DNA is essential for eukaryotic basal transcription. TBP binds in the minor groove of DNA, causing a large distortion of the DNA helix. Given the apparent stereochemical equivalence of AT and TA basepairs in the minor groove, DNA deformability must play a significant role in binding site selection, because not all AT-rich sequences are bound effectively by TBP. To gain insight into the precise role that the properties of the TATA sequence have in determining the specificity of the DNA substrates of TBP, the solution structure and dynamics of seven DNA dodecamers have been studied by using molecular dynamics simulations. The analysis of the structural properties of basepair steps in these TATA sequences suggests a reason for the preference for alternating pyrimidine-purine (YR) sequences, but indicates that these properties cannot be the sole determinant of the sequence specificity of TBP. Rather, recognition depends on the interplay between the inherent deformability of the DNA and steric complementarity at the molecular interface. Images FIGURE 2 PMID:9251783
Perez-Rueda, Ernesto; Hernandez-Guerrero, Rafael; Martinez-Nuñez, Mario Alberto; Armenta-Medina, Dagoberto; Sanchez, Israel; Ibarra, J Antonio
2018-01-01
Gene regulation at the transcriptional level is a central process in all organisms, and DNA-binding transcription factors, known as TFs, play a fundamental role. This class of proteins usually binds at specific DNA sequences, activating or repressing gene expression. In general, TFs are composed of two domains: the DNA-binding domain (DBD) and an extra domain, which in this work we have named "companion domain" (CD). This latter could be involved in one or more functions such as ligand binding, protein-protein interactions or even with enzymatic activity. In contrast to DBDs, which have been widely characterized both experimentally and bioinformatically, information on the abundance, distribution, variability and possible role of the CDs is scarce. Here, we investigated these issues associated with the domain architectures of TFs in prokaryotic genomes. To this end, 19 families of TFs in 761 non-redundant bacterial and archaeal genomes were evaluated. In this regard we found four main groups based on the abundance and distribution in the analyzed genomes: i) LysR and TetR/AcrR; ii) AraC/XylS, SinR, and others; iii) Lrp, Fis, ArsR, and others; and iv) a group that included only two families, ArgR and BirA. Based on a classification of the organisms according to the life-styles, a major abundance of regulatory families in free-living organisms, in contrast with pathogenic, extremophilic or intracellular organisms, was identified. Finally, the protein architecture diversity associated to the 19 families considering a weight score for domain promiscuity evidenced which regulatory families were characterized by either a large diversity of CDs, here named as "promiscuous" families given the elevated number of variable domains found in those TFs, or a low diversity of CDs. Altogether this information helped us to understand the diversity and distribution of the 19 Prokaryotes TF families. Moreover, initial steps were taken to comprehend the variability of the extra domain in those TFs, which eventually might assist in evolutionary and functional studies.
Kristie, T M; LeBowitz, J H; Sharp, P A
1989-01-01
The herpes simplex virus transactivator, alpha TIF, stimulates transcription of the alpha/immediate early genes via a cis-acting site containing an octamer element and a conserved flanking sequence. The alpha TIF protein, produced in a baculovirus expression system, nucleates the formation of at least two DNA--protein complexes on this regulatory element. Both of these complexes contain the ubiquitous Oct-1 protein, whose POU domain alone is sufficient to allow assembly of the alpha TIF-dependent complexes. A second member of the POU domain family, the lymphoid specific Oct-2 protein, can also be assembled into similar complexes at high concentrations of alpha TIF protein. These complexes contain at least two cellular proteins in addition to Oct-1. One of these proteins is present in both insect and HeLa cells and probably recognizes sequences in the cis element. The second cellular protein, only present in HeLa cells, probably binds by protein-protein interactions. Images PMID:2556266
Kristie, T M; LeBowitz, J H; Sharp, P A
1989-12-20
The herpes simplex virus transactivator, alpha TIF, stimulates transcription of the alpha/immediate early genes via a cis-acting site containing an octamer element and a conserved flanking sequence. The alpha TIF protein, produced in a baculovirus expression system, nucleates the formation of at least two DNA--protein complexes on this regulatory element. Both of these complexes contain the ubiquitous Oct-1 protein, whose POU domain alone is sufficient to allow assembly of the alpha TIF-dependent complexes. A second member of the POU domain family, the lymphoid specific Oct-2 protein, can also be assembled into similar complexes at high concentrations of alpha TIF protein. These complexes contain at least two cellular proteins in addition to Oct-1. One of these proteins is present in both insect and HeLa cells and probably recognizes sequences in the cis element. The second cellular protein, only present in HeLa cells, probably binds by protein-protein interactions.
DOE Office of Scientific and Technical Information (OSTI.GOV)
MacArthur, Stewart; Li, Xiao-Yong; Li, Jingyi
2009-05-15
BACKGROUND: We previously established that six sequence-specific transcription factors that initiate anterior/posterior patterning in Drosophila bind to overlapping sets of thousands of genomic regions in blastoderm embryos. While regions bound at high levels include known and probable functional targets, more poorly bound regions are preferentially associated with housekeeping genes and/or genes not transcribed in the blastoderm, and are frequently found in protein coding sequences or in less conserved non-coding DNA, suggesting that many are likely non-functional. RESULTS: Here we show that an additional 15 transcription factors that regulate other aspects of embryo patterning show a similar quantitative continuum of functionmore » and binding to thousands of genomic regions in vivo. Collectively, the 21 regulators show a surprisingly high overlap in the regions they bind given that they belong to 11 DNA binding domain families, specify distinct developmental fates, and can act via different cis-regulatory modules. We demonstrate, however, that quantitative differences in relative levels of binding to shared targets correlate with the known biological and transcriptional regulatory specificities of these factors. CONCLUSIONS: It is likely that the overlap in binding of biochemically and functionally unrelated transcription factors arises from the high concentrations of these proteins in nuclei, which, coupled with their broad DNA binding specificities, directs them to regions of open chromatin. We suggest that most animal transcription factors will be found to show a similar broad overlapping pattern of binding in vivo, with specificity achieved by modulating the amount, rather than the identity, of bound factor.« less
Sequence-specific DNA binding Pyrrole-imidazole polyamides and their applications.
Kawamoto, Yusuke; Bando, Toshikazu; Sugiyama, Hiroshi
2018-05-01
Pyrrole-imidazole polyamides (Py-Im polyamides) are cell-permeable compounds that bind to the minor groove of double-stranded DNA in a sequence-specific manner without causing denaturation of the DNA. These compounds can be used to control gene expression and to stain specific sequences in cells. Here, we review the history, structural variations, and functional investigations of Py-Im polyamides. Copyright © 2018 Elsevier Ltd. All rights reserved.
Hwang, Dae-Sik; Lee, Bo-Young; Kim, Hui-Su; Lee, Min Chul; Kyung, Do-Hyun; Om, Ae-Son; Rhee, Jae-Sung; Lee, Jae-Seong
2014-11-18
Nuclear receptors (NRs) are a large superfamily of proteins defined by a DNA-binding domain (DBD) and a ligand-binding domain (LBD). They function as transcriptional regulators to control expression of genes involved in development, homeostasis, and metabolism. The number of NRs differs from species to species, because of gene duplications and/or lineage-specific gene losses during metazoan evolution. Many NRs in arthropods interact with the ecdysteroid hormone and are involved in ecdysone-mediated signaling in arthropods. The nuclear receptor superfamily complement has been reported in several arthropods, including crustaceans, but not in copepods. We identified the entire NR repertoire of the copepod Tigriopus japonicus, which is an important marine model species for ecotoxicology and environmental genomics. Using whole genome and transcriptome sequences, we identified a total of 31 nuclear receptors in the genome of T. japonicus. Nomenclature of the nuclear receptors was determined based on the sequence similarities of the DNA-binding domain (DBD) and ligand-binding domain (LBD). The 7 subfamilies of NRs separate into five major clades (subfamilies NR1, NR2, NR3, NR4, and NR5/6). Although the repertoire of NR members in, T. japonicus was similar to that reported for other arthropods, there was an expansion of the NR1 subfamily in Tigriopus japonicus. The twelve unique nuclear receptors identified in T. japonicus are members of NR1L. This expansion may be a unique lineage-specific feature of crustaceans. Interestingly, E78 and HR83, which are present in other arthropods, were absent from the genomes of T. japonicus and two congeneric copepod species (T. japonicus and Tigriopus californicus), suggesting copepod lineage-specific gene loss. We identified all NR receptors present in the copepod, T. japonicus. Knowledge of the copepod nuclear receptor repertoire will contribute to a better understanding of copepod- and crustacean-specific NR evolution.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Safo, Martin K., E-mail: msafo@vcu.edu; Ko, Tzu-Ping; Musayev, Faik N.
The up-and-down binding of dimeric MecI to mecA dyad DNA may account for the cooperative effect of the repressor. The dimeric repressor MecI regulates the mecA gene that encodes the penicillin-binding protein PBP-2a in methicillin-resistant Staphylococcus aureus (MRSA). MecI is similar to BlaI, the repressor for the blaZ gene of β-lactamase. MecI and BlaI can bind to both operator DNA sequences. The crystal structure of MecI in complex with the 32 base-pair cognate DNA of mec was determined to 3.8 Å resolution. MecI is a homodimer and each monomer consists of a compact N-terminal winged-helix domain, which binds to DNA,more » and a loosely packed C-terminal helical domain, which intertwines with its counter-monomer. The crystal contains horizontal layers of virtual DNA double helices extending in three directions, which are separated by perpendicular DNA segments. Each DNA segment is bound to two MecI dimers. Similar to the BlaI–mec complex, but unlike the MecI–bla complex, the MecI repressors bind to both sides of the mec DNA dyad that contains four conserved sequences of TACA/TGTA. The results confirm the up-and-down binding to the mec operator, which may account for cooperative effect of the repressor.« less
Kong, Daochun; Coleman, Thomas R.; DePamphilis, Melvin L.
2003-01-01
Budding yeast (Saccharomyces cerevisiae) origin recognition complex (ORC) requires ATP to bind specific DNA sequences, whereas fission yeast (Schizosaccharomyces pombe) ORC binds to specific, asymmetric A:T-rich sites within replication origins, independently of ATP, and frog (Xenopus laevis) ORC seems to bind DNA non-specifically. Here we show that despite these differences, ORCs are functionally conserved. Firstly, SpOrc1, SpOrc4 and SpOrc5, like those from other eukaryotes, bound ATP and exhibited ATPase activity, suggesting that ATP is required for pre-replication complex (pre-RC) assembly rather than origin specificity. Secondly, SpOrc4, which is solely responsible for binding SpORC to DNA, inhibited up to 70% of XlORC-dependent DNA replication in Xenopus egg extract by preventing XlORC from binding to chromatin and assembling pre-RCs. Chromatin-bound SpOrc4 was located at AT-rich sequences. XlORC in egg extract bound preferentially to asymmetric A:T-sequences in either bare DNA or in sperm chromatin, and it recruited XlCdc6 and XlMcm proteins to these sequences. These results reveal that XlORC initiates DNA replication preferentially at the same or similar sites to those targeted in S.pombe. PMID:12840006
Solution structure and DNA-binding properties of the C-terminal domain of UvrC from E.coli
Singh, S.; Folkers, G.E.; Bonvin, A.M.J.J.; Boelens, R.; Wechselberger, R.; Niztayev, A.; Kaptein, R.
2002-01-01
The C-terminal domain of the UvrC protein (UvrC CTD) is essential for 5′ incision in the prokaryotic nucleotide excision repair process. We have determined the three-dimensional structure of the UvrC CTD using heteronuclear NMR techniques. The structure shows two helix–hairpin–helix (HhH) motifs connected by a small connector helix. The UvrC CTD is shown to mediate structure-specific DNA binding. The domain binds to a single-stranded–double-stranded junction DNA, with a strong specificity towards looped duplex DNA that contains at least six unpaired bases per loop (‘bubble DNA’). Using chemical shift perturbation experiments, the DNA-binding surface is mapped to the first hairpin region encompassing the conserved glycine–valine–glycine residues followed by lysine–arginine–arginine, a positively charged surface patch and the second hairpin region consisting of glycine–isoleucine–serine. A model for the protein– DNA complex is proposed that accounts for this specificity. PMID:12426397
Eukaryotic DNA Ligases: Structural and Functional Insights
Ellenberger, Tom; Tomkinson, Alan E.
2010-01-01
DNA ligases are required for DNA replication, repair, and recombination. In eukaryotes, there are three families of ATP-dependent DNA ligases. Members of the DNA ligase I and IV families are found in all eukaryotes, whereas DNA ligase III family members are restricted to vertebrates. These enzymes share a common catalytic region comprising a DNA-binding domain, a nucleotidyltransferase (NTase) domain, and an oligonucleotide/oligosaccharide binding (OB)-fold domain. The catalytic region encircles nicked DNA with each of the domains contacting the DNA duplex. The unique segments adjacent to the catalytic region of eukaryotic DNA ligases are involved in specific protein-protein interactions with a growing number of DNA replication and repair proteins. These interactions determine the specific cellular functions of the DNA ligase isozymes. In mammals, defects in DNA ligation have been linked with an increased incidence of cancer and neurodegeneration. PMID:18518823
Functional specificity of a Hox protein mediated by the recognition of minor groove structure.
Joshi, Rohit; Passner, Jonathan M; Rohs, Remo; Jain, Rinku; Sosinsky, Alona; Crickmore, Michael A; Jacob, Vinitha; Aggarwal, Aneel K; Honig, Barry; Mann, Richard S
2007-11-02
The recognition of specific DNA-binding sites by transcription factors is a critical yet poorly understood step in the control of gene expression. Members of the Hox family of transcription factors bind DNA by making nearly identical major groove contacts via the recognition helices of their homeodomains. In vivo specificity, however, often depends on extended and unstructured regions that link Hox homeodomains to a DNA-bound cofactor, Extradenticle (Exd). Using a combination of structure determination, computational analysis, and in vitro and in vivo assays, we show that Hox proteins recognize specific Hox-Exd binding sites via residues located in these extended regions that insert into the minor groove but only when presented with the correct DNA sequence. Our results suggest that these residues, which are conserved in a paralog-specific manner, confer specificity by recognizing a sequence-dependent DNA structure instead of directly reading a specific DNA sequence.
Sequence specificity of single-stranded DNA-binding proteins: a novel DNA microarray approach
Morgan, Hugh P.; Estibeiro, Peter; Wear, Martin A.; Max, Klaas E.A.; Heinemann, Udo; Cubeddu, Liza; Gallagher, Maurice P.; Sadler, Peter J.; Walkinshaw, Malcolm D.
2007-01-01
We have developed a novel DNA microarray-based approach for identification of the sequence-specificity of single-stranded nucleic-acid-binding proteins (SNABPs). For verification, we have shown that the major cold shock protein (CspB) from Bacillus subtilis binds with high affinity to pyrimidine-rich sequences, with a binding preference for the consensus sequence, 5′-GTCTTTG/T-3′. The sequence was modelled onto the known structure of CspB and a cytosine-binding pocket was identified, which explains the strong preference for a cytosine base at position 3. This microarray method offers a rapid high-throughput approach for determining the specificity and strength of ss DNA–protein interactions. Further screening of this newly emerging family of transcription factors will help provide an insight into their cellular function. PMID:17488853
DOE Office of Scientific and Technical Information (OSTI.GOV)
Safo,M.; Ko, T.; Musayev, F.
The dimeric repressor MecI regulates the mecA gene that encodes the penicillin-binding protein PBP-2a in methicillin-resistant Staphylococcus aureus (MRSA). MecI is similar to BlaI, the repressor for the blaZ gene of {beta}-lactamase. MecI and BlaI can bind to both operator DNA sequences. The crystal structure of MecI in complex with the 32 base-pair cognate DNA of mec was determined to 3.8 Angstroms resolution. MecI is a homodimer and each monomer consists of a compact N-terminal winged-helix domain, which binds to DNA, and a loosely packed C-terminal helical domain, which intertwines with its counter-monomer. The crystal contains horizontal layers of virtualmore » DNA double helices extending in three directions, which are separated by perpendicular DNA segments. Each DNA segment is bound to two MecI dimers. Similar to the BlaI-mec complex, but unlike the MecI-bla complex, the MecI repressors bind to both sides of the mec DNA dyad that contains four conserved sequences of TACA/TGTA. The results confirm the up-and-down binding to the mec operator, which may account for cooperative effect of the repressor.« less
Bhat, Abhay Prasad; Shin, Minsang; Choy, Hyon E
2014-07-01
Histone-like nucleoid structuring protein (H-NS) is a small but abundant protein present in enteric bacteria and is involved in compaction of the DNA and regulation of the transcription. Recent reports have suggested that H-NS binds to a specific AT rich DNA sequence than to intrinsically curved DNA in sequence independent manner. We detected two high-specificity H-NS binding sites in LEE5 promoter of EPEC centered at -110 and -138, which were close to the proposed consensus H-NS binding motif. To identify H-NS binding sequence in LEE5 promoter, we took a random mutagenesis approach and found the mutations at around -138 were specifically defective in the regulation by H-NS. It was concluded that H-NS exerts maximum repression via the specific sequence at around -138 and subsequently contacts a subunit of RNAP through oligomerization.
Cloning and characterization of a novel human STAR domain containing cDNA KHDRBS2.
Wang, Liu; Xu, Jian; Zeng, Li; Ye, Xin; Wu, Qihan; Dai, Jianfeng; Ji, Chaoneng; Gu, Shaohua; Zhao, Chunhua; Xie, Yi; Mao, Yumin
2002-12-01
KHDRBS2, KH domain containing, RNA binding, signal transduction associated 2, is an RNA-binding protein that is tyrosine phosphorylated by Src during mitosis. It contains a KH domain,which is embedded in a larger conserved domain called the STAR domain. This protein has a 99% sequence identity with rat SLM-1 (the Sam68-like mammalian protein 1) and 98% sequence identity with mouse SLM-1 in its STAR domain. KHDRBS2 has the characteristic Sam68 SH2 and SH3 domain binding sites. RT-PCR analysis showed its transcript is ubiquitously expressed. The characterization of KHDRBS2 indicates it may link tyrosine kinase signaling cascades with some aspect of RNA metabolism.
Presence of an SH2 domain in the actin-binding protein tensin.
Davis, S; Lu, M L; Lo, S H; Lin, S; Butler, J A; Druker, B J; Roberts, T M; An, Q; Chen, L B
1991-05-03
The molecular cloning of the complementary DNA coding for a 90-kilodalton fragment of tensin, an actin-binding component of focal contacts and other submembraneous cytoskeletal structures, is reported. The derived amino acid sequence revealed the presence of a Src homology 2 (SH2) domain. This domain is shared by a number of signal transduction proteins including nonreceptor tyrosine kinases such as Abl, Fps, Src, and Src family members, the transforming protein Crk, phospholipase C-gamma 1, PI-3 (phosphatidylinositol) kinase, and guanosine triphosphatase-activating protein (GAP). Like the SH2 domain found in Src, Crk, and Abl, the SH2 domain of tensin bound specifically to a number of phosphotyrosine-containing proteins from v-src-transformed cells. Tensin was also found to be phosphorylated on tyrosine residues. These findings suggest that by possessing both actin-binding and phosphotyrosine-binding activities and being itself a target for tyrosine kinases, tensin may link signal transduction pathways with the cytoskeleton.
Rangachari, Vijayaraghavan; Marin, Vedrana; Bienkiewicz, Ewa A; Semavina, Maria; Guerrero, Luis; Love, John F; Murphy, John R; Logan, Timothy M
2005-04-19
The diphtheria toxin repressor (DtxR) is an Fe(II)-activated transcriptional regulator of iron homeostatic and virulence genes in Corynebacterium diphtheriae. DtxR is a two-domain protein that contains two structurally and functionally distinct metal binding sites. Here, we investigate the molecular steps associated with activation by Ni(II)Cl(2) and Cd(II)Cl(2). Equilibrium binding energetics for Ni(II) were obtained from isothermal titration calorimetry, indicating apparent metal dissociation constants of 0.2 and 1.7 microM for two independent sites. The binding isotherms for Ni(II) and Cd(II) exhibited a characteristic exothermic-endothermic pattern that was used to infer the metal binding sequence by comparing the wild-type isotherm with those of several binding site mutants. These data were complemented by measuring the distance between specific backbone amide nitrogens and the first equivalent of metal through heteronuclear NMR relaxation measurements. Previous studies indicated that metal binding affects a disordered to ordered transition in the metal binding domain. The coupling between metal binding and structure change was investigated using near-UV circular dichroism spectroscopy. Together, the data show that the first equivalent of metal is bound by the primary metal binding site. This binding orients the DNA binding helices and begins to fold the N-terminal domain. Subsequent binding at the ancillary site completes the folding of this domain and formation of the dimer interface. This model is used to explain the behavior of several mutants.
Ranganathan, Sridevi; Cheung, Jonah; Cassidy, Michael; Ginter, Christopher; Pata, Janice D; McDonough, Kathleen A
2018-01-09
Mycobacterium tuberculosis (Mtb) encodes two CRP/FNR family transcription factors (TF) that contribute to virulence, Cmr (Rv1675c) and CRPMt (Rv3676). Prior studies identified distinct chromosomal binding profiles for each TF despite their recognizing overlapping DNA motifs. The present study shows that Cmr binding specificity is determined by discriminator nucleotides at motif positions 4 and 13. X-ray crystallography and targeted mutational analyses identified an arginine-rich loop that expands Cmr's DNA interactions beyond the classical helix-turn-helix contacts common to all CRP/FNR family members and facilitates binding to imperfect DNA sequences. Cmr binding to DNA results in a pronounced asymmetric bending of the DNA and its high level of cooperativity is consistent with DNA-facilitated dimerization. A unique N-terminal extension inserts between the DNA binding and dimerization domains, partially occluding the site where the canonical cAMP binding pocket is found. However, an unstructured region of this N-terminus may help modulate Cmr activity in response to cellular signals. Cmr's multiple levels of DNA interaction likely enhance its ability to integrate diverse gene regulatory signals, while its novel structural features establish Cmr as an atypical CRP/FNR family member. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Specific and non-specific interactions of ParB with DNA: implications for chromosome segregation
Taylor, James A.; Pastrana, Cesar L.; Butterer, Annika; Pernstich, Christian; Gwynn, Emma J.; Sobott, Frank; Moreno-Herrero, Fernando; Dillingham, Mark S.
2015-01-01
The segregation of many bacterial chromosomes is dependent on the interactions of ParB proteins with centromere-like DNA sequences called parS that are located close to the origin of replication. In this work, we have investigated the binding of Bacillus subtilis ParB to DNA in vitro using a variety of biochemical and biophysical techniques. We observe tight and specific binding of a ParB homodimer to the parS sequence. Binding of ParB to non-specific DNA is more complex and displays apparent positive co-operativity that is associated with the formation of larger, poorly defined, nucleoprotein complexes. Experiments with magnetic tweezers demonstrate that non-specific binding leads to DNA condensation that is reversible by protein unbinding or force. The condensed DNA structure is not well ordered and we infer that it is formed by many looping interactions between neighbouring DNA segments. Consistent with this view, ParB is also able to stabilize writhe in single supercoiled DNA molecules and to bridge segments from two different DNA molecules in trans. The experiments provide no evidence for the promotion of non-specific DNA binding and/or condensation events by the presence of parS sequences. The implications of these observations for chromosome segregation are discussed. PMID:25572315
A divergent Pumilio repeat protein family for pre-rRNA processing and mRNA localization
DOE Office of Scientific and Technical Information (OSTI.GOV)
Qiu, Chen; McCann, Kathleen L.; Wine, Robert N.
Pumilio/feminization of XX and XO animals (fem)-3 mRNA-binding factor (PUF) proteins bind sequence specifically to mRNA targets using a single-stranded RNA-binding domain comprising eight Pumilio (PUM) repeats. PUM repeats have now been identified in proteins that function in pre-rRNA processing, including human Puf-A and yeast Puf6. This is a role not previously ascribed to PUF proteins. In this paper we present crystal structures of human Puf-A that reveal a class of nucleic acid-binding proteins with 11 PUM repeats arranged in an “L”-like shape. In contrast to classical PUF proteins, Puf-A forms sequence-independent interactions with DNA or RNA, mediated by conservedmore » basic residues. We demonstrate that equivalent basic residues in yeast Puf6 are important for RNA binding, pre-rRNA processing, and mRNA localization. Finally, PUM repeats can be assembled into alternative folds that bind to structured nucleic acids in addition to forming canonical eight-repeat crescent-shaped RNA-binding domains found in classical PUF proteins.« less
A divergent Pumilio repeat protein family for pre-rRNA processing and mRNA localization
Qiu, Chen; McCann, Kathleen L.; Wine, Robert N.; ...
2014-12-15
Pumilio/feminization of XX and XO animals (fem)-3 mRNA-binding factor (PUF) proteins bind sequence specifically to mRNA targets using a single-stranded RNA-binding domain comprising eight Pumilio (PUM) repeats. PUM repeats have now been identified in proteins that function in pre-rRNA processing, including human Puf-A and yeast Puf6. This is a role not previously ascribed to PUF proteins. In this paper we present crystal structures of human Puf-A that reveal a class of nucleic acid-binding proteins with 11 PUM repeats arranged in an “L”-like shape. In contrast to classical PUF proteins, Puf-A forms sequence-independent interactions with DNA or RNA, mediated by conservedmore » basic residues. We demonstrate that equivalent basic residues in yeast Puf6 are important for RNA binding, pre-rRNA processing, and mRNA localization. Finally, PUM repeats can be assembled into alternative folds that bind to structured nucleic acids in addition to forming canonical eight-repeat crescent-shaped RNA-binding domains found in classical PUF proteins.« less
Dissection of the methyl-CpG binding domain from the chromosomal protein MeCP2.
Nan, X; Meehan, R R; Bird, A
1993-01-01
MeCP2 is a chromosomal protein which binds to DNA that is methylated at CpG. In situ immunofluorescence in mouse cells has shown that the protein is most concentrated in pericentromeric heterochromatin, suggesting that MeCP2 may play a role in the formation of inert chromatin. Here we have isolated a minimal methyl-CpG binding domain (MBD) from MeCP2. MBD is 85 amino acids in length, and binds exclusively to DNA that contains one or more symmetrically methylated CpGs. MBD has negligable non-specific affinity for DNA, confirming that non-specific and methyl-CpG specific binding domains of MeCP2 are distinct. In vitro footprinting indicates that MBD binding can protect a 12 nucleotide region surrounding a methyl-CpG pair, with an approximate dissociation constant of 10(-9) M. Images PMID:8177735
DNA residence time is a regulatory factor of transcription repression
Clauß, Karen; Popp, Achim P.; Schulze, Lena; Hettich, Johannes; Reisser, Matthias; Escoter Torres, Laura; Uhlenhaut, N. Henriette
2017-01-01
Abstract Transcription comprises a highly regulated sequence of intrinsically stochastic processes, resulting in bursts of transcription intermitted by quiescence. In transcription activation or repression, a transcription factor binds dynamically to DNA, with a residence time unique to each factor. Whether the DNA residence time is important in the transcription process is unclear. Here, we designed a series of transcription repressors differing in their DNA residence time by utilizing the modular DNA binding domain of transcription activator-like effectors (TALEs) and varying the number of nucleotide-recognizing repeat domains. We characterized the DNA residence times of our repressors in living cells using single molecule tracking. The residence times depended non-linearly on the number of repeat domains and differed by more than a factor of six. The factors provoked a residence time-dependent decrease in transcript level of the glucocorticoid receptor-activated gene SGK1. Down regulation of transcription was due to a lower burst frequency in the presence of long binding repressors and is in accordance with a model of competitive inhibition of endogenous activator binding. Our single molecule experiments reveal transcription factor DNA residence time as a regulatory factor controlling transcription repression and establish TALE-DNA binding domains as tools for the temporal dissection of transcription regulation. PMID:28977492
Nanopore sensing of individual transcription factors bound to DNA
Squires, Allison; Atas, Evrim; Meller, Amit
2015-01-01
Transcription factor (TF)-DNA interactions are the primary control point in regulation of gene expression. Characterization of these interactions is essential for understanding genetic regulation of biological systems and developing novel therapies to treat cellular malfunctions. Solid-state nanopores are a highly versatile class of single-molecule sensors that can provide rich information about local properties of long charged biopolymers using the current blockage patterns generated during analyte translocation, and provide a novel platform for characterization of TF-DNA interactions. The DNA-binding domain of the TF Early Growth Response Protein 1 (EGR1), a prototypical zinc finger protein known as zif268, is used as a model system for this study. zif268 adopts two distinct bound conformations corresponding to specific and nonspecific binding, according to the local DNA sequence. Here we implement a solid-state nanopore platform for direct, label- and tether-free single-molecule detection of zif268 bound to DNA. We demonstrate detection of single zif268 TFs bound to DNA according to current blockage sublevels and duration of translocation through the nanopore. We further show that the nanopore can detect and discriminate both specific and nonspecific binding conformations of zif268 on DNA via the distinct current blockage patterns corresponding to each of these two known binding modes. PMID:26109509
Nanopore sensing of individual transcription factors bound to DNA
NASA Astrophysics Data System (ADS)
Squires, Allison; Atas, Evrim; Meller, Amit
2015-06-01
Transcription factor (TF)-DNA interactions are the primary control point in regulation of gene expression. Characterization of these interactions is essential for understanding genetic regulation of biological systems and developing novel therapies to treat cellular malfunctions. Solid-state nanopores are a highly versatile class of single-molecule sensors that can provide rich information about local properties of long charged biopolymers using the current blockage patterns generated during analyte translocation, and provide a novel platform for characterization of TF-DNA interactions. The DNA-binding domain of the TF Early Growth Response Protein 1 (EGR1), a prototypical zinc finger protein known as zif268, is used as a model system for this study. zif268 adopts two distinct bound conformations corresponding to specific and nonspecific binding, according to the local DNA sequence. Here we implement a solid-state nanopore platform for direct, label- and tether-free single-molecule detection of zif268 bound to DNA. We demonstrate detection of single zif268 TFs bound to DNA according to current blockage sublevels and duration of translocation through the nanopore. We further show that the nanopore can detect and discriminate both specific and nonspecific binding conformations of zif268 on DNA via the distinct current blockage patterns corresponding to each of these two known binding modes.
Detecting Coevolution in and among Protein Domains
Yeang, Chen-Hsiang; Haussler, David
2007-01-01
Correlated changes of nucleic or amino acids have provided strong information about the structures and interactions of molecules. Despite the rich literature in coevolutionary sequence analysis, previous methods often have to trade off between generality, simplicity, phylogenetic information, and specific knowledge about interactions. Furthermore, despite the evidence of coevolution in selected protein families, a comprehensive screening of coevolution among all protein domains is still lacking. We propose an augmented continuous-time Markov process model for sequence coevolution. The model can handle different types of interactions, incorporate phylogenetic information and sequence substitution, has only one extra free parameter, and requires no knowledge about interaction rules. We employ this model to large-scale screenings on the entire protein domain database (Pfam). Strikingly, with 0.1 trillion tests executed, the majority of the inferred coevolving protein domains are functionally related, and the coevolving amino acid residues are spatially coupled. Moreover, many of the coevolving positions are located at functionally important sites of proteins/protein complexes, such as the subunit linkers of superoxide dismutase, the tRNA binding sites of ribosomes, the DNA binding region of RNA polymerase, and the active and ligand binding sites of various enzymes. The results suggest sequence coevolution manifests structural and functional constraints of proteins. The intricate relations between sequence coevolution and various selective constraints are worth pursuing at a deeper level. PMID:17983264
Specification of anteroposterior cell fates in Caenorhabditis elegans by Drosophila Hox proteins.
Hunter, C P; Kenyon, C
1995-09-21
Antennapedia class homeobox (Hox) genes specify cell fates in successive anteroposterior body domains in vertebrates, insects and nematodes. The DNA-binding homeodomain sequences are very similar between vertebrate and Drosophila Hox proteins, and this similarity allows vertebrate Hox proteins to function in Drosophila. In contrast, the Caenorhabditis elegans homeodomains are substantially divergent. Further, C. elegans differs from both insects and vertebrates in having a non-segmented body as well as a distinctive mode of development that involves asymmetric early cleavages and invariant cell lineages. Here we report that, despite these differences, Drosophila Hox proteins expressed in C. elegans can substitute for C. elegans Hox proteins in the control of three different cell-fate decisions: the regulation of cell migration, the specification of serotonergic neurons, and the specification of a sensory structure. We also show that the specificity of one C. elegans Hox protein is partly determined by two amino acids that have been implicated in sequence-specific DNA binding. Together these findings suggest that factors important for target recognition by specific Hox proteins have been conserved throughout much of the animal kingdom.
Wnt-Mediated Repression via Bipartite DNA Recognition by TCF in the Drosophila Hematopoietic System
Zhang, Chen U.; Blauwkamp, Timothy A.; Burby, Peter E.; Cadigan, Ken M.
2014-01-01
The Wnt/β-catenin signaling pathway plays many important roles in animal development, tissue homeostasis and human disease. Transcription factors of the TCF family mediate many Wnt transcriptional responses, promoting signal-dependent activation or repression of target gene expression. The mechanism of this specificity is poorly understood. Previously, we demonstrated that for activated targets in Drosophila, TCF/Pangolin (the fly TCF) recognizes regulatory DNA through two DNA binding domains, with the High Mobility Group (HMG) domain binding HMG sites and the adjacent C-clamp domain binding Helper sites. Here, we report that TCF/Pangolin utilizes a similar bipartite mechanism to recognize and regulate several Wnt-repressed targets, but through HMG and Helper sites whose sequences are distinct from those found in activated targets. The type of HMG and Helper sites is sufficient to direct activation or repression of Wnt regulated cis-regulatory modules, and protease digestion studies suggest that TCF/Pangolin adopts distinct conformations when bound to either HMG-Helper site pair. This repressive mechanism occurs in the fly lymph gland, the larval hematopoietic organ, where Wnt/β-catenin signaling controls prohemocytic differentiation. Our study provides a paradigm for direct repression of target gene expression by Wnt/β-catenin signaling and allosteric regulation of a transcription factor by DNA. PMID:25144371
NASA Astrophysics Data System (ADS)
Bauer, William Joseph, Jr.
The fate of an individual cell, or even an entire organism, is often determined by minute, yet very specific differences in the conformation of a single protein species. Very often, proteins take on alternate folds or even side chain conformations to deal with different situations present within the cell. These differences can be as large as a whole domain or as subtle as the alteration of a single amino acid side chain. Yet, even these seemingly minor side chain conformational differences can determine the development of a cell type during differentiation or even dictate whether a cell will live or die. Two examples of situations where minor conformational differences within a specific protein could lead to major differences in the life cycle of a cell are described herein. The first example describes the variations seen in DNA conformations which can lead to slightly different Hox protein binding conformations responsible for recognizing biologically relevant regulatory sites. These specific differences occur in the minor groove of the bound DNA and are limited to the conformation of only two side chains. The conformation of the bound DNA, however, is not solely determined by the sequence of the DNA, as multiple sequences can result in the same DNA conformation. The second example takes place in the context of a yeast prion protein which contains a mutation that decreases the frequency at which fibrils form. While the specific interactions leading to this physiological change were not directly detected, it can be ascertained from the crystal structure that the structural changes are subtle and most likely involve another binding partner. In both cases, these conformational changes are very slight but have a profound effect on the downstream processes.
NASA Astrophysics Data System (ADS)
Oiwa, Nestor; Cordeiro, Claudette; Heermann, Dieter
2016-05-01
Instead of ATCG letter alignments, typically used in bioinformatics, we propose a new alignment method using the probability distribution function of the bottom of the occupied molecular orbital (BOMO), highest occupied molecular orbital (HOMO) and lowest unoccupied orbital (LUMO). We apply the technique to transcription factors with Cys2His2 zinc fingers. These transcription factors search for binding sites, probing for the electronic patterns at the minor and major DNA groves. The eukaryotic Cys2His2 zinc finger proteins bind to DNA ubiquitously at highly conserved domains. They are responsible for gene regulation and the spatial organization of DNA. To study and understand these zinc finger DNA-protein interactions, we use the extended ladder in the DNA model proposed by Zhu, Rasmussen, Balatsky & Bishop (2007) te{Zhu-2007}. Considering one single spinless electron in each nucleotide π-orbital along a double DNA chain (dDNA), we find a typical pattern for the bottom of BOMO, HOMO and LUMO along the binding sites. We specifically looked at two members of zinc finger protein family: specificity protein 1 (SP1) and early grown response 1 transcription factors (EGR1). When the valence band is filled, we find electrons in the purines along the nucleotide sequence, compatible with the electric charges of the binding amino acids in SP1 and EGR1 zinc finger.
Sequence-specific binding of counterions to B-DNA
Denisov, Vladimir P.; Halle, Bertil
2000-01-01
Recent studies by x-ray crystallography, NMR, and molecular simulations have suggested that monovalent counterions can penetrate deeply into the minor groove of B form DNA. Such groove-bound ions potentially could play an important role in AT-tract bending and groove narrowing, thereby modulating DNA function in vivo. To address this issue, we report here 23Na magnetic relaxation dispersion measurements on oligonucleotides, including difference experiments with the groove-binding drug netropsin. The exquisite sensitivity of this method to ions in long-lived and intimate association with DNA allows us to detect sequence-specific sodium ion binding in the minor groove AT tract of three B-DNA dodecamers. The sodium ion occupancy is only a few percent, however, and therefore is not likely to contribute importantly to the ensemble of B-DNA structures. We also report results of ion competition experiments, indicating that potassium, rubidium, and cesium ions bind to the minor groove with similarly weak affinity as sodium ions, whereas ammonium ion binding is somewhat stronger. The present findings are discussed in the light of previous NMR and diffraction studies of sequence-specific counterion binding to DNA. PMID:10639130
Theory on the mechanism of site-specific DNA-protein interactions in the presence of traps
NASA Astrophysics Data System (ADS)
Niranjani, G.; Murugan, R.
2016-08-01
The speed of site-specific binding of transcription factor (TFs) proteins with genomic DNA seems to be strongly retarded by the randomly occurring sequence traps. Traps are those DNA sequences sharing significant similarity with the original specific binding sites (SBSs). It is an intriguing question how the naturally occurring TFs and their SBSs are designed to manage the retarding effects of such randomly occurring traps. We develop a simple random walk model on the site-specific binding of TFs with genomic DNA in the presence of sequence traps. Our dynamical model predicts that (a) the retarding effects of traps will be minimum when the traps are arranged around the SBS such that there is a negative correlation between the binding strength of TFs with traps and the distance of traps from the SBS and (b) the retarding effects of sequence traps can be appeased by the condensed conformational state of DNA. Our computational analysis results on the distribution of sequence traps around the putative binding sites of various TFs in mouse and human genome clearly agree well the theoretical predictions. We propose that the distribution of traps can be used as an additional metric to efficiently identify the SBSs of TFs on genomic DNA.
Lee, Mei-Ling Ting; Bulyk, Martha L; Whitmore, G A; Church, George M
2002-12-01
There is considerable scientific interest in knowing the probability that a site-specific transcription factor will bind to a given DNA sequence. Microarray methods provide an effective means for assessing the binding affinities of a large number of DNA sequences as demonstrated by Bulyk et al. (2001, Proceedings of the National Academy of Sciences, USA 98, 7158-7163) in their study of the DNA-binding specificities of Zif268 zinc fingers using microarray technology. In a follow-up investigation, Bulyk, Johnson, and Church (2002, Nucleic Acid Research 30, 1255-1261) studied the interdependence of nucleotides on the binding affinities of transcription proteins. Our article is motivated by this pair of studies. We present a general statistical methodology for analyzing microarray intensity measurements reflecting DNA-protein interactions. The log probability of a protein binding to a DNA sequence on an array is modeled using a linear ANOVA model. This model is convenient because it employs familiar statistical concepts and procedures and also because it is effective for investigating the probability structure of the binding mechanism.
Molecular Dynamics Simulation of Rap1 Myb-type domain in Saccharomyces cerevisiae
Mukherjee, Koel; Pandey, Dev Mani; Vidyarthi, Ambarish Saran
2012-01-01
Telomere is a nucleoprotein complex that plays important role in stability and their maintenance and consists of random repeats of species specific motifs. In budding Saccharomyces cerevisiae, Repressor Activator Protein 1 (Rap1) is a sequence specific protein that involved in transcriptional regulation. Rap1 consist of three active domains like N-terminal BRCT-domain, DNA-binding domain and C-terminal RCT-domain. In this study the unknown 3D structure of Myb-type domain (having 61 residues) within DNAbinding domain was modeled by Modeller7, and verified using different online bioinformatics tools (ProCheck, WhatIf, Verify3D). Dynamics of Myb-type domain of Rap1was carried out through simulation studies using GROMACS software. Time dependent interactions among the molecules were analyzed by Root Mean Square Deviation (RMSD), Radius of Gyration (Rg) and Root Mean Square Fluctuation (RMSF) plots. Motional properties in reduced dimension were also performed by Principal Component Analysis (PCA). Result indicated that Rap1 interacts with DNA major groove through its Helix Turn Helix motifs. Helix 3 was rigid, less amount of fluctuation was found as it interacts with DNA major groove. Helix2 and N-terminal having considerable fluctuation in the time scale. PMID:23144544
Molecular Dynamics Simulation of Rap1 Myb-type domain in Saccharomyces cerevisiae.
Mukherjee, Koel; Pandey, Dev Mani; Vidyarthi, Ambarish Saran
2012-01-01
Telomere is a nucleoprotein complex that plays important role in stability and their maintenance and consists of random repeats of species specific motifs. In budding Saccharomyces cerevisiae, Repressor Activator Protein 1 (Rap1) is a sequence specific protein that involved in transcriptional regulation. Rap1 consist of three active domains like N-terminal BRCT-domain, DNA-binding domain and C-terminal RCT-domain. In this study the unknown 3D structure of Myb-type domain (having 61 residues) within DNAbinding domain was modeled by Modeller7, and verified using different online bioinformatics tools (ProCheck, WhatIf, Verify3D). Dynamics of Myb-type domain of Rap1was carried out through simulation studies using GROMACS software. Time dependent interactions among the molecules were analyzed by Root Mean Square Deviation (RMSD), Radius of Gyration (Rg) and Root Mean Square Fluctuation (RMSF) plots. Motional properties in reduced dimension were also performed by Principal Component Analysis (PCA). Result indicated that Rap1 interacts with DNA major groove through its Helix Turn Helix motifs. Helix 3 was rigid, less amount of fluctuation was found as it interacts with DNA major groove. Helix2 and N-terminal having considerable fluctuation in the time scale.
The multi-zinc finger protein ZNF217 contacts DNA through a two-finger domain.
Nunez, Noelia; Clifton, Molly M K; Funnell, Alister P W; Artuz, Crisbel; Hallal, Samantha; Quinlan, Kate G R; Font, Josep; Vandevenne, Marylène; Setiyaputra, Surya; Pearson, Richard C M; Mackay, Joel P; Crossley, Merlin
2011-11-04
Classical C2H2 zinc finger proteins are among the most abundant transcription factors found in eukaryotes, and the mechanisms through which they recognize their target genes have been extensively investigated. In general, a tandem array of three fingers separated by characteristic TGERP links is required for sequence-specific DNA recognition. Nevertheless, a significant number of zinc finger proteins do not contain a hallmark three-finger array of this type, raising the question of whether and how they contact DNA. We have examined the multi-finger protein ZNF217, which contains eight classical zinc fingers. ZNF217 is implicated as an oncogene and in repressing the E-cadherin gene. We show that two of its zinc fingers, 6 and 7, can mediate contacts with DNA. We examine its putative recognition site in the E-cadherin promoter and demonstrate that this is a suboptimal site. NMR analysis and mutagenesis is used to define the DNA binding surface of ZNF217, and we examine the specificity of the DNA binding activity using fluorescence anisotropy titrations. Finally, sequence analysis reveals that a variety of multi-finger proteins also contain two-finger units, and our data support the idea that these may constitute a distinct subclass of DNA recognition motif.
The Multi-zinc Finger Protein ZNF217 Contacts DNA through a Two-finger Domain*
Nunez, Noelia; Clifton, Molly M. K.; Funnell, Alister P. W.; Artuz, Crisbel; Hallal, Samantha; Quinlan, Kate G. R.; Font, Josep; Vandevenne, Marylène; Setiyaputra, Surya; Pearson, Richard C. M.; Mackay, Joel P.; Crossley, Merlin
2011-01-01
Classical C2H2 zinc finger proteins are among the most abundant transcription factors found in eukaryotes, and the mechanisms through which they recognize their target genes have been extensively investigated. In general, a tandem array of three fingers separated by characteristic TGERP links is required for sequence-specific DNA recognition. Nevertheless, a significant number of zinc finger proteins do not contain a hallmark three-finger array of this type, raising the question of whether and how they contact DNA. We have examined the multi-finger protein ZNF217, which contains eight classical zinc fingers. ZNF217 is implicated as an oncogene and in repressing the E-cadherin gene. We show that two of its zinc fingers, 6 and 7, can mediate contacts with DNA. We examine its putative recognition site in the E-cadherin promoter and demonstrate that this is a suboptimal site. NMR analysis and mutagenesis is used to define the DNA binding surface of ZNF217, and we examine the specificity of the DNA binding activity using fluorescence anisotropy titrations. Finally, sequence analysis reveals that a variety of multi-finger proteins also contain two-finger units, and our data support the idea that these may constitute a distinct subclass of DNA recognition motif. PMID:21908891
2011-01-01
Background Transcription factors (TFs) play a central role in regulating gene expression by interacting with cis-regulatory DNA elements associated with their target genes. Recent surveys have examined the DNA binding specificities of most Saccharomyces cerevisiae TFs, but a comprehensive evaluation of their data has been lacking. Results We analyzed in vitro and in vivo TF-DNA binding data reported in previous large-scale studies to generate a comprehensive, curated resource of DNA binding specificity data for all characterized S. cerevisiae TFs. Our collection comprises DNA binding site motifs and comprehensive in vitro DNA binding specificity data for all possible 8-bp sequences. Investigation of the DNA binding specificities within the basic leucine zipper (bZIP) and VHT1 regulator (VHR) TF families revealed unexpected plasticity in TF-DNA recognition: intriguingly, the VHR TFs, newly characterized by protein binding microarrays in this study, recognize bZIP-like DNA motifs, while the bZIP TF Hac1 recognizes a motif highly similar to the canonical E-box motif of basic helix-loop-helix (bHLH) TFs. We identified several TFs with distinct primary and secondary motifs, which might be associated with different regulatory functions. Finally, integrated analysis of in vivo TF binding data with protein binding microarray data lends further support for indirect DNA binding in vivo by sequence-specific TFs. Conclusions The comprehensive data in this curated collection allow for more accurate analyses of regulatory TF-DNA interactions, in-depth structural studies of TF-DNA specificity determinants, and future experimental investigations of the TFs' predicted target genes and regulatory roles. PMID:22189060
Structural and Histone Binding Ability Characterizations of Human PWWP Domains
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wu, Hong; Zeng, Hong; Lam, Robert
2013-09-25
The PWWP domain was first identified as a structural motif of 100-130 amino acids in the WHSC1 protein and predicted to be a protein-protein interaction domain. It belongs to the Tudor domain 'Royal Family', which consists of Tudor, chromodomain, MBT and PWWP domains. While Tudor, chromodomain and MBT domains have long been known to bind methylated histones, PWWP was shown to exhibit histone binding ability only until recently. The PWWP domain has been shown to be a DNA binding domain, but sequence analysis and previous structural studies show that the PWWP domain exhibits significant similarity to other 'Royal Family' members,more » implying that the PWWP domain has the potential to bind histones. In order to further explore the function of the PWWP domain, we used the protein family approach to determine the crystal structures of the PWWP domains from seven different human proteins. Our fluorescence polarization binding studies show that PWWP domains have weak histone binding ability, which is also confirmed by our NMR titration experiments. Furthermore, we determined the crystal structures of the BRPF1 PWWP domain in complex with H3K36me3, and HDGF2 PWWP domain in complex with H3K79me3 and H4K20me3. PWWP proteins constitute a new family of methyl lysine histone binders. The PWWP domain consists of three motifs: a canonical {beta}-barrel core, an insertion motif between the second and third {beta}-strands and a C-terminal {alpha}-helix bundle. Both the canonical {beta}-barrel core and the insertion motif are directly involved in histone binding. The PWWP domain has been previously shown to be a DNA binding domain. Therefore, the PWWP domain exhibits dual functions: binding both DNA and methyllysine histones.« less
Modeling the interactions of the nucleotide excision repair UvrA(2) dimer with DNA.
Gantchev, Tsvetan G; Hunting, Darel J
2010-12-28
The UvrA protein initiates the DNA damage recognition process by the bacterial nucleotide excision repair (NER) system. Recently, crystallographic structures of holo-UvrA(2) dimers from two different microorganisms have been released (Protein Data Bank entries 2r6f , 2vf7 , and 2vf8 ). However, the details of the DNA binding by UvrA(2) and other peculiarities involved in the damage recognition process remain unknown. We have undertaken a molecular modeling approach to appraise the possible modes of DNA-UvrA(2) interaction using molecular docking and short-scale guided molecular dynamics [continuum field, constrained, and/or unrestricted simulated annealing (SA)], taking into account the three-dimensional location of a series of mutation-identified UvrA residues implicated in DNA binding. The molecular docking was based on the assumptions that the UvrA(2) dimer is preformed prior to DNA binding and that no major protein conformational rearrangements, except moderate domain reorientations, are required for binding of undamaged DNA. As a first approximation, DNA was treated as a rigid ligand. From the electrostatic relief of the ventral surface of UvrA(2), we initially identified three, noncollinear DNA binding paths. Each of the three resulting nucleoprotein complexes (C1, C2, and C3) was analyzed separately, including calculation of binding energies, the number and type of interaction residues (including mutated ones), and the predominant mode of translational and rotational motion of specific protein domains after SA to ensure improved DNA binding. The UvrA(2) dimer can accommodate DNA in all three orientations, albeit with different binding strengths. One of the UvrA(2)-DNA complexes (C1) fulfilled most of the requirements (high interaction energy, proximity of DNA to mutated residues, etc.) expected for a natural, high-affinity DNA binding site. This nucleoprotein presents a structural organization that is designed to clamp and bend double-stranded DNA. We examined the binding site in more detail by docking DNAs of significantly different (AT- vs CG-enriched) sequences and by submitting the complexes to DNA-unrestricted SA. It was found that in a manner independent of the DNA sequence and applied MD protocols, UvrA(2) favors binding of a bent and unwound undamaged DNA, with a kink positioned in the proximity of the Zn3 hairpins, anticollinearly aligned at the bottom of the ventral protein surface. It is further hypothesized that the Zn3 modules play an essential role in the damage recognition process and that the apparent existence of a family of DNA binding sites might be biologically relevant. Our data should prove to be useful in rational (structure-based) mutation studies.
Regulatory Phosphorylation of Ikaros by Bruton's Tyrosine Kinase
Zhang, Jian; Ishkhanian, Rita; Uckun, Fatih M.
2013-01-01
Diminished Ikaros function has been implicated in the pathogenesis of acute lymphoblastic leukemia (ALL), the most common form of childhood cancer. Therefore, a stringent regulation of Ikaros is of paramount importance for normal lymphocyte ontogeny. Here we provide genetic and biochemical evidence for a previously unknown function of Bruton's tyrosine kinase (BTK) as a partner and posttranslational regulator of Ikaros, a zinc finger-containing DNA-binding protein that plays a pivotal role in immune homeostasis. We demonstrate that BTK phosphorylates Ikaros at unique phosphorylation sites S214 and S215 in the close vicinity of its zinc finger 4 (ZF4) within the DNA binding domain, thereby augmenting its nuclear localization and sequence-specific DNA binding activity. Our results further demonstrate that BTK-induced activating phosphorylation is critical for the optimal transcription factor function of Ikaros. PMID:23977012
He, Xiaoyuan; Wang, Liqin; Wang, Shuishu
2016-04-15
The transcriptional regulator PhoP is an essential virulence factor in Mycobacterium tuberculosis, and it presents a target for the development of new anti-tuberculosis drugs and attenuated tuberculosis vaccine strains. PhoP binds to DNA as a highly cooperative dimer by recognizing direct repeats of 7-bp motifs with a 4-bp spacer. To elucidate the PhoP-DNA binding mechanism, we determined the crystal structure of the PhoP-DNA complex. The structure revealed a tandem PhoP dimer that bound to the direct repeat. The surprising tandem arrangement of the receiver domains allowed the four domains of the PhoP dimer to form a compact structure, accounting for the strict requirement of a 4-bp spacer and the highly cooperative binding of the dimer. The PhoP-DNA interactions exclusively involved the effector domain. The sequence-recognition helix made contact with the bases of the 7-bp motif in the major groove, and the wing interacted with the adjacent minor groove. The structure provides a starting point for the elucidation of the mechanism by which PhoP regulates the virulence of M. tuberculosis and guides the design of screening platforms for PhoP inhibitors.
Two new insulator proteins, Pita and ZIPIC, target CP190 to chromatin.
Maksimenko, Oksana; Bartkuhn, Marek; Stakhov, Viacheslav; Herold, Martin; Zolotarev, Nickolay; Jox, Theresa; Buxa, Melanie K; Kirsch, Ramona; Bonchuk, Artem; Fedotova, Anna; Kyrchanova, Olga; Renkawitz, Rainer; Georgiev, Pavel
2015-01-01
Insulators are multiprotein-DNA complexes that regulate the nuclear architecture. The Drosophila CP190 protein is a cofactor for the DNA-binding insulator proteins Su(Hw), CTCF, and BEAF-32. The fact that CP190 has been found at genomic sites devoid of either of the known insulator factors has until now been unexplained. We have identified two DNA-binding zinc-finger proteins, Pita, and a new factor named ZIPIC, that interact with CP190 in vivo and in vitro at specific interaction domains. Genomic binding sites for these proteins are clustered with CP190 as well as with CTCF and BEAF-32. Model binding sites for Pita or ZIPIC demonstrate a partial enhancer-blocking activity and protect gene expression from PRE-mediated silencing. The function of the CTCF-bound MCP insulator sequence requires binding of Pita. These results identify two new insulator proteins and emphasize the unifying function of CP190, which can be recruited by many DNA-binding insulator proteins. © 2015 Maksimenko et al.; Published by Cold Spring Harbor Laboratory Press.
Glinsky, Gennadi V.
2016-01-01
Abstract Thousands of candidate human-specific regulatory sequences (HSRS) have been identified, supporting the hypothesis that unique to human phenotypes result from human-specific alterations of genomic regulatory networks. Collectively, a compendium of multiple diverse families of HSRS that are functionally and structurally divergent from Great Apes could be defined as the backbone of human-specific genomic regulatory networks. Here, the conservation patterns analysis of 18,364 candidate HSRS was carried out requiring that 100% of bases must remap during the alignments of human, chimpanzee, and bonobo sequences. A total of 5,535 candidate HSRS were identified that are: (i) highly conserved in Great Apes; (ii) evolved by the exaptation of highly conserved ancestral DNA; (iii) defined by either the acceleration of mutation rates on the human lineage or the functional divergence from non-human primates. The exaptation of highly conserved ancestral DNA pathway seems mechanistically distinct from the evolution of regulatory DNA segments driven by the species-specific expansion of transposable elements. Genome-wide proximity placement analysis of HSRS revealed that a small fraction of topologically associating domains (TADs) contain more than half of HSRS from four distinct families. TADs that are enriched for HSRS and termed rapidly evolving in humans TADs (revTADs) comprise 0.8–10.3% of 3,127 TADs in the hESC genome. RevTADs manifest distinct correlation patterns between placements of human accelerated regions, human-specific transcription factor-binding sites, and recombination rates. There is a significant enrichment within revTAD boundaries of hESC-enhancers, primate-specific CTCF-binding sites, human-specific RNAPII-binding sites, hCONDELs, and H3K4me3 peaks with human-specific enrichment at TSS in prefrontal cortex neurons (P < 0.0001 in all instances). Present analysis supports the idea that phenotypic divergence of Homo sapiens is driven by the evolution of human-specific genomic regulatory networks via at least two mechanistically distinct pathways of creation of divergent sequences of regulatory DNA: (i) recombination-associated exaptation of the highly conserved ancestral regulatory DNA segments; (ii) human-specific insertions of transposable elements. PMID:27503290
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bianchetti, Christopher M.; Bingman, Craig A.; Phillips, Jr., George N.
The thanatos (the Greek god of death)-associated protein (THAP) domain is a sequence-specific DNA-binding domain that contains a C2-CH (Cys-Xaa{sub 2-4}-Cys-Xaa{sub 35-50}-Cys-Xaa{sub 2}-His) zinc finger that is similar to the DNA domain of the P element transposase from Drosophila. THAP-containing proteins have been observed in the proteome of humans, pigs, cows, chickens, zebrafish, Drosophila, C. elegans, and Xenopus. To date, there are no known THAP domain proteins in plants, yeast, or bacteria. There are 12 identified human THAP domain-containing proteins (THAP0-11). In all human THAP protein, the THAP domain is located at the N-terminus and is {approx}90 residues in length.more » Although all of the human THAP-containing proteins have a homologous N-terminus, there is extensive variation in both the predicted structure and length of the remaining protein. Even though the exact function of these THAP proteins is not well defined, there is evidence that they play a role in cell proliferation, apoptosis, cell cycle modulation, chromatin modification, and transcriptional regulation. THAP-containing proteins have also been implicated in a number of human disease states including heart disease, neurological defects, and several types of cancers. Human THAP4 is a 577-residue protein of unknown function that is proposed to bind DNA in a sequence-specific manner similar to THAP1 and has been found to be upregulated in response to heat shock. THAP4 is expressed in a relatively uniform manner in a broad range of tissues and appears to be upregulated in lymphoma cells and highly expressed in heart cells. The C-terminal domain of THAP4 (residues 415-577), designated here as cTHAP4, is evolutionarily conserved and is observed in all known THAP4 orthologs. Several single-domain proteins lacking a THAP domain are found in plants and bacteria and show significant levels of homology to cTHAP4. It appears that cTHAP4 belongs to a large class of proteins that have yet to be fully functionally characterized. On the basis of prior work, we predicted that cTHAP4 is composed of a heme-binding nitrobindin domain, making THAP4 the only human THAP protein predicted to bind a cofactor. Nitrobindin, a recently characterized protein from Arabidopsis thaliana, is structurally similar and exhibits nitric oxide (NO)-binding properties that resemble the heme-binding nitrophorins. Nitrophorins use a heme moiety to store, transport, and release NO in a pH-specific manner. Although the exact function of nitrobindin is not fully known, the similarities between the well-characterized nitrophorins imply a role in NO transport, sensing, or metabolism. To better elucidate the possible function of THAP4, we solved the hemebound structure of cTHAP4 to a resolution of 1.79 {angstrom}.« less
Molecular basis of CENP-C association with the CENP-A nucleosome at yeast centromeres
Xiao, Hua; Wang, Feng; Wisniewski, Jan; Shaytan, Alexey K.; Ghirlando, Rodolfo; FitzGerald, Peter C.; Huang, Yingzi; Wei, Debbie; Li, Shipeng; Landsman, David; Panchenko, Anna R.; Wu, Carl
2017-01-01
Histone CENP-A-containing nucleosomes play an important role in nucleating kinetochores at centromeres for chromosome segregation. However, the molecular mechanisms by which CENP-A nucleosomes engage with kinetochore proteins are not well understood. Here, we report the finding of a new function for the budding yeast Cse4/CENP-A histone-fold domain interacting with inner kinetochore protein Mif2/CENP-C. Strikingly, we also discovered that AT-rich centromere DNA has an important role for Mif2 recruitment. Mif2 contacts one side of the nucleosome dyad, engaging with both Cse4 residues and AT-rich nucleosomal DNA. Both interactions are directed by a contiguous DNA- and histone-binding domain (DHBD) harboring the conserved CENP-C motif, an AT hook, and RK clusters (clusters enriched for arginine–lysine residues). Human CENP-C has two related DHBDs that bind preferentially to DNA sequences of higher AT content. Our findings suggest that a DNA composition-based mechanism together with residues characteristic for the CENP-A histone variant contribute to the specification of centromere identity. PMID:29074736
Leavitt, Justin C.; Gilcrease, Eddie B.; Wilson, Kassandra; Casjens, Sherwood R.
2013-01-01
Bacteriophage Sf6 DNA packaging series initiate at many locations across a 2 kbp region. Our in vivo studies that show that Sf6 small terminase subunit (TerS) protein recognizes a specific packaging (pac) site near the center of this region, that this site lies within the portion of the Sf6 gene that encodes the DNA-binding domain of TerS protein, that this domain of the TerS protein is responsible for the imprecision in Sf6 packaging initiation, and that the DNA-binding domain of TerS must be covalently attached to the domain that interacts with the rest of the packaging motor. The TerS DNA-binding domain is self-contained in that it apparently does not interact closely with the rest of the motor and it binds to a recognition site that lies within the DNA that encodes the domain. This arrangement has allowed the horizontal exchange of terS genes among phages to be very successful. PMID:23562538
Subrahmanyam, S; Cronan, J E
1999-01-21
We report an efficient and flexible in vitro method for the isolation of genomic DNA sequences that are the binding targets of a given DNA binding protein. This method takes advantage of the fact that binding of a protein to a DNA molecule generally increases the rate of migration of the protein in nondenaturing gel electrophoresis. By the use of a radioactively labeled DNA-binding protein and nonradioactive DNA coupled with PCR amplification from gel slices, we show that specific binding sites can be isolated from Escherichia coli genomic DNA. We have applied this method to isolate a binding site for FadR, a global regulator of fatty acid metabolism in E. coli. We have also isolated a second binding site for BirA, the biotin operon repressor/biotin ligase, from the E. coli genome that has a very low binding efficiency compared with the bio operator region.
A Unique HMG-Box Domain of Mouse Maelstrom Binds Structured RNA but Not Double Stranded DNA
Genzor, Pavol; Bortvin, Alex
2015-01-01
Piwi-interacting piRNAs are a major and essential class of small RNAs in the animal germ cells with a prominent role in transposon control. Efficient piRNA biogenesis and function require a cohort of proteins conserved throughout the animal kingdom. Here we studied Maelstrom (MAEL), which is essential for piRNA biogenesis and germ cell differentiation in flies and mice. MAEL contains a high mobility group (HMG)-box domain and a Maelstrom-specific domain with a presumptive RNase H-fold. We employed a combination of sequence analyses, structural and biochemical approaches to evaluate and compare nucleic acid binding of mouse MAEL HMG-box to that of canonical HMG-box domain proteins (SRY and HMGB1a). MAEL HMG-box failed to bind double-stranded (ds)DNA but bound to structured RNA. We also identified important roles of a novel cluster of arginine residues in MAEL HMG-box in these interactions. Cumulatively, our results suggest that the MAEL HMG-box domain may contribute to MAEL function in selective processing of retrotransposon RNA into piRNAs. In this regard, a cellular role of MAEL HMG-box domain is reminiscent of that of HMGB1 as a sentinel of immunogenic nucleic acids in the innate immune response. PMID:25807393
Autoinhibition of ETV6 DNA Binding Is Established by the Stability of Its Inhibitory Helix
De, Soumya; Okon, Mark; Graves, Barbara J.; McIntosh, Lawrence P.
2017-01-01
The ETS transcriptional repressor ETV6 (or TEL) is autoinhibited by an α-helix that sterically blocks its DNA-binding ETS domain. The inhibitory helix is marginally stable and unfolds when ETV6 binds to either specific or non-specific DNA. Using NMR spectroscopy, we show that folding of the inhibitory helix requires a buried charge–dipole interaction with helix H1 of the ETS domain. This interaction also contributes directly to autoinhibition by precluding a highly conserved dipole-enhanced hydrogen bond between the phosphodiester backbone of bound DNA and the N terminus of helix H1. To probe further the thermodynamic basis of autoinhibition, ETV6 variants were generated with amino acid substitutions introduced along the solvent exposed surface of the inhibitory helix. These changes were designed to increase the intrinsic helical propensity of the inhibitory helix without perturbing its packing interactions with the ETS domain. NMR-monitored amide hydrogen exchange measurements confirmed that the stability of the folded inhibitory helix increases progressively with added helix-promoting substitutions. This also results in progressively reinforced autoinhibition and decreased DNA-binding affinity. Surprisingly, locking the inhibitory helix onto the ETS domain by a disulfide bridge severely impairs, but does not abolish DNA binding. Weak interactions still occur via an interface displaced from the canonical ETS domain DNA-binding surface. Collectively, these studies establish a direct thermodynamic linkage between inhibitory helix stability and ETV6 autoinhibition, and demonstrate that helix unfolding does not strictly precede DNA binding. Modulating inhibitory helix stability provides a potential route for the in vivo regulation of ETV6 activity. PMID:26920109
Chen, Dana; Orenstein, Yaron; Golodnitsky, Rada; Pellach, Michal; Avrahami, Dorit; Wachtel, Chaim; Ovadia-Shochat, Avital; Shir-Shapira, Hila; Kedmi, Adi; Juven-Gershon, Tamar; Shamir, Ron; Gerber, Doron
2016-01-01
Transcription factors (TFs) alter gene expression in response to changes in the environment through sequence-specific interactions with the DNA. These interactions are best portrayed as a landscape of TF binding affinities. Current methods to study sequence-specific binding preferences suffer from limited dynamic range, sequence bias, lack of specificity and limited throughput. We have developed a microfluidic-based device for SELEX Affinity Landscape MAPping (SELMAP) of TF binding, which allows high-throughput measurement of 16 proteins in parallel. We used it to measure the relative affinities of Pho4, AtERF2 and Btd full-length proteins to millions of different DNA binding sites, and detected both high and low-affinity interactions in equilibrium conditions, generating a comprehensive landscape of the relative TF affinities to all possible DNA 6-mers, and even DNA10-mers with increased sequencing depth. Low quantities of both the TFs and DNA oligomers were sufficient for obtaining high-quality results, significantly reducing experimental costs. SELMAP allows in-depth screening of hundreds of TFs, and provides a means for better understanding of the regulatory processes that govern gene expression. PMID:27628341
Tu, Chao; Tan, Yu-Hong; Shaw, Gary; Zhou, Zheng; Bai, Yawen; Luo, Ray; Ji, Xinhua
2008-01-01
Tumor suppressor p53 is a sequence-specific DNA-binding protein and its central DNA-binding domain (DBD) harbors six hotspots (Arg175, Gly245, Arg248, Arg249, Arg273 and Arg282) for human cancers. Here, the crystal structure of a low-frequency hotspot mutant, p53DBD(R282Q), is reported at 1.54 Å resolution together with the results of molecular-dynamics simulations on the basis of the structure. In addition to eliminating a salt bridge, the R282Q mutation has a significant impact on the properties of two DNA-binding loops (L1 and L3). The L1 loop is flexible in the wild type, but it is not flexible in the mutant. The L3 loop of the wild type is not flexible, whereas it assumes two conformations in the mutant. Molecular-dynamics simulations indicated that both conformations of the L3 loop are accessible under biological conditions. It is predicted that the elimination of the salt bridge and the inversion of the flexibility of L1 and L3 are directly or indirectly responsible for deactivating the tumor suppressor p53. PMID:18453682
Crystal structure of the Msx-1 homeodomain/DNA complex.
Hovde, S; Abate-Shen, C; Geiger, J H
2001-10-09
The Msx-1 homeodomain protein plays a crucial role in craniofacial, limb, and nervous system development. Homeodomain DNA-binding domains are comprised of 60 amino acids that show a high degree of evolutionary conservation. We have determined the structure of the Msx-1 homeodomain complexed to DNA at 2.2 A resolution. The structure has an unusually well-ordered N-terminal arm with a unique trajectory across the minor groove of the DNA. DNA specificity conferred by bases flanking the core TAAT sequence is explained by well ordered water-mediated interactions at Q50. Most interactions seen at the TAAT sequence are typical of the interactions seen in other homeodomain structures. Comparison of the Msx-1-HD structure to all other high resolution HD-DNA complex structures indicate a remarkably well-conserved sphere of hydration between the DNA and protein in these complexes.
Human mRNA polyadenylate binding protein: evolutionary conservation of a nucleic acid binding motif.
Grange, T; de Sa, C M; Oddos, J; Pictet, R
1987-01-01
We have isolated a full length cDNA (cDNA) coding for the human poly(A) binding protein. The cDNA derived 73 kd basic translation product has the same Mr, isoelectric point and peptidic map as the poly(A) binding protein. DNA sequence analysis reveals a 70,244 dalton protein. The N terminal part, highly homologous to the yeast poly(A) binding protein, is sufficient for poly(A) binding activity. This domain consists of a four-fold repeated unit of approximately 80 amino acids present in other nucleic acid binding proteins. In the C terminal part there is, as in the yeast protein, a sequence of approximately 150 amino acids, rich in proline, alanine and glutamine which together account for 48% of the residues. A 2,9 kb mRNA corresponding to this cDNA has been detected in several vertebrate cell types and in Drosophila melanogaster at every developmental stage including oogenesis. Images PMID:2885805
Computational characterization of DNA/peptide/nanotube self assembly for bioenergy applications
NASA Astrophysics Data System (ADS)
Ortiz, Vanessa; Araki, Ruriko; Collier, Galen
2012-02-01
Multi-enzyme pathways have become a subject of increasing interest for their role in the engineering of biomimetic systems for applications including biosensors, bioelectronics, and bioenergy. The efficiencies found in natural metabolic pathways partially arise from biomolecular self-assembly of the component enzymes in an effort to avoid transport limitations. The ultimate goal of this effort is to design and build biofuel cells with efficiencies similar to those of native systems by introducing biomimetic structures that immobilize multiple enzymes in specific orientations on a bioelectrode. To achieve site-specific immobilization, the specificity of DNA-binding domains is exploited with an approach that allows any redox enzyme to be modified to site-specifically bind to double stranded (ds) DNA while retaining activity. Because of its many desirable properties, the bioelectrode of choice is single-wall carbon nanotubes (SWNTs), but little is known about dsDNA/SWNT assembly and how this might affect the activity of the DNA-binding domains. Here we evaluate the feasibility of the proposed assembly by performing atomistic molecular dynamics simulations to look at the stability and conformations adopted by dsDNA when bound to a SWNT. We also evaluate the effects of the presence of a SWNT on the stability of the complex formed by a DNA-binding domain and DNA.
A novel paired domain DNA recognition motif can mediate Pax2 repression of gene transcription.
Håvik, B; Ragnhildstveit, E; Lorens, J B; Saelemyr, K; Fauske, O; Knudsen, L K; Fjose, A
1999-12-20
The paired domain (PD) is an evolutionarily conserved DNA-binding domain encoded by the Pax gene family of developmental regulators. The Pax proteins are transcription factors and are involved in a variety of processes such as brain development, patterning of the central nervous system (CNS), and B-cell development. In this report we demonstrate that the zebrafish Pax2 PD can interact with a novel type of DNA sequences in vitro, the triple-A motif, consisting of a heptameric nucleotide sequence G/CAAACA/TC with an invariant core of three adjacent adenosines. This recognition sequence was found to be conserved in known natural Pax5 repressor elements involved in controlling the expression of the p53 and J-chain genes. By identifying similar high affinity binding sites in potential target genes of the Pax2 protein, including the pax2 gene itself, we obtained further evidence that the triple-A sites are biologically significant. The putative natural target sites also provide a basis for defining an extended consensus recognition sequence. In addition, we observed in transformation assays a direct correlation between Pax2 repressor activity and the presence of triple-A sites. The results suggest that a transcriptional regulatory function of Pax proteins can be modulated by PD binding to different categories of target sequences. Copyright 1999 Academic Press.
Structural basis of Bloom syndrome (BS) causing mutations in the BLM helicase domain.
Rong, S. B.; Väliaho, J.; Vihinen, M.
2000-01-01
BACKGROUND: Bloom syndrome (BS) is characterized by mutations within the BLM gene. The Bloom syndrome protein (BLM) has similarity to the RecQ subfamily of DNA helicases, which contain seven conserved helicase domains and share significant sequence and structural similarity with the Rep and PcrA DNA helicases. We modeled the three-dimensional structure of the BLM helicase domain to analyze the structural basis of BS-causing mutations. MATERIALS AND METHODS: The sequence alignment was performed for RecQ DNA helicases and Rep and PcrA helicases. The crystal structure of PcrA helicase (PDB entry 3PJR) was used as the template for modeling the BLM helicase domain. The model was used to infer the function of BLM and to analyze the effect of the mutations. RESULTS: The structural model with good stereochemistry of the BLM helicase domain contains two subdomains, 1A and 2A. The electrostatic potential of the model is highly negative over most of the surface, except for the cleft between subdomains 1A and 2A which is similar to the template protein. The ATP-binding site is located inside the model between subdomains 1A and 2A; whereas, the DNA-binding region is situated at the surface cleft, with positive potential between 1A and 2A. CONCLUSIONS: The three-dimensional structure of the BLM helicase domain was modeled and applied to interpret BS-causing mutations. The mutation I841T is likely to weaken DNA binding, while the mutations C891R, C901Y, and Q672R presumably disturb the ATP binding. In addition, other critical positions are discussed. PMID:10965492
Alexandrov, Boian S; Fukuyo, Yayoi; Lange, Martin; Horikoshi, Nobuo; Gelev, Vladimir; Rasmussen, Kim Ø; Bishop, Alan R; Usheva, Anny
2012-11-01
The genome-wide mapping of the major gene expression regulators, the transcription factors (TFs) and their DNA binding sites, is of great importance for describing cellular behavior and phenotypic diversity. Presently, the methods for prediction of genomic TF binding produce a large number of false positives, most likely due to insufficient description of the physiochemical mechanisms of protein-DNA binding. Growing evidence suggests that, in the cell, the double-stranded DNA (dsDNA) is subject to local transient strands separations (breathing) that contribute to genomic functions. By using site-specific chromatin immunopecipitations, gel shifts, BIOBASE data, and our model that accurately describes the melting behavior and breathing dynamics of dsDNA we report a specific DNA breathing profile found at YY1 binding sites in cells. We find that the genomic flanking sequence variations and SNPs, may exert long-range effects on DNA dynamics and predetermine YY1 binding. The ubiquitous TF YY1 has a fundamental role in essential biological processes by activating, initiating or repressing transcription depending upon the sequence context it binds. We anticipate that consensus binding sequences together with the related DNA dynamics profile may significantly improve the accuracy of genomic TF binding sites and TF binding-related functional SNPs.
Non-B-Form DNA Is Enriched at Centromeres
Henikoff, Steven
2018-01-01
Abstract Animal and plant centromeres are embedded in repetitive “satellite” DNA, but are thought to be epigenetically specified. To define genetic characteristics of centromeres, we surveyed satellite DNA from diverse eukaryotes and identified variation in <10-bp dyad symmetries predicted to adopt non-B-form conformations. Organisms lacking centromeric dyad symmetries had binding sites for sequence-specific DNA-binding proteins with DNA-bending activity. For example, human and mouse centromeres are depleted for dyad symmetries, but are enriched for non-B-form DNA and are associated with binding sites for the conserved DNA-binding protein CENP-B, which is required for artificial centromere function but is paradoxically nonessential. We also detected dyad symmetries and predicted non-B-form DNA structures at neocentromeres, which form at ectopic loci. We propose that centromeres form at non-B-form DNA because of dyad symmetries or are strengthened by sequence-specific DNA binding proteins. This may resolve the CENP-B paradox and provide a general basis for centromere specification. PMID:29365169
Leonard, D A; Rajaram, N; Kerppola, T K
1997-05-13
Interactions among transcription factors that bind to separate sequence elements require bending of the intervening DNA and juxtaposition of interacting molecular surfaces in an appropriate orientation. Here, we examine the effects of single amino acid substitutions adjacent to the basic regions of Fos and Jun as well as changes in sequences flanking the AP-1 site on DNA bending. Substitution of charged amino acid residues at positions adjacent to the basic DNA-binding domains of Fos and Jun altered DNA bending. The change in DNA bending was directly proportional to the change in net charge for all heterodimeric combinations between these proteins. Fos and Jun induced distinct DNA bends at different binding sites. Exchange of a single base pair outside of the region contacted in the x-ray crystal structure altered DNA bending. Substitution of base pairs flanking the AP-1 site had converse effects on the opposite directions of DNA bending induced by homodimers and heterodimers. These results suggest that Fos and Jun induce DNA bending in part through electrostatic interactions between amino acid residues adjacent to the basic region and base pairs flanking the AP-1 site. DNA bending by Fos and Jun at inverted binding sites indicated that heterodimers bind to the AP-1 site in a preferred orientation. Mutation of a conserved arginine within the basic regions of Fos and transversion of the central C:G base pair in the AP-1 site to G:C had complementary effects on the orientation of heterodimer binding and DNA bending. The conformational variability of the Fos-Jun-AP-1 complex may contribute to its functional versatility at different promoters.
Robinson, Clifford R.; Sligar, Stephen G.
1998-01-01
Restriction endonucleases such as EcoRI bind and cleave DNA with great specificity and represent a paradigm for protein–DNA interactions and molecular recognition. Using osmotic pressure to induce water release, we demonstrate the participation of bound waters in the sequence discrimination of substrate DNA by EcoRI. Changes in solvation can play a critical role in directing sequence-specific DNA binding by EcoRI and are also crucial in assisting site discrimination during catalysis. By measuring the volume change for complex formation, we show that at the cognate sequence (GAATTC) EcoRI binding releases about 70 fewer water molecules than binding at an alternate DNA sequence (TAATTC), which differs by a single base pair. EcoRI complexation with nonspecific DNA releases substantially less water than either of these specific complexes. In cognate substrates (GAATTC) kcat decreases as osmotic pressure is increased, indicating the binding of about 30 water molecules accompanies the cleavage reaction. For the alternate substrate (TAATTC), release of about 40 water molecules accompanies the reaction, indicated by a dramatic acceleration of the rate when osmotic pressure is raised. These large differences in solvation effects demonstrate that water molecules can be key players in the molecular recognition process during both association and catalytic phases of the EcoRI reaction, acting to change the specificity of the enzyme. For both the protein–DNA complex and the transition state, there may be substantial conformational differences between cognate and alternate sites, accompanied by significant alterations in hydration and solvent accessibility. PMID:9482860
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hancock, Stephen P.; Stella, Stefano; Cascio, Duilio
The abundant Fis nucleoid protein selectively binds poorly related DNA sequences with high affinities to regulate diverse DNA reactions. Fis binds DNA primarily through DNA backbone contacts and selects target sites by reading conformational properties of DNA sequences, most prominently intrinsic minor groove widths. High-affinity binding requires Fis-stabilized DNA conformational changes that vary depending on DNA sequence. In order to better understand the molecular basis for high affinity site recognition, we analyzed the effects of DNA sequence within and flanking the core Fis binding site on binding affinity and DNA structure. X-ray crystal structures of Fis-DNA complexes containing variable sequencesmore » in the noncontacted center of the binding site or variations within the major groove interfaces show that the DNA can adapt to the Fis dimer surface asymmetrically. We show that the presence and position of pyrimidine-purine base steps within the major groove interfaces affect both local DNA bending and minor groove compression to modulate affinities and lifetimes of Fis-DNA complexes. Sequences flanking the core binding site also modulate complex affinities, lifetimes, and the degree of local and global Fis-induced DNA bending. In particular, a G immediately upstream of the 15 bp core sequence inhibits binding and bending, and A-tracts within the flanking base pairs increase both complex lifetimes and global DNA curvatures. Taken together, our observations support a revised DNA motif specifying high-affinity Fis binding and highlight the range of conformations that Fis-bound DNA can adopt. Lastly, the affinities and DNA conformations of individual Fis-DNA complexes are likely to be tailored to their context-specific biological functions.« less
Hancock, Stephen P.; Stella, Stefano; Cascio, Duilio; ...
2016-03-09
The abundant Fis nucleoid protein selectively binds poorly related DNA sequences with high affinities to regulate diverse DNA reactions. Fis binds DNA primarily through DNA backbone contacts and selects target sites by reading conformational properties of DNA sequences, most prominently intrinsic minor groove widths. High-affinity binding requires Fis-stabilized DNA conformational changes that vary depending on DNA sequence. In order to better understand the molecular basis for high affinity site recognition, we analyzed the effects of DNA sequence within and flanking the core Fis binding site on binding affinity and DNA structure. X-ray crystal structures of Fis-DNA complexes containing variable sequencesmore » in the noncontacted center of the binding site or variations within the major groove interfaces show that the DNA can adapt to the Fis dimer surface asymmetrically. We show that the presence and position of pyrimidine-purine base steps within the major groove interfaces affect both local DNA bending and minor groove compression to modulate affinities and lifetimes of Fis-DNA complexes. Sequences flanking the core binding site also modulate complex affinities, lifetimes, and the degree of local and global Fis-induced DNA bending. In particular, a G immediately upstream of the 15 bp core sequence inhibits binding and bending, and A-tracts within the flanking base pairs increase both complex lifetimes and global DNA curvatures. Taken together, our observations support a revised DNA motif specifying high-affinity Fis binding and highlight the range of conformations that Fis-bound DNA can adopt. Lastly, the affinities and DNA conformations of individual Fis-DNA complexes are likely to be tailored to their context-specific biological functions.« less
2015-01-01
We report a dual illumination, single-molecule imaging strategy to dissect directly and in real-time the correlation between nanometer-scale domain motion of a DNA repair protein and its interaction with individual DNA substrates. The strategy was applied to XPD, an FeS cluster-containing DNA repair helicase. Conformational dynamics was assessed via FeS-mediated quenching of a fluorophore site-specifically incorporated into XPD. Simultaneously, binding of DNA molecules labeled with a spectrally distinct fluorophore was detected by colocalization of the DNA- and protein-derived signals. We show that XPD undergoes thermally driven conformational transitions that manifest in spatial separation of its two auxiliary domains. DNA binding does not strictly enforce a specific conformation. Interaction with a cognate DNA damage, however, stabilizes the compact conformation of XPD by increasing the weighted average lifetime of this state by 140% relative to an undamaged DNA. Our imaging strategy will be a valuable tool to study other FeS-containing nucleic acid processing enzymes. PMID:25204359
Proteolytic dissection of Zab, the Z-DNA-binding domain of human ADAR1
NASA Technical Reports Server (NTRS)
Schwartz, T.; Lowenhaupt, K.; Kim, Y. G.; Li, L.; Brown, B. A. 2nd; Herbert, A.; Rich, A.
1999-01-01
Zalpha is a peptide motif that binds to Z-DNA with high affinity. This motif binds to alternating dC-dG sequences stabilized in the Z-conformation by means of bromination or supercoiling, but not to B-DNA. Zalpha is part of the N-terminal region of double-stranded RNA adenosine deaminase (ADAR1), a candidate enzyme for nuclear pre-mRNA editing in mammals. Zalpha is conserved in ADAR1 from many species; in each case, there is a second similar motif, Zbeta, separated from Zalpha by a more divergent linker. To investigate the structure-function relationship of Zalpha, its domain structure was studied by limited proteolysis. Proteolytic profiles indicated that Zalpha is part of a domain, Zab, of 229 amino acids (residues 133-361 in human ADAR1). This domain contains both Zalpha and Zbeta as well as a tandem repeat of a 49-amino acid linker module. Prolonged proteolysis revealed a minimal core domain of 77 amino acids (positions 133-209), containing only Zalpha, which is sufficient to bind left-handed Z-DNA; however, the substrate binding is strikingly different from that of Zab. The second motif, Zbeta, retains its structural integrity only in the context of Zab and does not bind Z-DNA as a separate entity. These results suggest that Zalpha and Zbeta act as a single bipartite domain. In the presence of substrate DNA, Zab becomes more resistant to proteases, suggesting that it adopts a more rigid structure when bound to its substrate, possibly with conformational changes in parts of the protein.
Rajasekar, Karthik V.; Lovering, Andrew L.; Dancea, Felician; Scott, David J.; Harris, Sarah A.; Bingle, Lewis E.H.; Roessle, Manfred; Thomas, Christopher M.; Hyde, Eva I.; White, Scott A.
2016-01-01
Abstract The IncP (Incompatibility group P) plasmids are important carriers in the spread of antibiotic resistance across Gram-negative bacteria. Gene expression in the IncP-1 plasmids is stringently controlled by a network of four global repressors, KorA, KorB, TrbA and KorC interacting cooperatively. Intriguingly, KorA and KorB can act as co-repressors at varying distances between their operators, even when they are moved to be on opposite sides of the DNA. KorA is a homodimer with the 101-amino acid subunits, folding into an N-terminal DNA-binding domain and a C-terminal dimerization domain. In this study, we have determined the structures of the free KorA repressor and two complexes each bound to a 20-bp palindromic DNA duplex containing its consensus operator sequence. Using a combination of X-ray crystallography, nuclear magnetic resonance spectroscopy, SAXS and molecular dynamics calculations, we show that the linker between the two domains is very flexible and the protein remains highly mobile in the presence of DNA. This flexibility allows the DNA-binding domains of the dimer to straddle the operator DNA on binding and is likely to be important in cooperative binding to KorB. Unexpectedly, the C-terminal domain of KorA is structurally similar to the dimerization domain of the tumour suppressor p53. PMID:27016739
Cloning and analysis of DnaJ family members in the silkworm, Bombyx mori.
Li, Yinü; Bu, Cuiyu; Li, Tiantian; Wang, Shibao; Jiang, Feng; Yi, Yongzhu; Yang, Huipeng; Zhang, Zhifang
2016-01-15
Heat shock proteins (Hsps) are involved in a variety of critical biological functions, including protein folding, degradation, and translocation and macromolecule assembly, act as molecular chaperones during periods of stress by binding to other proteins. Using expressed sequence tag (EST) and silkworm (Bombyx mori) transcriptome databases, we identified 27 cDNA sequences encoding the conserved J domain, which is found in DnaJ-type Hsps. Of the 27 J domain-containing sequences, 25 were complete cDNA sequences. We divided them into three types according to the number and presence of conserved domains. By analyzing the gene structures, intron numbers, and conserved domains and constructing a phylogenetic tree, we found that the DnaJ family had undergone convergent evolution, obtaining new domains to expand the diversity of its family members. The acquisition of the new DnaJ domains most likely occurred prior to the evolutionary divergence of prokaryotes and eukaryotes. The expression of DnaJ genes in the silkworm was generally higher in the fat body. The tissue distribution of DnaJ1 proteins was detected by western blotting, demonstrating that in the fifth-instar larvae, the DnaJ1 proteins were expressed at their highest levels in hemocytes, followed by the fat body and head. We also found that the DnaJ1 transcripts were likely differentially translated in different tissues. Using immunofluorescence cytochemistry, we revealed that in the blood cells, DnaJ1 was mainly localized in the cytoplasm. Copyright © 2015 Elsevier B.V. All rights reserved.
Multiple structure-intrinsic disorder interactions regulate and coordinate Hox protein function
NASA Astrophysics Data System (ADS)
Bondos, Sarah
During animal development, Hox transcription factors determine fate of developing tissues to generate diverse organs and appendages. Hox proteins are famous for their bizarre mutant phenotypes, such as replacing antennae with legs. Clearly, the functions of individual Hox proteins must be distinct and reliable in vivo, or the organism risks malformation or death. However, within the Hox protein family, the DNA-binding homeodomains are highly conserved and the amino acids that contact DNA are nearly invariant. These observations raise the question: How do different Hox proteins correctly identify their distinct target genes using a common DNA binding domain? One possible means to modulate DNA binding is through the influence of the non-homeodomain protein regions, which differ significantly among Hox proteins. However genetic approaches never detected intra-protein interactions, and early biochemical attempts were hindered because the special features of ``intrinsically disordered'' sequences were not appreciated. We propose the first-ever structural model of a Hox protein to explain how specific contacts between distant, intrinsically disordered regions of the protein and the homeodomain regulate DNA binding and coordinate this activity with other Hox molecular functions.
Han, Le; Pandian, Ganesh N; Chandran, Anandhakumar; Sato, Shinsuke; Taniguchi, Junichi; Kashiwazaki, Gengo; Sawatani, Yoshito; Hashiya, Kaori; Bando, Toshikazu; Xu, Yufang; Qian, Xuhong; Sugiyama, Hiroshi
2015-07-20
Synthetic dual-function ligands targeting specific DNA sequences and histone-modifying enzymes were applied to achieve regulatory control over multi-gene networks in living cells. Unlike the broad array of targeting small molecules for histone deacetylases (HDACs), few modulators are known for histone acetyltransferases (HATs), which play a central role in transcriptional control. As a novel chemical approach to induce selective HAT-regulated genes, we conjugated a DNA-binding domain (DBD) "I" to N-(4-chloro-3-trifluoromethyl-phenyl)-2-ethoxy-benzamide (CTB), an artificial HAT activator. In vitro enzyme activity assays and microarray studies were used to demonstrate that distinct functional small molecules could be transformed to have identical bioactivity when conjugated with a targeting DBD. This proof-of-concept synthetic strategy validates the switchable functions of HDACs and HATs in gene regulation and provides a molecular basis for developing versatile bioactive ligands. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stella, Stefano; University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen; Molina, Rafael
Crystal structures of BurrH and the BurrH–DNA complex are reported. DNA editing offers new possibilities in synthetic biology and biomedicine for modulation or modification of cellular functions to organisms. However, inaccuracy in this process may lead to genome damage. To address this important problem, a strategy allowing specific gene modification has been achieved through the addition, removal or exchange of DNA sequences using customized proteins and the endogenous DNA-repair machinery. Therefore, the engineering of specific protein–DNA interactions in protein scaffolds is key to providing ‘toolkits’ for precise genome modification or regulation of gene expression. In a search for putative DNA-bindingmore » domains, BurrH, a protein that recognizes a 19 bp DNA target, was identified. Here, its apo and DNA-bound crystal structures are reported, revealing a central region containing 19 repeats of a helix–loop–helix modular domain (BurrH domain; BuD), which identifies the DNA target by a single residue-to-nucleotide code, thus facilitating its redesign for gene targeting. New DNA-binding specificities have been engineered in this template, showing that BuD-derived nucleases (BuDNs) induce high levels of gene targeting in a locus of the human haemoglobin β (HBB) gene close to mutations responsible for sickle-cell anaemia. Hence, the unique combination of high efficiency and specificity of the BuD arrays can push forward diverse genome-modification approaches for cell or organism redesign, opening new avenues for gene editing.« less
USDA-ARS?s Scientific Manuscript database
Antibody engineering requires the identification of antigen binding domains or variable regions (VR) unique to each antibody. It is the VR that define the unique antigen binding properties and proper sequence identification is essential for functional evaluation and performance of recombinant antibo...
Hurst, Sarah J; Han, Min Su; Lytton-Jean, Abigail K R; Mirkin, Chad A
2007-09-15
We have developed a novel competition assay that uses a gold nanoparticle (Au NP)-based, high-throughput colorimetric approach to screen the sequence selectivity of DNA-binding molecules. This assay hinges on the observation that the melting behavior of DNA-functionalized Au NP aggregates is sensitive to the concentration of the DNA-binding molecule in solution. When short, oligomeric hairpin DNA sequences were added to a reaction solution consisting of DNA-functionalized Au NP aggregates and DNA-binding molecules, these molecules may either bind to the Au NP aggregate interconnects or the hairpin stems based on their relative affinity for each. This relative affinity can be measured as a change in the melting temperature (Tm) of the DNA-modified Au NP aggregates in solution. As a proof of concept, we evaluated the selectivity of 4',6-diamidino-2-phenylindone (an AT-specific binder), ethidium bromide (a nonspecific binder), and chromomycin A (a GC-specific binder) for six sequences of hairpin DNA having different numbers of AT pairs in a five-base pair variable stem region. Our assay accurately and easily confirmed the known trends in selectivity for the DNA binders in question without the use of complicated instrumentation. This novel assay will be useful in assessing large libraries of potential drug candidates that work by binding DNA to form a drug/DNA complex.
Nag, Ronita; Maity, Manas Kanti; Dasgupta, Maitrayee
2005-11-01
The ABA responsive ABI3 and the auxin responsive ARF family of transcription factors bind the CATGCATG (Sph) and TGTCTC core motifs in ABA and auxin response elements (ABRE and AuxRE), respectively. Several evidences indicate ABI3s to act downstream to auxin too. Because DNA binding domain of ABI3s shows significant overlap with ARFs we enquired whether auxin responsiveness through ABI3s could be mediated by their binding to canonical AuxREs. Investigations were undertaken through in vitro gel mobility shift assays (GMSA) using the DNA binding domain B3 of PvAlf (Phaseolus vulgaris ABI3 like factor) and upstream regions of auxin responsive gene GH3 (-267 to -141) and ABA responsive gene Em (-316 to -146) harboring AuxRE and ABRE, respectively. We demonstrate that B3 domain of PvAlf could bind AuxRE only when B3 was associated with its flanking domain B2 (B2B3). Such strict requirement of B2 domain was not observed with ABRE, where B3 could bind with or without being associated with B2. This dual specificity in DNA binding of ABI3s was also demonstrated with nuclear extracts of cultured cells of Arachis hypogea. Supershift analysis of ABRE and AuxRE bound nuclear proteins with antibodies raised against B2B3 domains of PvAlf revealed that ABI3 associated complexes were detectable in association with both cis elements. Competition GMSA confirmed the same complexes to bind ABRE and AuxRE. This dual specificity of ABI3 like factors in DNA binding targeted to natural promoters responsive to ABA and auxin suggests them to have a potential role in conferring crosstalk between these two phytohormones.
Suzuki, Toru; Muto, Shinsuke; Miyamoto, Saku; Aizawa, Kenichi; Horikoshi, Masami; Nagai, Ryozo
2003-08-01
Transcription involves molecular interactions between general and regulatory transcription factors with further regulation by protein-protein interactions (e.g. transcriptional cofactors). Here we describe functional interaction between DNA-binding transcription factor and histone chaperone. Affinity purification of factors interacting with the DNA-binding domain of the transcription factor Sp1 showed Sp1 to interact with the histone chaperone TAF-I, both alpha and beta isoforms. This interaction was specific as Sp1 did not interact with another histone chaperone CIA nor did other tested DNA-binding regulatory factors (MyoD, NFkappaB, p53) interact with TAF-I. Interaction of Sp1 and TAF-I occurs both in vitro and in vivo. Interaction with TAF-I results in inhibition of DNA-binding, and also likely as a result of such, inhibition of promoter activation by Sp1. Collectively, we describe interaction between DNA-binding transcription factor and histone chaperone which results in negative regulation of the former. This novel regulatory interaction advances our understanding of the mechanisms of eukaryotic transcription through DNA-binding regulatory transcription factors by protein-protein interactions, and also shows the DNA-binding domain to mediate important regulatory interactions.
Martín-Blanco, E; Kornberg, T B
1993-11-16
Degenerate oligodeoxyribonucleotides were designed for both ends of the DNA-binding domain of members of the nuclear receptor superfamily. PCR amplified Drosophila melanogaster DNA was purified and cloned (DR plasmids). Genomic lambda DASH clones were identified at high stringency with an amplified DR-78 plasmid DNA and isolated. The partial sequence shows a very probable open reading frame which would encode a peptide highly homologous to members of the thyroid hormone-retinoic acid-vitamin D receptor subfamily. The fragment corresponds to a single copy gene and was mapped at position 78D of chromosome three by in situ hybridization.
Toward a General Approach for RNA-Templated Hierarchical Assembly of Split-Proteins
Furman, Jennifer L.; Badran, Ahmed H.; Ajulo, Oluyomi; Porter, Jason R.; Stains, Cliff I.; Segal, David J.; Ghosh, Indraneel
2010-01-01
The ability to conditionally turn on a signal or induce a function in the presence of a user-defined RNA target has potential applications in medicine and synthetic biology. Although sequence-specific pumilio repeat proteins can target a limited set of ssRNA sequences, there are no general methods for targeting ssRNA with designed proteins. As a first step toward RNA recognition, we utilized the RNA binding domain of argonaute, implicated in RNA interference, for specifically targeting generic 2-nucleotide, 3' overhangs of any dsRNA. We tested the reassembly of a split-luciferase enzyme guided by argonaute-mediated recognition of newly generated nucleotide overhangs when ssRNA is targeted by a designed complementary guide sequence. This approach was successful when argonaute was utilized in conjunction with a pumilio repeat and expanded the scope of potential ssRNA targets. However, targeting any desired ssRNA remained elusive as two argonaute domains provided minimal reassembled split-luciferase. We next designed and tested a second hierarchical assembly, wherein ssDNA guides are appended to DNA hairpins that serve as a scaffold for high affinity zinc fingers attached to split-luciferase. In the presence of a ssRNA target containing adjacent sequences complementary to the guides, the hairpins are brought into proximity, allowing for zinc finger binding and concomitant reassembly of the fragmented luciferase. The scope of this new approach was validated by specifically targeting RNA encoding VEGF, hDM2, and HER2. These approaches provide potentially general design paradigms for the conditional reassembly of fragmented proteins in the presence of any desired ssRNA target. PMID:20681585
MOCCS: Clarifying DNA-binding motif ambiguity using ChIP-Seq data.
Ozaki, Haruka; Iwasaki, Wataru
2016-08-01
As a key mechanism of gene regulation, transcription factors (TFs) bind to DNA by recognizing specific short sequence patterns that are called DNA-binding motifs. A single TF can accept ambiguity within its DNA-binding motifs, which comprise both canonical (typical) and non-canonical motifs. Clarification of such DNA-binding motif ambiguity is crucial for revealing gene regulatory networks and evaluating mutations in cis-regulatory elements. Although chromatin immunoprecipitation sequencing (ChIP-seq) now provides abundant data on the genomic sequences to which a given TF binds, existing motif discovery methods are unable to directly answer whether a given TF can bind to a specific DNA-binding motif. Here, we report a method for clarifying the DNA-binding motif ambiguity, MOCCS. Given ChIP-Seq data of any TF, MOCCS comprehensively analyzes and describes every k-mer to which that TF binds. Analysis of simulated datasets revealed that MOCCS is applicable to various ChIP-Seq datasets, requiring only a few minutes per dataset. Application to the ENCODE ChIP-Seq datasets proved that MOCCS directly evaluates whether a given TF binds to each DNA-binding motif, even if known position weight matrix models do not provide sufficient information on DNA-binding motif ambiguity. Furthermore, users are not required to provide numerous parameters or background genomic sequence models that are typically unavailable. MOCCS is implemented in Perl and R and is freely available via https://github.com/yuifu/moccs. By complementing existing motif-discovery software, MOCCS will contribute to the basic understanding of how the genome controls diverse cellular processes via DNA-protein interactions. Copyright © 2016 Elsevier Ltd. All rights reserved.
NASA Technical Reports Server (NTRS)
Hsieh, H. L.; Tong, C. G.; Thomas, C.; Roux, S. J.
1996-01-01
A CDNA encoding a 47 kDa nucleoside triphosphatase (NTPase) that is associated with the chromatin of pea nuclei has been cloned and sequenced. The translated sequence of the cDNA includes several domains predicted by known biochemical properties of the enzyme, including five motifs characteristic of the ATP-binding domain of many proteins, several potential casein kinase II phosphorylation sites, a helix-turn-helix region characteristic of DNA-binding proteins, and a potential calmodulin-binding domain. The deduced primary structure also includes an N-terminal sequence that is a predicted signal peptide and an internal sequence that could serve as a bipartite-type nuclear localization signal. Both in situ immunocytochemistry of pea plumules and immunoblots of purified cell fractions indicate that most of the immunodetectable NTPase is within the nucleus, a compartment proteins typically reach through nuclear pores rather than through the endoplasmic reticulum pathway. The translated sequence has some similarity to that of human lamin C, but not high enough to account for the earlier observation that IgG against human lamin C binds to the NTPase in immunoblots. Northern blot analysis shows that the NTPase MRNA is strongly expressed in etiolated plumules, but only poorly or not at all in the leaf and stem tissues of light-grown plants. Accumulation of NTPase mRNA in etiolated seedlings is stimulated by brief treatments with both red and far-red light, as is characteristic of very low-fluence phytochrome responses. Southern blotting with pea genomic DNA indicates the NTPase is likely to be encoded by a single gene.
Maira, S M; Wurtz, J M; Wasylyk, B
1996-01-01
The three ternary complex factors (TCFs), Net (ERP/ SAP-2), ELK-1 and SAP-1, are highly related ets oncogene family members that participate in the response of the cell to Ras and growth signals. Understanding the different roles of these factors will provide insights into how the signals result in coordinate regulation of the cell. We show that Net inhibits transcription under basal conditions, in which SAP-1a is inactive and ELK-1 stimulates. Repression is mediated by the NID, the Net Inhibitory Domain of about 50 amino acids, which autoregulates the Net protein and also inhibits when it is isolated in a heterologous fusion protein. Net is particularly sensitive to Ras activation. Ras activates Net through the C-domain, which is conserved between the three TCFs, and the NID is an efficient inhibitor of Ras activation. The NID, as well as more C-terminal sequences, inhibit DNA binding. Net is more refractory to DNA binding than the other TCFs, possibly due to the presence of multiple inhibitory elements. The NID may adopt a helix-loop-helix (HLH) structure, as evidenced by homology to other HLH motifs, structure predictions, model building and mutagenesis of critical residues. The sequence resemblance with myogenic factors suggested that Net may form complexes with the same partners. Indeed, we found that Net can interact in vivo with the basic HLH factor, E47. We propose that Net is regulated at the level of its latent DNA-binding activity by protein interactions and/or phosphorylation. Net may form complexes with HLH proteins as well as SRF on specific promotor sequences. The identification of the novel inhibitory domain provides a new inroad into exploring the different roles of the ternary complex factors in growth control and transformation. Images PMID:8918463
Maira, S M; Wurtz, J M; Wasylyk, B
1996-11-01
The three ternary complex factors (TCFs), Net (ERP/ SAP-2), ELK-1 and SAP-1, are highly related ets oncogene family members that participate in the response of the cell to Ras and growth signals. Understanding the different roles of these factors will provide insights into how the signals result in coordinate regulation of the cell. We show that Net inhibits transcription under basal conditions, in which SAP-1a is inactive and ELK-1 stimulates. Repression is mediated by the NID, the Net Inhibitory Domain of about 50 amino acids, which autoregulates the Net protein and also inhibits when it is isolated in a heterologous fusion protein. Net is particularly sensitive to Ras activation. Ras activates Net through the C-domain, which is conserved between the three TCFs, and the NID is an efficient inhibitor of Ras activation. The NID, as well as more C-terminal sequences, inhibit DNA binding. Net is more refractory to DNA binding than the other TCFs, possibly due to the presence of multiple inhibitory elements. The NID may adopt a helix-loop-helix (HLH) structure, as evidenced by homology to other HLH motifs, structure predictions, model building and mutagenesis of critical residues. The sequence resemblance with myogenic factors suggested that Net may form complexes with the same partners. Indeed, we found that Net can interact in vivo with the basic HLH factor, E47. We propose that Net is regulated at the level of its latent DNA-binding activity by protein interactions and/or phosphorylation. Net may form complexes with HLH proteins as well as SRF on specific promotor sequences. The identification of the novel inhibitory domain provides a new inroad into exploring the different roles of the ternary complex factors in growth control and transformation.
Reddy, G; Nanduri, V B; Basu, A; Modak, M J
1991-08-20
Treatment of murine leukemia virus reverse transcriptase (MuLV RT) with potassium ferrate, an oxidizing agent known to oxidize amino acids involved in phosphate binding domains of proteins, results in the irreversible inactivation of both the DNA polymerase and the RNase H activities. Significant protection from ferrate-mediated inactivation is observed in the presence of template-primer but not in the presence of substrate deoxynucleoside triphosphates. Furthermore, ferrate-treated enzyme loses template-primer binding activity as judged by UV-mediated cross-linking of radiolabeled DNA. Comparative tryptic peptide mapping by reverse-phase HPLC of native and ferrate-oxidized enzyme indicated the presence of two new peptides eluting at 38 and 57 min and a significant loss of a peptide eluting at 74 min. Purification, amino acid composition, and sequencing of these affected peptides revealed that they correspond to amino acid residues 285-295, 630-640, and 586-599, respectively, in the primary amino acid sequence of MuLV RT. These results indicate that the domains constituted by the above peptides are important for the template-primer binding function in MuLV RT. Peptide I is located in the polymerase domain whereas peptides II and III are located in the RNase H domain. Amino acid sequence analysis of peptides I and II suggested Lys-285 and Cys-635 as the probable sites of ferrate action.
Rogers, Julia M; Bulyk, Martha L
2018-04-25
Sequence-specific transcription factors (TFs) bind short DNA sequences in the genome to regulate the expression of target genes. In the last decade, numerous technical advances have enabled the determination of the DNA-binding specificities of many of these factors. Large-scale screens of many TFs enabled the creation of databases of TF DNA-binding specificities, typically represented as position weight matrices (PWMs). Although great progress has been made in determining and predicting binding specificities systematically, there are still many surprises to be found when studying a particular TF's interactions with DNA in detail. Paralogous TFs' binding specificities can differ in subtle ways, in a manner that is not immediately apparent from looking at their PWMs. These differences affect gene regulatory outputs and enable TFs to rewire transcriptional networks over evolutionary time. This review discusses recent observations made in the study of TF-DNA interactions that highlight the importance of continued in-depth analysis of TF-DNA interactions and their inherent complexity. This article is categorized under: Biological Mechanisms > Regulatory Biology. © 2018 Wiley Periodicals, Inc.
Interactions between the R2R3-MYB Transcription Factor, AtMYB61, and Target DNA Binding Sites
Prouse, Michael B.; Campbell, Malcolm M.
2013-01-01
Despite the prominent roles played by R2R3-MYB transcription factors in the regulation of plant gene expression, little is known about the details of how these proteins interact with their DNA targets. For example, while Arabidopsis thaliana R2R3-MYB protein AtMYB61 is known to alter transcript abundance of a specific set of target genes, little is known about the specific DNA sequences to which AtMYB61 binds. To address this gap in knowledge, DNA sequences bound by AtMYB61 were identified using cyclic amplification and selection of targets (CASTing). The DNA targets identified using this approach corresponded to AC elements, sequences enriched in adenosine and cytosine nucleotides. The preferred target sequence that bound with the greatest affinity to AtMYB61 recombinant protein was ACCTAC, the AC-I element. Mutational analyses based on the AC-I element showed that ACC nucleotides in the AC-I element served as the core recognition motif, critical for AtMYB61 binding. Molecular modelling predicted interactions between AtMYB61 amino acid residues and corresponding nucleotides in the DNA targets. The affinity between AtMYB61 and specific target DNA sequences did not correlate with AtMYB61-driven transcriptional activation with each of the target sequences. CASTing-selected motifs were found in the regulatory regions of genes previously shown to be regulated by AtMYB61. Taken together, these findings are consistent with the hypothesis that AtMYB61 regulates transcription from specific cis-acting AC elements in vivo. The results shed light on the specifics of DNA binding by an important family of plant-specific transcriptional regulators. PMID:23741471
Inhibition of HMGA2 binding to DNA by netropsin
Miao, Yi; Cui, Tengjiao; Leng, Fenfei; Wilson, W. David
2008-01-01
The design of small synthetic molecules that can be used to affect gene expression is an area of active interest for development of agents in therapeutic and biotechnology applications. Many compounds that target the minor groove in AT sequences in DNA are well characterized and are promising reagents for use as modulators of protein-DNA complexes. The mammalian high mobility group transcriptional factor, HMGA2, also targets the DNA minor groove and plays critical roles in disease processes from cancer to obesity. Biosensor-surface plasmon resonance methods were used to monitor HMGA2 binding to target sites on immobilized DNA and a competition assay for inhibition of the HMGA2-DNA complex was designed. HMGA2 binds strongly to the DNA through AT hook domains with KD values of 20 - 30 nM depending on the DNA sequence. The well-characterized minor groove binder, netropsin, was used to develop and test the assay. The compound has two binding sites in the protein-DNA interaction sequence and this provides an advantage for inhibition. An equation for analysis of results when the inhibitor has two binding sites in the biopolymer recognition surface is presented with the results. The assay provides a platform for discovery of HMGA2 inhibitors. PMID:18023407
DNA Binding of Centromere Protein C (CENPC) Is Stabilized by Single-Stranded RNA
Du, Yaqing; Topp, Christopher N.; Dawe, R. Kelly
2010-01-01
Centromeres are the attachment points between the genome and the cytoskeleton: centromeres bind to kinetochores, which in turn bind to spindles and move chromosomes. Paradoxically, the DNA sequence of centromeres has little or no role in perpetuating kinetochores. As such they are striking examples of genetic information being transmitted in a manner that is independent of DNA sequence (epigenetically). It has been found that RNA transcribed from centromeres remains bound within the kinetochore region, and this local population of RNA is thought to be part of the epigenetic marking system. Here we carried out a genetic and biochemical study of maize CENPC, a key inner kinetochore protein. We show that DNA binding is conferred by a localized region 122 amino acids long, and that the DNA-binding reaction is exquisitely sensitive to single-stranded RNA. Long, single-stranded nucleic acids strongly promote the binding of CENPC to DNA, and the types of RNAs that stabilize DNA binding match in size and character the RNAs present on kinetochores in vivo. Removal or replacement of the binding module with HIV integrase binding domain causes a partial delocalization of CENPC in vivo. The data suggest that centromeric RNA helps to recruit CENPC to the inner kinetochore by altering its DNA binding characteristics. PMID:20140237
Reddy, M K; Nair, S; Singh, B N; Mudgil, Y; Tewari, K K; Sopory, S K
2001-01-24
We report the cloning and sequencing of both cDNA and genomic DNA of a 33 kDa chloroplast ribonucleoprotein (33RNP) from pea. The analysis of the predicted amino acid sequence of the cDNA clone revealed that the encoded protein contains two RNA binding domains, including the conserved consensus ribonucleoprotein sequences CS-RNP1 and CS-RNP2, on the C-terminus half and the presence of a putative transit peptide sequence in the N-terminus region. The phylogenetic and multiple sequence alignment analysis of pea chloroplast RNP along with RNPs reported from the other plant sources revealed that the pea 33RNP is very closely related to Nicotiana sylvestris 31RNP and 28RNP and also to 31RNP and 28RNP of Arabidopsis and spinach, respectively. The pea 33RNP was expressed in Escherichia coli and purified to homogeneity. The in vitro import of precursor protein into chloroplasts confirmed that the N-terminus putative transit peptide is a bona fide transit peptide and 33RNP is localized in the chloroplast. The nucleic acid-binding properties of the recombinant protein, as revealed by South-Western analysis, showed that 33RNP has higher binding affinity for poly (U) and oligo dT than for ssDNA and dsDNA. The steady state transcript level was higher in leaves than in roots and the expression of this gene is light stimulated. Sequence analysis of the genomic clone revealed that the gene contains four exons and three introns. We have also isolated and analyzed the 5' flanking region of the pea 33RNP gene.
Mechanistic insights into phosphoprotein-binding FHA domains.
Liang, Xiangyang; Van Doren, Steven R
2008-08-01
[Structure: see text]. FHA domains are protein modules that switch signals in diverse biological pathways by monitoring the phosphorylation of threonine residues of target proteins. As part of the effort to gain insight into cellular avoidance of cancer, FHA domains involved in the cellular response to DNA damage have been especially well-characterized. The complete protein where the FHA domain resides and the interaction partners determine the nature of the signaling. Thus, a key biochemical question is how do FHA domains pick out their partners from among thousands of alternatives in the cell? This Account discusses the structure, affinity, and specificity of FHA domains and the formation of their functional structure. Although FHA domains share sequence identity at only five loop residues, they all fold into a beta-sandwich of two beta-sheets. The conserved arginine and serine of the recognition loops recognize the phosphorylation of the threonine targeted. Side chains emanating from loops that join beta-strand 4 with 5, 6 with 7, or 10 with 11 make specific contacts with amino acids of the ligand that tailor sequence preferences. Many FHA domains choose a partner in extended conformation, somewhat according to the residue three after the phosphothreonine in sequence (pT + 3 position). One group of FHA domains chooses a short carboxylate-containing side chain at pT + 3. Another group chooses a long, branched aliphatic side chain. A third group prefers other hydrophobic or uncharged polar side chains at pT + 3. However, another FHA domain instead chooses on the basis of pT - 2, pT - 3, and pT + 1 positions. An FHA domain from a marker of human cancer instead chooses a much longer protein fragment that adds a beta-strand to its beta-sheet and that presents hydrophobic residues from a novel helix to the usual recognition surface. This novel recognition site and more remote sites for the binding of other types of protein partners were predicted for the entire family of FHA domains by a bioinformatics approach. The phosphopeptide-dependent dynamics of an FHA domain, SH2 domain, and PTB domain suggest a common theme: rigid, preformed binding surfaces support van der Waals contacts that provide favorable binding enthalpy. Despite the lack of pronounced conformational changes in FHA domains linked to binding events, more subtle adjustments may be possible. In the one FHA domain tested, phosphothreonine peptide binding is accompanied by increased flexibility just outside the binding site and increased rigidity across the beta-sandwich. The folding of the same FHA domain progresses through near-native intermediates that stabilize the recognition loops in the center of the phosphoprotein-binding surface; this may promote rigidity in the interface and affinity for targets phosphorylated on threonine.
Jain, Deepti
2015-07-01
The GntR family of transcription regulators constitutes one of the most abundant family of transcription factors. These modulators are involved in a variety of mechanisms controlling various metabolic processes. GntR family members are typically two domain proteins with a smaller N-terminus domain (NTD) with conserved architecture of winged-helix-turn-helix (wHTH) for DNA binding and a larger C-terminus domain (CTD) or the effector binding domain which is also involved in oligomerization. Interestingly, the CTD shows structural heterogeneity depending upon the type of effector molecule that it binds and displays structural homology to various classes of proteins. Binding of the effector molecule to the CTD brings about a conformational change in the transcription factor such that its affinity for its cognate DNA sequence is altered. This review summarizes the structural information available on the members of GntR family and discusses the common features of the DNA binding and operator recognition within the family. The variation in the allosteric mechanism employed by the members of this family is also discussed. © 2015 International Union of Biochemistry and Molecular Biology.
Sun, D; Leung, C L; Liem, R K
2001-01-01
MACF (microtubule actin cross-linking factor) is a large, 608-kDa protein that can associate with both actin microfilaments and microtubules (MTs). Structurally, MACF can be divided into 3 domains: an N-terminal domain that contains both a calponin type actin-binding domain and a plakin domain; a rod domain that is composed of 23 dystrophin-like spectrin repeats; and a C-terminal domain that includes two EF-hand calcium-binding motifs, as well as a region that is homologous to two related proteins, GAR22 and Gas2. We have previously demonstrated that the C-terminal domain of MACF binds to MTs, although no homology was observed between this domain and other known microtubule-binding proteins. In this report, we describe the characterization of this microtubule-binding domain of MACF by transient transfection studies and in vitro binding assays. We found that the C-terminus of MACF contains at least two microtubule-binding regions, a GAR domain and a domain containing glycine-serine-arginine (GSR) repeats. In transfected cells, the GAR domain bound to and partially stabilized MTs to depolymerization by nocodazole. The GSR-containing domain caused MTs to form bundles that are still sensitive to nocodazole-induced depolymerization. When present together, these two domains acted in concert to bundle MTs and render them stable to nocodazole treatment. Recently, a study has shown that the N-terminal half of the plakin domain (called the M1 domain) of MACF also binds MTs. We therefore examined the microtubule binding ability of the M1 domain in the context of the entire plakin domain with and without the remaining N-terminal regions of two different MACF isoforms. Interestingly, in the presence of the surrounding sequences, the M1 domain did not bind MTs. In addition to MACF, cDNA sequences encoding the GAR and GSR-containing domains are also found in the partial human EST clone KIAA0728, which has high sequence homology to the 3' end of the MACF cDNA; hence, we refer to it as MACF2. The C-terminal domain of mouse MACF2 was cloned and characterized. The microtubule-binding properties of MACF2 C-terminal domain are similar to that of MACF. The GAR domain was originally found in Gas 2 protein and here we show that it can associate with MTs in transfected cells. Plectin and desmoplakin have GSR-containing domains at their C-termini and we further demonstrate that the GSR-containing domain of plectin, but not desmoplakin, can bind to MTs in vivo.
Structure, organization and expression of common carp (Cyprinus carpio L.) SLP-76 gene.
Huang, Rong; Sun, Xiao-Feng; Hu, Wei; Wang, Ya-Ping; Guo, Qiong-Lin
2008-05-01
SLP-76 is an important member of the SLP-76 family of adapters, and it plays a key role in TCR signaling and T cell function. Partial cDNA sequence of SLP-76 of common carp (Cyprinus carpio L.) was isolated from thymus cDNA library by the method of suppression subtractive hybridization (SSH). Subsequently, the full length cDNA of carp SLP-76 was obtained by means of 3' RACE and 5' RACE, respectively. The full length cDNA of carp SLP-76 was 2007 bp, consisting of a 5'-terminal untranslated region (UTR) of 285 bp, a 3'-terminal UTR of 240 bp, and an open reading frame of 1482 bp. Sequence comparison showed that the deduced amino acid sequence of carp SLP-76 had an overall similarity of 34-73% to that of other species homologues, and it was composed of an NH2-terminal domain, a central proline-rich domain, and a C-terminal SH2 domain. Amino acid sequence analysis indicated the existence of a Gads binding site R-X-X-K, a 10-aa-long sequence which binds to the SH3 domain of LCK in vitro, and three conserved tyrosine-containing sequence in the NH2-terminal domain. Then we used PCR to obtain a genomic DNA which covers the entire coding region of carp SLP-76. In the 9.2k-long genomic sequence, twenty one exons and twenty introns were identified. RT-PCR results showed that carp SLP-76 was expressed predominantly in hematopoietic tissues, and was upregulated in thymus tissue of four-month carp compared to one-year old carp. RT-PCR and virtual northern hybridization results showed that carp SLP-76 was also upregulated in thymus tissue of GH transgenic carp at the age of four-months. These results suggest that the expression level of SLP-76 gene may be related to thymocyte development in teleosts.
Nomura, Yusuke; Tanaka, Yoichiro; Fukunaga, Jun-ichi; Fujiwara, Kazuya; Chiba, Manabu; Iibuchi, Hiroaki; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Kozu, Tomoko; Sakamoto, Taiichi
2013-12-01
AML1/RUNX1 is an essential transcription factor involved in the differentiation of hematopoietic cells. AML1 binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. In a previous study, we obtained RNA aptamers against the AML1 Runt domain by systematic evolution of ligands by exponential enrichment and revealed that RNA aptamers exhibit higher affinity for the Runt domain than that for RDE and possess the 5'-GCGMGNN-3' and 5'-N'N'CCAC-3' conserved motif (M: A or C; N and N' form Watson-Crick base pairs) that is important for Runt domain binding. In this study, to understand the structural basis of recognition of the Runt domain by the aptamer motif, the solution structure of a 22-mer RNA was determined using nuclear magnetic resonance. The motif contains the AH(+)-C mismatch and base triple and adopts an unusual backbone structure. Structural analysis of the aptamer motif indicated that the aptamer binds to the Runt domain by mimicking the RDE sequence and structure. Our data should enhance the understanding of the structural basis of DNA mimicry by RNA molecules.
Roux-Rouquie, M; Marilley, M
2000-09-15
We have modeled local DNA sequence parameters to search for DNA architectural motifs involved in transcription regulation and promotion within the Xenopus laevis ribosomal gene promoter and the intergenic spacer (IGS) sequences. The IGS was found to be shaped into distinct topological domains. First, intrinsic bends split the IGS into domains of common but different helical features. Local parameters at inter-domain junctions exhibit a high variability with respect to intrinsic curvature, bendability and thermal stability. Secondly, the repeated sequence blocks of the IGS exhibit right-handed supercoiled structures which could be related to their enhancer properties. Thirdly, the gene promoter presents both inherent curvature and minor groove narrowing which may be viewed as motifs of a structural code for protein recognition and binding. Such pre-existing deformations could simply be remodeled during the binding of the transcription complex. Alternatively, these deformations could pre-shape the promoter in such a way that further remodeling is facilitated. Mutations shown to abolish promoter curvature as well as intrinsic minor groove narrowing, in a variant which maintained full transcriptional activity, bring circumstantial evidence for structurally-preorganized motifs in relation to transcription regulation and promotion. Using well documented X. laevis rDNA regulatory sequences we showed that computer modeling may be of invaluable assistance in assessing encrypted architectural motifs. The evidence of these DNA topological motifs with respect to the concept of structural code is discussed.
Kachhap, Sangita; Singh, Balvinder
2015-01-01
In most of homeodomain-DNA complexes, glutamine or lysine is present at 50th position and interacts with 5th and 6th nucleotide of core recognition region. Molecular dynamics simulations of Msx-1-DNA complex (Q50-TG) and its variant complexes, that is specific (Q50K-CC), nonspecific (Q50-CC) having mutation in DNA and (Q50K-TG) in protein, have been carried out. Analysis of protein-DNA interactions and structure of DNA in specific and nonspecific complexes show that amino acid residues use sequence-dependent shape of DNA to interact. The binding free energies of all four complexes were analysed to define role of amino acid residue at 50th position in terms of binding strength considering the variation in DNA on stability of protein-DNA complexes. The order of stability of protein-DNA complexes shows that specific complexes are more stable than nonspecific ones. Decomposition analysis shows that N-terminal amino acid residues have been found to contribute maximally in binding free energy of protein-DNA complexes. Among specific protein-DNA complexes, K50 contributes more as compared to Q50 towards binding free energy in respective complexes. The sequence dependence of local conformation of DNA enables Q50/Q50K to make hydrogen bond with nucleotide(s) of DNA. The changes in amino acid sequence of protein are accommodated and stabilized around TAAT core region of DNA having variation in nucleotides.
Autoinhibitory mechanisms of ERG studied by molecular dynamics simulations
NASA Astrophysics Data System (ADS)
Lu, Yan; Salsbury, Freddie R.
2015-01-01
ERG, an ETS-family transcription factor, acts as a regulator of differentiation of early hematopoietic cells. It contains an autoinhibitory domain, which negatively regulates DNA-binding. The mechanism of autoinhibitory is still illusive. To understand the mechanism, we study the dynamical properties of ERG protein by molecular dynamics simulations. These simulations suggest that DNA binding autoinhibition associates with the internal dynamics of ERG. Specifically, we find that (1), The N-C terminal correlation in the inhibited ERG is larger than that in uninhibited ERG that contributes to the autoinhibition of DNA-binding. (2), DNA-binding changes the property of the N-C terminal correlation from being anti-correlated to correlated, that is, changing the relative direction of the correlated motions and (3), For the Ets-domain specifically, the inhibited and uninhibited forms exhibit essentially the same dynamics, but the binding of the DNA decreases the fluctuation of the Ets-domain. We also find from PCA analysis that the three systems, even with quite different dynamics, do have highly similar free energy surfaces, indicating that they share similar conformations.
Recombinant soluble adenovirus receptor
Freimuth, Paul I.
2002-01-01
Disclosed are isolated polypeptides from human CAR (coxsackievirus and adenovirus receptor) protein which bind adenovirus. Specifically disclosed are amino acid sequences which corresponds to adenovirus binding domain D1 and the entire extracellular domain of human CAR protein comprising D1 and D2. In other aspects, the disclosure relates to nucleic acid sequences encoding these domains as well as expression vectors which encode the domains and bacterial cells containing such vectors. Also disclosed is an isolated fusion protein comprised of the D1 polypeptide sequence fused to a polypeptide sequence which facilitates folding of D1 into a functional, soluble domain when expressed in bacteria. The functional D1 domain finds application for example in a therapeutic method for treating a patient infected with a virus which binds to D1, and also in a method for identifying an antiviral compound which interferes with viral attachment. Also included is a method for specifically targeting a cell for infection by a virus which binds to D1.
Predicting the binding preference of transcription factors to individual DNA k-mers.
Alleyne, Trevis M; Peña-Castillo, Lourdes; Badis, Gwenael; Talukder, Shaheynoor; Berger, Michael F; Gehrke, Andrew R; Philippakis, Anthony A; Bulyk, Martha L; Morris, Quaid D; Hughes, Timothy R
2009-04-15
Recognition of specific DNA sequences is a central mechanism by which transcription factors (TFs) control gene expression. Many TF-binding preferences, however, are unknown or poorly characterized, in part due to the difficulty associated with determining their specificity experimentally, and an incomplete understanding of the mechanisms governing sequence specificity. New techniques that estimate the affinity of TFs to all possible k-mers provide a new opportunity to study DNA-protein interaction mechanisms, and may facilitate inference of binding preferences for members of a given TF family when such information is available for other family members. We employed a new dataset consisting of the relative preferences of mouse homeodomains for all eight-base DNA sequences in order to ask how well we can predict the binding profiles of homeodomains when only their protein sequences are given. We evaluated a panel of standard statistical inference techniques, as well as variations of the protein features considered. Nearest neighbour among functionally important residues emerged among the most effective methods. Our results underscore the complexity of TF-DNA recognition, and suggest a rational approach for future analyses of TF families.
Lee, Dong-Kee; Suh, Dongchul; Edenberg, Howard J; Hur, Man-Wook
2002-07-26
The POZ domain is a protein-protein interaction motif that is found in many transcription factors, which are important for development, oncogenesis, apoptosis, and transcription repression. We cloned the POZ domain transcription factor, FBI-1, that recognizes the cis-element (bp -38 to -22) located just upstream of the core Sp1 binding sites (bp -22 to +22) of the ADH5/FDH minimal promoter (bp -38 to +61) in vitro and in vivo, as revealed by electrophoretic mobility shift assay and chromatin immunoprecipitation assay. The ADH5/FDH minimal promoter is potently repressed by the FBI-1. Glutathione S-transferase fusion protein pull-down showed that the POZ domains of FBI-1, Plzf, and Bcl-6 directly interact with the zinc finger DNA binding domain of Sp1. DNase I footprinting assays showed that the interaction prevents binding of Sp1 to the GC boxes of the ADH5/FDH promoter. Gal4-POZ domain fusions targeted proximal to the GC boxes repress transcription of the Gal4 upstream activator sequence-Sp1-adenovirus major late promoter. Our data suggest that POZ domain represses transcription by interacting with Sp1 zinc fingers and by interfering with the DNA binding activity of Sp1.
Structure of the MLL CXXC domain – DNA complex and its functional role in MLL-AF9 leukemia
Cierpicki, Tomasz; Risner, Laurie E.; Grembecka, Jolanta; Lukasik, Stephen M.; Popovic, Relja; Omonkowska, Monika; Shultis, David S.; Zeleznik-Le, Nancy J.; Bushweller, John H.
2010-01-01
MLL (Mixed Lineage Leukemia) is the target of chromosomal translocations which cause leukemias with poor prognosis. All leukemogenic MLL fusion proteins retain the CXXC domain which binds to nonmethylated CpG DNA. We present the solution structure of the MLL CXXC domain in complex with DNA, showing for the first time how the CXXC domain distinguishes nonmethylated from methylated CpG DNA. Based on the structure, we designed point mutations which disrupt DNA binding. Introduction of these mutations into MLL-AF9 results in increased DNA methylation of specific CpG nucleotides in Hoxa9, increased H3K9 methylation, decreased expression of Hoxa9 locus transcripts, loss of immortalization potential, and inability to induce leukemia in mice. These results establish that DNA binding by the CXXC domain and protection against DNA methylation is essential for MLL fusion leukemia. They also provide support for this interaction as a potential target for therapeutic intervention. PMID:20010842
Cooperative interactions between paired domain and homeodomain.
Jun, S; Desplan, C
1996-09-01
The Pax proteins are a family of transcriptional regulators involved in many developmental processes in all higher eukaryotes. They are characterized by the presence of a paired domain (PD), a bipartite DNA binding domain composed of two helix-turn-helix (HTH) motifs,the PAI and RED domains. The PD is also often associated with a homeodomain (HD) which is itself able to form homo- and hetero-dimers on DNA. Many of these proteins therefore contain three HTH motifs each able to recognize DNA. However, all PDs recognize highly related DNA sequences, and most HDs also recognize almost identical sites. We show here that different Pax proteins use multiple combinations of their HTHs to recognize several types of target sites. For instance, the Drosophila Paired protein can bind, in vitro, exclusively through its PAI domain, or through a dimer of its HD, or through cooperative interaction between PAI domain and HD. However, prd function in vivo requires the synergistic action of both the PAI domain and the HD. Pax proteins with only a PD appear to require both PAI and RED domains, while a Pax-6 isoform and a new Pax protein, Lune, may rely on the RED domain and HD. We propose a model by which Pax proteins recognize different target genes in vivo through various combinations of their DNA binding domains, thus expanding their recognition repertoire.
Pavlov, Andrey R.; Pavlova, Nadejda V.; Kozyavkin, Sergei A.; Slesarev, Alexei I.
2012-01-01
We have previously introduced a general kinetic approach for comparative study of processivity, thermostability, and resistance to inhibitors of DNA polymerases (Pavlov et. al., (2002) Proc. Natl. Acad. Sci. USA 99, 13510–13515). The proposed method was successfully applied to characterize hybrid DNA polymerases created by fusing catalytic DNA polymerase domains with various non-specific DNA binding domains. Here we use the developed kinetic analysis to assess basic parameters of DNA elongation by DNA polymerases and to further study the interdomain interactions in both previously constructed and new chimeric DNA polymerases. We show that connecting Helix-hairpin-Helix (HhH) domains to catalytic polymerase domains can increase thermostability, not only of DNA polymerases from extremely thermophilic species, but also of the enzyme from a faculatative thermophilic bacterium Bacillus stearothermophilus. We also demonstrate that addition of TopoV HhH domains extends efficient DNA synthesis by chimerical polymerases up to 105°C by maintaining processivity of DNA synthesis at high temperatures. We also found that reversible high-temperature structural transitions in DNA polymerases decrease the rates of binding of these enzymes to the templates. Furthermore, activation energies and pre-exponential factors of the Arrhenius equation suggest that the mechanism of electrostatic enhancement of diffusion-controlled association plays a minor role in binding templates to DNA polymerases. PMID:22320201
Hishiki, Asami; Hara, Kodai; Ikegaya, Yuzu; Yokoyama, Hideshi; Shimizu, Toshiyuki; Sato, Mamoru; Hashimoto, Hiroshi
2015-05-22
HLTF (helicase-like transcription factor) is a yeast RAD5 homolog found in mammals. HLTF has E3 ubiquitin ligase and DNA helicase activities, and plays a pivotal role in the template-switching pathway of DNA damage tolerance. HLTF has an N-terminal domain that has been designated the HIRAN (HIP116 and RAD5 N-terminal) domain. The HIRAN domain has been hypothesized to play a role in DNA binding; however, the structural basis of, and functional evidence for, the HIRAN domain in DNA binding has remained unclear. Here we show for the first time the crystal structure of the HIRAN domain of human HLTF in complex with DNA. The HIRAN domain is composed of six β-strands and two α-helices, forming an OB-fold structure frequently found in ssDNA-binding proteins, including in replication factor A (RPA). Interestingly, this study reveals that the HIRAN domain interacts with not only with a single-stranded DNA but also with a duplex DNA. Furthermore, the structure unexpectedly clarifies that the HIRAN domain specifically recognizes the 3'-end of DNA. These results suggest that the HIRAN domain functions as a sensor to the 3'-end of the primer strand at the stalled replication fork and that the domain facilitates fork regression. HLTF is recruited to a damaged site through the HIRAN domain at the stalled replication fork. Furthermore, our results have implications for the mechanism of template switching. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Vlahovicek, K; Munteanu, M G; Pongor, S
1999-01-01
Bending is a local conformational micropolymorphism of DNA in which the original B-DNA structure is only distorted but not extensively modified. Bending can be predicted by simple static geometry models as well as by a recently developed elastic model that incorporate sequence dependent anisotropic bendability (SDAB). The SDAB model qualitatively explains phenomena including affinity of protein binding, kinking, as well as sequence-dependent vibrational properties of DNA. The vibrational properties of DNA segments can be studied by finite element analysis of a model subjected to an initial bending moment. The frequency spectrum is obtained by applying Fourier analysis to the displacement values in the time domain. This analysis shows that the spectrum of the bending vibrations quite sensitively depends on the sequence, for example the spectrum of a curved sequence is characteristically different from the spectrum of straight sequence motifs of identical basepair composition. Curvature distributions are genome-specific, and pronounced differences are found between protein-coding and regulatory regions, respectively, that is, sites of extreme curvature and/or bendability are less frequent in protein-coding regions. A WWW server is set up for the prediction of curvature and generation of 3D models from DNA sequences (http:@www.icgeb.trieste.it/dna).
Aamir, Mohd; Singh, Vinay K.; Meena, Mukesh; Upadhyay, Ram S.; Gupta, Vijai K.; Singh, Surendra
2017-01-01
The WRKY transcription factors (TFs), play crucial role in plant defense response against various abiotic and biotic stresses. The role of WRKY3 and WRKY4 genes in plant defense response against necrotrophic pathogens is well-reported. However, their functional annotation in tomato is largely unknown. In the present work, we have characterized the structural and functional attributes of the two identified tomato WRKY transcription factors, WRKY3 (SlWRKY3), and WRKY4 (SlWRKY4) using computational approaches. Arabidopsis WRKY3 (AtWRKY3: NP_178433) and WRKY4 (AtWRKY4: NP_172849) protein sequences were retrieved from TAIR database and protein BLAST was done for finding their sequential homologs in tomato. Sequence alignment, phylogenetic classification, and motif composition analysis revealed the remarkable sequential variation between, these two WRKYs. The tomato WRKY3 and WRKY4 clusters with Solanum pennellii showing the monophyletic origin and evolution from their wild homolog. The functional domain region responsible for sequence specific DNA-binding occupied in both proteins were modeled [using AtWRKY4 (PDB ID:1WJ2) and AtWRKY1 (PDBID:2AYD) as template protein structures] through homology modeling using Discovery Studio 3.0. The generated models were further evaluated for their accuracy and reliability based on qualitative and quantitative parameters. The modeled proteins were found to satisfy all the crucial energy parameters and showed acceptable Ramachandran statistics when compared to the experimentally resolved NMR solution structures and/or X-Ray diffracted crystal structures (templates). The superimposition of the functional WRKY domains from SlWRKY3 and SlWRKY4 revealed remarkable structural similarity. The sequence specific DNA binding for two WRKYs was explored through DNA-protein interaction using Hex Docking server. The interaction studies found that SlWRKY4 binds with the W-box DNA through WRKYGQK with Tyr408, Arg409, and Lys419 with the initial flanking sequences also get involved in binding. In contrast, the SlWRKY3 made interaction with RKYGQK along with the residues from zinc finger motifs. Protein-protein interactions studies were done using STRING version 10.0 to explore all the possible protein partners involved in associative functional interaction networks. The Gene ontology enrichment analysis revealed the functional dimension and characterized the identified WRKYs based on their functional annotation. PMID:28611792
Nucleic acids encoding human trithorax protein
Evans, Glen A.; Djabali, Malek; Selleri, Licia; Parry, Pauline
2001-01-01
In accordance with the present invention, there is provided an isolated peptide having the characteristics of human trithorax protein (as well as DNA encoding same, antisense DNA derived therefrom and antagonists therefor). The invention peptide is characterized by having a DNA binding domain comprising multiple zinc fingers and at least 40% amino acid identity with respect to the DNA binding domain of Drosophila trithorax protein and at least 70% conserved sequence with respect to the DNA binding domain of Drosophila trithorax protein, and wherein said peptide is encoded by a gene located at chromosome 11 of the human genome at q23. Also provided are methods for the treatment of subject(s) suffering from immunodeficiency, developmental abnormality, inherited disease, or cancer by administering to said subject a therapeutically effective amount of one of the above-described agents (i.e., peptide, antagonist therefor, DNA encoding said peptide or antisense DNA derived therefrom). Also provided is a method for the diagnosis, in a subject, of immunodeficiency, developmental abnormality, inherited disease, or cancer associated with disruption of chromosome 11 at q23.
Izsvák, Zsuzsanna; Khare, Dheeraj; Behlke, Joachim; Heinemann, Udo; Plasterk, Ronald H; Ivics, Zoltán
2002-09-13
Sleeping Beauty (SB) is the most active Tc1/mariner-like transposon in vertebrate species. Each of the terminal inverted repeats (IRs) of SB contains two transposase-binding sites (DRs). This feature, termed the IR/DR structure, is conserved in a group of Tc1-like transposons. The DNA-binding region of SB transposase, similar to the paired domain of Pax proteins, consists of two helix-turn-helix subdomains (PAI + RED = PAIRED). The N-terminal PAI subdomain was found to play a dominant role in contacting the DRs. Transposase was able to bind to mutant sites retaining the 3' part of the DRs; thus, primary DNA binding is not sufficient to determine the specificity of the transposition reaction. The PAI subdomain was also found to bind to a transpositional enhancer-like sequence within the left IR of SB, and to mediate protein-protein interactions between transposase subunits. A tetrameric form of the transposase was detected in solution, consistent with an interaction between the IR/DR structure and a transposase tetramer. We propose a model in which the transpositional enhancer and the PAI subdomain stabilize complexes formed by a transposase tetramer bound at the IR/DR. These interactions may result in enhanced stability of synaptic complexes, which might explain the efficient transposition of Sleeping Beauty in vertebrate cells.
Kim, Taehyung; Tyndel, Marc S; Huang, Haiming; Sidhu, Sachdev S; Bader, Gary D; Gfeller, David; Kim, Philip M
2012-03-01
Peptide recognition domains and transcription factors play crucial roles in cellular signaling. They bind linear stretches of amino acids or nucleotides, respectively, with high specificity. Experimental techniques that assess the binding specificity of these domains, such as microarrays or phage display, can retrieve thousands of distinct ligands, providing detailed insight into binding specificity. In particular, the advent of next-generation sequencing has recently increased the throughput of such methods by several orders of magnitude. These advances have helped reveal the presence of distinct binding specificity classes that co-exist within a set of ligands interacting with the same target. Here, we introduce a software system called MUSI that can rapidly analyze very large data sets of binding sequences to determine the relevant binding specificity patterns. Our pipeline provides two major advances. First, it can detect previously unrecognized multiple specificity patterns in any data set. Second, it offers integrated processing of very large data sets from next-generation sequencing machines. The results are visualized as multiple sequence logos describing the different binding preferences of the protein under investigation. We demonstrate the performance of MUSI by analyzing recent phage display data for human SH3 domains as well as microarray data for mouse transcription factors.
Zheng, Wenjun
2017-02-01
In the adaptive immune systems of many bacteria and archaea, the Cas9 endonuclease forms a complex with specific guide/scaffold RNA to identify and cleave complementary target sequences in foreign DNA. This DNA targeting machinery has been exploited in numerous applications of genome editing and transcription control. However, the molecular mechanism of the Cas9 system is still obscure. Recently, high-resolution structures have been solved for Cas9 in different structural forms (e.g., unbound forms, RNA-bound binary complexes, and RNA-DNA-bound tertiary complexes, corresponding to an inactive state, a pre-target-bound state, and a cleavage-competent or product state), which offered key structural insights to the Cas9 mechanism. To further probe the structural dynamics of Cas9 interacting with RNA and DNA at the amino-acid level of details, we have performed systematic coarse-grained modeling using an elastic network model and related analyses. Our normal mode analysis predicted a few key modes of collective motions that capture the observed conformational changes featuring large domain motions triggered by binding of RNA and DNA. Our flexibility analysis identified specific regions with high or low flexibility that coincide with key functional sites (such as DNA/RNA-binding sites, nuclease cleavage sites, and key hinges). We also identified a small set of hotspot residues that control the energetics of functional motions, which overlap with known functional sites and offer promising targets for future mutagenesis efforts to improve the specificity of Cas9. Finally, we modeled the conformational transitions of Cas9 from the unbound form to the binary complex and then the tertiary complex, and predicted a distinct sequence of domain motions. In sum, our findings have offered rich structural and dynamic details relevant to the Cas9 machinery, and will guide future investigation and engineering of the Cas9 systems. Proteins 2017; 85:342-353. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Global Analysis of Transcription Factor-Binding Sites in Yeast Using ChIP-Seq
Lefrançois, Philippe; Gallagher, Jennifer E. G.; Snyder, Michael
2016-01-01
Transcription factors influence gene expression through their ability to bind DNA at specific regulatory elements. Specific DNA-protein interactions can be isolated through the chromatin immunoprecipitation (ChIP) procedure, in which DNA fragments bound by the protein of interest are recovered. ChIP is followed by high-throughput DNA sequencing (Seq) to determine the genomic provenance of ChIP DNA fragments and their relative abundance in the sample. This chapter describes a ChIP-Seq strategy adapted for budding yeast to enable the genome-wide characterization of binding sites of transcription factors (TFs) and other DNA-binding proteins in an efficient and cost-effective way. Yeast strains with epitope-tagged TFs are most commonly used for ChIP-Seq, along with their matching untagged control strains. The initial step of ChIP involves the cross-linking of DNA and proteins. Next, yeast cells are lysed and sonicated to shear chromatin into smaller fragments. An antibody against an epitope-tagged TF is used to pull down chromatin complexes containing DNA and the TF of interest. DNA is then purified and proteins degraded. Specific barcoded adapters for multiplex DNA sequencing are ligated to ChIP DNA. Short DNA sequence reads (28–36 base pairs) are parsed according to the barcode and aligned against the yeast reference genome, thus generating a nucleotide-resolution map of transcription factor-binding sites and their occupancy. PMID:25213249
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats
de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas
2015-01-01
Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. PMID:26481363
Wang, H Y; Paul, W E; Keegan, A D
1996-02-01
IL-4 binds to a cell surface receptor complex that consists of the IL-4 binding protein (IL-4R alpha) and the gamma chain of the IL-2 receptor complex (gamma c). The receptors for IL-4 and IL-2 have several features in common; both use the gamma c as a receptor component, and both activate the Janus kinases JAK-1 and JAK-3. In spite of these similarities, IL-4 evokes specific responses, including the tyrosine phosphorylation of 4PS/IRS-2 and the induction of CD23. To determine whether sequences within the cytoplasmic domain of the IL-4R alpha specify these IL-4-specific responses, we transplanted the insulin IL-4 receptor motif (I4R motif) of the huIL-4R alpha to the cytoplasmic domain of a truncated IL-2R beta. In addition, we transplanted a region that contains peptide sequences shown to block Stat6 binding to DNA. We analyzed the ability of cells expressing these IL-2R-IL-4R chimeric constructs to respond to IL-2. We found that IL-4 function could be transplanted to the IL-2 receptor by these regions and that proliferative and differentiative functions can be induced by different receptor sequences.
NASA Astrophysics Data System (ADS)
Moreland, Blythe; Oman, Kenji; Curfman, John; Yan, Pearlly; Bundschuh, Ralf
Methyl-binding domain (MBD) protein pulldown experiments have been a valuable tool in measuring the levels of methylated CpG dinucleotides. Due to the frequent use of this technique, high-throughput sequencing data sets are available that allow a detailed quantitative characterization of the underlying interaction between methylated DNA and MBD proteins. Analyzing such data sets, we first found that two such proteins cannot bind closer to each other than 2 bp, consistent with structural models of the DNA-protein interaction. Second, the large amount of sequencing data allowed us to find rather weak but nevertheless clearly statistically significant sequence preferences for several bases around the required CpG. These results demonstrate that pulldown sequencing is a high-precision tool in characterizing DNA-protein interactions. This material is based upon work supported by the National Science Foundation under Grant No. DMR-1410172.
Programmable RNA recognition and cleavage by CRISPR/Cas9.
O'Connell, Mitchell R; Oakes, Benjamin L; Sternberg, Samuel H; East-Seletsky, Alexandra; Kaplan, Matias; Doudna, Jennifer A
2014-12-11
The CRISPR-associated protein Cas9 is an RNA-guided DNA endonuclease that uses RNA-DNA complementarity to identify target sites for sequence-specific double-stranded DNA (dsDNA) cleavage. In its native context, Cas9 acts on DNA substrates exclusively because both binding and catalysis require recognition of a short DNA sequence, known as the protospacer adjacent motif (PAM), next to and on the strand opposite the twenty-nucleotide target site in dsDNA. Cas9 has proven to be a versatile tool for genome engineering and gene regulation in a large range of prokaryotic and eukaryotic cell types, and in whole organisms, but it has been thought to be incapable of targeting RNA. Here we show that Cas9 binds with high affinity to single-stranded RNA (ssRNA) targets matching the Cas9-associated guide RNA sequence when the PAM is presented in trans as a separate DNA oligonucleotide. Furthermore, PAM-presenting oligonucleotides (PAMmers) stimulate site-specific endonucleolytic cleavage of ssRNA targets, similar to PAM-mediated stimulation of Cas9-catalysed DNA cleavage. Using specially designed PAMmers, Cas9 can be specifically directed to bind or cut RNA targets while avoiding corresponding DNA sequences, and we demonstrate that this strategy enables the isolation of a specific endogenous messenger RNA from cells. These results reveal a fundamental connection between PAM binding and substrate selection by Cas9, and highlight the utility of Cas9 for programmable transcript recognition without the need for tags.
Programmable RNA recognition and cleavage by CRISPR/Cas9
O’Connell, Mitchell R.; Oakes, Benjamin L.; Sternberg, Samuel H.; East-Seletsky, Alexandra; Kaplan, Matias; Doudna, Jennifer A.
2014-01-01
The CRISPR-associated protein Cas9 is an RNA-guided DNA endonuclease that uses RNA:DNA complementarity to identify target sites for sequence-specific doublestranded DNA (dsDNA) cleavage1-5. In its native context, Cas9 acts on DNA substrates exclusively because both binding and catalysis require recognition of a short DNA sequence, the protospacer adjacent motif (PAM), next to and on the strand opposite the 20-nucleotide target site in dsDNA4-7. Cas9 has proven to be a versatile tool for genome engineering and gene regulation in many cell types and organisms8, but it has been thought to be incapable of targeting RNA5. Here we show that Cas9 binds with high affinity to single-stranded RNA (ssRNA) targets matching the Cas9-associated guide RNA sequence when the PAM is presented in trans as a separate DNA oligonucleotide. Furthermore, PAM-presenting oligonucleotides (PAMmers) stimulate site-specific endonucleolytic cleavage of ssRNA targets, similar to PAM-mediated stimulation of Cas9-catalyzed DNA cleavage7. Using specially designed PAMmers, Cas9 can be specifically directed to bind or cut RNA targets while avoiding corresponding DNA sequences, and we demonstrate that this strategy enables the isolation of a specific endogenous mRNA from cells. These results reveal a fundamental connection between PAM binding and substrate selection by Cas9, and highlight the utility of Cas9 for programmable and tagless transcript recognition. PMID:25274302
Merino, Felipe; Bouvier, Benjamin; Cojocaru, Vlad
2015-01-01
Highly specific transcriptional regulation depends on the cooperative association of transcription factors into enhanceosomes. Usually, their DNA-binding cooperativity originates from either direct interactions or DNA-mediated allostery. Here, we performed unbiased molecular simulations followed by simulations of protein-DNA unbinding and free energy profiling to study the cooperative DNA recognition by OCT4 and SOX2, key components of enhanceosomes in pluripotent cells. We found that SOX2 influences the orientation and dynamics of the DNA-bound configuration of OCT4. In addition SOX2 modifies the unbinding free energy profiles of both DNA-binding domains of OCT4, the POU specific and POU homeodomain, despite interacting directly only with the first. Thus, we demonstrate that the OCT4-SOX2 cooperativity is modulated by an interplay between protein-protein interactions and DNA-mediated allostery. Further, we estimated the change in OCT4-DNA binding free energy due to the cooperativity with SOX2, observed a good agreement with experimental measurements, and found that SOX2 affects the relative DNA-binding strength of the two OCT4 domains. Based on these findings, we propose that available interaction partners in different biological contexts modulate the DNA exploration routes of multi-domain transcription factors such as OCT4. We consider the OCT4-SOX2 cooperativity as a paradigm of how specificity of transcriptional regulation is achieved through concerted modulation of protein-DNA recognition by different types of interactions. PMID:26067358
Merino, Felipe; Bouvier, Benjamin; Cojocaru, Vlad
2015-06-01
Highly specific transcriptional regulation depends on the cooperative association of transcription factors into enhanceosomes. Usually, their DNA-binding cooperativity originates from either direct interactions or DNA-mediated allostery. Here, we performed unbiased molecular simulations followed by simulations of protein-DNA unbinding and free energy profiling to study the cooperative DNA recognition by OCT4 and SOX2, key components of enhanceosomes in pluripotent cells. We found that SOX2 influences the orientation and dynamics of the DNA-bound configuration of OCT4. In addition SOX2 modifies the unbinding free energy profiles of both DNA-binding domains of OCT4, the POU specific and POU homeodomain, despite interacting directly only with the first. Thus, we demonstrate that the OCT4-SOX2 cooperativity is modulated by an interplay between protein-protein interactions and DNA-mediated allostery. Further, we estimated the change in OCT4-DNA binding free energy due to the cooperativity with SOX2, observed a good agreement with experimental measurements, and found that SOX2 affects the relative DNA-binding strength of the two OCT4 domains. Based on these findings, we propose that available interaction partners in different biological contexts modulate the DNA exploration routes of multi-domain transcription factors such as OCT4. We consider the OCT4-SOX2 cooperativity as a paradigm of how specificity of transcriptional regulation is achieved through concerted modulation of protein-DNA recognition by different types of interactions.
Structure of an XPF endonuclease with and without DNA suggests a model for substrate recognition
Newman, Matthew; Murray-Rust, Judith; Lally, John; Rudolf, Jana; Fadden, Andrew; Knowles, Philip P; White, Malcolm F; McDonald, Neil Q
2005-01-01
The XPF/Mus81 structure-specific endonucleases cleave double-stranded DNA (dsDNA) within asymmetric branched DNA substrates and play an essential role in nucleotide excision repair, recombination and genome integrity. We report the structure of an archaeal XPF homodimer alone and bound to dsDNA. Superposition of these structures reveals a large domain movement upon binding DNA, indicating how the (HhH)2 domain and the nuclease domain are coupled to allow the recognition of double-stranded/single-stranded DNA junctions. We identify two nonequivalent DNA-binding sites and propose a model in which XPF distorts the 3′ flap substrate in order to engage both binding sites and promote strand cleavage. The model rationalises published biochemical data and implies a novel role for the ERCC1 subunit of eukaryotic XPF complexes. PMID:15719018
Left-handed Z-DNA: structure and function
NASA Technical Reports Server (NTRS)
Herbert, A.; Rich, A.
1999-01-01
Z-DNA is a high energy conformer of B-DNA that forms in vivo during transcription as a result of torsional strain generated by a moving polymerase. An understanding of the biological role of Z-DNA has advanced with the discovery that the RNA editing enzyme double-stranded RNA adenosine deaminase type I (ADAR1) has motifs specific for the Z-DNA conformation. Editing by ADAR1 requires a double-stranded RNA substrate. In the cases known, the substrate is formed by folding an intron back onto the exon that is targeted for modification. The use of introns to direct processing of exons requires that editing occurs before splicing. Recognition of Z-DNA by ADAR1 may allow editing of nascent transcripts to be initiated immediately after transcription, ensuring that editing and splicing are performed in the correct sequence. Structural characterization of the Z-DNA binding domain indicates that it belongs to the winged helix-turn-helix class of proteins and is similar to the globular domain of histone-H5.
Dossani, Zain Y.; Reider Apel, Amanda; Szmidt‐Middleton, Heather; Hillson, Nathan J.; Deutsch, Samuel; Keasling, Jay D.
2017-01-01
Abstract Despite the need for inducible promoters in strain development efforts, the majority of engineering in Saccharomyces cerevisiae continues to rely on a few constitutively active or inducible promoters. Building on advances that use the modular nature of both transcription factors and promoter regions, we have built a library of hybrid promoters that are regulated by a synthetic transcription factor. The hybrid promoters consist of native S. cerevisiae promoters, in which the operator regions have been replaced with sequences that are recognized by the bacterial LexA DNA binding protein. Correspondingly, the synthetic transcription factor (TF) consists of the DNA binding domain of the LexA protein, fused with the human estrogen binding domain and the viral activator domain, VP16. The resulting system with a bacterial DNA binding domain avoids the transcription of native S. cerevisiae genes, and the hybrid promoters can be induced using estradiol, a compound with no detectable impact on S. cerevisiae physiology. Using combinations of one, two or three operator sequence repeats and a set of native S. cerevisiae promoters, we obtained a series of hybrid promoters that can be induced to different levels, using the same synthetic TF and a given estradiol. This set of promoters, in combination with our synthetic TF, has the potential to regulate numerous genes or pathways simultaneously, to multiple desired levels, in a single strain. PMID:29084380
Complete complementary DNA-derived amino acid sequence of canine cardiac phospholamban.
Fujii, J; Ueno, A; Kitano, K; Tanaka, S; Kadoma, M; Tada, M
1987-01-01
Complementary DNA (cDNA) clones specific for phospholamban of sarcoplasmic reticulum membranes have been isolated from a canine cardiac cDNA library. The amino acid sequence deduced from the cDNA sequence indicates that phospholamban consists of 52 amino acid residues and lacks an amino-terminal signal sequence. The protein has an inferred mol wt 6,080 that is in agreement with its apparent monomeric mol wt 6,000, estimated previously by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. Phospholamban contains two distinct domains, a hydrophilic region at the amino terminus (domain I) and a hydrophobic region at the carboxy terminus (domain II). We propose that domain I is localized at the cytoplasmic surface and offers phosphorylatable sites whereas domain II is anchored into the sarcoplasmic reticulum membrane. PMID:3793929
Sperm 1: a POU-domain gene transiently expressed immediately before meiosis I in the male germ cell.
Andersen, B; Pearse, R V; Schlegel, P N; Cichon, Z; Schonemann, M D; Bardin, C W; Rosenfeld, M G
1993-01-01
Members of the POU-domain gene family encode for transcriptional regulatory molecules that are important for terminal differentiation of several organ systems, including anterior pituitary, sensory neurons, and B lymphocytes. We have identified a POU-domain factor, referred to as sperm 1 (Sprm-1). This factor is most related to the transactivator Oct-3/4, which is expressed in the early embryo, primordial germ cells, and the egg. However, in contrast with Oct-3/4, rat Sprm-1 is selectively expressed during a 36- to 48-hr period immediately preceding meiosis I in male germ cells. Although the POU-domain of Sprm-1 is divergent from the POU-domains of Oct-1 and Oct-2, random-site-selection assay reveals that Sprm-1 preferentially binds to a specific variant of the classic octamer DNA-response element in which the optimal sequence differs from that preferred by Oct-1 and Pit-1. These data suggest that the Sprm-1 gene encodes a DNA-binding protein that may exert a regulatory function in meiotic events that are required for terminal differentiation of the male germ cell. Images Fig. 1 Fig. 2 Fig. 3 Fig. 4 PMID:7902581
Brucet, Marina; Querol-Audí, Jordi; Serra, Maria; Ramirez-Espain, Ximena; Bertlik, Kamila; Ruiz, Lidia; Lloberas, Jorge; Macias, Maria J; Fita, Ignacio; Celada, Antonio
2007-05-11
TREX1 is the most abundant mammalian 3' --> 5' DNA exonuclease. It has been described to form part of the SET complex and is responsible for the Aicardi-Goutières syndrome in humans. Here we show that the exonuclease activity is correlated to the binding preferences toward certain DNA sequences. In particular, we have found three motifs that are selected, GAG, ACA, and CTGC. To elucidate how the discrimination occurs, we determined the crystal structures of two murine TREX1 complexes, with a nucleotide product of the exonuclease reaction, and with a single-stranded DNA substrate. Using confocal microscopy, we observed TREX1 both in nuclear and cytoplasmic subcellular compartments. Remarkably, the presence of TREX1 in the nucleus requires the loss of a C-terminal segment, which we named leucine-rich repeat 3. Furthermore, we detected the presence of a conserved proline-rich region on the surface of TREX1. This observation points to interactions with proline-binding domains. The potential interacting motif "PPPVPRPP" does not contain aromatic residues and thus resembles other sequences that select SH3 and/or Group 2 WW domains. By means of nuclear magnetic resonance titration experiments, we show that, indeed, a polyproline peptide derived from the murine TREX1 sequence interacted with the WW2 domain of the elongation transcription factor CA150. Co-immunoprecipitation studies confirmed this interaction with the full-length TREX1 protein, thereby suggesting that TREX1 participates in more functional complexes than previously thought.
Structure-based Analysis to Hu-DNA Binding
DOE Office of Scientific and Technical Information (OSTI.GOV)
Swinger,K.; Rice, P.
2007-01-01
HU and IHF are prokaryotic proteins that induce very large bends in DNA. They are present in high concentrations in the bacterial nucleoid and aid in chromosomal compaction. They also function as regulatory cofactors in many processes, such as site-specific recombination and the initiation of replication and transcription. HU and IHF have become paradigms for understanding DNA bending and indirect readout of sequence. While IHF shows significant sequence specificity, HU binds preferentially to certain damaged or distorted DNAs. However, none of the structurally diverse HU substrates previously studied in vitro is identical with the distorted substrates in the recently publishedmore » Anabaena HU(AHU)-DNA cocrystal structures. Here, we report binding affinities for AHU and the DNA in the cocrystal structures. The binding free energies for formation of these AHU-DNA complexes range from 10-14.5 kcal/mol, representing K{sub d} values in the nanomolar to low picomolar range, and a maximum stabilization of at least 6.3 kcal/mol relative to complexes with undistorted, non-specific DNA. We investigated IHF binding and found that appropriate structural distortions can greatly enhance its affinity. On the basis of the coupling of structural and relevant binding data, we estimate the amount of conformational strain in an IHF-mediated DNA kink that is relieved by a nick (at least 0.76 kcal/mol) and pinpoint the location of the strain. We show that AHU has a sequence preference for an A+T-rich region in the center of its DNA-binding site, correlating with an unusually narrow minor groove. This is similar to sequence preferences shown by the eukaryotic nucleosome.« less
Binding site size limit of the 2:1 pyrrole-imidazole polyamide-DNA motif.
Kelly, J J; Baird, E E; Dervan, P B
1996-01-01
Polyamides containing N-methylimidazole (Im) and N-methylpyrrole (Py) amino acids can be combined in antiparallel side-by-side dimeric complexes for sequence-specific recognition in the minor groove of DNA. Six polyamides containing three to eight rings bind DNA sites 5-10 bp in length, respectively. Quantitative DNase I footprint titration experiments demonstrate that affinity maximizes and is similar at ring sizes of five, six, and seven. Sequence specificity decreases as the length of the polyamides increases beyond five rings. These results provide useful guidelines for the design of new polyamides that bind longer DNA sites with enhanced affinity and specificity. Images Fig. 4 PMID:8692930
Glinsky, Gennadi V
2016-09-19
Thousands of candidate human-specific regulatory sequences (HSRS) have been identified, supporting the hypothesis that unique to human phenotypes result from human-specific alterations of genomic regulatory networks. Collectively, a compendium of multiple diverse families of HSRS that are functionally and structurally divergent from Great Apes could be defined as the backbone of human-specific genomic regulatory networks. Here, the conservation patterns analysis of 18,364 candidate HSRS was carried out requiring that 100% of bases must remap during the alignments of human, chimpanzee, and bonobo sequences. A total of 5,535 candidate HSRS were identified that are: (i) highly conserved in Great Apes; (ii) evolved by the exaptation of highly conserved ancestral DNA; (iii) defined by either the acceleration of mutation rates on the human lineage or the functional divergence from non-human primates. The exaptation of highly conserved ancestral DNA pathway seems mechanistically distinct from the evolution of regulatory DNA segments driven by the species-specific expansion of transposable elements. Genome-wide proximity placement analysis of HSRS revealed that a small fraction of topologically associating domains (TADs) contain more than half of HSRS from four distinct families. TADs that are enriched for HSRS and termed rapidly evolving in humans TADs (revTADs) comprise 0.8-10.3% of 3,127 TADs in the hESC genome. RevTADs manifest distinct correlation patterns between placements of human accelerated regions, human-specific transcription factor-binding sites, and recombination rates. There is a significant enrichment within revTAD boundaries of hESC-enhancers, primate-specific CTCF-binding sites, human-specific RNAPII-binding sites, hCONDELs, and H3K4me3 peaks with human-specific enrichment at TSS in prefrontal cortex neurons (P < 0.0001 in all instances). Present analysis supports the idea that phenotypic divergence of Homo sapiens is driven by the evolution of human-specific genomic regulatory networks via at least two mechanistically distinct pathways of creation of divergent sequences of regulatory DNA: (i) recombination-associated exaptation of the highly conserved ancestral regulatory DNA segments; (ii) human-specific insertions of transposable elements. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Human DNA ligase III recognizes DNA ends by dynamic switching between two DNA-bound states.
Cotner-Gohara, Elizabeth; Kim, In-Kwon; Hammel, Michal; Tainer, John A; Tomkinson, Alan E; Ellenberger, Tom
2010-07-27
Human DNA ligase III has essential functions in nuclear and mitochondrial DNA replication and repair and contains a PARP-like zinc finger (ZnF) that increases the extent of DNA nick joining and intermolecular DNA ligation, yet the bases for ligase III specificity and structural variation among human ligases are not understood. Here combined crystal structure and small-angle X-ray scattering results reveal dynamic switching between two nick-binding components of ligase III: the ZnF-DNA binding domain (DBD) forms a crescent-shaped surface used for DNA end recognition which switches to a ring formed by the nucleotidyl transferase (NTase) and OB-fold (OBD) domains for catalysis. Structural and mutational analyses indicate that high flexibility and distinct DNA binding domain features in ligase III assist both nick sensing and the transition from nick sensing by the ZnF to nick joining by the catalytic core. The collective results support a "jackknife model" in which the ZnF loads ligase III onto nicked DNA and conformational changes deliver DNA into the active site. This work has implications for the biological specificity of DNA ligases and functions of PARP-like zinc fingers.
NASA Technical Reports Server (NTRS)
Reddy, A. S.; Reddy, V. S.; Golovkin, M.
2000-01-01
Calmodulin (CaM), a key calcium sensor in all eukaryotes, regulates diverse cellular processes by interacting with other proteins. To isolate CaM binding proteins involved in ethylene signal transduction, we screened an expression library prepared from ethylene-treated Arabidopsis seedlings with 35S-labeled CaM. A cDNA clone, EICBP (Ethylene-Induced CaM Binding Protein), encoding a protein that interacts with activated CaM was isolated in this screening. The CaM binding domain in EICBP was mapped to the C-terminus of the protein. These results indicate that calcium, through CaM, could regulate the activity of EICBP. The EICBP is expressed in different tissues and its expression in seedlings is induced by ethylene. The EICBP contains, in addition to a CaM binding domain, several features that are typical of transcription factors. These include a DNA-binding domain at the N terminus, an acidic region at the C terminus, and nuclear localization signals. In database searches a partial cDNA (CG-1) encoding a DNA-binding motif from parsley and an ethylene up-regulated partial cDNA from tomato (ER66) showed significant similarity to EICBP. In addition, five hypothetical proteins in the Arabidopsis genome also showed a very high sequence similarity with EICBP, indicating that there are several EICBP-related proteins in Arabidopsis. The structural features of EICBP are conserved in all EICBP-related proteins in Arabidopsis, suggesting that they may constitute a new family of DNA binding proteins and are likely to be involved in modulating gene expression in the presence of ethylene.
Structures of apo IRF-3 and IRF-7 DNA binding domains: effect of loop L1 on DNA binding
DOE Office of Scientific and Technical Information (OSTI.GOV)
De Ioannes, Pablo; Escalante, Carlos R.; Aggarwal, Aneel K.
2013-11-20
Interferon regulatory factors IRF-3 and IRF-7 are transcription factors essential in the activation of interferon-{beta} (IFN-{beta}) gene in response to viral infections. Although, both proteins recognize the same consensus IRF binding site AANNGAAA, they have distinct DNA binding preferences for sites in vivo. The X-ray structures of IRF-3 and IRF-7 DNA binding domains (DBDs) bound to IFN-{beta} promoter elements revealed flexibility in the loops (L1-L3) and the residues that make contacts with the target sequence. To characterize the conformational changes that occur on DNA binding and how they differ between IRF family members, we have solved the X-ray structures ofmore » IRF-3 and IRF-7 DBDs in the absence of DNA. We found that loop L1, carrying the conserved histidine that interacts with the DNA minor groove, is disordered in apo IRF-3 but is ordered in apo IRF-7. This is reflected in differences in DNA binding affinities when the conserved histidine in loop L1 is mutated to alanine in the two proteins. The stability of loop L1 in IRF-7 derives from a unique combination of hydrophobic residues that pack against the protein core. Together, our data show that differences in flexibility of loop L1 are an important determinant of differential IRF-DNA binding.« less
Sharma, Amit; Jenkins, Katherine R.; Héroux, Annie; Bowman, Gregory D.
2011-01-01
Chromatin remodelers are ATP-dependent machines that dynamically alter the chromatin packaging of eukaryotic genomes by assembling, sliding, and displacing nucleosomes. The Chd1 chromatin remodeler possesses a C-terminal DNA-binding domain that is required for efficient nucleosome sliding and believed to be essential for sensing the length of DNA flanking the nucleosome core. The structure of the Chd1 DNA-binding domain was recently shown to consist of a SANT and SLIDE domain, analogous to the DNA-binding domain of the ISWI family, yet the details of how Chd1 recognized DNA were not known. Here we present the crystal structure of the Saccharomyces cerevisiae Chd1 DNA-binding domain in complex with a DNA duplex. The bound DNA duplex is straight, consistent with the preference exhibited by the Chd1 DNA-binding domain for extranucleosomal DNA. Comparison of this structure with the recently solved ISW1a DNA-binding domain bound to DNA reveals that DNA lays across each protein at a distinct angle, yet contacts similar surfaces on the SANT and SLIDE domains. In contrast to the minor groove binding seen for Isw1 and predicted for Chd1, the SLIDE domain of the Chd1 DNA-binding domain contacts the DNA major groove. The majority of direct contacts with the phosphate backbone occur only on one DNA strand, suggesting that Chd1 may not strongly discriminate between major and minor grooves. PMID:22033927
DNA binding by FOXP3 domain-swapped dimer suggests mechanisms of long-range chromosomal interactions
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chen, Y.; Chen, C.; Zhang, Z.
2015-01-07
FOXP3 is a lineage-specific transcription factor that is required for regulatory T cell development and function. In this study, we determined the crystal structure of the FOXP3 forkhead domain bound to DNA. The structure reveals that FOXP3 can form a stable domain-swapped dimer to bridge DNA in the absence of cofactors, suggesting that FOXP3 may play a role in long-range gene interactions. To test this hypothesis, we used circular chromosome conformation capture coupled with high throughput sequencing (4C-seq) to analyze FOXP3-dependent genomic contacts around a known FOXP3-bound locus, Ptpn22. Our studies reveal that FOXP3 induces significant changes in the chromatinmore » contacts between the Ptpn22 locus and other Foxp3-regulated genes, reflecting a mechanism by which FOXP3 reorganizes the genome architecture to coordinate the expression of its target genes. Our results suggest that FOXP3 mediates long-range chromatin interactions as part of its mechanisms to regulate specific gene expression in regulatory T cells.« less
Interactions of Ku70/80 with Double-Strand DNA: Energetic, Dynamics, and Functional Implications
NASA Technical Reports Server (NTRS)
Hu, Shaowen; Cucinotta, Francis A.
2010-01-01
Space radiation is a proficient inducer of DNA damage leading to mutation, aberrant cell signaling, and cancer formation. Ku is among the first responding proteins in nucleus to recognize and bind the DNA double strand breaks (DSBs) whenever they are introduced. Once loaded Ku works as a scaffold to recruit other repair factors of non-homologous end joining and facilitates the following repair processes. The crystallographic study of the Ku70/80 heterodimer indicate the core structure of this protein shows virtually no conformational change after binding with DNA. To investigate the dynamical features as well as the energetic characteristics of Ku-DNA binding, we conduct multi-nanosecond molecular dynamics simulations of a modeled Ku70/80 structure and several complexes with two 24-bp DNA duplexes. Free energy calculations show significant energy differences between the complexes with Ku bound at DSBs and those with Ku associated at an internal site of a chromosome. The results also reveal detailed interactions between different nucleotides and the amino acids along the DNA-binding cradle of Ku, indicating subtle binding preference of Ku at specific DNA sequences. The covariance matrix analyses along the trajectories demonstrate the protein is stimulated to undergo correlated motions of different domains once bound to DNA ends. Additionally, principle component analyses identify these low frequency collective motions suitable for binding with and translocation along duplex DNA. It is proposed that the modification of dynamical properties of Ku upon binding with DSBs may provide a signal for the further recruitment of other repair factors such as DNA-PKcs, XLF, and XRCC4.
QueTAL: a suite of tools to classify and compare TAL effectors functionally and phylogenetically
Pérez-Quintero, Alvaro L.; Lamy, Léo; Gordon, Jonathan L.; Escalon, Aline; Cunnac, Sébastien; Szurek, Boris; Gagnevin, Lionel
2015-01-01
Transcription Activator-Like (TAL) effectors from Xanthomonas plant pathogenic bacteria can bind to the promoter region of plant genes and induce their expression. DNA-binding specificity is governed by a central domain made of nearly identical repeats, each determining the recognition of one base pair via two amino acid residues (a.k.a. Repeat Variable Di-residue, or RVD). Knowing how TAL effectors differ from each other within and between strains would be useful to infer functional and evolutionary relationships, but their repetitive nature precludes reliable use of traditional alignment methods. The suite QueTAL was therefore developed to offer tailored tools for comparison of TAL effector genes. The program DisTAL considers each repeat as a unit, transforms a TAL effector sequence into a sequence of coded repeats and makes pair-wise alignments between these coded sequences to construct trees. The program FuncTAL is aimed at finding TAL effectors with similar DNA-binding capabilities. It calculates correlations between position weight matrices of potential target DNA sequence predicted from the RVD sequence, and builds trees based on these correlations. The programs accurately represented phylogenetic and functional relationships between TAL effectors using either simulated or literature-curated data. When using the programs on a large set of TAL effector sequences, the DisTAL tree largely reflected the expected species phylogeny. In contrast, FuncTAL showed that TAL effectors with similar binding capabilities can be found between phylogenetically distant taxa. This suite will help users to rapidly analyse any TAL effector genes of interest and compare them to other available TAL genes and should improve our understanding of TAL effectors evolution. It is available at http://bioinfo-web.mpl.ird.fr/cgi-bin2/quetal/quetal.cgi. PMID:26284082
BclxL changes conformation upon binding to wild-type but not mutant p53 DNA binding domain.
Hagn, Franz; Klein, Christian; Demmer, Oliver; Marchenko, Natasha; Vaseva, Angelina; Moll, Ute M; Kessler, Horst
2010-01-29
p53 can induce apoptosis through mitochondrial membrane permeabilization by interaction of its DNA binding region with the anti-apoptotic proteins BclxL and Bcl2. However, little is known about the action of p53 at the mitochondria in molecular detail. By using NMR spectroscopy and fluorescence polarization we characterized the binding of wild-type and mutant p53 DNA binding domains to BclxL and show that the wild-type p53 DNA binding domain leads to structural changes in the BH3 binding region of BclxL, whereas mutants fail to induce such effects due to reduced affinity. This was probed by induced chemical shift and residual dipolar coupling data. These data imply that p53 partly achieves its pro-apoptotic function at the mitochondria by facilitating interaction between BclxL and BH3-only proteins in an allosteric mode of action. Furthermore, we characterize for the first time the binding behavior of Pifithrin-mu, a specific small molecule inhibitor of the p53-BclxL interaction, and present a structural model of the protein-ligand complex. A rather unusual behavior is revealed whereby Pifithrin-mu binds to both sides of the protein-protein complex. These data should facilitate the rational design of more potent specific BclxL-p53 inhibitors.
NMR studies of DNA oligomers and their interactions with minor groove binding ligands
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fagan, Patricia A.
1996-05-01
The cationic peptide ligands distamycin and netropsin bind noncovalently to the minor groove of DNA. The binding site, orientation, stoichiometry, and qualitative affinity of distamycin binding to several short DNA oligomers were investigated by NMR spectroscopy. The oligomers studied contain A,T-rich or I,C-rich binding sites, where I = 2-desaminodeoxyguanosine. I•C base pairs are functional analogs of A•T base pairs in the minor groove. The different behaviors exhibited by distamycin and netropsin binding to various DNA sequences suggested that these ligands are sensitive probes of DNA structure. For sites of five or more base pairs, distamycin can form 1:1 or 2:1more » ligand:DNA complexes. Cooperativity in distamycin binding is low in sites such as AAAAA which has narrow minor grooves, and is higher in sites with wider minor grooves such as ATATAT. The distamycin binding and base pair opening lifetimes of I,C-containing DNA oligomers suggest that the I,C minor groove is structurally different from the A,T minor groove. Molecules which direct chemistry to a specific DNA sequence could be used as antiviral compounds, diagnostic probes, or molecular biology tools. The author studied two ligands in which reactive groups were tethered to a distamycin to increase the sequence specificity of the reactive agent.« less
Habu, Toshiyuki; Wakabayashi, Nobunao; Yoshida, Kayo; Yomogida, Kenntaro; Nishimune, Yoshitake; Morita, Takashi
2004-06-01
The tumor suppressor protein p53 is specifically expressed during meiosis in spermatocytes. Subsets of p53 knockout mice exhibit testicular giant cell degenerative syndrome, which suggests p53 may be associated with meiotic cell cycle and/or DNA metabolism. Here, we show that p53 binds to the mouse meiosis-specific RecA-like protein Mus musculus DMC1 (MmDMC1). The C-terminal domain (amino acid 234-340) of MmDMC1 binds to DNA-binding domain of p53 protein. p53 might be involved in homologous recombination and/or checkpoint function by directly binding to DMC1 protein to repress genomic instability in meiotic germ cells.
MCM ring hexamerization is a prerequisite for DNA-binding
Froelich, Clifford A.; Nourse, Amanda; Enemark, Eric J.
2015-09-13
The hexameric Minichromosome Maintenance (MCM) protein complex forms a ring that unwinds DNA at the replication fork in eukaryotes and archaea. Our recent crystal structure of an archaeal MCM N-terminal domain bound to single-stranded DNA (ssDNA) revealed ssDNA associating across tight subunit interfaces but not at the loose interfaces, indicating that DNA-binding is governed not only by the DNA-binding residues of the subunits (MCM ssDNA-binding motif, MSSB) but also by the relative orientation of the subunits. We now extend these findings to show that DNA-binding by the MCM N-terminal domain of the archaeal organism Pyrococcus furiosus occurs specifically in themore » hexameric oligomeric form. We show that mutants defective for hexamerization are defective in binding ssDNA despite retaining all the residues observed to interact with ssDNA in the crystal structure. One mutation that exhibits severely defective hexamerization and ssDNA-binding is at a conserved phenylalanine that aligns with the mouse Mcm4(Chaos3) mutation associated with chromosomal instability, cancer, and decreased intersubunit association.« less
Secondary structure prediction and structure-specific sequence analysis of single-stranded DNA.
Dong, F; Allawi, H T; Anderson, T; Neri, B P; Lyamichev, V I
2001-08-01
DNA sequence analysis by oligonucleotide binding is often affected by interference with the secondary structure of the target DNA. Here we describe an approach that improves DNA secondary structure prediction by combining enzymatic probing of DNA by structure-specific 5'-nucleases with an energy minimization algorithm that utilizes the 5'-nuclease cleavage sites as constraints. The method can identify structural differences between two DNA molecules caused by minor sequence variations such as a single nucleotide mutation. It also demonstrates the existence of long-range interactions between DNA regions separated by >300 nt and the formation of multiple alternative structures by a 244 nt DNA molecule. The differences in the secondary structure of DNA molecules revealed by 5'-nuclease probing were used to design structure-specific probes for mutation discrimination that target the regions of structural, rather than sequence, differences. We also demonstrate the performance of structure-specific 'bridge' probes complementary to non-contiguous regions of the target molecule. The structure-specific probes do not require the high stringency binding conditions necessary for methods based on mismatch formation and permit mutation detection at temperatures from 4 to 37 degrees C. Structure-specific sequence analysis is applied for mutation detection in the Mycobacterium tuberculosis katG gene and for genotyping of the hepatitis C virus.
Leblanc, B; Read, C; Moss, T
1993-02-01
The interaction of the ribosomal transcription factor xUBF with the RNA polymerase I core promoter of Xenopus laevis has been studied both at the DNA and protein levels. It is shown that a single xUBF-DNA complex forms over the 40S initiation site (+1) and involves at least the DNA sequences between -20 and +60 bp. DNA sequences upstream of +10 and downstream of +18 are each sufficient to direct complex formation independently. HMG box 1 of xUBF independently recognizes the sequences -20 to -1 and +1 to +22 and the addition of the N-terminal dimerization domain to HMG box 1 stabilizes its interaction with these sequences approximately 10-fold. HMG boxes 2/3 interact with the DNA downstream of +22 and can independently position xUBF across the initiation site. The C-terminal segment of xUBF, HMG boxes 4, 5 or the acidic domain, directly or indirectly interact with HMG box 1, making the core promoter sequences between -11 and -15 hypersensitive to DNase. This interaction also requires the DNA sequences between +17 and +32, i.e. the HMG box 2/3 binding site. The data suggest extensive folding of the core promoter within the xUBF complex.
The nucleoid protein Dps binds genomic DNA of Escherichia coli in a non-random manner
Kondrashov, F. A.; Toshchakov, S. V.; Dominova, I.; Shvyreva, U. S.; Vrublevskaya, V. V.; Morenkov, O. S.; Panyukov, V. V.
2017-01-01
Dps is a multifunctional homododecameric protein that oxidizes Fe2+ ions accumulating them in the form of Fe2O3 within its protein cavity, interacts with DNA tightly condensing bacterial nucleoid upon starvation and performs some other functions. During the last two decades from discovery of this protein, its ferroxidase activity became rather well studied, but the mechanism of Dps interaction with DNA still remains enigmatic. The crucial role of lysine residues in the unstructured N-terminal tails led to the conventional point of view that Dps binds DNA without sequence or structural specificity. However, deletion of dps changed the profile of proteins in starved cells, SELEX screen revealed genomic regions preferentially bound in vitro and certain affinity of Dps for artificial branched molecules was detected by atomic force microscopy. Here we report a non-random distribution of Dps binding sites across the bacterial chromosome in exponentially growing cells and show their enrichment with inverted repeats prone to form secondary structures. We found that the Dps-bound regions overlap with sites occupied by other nucleoid proteins, and contain overrepresented motifs typical for their consensus sequences. Of the two types of genomic domains with extensive protein occupancy, which can be highly expressed or transcriptionally silent only those that are enriched with RNA polymerase molecules were preferentially occupied by Dps. In the dps-null mutant we, therefore, observed a differentially altered expression of several targeted genes and found suppressed transcription from the dps promoter. In most cases this can be explained by the relieved interference with Dps for nucleoid proteins exploiting sequence-specific modes of DNA binding. Thus, protecting bacterial cells from different stresses during exponential growth, Dps can modulate transcriptional integrity of the bacterial chromosome hampering RNA biosynthesis from some genes via competition with RNA polymerase or, vice versa, competing with inhibitors to activate transcription. PMID:28800583
Roux-Rouquie, Magali; Marilley, Monique
2000-01-01
We have modeled local DNA sequence parameters to search for DNA architectural motifs involved in transcription regulation and promotion within the Xenopus laevis ribosomal gene promoter and the intergenic spacer (IGS) sequences. The IGS was found to be shaped into distinct topological domains. First, intrinsic bends split the IGS into domains of common but different helical features. Local parameters at inter-domain junctions exhibit a high variability with respect to intrinsic curvature, bendability and thermal stability. Secondly, the repeated sequence blocks of the IGS exhibit right-handed supercoiled structures which could be related to their enhancer properties. Thirdly, the gene promoter presents both inherent curvature and minor groove narrowing which may be viewed as motifs of a structural code for protein recognition and binding. Such pre-existing deformations could simply be remodeled during the binding of the transcription complex. Alternatively, these deformations could pre-shape the promoter in such a way that further remodeling is facilitated. Mutations shown to abolish promoter curvature as well as intrinsic minor groove narrowing, in a variant which maintained full transcriptional activity, bring circumstantial evidence for structurally-preorganized motifs in relation to transcription regulation and promotion. Using well documented X.laevis rDNA regulatory sequences we showed that computer modeling may be of invaluable assistance in assessing encrypted architectural motifs. The evidence of these DNA topological motifs with respect to the concept of structural code is discussed. PMID:10982860
Structure of p73 DNA-binding domain tetramer modulates p73 transactivation
Ethayathulla, Abdul S.; Tse, Pui-Wah; Monti, Paola; Nguyen, Sonha; Inga, Alberto; Fronza, Gilberto; Viadiu, Hector
2012-01-01
The transcription factor p73 triggers developmental pathways and overlaps stress-induced p53 transcriptional pathways. How p53-family response elements determine and regulate transcriptional specificity remains an unsolved problem. In this work, we have determined the first crystal structures of p73 DNA-binding domain tetramer bound to response elements with spacers of different length. The structure and function of the adaptable tetramer are determined by the distance between two half-sites. The structures with zero and one base-pair spacers show compact p73 DNA-binding domain tetramers with large tetramerization interfaces; a two base-pair spacer results in DNA unwinding and a smaller tetramerization interface, whereas a four base-pair spacer hinders tetramerization. Functionally, p73 is more sensitive to spacer length than p53, with one base-pair spacer reducing 90% of transactivation activity and longer spacers reducing transactivation to basal levels. Our results establish the quaternary structure of the p73 DNA-binding domain required as a scaffold to promote transactivation. PMID:22474346
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.
de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas
2015-11-16
Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Different modes of interaction by TIAR and HuR with target RNA and DNA
Kim, Henry S.; Wilce, Matthew C. J.; Yoga, Yano M. K.; Pendini, Nicole R.; Gunzburg, Menachem J.; Cowieson, Nathan P.; Wilson, Gerald M.; Williams, Bryan R. G.; Gorospe, Myriam; Wilce, Jacqueline A.
2011-01-01
TIAR and HuR are mRNA-binding proteins that play important roles in the regulation of translation. They both possess three RNA recognition motifs (RRMs) and bind to AU-rich elements (AREs), with seemingly overlapping specificity. Here we show using SPR that TIAR and HuR bind to both U-rich and AU-rich RNA in the nanomolar range, with higher overall affinity for U-rich RNA. However, the higher affinity for U–rich sequences is mainly due to faster association with U-rich RNA, which we propose is a reflection of the higher probability of association. Differences between TIAR and HuR are observed in their modes of binding to RNA. TIAR is able to bind deoxy-oligonucleotides with nanomolar affinity, whereas HuR affinity is reduced to a micromolar level. Studies with U-rich DNA reveal that TIAR binding depends less on the 2′-hydroxyl group of RNA than HuR binding. Finally we show that SAXS data, recorded for the first two domains of TIAR in complex with RNA, are more consistent with a flexible, elongated shape and not the compact shape that the first two domains of Hu proteins adopt upon binding to RNA. We thus propose that these triple-RRM proteins, which compete for the same binding sites in cells, interact with their targets in fundamentally different ways. PMID:21233170
Different modes of interaction by TIAR and HuR with target RNA and DNA.
Kim, Henry S; Wilce, Matthew C J; Yoga, Yano M K; Pendini, Nicole R; Gunzburg, Menachem J; Cowieson, Nathan P; Wilson, Gerald M; Williams, Bryan R G; Gorospe, Myriam; Wilce, Jacqueline A
2011-02-01
TIAR and HuR are mRNA-binding proteins that play important roles in the regulation of translation. They both possess three RNA recognition motifs (RRMs) and bind to AU-rich elements (AREs), with seemingly overlapping specificity. Here we show using SPR that TIAR and HuR bind to both U-rich and AU-rich RNA in the nanomolar range, with higher overall affinity for U-rich RNA. However, the higher affinity for U-rich sequences is mainly due to faster association with U-rich RNA, which we propose is a reflection of the higher probability of association. Differences between TIAR and HuR are observed in their modes of binding to RNA. TIAR is able to bind deoxy-oligonucleotides with nanomolar affinity, whereas HuR affinity is reduced to a micromolar level. Studies with U-rich DNA reveal that TIAR binding depends less on the 2'-hydroxyl group of RNA than HuR binding. Finally we show that SAXS data, recorded for the first two domains of TIAR in complex with RNA, are more consistent with a flexible, elongated shape and not the compact shape that the first two domains of Hu proteins adopt upon binding to RNA. We thus propose that these triple-RRM proteins, which compete for the same binding sites in cells, interact with their targets in fundamentally different ways.
Vitolo, Joseph M.; Thiriet, Christophe; Hayes, Jeffrey J.
2000-01-01
Reconstitution of a DNA fragment containing a Xenopus borealis somatic type 5S rRNA gene into a nucleosome greatly restricts the binding of transcription factor IIIA (TFIIIA) to its cognate DNA sequence within the internal promoter of the gene. Removal of all core histone tail domains by limited trypsin proteolysis or acetylation of the core histone tails significantly relieves this inhibition and allows TFIIIA to exhibit high-affinity binding to nucleosomal DNA. Since only a single tail or a subset of tails may be primarily responsible for this effect, we determined whether removal of the individual tail domains of the H2A-H2B dimer or the H3-H4 tetramer affects TFIIIA binding to its cognate DNA site within the 5S nucleosome in vitro. The results show that the tail domains of H3 and H4, but not those of H2A and/or H2B, directly modulate the ability of TFIIIA to bind nucleosomal DNA. In vitro transcription assays carried out with nucleosomal templates lacking individual tail domains show that transcription efficiency parallels the binding of TFIIIA. In addition, we show that the stoichiometry of core histones within the 5S DNA-core histone-TFIIIA triple complex is not changed upon TFIIIA association. Thus, TFIIIA binding occurs by displacement of H2A-H2B–DNA contacts but without complete loss of the dimer from the nucleoprotein complex. These data, coupled with previous reports (M. Vettese-Dadey, P. A. Grant, T. R. Hebbes, C. Crane-Robinson, C. D. Allis, and J. L. Workman, EMBO J. 15:2508–2518, 1996; L. Howe, T. A. Ranalli, C. D. Allis, and J. Ausio, J. Biol. Chem. 273:20693–20696, 1998), suggest that the H3/H4 tails are the primary arbiters of transcription factor access to intranucleosomal DNA. PMID:10688663
ExpandplusCrystal Structures of Poly(ADP-ribose) Polymerase-1 (PARP-1) Zinc Fingers Bound to DNA
DOE Office of Scientific and Technical Information (OSTI.GOV)
M Langelier; J Planck; S Roy
2011-12-31
Poly(ADP-ribose) polymerase-1 (PARP-1) has two homologous zinc finger domains, Zn1 and Zn2, that bind to a variety of DNA structures to stimulate poly(ADP-ribose) synthesis activity and to mediate PARP-1 interaction with chromatin. The structural basis for interaction with DNA is unknown, which limits our understanding of PARP-1 regulation and involvement in DNA repair and transcription. Here, we have determined crystal structures for the individual Zn1 and Zn2 domains in complex with a DNA double strand break, providing the first views of PARP-1 zinc fingers bound to DNA. The Zn1-DNA and Zn2-DNA structures establish a novel, bipartite mode of sequence-independent DNAmore » interaction that engages a continuous region of the phosphodiester backbone and the hydrophobic faces of exposed nucleotide bases. Biochemical and cell biological analysis indicate that the Zn1 and Zn2 domains perform distinct functions. The Zn2 domain exhibits high binding affinity to DNA compared with the Zn1 domain. However, the Zn1 domain is essential for DNA-dependent PARP-1 activity in vitro and in vivo, whereas the Zn2 domain is not strictly required. Structural differences between the Zn1-DNA and Zn2-DNA complexes, combined with mutational and structural analysis, indicate that a specialized region of the Zn1 domain is re-configured through the hydrophobic interaction with exposed nucleotide bases to initiate PARP-1 activation.« less
Brady, J; Radonovich, M; Thoren, M; Das, G; Salzman, N P
1984-01-01
We have previously identified an 11-base DNA sequence, 5'-G-G-T-A-C-C-T-A-A-C-C-3' (simian virus 40 [SV40] map position 294 to 304), which is important in the control of SV40 late RNA expression in vitro and in vivo (Brady et al., Cell 31:625-633, 1982). We report here the identification of another domain of the SV40 late promoter. A series of mutants with deletions extending from SV40 map position 0 to 300 was prepared by nuclease BAL 31 treatment. The cloned templates were then analyzed for efficiency and accuracy of late SV40 RNA expression in the Manley in vitro transcription system. Our studies showed that, in addition to the promoter domain near map position 300, there are essential DNA sequences between nucleotide positions 74 and 95 that are required for efficient expression of late SV40 RNA. Included in this SV40 DNA sequence were two of the six GGGCGG SV40 repeat sequences and an 11-nucleotide segment which showed strong homology with the upstream sequences required for the efficient in vitro and in vivo expression of the histone H2A gene. This upstream promoter sequence supported transcription with the same efficiency even when it was moved 72 nucleotides closer to the major late cap site. In vitro promoter competition analysis demonstrated that the upstream promoter sequence, independent of the 294 to 304 promoter element, is capable of binding polymerase-transcription factors required for SV40 late gene transcription. Finally, we show that DNA sequences which control the specificity of RNA initiation at nucleotide 325 lie downstream of map position 294. Images PMID:6321950
Van Damme, Els J. M.; Nakamura-Tsuruta, Sachiko; Smith, David F.; Ongenaert, Maté; Winter, Harry C.; Rougé, Pierre; Goldstein, Irwin J.; Mo, Hanqing; Kominami, Junko; Culerrier, Raphaël; Barre, Annick; Hirabayashi, Jun; Peumans, Willy J.
2007-01-01
A re-investigation of the occurrence and taxonomic distribution of proteins built up of protomers consisting of two tandem arrayed domains equivalent to the GNA [Galanthus nivalis (snowdrop) agglutinin] revealed that these are widespread among monotyledonous plants. Phylogenetic analysis of the available sequences indicated that these proteins do not represent a monophylogenetic group but most probably result from multiple independent domain duplication/in tandem insertion events. To corroborate the relationship between inter-domain sequence divergence and the widening of specificity range, a detailed comparative analysis was made of the sequences and specificity of a set of two-domain GNA-related lectins. Glycan microarray analyses, frontal affinity chromatography and surface plasmon resonance measurements demonstrated that the two-domain GNA-related lectins acquired a marked diversity in carbohydrate-binding specificity that strikingly contrasts the canonical exclusive specificity of their single domain counterparts towards mannose. Moreover, it appears that most two-domain GNA-related lectins interact with both high mannose and complex N-glycans and that this dual specificity relies on the simultaneous presence of at least two different independently acting binding sites. The combined phylogenetic, specificity and structural data strongly suggest that plants used domain duplication followed by divergent evolution as a mechanism to generate multispecific lectins from a single mannose-binding domain. Taking into account that the shift in specificity of some binding sites from high mannose to complex type N-glycans implies that the two-domain GNA-related lectins are primarily directed against typical animal glycans, it is tempting to speculate that plants developed two-domain GNA-related lectins for defence purposes. PMID:17288538
Sidell, Neil; Mathad, Raveendra I.; Shu, Feng-jue; Zhang, Zhenjiang; Kallen, Caleb B.; Yang, Danzhou
2011-01-01
DNA-intercalating molecules can impair DNA replication, DNA repair, and gene transcription. We previously demonstrated that XR5944, a DNA bis-intercalator, specifically blocks binding of estrogen receptor-α (ERα) to the consensus estrogen response element (ERE). The consensus ERE sequence is AGGTCAnnnTGACCT, where nnn is known as the tri-nucleotide spacer. Recent work has shown that the tri-nucleotide spacer can modulate ERα-ERE binding affinity and ligand-mediated transcriptional responses. To further understand the mechanism by which XR5944 inhibits ERα-ERE binding, we tested its ability to interact with consensus EREs with variable tri-nucleotide spacer sequences and with natural but non-consensus ERE sequences using one dimensional nuclear magnetic resonance (1D 1H NMR) titration studies. We found that the tri-nucleotide spacer sequence significantly modulates the binding of XR5944 to EREs. Of the sequences that were tested, EREs with CGG and AGG spacers showed the best binding specificity with XR5944, while those spaced with TTT demonstrated the least specific binding. The binding stoichiometry of XR5944 with EREs was 2:1, which can explain why the spacer influences the drug-DNA interaction; each XR5944 spans four nucleotides (including portions of the spacer) when intercalating with DNA. To validate our NMR results, we conducted functional studies using reporter constructs containing consensus EREs with tri-nucleotide spacers CGG, CTG, and TTT. Results of reporter assays in MCF-7 cells indicated that XR5944 was significantly more potent in inhibiting the activity of CGG- than TTT-spaced EREs, consistent with our NMR results. Taken together, these findings predict that the anti-estrogenic effects of XR5944 will depend not only on ERE half-site composition but also on the tri-nucleotide spacer sequence of EREs located in the promoters of estrogen-responsive genes. PMID:21333738
He, Qiye; Johnston, Jeff; Zeitlinger, Julia
2014-01-01
Understanding how eukaryotic enhancers are bound and regulated by specific combinations of transcription factors is still a major challenge. To better map transcription factor binding genome-wide at nucleotide resolution in vivo, we have developed a robust ChIP-exo protocol called ChIP experiments with nucleotide resolution through exonuclease, unique barcode and single ligation (ChIP-nexus), which utilizes an efficient DNA self-circularization step during library preparation. Application of ChIP-nexus to four proteins—human TBP and Drosophila NFkB, Twist and Max— demonstrates that it outperforms existing ChIP protocols in resolution and specificity, pinpoints relevant binding sites within enhancers containing multiple binding motifs and allows the analysis of in vivo binding specificities. Notably, we show that Max frequently interacts with DNA sequences next to its motif, and that this binding pattern correlates with local DNA sequence features such as DNA shape. ChIP-nexus will be broadly applicable to studying in vivo transcription factor binding specificity and its relationship to cis-regulatory changes in humans and model organisms. PMID:25751057
APE1 incision activity at abasic sites in tandem repeat sequences.
Li, Mengxia; Völker, Jens; Breslauer, Kenneth J; Wilson, David M
2014-05-29
Repetitive DNA sequences, such as those present in microsatellites and minisatellites, telomeres, and trinucleotide repeats (linked to fragile X syndrome, Huntington disease, etc.), account for nearly 30% of the human genome. These domains exhibit enhanced susceptibility to oxidative attack to yield base modifications, strand breaks, and abasic sites; have a propensity to adopt non-canonical DNA forms modulated by the positions of the lesions; and, when not properly processed, can contribute to genome instability that underlies aging and disease development. Knowledge on the repair efficiencies of DNA damage within such repetitive sequences is therefore crucial for understanding the impact of such domains on genomic integrity. In the present study, using strategically designed oligonucleotide substrates, we determined the ability of human apurinic/apyrimidinic endonuclease 1 (APE1) to cleave at apurinic/apyrimidinic (AP) sites in a collection of tandem DNA repeat landscapes involving telomeric and CAG/CTG repeat sequences. Our studies reveal the differential influence of domain sequence, conformation, and AP site location/relative positioning on the efficiency of APE1 binding and strand incision. Intriguingly, our data demonstrate that APE1 endonuclease efficiency correlates with the thermodynamic stability of the DNA substrate. We discuss how these results have both predictive and mechanistic consequences for understanding the success and failure of repair protein activity associated with such oxidatively sensitive, conformationally plastic/dynamic repetitive DNA domains. Published by Elsevier Ltd.
The EWS–Oct-4 fusion gene encodes a transforming gene
Lee, Jungwoon; Kim, Ja Young; Kang, In Young; Kim, Hye Kyoung; Han, Yong-Mahn; Kim, Jungho
2007-01-01
The t(6;22)(p21;q12) translocation associated with human bone and soft-tissue tumours results in a chimaeric molecule fusing the NTD (N-terminal domain) of the EWS (Ewing's sarcoma) gene to the CTD (C-terminal domain) of the Oct-4 (octamer-4) embryonic gene. Since the N-terminal domains of EWS and Oct-4 are structurally different, in the present study we have assessed the functional consequences of the EWS–Oct-4 fusion. We find that this chimaeric gene encodes a nuclear protein which binds DNA with the same sequence specificity as the parental Oct-4 protein. Comparison of the transactivation properties of EWS–Oct-4 and Oct-4 indicates that the former has higher transactivation activity for a known target reporter gene containing Oct-4 binding. Deletion analysis of the functional domains of EWS–Oct-4 indicates that the EWS (NTD), the POU domain and the CTD of EWS–Oct-4 are necessary for full transactivation potential. EWS–Oct-4 induced the expression of fgf-4 (fibroblast growth factor 4) and nanog, which are potent mitogens as well as Oct-4 downstream target genes whose promoters contain potential Oct-4-binding sites. Finally, ectopic expression of EWS–Oct-4 in Oct-4-null ZHBTc4 ES (embryonic stem) cells resulted in increased tumorigenic growth potential in nude mice. These results suggest that the oncogenic effect of the t(6;22) translocation is due to the EWS–Oct-4 chimaeric protein and that fusion of the EWS NTD to the Oct-4 DNA-binding domain produces a transforming chimaeric product. PMID:17564582
Substrate sequence selectivity of APOBEC3A implicates intra-DNA interactions.
Silvas, Tania V; Hou, Shurong; Myint, Wazo; Nalivaika, Ellen; Somasundaran, Mohan; Kelch, Brian A; Matsuo, Hiroshi; Kurt Yilmaz, Nese; Schiffer, Celia A
2018-05-14
The APOBEC3 (A3) family of human cytidine deaminases is renowned for providing a first line of defense against many exogenous and endogenous retroviruses. However, the ability of these proteins to deaminate deoxycytidines in ssDNA makes A3s a double-edged sword. When overexpressed, A3s can mutate endogenous genomic DNA resulting in a variety of cancers. Although the sequence context for mutating DNA varies among A3s, the mechanism for substrate sequence specificity is not well understood. To characterize substrate specificity of A3A, a systematic approach was used to quantify the affinity for substrate as a function of sequence context, length, secondary structure, and solution pH. We identified the A3A ssDNA binding motif as (T/C)TC(A/G), which correlated with enzymatic activity. We also validated that A3A binds RNA in a sequence specific manner. A3A bound tighter to substrate binding motif within a hairpin loop compared to linear oligonucleotide, suggesting A3A affinity is modulated by substrate structure. Based on these findings and previously published A3A-ssDNA co-crystal structures, we propose a new model with intra-DNA interactions for the molecular mechanism underlying A3A sequence preference. Overall, the sequence and structural preferences identified for A3A leads to a new paradigm for identifying A3A's involvement in mutation of endogenous or exogenous DNA.
Burger, C; Fanning, E
1983-04-15
Large tumor antigen (T antigen) occurs in at least three different oligomeric subclasses in cells infected or transformed by simian virus 40 (SV40): 5-7 S, 14-16 S, and 23-25 S. The 23-25 S form is complexed with a host phosphoprotein (p53). The DNA binding properties of these three subclasses of T antigen from nine different cell lines and free p53 protein were compared using an immunoprecipitation assay. All three subclasses of T antigen bound specifically to SV40 DNA sequences near the origin of replication. However, the DNA binding activity varied between different cell lines over a 40- to 50-fold range. The 23-25 S and 14-16 S forms from most of the cell lines tested bound much less SV40 origin DNA than 5-7 S T antigen. The free p53 phosphoprotein did not bind specifically to any SV40 DNA sequences.
Structure-Function Relationships in Human Testis-determining Factor SRY
Racca, Joseph D.; Chen, Yen-Shan; Maloy, James D.; Wickramasinghe, Nalinda; Phillips, Nelson B.; Weiss, Michael A.
2014-01-01
Human testis determination is initiated by SRY, a Y-encoded architectural transcription factor. Mutations in SRY cause 46 XY gonadal dysgenesis with female somatic phenotype (Swyer syndrome) and confer a high risk of malignancy (gonadoblastoma). Such mutations cluster in the SRY high mobility group (HMG) box, a conserved motif of specific DNA binding and bending. To explore structure-function relationships, we constructed all possible substitutions at a site of clinical mutation (W70L). Our studies thus focused on a core aromatic residue (position 15 of the consensus HMG box) that is invariant among SRY-related HMG box transcription factors (the SOX family) and conserved as aromatic (Phe or Tyr) among other sequence-specific boxes. In a yeast one-hybrid system sensitive to specific SRY-DNA binding, the variant domains exhibited reduced (Phe and Tyr) or absent activity (the remaining 17 substitutions). Representative nonpolar variants with partial or absent activity (Tyr, Phe, Leu, and Ala in order of decreasing side-chain volume) were chosen for study in vitro and in mammalian cell culture. The clinical mutation (Leu) was found to markedly impair multiple biochemical and cellular activities as respectively probed through the following: (i) in vitro assays of specific DNA binding and protein stability, and (ii) cell culture-based assays of proteosomal degradation, nuclear import, enhancer DNA occupancy, and SRY-dependent transcriptional activation. Surprisingly, however, DNA bending is robust to this or the related Ala substitution that profoundly impairs box stability. Together, our findings demonstrate that the folding, trafficking, and gene-regulatory function of SRY requires an invariant aromatic “buttress” beneath its specific DNA-bending surface. PMID:25258310
Jakubec, David; Laskowski, Roman A.; Vondrasek, Jiri
2016-01-01
Decades of intensive experimental studies of the recognition of DNA sequences by proteins have provided us with a view of a diverse and complicated world in which few to no features are shared between individual DNA-binding protein families. The originally conceived direct readout of DNA residue sequences by amino acid side chains offers very limited capacity for sequence recognition, while the effects of the dynamic properties of the interacting partners remain difficult to quantify and almost impossible to generalise. In this work we investigated the energetic characteristics of all DNA residue—amino acid side chain combinations in the conformations found at the interaction interface in a very large set of protein—DNA complexes by the means of empirical potential-based calculations. General specificity-defining criteria were derived and utilised to look beyond the binding motifs considered in previous studies. Linking energetic favourability to the observed geometrical preferences, our approach reveals several additional amino acid motifs which can distinguish between individual DNA bases. Our results remained valid in environments with various dielectric properties. PMID:27384774
Woo, Hye Ryun; Dittmer, Travis A.; Richards, Eric J.
2008-01-01
Methylcytosine-binding proteins decipher the epigenetic information encoded by DNA methylation and provide a link between DNA methylation, modification of chromatin structure, and gene silencing. VARIANT IN METHYLATION 1 (VIM1) encodes an SRA (SET- and RING-associated) domain methylcytosine-binding protein in Arabidopsis thaliana, and loss of VIM1 function causes centromere DNA hypomethylation and centromeric heterochromatin decondensation in interphase. In the Arabidopsis genome, there are five VIM genes that share very high sequence similarity and encode proteins containing a PHD domain, two RING domains, and an SRA domain. To gain further insight into the function and potential redundancy among the VIM proteins, we investigated strains combining different vim mutations and transgenic vim knock-down lines that down-regulate multiple VIM family genes. The vim1 vim3 double mutant and the transgenic vim knock-down lines showed decreased DNA methylation primarily at CpG sites in genic regions, as well as repeated sequences in heterochromatic regions. In addition, transcriptional silencing was released in these plants at most heterochromatin regions examined. Interestingly, the vim1 vim3 mutant and vim knock-down lines gained ectopic CpHpH methylation in the 5S rRNA genes against a background of CpG hypomethylation. The vim1 vim2 vim3 triple mutant displayed abnormal morphological phenotypes including late flowering, which is associated with DNA hypomethylation of the 5′ region of FWA and release of FWA gene silencing. Our findings demonstrate that VIM1, VIM2, and VIM3 have overlapping functions in maintenance of global CpG methylation and epigenetic transcriptional silencing. PMID:18704160
Roles of JnRAP2.6-like from the transition zone of black walnut in hormone signaling
Zhonglian Huang; Peng Zhao; Jose Medina; Richard Meilan; Keith Woeste
2013-01-01
An EST sequence, designated JnRAP2-like, was isolated from tissue at the heartwood/sapwood transition zone (TZ) in black walnut (Juglans nigra L). The deduced amino acid sequence of JnRAP2-like protein consists of a single AP2- containing domain with significant similarity to conserved AP2/ERF DNA-binding domains in other...
Bonham, Andrew J.; Wenta, Nikola; Osslund, Leah M.; Prussin, Aaron J.; Vinkemeier, Uwe; Reich, Norbert O.
2013-01-01
The DNA-binding specificity and affinity of the dimeric human transcription factor (TF) STAT1, were assessed by total internal reflectance fluorescence protein-binding microarrays (TIRF-PBM) to evaluate the effects of protein phosphorylation, higher-order polymerization and small-molecule inhibition. Active, phosphorylated STAT1 showed binding preferences consistent with prior characterization, whereas unphosphorylated STAT1 showed a weak-binding preference for one-half of the GAS consensus site, consistent with recent models of STAT1 structure and function in response to phosphorylation. This altered-binding preference was further tested by use of the inhibitor LLL3, which we show to disrupt STAT1 binding in a sequence-dependent fashion. To determine if this sequence-dependence is specific to STAT1 and not a general feature of human TF biology, the TF Myc/Max was analysed and tested with the inhibitor Mycro3. Myc/Max inhibition by Mycro3 is sequence independent, suggesting that the sequence-dependent inhibition of STAT1 may be specific to this system and a useful target for future inhibitor design. PMID:23180800
An immunoassay for the study of DNA-binding activities of herpes simplex virus protein ICP8.
Lee, C K; Knipe, D M
1985-06-01
An immunoassay was used to examine the interaction between a herpes simplex virus protein, ICP8, and various types of DNA. The advantage of this assay is that the protein is not subjected to harsh purification procedures. We characterized the binding of ICP8 to both single-stranded (ss) and double-stranded (ds) DNA. ICP8 bound ss DNA fivefold more efficiently than ds DNA, and both binding activities were most efficient in 150 mM NaCl. Two lines of evidence indicate that the binding activities were not identical: (i) ds DNA failed to complete with ss DNA binding even with a large excess of ds DNA; (ii) Scatchard plots of DNA binding with various amounts of DNA were fundamentally different for ss DNA and ds DNA. However, the two activities were related in that ss DNA efficiently competed with the binding of ds DNA. We conclude that the ds DNA-binding activity of ICP8 is probably distinct from the ss DNA-binding activity. No evidence for sequence-specific ds DNA binding was obtained for either the entire herpes simplex virus genome or cloned viral sequences.
Radiation-induced tetramer-to-dimer transition of Escherichia coli lactose repressor
DOE Office of Scientific and Technical Information (OSTI.GOV)
Goffinont, S.; Davidkova, M.; Spotheim-Maurizot, M., E-mail: spotheim@cnrs-orleans.fr
2009-08-21
The wild type lactose repressor of Escherichia coli is a tetrameric protein formed by two identical dimers. They are associated via a C-terminal 4-helix bundle (called tetramerization domain) whose stability is ensured by the interaction of leucine zipper motifs. Upon in vitro {gamma}-irradiation the repressor losses its ability to bind the operator DNA sequence due to damage of its DNA-binding domains. Using an engineered dimeric repressor for comparison, we show here that irradiation induces also the change of repressor oligomerisation state from tetramer to dimer. The splitting of the tetramer into dimers can result from the oxidation of the leucinemore » residues of the tetramerization domain.« less
Liu, Qiang; Su, Shifeng; Blackwelder, Amanda J.; Minges, John T.; Wilson, Elizabeth M.
2011-01-01
Male sex development and growth occur in response to high affinity androgen binding to the androgen receptor (AR). In contrast to complete amino acid sequence conservation in the AR DNA and ligand binding domains among mammals, a primate-specific difference in the AR NH2-terminal region that regulates the NH2- and carboxyl-terminal (N/C) interaction enables direct binding to melanoma antigen-A11 (MAGE-11), an AR coregulator that is also primate-specific. Human, mouse, and rat AR share the same NH2-terminal 23FQNLF27 sequence that mediates the androgen-dependent N/C interaction. However, the mouse and rat AR FXXLF motif is flanked by Ala33 that evolved to Val33 in primates. Human AR Val33 was required to interact directly with MAGE-11 and for the inhibitory effect of the AR N/C interaction on activation function 2 that was relieved by MAGE-11. The functional importance of MAGE-11 was indicated by decreased human AR regulation of an androgen-dependent endogenous gene using lentivirus short hairpin RNAs and by the greater transcriptional strength of human compared with mouse AR. MAGE-11 increased progesterone and glucocorticoid receptor activity independently of binding an FXXLF motif by interacting with p300 and p160 coactivators. We conclude that the coevolution of the AR NH2-terminal sequence and MAGE-11 expression among primates provides increased regulatory control over activation domain dominance. Primate-specific expression of MAGE-11 results in greater steroid receptor transcriptional activity through direct interactions with the human AR FXXLF motif region and indirectly through steroid receptor-associated p300 and p160 coactivators. PMID:21730049
DNA sequence+shape kernel enables alignment-free modeling of transcription factor binding.
Ma, Wenxiu; Yang, Lin; Rohs, Remo; Noble, William Stafford
2017-10-01
Transcription factors (TFs) bind to specific DNA sequence motifs. Several lines of evidence suggest that TF-DNA binding is mediated in part by properties of the local DNA shape: the width of the minor groove, the relative orientations of adjacent base pairs, etc. Several methods have been developed to jointly account for DNA sequence and shape properties in predicting TF binding affinity. However, a limitation of these methods is that they typically require a training set of aligned TF binding sites. We describe a sequence + shape kernel that leverages DNA sequence and shape information to better understand protein-DNA binding preference and affinity. This kernel extends an existing class of k-mer based sequence kernels, based on the recently described di-mismatch kernel. Using three in vitro benchmark datasets, derived from universal protein binding microarrays (uPBMs), genomic context PBMs (gcPBMs) and SELEX-seq data, we demonstrate that incorporating DNA shape information improves our ability to predict protein-DNA binding affinity. In particular, we observe that (i) the k-spectrum + shape model performs better than the classical k-spectrum kernel, particularly for small k values; (ii) the di-mismatch kernel performs better than the k-mer kernel, for larger k; and (iii) the di-mismatch + shape kernel performs better than the di-mismatch kernel for intermediate k values. The software is available at https://bitbucket.org/wenxiu/sequence-shape.git. rohs@usc.edu or william-noble@uw.edu. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.
Alcántara, Cristina; Sarmiento-Rubiano, Luz Adriana; Monedero, Vicente; Deutscher, Josef; Pérez-Martínez, Gaspar; Yebra, María J.
2008-01-01
Sequence analysis of the five genes (gutRMCBA) downstream from the previously described sorbitol-6-phosphate dehydrogenase-encoding Lactobacillus casei gutF gene revealed that they constitute a sorbitol (glucitol) utilization operon. The gutRM genes encode putative regulators, while the gutCBA genes encode the EIIC, EIIBC, and EIIA proteins of a phosphoenolpyruvate-dependent sorbitol phosphotransferase system (PTSGut). The gut operon is transcribed as a polycistronic gutFRMCBA messenger, the expression of which is induced by sorbitol and repressed by glucose. gutR encodes a transcriptional regulator with two PTS-regulated domains, a galactitol-specific EIIB-like domain (EIIBGat domain) and a mannitol/fructose-specific EIIA-like domain (EIIAMtl domain). Its inactivation abolished gut operon transcription and sorbitol uptake, indicating that it acts as a transcriptional activator. In contrast, cells carrying a gutB mutation expressed the gut operon constitutively, but they failed to transport sorbitol, indicating that EIIBCGut negatively regulates GutR. A footprint analysis showed that GutR binds to a 35-bp sequence upstream from the gut promoter. A sequence comparison with the presumed promoter region of gut operons from various firmicutes revealed a GutR consensus motif that includes an inverted repeat. The regulation mechanism of the L. casei gut operon is therefore likely to be operative in other firmicutes. Finally, gutM codes for a conserved protein of unknown function present in all sequenced gut operons. A gutM mutant, the first constructed in a firmicute, showed drastically reduced gut operon expression and sorbitol uptake, indicating a regulatory role also for GutM. PMID:18676710
Lopez, Christopher R; Singh, Shivani; Hambarde, Shashank; Griffin, Wezley C; Gao, Jun; Chib, Shubeena; Yu, Yang; Ira, Grzegorz; Raney, Kevin D; Kim, Nayun
2017-06-02
G-quadruplex or G4 DNA is a non-B secondary DNA structure consisting of a stacked array of guanine-quartets that can disrupt critical cellular functions such as replication and transcription. When sequences that can adopt Non-B structures including G4 DNA are located within actively transcribed genes, the reshaping of DNA topology necessary for transcription process stimulates secondary structure-formation thereby amplifying the potential for genome instability. Using a reporter assay designed to study G4-induced recombination in the context of an actively transcribed locus in Saccharomyces cerevisiae, we tested whether co-transcriptional activator Sub1, recently identified as a G4-binding factor, contributes to genome maintenance at G4-forming sequences. Our data indicate that, upon Sub1-disruption, genome instability linked to co-transcriptionally formed G4 DNA in Top1-deficient cells is significantly augmented and that its highly conserved DNA binding domain or the human homolog PC4 is sufficient to suppress G4-associated genome instability. We also show that Sub1 interacts specifically with co-transcriptionally formed G4 DNA in vivo and that yeast cells become highly sensitivity to G4-stabilizing chemical ligands by the loss of Sub1. Finally, we demonstrate the physical and genetic interaction of Sub1 with the G4-resolving helicase Pif1, suggesting a possible mechanism by which Sub1 suppresses instability at G4 DNA. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Teh, Huey Fang; Peh, Wendy Y X; Su, Xiaodi; Thomsen, Jane S
2007-02-27
Specific protein-DNA interactions play a central role in transcription and other biological processes. A comprehensive characterization of protein-DNA interactions should include information about binding affinity, kinetics, sequence specificity, and binding stoichiometry. In this study, we have used surface plasmon resonance spectroscopy (SPR) to study the interactions between human estrogen receptors (ER, alpha and beta subtypes) and estrogen response elements (ERE), with four assay schemes. First, we determined the sequence-dependent receptors' binding capacity by monitoring the binding of ER to various ERE sequences immobilized on a sensor surface (assay format denoted as the direct assay). Second, we screened the relative affinity of ER for various ERE sequences using a competition assay, in which the receptors bind to an ERE-immobilized surface in the presence of competitor ERE sequences. Third, we monitored the assembly of ER-ERE complexes on a SPR surface and thereafter the removal and/or dissociation of the ER (assay scheme denoted as the dissociation assay) to determine the binding stoichiometry. Last, a sandwich assay (ER binding to ERE followed by anti-ER recognition of a specific ER subtype) was performed in an effort to understand how ERalpha and ERbeta may associate and compete when binding to the DNA. With these assay schemes, we reaffirmed that (1) ERalpha is more sensitive than ERbeta to base pair change(s) in the consensus ERE, (2) ERalpha and ERbeta form a heterodimer when they bind to the consensus ERE, and (3) the binding stoichiometry of both ERalpha- and ERbeta-ERE complexes is dependent on salt concentration. With this study, we demonstrate the versatility of the SPR analysis. With the involvement of various assay arrangements, the SPR analysis can be further extended to more than kinetics and affinity study.
Comparative analysis of the XopD T3S effector family in plant pathogenic bacteria
Kim, Jung-Gun; Taylor, Kyle W.; Mudgett, Mary Beth
2011-01-01
SUMMARY XopD is a type III effector protein that is required for Xanthomonas campestris pathovar vesicatoria (Xcv) growth in tomato. It is a modular protein consisting of an N-terminal DNA-binding domain, two EAR transcriptional repressor motifs, and a C-terminal SUMO protease. In tomato, XopD functions as a transcriptional repressor, resulting in the suppression of defense responses at late stages of infection. A survey of available genome sequences for phytopathogenic bacteria revealed that XopD homologs are limited to species within three Genera of Proteobacteria – Xanthomonas, Acidovorax, and Pseudomonas. While the EAR motif(s) and SUMO protease domain are conserved in all the XopD-like proteins, variation exists in the length and sequence identity of the N-terminal domains. Comparative analysis of the DNA sequences surrounding xopD and xopD-like genes led to revised annotation of the xopD gene. Edman degradation sequence analysis and functional complementation studies confirmed that the xopD gene from Xcv encodes a 760 amino acid protein with a longer N-terminal domain than previously predicted. None of the XopD-like proteins studied complemented Xcv ΔxopD mutant phenotypes in tomato leaves suggesting that the N-terminus of XopD defines functional specificity. Xcv ΔxopD strains expressing chimeric fusion proteins containing the N-terminus of XopD fused to the EAR motif(s) and SUMO protease domain of the XopD-like protein from Xanthomonas campestris pathovar campestris strain B100 were fully virulent in tomato demonstrating that the N-terminus of XopD controls specificity in tomato. PMID:21726373
Song, Wei; Guo, Jun-Tao
2015-01-01
Transcription factors regulate gene expression through binding to specific DNA sequences. How transcription factors achieve high binding specificity is still not well understood. In this paper, we investigated the role of protein flexibility in protein-DNA-binding specificity by comparative molecular dynamics (MD) simulations. Protein flexibility has been considered as a key factor in molecular recognition, which is intrinsically a dynamic process involving fine structural fitting between binding components. In this study, we performed comparative MD simulations on wild-type and F10V mutant P22 Arc repressor in both free and complex conformations. The F10V mutant has lower DNA-binding specificity though both the bound and unbound main-chain structures between the wild-type and F10V mutant Arc are highly similar. We found that the DNA-binding motif of wild-type Arc is structurally more flexible than the F10V mutant in the unbound state, especially for the six DNA base-contacting residues in each dimer. We demonstrated that the flexible side chains of wild-type Arc lead to a higher DNA-binding specificity through forming more hydrogen bonds with DNA bases upon binding. Our simulations also showed a possible conformational selection mechanism for Arc-DNA binding. These results indicate the important roles of protein flexibility and dynamic properties in protein-DNA-binding specificity.
Kazanov, Marat D.; Li, Xiaoqing; Gelfand, Mikhail S.; Osterman, Andrei L.; Rodionov, Dmitry A.
2013-01-01
Large and functionally heterogeneous families of transcription factors have complex evolutionary histories. What shapes specificities toward effectors and DNA sites in paralogous regulators is a fundamental question in biology. Bacteria from the deep-branching lineage Thermotogae possess multiple paralogs of the repressor, open reading frame, kinase (ROK) family regulators that are characterized by carbohydrate-sensing domains shared with sugar kinases. We applied an integrated genomic approach to study functions and specificities of regulators from this family. A comparative analysis of 11 Thermotogae genomes revealed novel mechanisms of transcriptional regulation of the sugar utilization networks, DNA-binding motifs and specific functions. Reconstructed regulons for seven groups of ROK regulators were validated by DNA-binding assays using purified recombinant proteins from the model bacterium Thermotoga maritima. All tested regulators demonstrated specific binding to their predicted cognate DNA sites, and this binding was inhibited by specific effectors, mono- or disaccharides from their respective sugar catabolic pathways. By comparing ligand-binding domains of regulators with structurally characterized kinases from the ROK family, we elucidated signature amino acid residues determining sugar-ligand regulator specificity. Observed correlations between signature residues and the sugar-ligand specificities provide the framework for structure functional classification of the entire ROK family. PMID:23209028
Jaeger, Alex M.; Makley, Leah N.; Gestwicki, Jason E.; Thiele, Dennis J.
2014-01-01
The heat shock transcription factor 1 (HSF1) activates expression of a variety of genes involved in cell survival, including protein chaperones, the protein degradation machinery, anti-apoptotic proteins, and transcription factors. Although HSF1 activation has been linked to amelioration of neurodegenerative disease, cancer cells exhibit a dependence on HSF1 for survival. Indeed, HSF1 drives a program of gene expression in cancer cells that is distinct from that activated in response to proteotoxic stress, and HSF1 DNA binding activity is elevated in cycling cells as compared with arrested cells. Active HSF1 homotrimerizes and binds to a DNA sequence consisting of inverted repeats of the pentameric sequence nGAAn, known as heat shock elements (HSEs). Recent comprehensive ChIP-seq experiments demonstrated that the architecture of HSEs is very diverse in the human genome, with deviations from the consensus sequence in the spacing, orientation, and extent of HSE repeats that could influence HSF1 DNA binding efficacy and the kinetics and magnitude of target gene expression. To understand the mechanisms that dictate binding specificity, HSF1 was purified as either a monomer or trimer and used to evaluate DNA-binding site preferences in vitro using fluorescence polarization and thermal denaturation profiling. These results were compared with quantitative chromatin immunoprecipitation assays in vivo. We demonstrate a role for specific orientations of extended HSE sequences in driving preferential HSF1 DNA binding to target loci in vivo. These studies provide a biochemical basis for understanding differential HSF1 target gene recognition and transcription in neurodegenerative disease and in cancer. PMID:25204655
Ibrahim, Nouhou; Wicklund, April; Wiebe, Matthew S
2011-11-01
The barrier to autointegration factor (BAF) is an essential cellular protein with functions in mitotic nuclear reassembly, retroviral preintegration complex stability, and transcriptional regulation. Molecular properties of BAF include the ability to bind double-stranded DNA in a sequence-independent manner, homodimerize, and bind proteins containing a LEM domain. These capabilities allow BAF to compact DNA and assemble higher-order nucleoprotein complexes, the nature of which is poorly understood. Recently, it was revealed that BAF also acts as a potent host defense against poxviral DNA replication in the cytoplasm. Here, we extend these observations by examining the molecular mechanism through which BAF acts as a host defense against vaccinia virus replication and cytoplasmic DNA in general. Interestingly, BAF rapidly relocalizes to transfected DNA from a variety of sources, demonstrating that BAF's activity as a host defense factor is not limited to poxviral infection. BAF's relocalization to cytoplasmic foreign DNA is highly dependent upon its DNA binding and dimerization properties but does not appear to require its LEM domain binding activity. However, the LEM domain protein emerin is recruited to cytoplasmic DNA in a BAF-dependent manner during both transfection and vaccinia virus infection. Finally, we demonstrate that the DNA binding and dimerization capabilities of BAF are essential for its function as an antipoxviral effector, while the presence of emerin is not required. Together, these data provide further mechanistic insight into which of BAF's molecular properties are employed by cells to impair the replication of poxviruses or respond to foreign DNA in general.
Aydin, Özge Z.; Marteijn, Jurgen A.; Ribeiro-Silva, Cristina; Rodríguez López, Aida; Wijgers, Nils; Smeenk, Godelieve; van Attikum, Haico; Poot, Raymond A.; Vermeulen, Wim; Lans, Hannes
2014-01-01
Chromatin compaction of deoxyribonucleic acid (DNA) presents a major challenge to the detection and removal of DNA damage. Helix-distorting DNA lesions that block transcription are specifically repaired by transcription-coupled nucleotide excision repair, which is initiated by binding of the CSB protein to lesion-stalled RNA polymerase II. Using live cell imaging, we identify a novel function for two distinct mammalian ISWI adenosine triphosphate (ATP)-dependent chromatin remodeling complexes in resolving lesion-stalled transcription. Human ISWI isoform SMARCA5/SNF2H and its binding partners ACF1 and WSTF are rapidly recruited to UV-C induced DNA damage to specifically facilitate CSB binding and to promote transcription recovery. SMARCA5 targeting to UV-C damage depends on transcription and histone modifications and requires functional SWI2/SNF2-ATPase and SLIDE domains. After initial recruitment to UV damage, SMARCA5 re-localizes away from the center of DNA damage, requiring its HAND domain. Our studies support a model in which SMARCA5 targeting to DNA damage-stalled transcription sites is controlled by an ATP-hydrolysis-dependent scanning and proofreading mechanism, highlighting how SWI2/SNF2 chromatin remodelers identify and bind nucleosomes containing damaged DNA. PMID:24990377
DNA sequences of three beta-1,4-endoglucanase genes from Thermomonospora fusca.
Lao, G; Ghangas, G S; Jung, E D; Wilson, D B
1991-01-01
The DNA sequences of the Thermomonospora fusca genes encoding cellulases E2 and E5 and the N-terminal end of E4 were determined. Each sequence contains an identical 14-bp inverted repeat upstream of the initiation codon. There were no significant homologies between the coding regions of the three genes. The E2 gene is 73% identical to the celA gene from Microbispora bispora, but this was the only homology found with other cellulase genes. E2 belongs to a family of cellulases that includes celA from M. bispora, cenA from Cellulomonas fimi, casA from an alkalophilic Streptomyces strain, and cellobiohydrolase II from Trichoderma reesei. E4 shows 44% identity to an avocado cellulase, while E5 belongs to the Bacillus cellulase family. There were strong similarities between the amino acid sequences of the E2 and E5 cellulose binding domains, and these regions also showed homology with C. fimi and Pseudomonas fluorescens cellulose binding domains. PMID:1904434
Gauthier-Rouvière, C; Cavadore, J C; Blanchard, J M; Lamb, N J; Fernandez, A
1991-01-01
Indirect immunofluorescence analysis, using antibodies directed against peptide sequences outside the DNA-binding domain of the 67-kDa serum response factor (p67SRF), revealed a punctuated nuclear staining, constant throughout the cell cycle and in all different cell lines tested. p67SRF was also tightly associated with chromatin through all stages of mitosis. Inhibition of p67SRF activity in vivo, through microinjection of anti-p67SRF antibodies, specifically suppressed DNA synthesis induced after serum addition or ras microinjection, suggesting that these antibodies were effective in preventing expression of serum response element (SRE)-regulated genes. A similar inhibition was also obtained in cells injected with oligonucleotides corresponding to the DNA binding sequence for p67SRF protein, SRE. Moreover, this inhibition of DNA synthesis by anti-p67SRF or SRE injection was still observed in cells injected during late G1, well after c-fos induction. These data imply that genes regulated by p67SRF are continuously involved in the proliferation pathway throughout G1 and that p67SRF forms an integral component of mammalian cell transcriptional control. Images PMID:1782216
Functional domains of the poliovirus receptor
DOE Office of Scientific and Technical Information (OSTI.GOV)
Koike, Satoshi; Ise, Iku; Nomoto, Akio
1991-05-15
A number of mutant cDNAs of the human poliovirus receptor were constructed to identify essential regions of the molecule as the receptor. All mutant cDNAs carrying the sequence coding for the entire N-terminal immunoglobulin-like domain (domain I) confer permissiveness for poliovirus to mouse L cells, but a mutant cDNA lacking the sequence for domain I does not. The transformants permissive for poliovirus were able to bind the virus and were also recognized by monoclonal antibody D171, which competes with poliovirus for the cellular receptor. These results strongly suggest that the poliovirus binding site resides in domain I of the receptor.more » Mutant cDNAs for the sequence encoding the intracellular peptide were also constructed and expressed in mouse L cells. Susceptibility of these cells to poliovirus revealed that the entire putative cytoplasmic domain is not essential for virus infection. Thus, the cytoplasmic domain of the molecule appears not to play a role in the penetration of poliovirus.« less
Finarov, Igal; Moor, Nina; Kessler, Naama; Klipcan, Liron; Safro, Mark G
2010-03-10
The existence of three types of phenylalanyl-tRNA synthetase (PheRS), bacterial (alphabeta)(2), eukaryotic/archaeal cytosolic (alphabeta)(2), and mitochondrial alpha, is a prominent example of structural diversity within the aaRS family. PheRSs have considerably diverged in primary sequences, domain compositions, and subunit organizations. Loss of the anticodon-binding domain B8 in human cytosolic PheRS (hcPheRS) is indicative of variations in the tRNA(Phe) binding and recognition as compared to bacterial PheRSs. We report herein the crystal structure of hcPheRS in complex with phenylalanine at 3.3 A resolution. A novel structural module has been revealed at the N terminus of the alpha subunit. It stretches out into the solvent of approximately 80 A and is made up of three structural domains (DBDs) possessing DNA-binding fold. The dramatic reduction of aminoacylation activity for truncated N terminus variants coupled with structural data and tRNA-docking model testify that DBDs play crucial role in hcPheRS activity.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Benasutti, M.; Ejadi, S.; Whitlow, M.D.
The mutagenic and carcinogenic chemical aflatoxin B/sub 1/ (AFB/sub 1/) reacts almost exclusively at the N(7)-position of guanine following activation to its reactive form, the 8,9-epoxide (AFB/sub 1/ oxide). In general N(7)-guanine adducts yield DNA strand breaks when heated in base, a property that serves as the basis for the Maxam-Gilbert DNA sequencing reaction specific for guanine. Using DNA sequencing methods, other workers have shown that AFB/sub 1/ oxide gives strand breaks at positions of guanines; however, the guanine bands varied in intensity. This phenomenon has been used to infer that AFB/sub 1/ oxide prefers to react with guanines inmore » some sequence contexts more than in others and has been referred to as sequence specificity of binding. Herein, data on the reaction of AFB/sub 1/ oxide with several synthetic DNA polymers with different sequences are presented, and (following hydrolysis) adduct levels are determine by high-pressure liquid chromatography. These results reveal that for AFB/sub 1/ oxide (1) the N(7)-guanine adduct is the major adduct found in all of the DNA polymers, (2) adduct levels vary in different sequences, and, thus, sequence specificity is also observed by this more direct method, and (3) the intensity of bands in DNA sequencing gels is likely to reflect adduct levels formed at the N(7)-position of guanine. Knowing this, a reinvestigation of the reactivity of guanines in different DNA sequences using DNA sequencing methods was undertaken. Methods are developed to determine the X (5'-side) base and the Y (3'-side) base are most influential in determining guanine reactivity. These rules in conjunction with molecular modeling studies were used to assess the binding sites that might be utilized by AFB/sub 1/ oxide in its reaction with DNA.« less
The linker region of AraC protein.
Eustance, R J; Schleif, R F
1996-01-01
AraC protein, a transcriptional regulator of the L-arabinose operon in Escherichia coli, is dimeric. Each monomer consists of a domain for DNA binding plus transcription activation and a domain for dimerization plus arabinose binding. These are connected to one another by a linker region of at least 5 amino acids. Here we have addressed the question of whether any of the amino acids in the linker region play active, specific, and crucial structural roles or whether these amino acids merely serve as passive spacers between the functional domains. We found that all but one of the linker amino acids can be changed to other amino acids individually and in small groups without substantially affecting the ability of AraC protein to activate transcription when arabinose is present. When, however, the entire linker region is replaced with linker sequences from other proteins, the functioning of AraC is impaired. PMID:8955380
Accurate and sensitive quantification of protein-DNA binding affinity.
Rastogi, Chaitanya; Rube, H Tomas; Kribelbauer, Judith F; Crocker, Justin; Loker, Ryan E; Martini, Gabriella D; Laptenko, Oleg; Freed-Pastor, William A; Prives, Carol; Stern, David L; Mann, Richard S; Bussemaker, Harmen J
2018-04-17
Transcription factors (TFs) control gene expression by binding to genomic DNA in a sequence-specific manner. Mutations in TF binding sites are increasingly found to be associated with human disease, yet we currently lack robust methods to predict these sites. Here, we developed a versatile maximum likelihood framework named No Read Left Behind (NRLB) that infers a biophysical model of protein-DNA recognition across the full affinity range from a library of in vitro selected DNA binding sites. NRLB predicts human Max homodimer binding in near-perfect agreement with existing low-throughput measurements. It can capture the specificity of the p53 tetramer and distinguish multiple binding modes within a single sample. Additionally, we confirm that newly identified low-affinity enhancer binding sites are functional in vivo, and that their contribution to gene expression matches their predicted affinity. Our results establish a powerful paradigm for identifying protein binding sites and interpreting gene regulatory sequences in eukaryotic genomes. Copyright © 2018 the Author(s). Published by PNAS.
Accurate and sensitive quantification of protein-DNA binding affinity
Rastogi, Chaitanya; Rube, H. Tomas; Kribelbauer, Judith F.; Crocker, Justin; Loker, Ryan E.; Martini, Gabriella D.; Laptenko, Oleg; Freed-Pastor, William A.; Prives, Carol; Stern, David L.; Mann, Richard S.; Bussemaker, Harmen J.
2018-01-01
Transcription factors (TFs) control gene expression by binding to genomic DNA in a sequence-specific manner. Mutations in TF binding sites are increasingly found to be associated with human disease, yet we currently lack robust methods to predict these sites. Here, we developed a versatile maximum likelihood framework named No Read Left Behind (NRLB) that infers a biophysical model of protein-DNA recognition across the full affinity range from a library of in vitro selected DNA binding sites. NRLB predicts human Max homodimer binding in near-perfect agreement with existing low-throughput measurements. It can capture the specificity of the p53 tetramer and distinguish multiple binding modes within a single sample. Additionally, we confirm that newly identified low-affinity enhancer binding sites are functional in vivo, and that their contribution to gene expression matches their predicted affinity. Our results establish a powerful paradigm for identifying protein binding sites and interpreting gene regulatory sequences in eukaryotic genomes. PMID:29610332
Scholze, Heidi; Boch, Jens
2010-01-01
TAL effectors are important virulence factors of bacterial plant pathogenic Xanthomonas, which infect a wide variety of plants including valuable crops like pepper, rice, and citrus. TAL proteins are translocated via the bacterial type III secretion system into host cells and induce transcription of plant genes by binding to target gene promoters. Members of the TAL effector family differ mainly in their central domain of tandemly arranged repeats of typically 34 amino acids each with hypervariable di-amino acids at positions 12 and 13. We recently showed that target DNA-recognition specificity of TAL effectors is encoded in a modular and clearly predictable mode. The repeats of TAL effectors feature a surprising one repeat-to-one-bp correlation with different repeat types exhibiting a different DNA base pair specificity. Accordingly, we predicted DNA specificities of TAL effectors and generated artificial TAL proteins with novel DNA recognition specificities. We describe here novel artificial TALs and discuss implications for the DNA recognition specificity. The unique TAL-DNA binding domain allows design of proteins with potentially any given DNA recognition specificity enabling many uses for biotechnology.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hashimoto, Hideharu; Zhang, Xing; Zheng, Yu
Mutations in human zinc-finger transcription factor WT1 result in abnormal development of the kidneys and genitalia and an array of pediatric problems including nephropathy, blastoma, gonadal dysgenesis and genital discordance. Several overlapping phenotypes are associated with WT1 mutations, including Wilms tumors, Denys-Drash syndrome (DDS), Frasier syndrome (FS) and WAGR syndrome (Wilms tumor, aniridia, genitourinary malformations, and mental retardation). These conditions vary in severity from individual to individual; they can be fatal in early childhood, or relatively benign into adulthood. DDS mutations cluster predominantly in zinc fingers (ZF) 2 and 3 at the C-terminus of WT1, which together with ZF4 determinemore » the sequence-specificity of DNA binding. We examined three DDS associated mutations in ZF2 of human WT1 where the normal glutamine at position 369 is replaced by arginine (Q369R), lysine (Q369K) or histidine (Q369H). These mutations alter the sequence-specificity of ZF2, we find, changing its affinity for certain bases and certain epigenetic forms of cytosine. X-ray crystallography of the DNA binding domains of normal WT1, Q369R and Q369H in complex with preferred sequences revealed the molecular interactions responsible for these affinity changes. DDS is inherited in an autosomal dominant fashion, implying a gain of function by mutant WT1 proteins. This gain, we speculate, might derive from the ability of the mutant proteins to sequester WT1 into unproductive oligomers, or to erroneously bind to variant target sequences.« less
Slama-Schwok, A; Zakrzewska, K; Léger, G; Leroux, Y; Takahashi, M; Käs, E; Debey, P
2000-01-01
Using spectroscopic methods, we have studied the structural changes induced in both protein and DNA upon binding of the High-Mobility Group I (HMG-I) protein to a 21-bp sequence derived from mouse satellite DNA. We show that these structural changes depend on the stoichiometry of the protein/DNA complexes formed, as determined by Job plots derived from experiments using pyrene-labeled duplexes. Circular dichroism and melting temperature experiments extended in the far ultraviolet range show that while native HMG-I is mainly random coiled in solution, it adopts a beta-turn conformation upon forming a 1:1 complex in which the protein first binds to one of two dA.dT stretches present in the duplex. HMG-I structure in the 1:1 complex is dependent on the sequence of its DNA target. A 3:1 HMG-I/DNA complex can also form and is characterized by a small increase in the DNA natural bend and/or compaction coupled to a change in the protein conformation, as determined from fluorescence resonance energy transfer (FRET) experiments. In addition, a peptide corresponding to an extended DNA-binding domain of HMG-I induces an ordered condensation of DNA duplexes. Based on the constraints derived from pyrene excimer measurements, we present a model of these nucleated structures. Our results illustrate an extreme case of protein structure induced by DNA conformation that may bear on the evolutionary conservation of the DNA-binding motifs of HMG-I. We discuss the functional relevance of the structural flexibility of HMG-I associated with the nature of its DNA targets and the implications of the binding stoichiometry for several aspects of chromatin structure and gene regulation. PMID:10777751
NASA Technical Reports Server (NTRS)
Hu, Shaowen; Cucinotta, Francis A.
2009-01-01
The Ku70/80 heterodimer is the first repair protein in the initial binding of double-strand break (DSB) ends following DNA damage, and is a component of nonhomologous end joining repair, the primary pathway for DSB repair in mammalian cells. In this study we constructed a full-length human Ku70 structure based on its crystal structure, and performed 20 ns conventional molecular dynamic (CMD) simulations on this protein and several other complexes with short DNA duplexes of different sequences. The trajectories of these simulations indicated that, without the topological support of Ku80, the residues in the bridge and C-terminal arm of Ku70 are more flexible than other experimentally identified domains. We studied the two missing loops in the crystal structure and predicted that they are also very flexible. Simulations revealed that they make an important contribution to the Ku70 interaction with DNA. Dislocation of the previously studied SAP domain was observed in several systems, implying its role in DNA binding. Targeted molecular dynamic (TMD) simulation was also performed for one system with a far-away 14bp DNA duplex. The TMD trajectory and energetic analysis disclosed detailed interactions of the DNA-binding residues during the DNA dislocation, and revealed a possible conformational transition for a DSB end when encountering Ku70 in solution. Compared to experimentally based analysis, this study identified more detailed interactions between DNA and Ku70. Free energy analysis indicated Ku70 alone is able to bind DNA with relatively high affinity, with consistent contributions from various domains of Ku70 in different systems. The functional implications of these domains in the processes of Ku heterodimerization and DNA damage recognition and repair can be characterized in detail based upon this analysis.
Dossani, Zain Y.; Reider Apel, Amanda; Szmidt-Middleton, Heather; ...
2017-10-30
Despite the need for inducible promoters in strain development efforts, the majority of engineering in Saccharomyces cerevisiae continues to rely on a few constitutively active or inducible promoters. Building on advances that use the modular nature of both transcription factors and promoter regions, we have built a library of hybrid promoters that are regulated by a synthetic transcription factor. The hybrid promoters consist of native S. cerevisiae promoters, in which the operator regions have been replaced with sequences that are recognized by the bacterial LexA DNA binding protein. Correspondingly, the synthetic transcription factor (TF) consists of the DNA binding domainmore » of the LexA protein, fused with the human estrogen binding domain and the viral activator domain, VP16. The resulting system with a bacterial DNA binding domain avoids the transcription of native S. cerevisiae genes, and the hybrid promoters can be induced using estradiol, a compound with no detectable impact on S. cerevisiae physiology. Using combinations of one, two or three operator sequence repeats and a set of native S. cerevisiae promoters, we obtained a series of hybrid promoters that can be induced to different levels, using the same synthetic TF and a given estradiol. Finally, this set of promoters, in combination with our synthetic TF, has the potential to regulate numerous genes or pathways simultaneously, to multiple desired levels, in a single strain.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dossani, Zain Y.; Reider Apel, Amanda; Szmidt-Middleton, Heather
Despite the need for inducible promoters in strain development efforts, the majority of engineering in Saccharomyces cerevisiae continues to rely on a few constitutively active or inducible promoters. Building on advances that use the modular nature of both transcription factors and promoter regions, we have built a library of hybrid promoters that are regulated by a synthetic transcription factor. The hybrid promoters consist of native S. cerevisiae promoters, in which the operator regions have been replaced with sequences that are recognized by the bacterial LexA DNA binding protein. Correspondingly, the synthetic transcription factor (TF) consists of the DNA binding domainmore » of the LexA protein, fused with the human estrogen binding domain and the viral activator domain, VP16. The resulting system with a bacterial DNA binding domain avoids the transcription of native S. cerevisiae genes, and the hybrid promoters can be induced using estradiol, a compound with no detectable impact on S. cerevisiae physiology. Using combinations of one, two or three operator sequence repeats and a set of native S. cerevisiae promoters, we obtained a series of hybrid promoters that can be induced to different levels, using the same synthetic TF and a given estradiol. Finally, this set of promoters, in combination with our synthetic TF, has the potential to regulate numerous genes or pathways simultaneously, to multiple desired levels, in a single strain.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Eun-Sung; Yang, Seung-Woo; Hong, Dong-Ki
Non-viral gene delivery is a safe and suitable alternative to viral vector-mediated delivery to overcome the immunogenicity and tumorigenesis associated with viral vectors. Using the novel, human-origin Hph-1 protein transduction domain that can facilitate the transduction of protein into cells, we developed a new strategy to deliver naked DNA in vitro and in vivo. The new DNA delivery system contains Hph-1-GAL4 DNA-binding domain (DBD) fusion protein and enhanced green fluorescent protein (EGFP) reporter plasmid that includes the five repeats of GAL4 upstream activating sequence (UAS). Hph-1-GAL4-DBD protein formed complex with plasmid DNA through the specific interaction between GAL4-DBD and UAS,more » and delivered into the cells via the Hph-1-PTD. The pEGFP DNA was successfully delivered by the Hph-1-GAL4 system, and the EGFP was effectively expressed in mammalian cells such as HeLa and Jurkat, as well as in Bright Yellow-2 (BY-2) plant cells. When 10 {mu}g of pEGFP DNA was intranasally administered to mice using Hph-1-GAL4 protein, a high level of EGFP expression was detected throughout the lung tissue for 7 days. These results suggest that an Hph-1-PTD-mediated DNA delivery strategy may be an useful non-viral DNA delivery system for gene therapy and DNA vaccines.« less
Comparative genomics of pyridoxal 5′-phosphate-dependent transcription factor regulons in Bacteria
Suvorova, Inna A.
2016-01-01
The MocR-subfamily transcription factors (MocR-TFs) characterized by the GntR-family DNA-binding domain and aminotransferase-like sensory domain are broadly distributed among certain lineages of Bacteria. Characterized MocR-TFs bind pyridoxal 5′-phosphate (PLP) and control transcription of genes involved in PLP, gamma aminobutyric acid (GABA) and taurine metabolism via binding specific DNA operator sites. To identify putative target genes and DNA binding motifs of MocR-TFs, we performed comparative genomics analysis of over 250 bacterial genomes. The reconstructed regulons for 825 MocR-TFs comprise structural genes from over 200 protein families involved in diverse biological processes. Using the genome context and metabolic subsystem analysis we tentatively assigned functional roles for 38 out of 86 orthologous groups of studied regulators. Most of these MocR-TF regulons are involved in PLP metabolism, as well as utilization of GABA, taurine and ectoine. The remaining studied MocR-TF regulators presumably control genes encoding enzymes involved in reduction/oxidation processes, various transporters and PLP-dependent enzymes, for example aminotransferases. Predicted DNA binding motifs of MocR-TFs are generally similar in each orthologous group and are characterized by two to four repeated sequences. Identified motifs were classified according to their structures. Motifs with direct and/or inverted repeat symmetry constitute the majority of inferred DNA motifs, suggesting preferable TF dimerization in head-to-tail or head-to-head configuration. The obtained genomic collection of in silico reconstructed MocR-TF motifs and regulons in Bacteria provides a basis for future experimental characterization of molecular mechanisms for various regulators in this family. PMID:28348826
Tóth, Júlia; van Aelst, Kara; Salmons, Hannah; Szczelkun, Mark D.
2012-01-01
DNA cleavage by the Type III Restriction–Modification (RM) enzymes requires the binding of a pair of RM enzymes at two distant, inversely orientated recognition sequences followed by helicase-catalysed ATP hydrolysis and long-range communication. Here we addressed the dissociation from DNA of these enzymes at two stages: during long-range communication and following DNA cleavage. First, we demonstrated that a communicating species can be trapped in a DNA domain without a recognition site, with a non-specific DNA association lifetime of ∼200 s. If free DNA ends were present the lifetime became too short to measure, confirming that ends accelerate dissociation. Secondly, we observed that Type III RM enzymes can dissociate upon DNA cleavage and go on to cleave further DNA molecules (they can ‘turnover’, albeit inefficiently). The relationship between the observed cleavage rate and enzyme concentration indicated independent binding of each site and a requirement for simultaneous interaction of at least two enzymes per DNA to achieve cleavage. In light of various mechanisms for helicase-driven motion on DNA, we suggest these results are most consistent with a thermally driven random 1D search model (i.e. ‘DNA sliding’). PMID:22523084
Dequard-Chablat, Michelle; Allandt, Cynthia
2002-08-01
In the filamentous fungus Podospora anserina, two degenerative processes which result in growth arrest are associated with mitochondrial genome (mitochondrial DNA [mtDNA]) instability. Senescence is correlated with mtDNA rearrangements and amplification of specific regions (senDNAs). Premature death syndrome is characterized by the accumulation of specific mtDNA deletions. This accumulation is due to indirect effects of the AS1-4 mutation, which alters a cytosolic ribosomal protein gene. The mthmg1 gene has been identified as a double-copy suppressor of premature death. It greatly delays premature death and the accumulation of deletions when it is present in two copies in an ASI-4 context. The duplication of mthmg1 has no significant effect on the wild-type life span or on senDNA patterns. In anAS1+ context, deletion of the mthmg1 gene alters germination, growth, and fertility and reduces the life span. The deltamthmg1 senescent strains display a particular senDNA pattern. This deletion is lethal in an AS1-4 context. According to its physical properties (very basic protein with putative mitochondrial targeting sequence and HMG-type DNA-binding domains) and the cellular localization of an mtHMG1-green fluorescent protein fusion, mtHMG1 appears to be a mitochondrial protein possibly associated with mtDNA. It is noteworthy that it is the first example of a protein combining the two DNA-binding domains, AT-hook motif and HMG-1 boxes. It may be involved in the stability and/or transmission of the mitochondrial genome. To date, no structural homologues have been found in other organisms. However, mtHMG1 displays functional similarities with the Saccharomyces cerevisiae mitochondrial HMG-box protein Abf2.
Kelemen, Zsolt; Sebastian, Alvaro; Xu, Wenjia; Grain, Damaris; Salsac, Fabien; Avon, Alexandra; Berger, Nathalie; Tran, Joseph; Dubreucq, Bertrand; Lurin, Claire; Lepiniec, Loïc; Contreras-Moreira, Bruno; Dubos, Christian
2015-01-01
The control of growth and development of all living organisms is a complex and dynamic process that requires the harmonious expression of numerous genes. Gene expression is mainly controlled by the activity of sequence-specific DNA binding proteins called transcription factors (TFs). Amongst the various classes of eukaryotic TFs, the MYB superfamily is one of the largest and most diverse, and it has considerably expanded in the plant kingdom. R2R3-MYBs have been extensively studied over the last 15 years. However, DNA-binding specificity has been characterized for only a small subset of these proteins. Therefore, one of the remaining challenges is the exhaustive characterization of the DNA-binding specificity of all R2R3-MYB proteins. In this study, we have developed a library of Arabidopsis thaliana R2R3-MYB open reading frames, whose DNA-binding activities were assayed in vivo (yeast one-hybrid experiments) with a pool of selected cis-regulatory elements. Altogether 1904 interactions were assayed leading to the discovery of specific patterns of interactions between the various R2R3-MYB subgroups and their DNA target sequences and to the identification of key features that govern these interactions. The present work provides a comprehensive in vivo analysis of R2R3-MYB binding activities that should help in predicting new DNA motifs and identifying new putative target genes for each member of this very large family of TFs. In a broader perspective, the generated data will help to better understand how TF interact with their target DNA sequences. PMID:26484765
Duquesnoy, P; Sobrier, M L; Amselem, S; Goossens, M
1991-01-01
Mutations in the growth hormone receptor (GHR) gene can cause growth hormone (GH) resistance. Given the sequence homology between the extracellular domain of the GHR and a soluble GH-binding protein (GH-BP), it is remarkable that GH-BP binding activity is absent from the serum of patients with Laron-type GH insensitivity, a hereditary form of severe dwarfism. We have previously identified a mutation within the extracellular domain of this receptor, replacing phenylalanine by serine at position 96 of the mature protein, in a patient with Laron syndrome. We have now investigated the effect of this Phe----Ser substitution on hormone binding activity by expressing the total human GHR cDNA and mutant form in eukaryotic cells. The wild-type protein expressed was able to bind GH but no plasma membrane binding was detectable on cells transfected with the mutant cDNA; this was also the case of cells transfected with a Phe96----Ala mutant cDNA, suggesting that the lack of binding activity is not due to a posttranslational modification of serine. Examination of the variant proteins in subcellular fractions revealed the presence of specific GH binding activity in the lysosomal fraction, whereas immunofluorescence studies located mutant proteins in the cytosol. Our findings suggest that these mutant GHRs fail to follow the correct intracellular transport pathway and underline the potential importance of this phenylalanine residue, which is conserved among the GH, prolactin, and erythropoietin receptors that belong to the same cytokine receptor superfamily. Images PMID:1719554
Recognition of platinum-DNA adducts by HMGB1a.
Ramachandran, Srinivas; Temple, Brenda; Alexandrova, Anastassia N; Chaney, Stephen G; Dokholyan, Nikolay V
2012-09-25
Cisplatin (CP) and oxaliplatin (OX), platinum-based drugs used widely in chemotherapy, form adducts on intrastrand guanines (5'GG) in genomic DNA. DNA damage recognition proteins, transcription factors, mismatch repair proteins, and DNA polymerases discriminate between CP- and OX-GG DNA adducts, which could partly account for differences in the efficacy, toxicity, and mutagenicity of CP and OX. In addition, differential recognition of CP- and OX-GG adducts is highly dependent on the sequence context of the Pt-GG adduct. In particular, DNA binding protein domain HMGB1a binds to CP-GG DNA adducts with up to 53-fold greater affinity than to OX-GG adducts in the TGGA sequence context but shows much smaller differences in binding in the AGGC or TGGT sequence contexts. Here, simulations of the HMGB1a-Pt-DNA complex in the three sequence contexts revealed a higher number of interface contacts for the CP-DNA complex in the TGGA sequence context than in the OX-DNA complex. However, the number of interface contacts was similar in the TGGT and AGGC sequence contexts. The higher number of interface contacts in the CP-TGGA sequence context corresponded to a larger roll of the Pt-GG base pair step. Furthermore, geometric analysis of stacking of phenylalanine 37 in HMGB1a (Phe37) with the platinated guanines revealed more favorable stacking modes correlated with a larger roll of the Pt-GG base pair step in the TGGA sequence context. These data are consistent with our previous molecular dynamics simulations showing that the CP-TGGA complex was able to sample larger roll angles than the OX-TGGA complex or either CP- or OX-DNA complexes in the AGGC or TGGT sequences. We infer that the high binding affinity of HMGB1a for CP-TGGA is due to the greater flexibility of CP-TGGA compared to OX-TGGA and other Pt-DNA adducts. This increased flexibility is reflected in the ability of CP-TGGA to sample larger roll angles, which allows for a higher number of interface contacts between the Pt-DNA adduct and HMGB1a.
In silico modeling of epigenetic-induced changes in photoreceptor cis-regulatory elements.
Hossain, Reafa A; Dunham, Nicholas R; Enke, Raymond A; Berndsen, Christopher E
2018-01-01
DNA methylation is a well-characterized epigenetic repressor of mRNA transcription in many plant and vertebrate systems. However, the mechanism of this repression is not fully understood. The process of transcription is controlled by proteins that regulate recruitment and activity of RNA polymerase by binding to specific cis-regulatory sequences. Cone-rod homeobox (CRX) is a well-characterized mammalian transcription factor that controls photoreceptor cell-specific gene expression. Although much is known about the functions and DNA binding specificity of CRX, little is known about how DNA methylation modulates CRX binding affinity to genomic cis-regulatory elements. We used bisulfite pyrosequencing of human ocular tissues to measure DNA methylation levels of the regulatory regions of RHO , PDE6B, PAX6 , and LINE1 retrotransposon repeats. To describe the molecular mechanism of repression, we used molecular modeling to illustrate the effect of DNA methylation on human RHO regulatory sequences. In this study, we demonstrate an inverse correlation between DNA methylation in regulatory regions adjacent to the human RHO and PDE6B genes and their subsequent transcription in human ocular tissues. Docking of CRX to the DNA models shows that CRX interacts with the grooves of these sequences, suggesting changes in groove structure could regulate binding. Molecular dynamics simulations of the RHO promoter and enhancer regions show changes in the flexibility and groove width upon epigenetic modification. Models also demonstrate changes in the local dynamics of CRX binding sites within RHO regulatory sequences which may account for the repression of CRX-dependent transcription. Collectively, these data demonstrate epigenetic regulation of CRX binding sites in human retinal tissue and provide insight into the mechanism of this mode of epigenetic regulation to be tested in future experiments.
Structural and sequencing analysis of local target DNA recognition by MLV integrase.
Aiyer, Sriram; Rossi, Paolo; Malani, Nirav; Schneider, William M; Chandar, Ashwin; Bushman, Frederic D; Montelione, Gaetano T; Roth, Monica J
2015-06-23
Target-site selection by retroviral integrase (IN) proteins profoundly affects viral pathogenesis. We describe the solution nuclear magnetic resonance structure of the Moloney murine leukemia virus IN (M-MLV) C-terminal domain (CTD) and a structural homology model of the catalytic core domain (CCD). In solution, the isolated MLV IN CTD adopts an SH3 domain fold flanked by a C-terminal unstructured tail. We generated a concordant MLV IN CCD structural model using SWISS-MODEL, MMM-tree and I-TASSER. Using the X-ray crystal structure of the prototype foamy virus IN target capture complex together with our MLV domain structures, residues within the CCD α2 helical region and the CTD β1-β2 loop were predicted to bind target DNA. The role of these residues was analyzed in vivo through point mutants and motif interchanges. Viable viruses with substitutions at the IN CCD α2 helical region and the CTD β1-β2 loop were tested for effects on integration target site selection. Next-generation sequencing and analysis of integration target sequences indicate that the CCD α2 helical region, in particular P187, interacts with the sequences distal to the scissile bonds whereas the CTD β1-β2 loop binds to residues proximal to it. These findings validate our structural model and disclose IN-DNA interactions relevant to target site selection. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Chen, Zhen-Yong; Guo, Xiao-Jiang; Chen, Zhong-Xu; Chen, Wei-Ying; Wang, Ji-Rui
2017-06-01
The binding sites of transcription factors (TFs) in upstream DNA regions are called transcription factor binding sites (TFBSs). TFBSs are important elements for regulating gene expression. To date, there have been few studies on the profiles of TFBSs in plants. In total, 4,873 sequences with 5' upstream regions from 8530 wheat fl-cDNA sequences were used to predict TFBSs. We found 4572 TFBSs for the MADS TF family, which was twice as many as for bHLH (1951), B3 (1951), HB superfamily (1914), ERF (1820), and AP2/ERF (1725) TFs, and was approximately four times higher than the remaining TFBS types. The percentage of TFBSs and TF members showed a distinct distribution in different tissues. Overall, the distribution of TFBSs in the upstream regions of wheat fl-cDNA sequences had significant difference. Meanwhile, high frequencies of some types of TFBSs were found in specific regions in the upstream sequences. Both TFs and fl-cDNA with TFBSs predicted in the same tissues exhibited specific distribution preferences for regulating gene expression. The tissue-specific analysis of TFs and fl-cDNA with TFBSs provides useful information for functional research, and can be used to identify relationships between tissue-specific TFs and fl-cDNA with TFBSs. Moreover, the positional distribution of TFBSs indicates that some types of wheat TFBS have different positional distribution preferences in the upstream regions of genes.
Shchelkunov, S N; Taranov, O S; Tregubchak, T V; Maksyutov, R A; Silkov, A N; Nesterov, A E; Sennikov, S V
2016-07-01
Wistar rats with collagen-induced arthritis were intramuscularly injected with the recombinant plasmid pcDNA/sTNF-BD encoding the sequence of the TNF-binding protein domain of variola virus CrmB protein (VARV sTNF-BD) or the pcDNA3.1 vector. Quantitative analysis showed that the histopathological changes in the hind-limb joints of rats were most severe in the animals injected with pcDNA3.1 and much less severe in the group of rats injected with pcDNA/sTNF-BD, which indicates that gene therapy of rheumatoid arthritis is promising in the case of local administration of plasmids governing the synthesis of VARV immunomodulatory proteins.
Ferreira, L M; Hazlewood, G P; Barker, P J; Gilbert, H J
1991-01-01
A genomic library of Pseudomonas fluorescens subsp. cellulosa DNA was constructed in pUC18 and Escherichia coli recombinants expressing 4-methylumbelliferyl beta-D-cellobioside-hydrolysing activity (MUCase) were isolated. Enzyme produced by MUCase-positive clones did not hydrolyse either cellobiose or cellotriose but converted cellotetraose into cellobiose and cleaved cellopentaose and cellohexaose, producing a mixture of cellobiose and cellotriose. There was no activity against CM-cellulose, insoluble cellulose or xylan. On this basis, the enzyme is identified as an endo-acting cellodextrinase and is designated cellodextrinase C (CELC). Nucleotide sequencing of the gene (celC) which directs the synthesis of CELC revealed an open reading frame of 2153 bp, encoding a protein of Mr 80,189. The deduced primary sequence of CELC was confirmed by the Mr of purified CELC (77,000) and by the experimentally determined N-terminus of the enzyme which was identical with residues 38-47 of the translated sequence. The N-terminal region of CELC showed strong homology with endoglucanase, xylanases and an arabinofuranosidase of Ps. fluorescens subsp. cellulosa; homologous sequences included highly conserved serine-rich regions. Full-length CELC bound tightly to crystalline cellulose. Truncated forms of celC from which the DNA sequence encoding the conserved domain had been deleted, directed the synthesis of a functional cellodextrinase that did not bind to crystalline cellulose. This is consistent with the N-terminal region of CELC comprising a non-catalytic cellulose-binding domain which is distinct from the catalytic domain. The role of the cellulose-binding region is discussed. Images Fig. 2. Fig. 6. PMID:1953673
Deryusheva, Evgeniia I; Machulin, Andrey V; Selivanova, Olga M; Galzitskaya, Oxana V
2017-04-01
Proteins of the nucleic acid-binding proteins superfamily perform such functions as processing, transport, storage, stretching, translation, and degradation of RNA. It is one of the 16 superfamilies containing the OB-fold in protein structures. Here, we have analyzed the superfamily of nucleic acid-binding proteins (the number of sequences exceeds 200,000) and obtained that this superfamily prevalently consists of proteins containing the cold shock DNA-binding domain (ca. 131,000 protein sequences). Proteins containing the S1 domain compose 57% from the cold shock DNA-binding domain family. Furthermore, we have found that the S1 domain was identified mainly in the bacterial proteins (ca. 83%) compared to the eukaryotic and archaeal proteins, which are available in the UniProt database. We have found that the number of multiple repeats of S1 domain in the S1 domain-containing proteins depends on the taxonomic affiliation. All archaeal proteins contain one copy of the S1 domain, while the number of repeats in the eukaryotic proteins varies between 1 and 15 and correlates with the protein size. In the bacterial proteins, the number of repeats is no more than 6, regardless of the protein size. The large variation of the repeat number of S1 domain as one of the structural variants of the OB-fold is a distinctive feature of S1 domain-containing proteins. Proteins from the other families and superfamilies have either one OB-fold or change slightly the repeat numbers. On the whole, it can be supposed that the repeat number is a vital for multifunctional activity of the S1 domain-containing proteins. Proteins 2017; 85:602-613. © 2016 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Structure-Templated Predictions of Novel Protein Interactions from Sequence Information
Betel, Doron; Breitkreuz, Kevin E; Isserlin, Ruth; Dewar-Darch, Danielle; Tyers, Mike; Hogue, Christopher W. V
2007-01-01
The multitude of functions performed in the cell are largely controlled by a set of carefully orchestrated protein interactions often facilitated by specific binding of conserved domains in the interacting proteins. Interacting domains commonly exhibit distinct binding specificity to short and conserved recognition peptides called binding profiles. Although many conserved domains are known in nature, only a few have well-characterized binding profiles. Here, we describe a novel predictive method known as domain–motif interactions from structural topology (D-MIST) for elucidating the binding profiles of interacting domains. A set of domains and their corresponding binding profiles were derived from extant protein structures and protein interaction data and then used to predict novel protein interactions in yeast. A number of the predicted interactions were verified experimentally, including new interactions of the mitotic exit network, RNA polymerases, nucleotide metabolism enzymes, and the chaperone complex. These results demonstrate that new protein interactions can be predicted exclusively from sequence information. PMID:17892321
Cooley, Anne E; Riley, Sean P; Kral, Keith; Miller, M Clarke; DeMoll, Edward; Fried, Michael G; Stevenson, Brian
2009-07-13
Genes orthologous to the ybaB loci of Escherichia coli and Haemophilus influenzae are widely distributed among eubacteria. Several years ago, the three-dimensional structures of the YbaB orthologs of both E. coli and H. influenzae were determined, revealing a novel "tweezer"-like structure. However, a function for YbaB had remained elusive, with an early study of the H. influenzae ortholog failing to detect DNA-binding activity. Our group recently determined that the Borrelia burgdorferi YbaB ortholog, EbfC, is a DNA-binding protein. To reconcile those results, we assessed the abilities of both the H. influenzae and E. coli YbaB proteins to bind DNA to which B. burgdorferi EbfC can bind. Both the H. influenzae and the E. coli YbaB proteins bound to tested DNAs. DNA-binding was not well competed with poly-dI-dC, indicating some sequence preferences for those two proteins. Analyses of binding characteristics determined that both YbaB orthologs bind as homodimers. Different DNA sequence preferences were observed between H. influenzae YbaB, E. coli YbaB and B. burgdorferi EbfC, consistent with amino acid differences in the putative DNA-binding domains of these proteins. Three distinct members of the YbaB/EbfC bacterial protein family have now been demonstrated to bind DNA. Members of this protein family are encoded by a broad range of bacteria, including many pathogenic species, and results of our studies suggest that all such proteins have DNA-binding activities. The functions of YbaB/EbfC family members in each bacterial species are as-yet unknown, but given the ubiquity of these DNA-binding proteins among Eubacteria, further investigations are warranted.
Smagulova, Fatima; Brick, Kevin; Pu, Yongmei; Sengupta, Uttara; Camerini-Otero, R Daniel; Petukhova, Galina V
2013-07-22
Homologous recombination is the key process that generates genetic diversity and drives evolution. SPO11 protein triggers recombination by introducing DNA double stranded breaks at discreet areas of the genome called recombination hotspots. The hotspot locations are largely determined by the DNA binding specificity of the PRDM9 protein in human, mice and most other mammals. In budding yeast Saccharomyces cerevisae, which lacks a Prdm9 gene, meiotic breaks are formed opportunistically in the regions of accessible chromatin, primarily at gene promoters. The genome-wide distribution of hotspots in this organism can be altered by tethering Spo11 protein to Gal4 recognition sequences in the strain expressing Spo11 attached to the DNA binding domain of the Gal4 transcription factor. To establish whether similar re-targeting of meiotic breaks can be achieved in PRDM9-containing organisms we have generated a Gal4BD-Spo11 mouse that expresses SPO11 protein joined to the DNA binding domain of yeast Gal4. We have mapped the genome-wide distribution of the recombination initiation sites in the Gal4BD-Spo11 mice. More than two hundred of the hotspots in these mice were novel and were likely defined by Gal4BD, as the Gal4 consensus motif was clustered around the centers in these hotspots. Surprisingly, meiotic DNA breaks in the Gal4BD-Spo11 mice were significantly depleted near the ends of chromosomes. The effect is particularly striking at the pseudoautosomal region of the X and Y chromosomes - normally the hottest region in the genome. Our data suggest that specific, yet-unidentified factors influence the initiation of meiotic recombination at subtelomeric chromosomal regions.
Majumder, P; Choudhury, A; Banerjee, M; Lahiri, A; Bhattacharyya, N P
2007-08-01
To investigate the mechanism of increased expression of caspase-1 caused by exogenous Hippi, observed earlier in HeLa and Neuro2A cells, in this work we identified a specific motif AAAGACATG (- 101 to - 93) at the caspase-1 gene upstream sequence where HIPPI could bind. Various mutations in this specific sequence compromised the interaction, showing the specificity of the interactions. In the luciferase reporter assay, when the reporter gene was driven by caspase-1 gene upstream sequences (- 151 to - 92) with the mutation G to T at position - 98, luciferase activity was decreased significantly in green fluorescent protein-Hippi-expressing HeLa cells in comparison to that obtained with the wild-type caspase-1 gene 60 bp upstream sequence, indicating the biological significance of such binding. It was observed that the C-terminal 'pseudo' death effector domain of HIPPI interacted with the 60 bp (- 151 to - 92) upstream sequence of the caspase-1 gene containing the motif. We further observed that expression of caspase-8 and caspase-10 was increased in green fluorescent protein-Hippi-expressing HeLa cells. In addition, HIPPI interacted in vitro with putative promoter sequences of these genes, containing a similar motif. In summary, we identified a novel function of HIPPI; it binds to specific upstream sequences of the caspase-1, caspase-8 and caspase-10 genes and alters the expression of the genes. This result showed the motif-specific interaction of HIPPI with DNA, and indicates that it could act as transcription regulator.
Sequence-specific DNA binding by MYC/MAX to low-affinity non-E-box motifs.
Allevato, Michael; Bolotin, Eugene; Grossman, Mark; Mane-Padros, Daniel; Sladek, Frances M; Martinez, Ernest
2017-01-01
The MYC oncoprotein regulates transcription of a large fraction of the genome as an obligatory heterodimer with the transcription factor MAX. The MYC:MAX heterodimer and MAX:MAX homodimer (hereafter MYC/MAX) bind Enhancer box (E-box) DNA elements (CANNTG) and have the greatest affinity for the canonical MYC E-box (CME) CACGTG. However, MYC:MAX also recognizes E-box variants and was reported to bind DNA in a "non-specific" fashion in vitro and in vivo. Here, in order to identify potential additional non-canonical binding sites for MYC/MAX, we employed high throughput in vitro protein-binding microarrays, along with electrophoretic mobility-shift assays and bioinformatic analyses of MYC-bound genomic loci in vivo. We identified all hexameric motifs preferentially bound by MYC/MAX in vitro, which include the low-affinity non-E-box sequence AACGTT, and found that the vast majority (87%) of MYC-bound genomic sites in a human B cell line contain at least one of the top 21 motifs bound by MYC:MAX in vitro. We further show that high MYC/MAX concentrations are needed for specific binding to the low-affinity sequence AACGTT in vitro and that elevated MYC levels in vivo more markedly increase the occupancy of AACGTT sites relative to CME sites, especially at distal intergenic and intragenic loci. Hence, MYC binds diverse DNA motifs with a broad range of affinities in a sequence-specific and dose-dependent manner, suggesting that MYC overexpression has more selective effects on the tumor transcriptome than previously thought.
Weber, Axel; Borghouts, Corina; Brendel, Christian; Moriggl, Richard; Delis, Natalia; Brill, Boris; Vafaizadeh, Vida; Groner, Bernd
2013-01-01
The signal transducer and activator of transcription Stat5 is transiently activated by growth factor and cytokine signals in normal cells, but its persistent activation has been observed in a wide range of human tumors. Aberrant Stat5 activity was initially observed in leukemias, but subsequently also found in carcinomas. We investigated the importance of Stat5 in human tumor cell lines. shRNA mediated downregulation of Stat5 revealed the dependence of prostate and breast cancer cells on the expression of this transcription factor. We extended these inhibition studies and derived a peptide aptamer (PA) ligand, which directly interacts with the DNA-binding domain of Stat5 in a yeast-two-hybrid screen. The Stat5 specific PA sequence is embedded in a thioredoxin (hTRX) scaffold protein. The resulting recombinant protein S5-DBD-PA was expressed in bacteria, purified and introduced into tumor cells by protein transduction. Alternatively, S5-DBD-PA was expressed in the tumor cells after infection with a S5-DBD-PA encoding gene transfer vector. Both strategies impaired the DNA-binding ability of Stat5, suppressed Stat5 dependent transactivation and caused its intracellular degradation. Our experiments describe a peptide based inhibitor of Stat5 protein activity which can serve as a lead for the development of a clinically useful compound for cancer treatment. PMID:24276378
Kanai, Akio; Oida, Hanako; Matsuura, Nana; Doi, Hirofumi
2003-01-01
We systematically screened a genomic DNA library to identify proteins of the hyperthermophilic archaeon Pyrococcus furiosus using an expression cloning method. One gene product, which we named FAU-1 (P. furiosus AU-binding), demonstrated the strongest binding activity of all the genomic library-derived proteins tested against an AU-rich RNA sequence. The protein was purified to near homogeneity as a 54 kDa single polypeptide, and the gene locus corresponding to this FAU-1 activity was also sequenced. The FAU-1 gene encoded a 472-amino-acid protein that was characterized by highly charged domains consisting of both acidic and basic amino acids. The N-terminal half of the gene had a degree of similarity (25%) with RNase E from Escherichia coli. Five rounds of RNA-binding-site selection and footprinting analysis showed that the FAU-1 protein binds specifically to the AU-rich sequence in a loop region of a possible RNA ligand. Moreover, we demonstrated that the FAU-1 protein acts as an oligomer, and mainly as a trimer. These results showed that the FAU-1 protein is a novel heat-stable protein with an RNA loop-binding characteristic. PMID:12614195
Non-coding RNA generated following lariat-debranching mediates targeting of AID to DNA
Zheng, Simin; Vuong, Bao Q.; Vaidyanathan, Bharat; Lin, Jia-Yu; Huang, Feng-Ting; Chaudhuri, Jayanta
2015-01-01
SUMMARY Transcription through immunoglobulin switch (S) regions is essential for class switch recombination (CSR) but no molecular function of the transcripts has been described. Likewise, recruitment of activation-induced cytidine deaminase (AID) to S regions is critical for CSR; however, the underlying mechanism has not been fully elucidated. Here, we demonstrate that intronic switch RNA acts in trans to target AID to S region DNA. AID binds directly to switch RNA through G-quadruplexes formed by the RNA molecules. Disruption of this interaction by mutation of a key residue in the putative RNA-binding domain of AID impairs recruitment of AID to S region DNA, thereby abolishing CSR. Additionally, inhibition of RNA lariat processing leads to loss of AID localization to S regions and compromises CSR; both defects can be rescued by exogenous expression of switch transcripts in a sequence-specific manner. These studies uncover an RNA-mediated mechanism of targeting AID to DNA. PMID:25957684
Epigenetic regulatory mechanisms in vertebrate eye development and disease
Cvekl, A; Mitton, KP
2014-01-01
Eukaryotic DNA is organized as a nucleoprotein polymer termed chromatin with nucleosomes serving as its repetitive architectural units. Cellular differentiation is a dynamic process driven by activation and repression of specific sets of genes, partitioning the genome into transcriptionally active and inactive chromatin domains. Chromatin architecture at individual genes/loci may remain stable through cell divisions, from a single mother cell to its progeny during mitosis, and represents an example of epigenetic phenomena. Epigenetics refers to heritable changes caused by mechanisms distinct from the primary DNA sequence. Recent studies have shown a number of links between chromatin structure, gene expression, extracellular signaling, and cellular differentiation during eye development. This review summarizes recent advances in this field, and the relationship between sequence-specific DNA-binding transcription factors and their roles in recruitment of chromatin remodeling enzymes. In addition, lens and retinal differentiation is accompanied by specific changes in the nucleolar organization, expression of non-coding RNAs, and DNA methylation. Epigenetic regulatory mechanisms in ocular tissues represent exciting areas of research that have opened new avenues for understanding normal eye development, inherited eye diseases and eye diseases related to aging and the environment. PMID:20179734
In vitro fluorescence studies of transcription factor IIB-DNA interaction.
Górecki, Andrzej; Figiel, Małgorzata; Dziedzicka-Wasylewska, Marta
2015-01-01
General transcription factor TFIIB is one of the basal constituents of the preinitiation complex of eukaryotic RNA polymerase II, acting as a bridge between the preinitiation complex and the polymerase, and binding promoter DNA in an asymmetric manner, thereby defining the direction of the transcription. Methods of fluorescence spectroscopy together with circular dichroism spectroscopy were used to observe conformational changes in the structure of recombinant human TFIIB after binding to specific DNA sequence. To facilitate the exploration of the structural changes, several site-directed mutations have been introduced altering the fluorescence properties of the protein. Our observations showed that binding of specific DNA sequences changed the protein structure and dynamics, and TFIIB may exist in two conformational states, which can be described by a different microenvironment of W52. Fluorescence studies using both intrinsic and exogenous fluorophores showed that these changes significantly depended on the recognition sequence and concerned various regions of the protein, including those interacting with other transcription factors and RNA polymerase II. DNA binding can cause rearrangements in regions of proteins interacting with the polymerase in a manner dependent on the recognized sequences, and therefore, influence the gene expression.
Schorderet, Daniel F; Escher, Pascal
2009-11-01
NR2E3, also called photoreceptor-specific nuclear receptor (PNR), is a transcription factor of the nuclear hormone receptor superfamily whose expression is uniquely restricted to photoreceptors. There, its physiological activity is essential for proper rod and cone photoreceptor development and maintenance. Thirty-two different mutations in NR2E3 have been identified in either homozygous or compound heterozygous state in the recessively inherited enhanced S-cone sensitivity syndrome (ESCS), Goldmann-Favre syndrome (GFS), and clumped pigmentary retinal degeneration (CPRD). The clinical phenotype common to all these patients is night blindness, rudimental or absent rod function, and hyperfunction of the "blue" S-cones. A single p.G56R mutation is inherited in a dominant manner and causes retinitis pigmentosa (RP). We have established a new locus-specific database for NR2E3 (www.LOVD.nl/eye), containing all reported mutations, polymorphisms, and unclassified sequence variants, including novel ones. A high proportion of mutations are located in the evolutionarily-conserved DNA-binding domains (DBDs) and ligand-binding domains (LBDs) of NR2E3. Based on homology modeling of these NR2E3 domains, we propose a structural localization of mutated residues. The high variability of clinical phenotypes observed in patients affected by NR2E3-linked retinal degenerations may be caused by different disease mechanisms, including absence of DNA-binding, altered interactions with transcriptional coregulators, and differential activity of modifier genes.
Protein sequences bound to mineral surfaces persist into deep time
Demarchi, Beatrice; Hall, Shaun; Roncal-Herrero, Teresa; Freeman, Colin L; Woolley, Jos; Crisp, Molly K; Wilson, Julie; Fotakis, Anna; Fischer, Roman; Kessler, Benedikt M; Rakownikow Jersie-Christensen, Rosa; Olsen, Jesper V; Haile, James; Thomas, Jessica; Marean, Curtis W; Parkington, John; Presslee, Samantha; Lee-Thorp, Julia; Ditchfield, Peter; Hamilton, Jacqueline F; Ward, Martyn W; Wang, Chunting Michelle; Shaw, Marvin D; Harrison, Terry; Domínguez-Rodrigo, Manuel; MacPhee, Ross DE; Kwekason, Amandus; Ecker, Michaela; Kolska Horwitz, Liora; Chazan, Michael; Kröger, Roland; Thomas-Oates, Jane; Harding, John H; Cappellini, Enrico; Penkman, Kirsty; Collins, Matthew J
2016-01-01
Proteins persist longer in the fossil record than DNA, but the longevity, survival mechanisms and substrates remain contested. Here, we demonstrate the role of mineral binding in preserving the protein sequence in ostrich (Struthionidae) eggshell, including from the palaeontological sites of Laetoli (3.8 Ma) and Olduvai Gorge (1.3 Ma) in Tanzania. By tracking protein diagenesis back in time we find consistent patterns of preservation, demonstrating authenticity of the surviving sequences. Molecular dynamics simulations of struthiocalcin-1 and -2, the dominant proteins within the eggshell, reveal that distinct domains bind to the mineral surface. It is the domain with the strongest calculated binding energy to the calcite surface that is selectively preserved. Thermal age calculations demonstrate that the Laetoli and Olduvai peptides are 50 times older than any previously authenticated sequence (equivalent to ~16 Ma at a constant 10°C). DOI: http://dx.doi.org/10.7554/eLife.17092.001 PMID:27668515
An ancient protein-DNA interaction underlying metazoan sex determination.
Murphy, Mark W; Lee, John K; Rojo, Sandra; Gearhart, Micah D; Kurahashi, Kayo; Banerjee, Surajit; Loeuille, Guy-André; Bashamboo, Anu; McElreavey, Kenneth; Zarkower, David; Aihara, Hideki; Bardwell, Vivian J
2015-06-01
DMRT transcription factors are deeply conserved regulators of metazoan sexual development. They share the DM DNA-binding domain, a unique intertwined double zinc-binding module followed by a C-terminal recognition helix, which binds a pseudopalindromic target DNA. Here we show that DMRT proteins use a unique binding interaction, inserting two adjacent antiparallel recognition helices into a widened DNA major groove to make base-specific contacts. Versatility in how specific base contacts are made allows human DMRT1 to use multiple DNA binding modes (tetramer, trimer and dimer). Chromatin immunoprecipitation with exonuclease treatment (ChIP-exo) indicates that multiple DNA binding modes also are used in vivo. We show that mutations affecting residues crucial for DNA recognition are associated with an intersex phenotype in flies and with male-to-female sex reversal in humans. Our results illuminate an ancient molecular interaction underlying much of metazoan sexual development.
An ancient protein-DNA interaction underlying metazoan sex determination
Murphy, Mark W.; Lee, John K.; Rojo, Sandra; ...
2015-05-25
DMRT transcription factors are deeply conserved regulators of metazoan sexual development. They share the DM DNA-binding domain, a unique intertwined double zinc-binding module followed by a C-terminal recognition helix, which binds a pseudopalindromic target DNA. In this paper, we show that DMRT proteins use a unique binding interaction, inserting two adjacent antiparallel recognition helices into a widened DNA major groove to make base-specific contacts. Versatility in how specific base contacts are made allows human DMRT1 to use multiple DNA binding modes (tetramer, trimer and dimer). Chromatin immunoprecipitation with exonuclease treatment (ChIP-exo) indicates that multiple DNA binding modes also are usedmore » in vivo. We show that mutations affecting residues crucial for DNA recognition are associated with an intersex phenotype in flies and with male-to-female sex reversal in humans. Finally, our results illuminate an ancient molecular interaction underlying much of metazoan sexual development.« less
An ancient protein-DNA interaction underlying metazoan sex determination
DOE Office of Scientific and Technical Information (OSTI.GOV)
Murphy, Mark W.; Lee, John K.; Rojo, Sandra
DMRT transcription factors are deeply conserved regulators of metazoan sexual development. They share the DM DNA-binding domain, a unique intertwined double zinc-binding module followed by a C-terminal recognition helix, which binds a pseudopalindromic target DNA. In this paper, we show that DMRT proteins use a unique binding interaction, inserting two adjacent antiparallel recognition helices into a widened DNA major groove to make base-specific contacts. Versatility in how specific base contacts are made allows human DMRT1 to use multiple DNA binding modes (tetramer, trimer and dimer). Chromatin immunoprecipitation with exonuclease treatment (ChIP-exo) indicates that multiple DNA binding modes also are usedmore » in vivo. We show that mutations affecting residues crucial for DNA recognition are associated with an intersex phenotype in flies and with male-to-female sex reversal in humans. Finally, our results illuminate an ancient molecular interaction underlying much of metazoan sexual development.« less
Direct observation of transcription activator-like effector (TALE) protein dynamics
NASA Astrophysics Data System (ADS)
Cuculis, Luke; Abil, Zhanar; Zhao, Huimin; Schroeder, Charles M.
2014-03-01
In this work, we describe a single molecule assay to probe the site-search dynamics of transcription activator-like effector (TALE) proteins along DNA. In modern genetics, the ability to selectively edit the human genome is an unprecedented development, driven by recent advances in targeted nuclease proteins. Specific gene editing can be accomplished using TALE proteins, which are programmable DNA-binding proteins that can be fused to a nuclease domain. In this way, TALENs are a leading technology that has shown great success in the genomic editing of pluripotent stem cells. A major hurdle facing clinical implementation, however, is the potential for deleterious off-target binding events. For these reasons, a molecular-level understanding of TALE binding and target sequence search on DNA is essential. To this end, we developed a single-molecule fluorescence imaging assay that provides a first-of-its-kind view of the 1-D diffusion of TALE proteins along stretched DNA. Taken together with co-crystal structures of DNA-bound TALEs, our results suggest a rotationally-coupled, major groove tracking model for diffusion. We further report diffusion constants for TALE proteins as a function of salt concentration, consistent with previously described models of 1-D protein diffusion.
Characterization of Prdm9 in equids and sterility in mules.
Steiner, Cynthia C; Ryder, Oliver A
2013-01-01
Prdm9 (Meisetz) is the first speciation gene discovered in vertebrates conferring reproductive isolation. This locus encodes a meiosis-specific histone H3 methyltransferase that specifies meiotic recombination hotspots during gametogenesis. Allelic differences in Prdm9, characterized for a variable number of zinc finger (ZF) domains, have been associated with hybrid sterility in male house mice via spermatogenic failure at the pachytene stage. The mule, a classic example of hybrid sterility in mammals also exhibits a similar spermatogenesis breakdown, making Prdm9 an interesting candidate to evaluate in equine hybrids. In this study, we characterized the Prdm9 gene in all species of equids by analyzing sequence variation of the ZF domains and estimating positive selection. We also evaluated the role of Prdm9 in hybrid sterility by assessing allelic differences of ZF domains in equine hybrids. We found remarkable variation in the sequence and number of ZF domains among equid species, ranging from five domains in the Tibetan kiang and Asiatic wild ass, to 14 in the Grevy's zebra. Positive selection was detected in all species at amino acid sites known to be associated with DNA-binding specificity of ZF domains in mice and humans. Equine hybrids, in particular a quartet pedigree composed of a fertile mule showed a mosaic of sequences and number of ZF domains suggesting that Prdm9 variation does not seem by itself to contribute to equine hybrid sterility.
Characterization of Prdm9 in Equids and Sterility in Mules
Steiner, Cynthia C.; Ryder, Oliver A.
2013-01-01
Prdm9 (Meisetz) is the first speciation gene discovered in vertebrates conferring reproductive isolation. This locus encodes a meiosis-specific histone H3 methyltransferase that specifies meiotic recombination hotspots during gametogenesis. Allelic differences in Prdm9, characterized for a variable number of zinc finger (ZF) domains, have been associated with hybrid sterility in male house mice via spermatogenic failure at the pachytene stage. The mule, a classic example of hybrid sterility in mammals also exhibits a similar spermatogenesis breakdown, making Prdm9 an interesting candidate to evaluate in equine hybrids. In this study, we characterized the Prdm9 gene in all species of equids by analyzing sequence variation of the ZF domains and estimating positive selection. We also evaluated the role of Prdm9 in hybrid sterility by assessing allelic differences of ZF domains in equine hybrids. We found remarkable variation in the sequence and number of ZF domains among equid species, ranging from five domains in the Tibetan kiang and Asiatic wild ass, to 14 in the Grevy’s zebra. Positive selection was detected in all species at amino acid sites known to be associated with DNA-binding specificity of ZF domains in mice and humans. Equine hybrids, in particular a quartet pedigree composed of a fertile mule showed a mosaic of sequences and number of ZF domains suggesting that Prdm9 variation does not seem by itself to contribute to equine hybrid sterility. PMID:23613924
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, T; Huang, S; Zhao, XF
Recent studies indicate that the DNA recognition domain of transcription activator-like (TAL) effectors can be combined with the nuclease domain of FokI restriction enzyme to produce TAL effector nucleases (TALENs) that, in pairs, bind adjacent DNA target sites and produce double-strand breaks between the target sequences, stimulating non-homologous end-joining and homologous recombination. Here, we exploit the four prevalent TAL repeats and their DNA recognition cipher to develop a 'modular assembly' method for rapid production of designer TALENs (dTALENs) that recognize unique DNA sequence up to 23 bases in any gene. We have used this approach to engineer 10 dTALENs tomore » target specific loci in native yeast chromosomal genes. All dTALENs produced high rates of site-specific gene disruptions and created strains with expected mutant phenotypes. Moreover, dTALENs stimulated high rates (up to 34%) of gene replacement by homologous recombination. Finally, dTALENs caused no detectable cytotoxicity and minimal levels of undesired genetic mutations in the treated yeast strains. These studies expand the realm of verified TALEN activity from cultured human cells to an intact eukaryotic organism and suggest that low-cost, highly dependable dTALENs can assume a significant role for gene modifications of value in human and animal health, agriculture and industry.« less
Phosphorylation of serine-515 activates the Mammalian maintenance methyltransferase Dnmt1.
Goyal, Rachna; Rathert, Philipp; Laser, Heike; Gowher, Humaira; Jeltsch, Albert
2007-09-01
DNA methyltransferase 1 methylates hemi-methylated CG sites generated during DNA replication. Serine 515 of this enzyme has been shown to be phosphorylated. To explore the importance of S515 phosphorylation, we generated mutants of Dnmt1 which removed the phosphorylation potential (S515A) or mimic phosphoserine (S515E), purified the proteins from insect cells and analyzed their DNA methylation activity in vitro. The S515E mutant was found to be active, while S515A mutant had severe loss in activity when compared to the wild type protein. The loss of activity of the S515A variant was not due to loss of DNA binding capacity. Furthermore, we show that a phosphorylated peptide whose sequence mimics the surrounding of Ser515 (EKIYIS(P)KIVVE) inhibited the activity of wild type Dnmt1 ten-fold more than the non-phosphorylated peptide. The inhibition was specific for Dnmt1 and for the particular peptide sequence. Our data suggest that phosphorylation of Ser515 is important for an interaction between the N-terminal domain of Dnmt1 and its catalytic domain that is necessary for activity and that this interaction is specifically disrupted by the phosphorylated peptide. We conclude that phosphorylation of Dnmt1 at Ser515 could be an important regulator of Dnmt1 activity during cell cycle and after proliferative stimuli.
Molecular mechanism of transcription inhibition by phage T7 gp2 protein.
Mekler, Vladimir; Minakhin, Leonid; Sheppard, Carol; Wigneshweraraj, Sivaramesh; Severinov, Konstantin
2011-11-11
Escherichia coli T7 bacteriophage gp2 protein is a potent inhibitor of host RNA polymerase (RNAP). gp2 inhibits formation of open promoter complex by binding to the β' jaw, an RNAP domain that interacts with downstream promoter DNA. Here, we used an engineered promoter with an optimized sequence to obtain and characterize a specific promoter complex containing RNAP and gp2. In this complex, localized melting of promoter DNA is initiated but does not propagate to include the point of the transcription start. As a result, the complex is transcriptionally inactive. Using a highly sensitive RNAP beacon assay, we performed quantitative real-time measurements of specific binding of the RNAP-gp2 complex to promoter DNA and various promoter fragments. In this way, the effect of gp2 on RNAP interaction with promoters was dissected. As expected, gp2 greatly decreased RNAP affinity to downstream promoter duplex. However, gp2 also inhibited RNAP binding to promoter fragments that lacked downstream promoter DNA that interacts with the β' jaw. The inhibition was caused by gp2-mediated decrease of the RNAP binding affinity to template and non-template strand segments of the transcription bubble downstream of the -10 promoter element. The inhibition of RNAP interactions with single-stranded segments of the transcription bubble by gp2 is a novel effect, which may occur via allosteric mechanism that is set in motion by the gp2 binding to the β' jaw. Copyright © 2011 Elsevier Ltd. All rights reserved.
Screening for Protein-DNA Interactions by Automatable DNA-Protein Interaction ELISA
Schüssler, Axel; Kolukisaoglu, H. Üner; Koch, Grit; Wallmeroth, Niklas; Hecker, Andreas; Thurow, Kerstin; Zell, Andreas; Harter, Klaus; Wanke, Dierk
2013-01-01
DNA-binding proteins (DBPs), such as transcription factors, constitute about 10% of the protein-coding genes in eukaryotic genomes and play pivotal roles in the regulation of chromatin structure and gene expression by binding to short stretches of DNA. Despite their number and importance, only for a minor portion of DBPs the binding sequence had been disclosed. Methods that allow the de novo identification of DNA-binding motifs of known DBPs, such as protein binding microarray technology or SELEX, are not yet suited for high-throughput and automation. To close this gap, we report an automatable DNA-protein-interaction (DPI)-ELISA screen of an optimized double-stranded DNA (dsDNA) probe library that allows the high-throughput identification of hexanucleotide DNA-binding motifs. In contrast to other methods, this DPI-ELISA screen can be performed manually or with standard laboratory automation. Furthermore, output evaluation does not require extensive computational analysis to derive a binding consensus. We could show that the DPI-ELISA screen disclosed the full spectrum of binding preferences for a given DBP. As an example, AtWRKY11 was used to demonstrate that the automated DPI-ELISA screen revealed the entire range of in vitro binding preferences. In addition, protein extracts of AtbZIP63 and the DNA-binding domain of AtWRKY33 were analyzed, which led to a refinement of their known DNA-binding consensi. Finally, we performed a DPI-ELISA screen to disclose the DNA-binding consensus of a yet uncharacterized putative DBP, AtTIFY1. A palindromic TGATCA-consensus was uncovered and we could show that the GATC-core is compulsory for AtTIFY1 binding. This specific interaction between AtTIFY1 and its DNA-binding motif was confirmed by in vivo plant one-hybrid assays in protoplasts. Thus, the value and applicability of the DPI-ELISA screen for de novo binding site identification of DBPs, also under automatized conditions, is a promising approach for a deeper understanding of gene regulation in any organism of choice. PMID:24146751
Bhattacharya, D; Steinkötter, J; Melkonian, M
1993-12-01
Centrin (= caltractin) is a ubiquitous, cytoskeletal protein which is a member of the EF-hand superfamily of calcium-binding proteins. A centrin-coding cDNA was isolated and characterized from the prasinophyte green alga Scherffelia dubia. Centrin PCR amplification primers were used to isolate partial, homologous cDNA sequences from the green algae Tetraselmis striata and Spermatozopsis similis. Annealing analyses suggested that centrin is a single-copy-coding region in T. striata and S. similis and other green algae studied. Centrin-coding regions from S. dubia, S. similis and T. striata encode four colinear EF-hand domains which putatively bind calcium. Phylogenetic analyses, including homologous sequences from Chlamydomonas reinhardtii and the land plant Atriplex nummularia, demonstrate that the domains of centrins are congruent and arose from the two-fold duplication of an ancestral EF hand with Domains 1+3 and Domains 2+4 clustering. The domains of centrins are also congruent with those of calmodulins demonstrating that, like calmodulin, centrin is an ancient protein which arose within the ancestor of all eukaryotes via gene duplication. Phylogenetic relationships inferred from centrin-coding region comparisons mirror results of small subunit ribosomal RNA sequence analyses suggesting that centrin-coding regions are useful evolutionary markers within the green algae.
A Feature-Based Approach to Modeling Protein–DNA Interactions
Segal, Eran
2008-01-01
Transcription factor (TF) binding to its DNA target site is a fundamental regulatory interaction. The most common model used to represent TF binding specificities is a position specific scoring matrix (PSSM), which assumes independence between binding positions. However, in many cases, this simplifying assumption does not hold. Here, we present feature motif models (FMMs), a novel probabilistic method for modeling TF–DNA interactions, based on log-linear models. Our approach uses sequence features to represent TF binding specificities, where each feature may span multiple positions. We develop the mathematical formulation of our model and devise an algorithm for learning its structural features from binding site data. We also developed a discriminative motif finder, which discovers de novo FMMs that are enriched in target sets of sequences compared to background sets. We evaluate our approach on synthetic data and on the widely used TF chromatin immunoprecipitation (ChIP) dataset of Harbison et al. We then apply our algorithm to high-throughput TF ChIP data from mouse and human, reveal sequence features that are present in the binding specificities of mouse and human TFs, and show that FMMs explain TF binding significantly better than PSSMs. Our FMM learning and motif finder software are available at http://genie.weizmann.ac.il/. PMID:18725950
Grove, A; Galeone, A; Mayol, L; Geiduschek, E P
1996-07-12
TF1 is a member of the family of type II DNA-binding proteins, which also includes the bacterial HU proteins and the Escherichia coli integration host factor (IHF). Distinctive to TF1, which is encoded by the Bacillus subtilis bacteriophage SPO1, is its preferential binding to DNA in which thymine is replaced by 5-hydroxymethyluracil (hmU), as it is in the phage genome. TF1 binds to preferred sites within the phage genome and generates pronounced DNA bending. The extent to which DNA flexibility contributes to the sequence-specific binding of TF1, and the connection between hmU preference and DNA flexibility has been examined. Model flexible sites, consisting of consecutive mismatches, increase the affinity of thymine-containing DNA for TF1. In particular, tandem mismatches separated by nine base-pairs generate an increase, by orders of magnitude, in the affinity of TF1 for T-containing DNA with the sequence of a preferred TF1 binding site, and fully match the affinity of TF1 for this cognate site in hmU-containing DNA (Kd approximately 3 nM). Other placements of loops generate suboptimal binding. This is consistent with a significant contribution of site-specific DNA flexibility to complex formation. Analysis of complexes with hmU-DNA of decreasing length shows that a major part of the binding affinity is generated within a central 19 bp segment (delta G0 = 41.7 kJ mol-1) with more-distal DNA contributing modestly to the affinity (delta delta G = -0.42 kJ mol-1 bp-1 on increasing duplex length to 37 bp). However, a previously characterised thermostable and more tightly binding mutant TF1, TF1(E15G/T32I), derives most of its extra affinity from interaction with flanking DNA. We propose that inherent but sequence-dependent deformability of hmU-containing DNA underlies the preferential binding of TF1 and that TF1-induced DNA bendings is a result of distortions at two distinct sites separated by 9 bp of duplex DNA.
Kamenova, Ivanka; Warfield, Linda
2014-01-01
Most RNA polymerase (Pol) II promoters lack a TATA element, yet nearly all Pol II transcription requires TATA binding protein (TBP). While the TBP-TATA interaction is critical for transcription at TATA-containing promoters, it has been unclear whether TBP sequence-specific DNA contacts are required for transcription at TATA-less genes. Transcription factor IID (TFIID), the TBP-containing coactivator that functions at most TATA-less genes, recognizes short sequence-specific promoter elements in metazoans, but analogous promoter elements have not been identified in Saccharomyces cerevisiae. We generated a set of mutations in the yeast TBP DNA binding surface and found that most support growth of yeast. Both in vivo and in vitro, many of these mutations are specifically defective for transcription of two TATA-containing genes with only minor defects in transcription of two TATA-less, TFIID-dependent genes. TBP binds several TATA-less promoters with apparent high affinity, but our results suggest that this binding is not important for transcription activity. Our results are consistent with the model that sequence-specific TBP-DNA contacts are not important at yeast TATA-less genes and suggest that other general transcription factors or coactivator subunits are responsible for recognition of TATA-less promoters. Our results also explain why yeast TBP derivatives defective for TATA binding appear defective in activated transcription. PMID:24865972
Kamenova, Ivanka; Warfield, Linda; Hahn, Steven
2014-08-01
Most RNA polymerase (Pol) II promoters lack a TATA element, yet nearly all Pol II transcription requires TATA binding protein (TBP). While the TBP-TATA interaction is critical for transcription at TATA-containing promoters, it has been unclear whether TBP sequence-specific DNA contacts are required for transcription at TATA-less genes. Transcription factor IID (TFIID), the TBP-containing coactivator that functions at most TATA-less genes, recognizes short sequence-specific promoter elements in metazoans, but analogous promoter elements have not been identified in Saccharomyces cerevisiae. We generated a set of mutations in the yeast TBP DNA binding surface and found that most support growth of yeast. Both in vivo and in vitro, many of these mutations are specifically defective for transcription of two TATA-containing genes with only minor defects in transcription of two TATA-less, TFIID-dependent genes. TBP binds several TATA-less promoters with apparent high affinity, but our results suggest that this binding is not important for transcription activity. Our results are consistent with the model that sequence-specific TBP-DNA contacts are not important at yeast TATA-less genes and suggest that other general transcription factors or coactivator subunits are responsible for recognition of TATA-less promoters. Our results also explain why yeast TBP derivatives defective for TATA binding appear defective in activated transcription. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Specific and Modular Binding Code for Cytosine Recognition in Pumilio/FBF (PUF) RNA-binding Domains
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dong, Shuyun; Wang, Yang; Cassidy-Amstutz, Caleb
2011-10-28
Pumilio/fem-3 mRNA-binding factor (PUF) proteins possess a recognition code for bases A, U, and G, allowing designed RNA sequence specificity of their modular Pumilio (PUM) repeats. However, recognition side chains in a PUM repeat for cytosine are unknown. Here we report identification of a cytosine-recognition code by screening random amino acid combinations at conserved RNA recognition positions using a yeast three-hybrid system. This C-recognition code is specific and modular as specificity can be transferred to different positions in the RNA recognition sequence. A crystal structure of a modified PUF domain reveals specific contacts between an arginine side chain and themore » cytosine base. We applied the C-recognition code to design PUF domains that recognize targets with multiple cytosines and to generate engineered splicing factors that modulate alternative splicing. Finally, we identified a divergent yeast PUF protein, Nop9p, that may recognize natural target RNAs with cytosine. This work deepens our understanding of natural PUF protein target recognition and expands the ability to engineer PUF domains to recognize any RNA sequence.« less
Heintz, Udo; Schlichting, Ilme
2016-01-01
The design of synthetic optogenetic tools that allow precise spatiotemporal control of biological processes previously inaccessible to optogenetic control has developed rapidly over the last years. Rational design of such tools requires detailed knowledge of allosteric light signaling in natural photoreceptors. To understand allosteric communication between sensor and effector domains, characterization of all relevant signaling states is required. Here, we describe the mechanism of light-dependent DNA binding of the light-oxygen-voltage (LOV) transcription factor Aureochrome 1a from Phaeodactylum tricornutum (PtAu1a) and present crystal structures of a dark state LOV monomer and a fully light-adapted LOV dimer. In combination with hydrogen/deuterium-exchange, solution scattering data and DNA-binding experiments, our studies reveal a light-sensitive interaction between the LOV and basic region leucine zipper DNA-binding domain that together with LOV dimerization results in modulation of the DNA affinity of PtAu1a. We discuss the implications of these results for the design of synthetic LOV-based photosensors with application in optogenetics. DOI: http://dx.doi.org/10.7554/eLife.11860.001 PMID:26754770
Specific DNA binding of the two chicken Deformed family homeodomain proteins, Chox-1.4 and Chox-a.
Sasaki, H; Yokoyama, E; Kuroiwa, A
1990-01-01
The cDNA clones encoding two chicken Deformed (Dfd) family homeobox containing genes Chox-1.4 and Chox-a were isolated. Comparison of their amino acid sequences with another chicken Dfd family homeodomain protein and with those of mouse homologues revealed that strong homologies are located in the amino terminal regions and around the homeodomains. Although homologies in other regions were relatively low, some short conserved sequences were also identified. E. coli-made full length proteins were purified and used for the production of specific antibodies and for DNA binding studies. The binding profiles of these proteins to the 5'-leader and 5'-upstream sequences of Chox-1.4 and Chox-a coding regions were analyzed by immunoprecipitation and DNase I footprint assays. These two Chox proteins bound to the same sites in the 5'-flanking sequences of their coding regions with various affinities and their binding affinities to each site were nearly the same. The consensus sequences of the high and low affinity binding sites were TAATGA(C/G) and CTAATTTT, respectively. A clustered binding site was identified in the 5'-upstream of the Chox-a gene, suggesting that this clustered binding site works as a cis-regulatory element for auto- and/or cross-regulation of Chox-a gene expression. Images PMID:1970866
Štros, Michal; Kučírek, Martin; Sani, Soodabeh Abbasi; Polanská, Eva
2018-03-01
HMGB1 is a chromatin-associated protein that has been implicated in many important biological processes such as transcription, recombination, DNA repair, and genome stability. These functions include the enhancement of binding of a number of transcription factors, including the tumor suppressor protein p53, to their specific DNA-binding sites. HMGB1 is composed of two highly conserved HMG boxes, linked to an intrinsically disordered acidic C-terminal tail. Previous reports have suggested that the ability of HMGB1 to bend DNA may explain the in vitro HMGB1-mediated increase in sequence-specific DNA binding by p53. The aim of this study was to reinvestigate the importance of HMGB1-induced DNA bending in relationship to the ability of the protein to promote the specific binding of p53 to short DNA duplexes in vitro, and to transactivate two major p53-regulated human genes: Mdm2 and p21/WAF1. Using a number of HMGB1 mutants, we report that the HMGB1-mediated increase in sequence-specific p53 binding to DNA duplexes in vitro depends very little on HMGB1-mediated DNA bending. The presence of the acidic C-terminal tail of HMGB1 and/or the oxidation of the protein can reduce the HMGB1-mediated p53 binding. Interestingly, the induction of transactivation of p53-responsive gene promoters by HMGB1 requires both the ability of the protein to bend DNA and the acidic C-terminal tail, and is promoter-specific. We propose that the efficient transactivation of p53-responsive gene promoters by HMGB1 depends on complex events, rather than solely on the promotion of p53 binding to its DNA cognate sites. Copyright © 2018 Elsevier B.V. All rights reserved.
Wyhs, Nicolas; Walker, David; Giovinazzo, Hugh; Yegnasubramanian, Srinivasan; Nelson, William G
2014-08-01
Methylated DNA binding proteins such as Methyl-CpG Binding Domain Protein 2 (MBD2) can transduce DNA methylation alterations into a repressive signal by recruiting transcriptional co-repressor complexes. Interfering with MBD2 could lead to reactivation of tumor suppressor genes and therefore represents an attractive strategy for epigenetic therapy. We developed and compared fluorescence polarization (FP) and time-resolved fluorescence resonance energy transfer (TR-FRET)-based high-throughput screening (HTS) assays to identify small-molecule inhibitors of the interaction between the methyl binding domain of MBD2 (MBD2-MBD) and methylated DNA. Although both assays performed well in 96-well format, the TR-FRET assay (Z' factor = 0.58) emerged as a superior screening strategy compared with FP (Z' factor = 0.08) when evaluated in an HTS 384-well plate format. Using TR-FRET, we screened the Sigma LOPAC library for MBD2-MBD inhibitors and identified four compounds that also validated in a dose-response series. This included two known DNA intercalators (mitoxantrone and idarubicin) among two other inhibitory compounds (NF449 and aurintricarboxylic acid). All four compounds also inhibited the binding of SP-1, a transcription factor with a GC-rich binding sequence, to a methylated oligonucleotide, demonstrating that the activity was nonspecific. Our results provide proof of principle for using TR-FRET-based HTS to identify small-molecule inhibitors of MBD2 and other DNA-protein interactions. © 2014 Society for Laboratory Automation and Screening.
NASA Astrophysics Data System (ADS)
Tsao, Shih-Ming; Lai, Ji-Ching; Horng, Horng-Er; Liu, Tu-Chen; Hong, Chin-Yih
2017-04-01
Aptamers are oligonucleotides that can bind to specific target molecules. Most aptamers are generated using random libraries in the standard systematic evolution of ligands by exponential enrichment (SELEX). Each random library contains oligonucleotides with a randomized central region and two fixed primer regions at both ends. The fixed primer regions are necessary for amplifying target-bound sequences by PCR. However, these extra-sequences may cause non-specific bindings, which potentially interfere with good binding for random sequences. The Magnetic-Assisted Rapid Aptamer Selection (MARAS) is a newly developed protocol for generating single-strand DNA aptamers. No repeat selection cycle is required in the protocol. This study proposes and demonstrates a method to isolate aptamers for C-reactive proteins (CRP) from a randomized ssDNA library containing no fixed sequences at 5‧ and 3‧ termini using the MARAS platform. Furthermore, the isolated primer-free aptamer was sequenced and binding affinity for CRP was analyzed. The specificity of the obtained aptamer was validated using blind serum samples. The result was consistent with monoclonal antibody-based nephelometry analysis, which indicated that a primer-free aptamer has high specificity toward targets. MARAS is a feasible platform for efficiently generating primer-free aptamers for clinical diagnoses.
Malhotra, Sony; Sowdhamini, Ramanathan
2013-08-01
The interaction of proteins with their respective DNA targets is known to control many high-fidelity cellular processes. Performing a comprehensive survey of the sequenced genomes for DNA-binding proteins (DBPs) will help in understanding their distribution and the associated functions in a particular genome. Availability of fully sequenced genome of Arabidopsis thaliana enables the review of distribution of DBPs in this model plant genome. We used profiles of both structure and sequence-based DNA-binding families, derived from PDB and PFam databases, to perform the survey. This resulted in 4471 proteins, identified as DNA-binding in Arabidopsis genome, which are distributed across 300 different PFam families. Apart from several plant-specific DNA-binding families, certain RING fingers and leucine zippers also had high representation. Our search protocol helped to assign DNA-binding property to several proteins that were previously marked as unknown, putative or hypothetical in function. The distribution of Arabidopsis genes having a role in plant DNA repair were particularly studied and noted for their functional mapping. The functions observed to be overrepresented in the plant genome harbour DNA-3-methyladenine glycosylase activity, alkylbase DNA N-glycosylase activity and DNA-(apurinic or apyrimidinic site) lyase activity, suggesting their role in specialized functions such as gene regulation and DNA repair.
Phosphorylation-regulated Binding of RNA Polymerase II to Fibrous Polymers of Low Complexity Domains
Xiang, Siheng; Wu, Leeju; Theodoropoulos, Pano; Mirzaei, Hamid; Han, Tina; Xie, Shanhai; Corden, Jeffry L.; McKnight, Steven L.
2014-01-01
SUMMARY The low complexity (LC) domains of the products of the fused in sarcoma (FUS), Ewings sarcoma (EWS) and TAF15 genes are translocated onto a variety of different DNA-binding domains and thereby assist in driving the formation of cancerous cells. In the context of the translocated fusion proteins, these LC sequences function as transcriptional activation domains. Here we show that polymeric fibers formed from these LC domains directly bind the C-terminal domain (CTD) of RNA polymerase II in a manner reversible by phosphorylation of the iterated, heptad repeats of the CTD. Mutational analysis indicates that the degree of binding between the CTD and the LC domain polymers correlates with the strength of transcriptional activation. These studies offer a simple means of conceptualizing how RNA polymerase II is recruited to active genes in its unphosphorylated state, and released for elongation following phosphorylation of the CTD. PMID:24267890
Alam, Tanfis I; Rao, Venigalla B
2008-03-07
Translocation of double-stranded DNA into a preformed capsid by tailed bacteriophages is driven by powerful motors assembled at the special portal vertex. The motor is thought to drive processive cycles of DNA binding, movement, and release to package the viral genome. In phage T4, there is evidence that the large terminase protein, gene product 17 (gp17), assembles into a multisubunit motor and translocates DNA by an inchworm mechanism. gp17 consists of two domains; an N-terminal ATPase domain (amino acids 1-360) that powers translocation of DNA, and a C-terminal nuclease domain (amino acids 361-610) that cuts concatemeric DNA to generate a headful-size viral genome. While the functional motifs of ATPase and nuclease have been well defined and the ATPase atomic structure has been solved, the DNA binding motif(s) responsible for viral DNA recognition, cutting, and translocation are unknown. Here we report the first evidence for the presence of a double-stranded DNA binding activity in the gp17 ATPase domain. Binding to DNA is sensitive to Mg(2+) and salt, but not the type of DNA used. DNA fragments as short as 20 bp can bind to the ATPase but preferential binding was observed to DNA greater than 1 kb. A high molecular weight ATPase-DNA complex was isolated by gel filtration, suggesting oligomerization of ATPase following DNA interaction. DNA binding was not observed with the full-length gp17, or the C-terminal nuclease domain. The small terminase protein, gp16, inhibited DNA binding, which was further accentuated by ATP. The presence of a DNA binding site in the ATPase domain and its binding properties implicate a role in the DNA packaging mechanism.
Novel DNA Motif Binding Activity Observed In Vivo With an Estrogen Receptor α Mutant Mouse
Li, Leping; Grimm, Sara A.; Winuthayanon, Wipawee; Hamilton, Katherine J.; Pockette, Brianna; Rubel, Cory A.; Pedersen, Lars C.; Fargo, David; Lanz, Rainer B.; DeMayo, Francesco J.; Schütz, Günther; Korach, Kenneth S.
2014-01-01
Estrogen receptor α (ERα) interacts with DNA directly or indirectly via other transcription factors, referred to as “tethering.” Evidence for tethering is based on in vitro studies and a widely used “KIKO” mouse model containing mutations that prevent direct estrogen response element DNA- binding. KIKO mice are infertile, due in part to the inability of estradiol (E2) to induce uterine epithelial proliferation. To elucidate the molecular events that prevent KIKO uterine growth, regulation of the pro-proliferative E2 target gene Klf4 and of Klf15, a progesterone (P4) target gene that opposes the pro-proliferative activity of KLF4, was evaluated. Klf4 induction was impaired in KIKO uteri; however, Klf15 was induced by E2 rather than by P4. Whole uterine chromatin immunoprecipitation-sequencing revealed enrichment of KIKO ERα binding to hormone response elements (HREs) motifs. KIKO binding to HRE motifs was verified using reporter gene and DNA-binding assays. Because the KIKO ERα has HRE DNA-binding activity, we evaluated the “EAAE” ERα, which has more severe DNA-binding domain mutations, and demonstrated a lack of estrogen response element or HRE reporter gene induction or DNA-binding. The EAAE mouse has an ERα null–like phenotype, with impaired uterine growth and transcriptional activity. Our findings demonstrate that the KIKO mouse model, which has been used by numerous investigators, cannot be used to establish biological functions for ERα tethering, because KIKO ERα effectively stimulates transcription using HRE motifs. The EAAE-ERα DNA-binding domain mutant mouse demonstrates that ERα DNA-binding is crucial for biological and transcriptional processes in reproductive tissues and that ERα tethering may not contribute to estrogen responsiveness in vivo. PMID:24713037
Freemont, P S; Ollis, D L; Steitz, T A; Joyce, C M
1986-09-01
The Klenow fragment of DNA polymerase I from Escherichia coli has two enzymatic activities: DNA polymerase and 3'-5' exonuclease. The crystal structure showed that the fragment is folded into two distinct domains. The smaller domain has a binding site for deoxynucleoside monophosphate and a divalent metal ion that is thought to identify the 3'-5' exonuclease active site. The larger C-terminal domain contains a deep cleft that is believed to bind duplex DNA. Several lines of evidence suggested that the large domain also contains the polymerase active site. To test this hypothesis, we have cloned the DNA coding for the large domain into an expression system and purified the protein product. We find that the C-terminal domain has polymerase activity (albeit at a lower specific activity than the native Klenow fragment) but no measurable 3'-5' exonuclease activity. These data are consistent with the hypothesis that each of the three enzymatic activities of DNA polymerase I from E. coli resides on a separate protein structural domain.
Hüntelmann, Bettina; Staab, Julia; Herrmann-Lingen, Christoph; Meyer, Thomas
2014-01-01
Binding to specific palindromic sequences termed gamma-activated sites (GAS) is a hallmark of gene activation by members of the STAT (signal transducer and activator of transcription) family of cytokine-inducible transcription factors. However, the precise molecular mechanisms involved in the signal-dependent finding of target genes by STAT dimers have not yet been very well studied. In this study, we have characterized a sequence motif in the STAT1 linker domain which is highly conserved among the seven human STAT proteins and includes surface-exposed residues in close proximity to the bound DNA. Using site-directed mutagenesis, we have demonstrated that a lysine residue in position 567 of the full-length molecule is required for GAS recognition. The substitution of alanine for this residue completely abolished both binding to high-affinity GAS elements and transcriptional activation of endogenous target genes in cells stimulated with interferon-γ (IFNγ), while the time course of transient nuclear accumulation and tyrosine phosphorylation were virtually unchanged. In contrast, two glutamic acid residues (E559 and E563) on each monomer are important for the dissociation of dimeric STAT1 from DNA and, when mutated to alanine, result in elevated levels of tyrosine-phosphorylated STAT1 as well as prolonged IFNγ-stimulated nuclear accumulation. In conclusion, our data indicate that the kinetics of signal-dependent GAS binding is determined by an array of glutamic acid residues located at the interior surface of the STAT1 dimer. These negatively charged residues appear to align the long axis of the STAT1 dimer in a position perpendicular to the DNA, thereby facilitating the interaction between lysine 567 and the phosphodiester backbone of a bound GAS element, which is a prerequisite for transient gene induction.
Roy Choudhury, Swarup; Roy, Sujit; Nag, Anish; Singh, Sanjay Kumar; Sengupta, Dibyendu N.
2012-01-01
The MADS-box family of genes has been shown to play a significant role in the development of reproductive organs, including dry and fleshy fruits. In this study, the molecular properties of an AGAMOUS like MADS box transcription factor in banana cultivar Giant governor (Musa sp, AAA group, subgroup Cavendish) has been elucidated. We have detected a CArG-box sequence binding AGAMOUS MADS-box protein in banana flower and fruit nuclear extracts in DNA-protein interaction assays. The protein fraction in the DNA-protein complex was analyzed by mass spectrometry and using this information we have obtained the full length cDNA of the corresponding protein. The deduced protein sequence showed ∼95% amino acid sequence homology with MA-MADS5, a MADS-box protein described previously from banana. We have characterized the domains of the identified AGAMOUS MADS-box protein involved in DNA binding and homodimer formation in vitro using full-length and truncated versions of affinity purified recombinant proteins. Furthermore, in order to gain insight about how DNA bending is achieved by this MADS-box factor, we performed circular permutation and phasing analysis using the wild type recombinant protein. The AGAMOUS MADS-box protein identified in this study has been found to predominantly accumulate in the climacteric fruit pulp and also in female flower ovary. In vivo and in vitro assays have revealed specific binding of the identified AGAMOUS MADS-box protein to CArG-box sequence in the promoters of major ripening genes in banana fruit. Overall, the expression patterns of this MADS-box protein in banana female flower ovary and during various phases of fruit ripening along with the interaction of the protein to the CArG-box sequence in the promoters of major ripening genes lead to interesting assumption about the possible involvement of this AGAMOUS MADS-box factor in banana fruit ripening and floral reproductive organ development. PMID:22984496
Roy Choudhury, Swarup; Roy, Sujit; Nag, Anish; Singh, Sanjay Kumar; Sengupta, Dibyendu N
2012-01-01
The MADS-box family of genes has been shown to play a significant role in the development of reproductive organs, including dry and fleshy fruits. In this study, the molecular properties of an AGAMOUS like MADS box transcription factor in banana cultivar Giant governor (Musa sp, AAA group, subgroup Cavendish) has been elucidated. We have detected a CArG-box sequence binding AGAMOUS MADS-box protein in banana flower and fruit nuclear extracts in DNA-protein interaction assays. The protein fraction in the DNA-protein complex was analyzed by mass spectrometry and using this information we have obtained the full length cDNA of the corresponding protein. The deduced protein sequence showed ~95% amino acid sequence homology with MA-MADS5, a MADS-box protein described previously from banana. We have characterized the domains of the identified AGAMOUS MADS-box protein involved in DNA binding and homodimer formation in vitro using full-length and truncated versions of affinity purified recombinant proteins. Furthermore, in order to gain insight about how DNA bending is achieved by this MADS-box factor, we performed circular permutation and phasing analysis using the wild type recombinant protein. The AGAMOUS MADS-box protein identified in this study has been found to predominantly accumulate in the climacteric fruit pulp and also in female flower ovary. In vivo and in vitro assays have revealed specific binding of the identified AGAMOUS MADS-box protein to CArG-box sequence in the promoters of major ripening genes in banana fruit. Overall, the expression patterns of this MADS-box protein in banana female flower ovary and during various phases of fruit ripening along with the interaction of the protein to the CArG-box sequence in the promoters of major ripening genes lead to interesting assumption about the possible involvement of this AGAMOUS MADS-box factor in banana fruit ripening and floral reproductive organ development.
Understanding the mechanisms of protein-DNA interactions
NASA Astrophysics Data System (ADS)
Lavery, Richard
2004-03-01
Structural, biochemical and thermodynamic data on protein-DNA interactions show that specific recognition cannot be reduced to a simple set of binary interactions between the partners (such as hydrogen bonds, ion pairs or steric contacts). The mechanical properties of the partners also play a role and, in the case of DNA, variations in both conformation and flexibility as a function of base sequence can be a significant factor in guiding a protein to the correct binding site. All-atom molecular modeling offers a means of analyzing the role of different binding mechanisms within protein-DNA complexes of known structure. This however requires estimating the binding strengths for the full range of sequences with which a given protein can interact. Since this number grows exponentially with the length of the binding site it is necessary to find a method to accelerate the calculations. We have achieved this by using a multi-copy approach (ADAPT) which allows us to build a DNA fragment with a variable base sequence. The results obtained with this method correlate well with experimental consensus binding sequences. They enable us to show that indirect recognition mechanisms involving the sequence dependent properties of DNA play a significant role in many complexes. This approach also offers a means of predicting protein binding sites on the basis of binding energies, which is complementary to conventional lexical techniques.
Position specific variation in the rate of evolution in transcription factor binding sites
Moses, Alan M; Chiang, Derek Y; Kellis, Manolis; Lander, Eric S; Eisen, Michael B
2003-01-01
Background The binding sites of sequence specific transcription factors are an important and relatively well-understood class of functional non-coding DNAs. Although a wide variety of experimental and computational methods have been developed to characterize transcription factor binding sites, they remain difficult to identify. Comparison of non-coding DNA from related species has shown considerable promise in identifying these functional non-coding sequences, even though relatively little is known about their evolution. Results Here we analyse the genome sequences of the budding yeasts Saccharomyces cerevisiae, S. bayanus, S. paradoxus and S. mikatae to study the evolution of transcription factor binding sites. As expected, we find that both experimentally characterized and computationally predicted binding sites evolve slower than surrounding sequence, consistent with the hypothesis that they are under purifying selection. We also observe position-specific variation in the rate of evolution within binding sites. We find that the position-specific rate of evolution is positively correlated with degeneracy among binding sites within S. cerevisiae. We test theoretical predictions for the rate of evolution at positions where the base frequencies deviate from background due to purifying selection and find reasonable agreement with the observed rates of evolution. Finally, we show how the evolutionary characteristics of real binding motifs can be used to distinguish them from artefacts of computational motif finding algorithms. Conclusion As has been observed for protein sequences, the rate of evolution in transcription factor binding sites varies with position, suggesting that some regions are under stronger functional constraint than others. This variation likely reflects the varying importance of different positions in the formation of the protein-DNA complex. The characterization of the pattern of evolution in known binding sites will likely contribute to the effective use of comparative sequence data in the identification of transcription factor binding sites and is an important step toward understanding the evolution of functional non-coding DNA. PMID:12946282
Stewart, Mikaela; Dunlap, Tori; Dourlain, Elizabeth; Grant, Bryce; McFail-Isom, Lori
2013-01-01
The fine conformational subtleties of DNA structure modulate many fundamental cellular processes including gene activation/repression, cellular division, and DNA repair. Most of these cellular processes rely on the conformational heterogeneity of specific DNA sequences. Factors including those structural characteristics inherent in the particular base sequence as well as those induced through interaction with solvent components combine to produce fine DNA structural variation including helical flexibility and conformation. Cation-pi interactions between solvent cations or their first hydration shell waters and the faces of DNA bases form sequence selectively and contribute to DNA structural heterogeneity. In this paper, we detect and characterize the binding patterns found in cation-pi interactions between solvent cations and DNA bases in a set of high resolution x-ray crystal structures. Specifically, we found that monovalent cations (Tl+) and the polarized first hydration shell waters of divalent cations (Mg2+, Ca2+) form cation-pi interactions with DNA bases stabilizing unstacked conformations. When these cation-pi interactions are combined with electrostatic interactions a pattern of specific binding motifs is formed within the grooves. PMID:23940752
Stewart, Mikaela; Dunlap, Tori; Dourlain, Elizabeth; Grant, Bryce; McFail-Isom, Lori
2013-01-01
The fine conformational subtleties of DNA structure modulate many fundamental cellular processes including gene activation/repression, cellular division, and DNA repair. Most of these cellular processes rely on the conformational heterogeneity of specific DNA sequences. Factors including those structural characteristics inherent in the particular base sequence as well as those induced through interaction with solvent components combine to produce fine DNA structural variation including helical flexibility and conformation. Cation-pi interactions between solvent cations or their first hydration shell waters and the faces of DNA bases form sequence selectively and contribute to DNA structural heterogeneity. In this paper, we detect and characterize the binding patterns found in cation-pi interactions between solvent cations and DNA bases in a set of high resolution x-ray crystal structures. Specifically, we found that monovalent cations (Tl⁺) and the polarized first hydration shell waters of divalent cations (Mg²⁺, Ca²⁺) form cation-pi interactions with DNA bases stabilizing unstacked conformations. When these cation-pi interactions are combined with electrostatic interactions a pattern of specific binding motifs is formed within the grooves.
Robinson, Lois; Panayiotakis, Alexandra; Papas, Takis S.; Kola, Ismail; Seth, Arun
1997-01-01
ETS transcription factors play important roles in hematopoiesis, angiogenesis, and organogenesis during murine development. The ETS genes also have a role in neoplasia, for example in Ewing’s sarcomas and retrovirally induced cancers. The ETS genes encode transcription factors that bind to specific DNA sequences and activate transcription of various cellular and viral genes. To isolate novel ETS target genes, we used two approaches. In the first approach, we isolated genes by the RNA differential display technique. Previously, we have shown that the overexpression of ETS1 and ETS2 genes effects transformation of NIH 3T3 cells and specific transformants produce high levels of the ETS proteins. To isolate ETS1 and ETS2 responsive genes in these transformed cells, we prepared RNA from ETS1, ETS2 transformants, and normal NIH 3T3 cell lines and converted it into cDNA. This cDNA was amplified by PCR and displayed on sequencing gels. The differentially displayed bands were subcloned into plasmid vectors. By Northern blot analysis, several clones showed differential patterns of mRNA expression in the NIH 3T3-, ETS1-, and ETS2-expressing cell lines. Sixteen clones were analyzed by DNA sequence analysis, and 13 of them appeared to be unique because their DNA sequences did not match with any of the known genes present in the gene bank. Three known genes were found to be identical to the CArG box binding factor, phospholipase A2-activating protein, and early growth response 1 (Egr1) genes. In the second approach, to isolate ETS target promoters directly, we performed ETS1 binding with MboI-cleaved genomic DNA in the presence of a specific mAb followed by whole genome PCR. The immune complex-bound ETS binding sites containing DNA fragments were amplified and subcloned into pBluescript and subjected to DNA sequence and computer analysis. We found that, of a large number of clones isolated, 43 represented unique sequences not previously identified. Three clones turned out to contain regulatory sequences derived from human serglycin, preproapolipoprotein C II, and Egr1 genes. The ETS binding sites derived from these three regulatory sequences showed specific binding with recombinant ETS proteins. Of interest, Egr1 was identified by both of these techniques, suggesting strongly that it is indeed an ETS target gene. PMID:9207063
A DEK Domain-Containing Protein Modulates Chromatin Structure and Function in Arabidopsis[W][OPEN
Waidmann, Sascha; Kusenda, Branislav; Mayerhofer, Juliane; Mechtler, Karl; Jonak, Claudia
2014-01-01
Chromatin is a major determinant in the regulation of virtually all DNA-dependent processes. Chromatin architectural proteins interact with nucleosomes to modulate chromatin accessibility and higher-order chromatin structure. The evolutionarily conserved DEK domain-containing protein is implicated in important chromatin-related processes in animals, but little is known about its DNA targets and protein interaction partners. In plants, the role of DEK has remained elusive. In this work, we identified DEK3 as a chromatin-associated protein in Arabidopsis thaliana. DEK3 specifically binds histones H3 and H4. Purification of other proteins associated with nuclear DEK3 also established DNA topoisomerase 1α and proteins of the cohesion complex as in vivo interaction partners. Genome-wide mapping of DEK3 binding sites by chromatin immunoprecipitation followed by deep sequencing revealed enrichment of DEK3 at protein-coding genes throughout the genome. Using DEK3 knockout and overexpressor lines, we show that DEK3 affects nucleosome occupancy and chromatin accessibility and modulates the expression of DEK3 target genes. Furthermore, functional levels of DEK3 are crucial for stress tolerance. Overall, data indicate that DEK3 contributes to modulation of Arabidopsis chromatin structure and function. PMID:25387881
Warfield, Linda; Tuttle, Lisa M; Pacheco, Derek; Klevit, Rachel E; Hahn, Steven
2014-08-26
Although many transcription activators contact the same set of coactivator complexes, the mechanism and specificity of these interactions have been unclear. For example, do intrinsically disordered transcription activation domains (ADs) use sequence-specific motifs, or do ADs of seemingly different sequence have common properties that encode activation function? We find that the central activation domain (cAD) of the yeast activator Gcn4 functions through a short, conserved sequence-specific motif. Optimizing the residues surrounding this short motif by inserting additional hydrophobic residues creates very powerful ADs that bind the Mediator subunit Gal11/Med15 with high affinity via a "fuzzy" protein interface. In contrast to Gcn4, the activity of these synthetic ADs is not strongly dependent on any one residue of the AD, and this redundancy is similar to that of some natural ADs in which few if any sequence-specific residues have been identified. The additional hydrophobic residues in the synthetic ADs likely allow multiple faces of the AD helix to interact with the Gal11 activator-binding domain, effectively forming a fuzzier interface than that of the wild-type cAD.
Sheridan, P L; Schorpp, M; Voz, M L; Jones, K A
1995-03-03
We have isolated a human cDNA clone encoding HIP116, a protein that binds to the SPH repeats of the SV40 enhancer and to the TATA/inhibitor region of the human immunodeficiency virus (HIV)-1 promoter. The predicted HIP116 protein is related to the yeast SNF2/SWI2 transcription factor and to other members of this extended family and contains seven domains similar to those found in the vaccinia NTP1 ATPase. Interestingly, HIP116 also contains a C3HC4 zinc-binding motif (RING finger) interspersed between the ATPase motifs in an arrangement similar to that found in the yeast RAD5 and RAD16 proteins. The HIP116 amino terminus is unique among the members of this family, and houses a specific DNA-binding domain. Antiserum raised against HIP116 recognizes a 116-kDa nuclear protein in Western blots and specifically supershifts SV40 and HIV-1 protein-DNA complexes in gel shift experiments. The binding site for HIP116 on the SV40 enhancer directly overlaps the site for TEF-1, and like TEF-1, binding of HIP116 to the SV40 enhancer is destroyed by mutations that inhibit SPH enhancer activity in vivo. Purified fractions of HIP116 display strong ATPase activity that is preferentially stimulated by SPH DNA and can be inhibited specifically by antibodies to HIP116. These findings suggest that HIP116 might affect transcription, directly or indirectly, by acting as a DNA binding site-specific ATPase.
Morea, Edna G O; Viviescas, Maria Alejandra; Fernandes, Carlos A H; Matioli, Fabio F; Lira, Cristina B B; Fernandez, Maribel F; Moraes, Barbara S; da Silva, Marcelo S; Storti, Camila B; Fontes, Marcos R M; Cano, Maria Isabel N
2017-11-01
Leishmania spp. telomeres are composed of 5'-TTAGGG-3' repeats associated with proteins. We have previously identified LaRbp38 and LaRPA-1 as proteins that bind the G-rich telomeric strand. At that time, we had also partially characterized a protein: DNA complex, named LaGT1, but we could not identify its protein component. Using protein-DNA interaction and competition assays, we confirmed that LaGT1 is highly specific to the G-rich telomeric single-stranded DNA. Three protein bands, with LaGT1 activity, were isolated from affinity-purified protein extracts in-gel digested, and sequenced de novo using mass spectrometry analysis. In silico analysis of the digested peptide identified them as a putative calmodulin with sequences identical to the T. cruzi calmodulin. In the Leishmania genome, the calmodulin ortholog is present in three identical copies. We cloned and sequenced one of the gene copies, named it LCalA, and obtained the recombinant protein. Multiple sequence alignment and molecular modeling showed that LCalA shares homology to most eukaryotes calmodulin. In addition, we demonstrated that LCalA is nuclear, partially co-localizes with telomeres and binds in vivo the G-rich telomeric strand. Recombinant LCalA can bind specifically and with relative affinity to the G-rich telomeric single-strand and to a 3'G-overhang, and DNA binding is calcium dependent. We have described a novel candidate component of Leishmania telomeres, LCalA, a nuclear calmodulin that binds the G-rich telomeric strand with high specificity and relative affinity, in a calcium-dependent manner. LCalA is the first reported calmodulin that binds in vivo telomeric DNA. Copyright © 2017 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Zhang, Xirui; Daaboul, George G.; Spuhler, Philipp S.; Dröge, Peter; Ünlü, M. Selim
2016-03-01
DNA-binding proteins play crucial roles in the maintenance and functions of the genome and yet, their specific binding mechanisms are not fully understood. Recently, it was discovered that DNA-binding proteins recognize specific binding sites to carry out their functions through an indirect readout mechanism by recognizing and capturing DNA conformational flexibility and deformation. High-throughput DNA microarray-based methods that provide large-scale protein-DNA binding information have shown effective and comprehensive analysis of protein-DNA binding affinities, but do not provide information of DNA conformational changes in specific protein-DNA complexes. Building on the high-throughput capability of DNA microarrays, we demonstrate a quantitative approach that simultaneously measures the amount of protein binding to DNA and nanometer-scale DNA conformational change induced by protein binding in a microarray format. Both measurements rely on spectral interferometry on a layered substrate using a single optical instrument in two distinct modalities. In the first modality, we quantitate the amount of binding of protein to surface-immobilized DNA in each DNA spot using a label-free spectral reflectivity technique that accurately measures the surface densities of protein and DNA accumulated on the substrate. In the second modality, for each DNA spot, we simultaneously measure DNA conformational change using a fluorescence vertical sectioning technique that determines average axial height of fluorophores tagged to specific nucleotides of the surface-immobilized DNA. The approach presented in this paper, when combined with current high-throughput DNA microarray-based technologies, has the potential to serve as a rapid and simple method for quantitative and large-scale characterization of conformational specific protein-DNA interactions.DNA-binding proteins play crucial roles in the maintenance and functions of the genome and yet, their specific binding mechanisms are not fully understood. Recently, it was discovered that DNA-binding proteins recognize specific binding sites to carry out their functions through an indirect readout mechanism by recognizing and capturing DNA conformational flexibility and deformation. High-throughput DNA microarray-based methods that provide large-scale protein-DNA binding information have shown effective and comprehensive analysis of protein-DNA binding affinities, but do not provide information of DNA conformational changes in specific protein-DNA complexes. Building on the high-throughput capability of DNA microarrays, we demonstrate a quantitative approach that simultaneously measures the amount of protein binding to DNA and nanometer-scale DNA conformational change induced by protein binding in a microarray format. Both measurements rely on spectral interferometry on a layered substrate using a single optical instrument in two distinct modalities. In the first modality, we quantitate the amount of binding of protein to surface-immobilized DNA in each DNA spot using a label-free spectral reflectivity technique that accurately measures the surface densities of protein and DNA accumulated on the substrate. In the second modality, for each DNA spot, we simultaneously measure DNA conformational change using a fluorescence vertical sectioning technique that determines average axial height of fluorophores tagged to specific nucleotides of the surface-immobilized DNA. The approach presented in this paper, when combined with current high-throughput DNA microarray-based technologies, has the potential to serve as a rapid and simple method for quantitative and large-scale characterization of conformational specific protein-DNA interactions. Electronic supplementary information (ESI) available: DNA sequences and nomenclature (Table 1S); SDS-PAGE assay of IHF stock solution (Fig. 1S); determination of the concentration of IHF stock solution by Bradford assay (Fig. 2S); equilibrium binding isotherm fitting results of other DNA sequences (Table 2S); calculation of dissociation constants (Fig. 3S, 4S; Table 2S); geometric model for quantitation of DNA bending angle induced by specific IHF binding (Fig. 4S); customized flow cell assembly (Fig. 5S); real-time measurement of average fluorophore height change by SSFM (Fig. 6S); summary of binding parameters obtained from additive isotherm model fitting (Table 3S); average surface densities of 10 dsDNA spots and bound IHF at equilibrium (Table 4S); effects of surface densities on the binding and bending of dsDNA (Tables 5S, 6S and Fig. 7S-10S). See DOI: 10.1039/c5nr06785e
NASA Astrophysics Data System (ADS)
Enea, Vincenzo; Ellis, Joan; Zavala, Fidel; Arnot, David E.; Asavanich, Achara; Masuda, Aoi; Quakyi, Isabella; Nussenzweig, Ruth S.
1984-08-01
A clone of complementary DNA encoding the circumsporozoite (CS) protein of the human malaria parasite Plasmodium falciparum has been isolated by screening an Escherichia coli complementary DNA library with a monoclonal antibody to the CS protein. The DNA sequence of the complementary DNA insert encodes a four-amino acid sequence: proline-asparagine-alanine-asparagine, tandemly repeated 23 times. The CS β -lactamase fusion protein specifically binds monoclonal antibodies to the CS protein and inhibits the binding of these antibodies to native Plasmodium falciparum CS protein. These findings provide a basis for the development of a vaccine against Plasmodium falciparum malaria.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tewary, Sunil K.; Liang, Lingfei; Lin, Zihan
Members of the Parvoviridae family all encode a non-structural protein 1 (NS1) that directs replication of single-stranded viral DNA, packages viral DNA into capsid, and serves as a potent transcriptional activator. Here we report the X-ray structure of the minute virus of mice (MVM) NS1 N-terminal domain at 1.45 Å resolution, showing that sites for dsDNA binding, ssDNA binding and cleavage, nuclear localization, and other functions are integrated on a canonical fold of the histidine-hydrophobic-histidine superfamily of nucleases, including elements specific for this Protoparvovirus but distinct from its Bocaparvovirus or Dependoparvovirus orthologs. High resolution structural analysis reveals a nickase activemore » site with an architecture that allows highly versatile metal ligand binding. The structures support a unified mechanism of replication origin recognition for homotelomeric and heterotelomeric parvoviruses, mediated by a basic-residue-rich hairpin and an adjacent helix in the initiator proteins and by tandem tetranucleotide motifs in the replication origins. - Highlights: • The structure of a parvovirus replication initiator protein has been determined; • The structure sheds light on mechanisms of ssDNA binding and cleavage; • The nickase active site is preconfigured for versatile metal ligand binding; • The binding site for the double-stranded replication origin DNA is identified; • A single domain integrates multiple functions in virus replication.« less
Structure and Biochemical Activities of Escherichia coli MgsA
DOE Office of Scientific and Technical Information (OSTI.GOV)
Page, Asher N.; George, Nicholas P.; Marceau, Aimee H.
2012-02-27
Bacterial 'maintenance of genome stability protein A' (MgsA) and related eukaryotic enzymes play important roles in cellular responses to stalled DNA replication processes. Sequence information identifies MgsA enzymes as members of the clamp loader clade of AAA{sup +} proteins, but structural information defining the family has been limited. Here, the x-ray crystal structure of Escherichia coli MgsA is described, revealing a homotetrameric arrangement for the protein that distinguishes it from other clamp loader clade AAA{sup +} proteins. Each MgsA protomer is composed of three elements as follows: ATP-binding and helical lid domains (conserved among AAA{sup +} proteins) and a tetramerizationmore » domain. Although the tetramerization domains bury the greatest amount of surface area in the MgsA oligomer, each of the domains participates in oligomerization to form a highly intertwined quaternary structure. Phosphate is bound at each AAA{sup +} ATP-binding site, but the active sites do not appear to be in a catalytically competent conformation due to displacement of Arg finger residues. E. coli MgsA is also shown to form a complex with the single-stranded DNA-binding protein through co-purification and biochemical studies. MgsA DNA-dependent ATPase activity is inhibited by single-stranded DNA-binding protein. Together, these structural and biochemical observations provide insights into the mechanisms of MgsA family AAA{sup +} proteins.« less
Structure and Biochemical Activities of Escherichia coli MgsA*♦
Page, Asher N.; George, Nicholas P.; Marceau, Aimee H.; Cox, Michael M.; Keck, James L.
2011-01-01
Bacterial “maintenance of genome stability protein A” (MgsA) and related eukaryotic enzymes play important roles in cellular responses to stalled DNA replication processes. Sequence information identifies MgsA enzymes as members of the clamp loader clade of AAA+ proteins, but structural information defining the family has been limited. Here, the x-ray crystal structure of Escherichia coli MgsA is described, revealing a homotetrameric arrangement for the protein that distinguishes it from other clamp loader clade AAA+ proteins. Each MgsA protomer is composed of three elements as follows: ATP-binding and helical lid domains (conserved among AAA+ proteins) and a tetramerization domain. Although the tetramerization domains bury the greatest amount of surface area in the MgsA oligomer, each of the domains participates in oligomerization to form a highly intertwined quaternary structure. Phosphate is bound at each AAA+ ATP-binding site, but the active sites do not appear to be in a catalytically competent conformation due to displacement of Arg finger residues. E. coli MgsA is also shown to form a complex with the single-stranded DNA-binding protein through co-purification and biochemical studies. MgsA DNA-dependent ATPase activity is inhibited by single-stranded DNA-binding protein. Together, these structural and biochemical observations provide insights into the mechanisms of MgsA family AAA+ proteins. PMID:21297161
Deng, Dong; Yan, Chuangye; Wu, Jianping; Pan, Xiaojing; Yan, Nieng
2014-04-01
Transcription activator-like (TAL) effectors specifically bind to double stranded (ds) DNA through a central domain of tandem repeats. Each TAL effector (TALE) repeat comprises 33-35 amino acids and recognizes one specific DNA base through a highly variable residue at a fixed position in the repeat. Structural studies have revealed the molecular basis of DNA recognition by TALE repeats. Examination of the overall structure reveals that the basic building block of TALE protein, namely a helical hairpin, is one-helix shifted from the previously defined TALE motif. Here we wish to suggest a structure-based re-demarcation of the TALE repeat which starts with the residues that bind to the DNA backbone phosphate and concludes with the base-recognition hyper-variable residue. This new numbering system is consistent with the α-solenoid superfamily to which TALE belongs, and reflects the structural integrity of TAL effectors. In addition, it confers integral number of TALE repeats that matches the number of bound DNA bases. We then present fifteen crystal structures of engineered dHax3 variants in complex with target DNA molecules, which elucidate the structural basis for the recognition of bases adenine (A) and guanine (G) by reported or uncharacterized TALE codes. Finally, we analyzed the sequence-structure correlation of the amino acid residues within a TALE repeat. The structural analyses reported here may advance the mechanistic understanding of TALE proteins and facilitate the design of TALEN with improved affinity and specificity.
Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes.
Riechmann, J L; Heard, J; Martin, G; Reuber, L; Jiang, C; Keddie, J; Adam, L; Pineda, O; Ratcliffe, O J; Samaha, R R; Creelman, R; Pilgrim, M; Broun, P; Zhang, J Z; Ghandehari, D; Sherman, B K; Yu, G
2000-12-15
The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over 5% of its genome to code for more than 1500 transcription factors, about 45% of which are from families specific to plants. Arabidopsis transcription factors that belong to families common to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the conserved DNA binding domains, many of which have been arranged in combinations specific to each lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the regulation of transcription.
Structure-affinity relationships for the binding of actinomycin D to DNA
NASA Astrophysics Data System (ADS)
Gallego, José; Ortiz, Angel R.; de Pascual-Teresa, Beatriz; Gago, Federico
1997-03-01
Molecular models of the complexes between actinomycin D and 14 different DNA hexamers were built based on the X-ray crystal structure of the actinomycin-d(GAAGCTTC)2 complex. The DNA sequences included the canonical GpC binding step flanked by different base pairs, nonclassical binding sites such as GpG and GpT, and sites containing 2,6-diamino- purine. A good correlation was found between the intermolecular interaction energies calculated for the refined complexes and the relative preferences of actinomycin binding to standard and modified DNA. A detailed energy decomposition into van der Waals and electrostatic components for the interactions between the DNA base pairs and either the chromophore or the peptidic part of the antibiotic was performed for each complex. The resulting energy matrix was then subjected to principal component analysis, which showed that actinomycin D discriminates among different DNA sequences by an interplay of hydrogen bonding and stacking interactions. The structure-affinity relationships for this important antitumor drug are thus rationalized and may be used to advantage in the design of novel sequence-specific DNA-binding agents.
Binding of sulphonated indigo derivatives to RepA-WH1 inhibits DNA-induced protein amyloidogenesis
Gasset-Rosa, Fátima; Maté, María Jesús; Dávila-Fajardo, Cristina; Bravo, Jerónimo; Giraldo, Rafael
2008-01-01
The quest for inducers and inhibitors of protein amyloidogenesis is of utmost interest, since they are key tools to understand the molecular bases of proteinopathies such as Alzheimer, Parkinson, Huntington and Creutzfeldt–Jakob diseases. It is also expected that such molecules could lead to valid therapeutic agents. In common with the mammalian prion protein (PrP), the N-terminal Winged-Helix (WH1) domain of the pPS10 plasmid replication protein (RepA) assembles in vitro into a variety of amyloid nanostructures upon binding to different specific dsDNA sequences. Here we show that di- (S2) and tetra-sulphonated (S4) derivatives of indigo stain dock at the DNA recognition interface in the RepA-WH1 dimer. They compete binding of RepA to its natural target dsDNA repeats, found at the repA operator and at the origin of replication of the plasmid. Calorimetry points to the existence of a major site, with micromolar affinity, for S4-indigo in RepA-WH1 dimers. As revealed by electron microscopy, in the presence of inducer dsDNA, both S2/S4 stains inhibit the assembly of RepA-WH1 into fibres. These results validate the concept that DNA can promote protein assembly into amyloids and reveal that the binding sites of effector molecules can be targeted to inhibit amyloidogenesis. PMID:18285361
Huh, T L; Ryu, J H; Huh, J W; Sung, H C; Oh, I U; Song, B J; Veech, R L
1993-01-01
Mitochondrial NADP(+)-specific isocitrate dehydrogenase (IDP) was co-purified with the pyruvate dehydrogenase complex from bovine kidney mitochondria. The determination of its N-terminal 16-amino-acid sequence revealed that it is highly similar to the IDP from yeast. A cDNA clone (1.8 kb long) encoding this protein was isolated from a bovine kidney lambda gt11 cDNA library using a synthetic oligodeoxynucleotide. The deduced protein sequence of this cDNA clone rendered a precursor protein of 452 amino-acid residues (50,830 Da) and a mature protein of 413 amino-acid residues (46,519 Da). It is 100% identical to the internal tryptic peptide sequences of the autologous form from pig heart and 62% similar to that from yeast. However, it shares little similarity with the mitochondrial NAD(+)-specific isoenzyme from yeast. Structural analyses of the deduced proteins of IDP isoenzymes from different species indicated that similarity exists in certain regions, which may represent the common domains for the active sites or coenzyme-binding sites. In Northern-blot analysis, one species of mRNA (about 2.2 kb for both bovine and human) was hybridized with a 32P-labelled cDNA probe. Southern-blot analysis of genomic DNAs verified simple patterns of hybridization with this cDNA. These results strongly indicate that the mitochondrial IDP may be derived from a single gene family which does not appear to be closely related to that of the NAD(+)-specific isoenzyme. Images Figure 1 Figure 3 Figure 4 Figure 5 PMID:8318002
Freeman, R M; Plutzky, J; Neel, B G
1992-01-01
src homology 2 (SH2) domains direct binding to specific phosphotyrosyl proteins. Recently, SH2-containing protein-tyrosine-phosphatases (PTPs) were identified. Using degenerate oligonucleotides and the PCR, we have cloned a cDNA for an additional PTP, SH-PTP2, which contains two SH2 domains and is expressed ubiquitously. When expressed in Escherichia coli, SH-PTP2 displays tyrosine-specific phosphatase activity. Strong sequence similarity between SH-PTP2 and the Drosophila gene corkscrew (csw) and their similar patterns of expression suggest that SH-PTP2 is the human corkscrew homolog. Sequence comparisons between SH-PTP2, SH-PTP1, corkscrew, and other SH2-containing proteins suggest the existence of a subfamily of SH2 domains found specifically in PTPs, whereas comparison of the PTP domains of the SH2-containing PTPs with other tyrosine phosphatases suggests the existence of a subfamily of PTPs containing SH2 domains. Since corkscrew, a member of the terminal class signal transduction pathway, acts in concert with D-raf to positively transduce the signal generated by the receptor tyrosine kinase torso, these findings suggest several mechanisms by which SH-PTP2 may participate in mammalian signal transduction. Images PMID:1280823
Quantifying domain-ligand affinities and specificities by high-throughput holdup assay
Vincentelli, Renaud; Luck, Katja; Poirson, Juline; Polanowska, Jolanta; Abdat, Julie; Blémont, Marilyne; Turchetto, Jeremy; Iv, François; Ricquier, Kevin; Straub, Marie-Laure; Forster, Anne; Cassonnet, Patricia; Borg, Jean-Paul; Jacob, Yves; Masson, Murielle; Nominé, Yves; Reboul, Jérôme; Wolff, Nicolas; Charbonnier, Sebastian; Travé, Gilles
2015-01-01
Many protein interactions are mediated by small linear motifs interacting specifically with defined families of globular domains. Quantifying the specificity of a motif requires measuring and comparing its binding affinities to all its putative target domains. To this aim, we developed the high-throughput holdup assay, a chromatographic approach that can measure up to a thousand domain-motif equilibrium binding affinities per day. Extracts of overexpressed domains are incubated with peptide-coated resins and subjected to filtration. Binding affinities are deduced from microfluidic capillary electrophoresis of flow-throughs. After benchmarking the approach on 210 PDZ-peptide pairs with known affinities, we determined the affinities of two viral PDZ-binding motifs derived from Human Papillomavirus E6 oncoproteins for 209 PDZ domains covering 79% of the human PDZome. We obtained exquisite sequence-dependent binding profiles, describing quantitatively the PDZome recognition specificity of each motif. This approach, applicable to many categories of domain-ligand interactions, has a wide potential for quantifying the specificities of interactomes. PMID:26053890
Shilling, F M; Krätzschmar, J; Cai, H; Weskamp, G; Gayko, U; Leibow, J; Myles, D G; Nuccitelli, R; Blobel, C P
1997-06-15
Proteins containing a membrane-anchored metalloprotease domain, a disintegrin domain, and a cysteine-rich region (MDC proteins) are thought to play an important role in mammalian fertilization, as well as in somatic cell-cell interactions. We have identified PCR sequence tags encoding the disintegrin domain of five distinct MDC proteins from Xenopus laevis testis cDNA. Four of these sequence tags (xMDC9, xMDC11.1, xMDC11.2, and xMDC13) showed strong similarity to known mammalian MDC proteins, whereas the fifth (xMDC16) apparently represents a novel family member. Northern blot analysis revealed that the mRNA for xMDC16 was only expressed in testis, and not in heart, muscle, liver, ovaries, or eggs, whereas the mRNAs corresponding to the four other PCR products were expressed in testis and in some or all somatic tissues tested. The xMDC16 protein sequence, as predicted from the full-length cDNA, contains a metalloprotease domain with the active-site sequence HEXXH, a disintegrin domain, a cysteine-rich region, an EGF repeat, a transmembrane domain, and a short cytoplasmic tail. To study a potential role for these xMDC proteins in fertilization, peptides corresponding to the predicted integrin-binding domain of each protein were tested for their ability to inhibit X. laevis fertilization. Cyclic and linear xMDC16 peptides inhibited fertilization in a concentration-dependent manner, whereas xMDC16 peptides that were scrambled or had certain amino acid replacements in the predicted integrin-binding domain did not affect fertilization. Cyclic and linear xMDC9 peptides and linear xMDC13 peptides also inhibited fertilization similarly to xMDC16 peptides, whereas peptides corresponding to the predicted integrin-binding site of xMDC11.1 and xMDC11.2 did not. These results are discussed in the context of a model in which multiple MDC protein-receptor interactions are necessary for fertilization to occur.
JASPAR 2010: the greatly expanded open-access database of transcription factor binding profiles
Portales-Casamar, Elodie; Thongjuea, Supat; Kwon, Andrew T.; Arenillas, David; Zhao, Xiaobei; Valen, Eivind; Yusuf, Dimas; Lenhard, Boris; Wasserman, Wyeth W.; Sandelin, Albin
2010-01-01
JASPAR (http://jaspar.genereg.net) is the leading open-access database of matrix profiles describing the DNA-binding patterns of transcription factors (TFs) and other proteins interacting with DNA in a sequence-specific manner. Its fourth major release is the largest expansion of the core database to date: the database now holds 457 non-redundant, curated profiles. The new entries include the first batch of profiles derived from ChIP-seq and ChIP-chip whole-genome binding experiments, and 177 yeast TF binding profiles. The introduction of a yeast division brings the convenience of JASPAR to an active research community. As binding models are refined by newer data, the JASPAR database now uses versioning of matrices: in this release, 12% of the older models were updated to improved versions. Classification of TF families has been improved by adopting a new DNA-binding domain nomenclature. A curated catalog of mammalian TFs is provided, extending the use of the JASPAR profiles to additional TFs belonging to the same structural family. The changes in the database set the system ready for more rapid acquisition of new high-throughput data sources. Additionally, three new special collections provide matrix profile data produced by recent alternative high-throughput approaches. PMID:19906716
Identification and functional characterization of BTas transactivator as a DNA-binding protein.
Tan, Juan; Hao, Peng; Jia, Rui; Yang, Wei; Liu, Ruichang; Wang, Jinzhong; Xi, Zhen; Geng, Yunqi; Qiao, Wentao
2010-09-30
The genome of bovine foamy virus (BFV) encodes a transcriptional transactivator, namely BTas, that remarkably enhances gene expression by binding to the viral long-terminal repeat promoter (LTR) and internal promoter (IP). In this report, we characterized the functional domains of BFV BTas. BTas contains two major functional domains: the N-terminal DNA-binding domain (residues 1-133) and the C-terminal activation domain (residues 198-249). The complete BTas responsive regions were mapped to the positions -380/-140 of LTR and 9205/9276 of IP. Four BTas responsive elements were identified at the positions -368/-346, -327/-307, -306/-285 and -186/-165 of the BFV LTR, and one element was identified at the position 9243/9264 of the BFV IP. Unlike other foamy viruses, the five BTas responsive elements in BFV shared obvious sequence homology. These data suggest that among the complex retroviruses, BFV appears to have a unique transactivation mechanism. Crown Copyright 2010. Published by Elsevier Inc. All rights reserved.
The structural role of the zinc ion can be dispensable in prokaryotic zinc-finger domains
Baglivo, Ilaria; Russo, Luigi; Esposito, Sabrina; Malgieri, Gaetano; Renda, Mario; Salluzzo, Antonio; Di Blasio, Benedetto; Isernia, Carla; Fattorusso, Roberto; Pedone, Paolo V.
2009-01-01
The recent characterization of the prokaryotic Cys2His2 zinc-finger domain, identified in Ros protein from Agrobacterium tumefaciens, has demonstrated that, although possessing a similar zinc coordination sphere, this domain is structurally very different from its eukaryotic counterpart. A search in the databases has identified ≈300 homologues with a high sequence identity to the Ros protein, including the amino acids that form the extensive hydrophobic core in Ros. Surprisingly, the Cys2His2 zinc coordination sphere is generally poorly conserved in the Ros homologues, raising the question of whether the zinc ion is always preserved in these proteins. Here, we present a functional and structural study of a point mutant of Ros protein, Ros56–142C82D, in which the second coordinating cysteine is replaced by an aspartate, 5 previously-uncharacterized representative Ros homologues from Mesorhizobium loti, and 2 mutants of the homologues. Our results indicate that the prokaryotic zinc-finger domain, which in Ros protein tetrahedrally coordinates Zn(II) through the typical Cys2His2 coordination, in Ros homologues can either exploit a CysAspHis2 coordination sphere, previously never described in DNA binding zinc finger domains to our knowledge, or lose the metal, while still preserving the DNA-binding activity. We demonstrate that this class of prokaryotic zinc-finger domains is structurally very adaptable, and surprisingly single mutations can transform a zinc-binding domain into a nonzinc-binding domain and vice versa, without affecting the DNA-binding ability. In light of our findings an evolutionary link between the prokaryotic and eukaryotic zinc-finger domains, based on bacteria-to-eukaryota horizontal gene transfer, is discussed. PMID:19369210
Zhang, Yun; Liu, Fang; Nie, Jinfang; Jiang, Fuyang; Zhou, Caibin; Yang, Jiani; Fan, Jinlong; Li, Jianping
2014-05-07
In this paper, we report for the first time an electrochemical biosensor for single-step, reagentless, and picomolar detection of a sequence-specific DNA-binding protein using a double-stranded, electrode-bound DNA probe terminally modified with a redox active label close to the electrode surface. This new methodology is based upon local repression of electrolyte diffusion associated with protein-DNA binding that leads to reduction of the electrochemical response of the label. In the proof-of-concept study, the resulting electrochemical biosensor was quantitatively sensitive to the concentrations of the TATA binding protein (TBP, a model analyte) ranging from 40 pM to 25.4 nM with an estimated detection limit of ∼10.6 pM (∼80 to 400-fold improvement on the detection limit over previous electrochemical analytical systems).
2015-01-01
Type IB topoisomerases unwind positive and negative DNA supercoils and play a key role in removing supercoils that would otherwise accumulate at replication and transcription forks. An interesting question is whether topoisomerase activity is regulated by the topological state of the DNA, thereby providing a mechanism for targeting the enzyme to highly supercoiled DNA domains in genomes. The type IB enzyme from variola virus (vTopo) has proven to be useful in addressing mechanistic questions about topoisomerase function because it forms a reversible 3′-phosphotyrosyl adduct with the DNA backbone at a specific target sequence (5′-CCCTT-3′) from which DNA unwinding can proceed. We have synthesized supercoiled DNA minicircles (MCs) containing a single vTopo target site that provides highly defined substrates for exploring the effects of supercoil density on DNA binding, strand cleavage and ligation, and unwinding. We observed no topological dependence for binding of vTopo to these supercoiled MC DNAs, indicating that affinity-based targeting to supercoiled DNA regions by vTopo is unlikely. Similarly, the cleavage and religation rates of the MCs were not topologically dependent, but topoisomers with low superhelical densities were found to unwind more slowly than highly supercoiled topoisomers, suggesting that reduced torque at low superhelical densities leads to an increased number of cycles of cleavage and ligation before a successful unwinding event. The K271E charge reversal mutant has an impaired interaction with the rotating DNA segment that leads to an increase in the number of supercoils that were unwound per cleavage event. This result provides evidence that interactions of the enzyme with the rotating DNA segment can restrict the number of supercoils that are unwound. We infer that both superhelical density and transient contacts between vTopo and the rotating DNA determine the efficiency of supercoil unwinding. Such determinants are likely to be important in regulating the steady-state superhelical density of DNA domains in the cell. PMID:24945825
Freimuth, Paul I.
2010-04-06
The invention provides recombinant human CAR (coxsackievirus and adenovirus receptor) polypeptides which bind adenovirus. Specifically, polypeptides corresponding to adenovirus binding domain D1 and the entire extracellular domain of human CAR protein comprising D1 and D2 are provided. In another aspect, the invention provides nucleic acid sequences encoding these domains and expression vectors for producing the domains and bacterial cells containing such vectors. The invention also includes an isolated fusion protein comprised of the D1 polypeptide fused to a polypeptide which facilitates folding of D1 when expressed in bacteria. The functional D1 domain finds application in a therapeutic method for treating a patient infected with a CAR D1-binding virus, and also in a method for identifying an antiviral compound which interferes with viral attachment. The invention also provides a method for specifically targeting a cell for infection by a virus which binds to D1.
Ma, Xin; Guo, Jing; Sun, Xiao
2016-01-01
DNA-binding proteins are fundamentally important in cellular processes. Several computational-based methods have been developed to improve the prediction of DNA-binding proteins in previous years. However, insufficient work has been done on the prediction of DNA-binding proteins from protein sequence information. In this paper, a novel predictor, DNABP (DNA-binding proteins), was designed to predict DNA-binding proteins using the random forest (RF) classifier with a hybrid feature. The hybrid feature contains two types of novel sequence features, which reflect information about the conservation of physicochemical properties of the amino acids, and the binding propensity of DNA-binding residues and non-binding propensities of non-binding residues. The comparisons with each feature demonstrated that these two novel features contributed most to the improvement in predictive ability. Furthermore, to improve the prediction performance of the DNABP model, feature selection using the minimum redundancy maximum relevance (mRMR) method combined with incremental feature selection (IFS) was carried out during the model construction. The results showed that the DNABP model could achieve 86.90% accuracy, 83.76% sensitivity, 90.03% specificity and a Matthews correlation coefficient of 0.727. High prediction accuracy and performance comparisons with previous research suggested that DNABP could be a useful approach to identify DNA-binding proteins from sequence information. The DNABP web server system is freely available at http://www.cbi.seu.edu.cn/DNABP/.
DNA/RNA hybrid substrates modulate the catalytic activity of purified AID.
Abdouni, Hala S; King, Justin J; Ghorbani, Atefeh; Fifield, Heather; Berghuis, Lesley; Larijani, Mani
2018-01-01
Activation-induced cytidine deaminase (AID) converts cytidine to uridine at Immunoglobulin (Ig) loci, initiating somatic hypermutation and class switching of antibodies. In vitro, AID acts on single stranded DNA (ssDNA), but neither double-stranded DNA (dsDNA) oligonucleotides nor RNA, and it is believed that transcription is the in vivo generator of ssDNA targeted by AID. It is also known that the Ig loci, particularly the switch (S) regions targeted by AID are rich in transcription-generated DNA/RNA hybrids. Here, we examined the binding and catalytic behavior of purified AID on DNA/RNA hybrid substrates bearing either random sequences or GC-rich sequences simulating Ig S regions. If substrates were made up of a random sequence, AID preferred substrates composed entirely of DNA over DNA/RNA hybrids. In contrast, if substrates were composed of S region sequences, AID preferred to mutate DNA/RNA hybrids over substrates composed entirely of DNA. Accordingly, AID exhibited a significantly higher affinity for binding DNA/RNA hybrid substrates composed specifically of S region sequences, than any other substrates composed of DNA. Thus, in the absence of any other cellular processes or factors, AID itself favors binding and mutating DNA/RNA hybrids composed of S region sequences. AID:DNA/RNA complex formation and supporting mutational analyses suggest that recognition of DNA/RNA hybrids is an inherent structural property of AID. Copyright © 2017 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chakraborty, Kaushik; Bandyopadhyay, Sanjoy, E-mail: sanjoy@chem.iitkgp.ernet.in
2015-07-28
Single-stranded DNA (ss-DNA) binding proteins specifically bind to the single-stranded regions of the DNA and protect it from premature annealing, thereby stabilizing the DNA structure. We have carried out atomistic molecular dynamics simulations of the aqueous solutions of two DNA binding K homology (KH) domains (KH3 and KH4) of the far upstream element binding protein complexed with two short ss-DNA segments. Attempts have been made to explore the influence of the formation of such complex structures on the microscopic dynamics and hydrogen bond properties of the interfacial water molecules. It is found that the water molecules involved in bridging themore » ss-DNA segments and the protein domains form a highly constrained thin layer with extremely retarded mobility. These water molecules play important roles in freezing the conformational oscillations of the ss-DNA oligomers and thereby forming rigid complex structures. Further, it is demonstrated that the effect of complexation on the slow long-time relaxations of hydrogen bonds at the interface is correlated with hindered motions of the surrounding water molecules. Importantly, it is observed that the highly restricted motions of the water molecules bridging the protein and the DNA components in the complexed forms originate from more frequent hydrogen bond reformations.« less
Escherichia coli ArgR mutants defective in cer/Xer recombination, but not in DNA binding.
Sénéchal, Hélène; Delesques, Jérémy; Szatmari, George
2010-04-01
The Escherichia coli arginine repressor (ArgR) is an L-arginine-dependent DNA-binding protein that controls the expression of the arginine biosynthetic genes and is required as an accessory factor for Xer site-specific recombination at cer and related recombination sites in plasmids. We used the technique of pentapeptide scanning mutagenesis to isolate a series of ArgR mutants that were considerably reduced in cer recombination, but were still able to repress an argA::lacZ fusion. DNA sequence analysis showed that all of the mutants mapped to the same nucleotide, resulting in a five amino acid insertion between residues 149 and 150 of ArgR, corresponding to the end of the alpha6 helix. A truncated ArgR containing a stop codon at residue 150 displayed the same phenotype as the protein with the five amino acid insertion, and both mutants displayed sequence-specific DNA-binding activity that was L-arginine dependent. These results show that the C-terminus of ArgR is more important in cer/Xer site-specific recombination than in DNA binding.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Yanli; Sheng, Gang; Juranek, Stefan
The slicer activity of the RNA-induced silencing complex is associated with argonaute, the RNase H-like PIWI domain of which catalyses guide-strand-mediated sequence-specific cleavage of target messenger RNA. Here we report on the crystal structure of Thermus thermophilus argonaute bound to a 5'-phosphorylated 21-base DNA guide strand, thereby identifying the nucleic-acid-binding channel positioned between the PAZ- and PIWI-containing lobes, as well as the pivot-like conformational changes associated with complex formation. The bound guide strand is anchored at both of its ends, with the solvent-exposed Watson-Crick edges of stacked bases 2 to 6 positioned for nucleation with the mRNA target, whereas twomore » critically positioned arginines lock bases 10 and 11 at the cleavage site into an unanticipated orthogonal alignment. Biochemical studies indicate that key amino acid residues at the active site and those lining the 5'-phosphate-binding pocket made up of the Mid domain are critical for cleavage activity, whereas alterations of residues lining the 2-nucleotide 3'-end-binding pocket made up of the PAZ domain show little effect.« less
The functional landscape bound to the transcription factors of Escherichia coli K-12.
Pérez-Rueda, Ernesto; Tenorio-Salgado, Silvia; Huerta-Saquero, Alejandro; Balderas-Martínez, Yalbi I; Moreno-Hagelsieb, Gabriel
2015-10-01
Motivated by the experimental evidences accumulated in the last ten years and based on information deposited in RegulonDB, literature look up, and sequence analysis, we analyze the repertoire of 304 DNA-binding Transcription factors (TFs) in Escherichia coli K-12. These regulators were grouped in 78 evolutionary families and are regulating almost half of the total genes in this bacterium. In structural terms, 60% of TFs are composed by two-domains, 30% are monodomain, and 10% three- and four-structural domains. As previously noticed, the most abundant DNA-binding domain corresponds to the winged helix-turn-helix, with few alternative DNA-binding structures, resembling the hypothesis of successful protein structures with the emergence of new ones at low scales. In summary, we identified and described the characteristics associated to the DNA-binding TF in E. coli K-12. We also identified twelve functional modules based on a co-regulated gene matrix. Finally, diverse regulons were predicted based on direct associations between the TFs and potential regulated genes. This analysis should increase our knowledge about the gene regulation in the bacterium E. coli K-12, and provide more additional clues for comprehensive modelling of transcriptional regulatory networks in other bacteria. Copyright © 2015 Elsevier Ltd. All rights reserved.
ERIC Educational Resources Information Center
Kugel, Jennifer F.
2008-01-01
An undergraduate biochemistry laboratory experiment that will teach the technique of fluorescence resonance energy transfer (FRET) while analyzing protein-induced DNA bending is described. The experiment uses the protein TATA binding protein (TBP), which is a general transcription factor that recognizes and binds specific DNA sequences known as…
Finding the target sites of RNA-binding proteins
Li, Xiao; Kazan, Hilal; Lipshitz, Howard D; Morris, Quaid D
2014-01-01
RNA–protein interactions differ from DNA–protein interactions because of the central role of RNA secondary structure. Some RNA-binding domains (RBDs) recognize their target sites mainly by their shape and geometry and others are sequence-specific but are sensitive to secondary structure context. A number of small- and large-scale experimental approaches have been developed to measure RNAs associated in vitro and in vivo with RNA-binding proteins (RBPs). Generalizing outside of the experimental conditions tested by these assays requires computational motif finding. Often RBP motif finding is done by adapting DNA motif finding methods; but modeling secondary structure context leads to better recovery of RBP-binding preferences. Genome-wide assessment of mRNA secondary structure has recently become possible, but these data must be combined with computational predictions of secondary structure before they add value in predicting in vivo binding. There are two main approaches to incorporating structural information into motif models: supplementing primary sequence motif models with preferred secondary structure contexts (e.g., MEMERIS and RNAcontext) and directly modeling secondary structure recognized by the RBP using stochastic context-free grammars (e.g., CMfinder and RNApromo). The former better reconstruct known binding preferences for sequence-specific RBPs but are not suitable for modeling RBPs that recognize shape and geometry of RNAs. Future work in RBP motif finding should incorporate interactions between multiple RBDs and multiple RBPs in binding to RNA. WIREs RNA 2014, 5:111–130. doi: 10.1002/wrna.1201 PMID:24217996
Rattanaporn, Onnicha; Utarabhand, Prapaporn
2011-02-01
A diverse class of pattern-recognition proteins called lectins play important roles in shrimp innate immunity. A novel C-type lectin gene (FmLC) was cloned from the hepatopancreas of banana shrimp Fenneropenaeus merguiensis by means of PCR and 5' and 3' rapid amplification of cDNA ends (RACE). The full-length cDNA consists of 1118 bp with one 1002 bp open reading frame, encoding 333 amino acids. Its deduced amino acid sequence contains a putative signal peptide of 20 amino acids. FmLC contains two carbohydrate recognition domains, CRD1 and CRD2, that share only 30% identity with each other. The first CRD comprises a QPD motif with specificity for binding galactose and a single Ca(2+) binding site, while the second CRD consists of an EPN motif for a mannose-specific binding site. FmLC had a close evolutionary relationship to other dual-CRD lectins of penaeid shrimp. Expression results showed that transcripts of FmLC were detected only in the hepatopancreas, none was found in other tissues. After challenging either whole shrimp or hepatopancreas tissue fragments with Vibrioharveyi, the expression of FmLC was up-regulated. This indicates that FmLC is inducible and may be involved in a shrimp immune response to recognize potential bacterial pathogens. Copyright © 2010 Elsevier Inc. All rights reserved.
Wang, Yupeng; Khan, Iram F.; Boissel, Sandrine; Jarjour, Jordan; Pangallo, Joseph; Thyme, Summer; Baker, David; Scharenberg, Andrew M.; Rawlings, David J.
2014-01-01
LAGLIDADG homing endonucleases (LHEs) are compact endonucleases with 20–22 bp recognition sites, and thus are ideal scaffolds for engineering site-specific DNA cleavage enzymes for genome editing applications. Here, we describe a general approach to LHE engineering that combines rational design with directed evolution, using a yeast surface display high-throughput cleavage selection. This approach was employed to alter the binding and cleavage specificity of the I-Anil LHE to recognize a mutation in the mouse Bruton tyrosine kinase (Btk) gene causative for mouse X-linked immunodeficiency (XID)—a model of human X-linked agammaglobulinemia (XLA). The required re-targeting of I-AniI involved progressive resculpting of the DNA contact interface to accommodate nine base differences from the native cleavage sequence. The enzyme emerging from the progressive engineering process was specific for the XID mutant allele versus the wild-type (WT) allele, and exhibited activity equivalent to WT I-AniI in vitro and in cellulo reporter assays. Fusion of the enzyme to a site-specific DNA binding domain of transcription activator-like effector (TALE) resulted in a further enhancement of gene editing efficiency. These results illustrate the potential of LHE enzymes as specific and efficient tools for therapeutic genome engineering. PMID:24682825
Keyamura, Kenji; Katayama, Tsutomu
2011-08-19
Chromosomal replication is initiated from the replication origin oriC in Escherichia coli by the active ATP-bound form of DnaA protein. The regulatory inactivation of DnaA (RIDA) system, a complex of the ADP-bound Hda and the DNA-loaded replicase clamp, represses extra initiations by facilitating DnaA-bound ATP hydrolysis, yielding the inactive ADP-bound form of DnaA. However, the mechanisms involved in promoting the DnaA-Hda interaction have not been determined except for the involvement of an interaction between the AAA+ domains of the two. This study revealed that DnaA Leu-422 and Pro-423 residues within DnaA domain IV, including a typical DNA-binding HTH motif, are specifically required for RIDA-dependent ATP hydrolysis in vitro and that these residues support efficient interaction with the DNA-loaded clamp·Hda complex and with Hda in vitro. Consistently, substitutions of these residues caused accumulation of ATP-bound DnaA in vivo and oriC-dependent inhibition of cell growth. Leu-422 plays a more important role in these activities than Pro-423. By contrast, neither of these residues is crucial for DNA replication from oriC, although they are highly conserved in DnaA orthologues. Structural analysis of a DnaA·Hda complex model suggested that these residues make contact with residues in the vicinity of the Hda AAA+ sensor I that participates in formation of a nucleotide-interacting surface. Together, the results show that functional DnaA-Hda interactions require a second interaction site within DnaA domain IV in addition to the AAA+ domain and suggest that these interactions are crucial for the formation of RIDA complexes that are active for DnaA-ATP hydrolysis.
Keyamura, Kenji; Katayama, Tsutomu
2011-01-01
Chromosomal replication is initiated from the replication origin oriC in Escherichia coli by the active ATP-bound form of DnaA protein. The regulatory inactivation of DnaA (RIDA) system, a complex of the ADP-bound Hda and the DNA-loaded replicase clamp, represses extra initiations by facilitating DnaA-bound ATP hydrolysis, yielding the inactive ADP-bound form of DnaA. However, the mechanisms involved in promoting the DnaA-Hda interaction have not been determined except for the involvement of an interaction between the AAA+ domains of the two. This study revealed that DnaA Leu-422 and Pro-423 residues within DnaA domain IV, including a typical DNA-binding HTH motif, are specifically required for RIDA-dependent ATP hydrolysis in vitro and that these residues support efficient interaction with the DNA-loaded clamp·Hda complex and with Hda in vitro. Consistently, substitutions of these residues caused accumulation of ATP-bound DnaA in vivo and oriC-dependent inhibition of cell growth. Leu-422 plays a more important role in these activities than Pro-423. By contrast, neither of these residues is crucial for DNA replication from oriC, although they are highly conserved in DnaA orthologues. Structural analysis of a DnaA·Hda complex model suggested that these residues make contact with residues in the vicinity of the Hda AAA+ sensor I that participates in formation of a nucleotide-interacting surface. Together, the results show that functional DnaA-Hda interactions require a second interaction site within DnaA domain IV in addition to the AAA+ domain and suggest that these interactions are crucial for the formation of RIDA complexes that are active for DnaA-ATP hydrolysis. PMID:21708944
Toral-López, Jaime; González-Huerta, Luz M; Martín-Del Campo, Mónica; Messina-Baas, Olga; Cuevas-Covarrubias, Sergio A
2018-05-01
The proband in this study was a 4-year-old Mexican girl with Blau syndrome. She and her affected family members had skin rash and arthritis but no uveitis. Exome sequencing and DNA direct sequencing from blood samples revealed a novel nucleotide-binding oligomerization domain-containing protein 2 gene mutation in the affected family members. This study is the first report of a Mexican family with Blau syndrome showing good infliximab treatment response. The novel mutation in the nucleotide-binding oligomerization domain-containing protein 2 gene (c.1808A>G) enriches the mutation spectrum in Blau syndrome. This family represents one of the few cases of autosomal Blau syndrome with no uveitis; because of phenotype variability, it is important to recognize Blau syndrome's clinical spectrum and recommend genetic consultation. © 2018 Wiley Periodicals, Inc.
MIPS: a calmodulin-binding protein of Gracilaria lemaneiformis under heat shock.
Zhang, Xuan; Zhou, Huiyue; Zang, Xiaonan; Gong, Le; Sun, Hengyi; Zhang, Xuecheng
2014-08-01
To study the Ca(2+)/Calmodulin (CaM) signal transduction pathway of Gracilaria lemaneiformis under heat stress, myo-inositol-1-phosphate synthase (MIPS), a calmodulin-binding protein, was isolated using the yeast two-hybrid system. cDNA and DNA sequences of mips were cloned from G. lemaneiformis by using 5'RACE and genome walking procedures. The MIPS DNA sequence was 2,067 nucleotides long, containing an open reading frame (ORF) of 1,623 nucleotides with no intron. The mips ORF was predicted to encode 540 amino acids, which included the conserved MIPS domain and was 61-67 % similar to that of other species. After analyzing the amino acid sequence of MIPS, the CaM-Binding Domain (CaMBD) was inferred to be at a site spanning from amino acid 212 to amino acid 236. The yeast two-hybrid results proved that MIPS can interact with CaM and that MIPS is a type of calmodulin-binding protein. Next, the expression of CaM and MIPS in wild-type G. lemaneiformis and a heat-tolerant G. lemaneiformis cultivar, "981," were analyzed using real-time PCR under a heat shock of 32 °C. The expression level displayed a cyclical upward trend. Compared with wild type, the CaM expression levels of cultivar 981 were higher, which might directly relate to its resistance to high temperatures. This paper indicates that MIPS and CaM may play important roles in the high-temperature resistance of G. lemaneiformis.
Belak, Zachery R; Ovsenek, Nicholas; Eskiw, Christopher H
2018-05-23
Yin-Yang 1 (YY1) is a highly conserved transcription factor possessing RNA-binding activity. A putative YY1 homologue was previously identified in the developmental model organism Strongylocentrotus purpuratus (the purple sea urchin) by genomic sequencing. We identified a high degree of sequence similarity with YY1 homologues of vertebrate origin which shared 100% protein sequence identity over the DNA- and RNA-binding zinc-finger region with high similarity in the N-terminal transcriptional activation domain. SpYY1 demonstrated identical DNA- and RNA-binding characteristics between Xenopus laevis and S. purpuratus indicating that it maintains similar functional and biochemical properties across widely divergent deuterostome species. SpYY1 binds to the consensus YY1 DNA element, and also to U-rich RNA sequences. Although we detected SpYY1 RNA-binding activity in ova lysates and observed cytoplasmic localization, SpYY1 was not associated with maternal mRNA in ova. SpYY1 expressed in Xenopus oocytes was excluded from the nucleus and associated with maternally expressed cytoplasmic mRNA molecules. These data demonstrate the existence of an YY1 homologue in S. purpuratus with similar structural and biochemical features to those of the well-studied vertebrate YY1; however, the data reveal major differences in the biological role of YY1 in the regulation of maternally expressed mRNA in the two species.
Effect of DNA Binding on Geminate CO Recombination Kinetics in CO-sensing Transcription Factor CooA*
Benabbas, Abdelkrim; Karunakaran, Venugopal; Youn, Hwan; Poulos, Thomas L.; Champion, Paul M.
2012-01-01
Carbon monoxide oxidation activator (CooA) proteins are heme-based CO-sensing transcription factors. Here we study the ultrafast dynamics of geminate CO rebinding in two CooA homologues, Rhodospirillum rubrum (RrCooA) and Carboxydothermus hydrogenoformans (ChCooA). The effects of DNA binding and the truncation of the DNA-binding domain on the CO geminate recombination kinetics were specifically investigated. The CO rebinding kinetics in these CooA complexes take place on ultrafast time scales but remain non-exponential over many decades in time. We show that this non-exponential kinetic response is due to a quenched enthalpic barrier distribution resulting from a distribution of heme geometries that is frozen or slowly evolving on the time scale of CO rebinding. We also show that, upon CO binding, the distal pocket of the heme in the CooA proteins relaxes to form a very efficient hydrophobic trap for CO. DNA binding further tightens the narrow distal pocket and slightly weakens the iron-proximal histidine bond. Comparison of the CO rebinding kinetics of RrCooA, truncated RrCooA, and DNA-bound RrCooA proteins reveals that the uncomplexed and inherently flexible DNA-binding domain adds additional structural heterogeneity to the heme doming coordinate. When CooA forms a complex with DNA, the flexibility of the DNA-binding domain decreases, and the distribution of the conformations available in the heme domain becomes restricted. The kinetic studies also offer insights into how the architecture of the heme environment can tune entropic barriers in order to control the geminate recombination of CO in heme proteins, whereas spin selection rules play a minor or non-existent role. PMID:22544803
Effect of DNA binding on geminate CO recombination kinetics in CO-sensing transcription factor CooA.
Benabbas, Abdelkrim; Karunakaran, Venugopal; Youn, Hwan; Poulos, Thomas L; Champion, Paul M
2012-06-22
Carbon monoxide oxidation activator (CooA) proteins are heme-based CO-sensing transcription factors. Here we study the ultrafast dynamics of geminate CO rebinding in two CooA homologues, Rhodospirillum rubrum (RrCooA) and Carboxydothermus hydrogenoformans (ChCooA). The effects of DNA binding and the truncation of the DNA-binding domain on the CO geminate recombination kinetics were specifically investigated. The CO rebinding kinetics in these CooA complexes take place on ultrafast time scales but remain non-exponential over many decades in time. We show that this non-exponential kinetic response is due to a quenched enthalpic barrier distribution resulting from a distribution of heme geometries that is frozen or slowly evolving on the time scale of CO rebinding. We also show that, upon CO binding, the distal pocket of the heme in the CooA proteins relaxes to form a very efficient hydrophobic trap for CO. DNA binding further tightens the narrow distal pocket and slightly weakens the iron-proximal histidine bond. Comparison of the CO rebinding kinetics of RrCooA, truncated RrCooA, and DNA-bound RrCooA proteins reveals that the uncomplexed and inherently flexible DNA-binding domain adds additional structural heterogeneity to the heme doming coordinate. When CooA forms a complex with DNA, the flexibility of the DNA-binding domain decreases, and the distribution of the conformations available in the heme domain becomes restricted. The kinetic studies also offer insights into how the architecture of the heme environment can tune entropic barriers in order to control the geminate recombination of CO in heme proteins, whereas spin selection rules play a minor or non-existent role.
SivaRaman, L; Subramanian, S; Thimmappaya, B
1986-01-01
Utilizing the gel electrophoresis/DNA binding assay, a factor specific for the upstream transcriptional control sequence of the EIA-inducible adenovirus EIIA-early promoter has been detected in HeLa cell nuclear extract. Analysis of linker-scanning mutants of the promoter by DNA binding assays and methylation-interference experiments show that the factor binds to the 17-nucleotide sequence 5' TGGAGATGACGTAGTTT 3' located between positions -66 and -82 upstream from the cap site. This sequence has been shown to be essential for transcription of this promoter. The EIIA-early-promoter specific factor was found to be present at comparable levels in uninfected HeLa cells and in cells infected with either wild-type adenovirus or the EIA-deletion mutant dl312 under conditions in which the EIA proteins are induced to high levels [7 or 20 hr after infection in the presence of arabinonucleoside (cytosine arabinoside)]. Based on the quantitation in DNA binding assays, it appears that the mechanism of EIA-activated transcription of the EIIA-early promoter does not involve a net change in the amounts of this factor. Images PMID:2942943
AtSPX1 affects the AtPHR1-DNA-binding equilibrium by binding monomeric AtPHR1 in solution.
Qi, Wanjun; Manfield, Iain W; Muench, Stephen P; Baker, Alison
2017-10-23
Phosphorus is an essential macronutrient for plant growth and is deficient in ∼50% of agricultural soils. The transcription factor phosphate starvation response 1 (PHR1) plays a central role in regulating the expression of a subset of phosphate starvation-induced (PSI) genes through binding to a cis -acting DNA element termed P1BS (PHR1-binding sequences). In Arabidopsis and rice, activity of AtPHR1/OsPHR2 is regulated in part by their downstream target SPX ( S yg1, P ho81, X pr1) proteins through protein-protein interaction. Here, we provide kinetic and affinity data for interaction between AtPHR1 and P1BS sites. Using surface plasmon resonance, a tandem P1BS sequence showed ∼50-fold higher affinity for MBPAtdPHR1 (a fusion protein comprising the DNA-binding domain and coiled-coil domain of AtPHR1 fused to maltose-binding protein) than a single site. The affinity difference was largely reflected in a much slower dissociation rate from the 2× P1BS-binding site, suggesting an important role for protein co-operativity. Injection of AtSPX1 in the presence of phosphate or inositol hexakisphosphate (InsP6) failed to alter the MBPAtdPHR1-P1BS dissociation rate, while pre-mixing of these two proteins in the presence of either 5 mM Pi or 500 µM InsP6 resulted in a much lower DNA-binding signal from MBPAtdPHR1. These data suggest that, in the Pi-restored condition, AtSPX1 can bind to monomeric AtPHR1 in solution and therefore regulate PSI gene expression by tuning the AtPHR1-DNA-binding equilibrium. This Pi-dependent regulation of AtPHR1-DNA-binding equilibrium also generates a negative feedback loop on the expression of AtSPX1 itself, providing a tight control of PSI gene expression. © 2017 The Author(s).
Fibronectin tetrapeptide is target for syphilis spirochete cytadherence
DOE Office of Scientific and Technical Information (OSTI.GOV)
Thomas, D.D.; Baseman, J.B.; Alderete, J.F.
1985-11-01
The syphilis bacterium, Treponema pallidum, parasitizes host cells through recognition of fibronectin (Fn) on cell surfaces. The active site of the Fn molecule has been identified as a four-amino acid sequence, arg-gly-asp-ser (RGDS), located on each monomer of the cell-binding domain. The synthetic heptapeptide gly-arg-gly-asp-ser-pro-cys (GRGDSPC), with the active site sequence RGDS, specifically competed with SVI-labeled cell-binding domain acquisition by T. pallidum. Additionally, the same heptapeptide with the RGDS sequence diminished treponemal attachment to HEp-2 and HT1080 cell monolayers. Related heptapeptides altered in one key amino acid within the RGDS sequence failed to inhibit Fn cell-binding domain acquisition or parasitismmore » of host cells by T. pallidum. The data support the view that T. pallidum cytadherence of host cells is through recognition of the RGDS sequence also important for eukaryotic cell-Fn binding.« less
Yu, Haixiang; Canoura, Juan; Guntupalli, Bhargav; Lou, Xinhui
2017-01-01
Sensors employing split aptamers that reassemble in the presence of a target can achieve excellent specificity, but the accompanying reduction of target affinity mitigates any overall gains in sensitivity. We for the first time have developed a split aptamer that achieves enhanced target-binding affinity through cooperative binding. We have generated a split cocaine-binding aptamer that incorporates two binding domains, such that target binding at one domain greatly increases the affinity of the second domain. We experimentally demonstrate that the resulting cooperative-binding split aptamer (CBSA) exhibits higher target binding affinity and is far more responsive in terms of target-induced aptamer assembly compared to the single-domain parent split aptamer (PSA) from which it was derived. We further confirm that the target-binding affinity of our CBSA can be affected by the cooperativity of its binding domains and the intrinsic affinity of its PSA. To the best of our knowledge, CBSA-5335 has the highest cocaine affinity of any split aptamer described to date. The CBSA-based assay also demonstrates excellent performance in target detection in complex samples. Using this CBSA, we achieved specific, ultra-sensitive, one-step fluorescence detection of cocaine within fifteen minutes at concentrations as low as 50 nM in 10% saliva without signal amplification. This limit of detection meets the standards recommended by the European Union's Driving under the Influence of Drugs, Alcohol and Medicines program. Our assay also demonstrates excellent reproducibility of results, confirming that this CBSA-platform represents a robust and sensitive means for cocaine detection in actual clinical samples. PMID:28451157
Cross-talk between the ligand- and DNA-binding domains of estrogen receptor.
Huang, Wei; Greene, Geoffrey L; Ravikumar, Krishnakumar M; Yang, Sichun
2013-11-01
Estrogen receptor alpha (ERα) is a hormone-responsive transcription factor that contains several discrete functional domains, including a ligand-binding domain (LBD) and a DNA-binding domain (DBD). Despite a wealth of knowledge about the behaviors of individual domains, the molecular mechanisms of cross-talk between LBD and DBD during signal transduction from hormone to DNA-binding of ERα remain elusive. Here, we apply a multiscale approach combining coarse-grained (CG) and atomistically detailed simulations to characterize this cross-talk mechanism via an investigation of the ERα conformational landscape. First, a CG model of ERα is built based on crystal structures of individual LBDs and DBDs, with more emphasis on their interdomain interactions. Second, molecular dynamics simulations are implemented and enhanced sampling is achieved via the "push-pull-release" strategy in the search for different LBD-DBD orientations. Third, multiple energetically stable ERα conformations are identified on the landscape. A key finding is that estradiol-bound LBDs utilize the well-described activation helix H12 to pack and stabilize LBD-DBD interactions. Our results suggest that the estradiol-bound LBDs can serve as a scaffold to position and stabilize the DBD-DNA complex, consistent with experimental observations of enhanced DNA binding with the LBD. Final assessment using atomic-level simulations shows that these CG-predicted models are significantly stable within a 15-ns simulation window and that specific pairs of lysine residues in close proximity at the domain interfaces could serve as candidate sites for chemical cross-linking studies. Together, these simulation results provide a molecular view of the role of ERα domain interactions in response to hormone binding. Copyright © 2013 Wiley Periodicals, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Buchman, A.R.; Kimmerly, W.J.; Rine, J.
1988-01-01
Two DNA-binding factors from Saccharomyces cerevisiae have been characterized, GRFI (general regulatory factor I) and ABFI (ARS-binding factor I), that recognize specific sequences within diverse genetic elements. GRFI bound to sequences at the negative regulatory elements (silencers) of the silent mating type loci HML E and HMR E and to the upstream activating sequence (UAS) required for transcription of the MAT ..cap alpha.. genes. A putative conserved UAS located at genes involved in translation (RPG box) was also recognized by GRFI. In addition, GRFI bound with high affinity to sequences within the (C/sub 1-3/A)-repeat region at yeast telomeres. Binding sitesmore » for GRFI with the highest affinity appeared to be of the form 5'-(A/G)(A/C)ACCCAN NCA(T/C)(T/C)-3', where N is any nucleotide. ABFI-binding sites were located next to autonomously replicating sequences (ARSs) at controlling elements of the silent mating type loci HMR E, HMR I, and HML I and were associated with ARS1, ARS2, and the 2..mu..m plasmid ARS. Two tandem ABFI binding sites were found between the HIS3 and DED1 genes, several kilobase pairs from any ARS, indicating that ABFI-binding sites are not restricted to ARSs. The sequences recognized by AFBI showed partial dyad-symmetry and appeared to be variations of the consensus 5'-TATCATTNNNNACGA-3'. GRFI and ABFI were both abundant DNA-binding factors and did not appear to be encoded by the SIR genes, whose product are required for repression of the silent mating type loci. Together, these results indicate that both GRFI and ABFI play multiple roles within the cell.« less
Palermo, Giulia; Miao, Yinglong; Walker, Ross C; Jinek, Martin; McCammon, J Andrew
2016-10-26
The CRISPR (clustered regularly interspaced short palindromic repeats)-Cas9 system recently emerged as a transformative genome-editing technology that is innovating basic bioscience and applied medicine and biotechnology. The endonuclease Cas9 associates with a guide RNA to match and cleave complementary sequences in double stranded DNA, forming an RNA:DNA hybrid and a displaced non-target DNA strand. Although extensive structural studies are ongoing, the conformational dynamics of Cas9 and its interplay with the nucleic acids during association and DNA cleavage are largely unclear. Here, by employing multi-microsecond time scale molecular dynamics, we reveal the conformational plasticity of Cas9 and identify key determinants that allow its large-scale conformational changes during nucleic acid binding and processing. We show how the "closure" of the protein, which accompanies nucleic acid binding, fundamentally relies on highly coupled and specific motions of the protein domains, collectively initiating the prominent conformational changes needed for nucleic acid association. We further reveal a key role of the non-target DNA during the process of activation of the nuclease HNH domain, showing how the nontarget DNA positioning triggers local conformational changes that favor the formation of a catalytically competent Cas9. Finally, a remarkable conformational plasticity is identified as an intrinsic property of the HNH domain, constituting a necessary element that allows for the HNH repositioning. These novel findings constitute a reference for future experimental studies aimed at a full characterization of the dynamic features of the CRISPR-Cas9 system, and-more importantly-call for novel structure engineering efforts that are of fundamental importance for the rational design of new genome-engineering applications.
Structure and Sequence Search on Aptamer-Protein Docking
NASA Astrophysics Data System (ADS)
Xiao, Jiajie; Bonin, Keith; Guthold, Martin; Salsbury, Freddie
2015-03-01
Interactions between proteins and deoxyribonucleic acid (DNA) play a significant role in the living systems, especially through gene regulation. However, short nucleic acids sequences (aptamers) with specific binding affinity to specific proteins exhibit clinical potential as therapeutics. Our capillary and gel electrophoresis selection experiments show that specific sequences of aptamers can be selected that bind specific proteins. Computationally, given the experimentally-determined structure and sequence of a thrombin-binding aptamer, we can successfully dock the aptamer onto thrombin in agreement with experimental structures of the complex. In order to further study the conformational flexibility of this thrombin-binding aptamer and to potentially develop a predictive computational model of aptamer-binding, we use GPU-enabled molecular dynamics simulations to both examine the conformational flexibility of the aptamer in the absence of binding to thrombin, and to determine our ability to fold an aptamer. This study should help further de-novo predictions of aptamer sequences by enabling the study of structural and sequence-dependent effects on aptamer-protein docking specificity.