sequence-specific dna-binding proteins: Topics by Science.gov

Sample records for sequence-specific dna-binding proteins

Isolation and characterization of target sequences of the chicken CdxA homeobox gene.

PubMed Central

Margalit, Y; Yarus, S; Shapira, E; Gruenbaum, Y; Fainsod, A

1993-01-01

The DNA binding specificity of the chicken homeodomain protein CDXA was studied. Using a CDXA-glutathione-S-transferase fusion protein, DNA fragments containing the binding site for this protein were isolated. The sources of DNA were oligonucleotides with random sequence and chicken genomic DNA. The DNA fragments isolated were sequenced and tested in DNA binding assays. Sequencing revealed that most DNA fragments are AT rich which is a common feature of homeodomain binding sites. By electrophoretic mobility shift assays it was shown that the different target sequences isolated bind to the CDXA protein with different affinities. The specific sequences bound by the CDXA protein in the genomic fragments isolated, were determined by DNase I footprinting. From the footprinted sequences, the CDXA consensus binding site was determined. The CDXA protein binds the consensus sequence A, A/T, T, A/T, A, T, A/G. The CAUDAL binding site in the ftz promoter is also included in this consensus sequence. When tested, some of the genomic target sequences were capable of enhancing the transcriptional activity of reporter plasmids when introduced into CDXA expressing cells. This study determined the DNA sequence specificity of the CDXA protein and it also shows that this protein can further activate transcription in cells in culture. Images PMID:7909943
Specific minor groove solvation is a crucial determinant of DNA binding site recognition

PubMed Central

Harris, Lydia-Ann; Williams, Loren Dean; Koudelka, Gerald B.

2014-01-01

The DNA sequence preferences of nearly all sequence specific DNA binding proteins are influenced by the identities of bases that are not directly contacted by protein. Discrimination between non-contacted base sequences is commonly based on the differential abilities of DNA sequences to allow narrowing of the DNA minor groove. However, the factors that govern the propensity of minor groove narrowing are not completely understood. Here we show that the differential abilities of various DNA sequences to support formation of a highly ordered and stable minor groove solvation network are a key determinant of non-contacted base recognition by a sequence-specific binding protein. In addition, disrupting the solvent network in the non-contacted region of the binding site alters the protein's ability to recognize contacted base sequences at positions 5–6 bases away. This observation suggests that DNA solvent interactions link contacted and non-contacted base recognition by the protein. PMID:25429976
A novel class of plant-specific zinc-dependent DNA-binding protein that binds to A/T-rich DNA sequences

PubMed Central

Nagano, Yukio; Furuhashi, Hirofumi; Inaba, Takehito; Sasaki, Yukiko

2001-01-01

Complementary DNA encoding a DNA-binding protein, designated PLATZ1 (plant AT-rich sequence- and zinc-binding protein 1), was isolated from peas. The amino acid sequence of the protein is similar to those of other uncharacterized proteins predicted from the genome sequences of higher plants. However, no paralogous sequences have been found outside the plant kingdom. Multiple alignments among these paralogous proteins show that several cysteine and histidine residues are invariant, suggesting that these proteins are a novel class of zinc-dependent DNA-binding proteins with two distantly located regions, C-x2-H-x11-C-x2-C-x(4–5)-C-x2-C-x(3–7)-H-x2-H and C-x2-C-x(10–11)-C-x3-C. In an electrophoretic mobility shift assay, the zinc chelator 1,10-o-phenanthroline inhibited DNA binding, and two distant zinc-binding regions were required for DNA binding. A protein blot with 65ZnCl2 showed that both regions are required for zinc-binding activity. The PLATZ1 protein non-specifically binds to A/T-rich sequences, including the upstream region of the pea GTPase pra2 and plastocyanin petE genes. Expression of the PLATZ1 repressed those of the reporter constructs containing the coding sequence of luciferase gene driven by the cauliflower mosaic virus (CaMV) 35S90 promoter fused to the tandem repeat of the A/T-rich sequences. These results indicate that PLATZ1 is a novel class of plant-specific zinc-dependent DNA-binding protein responsible for A/T-rich sequence-mediated transcriptional repression. PMID:11600698
HMG-D is an architecture-specific protein that preferentially binds to DNA containing the dinucleotide TG.

PubMed Central

Churchill, M E; Jones, D N; Glaser, T; Hefner, H; Searles, M A; Travers, A A

1995-01-01

The high mobility group (HMG) protein HMG-D from Drosophila melanogaster is a highly abundant chromosomal protein that is closely related to the vertebrate HMG domain proteins HMG1 and HMG2. In general, chromosomal HMG domain proteins lack sequence specificity. However, using both NMR spectroscopy and standard biochemical techniques we show that binding of HMG-D to a single DNA site is sequence selective. The preferred duplex DNA binding site comprises at least 5 bp and contains the deformable dinucleotide TG embedded in A/T-rich sequences. The TG motif constitutes a common core element in the binding sites of the well-characterized sequence-specific HMG domain proteins. We show that a conserved aromatic residue in helix 1 of the HMG domain may be involved in recognition of this core sequence. In common with other HMG domain proteins HMG-D binds preferentially to DNA sites that are stably bent and underwound, therefore HMG-D can be considered an architecture-specific protein. Finally, we show that HMG-D bends DNA and may confer a superhelical DNA conformation at a natural DNA binding site in the Drosophila fushi tarazu scaffold-associated region. Images PMID:7720717
How proteins bind to DNA: target discrimination and dynamic sequence search by the telomeric protein TRF1

PubMed Central

2017-01-01

Abstract Target search as performed by DNA-binding proteins is a complex process, in which multiple factors contribute to both thermodynamic discrimination of the target sequence from overwhelmingly abundant off-target sites and kinetic acceleration of dynamic sequence interrogation. TRF1, the protein that binds to telomeric tandem repeats, faces an intriguing variant of the search problem where target sites are clustered within short fragments of chromosomal DNA. In this study, we use extensive (>0.5 ms in total) MD simulations to study the dynamical aspects of sequence-specific binding of TRF1 at both telomeric and non-cognate DNA. For the first time, we describe the spontaneous formation of a sequence-specific native protein–DNA complex in atomistic detail, and study the mechanism by which proteins avoid off-target binding while retaining high affinity for target sites. Our calculated free energy landscapes reproduce the thermodynamics of sequence-specific binding, while statistical approaches allow for a comprehensive description of intermediate stages of complex formation. PMID:28633355
Structure and DNA-Binding Sites of the SWI1 AT-rich Interaction Domain (ARID) Suggest Determinants for Sequence-Specific DNA Recognition

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kim, Suhkmann; Zhang, Ziming; Upchurch, Sean

2004-04-16

2 ARID is a homologous family of DNA-binding domains that occur in DNA binding proteins from a wide variety of species, ranging from yeast to nematodes, insects, mammals and plants. SWI1, a member of the SWI/SNF protein complex that is involved in chromatin remodeling during transcription, contains the ARID motif. The ARID domain of human SWI1 (also known as p270) does not select for a specific DNA sequence from a random sequence pool. The lack of sequence specificity shown by the SWI1 ARID domain stands in contrast to the other characterized ARID domains, which recognize specific AT-rich sequences. We havemore » solved the three-dimensional structure of human SWI1 ARID using solution NMR methods. In addition, we have characterized non-specific DNA-binding by the SWI1 ARID domain. Results from this study indicate that a flexible long internal loop in ARID motif is likely to be important for sequence specific DNA-recognition. The structure of human SWI1 ARID domain also represents a distinct structural subfamily. Studies of ARID indicate that boundary of the DNA binding structural and functional domains can extend beyond the sequence homologous region in a homologous family of proteins. Structural studies of homologous domains such as ARID family of DNA-binding domains should provide information to better predict the boundary of structural and functional domains in structural genomic studies. Key Words: ARID, SWI1, NMR, structural genomics, protein-DNA interaction.« less
Molecular cloning of MSSP-2, a c-myc gene single-strand binding protein: characterization of binding specificity and DNA replication activity.

PubMed Central

Takai, T; Nishita, Y; Iguchi-Ariga, S M; Ariga, H

1994-01-01

We have previously reported the human cDNA encoding MSSP-1, a sequence-specific double- and single-stranded DNA binding protein [Negishi, Nishita, Saëgusa, Kakizaki, Galli, Kihara, Tamai, Miyajima, Iguchi-Ariga and Ariga (1994) Oncogene, 9, 1133-1143]. MSSP-1 binds to a DNA replication origin/transcriptional enhancer of the human c-myc gene and has turned out to be identical with Scr2, a human protein which complements the defect of cdc2 kinase in S.pombe [Kataoka and Nojima (1994) Nucleic Acid Res., 22, 2687-2693]. We have cloned the cDNA for MSSP-2, another member of the MSSP family of proteins. The MSSP-2 cDNA shares highly homologous sequences with MSSP-1 cDNA, except for the insertion of 48 bp coding 16 amino acids near the C-terminus. Like MSSP-1, MSSP-2 has RNP-1 consensus sequences. The results of the experiments using bacterially expressed MSSP-2, and its deletion mutants, as histidine fusion proteins suggested that the binding specificity of MSSP-2 to double- and single-stranded DNA is the same as that of MSSP-1, and that the RNP consensus sequences are required for the DNA binding of the protein. MSSP-2 stimulated the DNA replication of an SV40-derived plasmid containing the binding sequence for MSSP-1 or -2. MSSP-2 is hence suggested to play an important role in regulation of DNA replication. Images PMID:7838710
Context influences on TALE–DNA binding revealed by quantitative profiling

PubMed Central

Rogers, Julia M.; Barrera, Luis A.; Reyon, Deepak; Sander, Jeffry D.; Kellis, Manolis; Joung, J Keith; Bulyk, Martha L.

2015-01-01

Transcription activator-like effector (TALE) proteins recognize DNA using a seemingly simple DNA-binding code, which makes them attractive for use in genome engineering technologies that require precise targeting. Although this code is used successfully to design TALEs to target specific sequences, off-target binding has been observed and is difficult to predict. Here we explore TALE–DNA interactions comprehensively by quantitatively assaying the DNA-binding specificities of 21 representative TALEs to ∼5,000–20,000 unique DNA sequences per protein using custom-designed protein-binding microarrays (PBMs). We find that protein context features exert significant influences on binding. Thus, the canonical recognition code does not fully capture the complexity of TALE–DNA binding. We used the PBM data to develop a computational model, Specificity Inference For TAL-Effector Design (SIFTED), to predict the DNA-binding specificity of any TALE. We provide SIFTED as a publicly available web tool that predicts potential genomic off-target sites for improved TALE design. PMID:26067805
Context influences on TALE-DNA binding revealed by quantitative profiling.

PubMed

Rogers, Julia M; Barrera, Luis A; Reyon, Deepak; Sander, Jeffry D; Kellis, Manolis; Joung, J Keith; Bulyk, Martha L

2015-06-11

Transcription activator-like effector (TALE) proteins recognize DNA using a seemingly simple DNA-binding code, which makes them attractive for use in genome engineering technologies that require precise targeting. Although this code is used successfully to design TALEs to target specific sequences, off-target binding has been observed and is difficult to predict. Here we explore TALE-DNA interactions comprehensively by quantitatively assaying the DNA-binding specificities of 21 representative TALEs to ∼5,000-20,000 unique DNA sequences per protein using custom-designed protein-binding microarrays (PBMs). We find that protein context features exert significant influences on binding. Thus, the canonical recognition code does not fully capture the complexity of TALE-DNA binding. We used the PBM data to develop a computational model, Specificity Inference For TAL-Effector Design (SIFTED), to predict the DNA-binding specificity of any TALE. We provide SIFTED as a publicly available web tool that predicts potential genomic off-target sites for improved TALE design.
Sequence specificity of single-stranded DNA-binding proteins: a novel DNA microarray approach

PubMed Central

Morgan, Hugh P.; Estibeiro, Peter; Wear, Martin A.; Max, Klaas E.A.; Heinemann, Udo; Cubeddu, Liza; Gallagher, Maurice P.; Sadler, Peter J.; Walkinshaw, Malcolm D.

2007-01-01

We have developed a novel DNA microarray-based approach for identification of the sequence-specificity of single-stranded nucleic-acid-binding proteins (SNABPs). For verification, we have shown that the major cold shock protein (CspB) from Bacillus subtilis binds with high affinity to pyrimidine-rich sequences, with a binding preference for the consensus sequence, 5′-GTCTTTG/T-3′. The sequence was modelled onto the known structure of CspB and a cytosine-binding pocket was identified, which explains the strong preference for a cytosine base at position 3. This microarray method offers a rapid high-throughput approach for determining the specificity and strength of ss DNA–protein interactions. Further screening of this newly emerging family of transcription factors will help provide an insight into their cellular function. PMID:17488853
Isolation from genomic DNA of sequences binding specific regulatory proteins by the acceleration of protein electrophoretic mobility upon DNA binding.

PubMed

Subrahmanyam, S; Cronan, J E

1999-01-21

We report an efficient and flexible in vitro method for the isolation of genomic DNA sequences that are the binding targets of a given DNA binding protein. This method takes advantage of the fact that binding of a protein to a DNA molecule generally increases the rate of migration of the protein in nondenaturing gel electrophoresis. By the use of a radioactively labeled DNA-binding protein and nonradioactive DNA coupled with PCR amplification from gel slices, we show that specific binding sites can be isolated from Escherichia coli genomic DNA. We have applied this method to isolate a binding site for FadR, a global regulator of fatty acid metabolism in E. coli. We have also isolated a second binding site for BirA, the biotin operon repressor/biotin ligase, from the E. coli genome that has a very low binding efficiency compared with the bio operator region.
Molecular cloning and analysis of Schizosaccharomyces pombe Reb1p: sequence-specific recognition of two sites in the far upstream rDNA intergenic spacer.

PubMed Central

Zhao, A; Guo, A; Liu, Z; Pape, L

1997-01-01

The coding sequences for a Schizosaccharomyces pombe sequence-specific DNA binding protein, Reb1p, have been cloned. The predicted S. pombe Reb1p is 24-29% identical to mouse TTF-1 (transcription termination factor-1) and Saccharomyces cerevisiae REB1 protein, both of which direct termination of RNA polymerase I catalyzed transcripts. The S.pombe Reb1 cDNA encodes a predicted polypeptide of 504 amino acids with a predicted molecular weight of 58.4 kDa. The S. pombe Reb1p is unusual in that the bipartite DNA binding motif identified originally in S.cerevisiae and Klyveromyces lactis REB1 proteins is uninterrupted and thus S.pombe Reb1p may contain the smallest natural REB1 homologous DNA binding domain. Its genomic coding sequences were shown to be interrupted by two introns. A recombinant histidine-tagged Reb1 protein bearing the rDNA binding domain has two homologous, sequence-specific binding sites in the S. pomber DNA intergenic spacer, located between 289 and 480 nt downstream of the end of the approximately 25S rRNA coding sequences. Each binding site is 13-14 bp downstream of two of the three proposed in vivo termination sites. The core of this 17 bp site, AGGTAAGGGTAATGCAC, is specifically protected by Reb1p in footprinting analysis. PMID:9016645
TFBSshape: a motif database for DNA shape features of transcription factor binding sites.

PubMed

Yang, Lin; Zhou, Tianyin; Dror, Iris; Mathelier, Anthony; Wasserman, Wyeth W; Gordân, Raluca; Rohs, Remo

2014-01-01

Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein-DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone.
APOBEC3G Interacts with ssDNA by Two Modes: AFM Studies

NASA Astrophysics Data System (ADS)

Shlyakhtenko, Luda S.; Dutta, Samrat; Banga, Jaspreet; Li, Ming; Harris, Reuben S.; Lyubchenko, Yuri L.

2015-10-01

APOBEC3G (A3G) protein has antiviral activity against HIV and other pathogenic retroviruses. A3G has two domains: a catalytic C-terminal domain (CTD) that deaminates cytidine, and a N-terminal domain (NTD) that binds to ssDNA. Although abundant information exists about the biological activities of A3G protein, the interplay between sequence specific deaminase activity and A3G binding to ssDNA remains controversial. We used the topographic imaging and force spectroscopy modalities of Atomic Force Spectroscopy (AFM) to characterize the interaction of A3G protein with deaminase specific and nonspecific ssDNA substrates. AFM imaging demonstrated that A3G has elevated affinity for deaminase specific ssDNA than for nonspecific ssDNA. AFM force spectroscopy revealed two distinct binding modes by which A3G interacts with ssDNA. One mode requires sequence specificity, as demonstrated by stronger and more stable complexes with deaminase specific ssDNA than with nonspecific ssDNA. Overall these observations enforce prior studies suggesting that both domains of A3G contribute to the sequence specific binding of ssDNA.
APOBEC3G Interacts with ssDNA by Two Modes: AFM Studies.

PubMed

Shlyakhtenko, Luda S; Dutta, Samrat; Banga, Jaspreet; Li, Ming; Harris, Reuben S; Lyubchenko, Yuri L

2015-10-27

APOBEC3G (A3G) protein has antiviral activity against HIV and other pathogenic retroviruses. A3G has two domains: a catalytic C-terminal domain (CTD) that deaminates cytidine, and a N-terminal domain (NTD) that binds to ssDNA. Although abundant information exists about the biological activities of A3G protein, the interplay between sequence specific deaminase activity and A3G binding to ssDNA remains controversial. We used the topographic imaging and force spectroscopy modalities of Atomic Force Spectroscopy (AFM) to characterize the interaction of A3G protein with deaminase specific and nonspecific ssDNA substrates. AFM imaging demonstrated that A3G has elevated affinity for deaminase specific ssDNA than for nonspecific ssDNA. AFM force spectroscopy revealed two distinct binding modes by which A3G interacts with ssDNA. One mode requires sequence specificity, as demonstrated by stronger and more stable complexes with deaminase specific ssDNA than with nonspecific ssDNA. Overall these observations enforce prior studies suggesting that both domains of A3G contribute to the sequence specific binding of ssDNA.
A single amino-acid substitution in the Ets domain alters core DNA binding specificity of Ets1 to that of the related transcription factors Elf1 and E74.

PubMed

Bosselut, R; Levin, J; Adjadj, E; Ghysdael, J

1993-11-11

Ets proteins form a family of sequence specific DNA binding proteins which bind DNA through a 85 aminoacids conserved domain, the Ets domain, whose sequence is unrelated to any other characterized DNA binding domain. Unlike all other known Ets proteins, which bind specific DNA sequences centered over either GGAA or GGAT core motifs, E74 and Elf1 selectively bind to GGAA corecontaining sites. Elf1 and E74 differ from other Ets proteins in three residues located in an otherwise highly conserved region of the Ets domain, referred to as conserved region III (CRIII). We show that a restricted selectivity for GGAA core-containing sites could be conferred to Ets1 upon changing a single lysine residue within CRIII to the threonine found in Elf1 and E74 at this position. Conversely, the reciprocal mutation in Elf1 confers to this protein the ability to bind to GGAT core containing EBS. This, together with the fact that mutation of two invariant arginine residues in CRIII abolishes DNA binding, indicates that CRIII plays a key role in Ets domain recognition of the GGAA/T core motif and lead us to discuss a model of Ets proteins--core motif interaction.
Role of DNA conformation & energetic insights in Msx-1-DNA recognition as revealed by molecular dynamics studies on specific and nonspecific complexes.

PubMed

Kachhap, Sangita; Singh, Balvinder

2015-01-01

In most of homeodomain-DNA complexes, glutamine or lysine is present at 50th position and interacts with 5th and 6th nucleotide of core recognition region. Molecular dynamics simulations of Msx-1-DNA complex (Q50-TG) and its variant complexes, that is specific (Q50K-CC), nonspecific (Q50-CC) having mutation in DNA and (Q50K-TG) in protein, have been carried out. Analysis of protein-DNA interactions and structure of DNA in specific and nonspecific complexes show that amino acid residues use sequence-dependent shape of DNA to interact. The binding free energies of all four complexes were analysed to define role of amino acid residue at 50th position in terms of binding strength considering the variation in DNA on stability of protein-DNA complexes. The order of stability of protein-DNA complexes shows that specific complexes are more stable than nonspecific ones. Decomposition analysis shows that N-terminal amino acid residues have been found to contribute maximally in binding free energy of protein-DNA complexes. Among specific protein-DNA complexes, K50 contributes more as compared to Q50 towards binding free energy in respective complexes. The sequence dependence of local conformation of DNA enables Q50/Q50K to make hydrogen bond with nucleotide(s) of DNA. The changes in amino acid sequence of protein are accommodated and stabilized around TAAT core region of DNA having variation in nucleotides.
DNA binding specificity of the basic-helix-loop-helix protein MASH-1.

PubMed

Meierhan, D; el-Ariss, C; Neuenschwander, M; Sieber, M; Stackhouse, J F; Allemann, R K

1995-09-05

Despite the high degree of sequence similarity in their basic-helix-loop-helix (BHLH) domains, MASH-1 and MyoD are involved in different biological processes. In order to define possible differences between the DNA binding specificities of these two proteins, we investigated the DNA binding properties of MASH-1 by circular dichroism spectroscopy and by electrophoretic mobility shift assays (EMSA). Upon binding to DNA, the BHLH domain of MASH-1 underwent a conformational change from a mainly unfolded to a largely alpha-helical form, and surprisingly, this change was independent of the specific DNA sequence. The same conformational transition could be induced by the addition of 20% 2,2,2-trifluoroethanol. The apparent dissociation constants (KD) of the complexes of full-length MASH-1 with various oligonucleotides were determined from half-saturation points in EMSAs. MASH-1 bound as a dimer to DNA sequences containing an E-box with high affinity KD = 1.4-4.1 x 10(-14) M2). However, the specificity of DNA binding was low. The dissociation constant for the complex between MASH-1 and the highest affinity E-box sequence (KD = 1.4 x 10(-14) M2) was only a factor of 10 smaller than for completely unrelated DNA sequences (KD = approximately 1 x 10(-13) M2). The DNA binding specificity of MASH-1 was not significantly increased by the formation of an heterodimer with the ubiquitous E12 protein. MASH-1 and MyoD displayed similar binding site preferences, suggesting that their different target gene specificities cannot be explained solely by differential DNA binding. An explanation for these findings is provided on the basis of the known crystal structure of the BHLH domain of MyoD.
Differences in DNA Binding Specificity of Floral Homeotic Protein Complexes Predict Organ-Specific Target Genes.

PubMed

Smaczniak, Cezary; Muiño, Jose M; Chen, Dijun; Angenent, Gerco C; Kaufmann, Kerstin

2017-08-01

Floral organ identities in plants are specified by the combinatorial action of homeotic master regulatory transcription factors. However, how these factors achieve their regulatory specificities is still largely unclear. Genome-wide in vivo DNA binding data show that homeotic MADS domain proteins recognize partly distinct genomic regions, suggesting that DNA binding specificity contributes to functional differences of homeotic protein complexes. We used in vitro systematic evolution of ligands by exponential enrichment followed by high-throughput DNA sequencing (SELEX-seq) on several floral MADS domain protein homo- and heterodimers to measure their DNA binding specificities. We show that specification of reproductive organs is associated with distinct binding preferences of a complex formed by SEPALLATA3 and AGAMOUS. Binding specificity is further modulated by different binding site spacing preferences. Combination of SELEX-seq and genome-wide DNA binding data allows differentiation between targets in specification of reproductive versus perianth organs in the flower. We validate the importance of DNA binding specificity for organ-specific gene regulation by modulating promoter activity through targeted mutagenesis. Our study shows that intrafamily protein interactions affect DNA binding specificity of floral MADS domain proteins. Differential DNA binding of MADS domain protein complexes plays a role in the specificity of target gene regulation. © 2017 American Society of Plant Biologists. All rights reserved.
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats

PubMed Central

de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas

2015-01-01

Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. PMID:26481363

APOBEC3G Interacts with ssDNA by Two Modes: AFM Studies

PubMed Central

Shlyakhtenko, Luda S.; Dutta, Samrat; Banga, Jaspreet; Li, Ming; Harris, Reuben S.; Lyubchenko, Yuri L.

2015-01-01

APOBEC3G (A3G) protein has antiviral activity against HIV and other pathogenic retroviruses. A3G has two domains: a catalytic C-terminal domain (CTD) that deaminates cytidine, and a N-terminal domain (NTD) that binds to ssDNA. Although abundant information exists about the biological activities of A3G protein, the interplay between sequence specific deaminase activity and A3G binding to ssDNA remains controversial. We used the topographic imaging and force spectroscopy modalities of Atomic Force Spectroscopy (AFM) to characterize the interaction of A3G protein with deaminase specific and nonspecific ssDNA substrates. AFM imaging demonstrated that A3G has elevated affinity for deaminase specific ssDNA than for nonspecific ssDNA. AFM force spectroscopy revealed two distinct binding modes by which A3G interacts with ssDNA. One mode requires sequence specificity, as demonstrated by stronger and more stable complexes with deaminase specific ssDNA than with nonspecific ssDNA. Overall these observations enforce prior studies suggesting that both domains of A3G contribute to the sequence specific binding of ssDNA. PMID:26503602
Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain

PubMed Central

de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas

2014-01-01

The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. PMID:24792163
A statistical model for investigating binding probabilities of DNA nucleotide sequences using microarrays.

PubMed

Lee, Mei-Ling Ting; Bulyk, Martha L; Whitmore, G A; Church, George M

2002-12-01

There is considerable scientific interest in knowing the probability that a site-specific transcription factor will bind to a given DNA sequence. Microarray methods provide an effective means for assessing the binding affinities of a large number of DNA sequences as demonstrated by Bulyk et al. (2001, Proceedings of the National Academy of Sciences, USA 98, 7158-7163) in their study of the DNA-binding specificities of Zif268 zinc fingers using microarray technology. In a follow-up investigation, Bulyk, Johnson, and Church (2002, Nucleic Acid Research 30, 1255-1261) studied the interdependence of nucleotides on the binding affinities of transcription proteins. Our article is motivated by this pair of studies. We present a general statistical methodology for analyzing microarray intensity measurements reflecting DNA-protein interactions. The log probability of a protein binding to a DNA sequence on an array is modeled using a linear ANOVA model. This model is convenient because it employs familiar statistical concepts and procedures and also because it is effective for investigating the probability structure of the binding mechanism.
Non-B-Form DNA Is Enriched at Centromeres

PubMed Central

Henikoff, Steven

2018-01-01

Abstract Animal and plant centromeres are embedded in repetitive “satellite” DNA, but are thought to be epigenetically specified. To define genetic characteristics of centromeres, we surveyed satellite DNA from diverse eukaryotes and identified variation in <10-bp dyad symmetries predicted to adopt non-B-form conformations. Organisms lacking centromeric dyad symmetries had binding sites for sequence-specific DNA-binding proteins with DNA-bending activity. For example, human and mouse centromeres are depleted for dyad symmetries, but are enriched for non-B-form DNA and are associated with binding sites for the conserved DNA-binding protein CENP-B, which is required for artificial centromere function but is paradoxically nonessential. We also detected dyad symmetries and predicted non-B-form DNA structures at neocentromeres, which form at ectopic loci. We propose that centromeres form at non-B-form DNA because of dyad symmetries or are strengthened by sequence-specific DNA binding proteins. This may resolve the CENP-B paradox and provide a general basis for centromere specification. PMID:29365169
Functional specificity of a Hox protein mediated by the recognition of minor groove structure.

PubMed

Joshi, Rohit; Passner, Jonathan M; Rohs, Remo; Jain, Rinku; Sosinsky, Alona; Crickmore, Michael A; Jacob, Vinitha; Aggarwal, Aneel K; Honig, Barry; Mann, Richard S

2007-11-02

The recognition of specific DNA-binding sites by transcription factors is a critical yet poorly understood step in the control of gene expression. Members of the Hox family of transcription factors bind DNA by making nearly identical major groove contacts via the recognition helices of their homeodomains. In vivo specificity, however, often depends on extended and unstructured regions that link Hox homeodomains to a DNA-bound cofactor, Extradenticle (Exd). Using a combination of structure determination, computational analysis, and in vitro and in vivo assays, we show that Hox proteins recognize specific Hox-Exd binding sites via residues located in these extended regions that insert into the minor groove but only when presented with the correct DNA sequence. Our results suggest that these residues, which are conserved in a paralog-specific manner, confer specificity by recognizing a sequence-dependent DNA structure instead of directly reading a specific DNA sequence.
Global Analysis of Transcription Factor-Binding Sites in Yeast Using ChIP-Seq

PubMed Central

Lefrançois, Philippe; Gallagher, Jennifer E. G.; Snyder, Michael

2016-01-01

Transcription factors influence gene expression through their ability to bind DNA at specific regulatory elements. Specific DNA-protein interactions can be isolated through the chromatin immunoprecipitation (ChIP) procedure, in which DNA fragments bound by the protein of interest are recovered. ChIP is followed by high-throughput DNA sequencing (Seq) to determine the genomic provenance of ChIP DNA fragments and their relative abundance in the sample. This chapter describes a ChIP-Seq strategy adapted for budding yeast to enable the genome-wide characterization of binding sites of transcription factors (TFs) and other DNA-binding proteins in an efficient and cost-effective way. Yeast strains with epitope-tagged TFs are most commonly used for ChIP-Seq, along with their matching untagged control strains. The initial step of ChIP involves the cross-linking of DNA and proteins. Next, yeast cells are lysed and sonicated to shear chromatin into smaller fragments. An antibody against an epitope-tagged TF is used to pull down chromatin complexes containing DNA and the TF of interest. DNA is then purified and proteins degraded. Specific barcoded adapters for multiplex DNA sequencing are ligated to ChIP DNA. Short DNA sequence reads (28–36 base pairs) are parsed according to the barcode and aligned against the yeast reference genome, thus generating a nucleotide-resolution map of transcription factor-binding sites and their occupancy. PMID:25213249
Identification of high-specificity H-NS binding site in LEE5 promoter of enteropathogenic Esherichia coli (EPEC).

PubMed

Bhat, Abhay Prasad; Shin, Minsang; Choy, Hyon E

2014-07-01

Histone-like nucleoid structuring protein (H-NS) is a small but abundant protein present in enteric bacteria and is involved in compaction of the DNA and regulation of the transcription. Recent reports have suggested that H-NS binds to a specific AT rich DNA sequence than to intrinsically curved DNA in sequence independent manner. We detected two high-specificity H-NS binding sites in LEE5 promoter of EPEC centered at -110 and -138, which were close to the proposed consensus H-NS binding motif. To identify H-NS binding sequence in LEE5 promoter, we took a random mutagenesis approach and found the mutations at around -138 were specifically defective in the regulation by H-NS. It was concluded that H-NS exerts maximum repression via the specific sequence at around -138 and subsequently contacts a subunit of RNAP through oligomerization.
Influence of quasi-specific sites on kinetics of target DNA search by a sequence-specific DNA-binding protein.

PubMed

Kemme, Catherine A; Esadze, Alexandre; Iwahara, Junji

2015-11-10

Functions of transcription factors require formation of specific complexes at particular sites in cis-regulatory elements of genes. However, chromosomal DNA contains numerous sites that are similar to the target sequences recognized by transcription factors. The influence of such "quasi-specific" sites on functions of the transcription factors is not well understood at present by experimental means. In this work, using fluorescence methods, we have investigated the influence of quasi-specific DNA sites on the efficiency of target location by the zinc finger DNA-binding domain of the inducible transcription factor Egr-1, which recognizes a 9 bp sequence. By stopped-flow assays, we measured the kinetics of Egr-1's association with a target site on 143 bp DNA in the presence of various competitor DNAs, including nonspecific and quasi-specific sites. The presence of quasi-specific sites on competitor DNA significantly decelerated the target association by the Egr-1 protein. The impact of the quasi-specific sites depended strongly on their affinity, their concentration, and the degree of their binding to the protein. To quantitatively describe the kinetic impact of the quasi-specific sites, we derived an analytical form of the apparent kinetic rate constant for the target association and used it for fitting to the experimental data. Our kinetic data with calf thymus DNA as a competitor suggested that there are millions of high-affinity quasi-specific sites for Egr-1 among the 3 billion bp of genomic DNA. This study quantitatively demonstrates that naturally abundant quasi-specific sites on DNA can considerably impede the target search processes of sequence-specific DNA-binding proteins.
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.

PubMed

de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas

2015-11-16

Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Influence of Quasi-Specific Sites on Kinetics of Target DNA Search by a Sequence-Specific DNA-Binding Protein

PubMed Central

2015-01-01

Functions of transcription factors require formation of specific complexes at particular sites in cis-regulatory elements of genes. However, chromosomal DNA contains numerous sites that are similar to the target sequences recognized by transcription factors. The influence of such “quasi-specific” sites on functions of the transcription factors is not well understood at present by experimental means. In this work, using fluorescence methods, we have investigated the influence of quasi-specific DNA sites on the efficiency of target location by the zinc finger DNA-binding domain of the inducible transcription factor Egr-1, which recognizes a 9 bp sequence. By stopped-flow assays, we measured the kinetics of Egr-1’s association with a target site on 143 bp DNA in the presence of various competitor DNAs, including nonspecific and quasi-specific sites. The presence of quasi-specific sites on competitor DNA significantly decelerated the target association by the Egr-1 protein. The impact of the quasi-specific sites depended strongly on their affinity, their concentration, and the degree of their binding to the protein. To quantitatively describe the kinetic impact of the quasi-specific sites, we derived an analytical form of the apparent kinetic rate constant for the target association and used it for fitting to the experimental data. Our kinetic data with calf thymus DNA as a competitor suggested that there are millions of high-affinity quasi-specific sites for Egr-1 among the 3 billion bp of genomic DNA. This study quantitatively demonstrates that naturally abundant quasi-specific sites on DNA can considerably impede the target search processes of sequence-specific DNA-binding proteins. PMID:26502071
Does TATA matter? A structural exploration of the selectivity determinants in its complexes with TATA box-binding protein.

PubMed Central

Pastor, N; Pardo, L; Weinstein, H

1997-01-01

The binding of the TATA box-binding protein (TBP) to a TATA sequence in DNA is essential for eukaryotic basal transcription. TBP binds in the minor groove of DNA, causing a large distortion of the DNA helix. Given the apparent stereochemical equivalence of AT and TA basepairs in the minor groove, DNA deformability must play a significant role in binding site selection, because not all AT-rich sequences are bound effectively by TBP. To gain insight into the precise role that the properties of the TATA sequence have in determining the specificity of the DNA substrates of TBP, the solution structure and dynamics of seven DNA dodecamers have been studied by using molecular dynamics simulations. The analysis of the structural properties of basepair steps in these TATA sequences suggests a reason for the preference for alternating pyrimidine-purine (YR) sequences, but indicates that these properties cannot be the sole determinant of the sequence specificity of TBP. Rather, recognition depends on the interplay between the inherent deformability of the DNA and steric complementarity at the molecular interface. Images FIGURE 2 PMID:9251783
A rapid, generally applicable method to engineer zinc fingers illustrated by targeting the HIV-1 promoter.

PubMed

Isalan, M; Klug, A; Choo, Y

2001-07-01

DNA-binding domains with predetermined sequence specificity are engineered by selection of zinc finger modules using phage display, allowing the construction of customized transcription factors. Despite remarkable progress in this field, the available protein-engineering methods are deficient in many respects, thus hampering the applicability of the technique. Here we present a rapid and convenient method that can be used to design zinc finger proteins against a variety of DNA-binding sites. This is based on a pair of pre-made zinc finger phage-display libraries, which are used in parallel to select two DNA-binding domains each of which recognizes given 5 base pair sequences, and whose products are recombined to produce a single protein that recognizes a composite (9 base pair) site of predefined sequence. Engineering using this system can be completed in less than two weeks and yields proteins that bind sequence-specifically to DNA with Kd values in the nanomolar range. To illustrate the technique, we have selected seven different proteins to bind various regions of the human immunodeficiency virus 1 (HIV-1) promoter.
Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain.

PubMed

de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas

2014-06-01

The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Predicting the binding preference of transcription factors to individual DNA k-mers.

PubMed

Alleyne, Trevis M; Peña-Castillo, Lourdes; Badis, Gwenael; Talukder, Shaheynoor; Berger, Michael F; Gehrke, Andrew R; Philippakis, Anthony A; Bulyk, Martha L; Morris, Quaid D; Hughes, Timothy R

2009-04-15

Recognition of specific DNA sequences is a central mechanism by which transcription factors (TFs) control gene expression. Many TF-binding preferences, however, are unknown or poorly characterized, in part due to the difficulty associated with determining their specificity experimentally, and an incomplete understanding of the mechanisms governing sequence specificity. New techniques that estimate the affinity of TFs to all possible k-mers provide a new opportunity to study DNA-protein interaction mechanisms, and may facilitate inference of binding preferences for members of a given TF family when such information is available for other family members. We employed a new dataset consisting of the relative preferences of mouse homeodomains for all eight-base DNA sequences in order to ask how well we can predict the binding profiles of homeodomains when only their protein sequences are given. We evaluated a panel of standard statistical inference techniques, as well as variations of the protein features considered. Nearest neighbour among functionally important residues emerged among the most effective methods. Our results underscore the complexity of TF-DNA recognition, and suggest a rational approach for future analyses of TF families.
Impact of cadmium, cobalt and nickel on sequence-specific DNA binding of p63 and p73 in vitro and in cells

DOE Office of Scientific and Technical Information (OSTI.GOV)

Adámik, Matej; Bažantová, Pavla; Department of Biology and Ecology, Faculty of Science, University of Ostrava, Chittussiho 10, 701 03 Ostrava

Highlights: • DNA binding of p53 family core domains is inhibited by cadmium, cobalt and nickel. • Binding to DNA protects p53 family core domains from metal induced inhibition. • Cadmium, cobalt and nickel induced inhibition was reverted by EDTA in vitro. - Abstract: Site-specific DNA recognition and binding activity belong to common attributes of all three members of tumor suppressor p53 family proteins: p53, p63 and p73. It was previously shown that heavy metals can affect p53 conformation, sequence-specific binding and suppress p53 response to DNA damage. Here we report for the first time that cadmium, nickel and cobalt,more » which have already been shown to disturb various DNA repair mechanisms, can also influence p63 and p73 sequence-specific DNA binding activity and transactivation of p53 family target genes. Based on results of electrophoretic mobility shift assay and luciferase reporter assay, we conclude that cadmium inhibits sequence-specific binding of all three core domains to p53 consensus sequences and abolishes transactivation of several promoters (e.g. BAX and MDM2) by 50 μM concentrations. In the presence of specific DNA, all p53 family core domains were partially protected against loss of DNA binding activity due to cadmium treatment. Effective cadmium concentration to abolish DNA–protein interactions was about two times higher for p63 and p73 proteins than for p53. Furthermore, we detected partial reversibility of cadmium inhibition for all p53 family members by EDTA. DTT was able to reverse cadmium inhibition only for p53 and p73. Nickel and cobalt abolished DNA–p53 interaction at sub-millimolar concentrations while inhibition of p63 and p73 DNA binding was observed at millimolar concentrations. In summary, cadmium strongly inhibits p53, p63 and p73 DNA binding in vitro and in cells in comparison to nickel and cobalt. The role of cadmium inhibition of p53 tumor suppressor family in carcinogenesis is discussed.« less
A calmodulin-like protein (LCALA) is a new Leishmania amazonensis candidate for telomere end-binding protein.

PubMed

Morea, Edna G O; Viviescas, Maria Alejandra; Fernandes, Carlos A H; Matioli, Fabio F; Lira, Cristina B B; Fernandez, Maribel F; Moraes, Barbara S; da Silva, Marcelo S; Storti, Camila B; Fontes, Marcos R M; Cano, Maria Isabel N

2017-11-01

Leishmania spp. telomeres are composed of 5'-TTAGGG-3' repeats associated with proteins. We have previously identified LaRbp38 and LaRPA-1 as proteins that bind the G-rich telomeric strand. At that time, we had also partially characterized a protein: DNA complex, named LaGT1, but we could not identify its protein component. Using protein-DNA interaction and competition assays, we confirmed that LaGT1 is highly specific to the G-rich telomeric single-stranded DNA. Three protein bands, with LaGT1 activity, were isolated from affinity-purified protein extracts in-gel digested, and sequenced de novo using mass spectrometry analysis. In silico analysis of the digested peptide identified them as a putative calmodulin with sequences identical to the T. cruzi calmodulin. In the Leishmania genome, the calmodulin ortholog is present in three identical copies. We cloned and sequenced one of the gene copies, named it LCalA, and obtained the recombinant protein. Multiple sequence alignment and molecular modeling showed that LCalA shares homology to most eukaryotes calmodulin. In addition, we demonstrated that LCalA is nuclear, partially co-localizes with telomeres and binds in vivo the G-rich telomeric strand. Recombinant LCalA can bind specifically and with relative affinity to the G-rich telomeric single-strand and to a 3'G-overhang, and DNA binding is calcium dependent. We have described a novel candidate component of Leishmania telomeres, LCalA, a nuclear calmodulin that binds the G-rich telomeric strand with high specificity and relative affinity, in a calcium-dependent manner. LCalA is the first reported calmodulin that binds in vivo telomeric DNA. Copyright © 2017 Elsevier B.V. All rights reserved.
Quantitative characterization of conformational-specific protein-DNA binding using a dual-spectral interferometric imaging biosensor

NASA Astrophysics Data System (ADS)

Zhang, Xirui; Daaboul, George G.; Spuhler, Philipp S.; Dröge, Peter; Ünlü, M. Selim

2016-03-01

DNA-binding proteins play crucial roles in the maintenance and functions of the genome and yet, their specific binding mechanisms are not fully understood. Recently, it was discovered that DNA-binding proteins recognize specific binding sites to carry out their functions through an indirect readout mechanism by recognizing and capturing DNA conformational flexibility and deformation. High-throughput DNA microarray-based methods that provide large-scale protein-DNA binding information have shown effective and comprehensive analysis of protein-DNA binding affinities, but do not provide information of DNA conformational changes in specific protein-DNA complexes. Building on the high-throughput capability of DNA microarrays, we demonstrate a quantitative approach that simultaneously measures the amount of protein binding to DNA and nanometer-scale DNA conformational change induced by protein binding in a microarray format. Both measurements rely on spectral interferometry on a layered substrate using a single optical instrument in two distinct modalities. In the first modality, we quantitate the amount of binding of protein to surface-immobilized DNA in each DNA spot using a label-free spectral reflectivity technique that accurately measures the surface densities of protein and DNA accumulated on the substrate. In the second modality, for each DNA spot, we simultaneously measure DNA conformational change using a fluorescence vertical sectioning technique that determines average axial height of fluorophores tagged to specific nucleotides of the surface-immobilized DNA. The approach presented in this paper, when combined with current high-throughput DNA microarray-based technologies, has the potential to serve as a rapid and simple method for quantitative and large-scale characterization of conformational specific protein-DNA interactions.DNA-binding proteins play crucial roles in the maintenance and functions of the genome and yet, their specific binding mechanisms are not fully understood. Recently, it was discovered that DNA-binding proteins recognize specific binding sites to carry out their functions through an indirect readout mechanism by recognizing and capturing DNA conformational flexibility and deformation. High-throughput DNA microarray-based methods that provide large-scale protein-DNA binding information have shown effective and comprehensive analysis of protein-DNA binding affinities, but do not provide information of DNA conformational changes in specific protein-DNA complexes. Building on the high-throughput capability of DNA microarrays, we demonstrate a quantitative approach that simultaneously measures the amount of protein binding to DNA and nanometer-scale DNA conformational change induced by protein binding in a microarray format. Both measurements rely on spectral interferometry on a layered substrate using a single optical instrument in two distinct modalities. In the first modality, we quantitate the amount of binding of protein to surface-immobilized DNA in each DNA spot using a label-free spectral reflectivity technique that accurately measures the surface densities of protein and DNA accumulated on the substrate. In the second modality, for each DNA spot, we simultaneously measure DNA conformational change using a fluorescence vertical sectioning technique that determines average axial height of fluorophores tagged to specific nucleotides of the surface-immobilized DNA. The approach presented in this paper, when combined with current high-throughput DNA microarray-based technologies, has the potential to serve as a rapid and simple method for quantitative and large-scale characterization of conformational specific protein-DNA interactions. Electronic supplementary information (ESI) available: DNA sequences and nomenclature (Table 1S); SDS-PAGE assay of IHF stock solution (Fig. 1S); determination of the concentration of IHF stock solution by Bradford assay (Fig. 2S); equilibrium binding isotherm fitting results of other DNA sequences (Table 2S); calculation of dissociation constants (Fig. 3S, 4S; Table 2S); geometric model for quantitation of DNA bending angle induced by specific IHF binding (Fig. 4S); customized flow cell assembly (Fig. 5S); real-time measurement of average fluorophore height change by SSFM (Fig. 6S); summary of binding parameters obtained from additive isotherm model fitting (Table 3S); average surface densities of 10 dsDNA spots and bound IHF at equilibrium (Table 4S); effects of surface densities on the binding and bending of dsDNA (Tables 5S, 6S and Fig. 7S-10S). See DOI: 10.1039/c5nr06785e
DNA Cloning of Plasmodium falciparum Circumsporozoite Gene: Amino Acid Sequence of Repetitive Epitope

NASA Astrophysics Data System (ADS)

Enea, Vincenzo; Ellis, Joan; Zavala, Fidel; Arnot, David E.; Asavanich, Achara; Masuda, Aoi; Quakyi, Isabella; Nussenzweig, Ruth S.

1984-08-01

A clone of complementary DNA encoding the circumsporozoite (CS) protein of the human malaria parasite Plasmodium falciparum has been isolated by screening an Escherichia coli complementary DNA library with a monoclonal antibody to the CS protein. The DNA sequence of the complementary DNA insert encodes a four-amino acid sequence: proline-asparagine-alanine-asparagine, tandemly repeated 23 times. The CS β -lactamase fusion protein specifically binds monoclonal antibodies to the CS protein and inhibits the binding of these antibodies to native Plasmodium falciparum CS protein. These findings provide a basis for the development of a vaccine against Plasmodium falciparum malaria.
Specific and non-specific interactions of ParB with DNA: implications for chromosome segregation

PubMed Central

Taylor, James A.; Pastrana, Cesar L.; Butterer, Annika; Pernstich, Christian; Gwynn, Emma J.; Sobott, Frank; Moreno-Herrero, Fernando; Dillingham, Mark S.

2015-01-01

The segregation of many bacterial chromosomes is dependent on the interactions of ParB proteins with centromere-like DNA sequences called parS that are located close to the origin of replication. In this work, we have investigated the binding of Bacillus subtilis ParB to DNA in vitro using a variety of biochemical and biophysical techniques. We observe tight and specific binding of a ParB homodimer to the parS sequence. Binding of ParB to non-specific DNA is more complex and displays apparent positive co-operativity that is associated with the formation of larger, poorly defined, nucleoprotein complexes. Experiments with magnetic tweezers demonstrate that non-specific binding leads to DNA condensation that is reversible by protein unbinding or force. The condensed DNA structure is not well ordered and we infer that it is formed by many looping interactions between neighbouring DNA segments. Consistent with this view, ParB is also able to stabilize writhe in single supercoiled DNA molecules and to bridge segments from two different DNA molecules in trans. The experiments provide no evidence for the promotion of non-specific DNA binding and/or condensation events by the presence of parS sequences. The implications of these observations for chromosome segregation are discussed. PMID:25572315
Using FRET to Measure the Angle at Which a Protein Bends DNA: TBP Binding a TATA Box as a Model System

ERIC Educational Resources Information Center

Kugel, Jennifer F.

2008-01-01

An undergraduate biochemistry laboratory experiment that will teach the technique of fluorescence resonance energy transfer (FRET) while analyzing protein-induced DNA bending is described. The experiment uses the protein TATA binding protein (TBP), which is a general transcription factor that recognizes and binds specific DNA sequences known as…

TRF1 and TRF2 use different mechanisms to find telomeric DNA but share a novel mechanism to search for protein partners at telomeres.

PubMed

Lin, Jiangguo; Countryman, Preston; Buncher, Noah; Kaur, Parminder; E, Longjiang; Zhang, Yiyun; Gibson, Greg; You, Changjiang; Watkins, Simon C; Piehler, Jacob; Opresko, Patricia L; Kad, Neil M; Wang, Hong

2014-02-01

Human telomeres are maintained by the shelterin protein complex in which TRF1 and TRF2 bind directly to duplex telomeric DNA. How these proteins find telomeric sequences among a genome of billions of base pairs and how they find protein partners to form the shelterin complex remains uncertain. Using single-molecule fluorescence imaging of quantum dot-labeled TRF1 and TRF2, we study how these proteins locate TTAGGG repeats on DNA tightropes. By virtue of its basic domain TRF2 performs an extensive 1D search on nontelomeric DNA, whereas TRF1's 1D search is limited. Unlike the stable and static associations observed for other proteins at specific binding sites, TRF proteins possess reduced binding stability marked by transient binding (∼ 9-17 s) and slow 1D diffusion on specific telomeric regions. These slow diffusion constants yield activation energy barriers to sliding ∼ 2.8-3.6 κ(B)T greater than those for nontelomeric DNA. We propose that the TRF proteins use 1D sliding to find protein partners and assemble the shelterin complex, which in turn stabilizes the interaction with specific telomeric DNA. This 'tag-team proofreading' represents a more general mechanism to ensure a specific set of proteins interact with each other on long repetitive specific DNA sequences without requiring external energy sources.
STAT1:DNA sequence-dependent binding modulation by phosphorylation, protein:protein interactions and small-molecule inhibition

PubMed Central

Bonham, Andrew J.; Wenta, Nikola; Osslund, Leah M.; Prussin, Aaron J.; Vinkemeier, Uwe; Reich, Norbert O.

2013-01-01

The DNA-binding specificity and affinity of the dimeric human transcription factor (TF) STAT1, were assessed by total internal reflectance fluorescence protein-binding microarrays (TIRF-PBM) to evaluate the effects of protein phosphorylation, higher-order polymerization and small-molecule inhibition. Active, phosphorylated STAT1 showed binding preferences consistent with prior characterization, whereas unphosphorylated STAT1 showed a weak-binding preference for one-half of the GAS consensus site, consistent with recent models of STAT1 structure and function in response to phosphorylation. This altered-binding preference was further tested by use of the inhibitor LLL3, which we show to disrupt STAT1 binding in a sequence-dependent fashion. To determine if this sequence-dependence is specific to STAT1 and not a general feature of human TF biology, the TF Myc/Max was analysed and tested with the inhibitor Mycro3. Myc/Max inhibition by Mycro3 is sequence independent, suggesting that the sequence-dependent inhibition of STAT1 may be specific to this system and a useful target for future inhibitor design. PMID:23180800
Investigation of arc repressor DNA-binding specificity by comparative molecular dynamics simulations.

PubMed

Song, Wei; Guo, Jun-Tao

2015-01-01

Transcription factors regulate gene expression through binding to specific DNA sequences. How transcription factors achieve high binding specificity is still not well understood. In this paper, we investigated the role of protein flexibility in protein-DNA-binding specificity by comparative molecular dynamics (MD) simulations. Protein flexibility has been considered as a key factor in molecular recognition, which is intrinsically a dynamic process involving fine structural fitting between binding components. In this study, we performed comparative MD simulations on wild-type and F10V mutant P22 Arc repressor in both free and complex conformations. The F10V mutant has lower DNA-binding specificity though both the bound and unbound main-chain structures between the wild-type and F10V mutant Arc are highly similar. We found that the DNA-binding motif of wild-type Arc is structurally more flexible than the F10V mutant in the unbound state, especially for the six DNA base-contacting residues in each dimer. We demonstrated that the flexible side chains of wild-type Arc lead to a higher DNA-binding specificity through forming more hydrogen bonds with DNA bases upon binding. Our simulations also showed a possible conformational selection mechanism for Arc-DNA binding. These results indicate the important roles of protein flexibility and dynamic properties in protein-DNA-binding specificity.
TFBSshape: a motif database for DNA shape features of transcription factor binding sites

PubMed Central

Yang, Lin; Zhou, Tianyin; Dror, Iris; Mathelier, Anthony; Wasserman, Wyeth W.; Gordân, Raluca; Rohs, Remo

2014-01-01

Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein–DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone. PMID:24214955
DNABP: Identification of DNA-Binding Proteins Based on Feature Selection Using a Random Forest and Predicting Binding Residues.

PubMed

Ma, Xin; Guo, Jing; Sun, Xiao

2016-01-01

DNA-binding proteins are fundamentally important in cellular processes. Several computational-based methods have been developed to improve the prediction of DNA-binding proteins in previous years. However, insufficient work has been done on the prediction of DNA-binding proteins from protein sequence information. In this paper, a novel predictor, DNABP (DNA-binding proteins), was designed to predict DNA-binding proteins using the random forest (RF) classifier with a hybrid feature. The hybrid feature contains two types of novel sequence features, which reflect information about the conservation of physicochemical properties of the amino acids, and the binding propensity of DNA-binding residues and non-binding propensities of non-binding residues. The comparisons with each feature demonstrated that these two novel features contributed most to the improvement in predictive ability. Furthermore, to improve the prediction performance of the DNABP model, feature selection using the minimum redundancy maximum relevance (mRMR) method combined with incremental feature selection (IFS) was carried out during the model construction. The results showed that the DNABP model could achieve 86.90% accuracy, 83.76% sensitivity, 90.03% specificity and a Matthews correlation coefficient of 0.727. High prediction accuracy and performance comparisons with previous research suggested that DNABP could be a useful approach to identify DNA-binding proteins from sequence information. The DNABP web server system is freely available at http://www.cbi.seu.edu.cn/DNABP/.
Specificity determinants for the abscisic acid response element.

PubMed

Sarkar, Aditya Kumar; Lahiri, Ansuman

2013-01-01

Abscisic acid (ABA) response elements (ABREs) are a group of cis-acting DNA elements that have been identified from promoter analysis of many ABA-regulated genes in plants. We are interested in understanding the mechanism of binding specificity between ABREs and a class of bZIP transcription factors known as ABRE binding factors (ABFs). In this work, we have modeled the homodimeric structure of the bZIP domain of ABRE binding factor 1 from Arabidopsis thaliana (AtABF1) and studied its interaction with ACGT core motif-containing ABRE sequences. We have also examined the variation in the stability of the protein-DNA complex upon mutating ABRE sequences using the protein design algorithm FoldX. The high throughput free energy calculations successfully predicted the ability of ABF1 to bind to alternative core motifs like GCGT or AAGT and also rationalized the role of the flanking sequences in determining the specificity of the protein-DNA interaction.
Theory on the mechanism of site-specific DNA-protein interactions in the presence of traps

NASA Astrophysics Data System (ADS)

Niranjani, G.; Murugan, R.

2016-08-01

The speed of site-specific binding of transcription factor (TFs) proteins with genomic DNA seems to be strongly retarded by the randomly occurring sequence traps. Traps are those DNA sequences sharing significant similarity with the original specific binding sites (SBSs). It is an intriguing question how the naturally occurring TFs and their SBSs are designed to manage the retarding effects of such randomly occurring traps. We develop a simple random walk model on the site-specific binding of TFs with genomic DNA in the presence of sequence traps. Our dynamical model predicts that (a) the retarding effects of traps will be minimum when the traps are arranged around the SBS such that there is a negative correlation between the binding strength of TFs with traps and the distance of traps from the SBS and (b) the retarding effects of sequence traps can be appeased by the condensed conformational state of DNA. Our computational analysis results on the distribution of sequence traps around the putative binding sites of various TFs in mouse and human genome clearly agree well the theoretical predictions. We propose that the distribution of traps can be used as an additional metric to efficiently identify the SBSs of TFs on genomic DNA.
SELMAP - SELEX affinity landscape MAPping of transcription factor binding sites using integrated microfluidics

PubMed Central

Chen, Dana; Orenstein, Yaron; Golodnitsky, Rada; Pellach, Michal; Avrahami, Dorit; Wachtel, Chaim; Ovadia-Shochat, Avital; Shir-Shapira, Hila; Kedmi, Adi; Juven-Gershon, Tamar; Shamir, Ron; Gerber, Doron

2016-01-01

Transcription factors (TFs) alter gene expression in response to changes in the environment through sequence-specific interactions with the DNA. These interactions are best portrayed as a landscape of TF binding affinities. Current methods to study sequence-specific binding preferences suffer from limited dynamic range, sequence bias, lack of specificity and limited throughput. We have developed a microfluidic-based device for SELEX Affinity Landscape MAPping (SELMAP) of TF binding, which allows high-throughput measurement of 16 proteins in parallel. We used it to measure the relative affinities of Pho4, AtERF2 and Btd full-length proteins to millions of different DNA binding sites, and detected both high and low-affinity interactions in equilibrium conditions, generating a comprehensive landscape of the relative TF affinities to all possible DNA 6-mers, and even DNA10-mers with increased sequencing depth. Low quantities of both the TFs and DNA oligomers were sufficient for obtaining high-quality results, significantly reducing experimental costs. SELMAP allows in-depth screening of hundreds of TFs, and provides a means for better understanding of the regulatory processes that govern gene expression. PMID:27628341
Contribution of the first K-homology domain of poly(C)-binding protein 1 to its affinity and specificity for C-rich oligonucleotides

PubMed Central

Yoga, Yano M. K.; Traore, Daouda A. K.; Sidiqi, Mahjooba; Szeto, Chris; Pendini, Nicole R.; Barker, Andrew; Leedman, Peter J.; Wilce, Jacqueline A.; Wilce, Matthew C. J.

2012-01-01

Poly-C-binding proteins are triple KH (hnRNP K homology) domain proteins with specificity for single stranded C-rich RNA and DNA. They play diverse roles in the regulation of protein expression at both transcriptional and translational levels. Here, we analyse the contributions of individual αCP1 KH domains to binding C-rich oligonucleotides using biophysical and structural methods. Using surface plasmon resonance (SPR), we demonstrate that KH1 makes the most stable interactions with both RNA and DNA, KH3 binds with intermediate affinity and KH2 only interacts detectibly with DNA. The crystal structure of KH1 bound to a 5′-CCCTCCCT-3′ DNA sequence shows a 2:1 protein:DNA stoichiometry and demonstrates a molecular arrangement of KH domains bound to immediately adjacent oligonucleotide target sites. SPR experiments, with a series of poly-C-sequences reveals that cytosine is preferred at all four positions in the oligonucleotide binding cleft and that a C-tetrad binds KH1 with 10 times higher affinity than a C-triplet. The basis for this high affinity interaction is finally detailed with the structure determination of a KH1.W.C54S mutant bound to 5′-ACCCCA-3′ DNA sequence. Together, these data establish the lead role of KH1 in oligonucleotide binding by αCP1 and reveal the molecular basis of its specificity for a C-rich tetrad. PMID:22344691
Contribution of the first K-homology domain of poly(C)-binding protein 1 to its affinity and specificity for C-rich oligonucleotides.

PubMed

Yoga, Yano M K; Traore, Daouda A K; Sidiqi, Mahjooba; Szeto, Chris; Pendini, Nicole R; Barker, Andrew; Leedman, Peter J; Wilce, Jacqueline A; Wilce, Matthew C J

2012-06-01

Poly-C-binding proteins are triple KH (hnRNP K homology) domain proteins with specificity for single stranded C-rich RNA and DNA. They play diverse roles in the regulation of protein expression at both transcriptional and translational levels. Here, we analyse the contributions of individual αCP1 KH domains to binding C-rich oligonucleotides using biophysical and structural methods. Using surface plasmon resonance (SPR), we demonstrate that KH1 makes the most stable interactions with both RNA and DNA, KH3 binds with intermediate affinity and KH2 only interacts detectibly with DNA. The crystal structure of KH1 bound to a 5'-CCCTCCCT-3' DNA sequence shows a 2:1 protein:DNA stoichiometry and demonstrates a molecular arrangement of KH domains bound to immediately adjacent oligonucleotide target sites. SPR experiments, with a series of poly-C-sequences reveals that cytosine is preferred at all four positions in the oligonucleotide binding cleft and that a C-tetrad binds KH1 with 10 times higher affinity than a C-triplet. The basis for this high affinity interaction is finally detailed with the structure determination of a KH1.W.C54S mutant bound to 5'-ACCCCA-3' DNA sequence. Together, these data establish the lead role of KH1 in oligonucleotide binding by αCP1 and reveal the molecular basis of its specificity for a C-rich tetrad.
Understanding the mechanisms of protein-DNA interactions

NASA Astrophysics Data System (ADS)

Lavery, Richard

2004-03-01

Structural, biochemical and thermodynamic data on protein-DNA interactions show that specific recognition cannot be reduced to a simple set of binary interactions between the partners (such as hydrogen bonds, ion pairs or steric contacts). The mechanical properties of the partners also play a role and, in the case of DNA, variations in both conformation and flexibility as a function of base sequence can be a significant factor in guiding a protein to the correct binding site. All-atom molecular modeling offers a means of analyzing the role of different binding mechanisms within protein-DNA complexes of known structure. This however requires estimating the binding strengths for the full range of sequences with which a given protein can interact. Since this number grows exponentially with the length of the binding site it is necessary to find a method to accelerate the calculations. We have achieved this by using a multi-copy approach (ADAPT) which allows us to build a DNA fragment with a variable base sequence. The results obtained with this method correlate well with experimental consensus binding sequences. They enable us to show that indirect recognition mechanisms involving the sequence dependent properties of DNA play a significant role in many complexes. This approach also offers a means of predicting protein binding sites on the basis of binding energies, which is complementary to conventional lexical techniques.
Interaction of the alpha-subunit of Escherichia coli RNA polymerase with DNA: rigid body nature of the protein-DNA contact.

PubMed

Heyduk, E; Baichoo, N; Heyduk, T

2001-11-30

The alpha-subunit of Escherichia coli RNA polymerase plays an important role in the activity of many promoters by providing a direct protein-DNA contact with a specific sequence (UP element) located upstream of the core promoter sequence. To obtain insight into the nature of thermodynamic forces involved in the formation of this protein-DNA contact, the binding of the alpha-subunit of E. coli RNA polymerase to a fluorochrome-labeled DNA fragment containing the rrnB P1 promoter UP element sequence was quantitatively studied using fluorescence polarization. The alpha dimer and DNA formed a 1:1 complex in solution. Complex formation at 25 degrees C was enthalpy-driven, the binding was accompanied by a net release of 1-2 ions, and no significant specific ion effects were observed. The van't Hoff plot of temperature dependence of binding was linear suggesting that the heat capacity change (Deltac(p)) was close to zero. Protein footprinting with hydroxyradicals showed that the protein did not change its conformation upon protein-DNA contact formation. No conformational changes in the DNA molecule were detected by CD spectroscopy upon protein-DNA complex formation. The thermodynamic characteristics of the binding together with the lack of significant conformational changes in the protein and in the DNA suggested that the alpha-subunit formed a rigid body-like contact with the DNA in which a tight complementary recognition interface between alpha-subunit and DNA was not formed.
Specific DNA binding of the two chicken Deformed family homeodomain proteins, Chox-1.4 and Chox-a.

PubMed Central

Sasaki, H; Yokoyama, E; Kuroiwa, A

1990-01-01

The cDNA clones encoding two chicken Deformed (Dfd) family homeobox containing genes Chox-1.4 and Chox-a were isolated. Comparison of their amino acid sequences with another chicken Dfd family homeodomain protein and with those of mouse homologues revealed that strong homologies are located in the amino terminal regions and around the homeodomains. Although homologies in other regions were relatively low, some short conserved sequences were also identified. E. coli-made full length proteins were purified and used for the production of specific antibodies and for DNA binding studies. The binding profiles of these proteins to the 5'-leader and 5'-upstream sequences of Chox-1.4 and Chox-a coding regions were analyzed by immunoprecipitation and DNase I footprint assays. These two Chox proteins bound to the same sites in the 5'-flanking sequences of their coding regions with various affinities and their binding affinities to each site were nearly the same. The consensus sequences of the high and low affinity binding sites were TAATGA(C/G) and CTAATTTT, respectively. A clustered binding site was identified in the 5'-upstream of the Chox-a gene, suggesting that this clustered binding site works as a cis-regulatory element for auto- and/or cross-regulation of Chox-a gene expression. Images PMID:1970866
Interactions between the R2R3-MYB Transcription Factor, AtMYB61, and Target DNA Binding Sites

PubMed Central

Prouse, Michael B.; Campbell, Malcolm M.

2013-01-01

Despite the prominent roles played by R2R3-MYB transcription factors in the regulation of plant gene expression, little is known about the details of how these proteins interact with their DNA targets. For example, while Arabidopsis thaliana R2R3-MYB protein AtMYB61 is known to alter transcript abundance of a specific set of target genes, little is known about the specific DNA sequences to which AtMYB61 binds. To address this gap in knowledge, DNA sequences bound by AtMYB61 were identified using cyclic amplification and selection of targets (CASTing). The DNA targets identified using this approach corresponded to AC elements, sequences enriched in adenosine and cytosine nucleotides. The preferred target sequence that bound with the greatest affinity to AtMYB61 recombinant protein was ACCTAC, the AC-I element. Mutational analyses based on the AC-I element showed that ACC nucleotides in the AC-I element served as the core recognition motif, critical for AtMYB61 binding. Molecular modelling predicted interactions between AtMYB61 amino acid residues and corresponding nucleotides in the DNA targets. The affinity between AtMYB61 and specific target DNA sequences did not correlate with AtMYB61-driven transcriptional activation with each of the target sequences. CASTing-selected motifs were found in the regulatory regions of genes previously shown to be regulated by AtMYB61. Taken together, these findings are consistent with the hypothesis that AtMYB61 regulates transcription from specific cis-acting AC elements in vivo. The results shed light on the specifics of DNA binding by an important family of plant-specific transcriptional regulators. PMID:23741471
Genome-wide survey of DNA-binding proteins in Arabidopsis thaliana: analysis of distribution and functions.

PubMed

Malhotra, Sony; Sowdhamini, Ramanathan

2013-08-01

The interaction of proteins with their respective DNA targets is known to control many high-fidelity cellular processes. Performing a comprehensive survey of the sequenced genomes for DNA-binding proteins (DBPs) will help in understanding their distribution and the associated functions in a particular genome. Availability of fully sequenced genome of Arabidopsis thaliana enables the review of distribution of DBPs in this model plant genome. We used profiles of both structure and sequence-based DNA-binding families, derived from PDB and PFam databases, to perform the survey. This resulted in 4471 proteins, identified as DNA-binding in Arabidopsis genome, which are distributed across 300 different PFam families. Apart from several plant-specific DNA-binding families, certain RING fingers and leucine zippers also had high representation. Our search protocol helped to assign DNA-binding property to several proteins that were previously marked as unknown, putative or hypothetical in function. The distribution of Arabidopsis genes having a role in plant DNA repair were particularly studied and noted for their functional mapping. The functions observed to be overrepresented in the plant genome harbour DNA-3-methyladenine glycosylase activity, alkylbase DNA N-glycosylase activity and DNA-(apurinic or apyrimidinic site) lyase activity, suggesting their role in specialized functions such as gene regulation and DNA repair.
Zinc-binding Domain of the Bacteriophage T7 DNA Primase Modulates Binding to the DNA Template*

PubMed Central

Lee, Seung-Joo; Zhu, Bin; Akabayov, Barak; Richardson, Charles C.

2012-01-01

The zinc-binding domain (ZBD) of prokaryotic DNA primases has been postulated to be crucial for recognition of specific sequences in the single-stranded DNA template. To determine the molecular basis for this role in recognition, we carried out homolog-scanning mutagenesis of the zinc-binding domain of DNA primase of bacteriophage T7 using a bacterial homolog from Geobacillus stearothermophilus. The ability of T7 DNA primase to catalyze template-directed oligoribonucleotide synthesis is eliminated by substitution of any five-amino acid residue-long segment within the ZBD. The most significant defect occurs upon substitution of a region (Pro-16 to Cys-20) spanning two cysteines that coordinate the zinc ion. The role of this region in primase function was further investigated by generating a protein library composed of multiple amino acid substitutions for Pro-16, Asp-18, and Asn-19 followed by genetic screening for functional proteins. Examination of proteins selected from the screening reveals no change in sequence-specific recognition. However, the more positively charged residues in the region facilitate DNA binding, leading to more efficient oligoribonucleotide synthesis on short templates. The results suggest that the zinc-binding mode alone is not responsible for sequence recognition, but rather its interaction with the RNA polymerase domain is critical for DNA binding and for sequence recognition. Consequently, any alteration in the ZBD that disturbs its conformation leads to loss of DNA-dependent oligoribonucleotide synthesis. PMID:23024359
Structure and Sequence Search on Aptamer-Protein Docking

NASA Astrophysics Data System (ADS)

Xiao, Jiajie; Bonin, Keith; Guthold, Martin; Salsbury, Freddie

2015-03-01

Interactions between proteins and deoxyribonucleic acid (DNA) play a significant role in the living systems, especially through gene regulation. However, short nucleic acids sequences (aptamers) with specific binding affinity to specific proteins exhibit clinical potential as therapeutics. Our capillary and gel electrophoresis selection experiments show that specific sequences of aptamers can be selected that bind specific proteins. Computationally, given the experimentally-determined structure and sequence of a thrombin-binding aptamer, we can successfully dock the aptamer onto thrombin in agreement with experimental structures of the complex. In order to further study the conformational flexibility of this thrombin-binding aptamer and to potentially develop a predictive computational model of aptamer-binding, we use GPU-enabled molecular dynamics simulations to both examine the conformational flexibility of the aptamer in the absence of binding to thrombin, and to determine our ability to fold an aptamer. This study should help further de-novo predictions of aptamer sequences by enabling the study of structural and sequence-dependent effects on aptamer-protein docking specificity.
In vitro fluorescence studies of transcription factor IIB-DNA interaction.

PubMed

Górecki, Andrzej; Figiel, Małgorzata; Dziedzicka-Wasylewska, Marta

2015-01-01

General transcription factor TFIIB is one of the basal constituents of the preinitiation complex of eukaryotic RNA polymerase II, acting as a bridge between the preinitiation complex and the polymerase, and binding promoter DNA in an asymmetric manner, thereby defining the direction of the transcription. Methods of fluorescence spectroscopy together with circular dichroism spectroscopy were used to observe conformational changes in the structure of recombinant human TFIIB after binding to specific DNA sequence. To facilitate the exploration of the structural changes, several site-directed mutations have been introduced altering the fluorescence properties of the protein. Our observations showed that binding of specific DNA sequences changed the protein structure and dynamics, and TFIIB may exist in two conformational states, which can be described by a different microenvironment of W52. Fluorescence studies using both intrinsic and exogenous fluorophores showed that these changes significantly depended on the recognition sequence and concerned various regions of the protein, including those interacting with other transcription factors and RNA polymerase II. DNA binding can cause rearrangements in regions of proteins interacting with the polymerase in a manner dependent on the recognized sequences, and therefore, influence the gene expression.
Herpes simplex virus DNA packaging sequences adopt novel structures that are specifically recognized by a component of the cleavage and packaging machinery.

PubMed

Adelman, K; Salmon, B; Baines, J D

2001-03-13

The product of the herpes simplex virus type 1 U(L)28 gene is essential for cleavage of concatemeric viral DNA into genome-length units and packaging of this DNA into viral procapsids. To address the role of U(L)28 in this process, purified U(L)28 protein was assayed for the ability to recognize conserved herpesvirus DNA packaging sequences. We report that DNA fragments containing the pac1 DNA packaging motif can be induced by heat treatment to adopt novel DNA conformations that migrate faster than the corresponding duplex in nondenaturing gels. Surprisingly, these novel DNA structures are high-affinity substrates for U(L)28 protein binding, whereas double-stranded DNA of identical sequence composition is not recognized by U(L)28 protein. We demonstrate that only one strand of the pac1 motif is responsible for the formation of novel DNA structures that are bound tightly and specifically by U(L)28 protein. To determine the relevance of the observed U(L)28 protein-pac1 interaction to the cleavage and packaging process, we have analyzed the binding affinity of U(L)28 protein for pac1 mutants previously shown to be deficient in cleavage and packaging in vivo. Each of the pac1 mutants exhibited a decrease in DNA binding by U(L)28 protein that correlated directly with the reported reduction in cleavage and packaging efficiency, thereby supporting a role for the U(L)28 protein-pac1 interaction in vivo. These data therefore suggest that the formation of novel DNA structures by the pac1 motif confers added specificity on recognition of DNA packaging sequences by the U(L)28-encoded component of the herpesvirus cleavage and packaging machinery.
Regulation of the yeast RAD2 gene: DNA damage-dependent induction correlates with protein binding to regulatory sequences and their deletion influences survival.

PubMed

Siede, W; Friedberg, E C

1992-03-01

In the yeast Saccharomyces cerevisiae the RAD2 gene is absolutely required for damage-specific incision of DNA during nucleotide excision repair and is inducible by DNA-damaging agents. In the present study we correlated sensitivity to killing by DNA-damaging agents with the deletion of previously defined specific promoter elements. Deletion of the element DRE2 increased the UV sensitivity of cells in both the G1/early S and S/G2 phases of the cell cycle as well as in stationary phase. On the other hand, increased UV sensitivity associated with deletion of the sequence-related element DRE1 was restricted to cells irradiated in G1/S. Specific binding of protein(s) to the promoter elements DRE1 and DRE2 was observed under non-inducing conditions using gel retardation assays. Exposure of cells to DNA-damaging agents resulted in increased protein binding that was dependent on de novo protein synthesis.

Sequence-Specific Recognition of DNA by Proteins: Binding Motifs Discovered Using a Novel Statistical/Computational Analysis

PubMed Central

Jakubec, David; Laskowski, Roman A.; Vondrasek, Jiri

2016-01-01

Decades of intensive experimental studies of the recognition of DNA sequences by proteins have provided us with a view of a diverse and complicated world in which few to no features are shared between individual DNA-binding protein families. The originally conceived direct readout of DNA residue sequences by amino acid side chains offers very limited capacity for sequence recognition, while the effects of the dynamic properties of the interacting partners remain difficult to quantify and almost impossible to generalise. In this work we investigated the energetic characteristics of all DNA residue—amino acid side chain combinations in the conformations found at the interaction interface in a very large set of protein—DNA complexes by the means of empirical potential-based calculations. General specificity-defining criteria were derived and utilised to look beyond the binding motifs considered in previous studies. Linking energetic favourability to the observed geometrical preferences, our approach reveals several additional amino acid motifs which can distinguish between individual DNA bases. Our results remained valid in environments with various dielectric properties. PMID:27384774
Curated collection of yeast transcription factor DNA binding specificity data reveals novel structural and gene regulatory insights

PubMed Central

2011-01-01

Background Transcription factors (TFs) play a central role in regulating gene expression by interacting with cis-regulatory DNA elements associated with their target genes. Recent surveys have examined the DNA binding specificities of most Saccharomyces cerevisiae TFs, but a comprehensive evaluation of their data has been lacking. Results We analyzed in vitro and in vivo TF-DNA binding data reported in previous large-scale studies to generate a comprehensive, curated resource of DNA binding specificity data for all characterized S. cerevisiae TFs. Our collection comprises DNA binding site motifs and comprehensive in vitro DNA binding specificity data for all possible 8-bp sequences. Investigation of the DNA binding specificities within the basic leucine zipper (bZIP) and VHT1 regulator (VHR) TF families revealed unexpected plasticity in TF-DNA recognition: intriguingly, the VHR TFs, newly characterized by protein binding microarrays in this study, recognize bZIP-like DNA motifs, while the bZIP TF Hac1 recognizes a motif highly similar to the canonical E-box motif of basic helix-loop-helix (bHLH) TFs. We identified several TFs with distinct primary and secondary motifs, which might be associated with different regulatory functions. Finally, integrated analysis of in vivo TF binding data with protein binding microarray data lends further support for indirect DNA binding in vivo by sequence-specific TFs. Conclusions The comprehensive data in this curated collection allow for more accurate analyses of regulatory TF-DNA interactions, in-depth structural studies of TF-DNA specificity determinants, and future experimental investigations of the TFs' predicted target genes and regulatory roles. PMID:22189060
An electrochemical sensing platform based on local repression of electrolyte diffusion for single-step, reagentless, sensitive detection of a sequence-specific DNA-binding protein.

PubMed

Zhang, Yun; Liu, Fang; Nie, Jinfang; Jiang, Fuyang; Zhou, Caibin; Yang, Jiani; Fan, Jinlong; Li, Jianping

2014-05-07

In this paper, we report for the first time an electrochemical biosensor for single-step, reagentless, and picomolar detection of a sequence-specific DNA-binding protein using a double-stranded, electrode-bound DNA probe terminally modified with a redox active label close to the electrode surface. This new methodology is based upon local repression of electrolyte diffusion associated with protein-DNA binding that leads to reduction of the electrochemical response of the label. In the proof-of-concept study, the resulting electrochemical biosensor was quantitatively sensitive to the concentrations of the TATA binding protein (TBP, a model analyte) ranging from 40 pM to 25.4 nM with an estimated detection limit of ∼10.6 pM (∼80 to 400-fold improvement on the detection limit over previous electrochemical analytical systems).
Toward rules relating zinc finger protein sequences and DNA binding site preferences.

PubMed

Desjarlais, J R; Berg, J M

1992-08-15

Zinc finger proteins of the Cys2-His2 type consist of tandem arrays of domains, where each domain appears to contact three adjacent base pairs of DNA through three key residues. We have designed and prepared a series of variants of the central zinc finger within the DNA binding domain of Sp1 by using information from an analysis of a large data base of zinc finger protein sequences. Through systematic variations at two of the three contact positions (underlined), relatively specific recognition of sequences of the form 5'-GGGGN(G or T)GGG-3' has been achieved. These results provide the basis for rules that may develop into a code that will allow the design of zinc finger proteins with preselected DNA site specificity.
Accurate and sensitive quantification of protein-DNA binding affinity.

PubMed

Rastogi, Chaitanya; Rube, H Tomas; Kribelbauer, Judith F; Crocker, Justin; Loker, Ryan E; Martini, Gabriella D; Laptenko, Oleg; Freed-Pastor, William A; Prives, Carol; Stern, David L; Mann, Richard S; Bussemaker, Harmen J

2018-04-17

Transcription factors (TFs) control gene expression by binding to genomic DNA in a sequence-specific manner. Mutations in TF binding sites are increasingly found to be associated with human disease, yet we currently lack robust methods to predict these sites. Here, we developed a versatile maximum likelihood framework named No Read Left Behind (NRLB) that infers a biophysical model of protein-DNA recognition across the full affinity range from a library of in vitro selected DNA binding sites. NRLB predicts human Max homodimer binding in near-perfect agreement with existing low-throughput measurements. It can capture the specificity of the p53 tetramer and distinguish multiple binding modes within a single sample. Additionally, we confirm that newly identified low-affinity enhancer binding sites are functional in vivo, and that their contribution to gene expression matches their predicted affinity. Our results establish a powerful paradigm for identifying protein binding sites and interpreting gene regulatory sequences in eukaryotic genomes. Copyright © 2018 the Author(s). Published by PNAS.
Accurate and sensitive quantification of protein-DNA binding affinity

PubMed Central

Rastogi, Chaitanya; Rube, H. Tomas; Kribelbauer, Judith F.; Crocker, Justin; Loker, Ryan E.; Martini, Gabriella D.; Laptenko, Oleg; Freed-Pastor, William A.; Prives, Carol; Stern, David L.; Mann, Richard S.; Bussemaker, Harmen J.

2018-01-01

Transcription factors (TFs) control gene expression by binding to genomic DNA in a sequence-specific manner. Mutations in TF binding sites are increasingly found to be associated with human disease, yet we currently lack robust methods to predict these sites. Here, we developed a versatile maximum likelihood framework named No Read Left Behind (NRLB) that infers a biophysical model of protein-DNA recognition across the full affinity range from a library of in vitro selected DNA binding sites. NRLB predicts human Max homodimer binding in near-perfect agreement with existing low-throughput measurements. It can capture the specificity of the p53 tetramer and distinguish multiple binding modes within a single sample. Additionally, we confirm that newly identified low-affinity enhancer binding sites are functional in vivo, and that their contribution to gene expression matches their predicted affinity. Our results establish a powerful paradigm for identifying protein binding sites and interpreting gene regulatory sequences in eukaryotic genomes. PMID:29610332
Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models.

PubMed

Park, Byungkyu; Im, Jinyong; Tuvshinjargal, Narankhuu; Lee, Wook; Han, Kyungsook

2014-11-01

As many structures of protein-DNA complexes have been known in the past years, several computational methods have been developed to predict DNA-binding sites in proteins. However, its inverse problem (i.e., predicting protein-binding sites in DNA) has received much less attention. One of the reasons is that the differences between the interaction propensities of nucleotides are much smaller than those between amino acids. Another reason is that DNA exhibits less diverse sequence patterns than protein. Therefore, predicting protein-binding DNA nucleotides is much harder than predicting DNA-binding amino acids. We computed the interaction propensity (IP) of nucleotide triplets with amino acids using an extensive dataset of protein-DNA complexes, and developed two support vector machine (SVM) models that predict protein-binding nucleotides from sequence data alone. One SVM model predicts protein-binding nucleotides using DNA sequence data alone, and the other SVM model predicts protein-binding nucleotides using both DNA and protein sequences. In a 10-fold cross-validation with 1519 DNA sequences, the SVM model that uses DNA sequence data only predicted protein-binding nucleotides with an accuracy of 67.0%, an F-measure of 67.1%, and a Matthews correlation coefficient (MCC) of 0.340. With an independent dataset of 181 DNAs that were not used in training, it achieved an accuracy of 66.2%, an F-measure 66.3% and a MCC of 0.324. Another SVM model that uses both DNA and protein sequences achieved an accuracy of 69.6%, an F-measure of 69.6%, and a MCC of 0.383 in a 10-fold cross-validation with 1519 DNA sequences and 859 protein sequences. With an independent dataset of 181 DNAs and 143 proteins, it showed an accuracy of 67.3%, an F-measure of 66.5% and a MCC of 0.329. Both in cross-validation and independent testing, the second SVM model that used both DNA and protein sequence data showed better performance than the first model that used DNA sequence data. To the best of our knowledge, this is the first attempt to predict protein-binding nucleotides in a given DNA sequence from the sequence data alone. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
An immunoassay for the study of DNA-binding activities of herpes simplex virus protein ICP8.

PubMed

Lee, C K; Knipe, D M

1985-06-01

An immunoassay was used to examine the interaction between a herpes simplex virus protein, ICP8, and various types of DNA. The advantage of this assay is that the protein is not subjected to harsh purification procedures. We characterized the binding of ICP8 to both single-stranded (ss) and double-stranded (ds) DNA. ICP8 bound ss DNA fivefold more efficiently than ds DNA, and both binding activities were most efficient in 150 mM NaCl. Two lines of evidence indicate that the binding activities were not identical: (i) ds DNA failed to complete with ss DNA binding even with a large excess of ds DNA; (ii) Scatchard plots of DNA binding with various amounts of DNA were fundamentally different for ss DNA and ds DNA. However, the two activities were related in that ss DNA efficiently competed with the binding of ds DNA. We conclude that the ds DNA-binding activity of ICP8 is probably distinct from the ss DNA-binding activity. No evidence for sequence-specific ds DNA binding was obtained for either the entire herpes simplex virus genome or cloned viral sequences.
Deep-sea vent phage DNA polymerase specifically initiates DNA synthesis in the absence of primers.

PubMed

Zhu, Bin; Wang, Longfei; Mitsunobu, Hitoshi; Lu, Xueling; Hernandez, Alfredo J; Yoshida-Takashima, Yukari; Nunoura, Takuro; Tabor, Stanley; Richardson, Charles C

2017-03-21

A DNA polymerase is encoded by the deep-sea vent phage NrS-1. NrS-1 has a unique genome organization containing genes that are predicted to encode a helicase and a single-stranded DNA (ssDNA)-binding protein. The gene for an unknown protein shares weak homology with the bifunctional primase-polymerases (prim-pols) from archaeal plasmids but is missing the zinc-binding domain typically found in primases. We show that this gene product has efficient DNA polymerase activity and is processive in DNA synthesis in the presence of the NrS-1 helicase and ssDNA-binding protein. Remarkably, this NrS-1 DNA polymerase initiates DNA synthesis from a specific template DNA sequence in the absence of any primer. The de novo DNA polymerase activity resides in the N-terminal domain of the protein, whereas the C-terminal domain enhances DNA binding.
HMGB1-mediated DNA bending: Distinct roles in increasing p53 binding to DNA and the transactivation of p53-responsive gene promoters.

PubMed

Štros, Michal; Kučírek, Martin; Sani, Soodabeh Abbasi; Polanská, Eva

2018-03-01

HMGB1 is a chromatin-associated protein that has been implicated in many important biological processes such as transcription, recombination, DNA repair, and genome stability. These functions include the enhancement of binding of a number of transcription factors, including the tumor suppressor protein p53, to their specific DNA-binding sites. HMGB1 is composed of two highly conserved HMG boxes, linked to an intrinsically disordered acidic C-terminal tail. Previous reports have suggested that the ability of HMGB1 to bend DNA may explain the in vitro HMGB1-mediated increase in sequence-specific DNA binding by p53. The aim of this study was to reinvestigate the importance of HMGB1-induced DNA bending in relationship to the ability of the protein to promote the specific binding of p53 to short DNA duplexes in vitro, and to transactivate two major p53-regulated human genes: Mdm2 and p21/WAF1. Using a number of HMGB1 mutants, we report that the HMGB1-mediated increase in sequence-specific p53 binding to DNA duplexes in vitro depends very little on HMGB1-mediated DNA bending. The presence of the acidic C-terminal tail of HMGB1 and/or the oxidation of the protein can reduce the HMGB1-mediated p53 binding. Interestingly, the induction of transactivation of p53-responsive gene promoters by HMGB1 requires both the ability of the protein to bend DNA and the acidic C-terminal tail, and is promoter-specific. We propose that the efficient transactivation of p53-responsive gene promoters by HMGB1 depends on complex events, rather than solely on the promotion of p53 binding to its DNA cognate sites. Copyright © 2018 Elsevier B.V. All rights reserved.
Crystal Structure of the Minimalist Max-E47 Protein Chimera

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ahmadpour, Faraz; Ghirlando, Rodolfo; De Jong, Antonia T.

Max-E47 is a protein chimera generated from the fusion of the DNA-binding basic region of Max and the dimerization region of E47, both members of the basic region/helix-loop-helix (bHLH) superfamily of transcription factors. Like native Max, Max-E47 binds with high affinity and specificity to the E-box site, 5'-CACGTG, both in vivo and in vitro. We have determined the crystal structure of Max-E47 at 1.7 Å resolution, and found that it associates to form a well-structured dimer even in the absence of its cognate DNA. Analytical ultracentrifugation confirms that Max-E47 is dimeric even at low micromolar concentrations, indicating that the Max-E47more » dimer is stable in the absence of DNA. Circular dichroism analysis demonstrates that both non-specific DNA and the E-box site induce similar levels of helical secondary structure in Max-E47. These results suggest that Max-E47 may bind to the E-box following the two-step mechanism proposed for other bHLH proteins. In this mechanism, a rapid step where protein binds to DNA without sequence specificity is followed by a slow step where specific protein:DNA interactions are fine-tuned, leading to sequence-specific recognition. Collectively, these results show that the designed Max-E47 protein chimera behaves both structurally and functionally like its native counterparts.« less
Evolutionary and biophysical relationships among the papillomavirus E2 proteins.

PubMed

Blakaj, Dukagjin M; Fernandez-Fuentes, Narcis; Chen, Zigui; Hegde, Rashmi; Fiser, Andras; Burk, Robert D; Brenowitz, Michael

2009-01-01

Infection by human papillomavirus (HPV) may result in clinical conditions ranging from benign warts to invasive cancer. The HPV E2 protein represses oncoprotein transcription and is required for viral replication. HPV E2 binds to palindromic DNA sequences of highly conserved four base pair sequences flanking an identical length variable 'spacer'. E2 proteins directly contact the conserved but not the spacer DNA. Variation in naturally occurring spacer sequences results in differential protein affinity that is dependent on their sensitivity to the spacer DNA's unique conformational and/or dynamic properties. This article explores the biophysical character of this core viral protein with the goal of identifying characteristics that associated with risk of virally caused malignancy. The amino acid sequence, 3d structure and electrostatic features of the E2 protein DNA binding domain are highly conserved; specific interactions with DNA binding sites have also been conserved. In contrast, the E2 protein's transactivation domain does not have extensive surfaces of highly conserved residues. Rather, regions of high conservation are localized to small surface patches. Implications to cancer biology are discussed.
DNA sequence determinants controlling affinity, stability and shape of DNA complexes bound by the nucleoid protein Fis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hancock, Stephen P.; Stella, Stefano; Cascio, Duilio

The abundant Fis nucleoid protein selectively binds poorly related DNA sequences with high affinities to regulate diverse DNA reactions. Fis binds DNA primarily through DNA backbone contacts and selects target sites by reading conformational properties of DNA sequences, most prominently intrinsic minor groove widths. High-affinity binding requires Fis-stabilized DNA conformational changes that vary depending on DNA sequence. In order to better understand the molecular basis for high affinity site recognition, we analyzed the effects of DNA sequence within and flanking the core Fis binding site on binding affinity and DNA structure. X-ray crystal structures of Fis-DNA complexes containing variable sequencesmore » in the noncontacted center of the binding site or variations within the major groove interfaces show that the DNA can adapt to the Fis dimer surface asymmetrically. We show that the presence and position of pyrimidine-purine base steps within the major groove interfaces affect both local DNA bending and minor groove compression to modulate affinities and lifetimes of Fis-DNA complexes. Sequences flanking the core binding site also modulate complex affinities, lifetimes, and the degree of local and global Fis-induced DNA bending. In particular, a G immediately upstream of the 15 bp core sequence inhibits binding and bending, and A-tracts within the flanking base pairs increase both complex lifetimes and global DNA curvatures. Taken together, our observations support a revised DNA motif specifying high-affinity Fis binding and highlight the range of conformations that Fis-bound DNA can adopt. Lastly, the affinities and DNA conformations of individual Fis-DNA complexes are likely to be tailored to their context-specific biological functions.« less
DNA sequence determinants controlling affinity, stability and shape of DNA complexes bound by the nucleoid protein Fis

DOE PAGES

Hancock, Stephen P.; Stella, Stefano; Cascio, Duilio; ...

2016-03-09

The abundant Fis nucleoid protein selectively binds poorly related DNA sequences with high affinities to regulate diverse DNA reactions. Fis binds DNA primarily through DNA backbone contacts and selects target sites by reading conformational properties of DNA sequences, most prominently intrinsic minor groove widths. High-affinity binding requires Fis-stabilized DNA conformational changes that vary depending on DNA sequence. In order to better understand the molecular basis for high affinity site recognition, we analyzed the effects of DNA sequence within and flanking the core Fis binding site on binding affinity and DNA structure. X-ray crystal structures of Fis-DNA complexes containing variable sequencesmore » in the noncontacted center of the binding site or variations within the major groove interfaces show that the DNA can adapt to the Fis dimer surface asymmetrically. We show that the presence and position of pyrimidine-purine base steps within the major groove interfaces affect both local DNA bending and minor groove compression to modulate affinities and lifetimes of Fis-DNA complexes. Sequences flanking the core binding site also modulate complex affinities, lifetimes, and the degree of local and global Fis-induced DNA bending. In particular, a G immediately upstream of the 15 bp core sequence inhibits binding and bending, and A-tracts within the flanking base pairs increase both complex lifetimes and global DNA curvatures. Taken together, our observations support a revised DNA motif specifying high-affinity Fis binding and highlight the range of conformations that Fis-bound DNA can adopt. Lastly, the affinities and DNA conformations of individual Fis-DNA complexes are likely to be tailored to their context-specific biological functions.« less
Multiple conformational states of DnaA protein regulate its interaction with DnaA boxes in the initiation of DNA replication.

PubMed

Patel, Meera J; Bhatia, Lavesh; Yilmaz, Gulden; Biswas-Fiss, Esther E; Biswas, Subhasis B

2017-09-01

DnaA protein is the initiator of genomic DNA replication in prokaryotes. It binds to specific DNA sequences in the origin of DNA replication and unwinds small AT-rich sequences downstream for the assembly of the replisome. The mechanism of activation of DnaA that enables it to bind and organize the origin DNA and leads to replication initiation remains unclear. In this study, we have developed double-labeled fluorescent DnaA probes to analyze conformational states of DnaA protein upon binding DNA, nucleotide, and Soj sporulation protein using Fluorescence Resonance Energy Transfer (FRET). Our studies demonstrate that DnaA protein undergoes large conformational changes upon binding to substrates and there are multiple distinct conformational states that enable it to initiate DNA replication. DnaA protein adopted a relaxed conformation by expanding ~15Å upon binding ATP and DNA to form the ATP·DnaA·DNA complex. Hydrolysis of bound ATP to ADP led to a contraction of DnaA within the complex. The relaxed conformation of DnaA is likely required for the formation of the multi-protein ATP·DnaA·DNA complex. In the initiation of sporulation, Soj binding to DnaA prevented relaxation of its conformation. Soj·ADP appeared to block the activation of DnaA, suggesting a mechanism for Soj·ADP in switching initiation of DNA replication to sporulation. Our studies demonstrate that multiple conformational states of DnaA protein regulate its binding to DNA in the initiation of DNA replication. Copyright © 2017 Elsevier B.V. All rights reserved.
Human immunodeficiency virus type 1 LTR TATA and TAR region sequences required for transcriptional regulation.

PubMed Central

Garcia, J A; Harrich, D; Soultanakis, E; Wu, F; Mitsuyasu, R; Gaynor, R B

1989-01-01

The human immunodeficiency virus (HIV) type 1 LTR is regulated at the transcriptional level by both cellular and viral proteins. Using HeLa cell extracts, multiple regions of the HIV LTR were found to serve as binding sites for cellular proteins. An untranslated region binding protein UBP-1 has been purified and fractions containing this protein bind to both the TAR and TATA regions. To investigate the role of cellular proteins binding to both the TATA and TAR regions and their potential interaction with other HIV DNA binding proteins, oligonucleotide-directed mutagenesis of both these regions was performed followed by DNase I footprinting and transient expression assays. In the TATA region, two direct repeats TC/AAGC/AT/AGCTGC surround the TATA sequence. Mutagenesis of both of these direct repeats or of the TATA sequence interrupted binding over the TATA region on the coding strand, but only a mutation of the TATA sequence affected in vivo assays for tat-activation. In addition to TAR serving as the site of binding of cellular proteins, RNA transcribed from TAR is capable of forming a stable stem-loop structure. To determine the relative importance of DNA binding proteins as compared to secondary structure, oligonucleotide-directed mutations in the TAR region were studied. Local mutations that disrupted either the stem or loop structure were defective in gene expression. However, compensatory mutations which restored base pairing in the stem resulted in complete tat-activation. This indicated a significant role for the stem-loop structure in HIV gene expression. To determine the role of TAR binding proteins, mutations were constructed which extensively changed the primary structure of the TAR region, yet left stem base pairing, stem energy and the loop sequence intact. These mutations resulted in decreased protein binding to TAR DNA and defects in tat-activation, and revealed factor binding specifically to the loop DNA sequence. Further mutagenesis which inverted this stem and loop mutation relative to the HIV LTR mRNA start site resulted in even larger decreases in tat-activation. This suggests that multiple determinants, including protein binding, the loop sequence, and RNA or DNA secondary structure, are important in tat-activation and suggests that tat may interact with cellular proteins binding to DNA to increase HIV gene expression. Images PMID:2721501
A close relative of the nuclear, chromosomal high-mobility group protein HMG1 in yeast mitochondria.

PubMed Central

Diffley, J F; Stillman, B

1991-01-01

ABF2 (ARS-binding factor 2), a small, basic DNA-binding protein that binds specifically to the autonomously replicating sequence ARS1, is located primarily in the mitochondria of the yeast Saccharomyces cerevisiae. The abundance of ABF2 and the phenotype of abf2- null mutants argue that this protein plays a key role in the structure, maintenance, and expression of the yeast mitochondrial genome. The predicted amino acid sequence of ABF2 is closely related to the high-mobility group proteins HMG1 and HMG2 from vertebrate cell nuclei and to several other DNA-binding proteins. Additionally, ABF2 and the other HMG-related proteins are related to a globular domain from the heat shock protein hsp70 family. ABF2 interacts with DNA both nonspecifically and in a specific manner within regulatory regions, suggesting a mechanism whereby it may aid in compacting the mitochondrial genome without interfering with expression. Images PMID:1881919
DNA Recognition by a σ 54 Transcriptional Activator from Aquifex aeolicus

DOE PAGES

Vidangos, Natasha K.; Heideker, Johanna; Lyubimov, Artem; ...

2014-08-23

Transcription initiation by bacterial σ 54-polymerase requires the action of a transcriptional activator protein. Activators bind sequence-specifically upstream of the transcription initiation site via a DNA-binding domain. The structurally characterized DNA-binding domains from activators all belong to the Factor for Inversion Stimulation (Fis) family of helix-turn-helix DNA-binding proteins. We report here structures of the free and DNA-bound forms of the DNA-binding domain of NtrC4 (4DBD) from Aquifex aeolicus, a member of the NtrC family of σ 54 activators. Two NtrC4 binding sites were identified upstream (-145 and -85 base pairs) from the start of the lpxC gene, which is responsiblemore » for the first committed step in Lipid A biosynthesis. This is the first experimental evidence for σ 54 regulation in lpxC expression. 4DBD was crystallized both without DNA and in complex with the -145 binding site. The structures, together with biochemical data, indicate that NtrC4 binds to DNA in a manner that is similar to that of its close homologue, Fis. Ultimately, the greater sequence specificity for the binding of 4DBD relative to Fis seems to arise from a larger number of base specific contacts contributing to affinity than for Fis.« less
DNA sequence+shape kernel enables alignment-free modeling of transcription factor binding.

PubMed

Ma, Wenxiu; Yang, Lin; Rohs, Remo; Noble, William Stafford

2017-10-01

Transcription factors (TFs) bind to specific DNA sequence motifs. Several lines of evidence suggest that TF-DNA binding is mediated in part by properties of the local DNA shape: the width of the minor groove, the relative orientations of adjacent base pairs, etc. Several methods have been developed to jointly account for DNA sequence and shape properties in predicting TF binding affinity. However, a limitation of these methods is that they typically require a training set of aligned TF binding sites. We describe a sequence + shape kernel that leverages DNA sequence and shape information to better understand protein-DNA binding preference and affinity. This kernel extends an existing class of k-mer based sequence kernels, based on the recently described di-mismatch kernel. Using three in vitro benchmark datasets, derived from universal protein binding microarrays (uPBMs), genomic context PBMs (gcPBMs) and SELEX-seq data, we demonstrate that incorporating DNA shape information improves our ability to predict protein-DNA binding affinity. In particular, we observe that (i) the k-spectrum + shape model performs better than the classical k-spectrum kernel, particularly for small k values; (ii) the di-mismatch kernel performs better than the k-mer kernel, for larger k; and (iii) the di-mismatch + shape kernel performs better than the di-mismatch kernel for intermediate k values. The software is available at https://bitbucket.org/wenxiu/sequence-shape.git. rohs@usc.edu or william-noble@uw.edu. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.
Structural Analysis of HMGD-DNA Complexes Reveal Influence of Intercalation on Sequence Selectivity and DNA Bending

PubMed Central

Churchill, Mair E.A.; Klass, Janet; Zoetewey, David L.

2010-01-01

The ubiquitous eukaryotic High-Mobility-Group-Box (HMGB) chromosomal proteins promote many chromatin-mediated cellular activities through their non-sequence-specific binding and bending of DNA. Minor groove DNA binding by the HMG box results in substantial DNA bending toward the major groove owing to electrostatic interactions, shape complementarity and DNA intercalation that occurs at two sites. Here, the structures of the complexes formed with DNA by a partially DNA intercalation-deficient mutant of Drosophila melanogaster HMGD have been determined by X-ray crystallography at a resolution of 2.85 Å. The six proteins and fifty base pairs of DNA in the crystal structure revealed a variety of bound conformations. All of the proteins bound in the minor groove, bridging DNA molecules, presumably because these DNA regions are easily deformed. The loss of the primary site of DNA intercalation decreased overall DNA bending and shape complementarity. However, DNA bending at the secondary site of intercalation was retained and most protein-DNA contacts were preserved. The mode of binding resembles the HMGB1-boxA-cisplatin-DNA complex, which also lacks a primary intercalating residue. This study provides new insights into the binding mechanisms used by HMG boxes to recognize varied DNA structures and sequences as well as modulate DNA structure and DNA bending. PMID:20800069

Genomic Heat Shock Element Sequences Drive Cooperative Human Heat Shock Factor 1 DNA Binding and Selectivity*

PubMed Central

Jaeger, Alex M.; Makley, Leah N.; Gestwicki, Jason E.; Thiele, Dennis J.

2014-01-01

The heat shock transcription factor 1 (HSF1) activates expression of a variety of genes involved in cell survival, including protein chaperones, the protein degradation machinery, anti-apoptotic proteins, and transcription factors. Although HSF1 activation has been linked to amelioration of neurodegenerative disease, cancer cells exhibit a dependence on HSF1 for survival. Indeed, HSF1 drives a program of gene expression in cancer cells that is distinct from that activated in response to proteotoxic stress, and HSF1 DNA binding activity is elevated in cycling cells as compared with arrested cells. Active HSF1 homotrimerizes and binds to a DNA sequence consisting of inverted repeats of the pentameric sequence nGAAn, known as heat shock elements (HSEs). Recent comprehensive ChIP-seq experiments demonstrated that the architecture of HSEs is very diverse in the human genome, with deviations from the consensus sequence in the spacing, orientation, and extent of HSE repeats that could influence HSF1 DNA binding efficacy and the kinetics and magnitude of target gene expression. To understand the mechanisms that dictate binding specificity, HSF1 was purified as either a monomer or trimer and used to evaluate DNA-binding site preferences in vitro using fluorescence polarization and thermal denaturation profiling. These results were compared with quantitative chromatin immunoprecipitation assays in vivo. We demonstrate a role for specific orientations of extended HSE sequences in driving preferential HSF1 DNA binding to target loci in vivo. These studies provide a biochemical basis for understanding differential HSF1 target gene recognition and transcription in neurodegenerative disease and in cancer. PMID:25204655
Characterization of protein--DNA interactions using surface plasmon resonance spectroscopy with various assay schemes.

PubMed

Teh, Huey Fang; Peh, Wendy Y X; Su, Xiaodi; Thomsen, Jane S

2007-02-27

Specific protein-DNA interactions play a central role in transcription and other biological processes. A comprehensive characterization of protein-DNA interactions should include information about binding affinity, kinetics, sequence specificity, and binding stoichiometry. In this study, we have used surface plasmon resonance spectroscopy (SPR) to study the interactions between human estrogen receptors (ER, alpha and beta subtypes) and estrogen response elements (ERE), with four assay schemes. First, we determined the sequence-dependent receptors' binding capacity by monitoring the binding of ER to various ERE sequences immobilized on a sensor surface (assay format denoted as the direct assay). Second, we screened the relative affinity of ER for various ERE sequences using a competition assay, in which the receptors bind to an ERE-immobilized surface in the presence of competitor ERE sequences. Third, we monitored the assembly of ER-ERE complexes on a SPR surface and thereafter the removal and/or dissociation of the ER (assay scheme denoted as the dissociation assay) to determine the binding stoichiometry. Last, a sandwich assay (ER binding to ERE followed by anti-ER recognition of a specific ER subtype) was performed in an effort to understand how ERalpha and ERbeta may associate and compete when binding to the DNA. With these assay schemes, we reaffirmed that (1) ERalpha is more sensitive than ERbeta to base pair change(s) in the consensus ERE, (2) ERalpha and ERbeta form a heterodimer when they bind to the consensus ERE, and (3) the binding stoichiometry of both ERalpha- and ERbeta-ERE complexes is dependent on salt concentration. With this study, we demonstrate the versatility of the SPR analysis. With the involvement of various assay arrangements, the SPR analysis can be further extended to more than kinetics and affinity study.
NKX3.1 Genotype and IGF-1 Interact in Prostate Cancer Risk

DTIC Science & Technology

2009-05-01

Steadman DJ, Giuffrida D, Gelmann EP. DNA-binding sequence of the human prostate-specific homeodomain protein NKX3.1. Nucleic Acids Res 2000;28...Gelmann EP. DNA-binding sequence of the human prostate-specific homeodomain protein NKX3.1. Nucleic Acids Res 2000;28:2389–95. 20. Wu X, Senechal K...3212836 /UG=Hs.21765 fatty acid desaturase 3 204733_at 5.74 gb:NM_002774.1 /DEF=Homo sapiens kallikrein 6 (neurosin, zyme) (KLK6), mRNA. /FEA=mRNA /GEN
A cDNA from a mouse pancreatic beta cell encoding a putative transcription factor of the insulin gene.

PubMed Central

Walker, M D; Park, C W; Rosen, A; Aronheim, A

1990-01-01

Cell specific expression of the insulin gene is achieved through transcriptional mechanisms operating on multiple DNA sequence elements located in the 5' flanking region of the gene. Of particular importance in the rat insulin I gene are two closely similar 9 bp sequences (IEB1 and IEB2): mutation of either of these leads to 5-10 fold reduction in transcriptional activity. We have screened an expression cDNA library derived from mouse pancreatic endocrine beta cells with a radioactive DNA probe containing multiple copies of the IEB1 sequence. A cDNA clone (A1) isolated by this procedure encodes a protein which shows efficient binding to the IEB1 probe, but much weaker binding to either an unrelated DNA probe or to a probe bearing a single base pair insertion within the recognition sequence. DNA sequence analysis indicates a protein belonging to the helix-loop-helix family of DNA-binding proteins. The ability of the protein encoded by clone A1 to recognize a number of wild type and mutant DNA sequences correlates closely with the ability of each sequence element to support transcription in vivo in the context of the insulin 5' flanking DNA. We conclude that the isolated cDNA may encode a transcription factor that participates in control of insulin gene expression. Images PMID:2181401
Single-stranded DNA Binding by the Helix-Hairpin-Helix Domain of XPF Protein Contributes to the Substrate Specificity of the ERCC1-XPF Protein Complex*

PubMed Central

Das, Devashish; Faridounnia, Maryam; Kovacic, Lidija; Kaptein, Robert; Boelens, Rolf; Folkers, Gert E.

2017-01-01

The nucleotide excision repair protein complex ERCC1-XPF is required for incision of DNA upstream of DNA damage. Functional studies have provided insights into the binding of ERCC1-XPF to various DNA substrates. However, because no structure for the ERCC1-XPF-DNA complex has been determined, the mechanism of substrate recognition remains elusive. Here we biochemically characterize the substrate preferences of the helix-hairpin-helix (HhH) domains of XPF and ERCC-XPF and show that the binding to single-stranded DNA (ssDNA)/dsDNA junctions is dependent on joint binding to the DNA binding domain of ERCC1 and XPF. We reveal that the homodimeric XPF is able to bind various ssDNA sequences but with a clear preference for guanine-containing substrates. NMR titration experiments and in vitro DNA binding assays also show that, within the heterodimeric ERCC1-XPF complex, XPF specifically recognizes ssDNA. On the other hand, the HhH domain of ERCC1 preferentially binds dsDNA through the hairpin region. The two separate non-overlapping DNA binding domains in the ERCC1-XPF heterodimer jointly bind to an ssDNA/dsDNA substrate and, thereby, at least partially dictate the incision position during damage removal. Based on structural models, NMR titrations, DNA-binding studies, site-directed mutagenesis, charge distribution, and sequence conservation, we propose that the HhH domain of ERCC1 binds to dsDNA upstream of the damage, and XPF binds to the non-damaged strand within a repair bubble. PMID:28028171
Characterization of the DNA binding properties of polyomavirus capsid protein

NASA Technical Reports Server (NTRS)

Chang, D.; Cai, X.; Consigli, R. A.; Spooner, B. S. (Principal Investigator)

1993-01-01

The DNA binding properties of the polyomavirus structural proteins VP1, VP2, and VP3 were studied by Southwestern analysis. The major viral structural protein VP1 and host-contributed histone proteins of polyomavirus virions were shown to exhibit DNA binding activity, but the minor capsid proteins VP2 and VP3 failed to bind DNA. The N-terminal first five amino acids (Ala-1 to Lys-5) were identified as the VP1 DNA binding domain by genetic and biochemical approaches. Wild-type VP1 expressed in Escherichia coli (RK1448) exhibited DNA binding activity, but the N-terminal truncated VP1 mutants (lacking Ala-1 to Lys-5 and Ala-1 to Cys-11) failed to bind DNA. The synthetic peptide (Ala-1 to Cys-11) was also shown to have an affinity for DNA binding. Site-directed mutagenesis of the VP1 gene showed that the point mutations at Pro-2, Lys-3, and Arg-4 on the VP1 molecule did not affect DNA binding properties but that the point mutation at Lys-5 drastically reduced DNA binding affinity. The N-terminal (Ala-1 to Lys-5) region of VP1 was found to be essential and specific for DNA binding, while the DNA appears to be non-sequence specific. The DNA binding domain and the nuclear localization signal are located in the same N-terminal region.
The Arabidopsis class I TCP transcription factor AtTCP11 is a developmental regulator with distinct DNA-binding properties due to the presence of a threonine residue at position 15 of the TCP domain.

PubMed

Viola, Ivana L; Uberti Manassero, Nora G; Ripoll, Rodrigo; Gonzalez, Daniel H

2011-04-01

The TCP domain is a DNA-binding domain present in plant transcription factors that modulate different processes. In the present study, we show that Arabidopsis class I TCP proteins are able to interact with a dyad-symmetric sequence composed of two GTGGG half-sites. TCP20 establishes symmetric interactions with the 5' half of each strand, whereas TCP11 interacts mainly with the 3' half. SELEX (systematic evolution of ligands by exponential enrichment) experiments with TCP15 and TCP20 indicated that these proteins have similar, although not identical, DNA-binding preferences and are able to interact with non-palindromic binding sites of the type GTGGGNCCNN. TCP11 shows a different DNA-binding specificity, with a preference for the sequence GTGGGCCNNN. The distinct DNA-binding properties of TCP11 are due to the presence of a threonine residue at position 15 of the TCP domain, a position that is occupied by an arginine residue in most TCP proteins. TCP11 also forms heterodimers with TCP15 that have increased DNA-binding efficiency. The expression in plants of a repressor form of TCP11 demonstrated that this protein is a developmental regulator that influences the growth of leaves, stems and petioles, and pollen development. The results suggest that changes in DNA-binding preferences may be one of the mechanisms through which class I TCP proteins achieve functional specificity.
Concerted formation of macromolecular Suppressor–mutator transposition complexes

PubMed Central

Raina, Ramesh; Schläppi, Michael; Karunanandaa, Balasulojini; Elhofy, Adam; Fedoroff, Nina

1998-01-01

Transposition of the maize Suppressor–mutator (Spm) transposon requires two element-encoded proteins, TnpA and TnpD. Although there are multiple TnpA binding sites near each element end, binding of TnpA to DNA is not cooperative, and the binding affinity is not markedly affected by the number of binding sites per DNA fragment. However, intermolecular complexes form cooperatively between DNA fragments with three or more TnpA binding sites. TnpD, itself not a sequence-specific DNA-binding protein, binds to TnpA and stabilizes the TnpA–DNA complex. The high redundancy of TnpA binding sites at both element ends and the protein–protein interactions between DNA-bound TnpA complexes and between these and TnpD imply a concerted transition of the element from a linear to a protein crosslinked transposition complex within a very narrow protein concentration range. PMID:9671711
Analysis of the DNA-Binding Activities of the Arabidopsis R2R3-MYB Transcription Factor Family by One-Hybrid Experiments in Yeast

PubMed Central

Kelemen, Zsolt; Sebastian, Alvaro; Xu, Wenjia; Grain, Damaris; Salsac, Fabien; Avon, Alexandra; Berger, Nathalie; Tran, Joseph; Dubreucq, Bertrand; Lurin, Claire; Lepiniec, Loïc; Contreras-Moreira, Bruno; Dubos, Christian

2015-01-01

The control of growth and development of all living organisms is a complex and dynamic process that requires the harmonious expression of numerous genes. Gene expression is mainly controlled by the activity of sequence-specific DNA binding proteins called transcription factors (TFs). Amongst the various classes of eukaryotic TFs, the MYB superfamily is one of the largest and most diverse, and it has considerably expanded in the plant kingdom. R2R3-MYBs have been extensively studied over the last 15 years. However, DNA-binding specificity has been characterized for only a small subset of these proteins. Therefore, one of the remaining challenges is the exhaustive characterization of the DNA-binding specificity of all R2R3-MYB proteins. In this study, we have developed a library of Arabidopsis thaliana R2R3-MYB open reading frames, whose DNA-binding activities were assayed in vivo (yeast one-hybrid experiments) with a pool of selected cis-regulatory elements. Altogether 1904 interactions were assayed leading to the discovery of specific patterns of interactions between the various R2R3-MYB subgroups and their DNA target sequences and to the identification of key features that govern these interactions. The present work provides a comprehensive in vivo analysis of R2R3-MYB binding activities that should help in predicting new DNA motifs and identifying new putative target genes for each member of this very large family of TFs. In a broader perspective, the generated data will help to better understand how TF interact with their target DNA sequences. PMID:26484765
The GAGA protein of Drosophila is phosphorylated by CK2.

PubMed

Bonet, Carles; Fernández, Irene; Aran, Xavier; Bernués, Jordi; Giralt, Ernest; Azorín, Fernando

2005-08-19

The GAGA factor of Drosophila is a sequence-specific DNA-binding protein that contributes to multiple processes from the regulation of gene expression to the structural organisation of heterochromatin and chromatin remodelling. GAGA is known to interact with various other proteins (tramtrack, pipsqueak, batman and dSAP18) and protein complexes (PRC1, NURF and FACT). GAGA functions are likely regulated at the level of post-translational modifications. Little is known, however, about its actual pattern of modification. It was proposed that GAGA can be O-glycosylated. Here, we report that GAGA519 isoform is a phosphoprotein that is phosphorylated by CK2 at the region of the DNA-binding domain. Our results indicate that phosphorylation occurs at S388 and, to a lesser extent, at S378. These two residues are located in a region of the DNA-binding domain that makes no direct contact with DNA, being dispensable for sequence-specific recognition. Phosphorylation at these sites does not abolish DNA binding but reduces the affinity of the interaction. These results are discussed in the context of the various functions and interactions that GAGA supports.
Xenopus origin recognition complex (ORC) initiates DNA replication preferentially at sequences targeted by Schizosaccharomyces pombe ORC

PubMed Central

Kong, Daochun; Coleman, Thomas R.; DePamphilis, Melvin L.

2003-01-01

Budding yeast (Saccharomyces cerevisiae) origin recognition complex (ORC) requires ATP to bind specific DNA sequences, whereas fission yeast (Schizosaccharomyces pombe) ORC binds to specific, asymmetric A:T-rich sites within replication origins, independently of ATP, and frog (Xenopus laevis) ORC seems to bind DNA non-specifically. Here we show that despite these differences, ORCs are functionally conserved. Firstly, SpOrc1, SpOrc4 and SpOrc5, like those from other eukaryotes, bound ATP and exhibited ATPase activity, suggesting that ATP is required for pre-replication complex (pre-RC) assembly rather than origin specificity. Secondly, SpOrc4, which is solely responsible for binding SpORC to DNA, inhibited up to 70% of XlORC-dependent DNA replication in Xenopus egg extract by preventing XlORC from binding to chromatin and assembling pre-RCs. Chromatin-bound SpOrc4 was located at AT-rich sequences. XlORC in egg extract bound preferentially to asymmetric A:T-sequences in either bare DNA or in sperm chromatin, and it recruited XlCdc6 and XlMcm proteins to these sequences. These results reveal that XlORC initiates DNA replication preferentially at the same or similar sites to those targeted in S.pombe. PMID:12840006
In silico characterization and analysis of RTBP1 and NgTRF1 protein through MD simulation and molecular docking - A comparative study.

PubMed

Mukherjee, Koel; Pandey, Dev Mani; Vidyarthi, Ambarish Saran

2015-02-06

Gaining access to sequence and structure information of telomere binding proteins helps in understanding the essential biological processes involve in conserved sequence specific interaction between DNA and the proteins. Rice telomere binding protein (RTBP1) and Nicotiana glutinosa telomere repeat binding factor (NgTRF1) are helix turn helix motif type of proteins that plays role in telomeric DNA protection and length regulation. Both the proteins share same type of domain but till now there is very less communication on the in silico studies of these complete proteins.Here we intend to do a comparative study between two proteins through modeling of the complete proteins, physiochemical characterization, MD simulation and DNA-protein docking. I-TASSER and CLC protein work bench was performed to find out the protein 3D structure as well as the different parameters to characterize the proteins. MD simulation was completed by GROMOS forcefield of GROMACS for 10 ns of time stretch. The simulated 3D structures were docked with template DNA (3D DNA modeled through 3D-DART) of TTTAGGG conserved sequence motif using HADDOCK web server.Digging up all the facts about the proteins it was reveled that around 120 amino acids in the tail part was showing a good sequence similarity between the proteins. Molecular modeling, sequence characterization and secondary structure prediction also indicates the similarity between the protein's structure and sequence. The result of MD simulation highlights on the RMSD, RMSF, Rg, PCA and Energy plots which also conveys the similar type of motional behavior between them. The best complex formation for both the proteins in docking result also indicates for the first interaction site which is mainly the helix3 region of the DNA binding domain. The overall computational analysis reveals that RTBP1 and NgTRF1 proteins display good amount of similarity in their physicochemical properties, structure, dynamics and binding mode.
In Silico Characterization and Analysis of RTBP1 and NgTRF1 Protein Through MD Simulation and Molecular Docking: A Comparative Study.

PubMed

Mukherjee, Koel; Pandey, Dev Mani; Vidyarthi, Ambarish Saran

2015-09-01

Gaining access to sequence and structure information of telomere-binding proteins helps in understanding the essential biological processes involve in conserved sequence-specific interaction between DNA and the proteins. Rice telomere-binding protein (RTBP1) and Nicotiana glutinosa telomere repeat binding factor (NgTRF1) are helix-turn-helix motif type of proteins that plays role in telomeric DNA protection and length regulation. Both the proteins share same type of domain, but till now there is very less communication on the in silico studies of these complete proteins. Here we intend to do a comparative study between two proteins through modeling of the complete proteins, physiochemical characterization, MD simulation and DNA-protein docking. I-TASSER and CLC protein work bench was performed to find out the protein 3D structure as well as the different parameters to characterize the proteins. MD simulation was completed by GROMOS forcefield of GROMACS for 10 ns of time stretch. The simulated 3D structures were docked with template DNA (3D DNA modeled through 3D-DART) of TTTAGGG conserved sequence motif using HADDOCK Web server. By digging up all the facts about the proteins, it was revealed that around 120 amino acids in the tail part were showing a good sequence similarity between the proteins. Molecular modeling, sequence characterization and secondary structure prediction also indicate the similarity between the protein's structure and sequence. The result of MD simulation highlights on the RMSD, RMSF, Rg, PCA and energy plots which also conveys the similar type of motional behavior between them. The best complex formation for both the proteins in docking result also indicates for the first interaction site which is mainly the helix3 region of the DNA-binding domain. The overall computational analysis reveals that RTBP1 and NgTRF1 proteins display good amount of similarity in their physicochemical properties, structure, dynamics and binding mode.
The 87-kD A gamma-globin enhancer-binding protein is a product of the HOXB2(HOX2H) locus.

PubMed

Sengupta, P K; Lavelle, D E; DeSimone, J

1994-03-01

Developmental regulation of globin gene expression may be controlled by developmental stage-specific nuclear proteins that influence interactions between the locus control region and local regulatory sequences near individual globin genes. We previously isolated an 87-kD nuclear protein from K562 cells that bound to DNA sequences in the beta-globin locus control region, gamma-globin promoter, and A gamma-globin enhancer. The presence of this protein in fetal globin-expressing cells and its absence in adult globin-expressing cells suggested that it may be a developmental stage-specific factor. A lambda gt11 K562 cDNA clone encoding a portion of the HOXB2 (formerly HOX2H) homeobox gene was isolated on the basis of the ability of its beta-galactosidase fusion protein to bind to the same DNA sequences as the 87-kD K562 protein. Because no other relationship had been established between the 87-kD K562 protein and the HOXB2 protein other than their ability to bind ot the same DNA sequences, we have investigated whether the two proteins are related antigenically. Our data show that antisera produced against the HOXB2-beta-gal fusion protein and a synthetic HOXB2 decapeptide react specifically with an 87-kD protein from K562 nuclear extract, showing that the 87-kD K562 nuclear protein is a product of the HOXB2 locus, and is the first demonstration of cellular HOXB2 protein.
Computational Design of DNA-Binding Proteins.

PubMed

Thyme, Summer; Song, Yifan

2016-01-01

Predicting the outcome of engineered and naturally occurring sequence perturbations to protein-DNA interfaces requires accurate computational modeling technologies. It has been well established that computational design to accommodate small numbers of DNA target site substitutions is possible. This chapter details the basic method of design used in the Rosetta macromolecular modeling program that has been successfully used to modulate the specificity of DNA-binding proteins. More recently, combining computational design and directed evolution has become a common approach for increasing the success rate of protein engineering projects. The power of such high-throughput screening depends on computational methods producing multiple potential solutions. Therefore, this chapter describes several protocols for increasing the diversity of designed output. Lastly, we describe an approach for building comparative models of protein-DNA complexes in order to utilize information from homologous sequences. These models can be used to explore how nature modulates specificity of protein-DNA interfaces and potentially can even be used as starting templates for further engineering.
Electrophoretic mobility shift assay reveals a novel recognition sequence for Setaria italica NAC protein.

PubMed

Puranik, Swati; Kumar, Karunesh; Srivastava, Prem S; Prasad, Manoj

2011-10-01

The NAC (NAM/ATAF1,2/CUC2) proteins are among the largest family of plant transcription factors. Its members have been associated with diverse plant processes and intricately regulate the expression of several genes. Inspite of this immense progress, knowledge of their DNA-binding properties are still limited. In our recent publication,1 we reported isolation of a membrane-associated NAC domain protein from Setaria italica (SiNAC). Transactivation analysis revealed that it was a functionally active transcription factor as it could stimulate expression of reporter genes in vivo. Truncations of the transmembrane region of the protein lead to its nuclear localization. Here we describe expression and purification of SiNAC DNA-binding domain. We further report identification of a novel DNA-binding site, [C/G][A/T][T/A][G/C]TC[C/G][A/T][C/G][G/C] for SiNAC by electrophoretic mobility shift assay. The SiNAC-GST protein could bind to the NAC recognition sequence in vitro as well as to sequences where some bases had been reshuffled. The results presented here contribute to our understanding of the DNA-binding specificity of SiNAC protein.
Electrophoretic mobility shift assay reveals a novel recognition sequence for Setaria italica NAC protein

PubMed Central

Puranik, Swati; Kumar, Karunesh; Srivastava, Prem S

2011-01-01

The NAC (NAM/ATAF1,2/CUC2) proteins are among the largest family of plant transcription factors. Its members have been associated with diverse plant processes and intricately regulate the expression of several genes. Inspite of this immense progress, knowledge of their DNA-binding properties are still limited. In our recent publication,1 we reported isolation of a membrane-associated NAC domain protein from Setaria italica (SiNAC). Transactivation analysis revealed that it was a functionally active transcription factor as it could stimulate expression of reporter genes in vivo. Truncation of the transmembrane region of the protein lead to its nuclear localization. Here we describe expression and purification of SiNAC DNA-binding domain. We further report identification of a novel DNA-binding site, [C/G][A/T] [T/A][G/C]TC[C/G][A/T][C/G][G/C] for SiNAC by electrophoretic mobility shift assay. The SiNAC-GST protein could bind to the NAC recognition sequence in vitro as well as to sequences where some bases had been reshuffled. The results presented here contribute to our understanding of the DNA-binding specificity of SiNAC protein. PMID:21918373
Intramolecular control of transcriptional activity by the NK2-specific domain in NK-2 homeodomain proteins

PubMed Central

Watada, Hirotaka; Mirmira, Raghavendra G.; Kalamaras, Julie; German, Michael S.

2000-01-01

The developmentally important homeodomain transcription factors of the NK-2 class contain a highly conserved region, the NK2-specific domain (NK2-SD). The function of this domain, however, remains unknown. The primary structure of the NK2-SD suggests that it might function as an accessory DNA-binding domain or as a protein–protein interaction interface. To assess the possibility that the NK2-SD may contribute to DNA-binding specificity, we used a PCR-based approach to identify a consensus DNA-binding sequences for Nkx2.2, an NK-2 family member involved in pancreas and central nervous system development. The consensus sequence (TCTAAGTGAGCTT) is similar to the known binding sequences for other NK-2 homeodomain proteins, but we show that the NK2-SD does not contribute significantly to specific DNA binding to this sequence. To determine whether the NK2-SD contributes to transactivation, we used GAL4-Nkx2.2 fusion constructs to map a powerful transcriptional activation domain in the C-terminal region beyond the conserved NK2-SD. Interestingly, this C-terminal region functions as a transcriptional activator only in the absence of an intact NK2-SD. The NK2-SD also can mask transactivation from the paired homeodomain transcription factor Pax6, but it has no effect on transcription by itself. These results demonstrate that the NK2-SD functions as an intramolecular regulator of the C-terminal activation domain in Nkx2.2 and support a model in which interactions through the NK2-SD regulate the ability of NK-2-class proteins to activate specific genes during development. PMID:10944215
Molecular beacons for DNA binding proteins: an emerging technology for detection of DNA binding proteins and their ligands.

PubMed

Dummitt, Benjamin; Chang, Yie-Hwa

2006-06-01

Quantitation of the level or activity of specific proteins is one of the most commonly performed experiments in biomedical research. Protein detection has historically been difficult to adapt to high throughput platforms because of heavy reliance upon antibodies for protein detection. Molecular beacons for DNA binding proteins is a recently developed technology that attempts to overcome such limitations. Protein detection is accomplished using inexpensive, easy-to-synthesize oligonucleotides, accompanied by a fluorescence readout. Importantly, detection of the protein and reporting of the signal occur simultaneously, allowing for one-step protocols and increased potential for use in high throughput analysis. While the initial iteration of the technology allowed only for the detection of sequence-specific DNA binding proteins, more recent adaptations allow for the possibility of development of beacons for any protein, independent of native DNA binding activity. Here, we discuss the development of the technology, the mechanism of the reaction, and recent improvements and modifications made to improve the assay in terms of sensitivity, potential for multiplexing, and broad applicability.
RNA from the 5' end of the R2 retrotransposon controls R2 protein binding to and cleavage of its DNA target site.

PubMed

Christensen, Shawn M; Ye, Junqiang; Eickbush, Thomas H

2006-11-21

Non-LTR retrotransposons insert into eukaryotic genomes by target-primed reverse transcription (TPRT), a process in which cleaved DNA targets are used to prime reverse transcription of the element's RNA transcript. Many of the steps in the integration pathway of these elements can be characterized in vitro for the R2 element because of the rigid sequence specificity of R2 for both its DNA target and its RNA template. R2 retrotransposition involves identical subunits of the R2 protein bound to different DNA sequences upstream and downstream of the insertion site. The key determinant regulating which DNA-binding conformation the protein adopts was found to be a 320-nt RNA sequence from near the 5' end of the R2 element. In the absence of this 5' RNA the R2 protein binds DNA sequences upstream of the insertion site, cleaves the first DNA strand, and conducts TPRT when RNA containing the 3' untranslated region of the R2 transcript is present. In the presence of the 320-nt 5' RNA, the R2 protein binds DNA sequences downstream of the insertion site. Cleavage of the second DNA strand by the downstream subunit does not appear to occur until after the 5' RNA is removed from this subunit. We postulate that the removal of the 5' RNA normally occurs during reverse transcription, and thus provides a critical temporal link to first- and second-strand DNA cleavage in the R2 retrotransposition reaction.

The Fanconi anemia associated protein FAAP24 uses two substrate specific binding surfaces for DNA recognition

PubMed Central

Wienk, Hans; Slootweg, Jack C.; Speerstra, Sietske; Kaptein, Robert; Boelens, Rolf; Folkers, Gert E.

2013-01-01

To maintain the integrity of the genome, multiple DNA repair systems exist to repair damaged DNA. Recognition of altered DNA, including bulky adducts, pyrimidine dimers and interstrand crosslinks (ICL), partially depends on proteins containing helix-hairpin-helix (HhH) domains. To understand how ICL is specifically recognized by the Fanconi anemia proteins FANCM and FAAP24, we determined the structure of the HhH domain of FAAP24. Although it resembles other HhH domains, the FAAP24 domain contains a canonical hairpin motif followed by distorted motif. The HhH domain can bind various DNA substrates; using nuclear magnetic resonance titration experiments, we demonstrate that the canonical HhH motif is required for double-stranded DNA (dsDNA) binding, whereas the unstructured N-terminus can interact with single-stranded DNA. Both DNA binding surfaces are used for binding to ICL-like single/double-strand junction-containing DNA substrates. A structural model for FAAP24 bound to dsDNA has been made based on homology with the translesion polymerase iota. Site-directed mutagenesis, sequence conservation and charge distribution support the dsDNA-binding model. Analogous to other HhH domain-containing proteins, we suggest that multiple FAAP24 regions together contribute to binding to single/double-strand junction, which could contribute to specificity in ICL DNA recognition. PMID:23661679
[Screening specific recognition motif of RNA-binding proteins by SELEX in combination with next-generation sequencing technique].

PubMed

Zhang, Lu; Xu, Jinhao; Ma, Jinbiao

2016-07-25

RNA-binding protein exerts important biological function by specifically recognizing RNA motif. SELEX (Systematic evolution of ligands by exponential enrichment), an in vitro selection method, can obtain consensus motif with high-affinity and specificity for many target molecules from DNA or RNA libraries. Here, we combined SELEX with next-generation sequencing to study the protein-RNA interaction in vitro. A pool of RNAs with 20 bp random sequences were transcribed by T7 promoter, and target protein was inserted into plasmid containing SBP-tag, which can be captured by streptavidin beads. Through only one cycle, the specific RNA motif can be obtained, which dramatically improved the selection efficiency. Using this method, we found that human hnRNP A1 RRMs domain (UP1 domain) bound RNA motifs containing AGG and AG sequences. The EMSA experiment indicated that hnRNP A1 RRMs could bind the obtained RNA motif. Taken together, this method provides a rapid and effective method to study the RNA binding specificity of proteins.
Effect of NaeI-L43K mutation on protein dynamics and DNA conformation: Insights from molecular dynamics simulations.

PubMed

Ramachandrakurup, Sreelakshmi; Ramakrishnan, Vigneshwar

2017-09-01

Protein-DNA interactions are an important class of biomolecular interactions inside the cell. Delineating the mechanisms of protein-DNA interactions and more specifically, how proteins search and bind to their specific cognate sequences has been the quest of many in the scientific community. Restriction enzymes have served as useful model systems to this end. In this work, we have investigated using molecular dynamics simulations the effect of L43K mutation on NaeI, a type IIE restriction enzyme. NaeI has two domains, the Topo and the Endo domains, each binding to identical strands of DNA sequences (GCCGGC) 2 . The binding of the DNA to the Topo domain is thought to enhance the binding and cleavage of DNA at the Endo domain. Interestingly, it has been found that the mutation of an amino acid that is distantly-located from the DNA cleavage site (L43K) converts the restriction endonuclease to a topoisomerase. Our investigations reveal that the L43K mutation not only induces local structural changes (as evidenced by changes in hydrogen bond propensities and differences in the percentage of secondary structure assignments of the residues in the ligase-like domain) but also alters the overall protein dynamics and DNA conformation which probably leads to the loss of specific cleavage of the recognition site. In a larger context, our study underscores the importance of considering the role of distantly-located amino acids in understanding protein-DNA interactions. Copyright © 2017 Elsevier Inc. All rights reserved.
A duplex DNA-gold nanoparticle probe composed as a colorimetric biosensor for sequence-specific DNA-binding proteins.

PubMed

Ahn, Junho; Choi, Yeonweon; Lee, Ae-Ree; Lee, Joon-Hwa; Jung, Jong Hwa

2016-03-21

Using duplex DNA-AuNP aggregates, a sequence-specific DNA-binding protein, SQUAMOSA Promoter-binding-Like protein 12 (SPL-12), was directly determined by SPL-12-duplex DNA interaction-based colorimetric actions of DNA-Au assemblies. In order to prepare duplex DNA-Au aggregates, thiol-modified DNA 1 and DNA 2 were attached onto the surface of AuNPs, respectively, by the salt-aging method and then the DNA-attached AuNPs were mixed. Duplex-DNA-Au aggregates having the average size of 160 nm diameter and the maximum absorption at 529 nm were able to recognize SPL-12 and reached the equivalent state by the addition of ∼30 equivalents of SPL-12 accompanying a color change from red to blue with a red shift of the maximum absorption at 570 nm. As a result, the aggregation size grew to about 247 nm. Also, at higher temperatures of the mixture of duplex-DNA-Au aggregate solution and SPL-12, the equivalent state was reached rapidly. On the contrary, in the control experiment using Bovine Serum Albumin (BSA), no absorption band shift of duplex-DNA-Au aggregates was observed.
TALE proteins search DNA using a rotationally decoupled mechanism.

PubMed

Cuculis, Luke; Abil, Zhanar; Zhao, Huimin; Schroeder, Charles M

2016-10-01

Transcription activator-like effector (TALE) proteins are a class of programmable DNA-binding proteins used extensively for gene editing. Despite recent progress, however, little is known about their sequence search mechanism. Here, we use single-molecule experiments to study TALE search along DNA. Our results show that TALEs utilize a rotationally decoupled mechanism for nonspecific search, despite remaining associated with DNA templates during the search process. Our results suggest that the protein helical structure enables TALEs to adopt a loosely wrapped conformation around DNA templates during nonspecific search, facilitating rapid one-dimensional (1D) diffusion under a range of solution conditions. Furthermore, this model is consistent with a previously reported two-state mechanism for TALE search that allows these proteins to overcome the search speed-stability paradox. Taken together, our results suggest that TALE search is unique among the broad class of sequence-specific DNA-binding proteins and supports efficient 1D search along DNA.
UniPROBE, update 2015: new tools and content for the online database of protein-binding microarray data on protein-DNA interactions.

PubMed

Hume, Maxwell A; Barrera, Luis A; Gisselbrecht, Stephen S; Bulyk, Martha L

2015-01-01

The Universal PBM Resource for Oligonucleotide Binding Evaluation (UniPROBE) serves as a convenient source of information on published data generated using universal protein-binding microarray (PBM) technology, which provides in vitro data about the relative DNA-binding preferences of transcription factors for all possible sequence variants of a length k ('k-mers'). The database displays important information about the proteins and displays their DNA-binding specificity data in terms of k-mers, position weight matrices and graphical sequence logos. This update to the database documents the growth of UniPROBE since the last update 4 years ago, and introduces a variety of new features and tools, including a new streamlined pipeline that facilitates data deposition by universal PBM data generators in the research community, a tool that generates putative nonbinding (i.e. negative control) DNA sequences for one or more proteins and novel motifs obtained by analyzing the PBM data using the BEEML-PBM algorithm for motif inference. The UniPROBE database is available at http://uniprobe.org. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
The Drosophila telomere-capping protein Verrocchio binds single-stranded DNA and protects telomeres from DNA damage response

PubMed Central

Cicconi, Alessandro; Micheli, Emanuela; Vernì, Fiammetta; Jackson, Alison; Gradilla, Ana Citlali; Cipressa, Francesca; Raimondo, Domenico; Bosso, Giuseppe; Wakefield, James G.; Ciapponi, Laura; Cenci, Giovanni; Gatti, Maurizio

2017-01-01

Abstract Drosophila telomeres are sequence-independent structures maintained by transposition to chromosome ends of three specialized retroelements rather than by telomerase activity. Fly telomeres are protected by the terminin complex that includes the HOAP, HipHop, Moi and Ver proteins. These are fast evolving, non-conserved proteins that localize and function exclusively at telomeres, protecting them from fusion events. We have previously suggested that terminin is the functional analogue of shelterin, the multi-protein complex that protects human telomeres. Here, we use electrophoretic mobility shift assay (EMSA) and atomic force microscopy (AFM) to show that Ver preferentially binds single-stranded DNA (ssDNA) with no sequence specificity. We also show that Moi and Ver form a complex in vivo. Although these two proteins are mutually dependent for their localization at telomeres, Moi neither binds ssDNA nor facilitates Ver binding to ssDNA. Consistent with these results, we found that Ver-depleted telomeres form RPA and γH2AX foci, like the human telomeres lacking the ssDNA-binding POT1 protein. Collectively, our findings suggest that Drosophila telomeres possess a ssDNA overhang like the other eukaryotes, and that the terminin complex is architecturally and functionally similar to shelterin. PMID:27940556
Generalized theory on the mechanism of site-specific DNA-protein interactions

NASA Astrophysics Data System (ADS)

Niranjani, G.; Murugan, R.

2016-05-01

We develop a generalized theoretical framework on the binding of transcription factor proteins (TFs) with specific sites on DNA that takes into account the interplay of various factors regarding overall electrostatic potential at the DNA-protein interface, occurrence of kinetic traps along the DNA sequence, presence of other roadblock protein molecules along DNA and crowded environment, conformational fluctuations in the DNA binding domains (DBDs) of TFs, and the conformational state of the DNA. Starting from a Smolochowski type theoretical framework on site-specific binding of TFs we logically build our model by adding the effects of these factors one by one. Our generalized two-step model suggests that the electrostatic attractive forces present inbetween the positively charged DBDs of TFs and the negatively charged phosphate backbone of DNA, along with the counteracting shielding effects of solvent ions, is the core factor that creates a fluidic type environment at the DNA-protein interface. This in turn facilitates various one-dimensional diffusion (1Dd) processes such as sliding, hopping and intersegmental transfers. These facilitating processes as well as flipping dynamics of conformational states of DBDs of TFs between stationary and mobile states can enhance the 1Dd coefficient on a par with three-dimensional diffusion (3Dd). The random coil conformation of DNA also plays critical roles in enhancing the site-specific association rate. The extent of enhancement over the 3Dd controlled rate seems to be directly proportional to the maximum possible 1Dd length. We show that the overall site-specific binding rate scales with the length of DNA in an asymptotic way. For relaxed DNA, the specific binding rate will be independent of the length of DNA as length increases towards infinity. For condensed DNA as in in vivo conditions, the specific binding rate depends on the length of DNA in a turnover way with a maximum. This maximum rate seems to scale with the maximum possible 1Dd length of TFs in a square root manner. Results suggest that 1Dd processes contribute much less to the enhancement of specific binding rate under in vivo conditions for condensed DNA. There exists a critical length of binding stretch of TFs beyond which the probability associated with the random occurrence of similar specific binding sites will be close to zero. TFs in natural systems from prokaryotes to eukaryotes seem to handle sequence-mediated kinetic traps via increasing the length of their recognition stretch or combinatorial binding. TFs overcome the hurdles of roadblocks via switching efficiently between sliding, hopping and intersegmental transfer modes. The site-specific binding rate as well as the maximum possible 1Dd length seem to be directly proportional to the square root of the probability (p R) of finding a nonspecific binding site to be free from dynamic roadblocks. Here p R seems to be a function of the number of nsbs available per DNA binding protein (ϕ) inside the living cell. It seems that p R > 0.8 when ϕ > 10 which is true for the Escherichia coli cell system.
Fusion of GFP to the M.EcoKI DNA methyltransferase produces a new probe of Type I DNA restriction and modification enzymes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chen, Kai; Roberts, Gareth A.; Stephanou, Augoustinos S.

2010-07-23

Research highlights: {yields} Successful fusion of GFP to M.EcoKI DNA methyltransferase. {yields} GFP located at C-terminal of sequence specificity subunit does not later enzyme activity. {yields} FRET confirms structural model of M.EcoKI bound to DNA. -- Abstract: We describe the fusion of enhanced green fluorescent protein to the C-terminus of the HsdS DNA sequence-specificity subunit of the Type I DNA modification methyltransferase M.EcoKI. The fusion expresses well in vivo and assembles with the two HsdM modification subunits. The fusion protein functions as a sequence-specific DNA methyltransferase protecting DNA against digestion by the EcoKI restriction endonuclease. The purified enzyme shows Foerstermore » resonance energy transfer to fluorescently-labelled DNA duplexes containing the target sequence and to fluorescently-labelled ocr protein, a DNA mimic that binds to the M.EcoKI enzyme. Distances determined from the energy transfer experiments corroborate the structural model of M.EcoKI.« less
Two-step interrogation then recognition of DNA binding site by Integration Host Factor: an architectural DNA-bending protein.

PubMed

Velmurugu, Yogambigai; Vivas, Paula; Connolly, Mitchell; Kuznetsov, Serguei V; Rice, Phoebe A; Ansari, Anjum

2018-02-28

The dynamics and mechanism of how site-specific DNA-bending proteins initially interrogate potential binding sites prior to recognition have remained elusive for most systems. Here we present these dynamics for Integration Host factor (IHF), a nucleoid-associated architectural protein, using a μs-resolved T-jump approach. Our studies show two distinct DNA-bending steps during site recognition by IHF. While the faster (∼100 μs) step is unaffected by changes in DNA or protein sequence that alter affinity by >100-fold, the slower (1-10 ms) step is accelerated ∼5-fold when mismatches are introduced at DNA sites that are sharply kinked in the specific complex. The amplitudes of the fast phase increase when the specific complex is destabilized and decrease with increasing [salt], which increases specificity. Taken together, these results indicate that the fast phase is non-specific DNA bending while the slow phase, which responds only to changes in DNA flexibility at the kink sites, is specific DNA kinking during site recognition. Notably, the timescales for the fast phase overlap with one-dimensional diffusion times measured for several proteins on DNA, suggesting that these dynamics reflect partial DNA bending during interrogation of potential binding sites by IHF as it scans DNA.
On the connection between inherent DNA flexure and preferred binding of hydroxymethyluracil-containing DNA by the type II DNA-binding protein TF1.

PubMed

Grove, A; Galeone, A; Mayol, L; Geiduschek, E P

1996-07-12

TF1 is a member of the family of type II DNA-binding proteins, which also includes the bacterial HU proteins and the Escherichia coli integration host factor (IHF). Distinctive to TF1, which is encoded by the Bacillus subtilis bacteriophage SPO1, is its preferential binding to DNA in which thymine is replaced by 5-hydroxymethyluracil (hmU), as it is in the phage genome. TF1 binds to preferred sites within the phage genome and generates pronounced DNA bending. The extent to which DNA flexibility contributes to the sequence-specific binding of TF1, and the connection between hmU preference and DNA flexibility has been examined. Model flexible sites, consisting of consecutive mismatches, increase the affinity of thymine-containing DNA for TF1. In particular, tandem mismatches separated by nine base-pairs generate an increase, by orders of magnitude, in the affinity of TF1 for T-containing DNA with the sequence of a preferred TF1 binding site, and fully match the affinity of TF1 for this cognate site in hmU-containing DNA (Kd approximately 3 nM). Other placements of loops generate suboptimal binding. This is consistent with a significant contribution of site-specific DNA flexibility to complex formation. Analysis of complexes with hmU-DNA of decreasing length shows that a major part of the binding affinity is generated within a central 19 bp segment (delta G0 = 41.7 kJ mol-1) with more-distal DNA contributing modestly to the affinity (delta delta G = -0.42 kJ mol-1 bp-1 on increasing duplex length to 37 bp). However, a previously characterised thermostable and more tightly binding mutant TF1, TF1(E15G/T32I), derives most of its extra affinity from interaction with flanking DNA. We propose that inherent but sequence-dependent deformability of hmU-containing DNA underlies the preferential binding of TF1 and that TF1-induced DNA bendings is a result of distortions at two distinct sites separated by 9 bp of duplex DNA.
DNA-binding regulates site-specific ubiquitination of IRF-1.

PubMed

Landré, Vivien; Pion, Emmanuelle; Narayan, Vikram; Xirodimas, Dimitris P; Ball, Kathryn L

2013-02-01

Understanding the determinants for site-specific ubiquitination by E3 ligase components of the ubiquitin machinery is proving to be a challenge. In the present study we investigate the role of an E3 ligase docking site (Mf2 domain) in an intrinsically disordered domain of IRF-1 [IFN (interferon) regulatory factor-1], a short-lived IFNγ-regulated transcription factor, in ubiquitination of the protein. Ubiquitin modification of full-length IRF-1 by E3 ligases such as CHIP [C-terminus of the Hsc (heat-shock cognate) 70-interacting protein] and MDM2 (murine double minute 2), which dock to the Mf2 domain, was specific for lysine residues found predominantly in loop structures that extend from the DNA-binding domain, whereas no modification was detected in the more conformationally flexible C-terminal half of the protein. The E3 docking site was not available when IRF-1 was in its DNA-bound conformation and cognate DNA-binding sequences strongly suppressed ubiquitination, highlighting a strict relationship between ligase binding and site-specific modification at residues in the DNA-binding domain. Hyperubiquitination of a non-DNA-binding mutant supports a mechanism where an active DNA-bound pool of IRF-1 is protected from polyubiquitination and degradation.
Heterogeneous RNA-binding protein M4 is a receptor for carcinoembryonic antigen in Kupffer cells.

PubMed

Bajenova, O V; Zimmer, R; Stolper, E; Salisbury-Rowswell, J; Nanji, A; Thomas, P

2001-08-17

Here we report the isolation of the recombinant cDNA clone from rat macrophages, Kupffer cells (KC) that encodes a protein interacting with carcinoembryonic antigen (CEA). To isolate and identify the CEA receptor gene we used two approaches: screening of a KC cDNA library with a specific antibody and the yeast two-hybrid system for protein interaction using as a bait the N-terminal part of the CEA encoding the binding site. Both techniques resulted in the identification of the rat heterogeneous RNA-binding protein (hnRNP) M4 gene. The rat ortholog cDNA sequence has not been previously described. The open reading frame for this gene contains a 2351-base pair sequence with the polyadenylation signal AATAAA and a termination poly(A) tail. The mRNA shows ubiquitous tissue expression as a 2.4-kilobase transcript. The deduced amino acid sequence comprised a 78-kDa membrane protein with 3 putative RNA-binding domains, arginine/methionine/glutamine-rich C terminus and 3 potential membrane spanning regions. When hnRNP M4 protein is expressed in pGEX4T-3 vector system in Escherichia coli it binds (125)I-labeled CEA in a Ca(2+)-dependent fashion. Transfection of rat hnRNP M4 cDNA into a non-CEA binding mouse macrophage cell line p388D1 resulted in CEA binding. These data provide evidence for a new function of hnRNP M4 protein as a CEA-binding protein in Kupffer cells.
Identification of DNA-binding proteins by combining auto-cross covariance transformation and ensemble learning.

PubMed

Liu, Bin; Wang, Shanyi; Dong, Qiwen; Li, Shumin; Liu, Xuan

2016-04-20

DNA-binding proteins play a pivotal role in various intra- and extra-cellular activities ranging from DNA replication to gene expression control. With the rapid development of next generation of sequencing technique, the number of protein sequences is unprecedentedly increasing. Thus it is necessary to develop computational methods to identify the DNA-binding proteins only based on the protein sequence information. In this study, a novel method called iDNA-KACC is presented, which combines the Support Vector Machine (SVM) and the auto-cross covariance transformation. The protein sequences are first converted into profile-based protein representation, and then converted into a series of fixed-length vectors by the auto-cross covariance transformation with Kmer composition. The sequence order effect can be effectively captured by this scheme. These vectors are then fed into Support Vector Machine (SVM) to discriminate the DNA-binding proteins from the non DNA-binding ones. iDNA-KACC achieves an overall accuracy of 75.16% and Matthew correlation coefficient of 0.5 by a rigorous jackknife test. Its performance is further improved by employing an ensemble learning approach, and the improved predictor is called iDNA-KACC-EL. Experimental results on an independent dataset shows that iDNA-KACC-EL outperforms all the other state-of-the-art predictors, indicating that it would be a useful computational tool for DNA binding protein identification. .
DNA breathing dynamics distinguish binding from nonbinding consensus sites for transcription factor YY1 in cells.

PubMed

Alexandrov, Boian S; Fukuyo, Yayoi; Lange, Martin; Horikoshi, Nobuo; Gelev, Vladimir; Rasmussen, Kim Ø; Bishop, Alan R; Usheva, Anny

2012-11-01

The genome-wide mapping of the major gene expression regulators, the transcription factors (TFs) and their DNA binding sites, is of great importance for describing cellular behavior and phenotypic diversity. Presently, the methods for prediction of genomic TF binding produce a large number of false positives, most likely due to insufficient description of the physiochemical mechanisms of protein-DNA binding. Growing evidence suggests that, in the cell, the double-stranded DNA (dsDNA) is subject to local transient strands separations (breathing) that contribute to genomic functions. By using site-specific chromatin immunopecipitations, gel shifts, BIOBASE data, and our model that accurately describes the melting behavior and breathing dynamics of dsDNA we report a specific DNA breathing profile found at YY1 binding sites in cells. We find that the genomic flanking sequence variations and SNPs, may exert long-range effects on DNA dynamics and predetermine YY1 binding. The ubiquitous TF YY1 has a fundamental role in essential biological processes by activating, initiating or repressing transcription depending upon the sequence context it binds. We anticipate that consensus binding sequences together with the related DNA dynamics profile may significantly improve the accuracy of genomic TF binding sites and TF binding-related functional SNPs.
Expression of simian virus 40 T antigen in Escherichia coli: localization of T-antigen origin DNA-binding domain to within 129 amino acids.

PubMed Central

Arthur, A K; Höss, A; Fanning, E

1988-01-01

The genomic coding sequence of the large T antigen of simian virus 40 (SV40) was cloned into an Escherichia coli expression vector by joining new restriction sites, BglII and BamHI, introduced at the intron boundaries of the gene. Full-length large T antigen, as well as deletion and amino acid substitution mutants, were inducibly expressed from the lac promoter of pUC9, albeit with different efficiencies and protein stabilities. Specific interaction with SV40 origin DNA was detected for full-length T antigen and certain mutants. Deletion mutants lacking T-antigen residues 1 to 130 and 260 to 708 retained specific origin-binding activity, demonstrating that the region between residues 131 and 259 must carry the essential binding domain for DNA-binding sites I and II. A sequence between residues 302 and 320 homologous to a metal-binding "finger" motif is therefore not required for origin-specific binding. However, substitution of serine for either of two cysteine residues in this motif caused a dramatic decrease in origin DNA-binding activity. This region, as well as other regions of the full-length protein, may thus be involved in stabilizing the DNA-binding domain and altering its preference for binding to site I or site II DNA. Images PMID:2835505
Engineering and Application of Zinc Finger Proteins and TALEs for Biomedical Research.

PubMed

Kim, Moon-Soo; Kini, Anu Ganesh

2017-08-01

Engineered DNA-binding domains provide a powerful technology for numerous biomedical studies due to their ability to recognize specific DNA sequences. Zinc fingers (ZF) are one of the most common DNA-binding domains and have been extensively studied for a variety of applications, such as gene regulation, genome engineering and diagnostics. Another novel DNA-binding domain known as a transcriptional activator-like effector (TALE) has been more recently discovered, which has a previously undescribed DNA-binding mode. Due to their modular architecture and flexibility, TALEs have been rapidly developed into artificial gene targeting reagents. Here, we describe the methods used to design these DNA-binding proteins and their key applications in biomedical research.
Escherichia coli ArgR mutants defective in cer/Xer recombination, but not in DNA binding.

PubMed

Sénéchal, Hélène; Delesques, Jérémy; Szatmari, George

2010-04-01

The Escherichia coli arginine repressor (ArgR) is an L-arginine-dependent DNA-binding protein that controls the expression of the arginine biosynthetic genes and is required as an accessory factor for Xer site-specific recombination at cer and related recombination sites in plasmids. We used the technique of pentapeptide scanning mutagenesis to isolate a series of ArgR mutants that were considerably reduced in cer recombination, but were still able to repress an argA::lacZ fusion. DNA sequence analysis showed that all of the mutants mapped to the same nucleotide, resulting in a five amino acid insertion between residues 149 and 150 of ArgR, corresponding to the end of the alpha6 helix. A truncated ArgR containing a stop codon at residue 150 displayed the same phenotype as the protein with the five amino acid insertion, and both mutants displayed sequence-specific DNA-binding activity that was L-arginine dependent. These results show that the C-terminus of ArgR is more important in cer/Xer site-specific recombination than in DNA binding.
Strong minor groove base conservation in sequence logos implies DNA distortion or base flipping during replication and transcription initiation.

PubMed

Schneider, T D

2001-12-01

The sequence logo for DNA binding sites of the bacteriophage P1 replication protein RepA shows unusually high sequence conservation ( approximately 2 bits) at a minor groove that faces RepA. However, B-form DNA can support only 1 bit of sequence conservation via contacts into the minor groove. The high conservation in RepA sites therefore implies a distorted DNA helix with direct or indirect contacts to the protein. Here I show that a high minor groove conservation signature also appears in sequence logos of sites for other replication origin binding proteins (Rts1, DnaA, P4 alpha, EBNA1, ORC) and promoter binding proteins (sigma(70), sigma(D) factors). This finding implies that DNA binding proteins generally use non-B-form DNA distortion such as base flipping to initiate replication and transcription.
Making the Bend: DNA Tertiary Structure and Protein-DNA Interactions

PubMed Central

Harteis, Sabrina; Schneider, Sabine

2014-01-01

DNA structure functions as an overlapping code to the DNA sequence. Rapid progress in understanding the role of DNA structure in gene regulation, DNA damage recognition and genome stability has been made. The three dimensional structure of both proteins and DNA plays a crucial role for their specific interaction, and proteins can recognise the chemical signature of DNA sequence (“base readout”) as well as the intrinsic DNA structure (“shape recognition”). These recognition mechanisms do not exist in isolation but, depending on the individual interaction partners, are combined to various extents. Driving force for the interaction between protein and DNA remain the unique thermodynamics of each individual DNA-protein pair. In this review we focus on the structures and conformations adopted by DNA, both influenced by and influencing the specific interaction with the corresponding protein binding partner, as well as their underlying thermodynamics. PMID:25026169

Molecular mechanisms of floral organ specification by MADS domain proteins.

PubMed

Yan, Wenhao; Chen, Dijun; Kaufmann, Kerstin

2016-02-01

Flower development is a model system to understand organ specification in plants. The identities of different types of floral organs are specified by homeotic MADS transcription factors that interact in a combinatorial fashion. Systematic identification of DNA-binding sites and target genes of these key regulators show that they have shared and unique sets of target genes. DNA binding by MADS proteins is not based on 'simple' recognition of a specific DNA sequence, but depends on DNA structure and combinatorial interactions. Homeotic MADS proteins regulate gene expression via alternative mechanisms, one of which may be to modulate chromatin structure and accessibility in their target gene promoters. Copyright © 2015 Elsevier Ltd. All rights reserved.
Method for nucleic acid hybridization using single-stranded DNA binding protein

DOEpatents

Tabor, Stanley; Richardson, Charles C.

1996-01-01

Method of nucleic acid hybridization for detecting the presence of a specific nucleic acid sequence in a population of different nucleic acid sequences using a nucleic acid probe. The nucleic acid probe hybridizes with the specific nucleic acid sequence but not with other nucleic acid sequences in the population. The method includes contacting a sample (potentially including the nucleic acid sequence) with the nucleic acid probe under hybridizing conditions in the presence of a single-stranded DNA binding protein provided in an amount which stimulates renaturation of a dilute solution (i.e., one in which the t.sub.1/2 of renaturation is longer than 3 weeks) of single-stranded DNA greater than 500 fold (i.e., to a t.sub.1/2 less than 60 min, preferably less than 5 min, and most preferably about 1 min.) in the absence of nucleotide triphosphates.
Unusual Characteristics of the DNA Binding Domain of Epigenetic Regulatory Protein MeCP2 Determine Its Binding Specificity

PubMed Central

2015-01-01

The protein MeCP2 mediates epigenetic regulation by binding methyl-CpG (mCpG) sites on chromatin. MeCP2 consists of six domains of which one, the methyl binding domain (MBD), binds mCpG sites in duplex DNA. We show that solution conditions with physiological or greater salt concentrations or the presence of nonspecific competitor DNA is necessary for the MBD to discriminate mCpG from CpG with high specificity. The specificity for mCpG over CpG is >100-fold under these solution conditions. In contrast, the MBD does not discriminate hydroxymethyl-CpG from CpG. The MBD is unusual among site-specific DNA binding proteins in that (i) specificity is not conferred by the enhanced affinity for the specific site but rather by suppression of its affinity for generic DNA, (ii) its specific binding to mCpG is highly electrostatic, and (iii) it takes up as well as displaces monovalent cations upon DNA binding. The MBD displays an unusually high affinity for single-stranded DNA independent of modification or sequence. In addition, the MBD forms a discrete dimer on DNA via a noncooperative binding pathway. Because the affinity of the second monomer is 1 order of magnitude greater than that of nonspecific binding, the MBD dimer is a unique molecular complex. The significance of these results in the context of neuronal function and development and MeCP2-related developmental disorders such as Rett syndrome is discussed. PMID:24828757
Electrophoretic mobility shift scanning using an automated infrared DNA sequencer.

PubMed

Sano, M; Ohyama, A; Takase, K; Yamamoto, M; Machida, M

2001-11-01

Electrophoretic mobility shift assay (EMSA) is widely used in the study of sequence-specific DNA-binding proteins, including transcription factors and mismatch binding proteins. We have established a non-radioisotope-based protocol for EMSA that features an automated DNA sequencer with an infrared fluorescent dye (IRDye) detection unit. Our modification of the elec- trophoresis unit, which includes cooling the gel plates with a reduced well-to-read length, has made it possible to detect shifted bands within 1 h. Further, we have developed a rapid ligation-based method for generating IRDye-labeled probes with an approximately 60% cost reduction. This method has the advantages of real-time scanning, stability of labeled probes, and better safety associated with nonradioactive methods of detection. Analysis of a promoter from an industrially important filamentous fungus, Aspergillus oryzae, in a prototype experiment revealed that the method we describe has potential for use in systematic scanning and identification of the functionally important elements to which cellular factors bind in a sequence-specific manner.
Cyclosporin A and FK-506 both affect DNA binding of regulatory nuclear proteins to the human interleukin-2 promoter.

PubMed

Baumann, G; Geisse, S; Sullivan, M

1991-03-01

The structurally unrelated immunosuppressive drugs cyclosporin A (Sandimmun) and FK-506 both interfere with the process of T-cell proliferation by blocking the transcription of the T-cell growth factor interleukin-2 (IL-2). Here we demonstrate that the transcriptional activation of this gene requires the binding of regulatory nuclear proteins to a promoter element with sequence similarity to the consensus binding site for NF-kappa B-related transcription factors. We present evidence that the binding by regulatory nuclear proteins to the kappa B element of the IL-2 promoter is affected negatively by cyclosporin A and FK-506 at concentrations paralleling their immunosuppressive activity in vivo. The decrease in DNA-protein complex formation induced by the immunosuppressive drugs correlates with a decrease in IL-2 production. FK-506 is 10 to 100 times more potent than cyclosporin A in its ability to inhibit sequence-specific DNA binding and IL-2 production. Our findings suggest that the actions of both drugs converge at the level of DNA-protein interaction.
An ultrasensitive label-free biosensor for assaying of sequence-specific DNA-binding protein based on amplifying fluorescent conjugated polymer.

PubMed

Liu, Xingfen; Ouyang, Lan; Cai, Xiaohui; Huang, Yanqin; Feng, Xiaomiao; Fan, Quli; Huang, Wei

2013-03-15

Sensitive, reliable, and simple detection of sequence-specific DNA-binding proteins (DBP) is of paramount importance in the area of proteomics, genomics, and biomedicine. We describe herein a novel fluorescent-amplified strategy for ultrasensitive, visual, quantitative, and "turn-on" detection of DBP. A Förster resonance energy transfer (FRET) assay utilizing a cationic conjugated polymer (CCP) and an intercalating dye was designed to detect a key transcription factor, nuclear factor-kappa B (NF-κB), the model target. A series of label-free DNA probes bearing one or two protein-binding sites (PBS) were used to identify the target protein specifically. The binding DBP protects the probe from digestion by exonuclease III, resulting in high efficient FRET due to the high affinity between the intercalating dye and duplex DNA, as well as strong electrostatic interactions between the CCP and DNA probe. By using label-free hairpin DNA or double-stranded DNA containing two PBS as probe, we could detect as low as 1 pg/μL of NF-κB in HeLa nuclear extracts, which is 10000-fold more sensitive than the previously reported methods. The approach also allows naked-eye detection by observing fluorescent color of solutions with the assistance of a hand-held UV lamp. Additionally, a less than 10% relative standard deviation was obtained, which offers a new platform for superior precision, low-cost, and simple detection of DBP. The features of our optical biosensor shows promising potential for early diagnosis of many diseases and high-throughput screening of new drugs targeted to DNA-binding proteins. Copyright © 2012 Elsevier B.V. All rights reserved.
Protein Cofactors Are Essential for High-Affinity DNA Binding by the Nuclear Factor κB RelA Subunit.

PubMed

Mulero, Maria Carmen; Shahabi, Shandy; Ko, Myung Soo; Schiffer, Jamie M; Huang, De-Bin; Wang, Vivien Ya-Fan; Amaro, Rommie E; Huxford, Tom; Ghosh, Gourisankar

2018-05-22

Transcription activator proteins typically contain two functional domains: a DNA binding domain (DBD) that binds to DNA with sequence specificity and an activation domain (AD) whose established function is to recruit RNA polymerase. In this report, we show that purified recombinant nuclear factor κB (NF-κB) RelA dimers bind specific κB DNA sites with an affinity significantly lower than that of the same dimers from nuclear extracts of activated cells, suggesting that additional nuclear cofactors might facilitate DNA binding by the RelA dimers. Additionally, recombinant RelA binds DNA with relatively low affinity at a physiological salt concentration in vitro. The addition of p53 or RPS3 (ribosomal protein S3) increases RelA:DNA binding affinity 2- to >50-fold depending on the protein and ionic conditions. These cofactor proteins do not form stable ternary complexes, suggesting that they stabilize the RelA:DNA complex through dynamic interactions. Surprisingly, the RelA-DBD alone fails to bind DNA under the same solution conditions even in the presence of cofactors, suggesting an important role of the RelA-AD in DNA binding. Reduced RelA:DNA binding at a physiological ionic strength suggests that multiple cofactors might be acting simultaneously to mitigate the electrolyte effect and stabilize the RelA:DNA complex in vivo. Overall, our observations suggest that the RelA-AD and multiple cofactor proteins function cooperatively to prime the RelA-DBD and stabilize the RelA:DNA complex in cells. Our study provides a mechanism for nuclear cofactor proteins in NF-κB-dependent gene regulation.
Probing the electrostatics and pharmacologic modulation of sequence-specific binding by the DNA-binding domain of the ETS-family transcription factor PU.1: a binding affinity and kinetics investigation

PubMed Central

Munde, Manoj; Poon, Gregory M. K.; Wilson, W. David

2013-01-01

Members of the ETS family of transcription factors regulate a functionally diverse array of genes. All ETS proteins share a structurally-conserved but sequence-divergent DNA-binding domain, known as the ETS domain. Although the structure and thermodynamics of the ETS-DNA complexes are well known, little is known about the kinetics of sequence recognition, a facet that offers potential insight into its molecular mechanism. We have characterized DNA binding by the ETS domain of PU.1 by biosensor-surface plasmon resonance (SPR). SPR analysis revealed a striking kinetic profile for DNA binding by the PU.1 ETS domain. At low salt concentrations, it binds high-affinity cognate DNA with a very slow association rate constant (≤105 M−1 s−1), compensated by a correspondingly small dissociation rate constant. The kinetics are strongly salt-dependent but mutually balance to produce a relatively weak dependence in the equilibrium constant. This profile contrasts sharply with reported data for other ETS domains (e.g., Ets-1, TEL) for which high-affinity binding is driven by rapid association (>107 M−1 s−1). We interpret this difference in terms of the hydration properties of ETS-DNA binding and propose that at least two mechanisms of sequence recognition are employed by this family of DNA-binding domain. Additionally, we use SPR to demonstrate the potential for pharmacological inhibition of sequence-specific ETS-DNA binding, using the minor groove-binding distamycin as a model compound. Our work establishes SPR as a valuable technique for extending our understanding of the molecular mechanisms of ETS-DNA interactions as well as developing potential small-molecule agents for biotechnological and therapeutic purposes. PMID:23416556
Improved detection of DNA-binding proteins via compression technology on PSSM information.

PubMed

Wang, Yubo; Ding, Yijie; Guo, Fei; Wei, Leyi; Tang, Jijun

2017-01-01

Since the importance of DNA-binding proteins in multiple biomolecular functions has been recognized, an increasing number of researchers are attempting to identify DNA-binding proteins. In recent years, the machine learning methods have become more and more compelling in the case of protein sequence data soaring, because of their favorable speed and accuracy. In this paper, we extract three features from the protein sequence, namely NMBAC (Normalized Moreau-Broto Autocorrelation), PSSM-DWT (Position-specific scoring matrix-Discrete Wavelet Transform), and PSSM-DCT (Position-specific scoring matrix-Discrete Cosine Transform). We also employ feature selection algorithm on these feature vectors. Then, these features are fed into the training SVM (support vector machine) model as classifier to predict DNA-binding proteins. Our method applys three datasets, namely PDB1075, PDB594 and PDB186, to evaluate the performance of our approach. The PDB1075 and PDB594 datasets are employed for Jackknife test and the PDB186 dataset is used for the independent test. Our method achieves the best accuracy in the Jacknife test, from 79.20% to 86.23% and 80.5% to 86.20% on PDB1075 and PDB594 datasets, respectively. In the independent test, the accuracy of our method comes to 76.3%. The performance of independent test also shows that our method has a certain ability to be effectively used for DNA-binding protein prediction. The data and source code are at https://doi.org/10.6084/m9.figshare.5104084.
Prediction of TF target sites based on atomistic models of protein-DNA complexes

PubMed Central

Angarica, Vladimir Espinosa; Pérez, Abel González; Vasconcelos, Ana T; Collado-Vides, Julio; Contreras-Moreira, Bruno

2008-01-01

Background The specific recognition of genomic cis-regulatory elements by transcription factors (TFs) plays an essential role in the regulation of coordinated gene expression. Studying the mechanisms determining binding specificity in protein-DNA interactions is thus an important goal. Most current approaches for modeling TF specific recognition rely on the knowledge of large sets of cognate target sites and consider only the information contained in their primary sequence. Results Here we describe a structure-based methodology for predicting sequence motifs starting from the coordinates of a TF-DNA complex. Our algorithm combines information regarding the direct and indirect readout of DNA into an atomistic statistical model, which is used to estimate the interaction potential. We first measure the ability of our method to correctly estimate the binding specificities of eight prokaryotic and eukaryotic TFs that belong to different structural superfamilies. Secondly, the method is applied to two homology models, finding that sampling of interface side-chain rotamers remarkably improves the results. Thirdly, the algorithm is compared with a reference structural method based on contact counts, obtaining comparable predictions for the experimental complexes and more accurate sequence motifs for the homology models. Conclusion Our results demonstrate that atomic-detail structural information can be feasibly used to predict TF binding sites. The computational method presented here is universal and might be applied to other systems involving protein-DNA recognition. PMID:18922190
A DNA-binding protein from Candida albicans that binds to the RPG box of Saccharomyces cerevisiae and the telomeric repeat sequence of C. albicans.

PubMed

Ishii, N; Yamamoto, M; Lahm, H W; Iizumi, S; Yoshihara, F; Nakayama, H; Arisawa, M; Aoki, Y

1997-02-01

Electromobility shift assays with a DNA probe containing the Saccharomyces cerevisiae ENO1 RPG box identified a specific DNA-binding protein in total protein extracts of Candida albicans. The protein, named Rbf1p (RPG-box-binding protein 1), bound to other S. cerevisiae RPG boxes, although the nucleotide recognition profile was not completely the same as that of S. cerevisiae Rap 1p (repressor-activator protein 1), an RPG-box-binding protein. The repetitive sequence of the C. albicans chromosomal telomere also competed with RPG-box binding to Rbf1p. For further analysis, we purified Rbf1p 57,600-fold from C. albicans total protein extracts, raised mAbs against the purified protein and immunologically cloned the gene, whose ORF specified a protein of 527 aa. The bacterially expressed protein showed RPG-box-binding activity with the same profile as that of the purified one. The Rbf1p, containing two glutamine-rich regions that are found in many transcription factors, showed transcriptional activation capability in S. cerevisiae and was predominantly observed in nuclei. These results suggest that Rbf1p is a transcription factor with telomere-binding activity in C. albicans.
ETS target genes: Identification of Egr1 as a target by RNA differential display and whole genome PCR techniques

PubMed Central

Robinson, Lois; Panayiotakis, Alexandra; Papas, Takis S.; Kola, Ismail; Seth, Arun

1997-01-01

ETS transcription factors play important roles in hematopoiesis, angiogenesis, and organogenesis during murine development. The ETS genes also have a role in neoplasia, for example in Ewing’s sarcomas and retrovirally induced cancers. The ETS genes encode transcription factors that bind to specific DNA sequences and activate transcription of various cellular and viral genes. To isolate novel ETS target genes, we used two approaches. In the first approach, we isolated genes by the RNA differential display technique. Previously, we have shown that the overexpression of ETS1 and ETS2 genes effects transformation of NIH 3T3 cells and specific transformants produce high levels of the ETS proteins. To isolate ETS1 and ETS2 responsive genes in these transformed cells, we prepared RNA from ETS1, ETS2 transformants, and normal NIH 3T3 cell lines and converted it into cDNA. This cDNA was amplified by PCR and displayed on sequencing gels. The differentially displayed bands were subcloned into plasmid vectors. By Northern blot analysis, several clones showed differential patterns of mRNA expression in the NIH 3T3-, ETS1-, and ETS2-expressing cell lines. Sixteen clones were analyzed by DNA sequence analysis, and 13 of them appeared to be unique because their DNA sequences did not match with any of the known genes present in the gene bank. Three known genes were found to be identical to the CArG box binding factor, phospholipase A2-activating protein, and early growth response 1 (Egr1) genes. In the second approach, to isolate ETS target promoters directly, we performed ETS1 binding with MboI-cleaved genomic DNA in the presence of a specific mAb followed by whole genome PCR. The immune complex-bound ETS binding sites containing DNA fragments were amplified and subcloned into pBluescript and subjected to DNA sequence and computer analysis. We found that, of a large number of clones isolated, 43 represented unique sequences not previously identified. Three clones turned out to contain regulatory sequences derived from human serglycin, preproapolipoprotein C II, and Egr1 genes. The ETS binding sites derived from these three regulatory sequences showed specific binding with recombinant ETS proteins. Of interest, Egr1 was identified by both of these techniques, suggesting strongly that it is indeed an ETS target gene. PMID:9207063
Human T-cell leukemia virus type 1 Tax requires direct access to DNA for recruitment of CREB binding protein to the viral promoter.

PubMed

Lenzmeier, B A; Giebler, H A; Nyborg, J K

1998-02-01

Efficient human T-cell leukemia virus type 1 (HTLV-1) replication and viral gene expression are dependent upon the virally encoded oncoprotein Tax. To activate HTLV-1 transcription, Tax interacts with the cellular DNA binding protein cyclic AMP-responsive element binding protein (CREB) and recruits the coactivator CREB binding protein (CBP), forming a nucleoprotein complex on the three viral cyclic AMP-responsive elements (CREs) in the HTLV-1 promoter. Short stretches of dG-dC-rich (GC-rich) DNA, immediately flanking each of the viral CREs, are essential for Tax recruitment of CBP in vitro and Tax transactivation in vivo. Although the importance of the viral CRE-flanking sequences is well established, several studies have failed to identify an interaction between Tax and the DNA. The mechanistic role of the viral CRE-flanking sequences has therefore remained enigmatic. In this study, we used high resolution methidiumpropyl-EDTA iron(II) footprinting to show that Tax extended the CREB footprint into the GC-rich DNA flanking sequences of the viral CRE. The Tax-CREB footprint was enhanced but not extended by the KIX domain of CBP, suggesting that the coactivator increased the stability of the nucleoprotein complex. Conversely, the footprint pattern of CREB on a cellular CRE lacking GC-rich flanking sequences did not change in the presence of Tax or Tax plus KIX. The minor-groove DNA binding drug chromomycin A3 bound to the GC-rich flanking sequences and inhibited the association of Tax and the Tax-CBP complex without affecting CREB binding. Tax specifically cross-linked to the viral CRE in the 5'-flanking sequence, and this cross-link was blocked by chromomycin A3. Together, these data support a model where Tax interacts directly with both CREB and the minor-groove viral CRE-flanking sequences to form a high-affinity binding site for the recruitment of CBP to the HTLV-1 promoter.
Role of protein structure and the role of individual fingers in zinc finger protein-DNA recognition: a molecular dynamics simulation study and free energy calculations

NASA Astrophysics Data System (ADS)

Hamed, Mazen Y.

2018-05-01

Molecular dynamics and MM_GBSA energy calculations on various zinc finger proteins containing three and four fingers bound to their target DNA gave insights into the role of each finger in the DNA binding process as part of the protein structure. The wild type Zif 268 (PDB code: 1AAY) gave a ΔG value of - 76.1 (14) kcal/mol. Zinc fingers ZF1, ZF2 and ZF3 were mutated in one experiment and in another experiment one finger was cut and the rest of the protein was studied for binding. The ΔΔG values for the Zinc Finger protein with both ZF1 and ZF2 mutated was + 80 kcal/mol, while mutating only ZF1 the ΔΔG value was + 52 kcal/mol (relative to the wild type). Cutting ZF3 and studying the protein consisting only of ZF1 linked to ZF2 gave a ΔΔG value of + 68 kcal/mol. Upon cutting ZF1, the resulting ZF2 linked to ZF3 protein gave a ΔΔG value of + 41 kcal/mol. The above results shed light on the importance of each finger in the binding process, especially the role of ZF1 as the anchoring finger followed in importance by ZF2 and ZF3. The energy difference between the binding of the wild type protein Zif268 (1AAY) and that for individual finger binding to DNA according to the formula: ΔΔGlinkers, otherstructuralfactors = ΔGzif268 - (ΔGF1+F2+F3) gave a value = - 44.5 kcal/mol. This stabilization can be attributed to the contribution of linkers and other structural factors in the intact protein in the DNA binding process. DNA binding energies of variant proteins of the wild type Zif268 which differ in their ZF1 amino acid sequence gave evidence of a good relationship between binding energy and recognition and specificity, this finding confirms the reported vital role of ZF1 in the ZF protein scanning and anchoring to the target DNA sequence. The role of hydrogen bonds in both specific and nonspecific amino acid-DNA contacts is discussed in relation to mutations. The binding energies of variant Zinc Finger proteins confirmed the role of ZF1 in the recognition, specificity and anchoring of the zinc finger protein to DNA.
Role of protein structure and the role of individual fingers in zinc finger protein-DNA recognition: a molecular dynamics simulation study and free energy calculations.

PubMed

Hamed, Mazen Y

2018-05-03

Molecular dynamics and MM_GBSA energy calculations on various zinc finger proteins containing three and four fingers bound to their target DNA gave insights into the role of each finger in the DNA binding process as part of the protein structure. The wild type Zif 268 (PDB code: 1AAY) gave a ΔG value of - 76.1 (14) kcal/mol. Zinc fingers ZF1, ZF2 and ZF3 were mutated in one experiment and in another experiment one finger was cut and the rest of the protein was studied for binding. The ΔΔG values for the Zinc Finger protein with both ZF1 and ZF2 mutated was + 80 kcal/mol, while mutating only ZF1 the ΔΔG value was + 52 kcal/mol (relative to the wild type). Cutting ZF3 and studying the protein consisting only of ZF1 linked to ZF2 gave a ΔΔG value of + 68 kcal/mol. Upon cutting ZF1, the resulting ZF2 linked to ZF3 protein gave a ΔΔG value of + 41 kcal/mol. The above results shed light on the importance of each finger in the binding process, especially the role of ZF1 as the anchoring finger followed in importance by ZF2 and ZF3. The energy difference between the binding of the wild type protein Zif268 (1AAY) and that for individual finger binding to DNA according to the formula: ΔΔG linkers, otherstructuralfactors = ΔG zif268 - (ΔG F1+F2+F3 ) gave a value = - 44.5 kcal/mol. This stabilization can be attributed to the contribution of linkers and other structural factors in the intact protein in the DNA binding process. DNA binding energies of variant proteins of the wild type Zif268 which differ in their ZF1 amino acid sequence gave evidence of a good relationship between binding energy and recognition and specificity, this finding confirms the reported vital role of ZF1 in the ZF protein scanning and anchoring to the target DNA sequence. The role of hydrogen bonds in both specific and nonspecific amino acid-DNA contacts is discussed in relation to mutations. The binding energies of variant Zinc Finger proteins confirmed the role of ZF1 in the recognition, specificity and anchoring of the zinc finger protein to DNA.
ΔN-P63α and TA-P63α exhibit intrinsic differences in transactivation specificities that depend on distinct features of DNA target sites

PubMed Central

Foggetti, Giorgia; Raimondi, Ivan; Campomenosi, Paola; Menichini, Paola

2014-01-01

TP63 is a member of the TP53 gene family that encodes for up to ten different TA and ΔN isoforms through alternative promoter usage and alternative splicing. Besides being a master regulator of gene expression for squamous epithelial proliferation, differentiation and maintenance, P63, through differential expression of its isoforms, plays important roles in tumorigenesis. All P63 isoforms share an immunoglobulin-like folded DNA binding domain responsible for binding to sequence-specific response elements (REs), whose overall consensus sequence is similar to that of the canonical p53 RE. Using a defined assay in yeast, where P63 isoforms and RE sequences are the only variables, and gene expression assays in human cell lines, we demonstrated that human TA- and ΔN-P63α proteins exhibited differences in transactivation specificity not observed with the corresponding P73 or P53 protein isoforms. These differences 1) were dependent on specific features of the RE sequence, 2) could be related to intrinsic differences in their oligomeric state and cooperative DNA binding, and 3) appeared to be conserved in evolution. Since genotoxic stress can change relative ratio of TA- and ΔN-P63α protein levels, the different transactivation specificity of each P63 isoform could potentially influence cellular responses to specific stresses. PMID:24926492
Neisseria conserved protein DMP19 is a DNA mimic protein that prevents DNA binding to a hypothetical nitrogen-response transcription factor

PubMed Central

Wang, Hao-Ching; Ko, Tzu-Ping; Wu, Mao-Lun; Ku, Shan-Chi; Wu, Hsing-Ju; Wang, Andrew H.-J.

2012-01-01

DNA mimic proteins occupy the DNA binding sites of DNA-binding proteins, and prevent these sites from being accessed by DNA. We show here that the Neisseria conserved hypothetical protein DMP19 acts as a DNA mimic. The crystal structure of DMP19 shows a dsDNA-like negative charge distribution on the surface, suggesting that this protein should be added to the short list of known DNA mimic proteins. The crystal structure of another related protein, NHTF (Neisseria hypothetical transcription factor), provides evidence that it is a member of the xenobiotic-response element (XRE) family of transcriptional factors. NHTF binds to a palindromic DNA sequence containing a 5′-TGTNAN11TNACA-3′ recognition box that controls the expression of an NHTF-related operon in which the conserved nitrogen-response protein [i.e. (Protein-PII) uridylyltransferase] is encoded. The complementary surface charges between DMP19 and NHTF suggest specific charge–charge interaction. In a DNA-binding assay, we found that DMP19 can prevent NHTF from binding to its DNA-binding sites. Finally, we used an in situ gene regulation assay to provide evidence that NHTF is a repressor of its down-stream genes and that DMP19 can neutralize this effect. We therefore conclude that the interaction of DMP19 and NHTF provides a novel gene regulation mechanism in Neisseria spps. PMID:22373915
Odorant-binding proteins from a primitive termite.

PubMed

Ishida, Yuko; Chiang, Vicky P; Haverty, Michael I; Leal, Walter S

2002-09-01

Hitherto, odorant-binding proteins (OBPs) have been identified from insects belonging to more highly evolved insect orders (Lepidoptera, Coleoptera, Diptera, Hymenoptera, and Hemiptera), whereas only chemosensory proteins have been identified from more primitive species, such as orthopteran and phasmid species. Here, we report for the first time the isolation and cloning of odorant-binding proteins from a primitive termite species, the dampwood termite. Zootermopsis nevadensis nevadensis (Isoptera: Termopsidae). A major antennae-specific protein was detected by native PAGE along with four other minor proteins, which were also absent in the extract from control tissues (hindlegs). Multiple cDNA cloning led to the full characterization of the major antennae-specific protein (ZnevOBP1) and to the identification of two other antennae-specific cDNAs, encoding putative odorant-binding proteins (ZnevOBP2 and ZnevOBP3). N-terminal amino acid sequencing of the minor antennal bands and cDNA cloning showed that olfaction in Z. n. nevadensis may involve multiple odorant-binding proteins. Database searches suggest that the OBPs from this primitive termite are homologues of the pheromone-binding proteins from scarab beetles and antennal-binding proteins from moths.
Detection and quantitation of single nucleotide polymorphisms, DNA sequence variations, DNA mutations, DNA damage and DNA mismatches

DOEpatents

McCutchen-Maloney, Sandra L.

2002-01-01

DNA mutation binding proteins alone and as chimeric proteins with nucleases are used with solid supports to detect DNA sequence variations, DNA mutations and single nucleotide polymorphisms. The solid supports may be flow cytometry beads, DNA chips, glass slides or DNA dips sticks. DNA molecules are coupled to solid supports to form DNA-support complexes. Labeled DNA is used with unlabeled DNA mutation binding proteins such at TthMutS to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by binding which gives an increase in signal. Unlabeled DNA is utilized with labeled chimeras to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by nuclease activity of the chimera which gives a decrease in signal.
Role of indirect readout mechanism in TATA box binding protein-DNA interaction.

PubMed

Mondal, Manas; Choudhury, Devapriya; Chakrabarti, Jaydeb; Bhattacharyya, Dhananjay

2015-03-01

Gene expression generally initiates from recognition of TATA-box binding protein (TBP) to the minor groove of DNA of TATA box sequence where the DNA structure is significantly different from B-DNA. We have carried out molecular dynamics simulation studies of TBP-DNA system to understand how the DNA structure alters for efficient binding. We observed rigid nature of the protein while the DNA of TATA box sequence has an inherent flexibility in terms of bending and minor groove widening. The bending analysis of the free DNA and the TBP bound DNA systems indicate presence of some similar structures. Principal coordinate ordination analysis also indicates some structural features of the protein bound and free DNA are similar. Thus we suggest that the DNA of TATA box sequence regularly oscillates between several alternate structures and the one suitable for TBP binding is induced further by the protein for proper complex formation.

Identification and characterization of TF1(phox), a DNA-binding protein that increases expression of gp91(phox) in PLB985 myeloid leukemia cells.

PubMed

Eklund, E A; Kakar, R

1997-04-04

The CYBB gene encodes gp91(phox), the heavy chain of the phagocyte-specific NADPH oxidase. CYBB is transcriptionally inactive until the promyelocyte stage of myelopoiesis, and in mature phagocytes, expression of gp91(phox) is further increased by interferon-gamma (IFN-gamma) and other inflammatory mediators. The CYBB promoter region contains several lineage-specific cis-elements involved in the IFN-gamma response. We screened a leukocyte cDNA expression library for proteins able to bind to one of these cis-elements (-214 to -262 base pairs) and identified TF1(phox), a protein with sequence-specific binding to the CYBB promoter. Electrophoretic mobility shift assay with nuclear proteins from a variety of cell lines demonstrated binding of a protein to the CYBB promoter that was cross-immunoreactive with TF1(phox). DNA binding of this protein was increased by IFN-gamma treatment in the myeloid cell line PLB985, but not in the non-myeloid cell line HeLa. Overexpression of recombinant TF1(phox) in PLB985 cells increased endogenous gp91(phox) message abundance, but did not lead to cellular differentiation. Overexpression of TF1(phox) in myeloid leukemia cell lines increased reporter gene expression from artificial promoter constructs containing CYBB promoter sequence. These data suggested that TF1(phox) increased expression of gp91(phox).
GBshape: a genome browser database for DNA shape annotations

PubMed Central

Chiu, Tsu-Pei; Yang, Lin; Zhou, Tianyin; Main, Bradley J.; Parker, Stephen C.J.; Nuzhdin, Sergey V.; Tullius, Thomas D.; Rohs, Remo

2015-01-01

Many regulatory mechanisms require a high degree of specificity in protein-DNA binding. Nucleotide sequence does not provide an answer to the question of why a protein binds only to a small subset of the many putative binding sites in the genome that share the same core motif. Whereas higher-order effects, such as chromatin accessibility, cooperativity and cofactors, have been described, DNA shape recently gained attention as another feature that fine-tunes the DNA binding specificities of some transcription factor families. Our Genome Browser for DNA shape annotations (GBshape; freely available at http://rohslab.cmb.usc.edu/GBshape/) provides minor groove width, propeller twist, roll, helix twist and hydroxyl radical cleavage predictions for the entire genomes of 94 organisms. Additional genomes can easily be added using the GBshape framework. GBshape can be used to visualize DNA shape annotations qualitatively in a genome browser track format, and to download quantitative values of DNA shape features as a function of genomic position at nucleotide resolution. As biological applications, we illustrate the periodicity of DNA shape features that are present in nucleosome-occupied sequences from human, fly and worm, and we demonstrate structural similarities between transcription start sites in the genomes of four Drosophila species. PMID:25326329
Chimeric proteins for detection and quantitation of DNA mutations, DNA sequence variations, DNA damage and DNA mismatches

DOEpatents

McCutchen-Maloney, Sandra L.

2002-01-01

Chimeric proteins having both DNA mutation binding activity and nuclease activity are synthesized by recombinant technology. The proteins are of the general formula A-L-B and B-L-A where A is a peptide having DNA mutation binding activity, L is a linker and B is a peptide having nuclease activity. The chimeric proteins are useful for detection and identification of DNA sequence variations including DNA mutations (including DNA damage and mismatches) by binding to the DNA mutation and cutting the DNA once the DNA mutation is detected.
Molecular determinants of origin discrimination by Orc1 initiators in archaea.

PubMed

Dueber, Erin C; Costa, Alessandro; Corn, Jacob E; Bell, Stephen D; Berger, James M

2011-05-01

Unlike bacteria, many eukaryotes initiate DNA replication from genomic sites that lack apparent sequence conservation. These loci are identified and bound by the origin recognition complex (ORC), and subsequently activated by a cascade of events that includes recruitment of an additional factor, Cdc6. Archaeal organisms generally possess one or more Orc1/Cdc6 homologs, belonging to the Initiator clade of ATPases associated with various cellular activities (AAA(+)) superfamily; however, these proteins recognize specific sequences within replication origins. Atomic resolution studies have shown that archaeal Orc1 proteins contact double-stranded DNA through an N-terminal AAA(+) domain and a C-terminal winged-helix domain (WHD), but use remarkably few base-specific contacts. To investigate the biochemical effects of these associations, we mutated the DNA-interacting elements of the Orc1-1 and Orc1-3 paralogs from the archaeon Sulfolobus solfataricus, and tested their effect on origin binding and deformation. We find that the AAA(+) domain has an unpredicted role in controlling the sequence selectivity of DNA binding, despite an absence of base-specific contacts to this region. Our results show that both the WHD and ATPase region influence origin recognition by Orc1/Cdc6, and suggest that not only DNA sequence, but also local DNA structure help define archaeal initiator binding sites. © The Author(s) 2011. Published by Oxford University Press.
Comprehensive Interrogation of Natural TALE DNA Binding Modules and Transcriptional Repressor Domains

PubMed Central

Cong, Le; Zhou, Ruhong; Kuo, Yu-chi; Cunniff, Margaret; Zhang, Feng

2012-01-01

Transcription activator-like effectors (TALE) are sequence-specific DNA binding proteins that harbor modular, repetitive DNA binding domains. TALEs have enabled the creation of customizable designer transcriptional factors and sequence-specific nucleases for genome engineering. Here we report two improvements of the TALE toolbox for achieving efficient activation and repression of endogenous gene expression in mammalian cells. We show that the naturally occurring repeat variable diresidue (RVD) Asn-His (NH) has high biological activity and specificity for guanine, a highly prevalent base in mammalian genomes. We also report an effective TALE transcriptional repressor architecture for targeted inhibition of transcription in mammalian cells. These findings will improve the precision and effectiveness of genome engineering that can be achieved using TALEs. PMID:22828628
Specific DNA binding activity of T antigen subclasses varies among different SV40-transformed cell lines.

PubMed

Burger, C; Fanning, E

1983-04-15

Large tumor antigen (T antigen) occurs in at least three different oligomeric subclasses in cells infected or transformed by simian virus 40 (SV40): 5-7 S, 14-16 S, and 23-25 S. The 23-25 S form is complexed with a host phosphoprotein (p53). The DNA binding properties of these three subclasses of T antigen from nine different cell lines and free p53 protein were compared using an immunoprecipitation assay. All three subclasses of T antigen bound specifically to SV40 DNA sequences near the origin of replication. However, the DNA binding activity varied between different cell lines over a 40- to 50-fold range. The 23-25 S and 14-16 S forms from most of the cell lines tested bound much less SV40 origin DNA than 5-7 S T antigen. The free p53 phosphoprotein did not bind specifically to any SV40 DNA sequences.
Two new insulator proteins, Pita and ZIPIC, target CP190 to chromatin

PubMed Central

Maksimenko, Oksana; Bartkuhn, Marek; Stakhov, Viacheslav; Herold, Martin; Zolotarev, Nickolay; Jox, Theresa; Buxa, Melanie K.; Kirsch, Ramona; Bonchuk, Artem; Fedotova, Anna; Kyrchanova, Olga

2015-01-01

Insulators are multiprotein–DNA complexes that regulate the nuclear architecture. The Drosophila CP190 protein is a cofactor for the DNA-binding insulator proteins Su(Hw), CTCF, and BEAF-32. The fact that CP190 has been found at genomic sites devoid of either of the known insulator factors has until now been unexplained. We have identified two DNA-binding zinc-finger proteins, Pita, and a new factor named ZIPIC, that interact with CP190 in vivo and in vitro at specific interaction domains. Genomic binding sites for these proteins are clustered with CP190 as well as with CTCF and BEAF-32. Model binding sites for Pita or ZIPIC demonstrate a partial enhancer-blocking activity and protect gene expression from PRE-mediated silencing. The function of the CTCF-bound MCP insulator sequence requires binding of Pita. These results identify two new insulator proteins and emphasize the unifying function of CP190, which can be recruited by many DNA-binding insulator proteins. PMID:25342723
Solution structure of CEH-37 homeodomain of the nematode Caenorhabditis elegans

DOE Office of Scientific and Technical Information (OSTI.GOV)

Moon, Sunjin; Lee, Yong Woo; Kim, Woo Taek

Highlights: •We have determined solution structures of CEH-37 homedomain. •CEH-37 HD has a compact α-helical structure with HTH DNA binding motif. •Solution structure of CEH-37 HD shares its molecular topology with that of the homeodomain proteins. •Residues in the N-terminal region and HTH motif are important in binding to Caenorhabditis elegans telomeric DNA. •CEH-37 could play an important role in telomere function via DNA binding. -- Abstract: The nematode Caenorhabditis elegans protein CEH-37 belongs to the paired OTD/OTX family of homeobox-containing homeodomain proteins. CEH-37 shares sequence similarity with homeodomain proteins, although it specifically binds to double-stranded C. elegans telomeric DNA,more » which is unusual to homeodomain proteins. Here, we report the solution structure of CEH-37 homeodomain and molecular interaction with double-stranded C. elegans telomeric DNA using nuclear magnetic resonance (NMR) spectroscopy. NMR structure shows that CEH-37 homeodomain is composed of a flexible N-terminal region and three α-helices with a helix-turn-helix (HTH) DNA binding motif. Data from size-exclusion chromatography and fluorescence spectroscopy reveal that CEH-37 homeodomain interacts strongly with double-stranded C. elegans telomeric DNA. NMR titration experiments identified residues responsible for specific binding to nematode double-stranded telomeric DNA. These results suggest that C. elegans homeodomain protein, CEH-37 could play an important role in telomere function via DNA binding.« less
MOCCS: Clarifying DNA-binding motif ambiguity using ChIP-Seq data.

PubMed

Ozaki, Haruka; Iwasaki, Wataru

2016-08-01

As a key mechanism of gene regulation, transcription factors (TFs) bind to DNA by recognizing specific short sequence patterns that are called DNA-binding motifs. A single TF can accept ambiguity within its DNA-binding motifs, which comprise both canonical (typical) and non-canonical motifs. Clarification of such DNA-binding motif ambiguity is crucial for revealing gene regulatory networks and evaluating mutations in cis-regulatory elements. Although chromatin immunoprecipitation sequencing (ChIP-seq) now provides abundant data on the genomic sequences to which a given TF binds, existing motif discovery methods are unable to directly answer whether a given TF can bind to a specific DNA-binding motif. Here, we report a method for clarifying the DNA-binding motif ambiguity, MOCCS. Given ChIP-Seq data of any TF, MOCCS comprehensively analyzes and describes every k-mer to which that TF binds. Analysis of simulated datasets revealed that MOCCS is applicable to various ChIP-Seq datasets, requiring only a few minutes per dataset. Application to the ENCODE ChIP-Seq datasets proved that MOCCS directly evaluates whether a given TF binds to each DNA-binding motif, even if known position weight matrix models do not provide sufficient information on DNA-binding motif ambiguity. Furthermore, users are not required to provide numerous parameters or background genomic sequence models that are typically unavailable. MOCCS is implemented in Perl and R and is freely available via https://github.com/yuifu/moccs. By complementing existing motif-discovery software, MOCCS will contribute to the basic understanding of how the genome controls diverse cellular processes via DNA-protein interactions. Copyright © 2016 Elsevier Ltd. All rights reserved.
Simultaneously measuring multiple protein interactions and their correlations in a cell by Protein-interactome Footprinting

PubMed Central

Luo, Si-Wei; Liang, Zhi; Wu, Jia-Rui

2017-01-01

Quantitatively detecting correlations of multiple protein-protein interactions (PPIs) in vivo is a big challenge. Here we introduce a novel method, termed Protein-interactome Footprinting (PiF), to simultaneously measure multiple PPIs in one cell. The principle of PiF is that each target physical PPI in the interactome is simultaneously transcoded into a specific DNA sequence based on dimerization of the target proteins fused with DNA-binding domains. The interaction intensity of each target protein is quantified as the copy number of the specific DNA sequences bound by each fusion protein dimers. Using PiF, we quantitatively reveal dynamic patterns of PPIs and their correlation network in E. coli two-component systems. PMID:28338015
In vitro selection of DNA elements highly responsive to the human T-cell lymphotropic virus type I transcriptional activator, Tax.

PubMed

Paca-Uccaralertkun, S; Zhao, L J; Adya, N; Cross, J V; Cullen, B R; Boros, I M; Giam, C Z

1994-01-01

The human T-cell lymphotropic virus type I (HTLV-I) transactivator, Tax, the ubiquitous transcriptional factor cyclic AMP (cAMP) response element-binding protein (CREB protein), and the 21-bp repeats in the HTLV-I transcriptional enhancer form a ternary nucleoprotein complex (L. J. Zhao and C. Z. Giam, Proc. Natl. Acad. Sci. USA 89:7070-7074, 1992). Using an antibody directed against the COOH-terminal region of Tax along with purified Tax and CREB proteins, we selected DNA elements bound specifically by the Tax-CREB complex in vitro. Two distinct but related groups of sequences containing the cAMP response element (CRE) flanked by long runs of G and C residues in the 5' and 3' regions, respectively, were preferentially recognized by Tax-CREB. In contrast, CREB alone binds only to CRE motifs (GNTGACG[T/C]) without neighboring G- or C-rich sequences. The Tax-CREB-selected sequences bear a striking resemblance to the 5' or 3' two-thirds of the HTLV-I 21-bp repeats and are highly inducible by Tax. Gel electrophoretic mobility shift assays, DNA transfection, and DNase I footprinting analyses indicated that the G- and C-rich sequences flanking the CRE motif are crucial for Tax-CREB-DNA ternary complex assembly and Tax transactivation but are not in direct contact with the Tax-CREB complex. These data show that Tax recruits CREB to form a multiprotein complex that specifically recognizes the viral 21-bp repeats. The expanded DNA binding specificity of Tax-CREB and the obligatory role the ternary Tax-CREB-DNA complex plays in transactivation reveal a novel mechanism for regulating the transcriptional activity of leucine zipper proteins like CREB.
COUP-TF (chicken ovalbumin upstream promoter transcription factor)-interacting protein 1 (CTIP1) is a sequence-specific DNA binding protein.

PubMed Central

Avram, Dorina; Fields, Andrew; Senawong, Thanaset; Topark-Ngarm, Acharawan; Leid, Mark

2002-01-01

Chicken ovalbumin upstream promoter transcription factor (COUP-TF)-interacting proteins 1 and 2 [CTIP1/Evi9/B cell leukaemia (Bcl) l1a and CTIP2/Bcl11b respectively] are highly related C(2)H(2) zinc finger proteins that are abundantly expressed in brain and the immune system, and are associated with immune system malignancies. A selection procedure was employed to isolate high-affinity DNA binding sites for CTIP1. The core binding site on DNA identified in these studies, 5'-GGCCGG-3' (upper strand), is highly related to the canonical GC box and was bound by a CTIP1 oligomeric complex(es) in vitro. Furthermore, both CTIP1 and CTIP2 repressed transcription of a reporter gene harbouring a multimerized CTIP binding site, and this repression was neither reversed by trichostatin A (an inhibitor of known class I and II histone deacetylases) nor stimulated by co-transfection of a COUP-TF family member. These results demonstrate that CTIP1 is a sequence-specific DNA binding protein and a bona fide transcriptional repressor that is capable of functioning independently of COUP-TF family members. These findings may be relevant to the physiological and/or pathological action(s) of CTIPs in cells that do not express COUP-TF family members, such as cells of the haematopoietic and immune systems. PMID:12196208
Position specific variation in the rate of evolution in transcription factor binding sites

PubMed Central

Moses, Alan M; Chiang, Derek Y; Kellis, Manolis; Lander, Eric S; Eisen, Michael B

2003-01-01

Background The binding sites of sequence specific transcription factors are an important and relatively well-understood class of functional non-coding DNAs. Although a wide variety of experimental and computational methods have been developed to characterize transcription factor binding sites, they remain difficult to identify. Comparison of non-coding DNA from related species has shown considerable promise in identifying these functional non-coding sequences, even though relatively little is known about their evolution. Results Here we analyse the genome sequences of the budding yeasts Saccharomyces cerevisiae, S. bayanus, S. paradoxus and S. mikatae to study the evolution of transcription factor binding sites. As expected, we find that both experimentally characterized and computationally predicted binding sites evolve slower than surrounding sequence, consistent with the hypothesis that they are under purifying selection. We also observe position-specific variation in the rate of evolution within binding sites. We find that the position-specific rate of evolution is positively correlated with degeneracy among binding sites within S. cerevisiae. We test theoretical predictions for the rate of evolution at positions where the base frequencies deviate from background due to purifying selection and find reasonable agreement with the observed rates of evolution. Finally, we show how the evolutionary characteristics of real binding motifs can be used to distinguish them from artefacts of computational motif finding algorithms. Conclusion As has been observed for protein sequences, the rate of evolution in transcription factor binding sites varies with position, suggesting that some regions are under stronger functional constraint than others. This variation likely reflects the varying importance of different positions in the formation of the protein-DNA complex. The characterization of the pattern of evolution in known binding sites will likely contribute to the effective use of comparative sequence data in the identification of transcription factor binding sites and is an important step toward understanding the evolution of functional non-coding DNA. PMID:12946282
Dissociation free-energy profiles of specific and nonspecific DNA-protein complexes.

PubMed

Yonetani, Yoshiteru; Kono, Hidetoshi

2013-06-27

DNA-binding proteins recognize DNA sequences with at least two different binding modes: specific and nonspecific. Experimental structures of such complexes provide us a static view of the bindings. However, it is difficult to reveal further mechanisms of their target-site search and recognition only from static information because the transition process between the bound and unbound states is not clarified by static information. What is the difference between specific and nonspecific bindings? Here we performed adaptive biasing force molecular dynamics simulations with the specific and nonspecific structures of DNA-Lac repressor complexes to investigate the dissociation process. The resultant free-energy profiles showed that the specific complex has a sharp, deep well consistent with tight binding, whereas the nonspecific complex has a broad, shallow well consistent with loose binding. The difference in the well depth, ~5 kcal/mol, was in fair agreement with the experimentally obtained value and was found to mainly come from the protein conformational difference, particularly in the C-terminal tail. Also, the free-energy profiles were found to be correlated with changes in the number of protein-DNA contacts and that of surface water molecules. The derived protein spatial distributions around the DNA indicate that any large dissociation occurs rarely, regardless of the specific and nonspecific sites. Comparison of the free-energy barrier for sliding [~8.7 kcal/mol; Furini J. Phys. Chem. B 2010, 114, 2238] and that for dissociation (at least ~16 kcal/mol) calculated in this study suggests that sliding is much preferred to dissociation.
Conflict RNA modification, host-parasite co-evolution, and the origins of DNA and DNA-binding proteins1.

PubMed

McLaughlin, Paul J; Keegan, Liam P

2014-08-01

Nearly 150 different enzymatically modified forms of the four canonical residues in RNA have been identified. For instance, enzymes of the ADAR (adenosine deaminase acting on RNA) family convert adenosine residues into inosine in cellular dsRNAs. Recent findings show that DNA endonuclease V enzymes have undergone an evolutionary transition from cleaving 3' to deoxyinosine in DNA and ssDNA to cleaving 3' to inosine in dsRNA and ssRNA in humans. Recent work on dsRNA-binding domains of ADARs and other proteins also shows that a degree of sequence specificity is achieved by direct readout in the minor groove. However, the level of sequence specificity observed is much less than that of DNA major groove-binding helix-turn-helix proteins. We suggest that the evolution of DNA-binding proteins following the RNA to DNA genome transition represents the major advantage that DNA genomes have over RNA genomes. We propose that a hypothetical RNA modification, a RRAR (ribose reductase acting on genomic dsRNA) produced the first stretches of DNA in RNA genomes. We discuss why this is the most satisfactory explanation for the origin of DNA. The evolution of this RNA modification and later steps to DNA genomes are likely to have been driven by cellular genome co-evolution with viruses and intragenomic parasites. RNA modifications continue to be involved in host-virus conflicts; in vertebrates, edited cellular dsRNAs with inosine-uracil base pairs appear to be recognized as self RNA and to suppress activation of innate immune sensors that detect viral dsRNA.
Programmable RNA recognition and cleavage by CRISPR/Cas9.

PubMed

O'Connell, Mitchell R; Oakes, Benjamin L; Sternberg, Samuel H; East-Seletsky, Alexandra; Kaplan, Matias; Doudna, Jennifer A

2014-12-11

The CRISPR-associated protein Cas9 is an RNA-guided DNA endonuclease that uses RNA-DNA complementarity to identify target sites for sequence-specific double-stranded DNA (dsDNA) cleavage. In its native context, Cas9 acts on DNA substrates exclusively because both binding and catalysis require recognition of a short DNA sequence, known as the protospacer adjacent motif (PAM), next to and on the strand opposite the twenty-nucleotide target site in dsDNA. Cas9 has proven to be a versatile tool for genome engineering and gene regulation in a large range of prokaryotic and eukaryotic cell types, and in whole organisms, but it has been thought to be incapable of targeting RNA. Here we show that Cas9 binds with high affinity to single-stranded RNA (ssRNA) targets matching the Cas9-associated guide RNA sequence when the PAM is presented in trans as a separate DNA oligonucleotide. Furthermore, PAM-presenting oligonucleotides (PAMmers) stimulate site-specific endonucleolytic cleavage of ssRNA targets, similar to PAM-mediated stimulation of Cas9-catalysed DNA cleavage. Using specially designed PAMmers, Cas9 can be specifically directed to bind or cut RNA targets while avoiding corresponding DNA sequences, and we demonstrate that this strategy enables the isolation of a specific endogenous messenger RNA from cells. These results reveal a fundamental connection between PAM binding and substrate selection by Cas9, and highlight the utility of Cas9 for programmable transcript recognition without the need for tags.
Programmable RNA recognition and cleavage by CRISPR/Cas9

PubMed Central

O’Connell, Mitchell R.; Oakes, Benjamin L.; Sternberg, Samuel H.; East-Seletsky, Alexandra; Kaplan, Matias; Doudna, Jennifer A.

2014-01-01

The CRISPR-associated protein Cas9 is an RNA-guided DNA endonuclease that uses RNA:DNA complementarity to identify target sites for sequence-specific doublestranded DNA (dsDNA) cleavage1-5. In its native context, Cas9 acts on DNA substrates exclusively because both binding and catalysis require recognition of a short DNA sequence, the protospacer adjacent motif (PAM), next to and on the strand opposite the 20-nucleotide target site in dsDNA4-7. Cas9 has proven to be a versatile tool for genome engineering and gene regulation in many cell types and organisms8, but it has been thought to be incapable of targeting RNA5. Here we show that Cas9 binds with high affinity to single-stranded RNA (ssRNA) targets matching the Cas9-associated guide RNA sequence when the PAM is presented in trans as a separate DNA oligonucleotide. Furthermore, PAM-presenting oligonucleotides (PAMmers) stimulate site-specific endonucleolytic cleavage of ssRNA targets, similar to PAM-mediated stimulation of Cas9-catalyzed DNA cleavage7. Using specially designed PAMmers, Cas9 can be specifically directed to bind or cut RNA targets while avoiding corresponding DNA sequences, and we demonstrate that this strategy enables the isolation of a specific endogenous mRNA from cells. These results reveal a fundamental connection between PAM binding and substrate selection by Cas9, and highlight the utility of Cas9 for programmable and tagless transcript recognition. PMID:25274302
p53 Specifically Binds Triplex DNA In Vitro and in Cells

PubMed Central

Brázdová, Marie; Tichý, Vlastimil; Helma, Robert; Bažantová, Pavla; Polášková, Alena; Krejčí, Aneta; Petr, Marek; Navrátilová, Lucie; Tichá, Olga; Nejedlý, Karel; Bennink, Martin L.; Subramaniam, Vinod; Bábková, Zuzana; Martínek, Tomáš; Lexa, Matej; Adámik, Matej

2016-01-01

Triplex DNA is implicated in a wide range of biological activities, including regulation of gene expression and genomic instability leading to cancer. The tumor suppressor p53 is a central regulator of cell fate in response to different type of insults. Sequence and structure specific modes of DNA recognition are core attributes of the p53 protein. The focus of this work is the structure-specific binding of p53 to DNA containing triplex-forming sequences in vitro and in cells and the effect on p53-driven transcription. This is the first DNA binding study of full-length p53 and its deletion variants to both intermolecular and intramolecular T.A.T triplexes. We demonstrate that the interaction of p53 with intermolecular T.A.T triplex is comparable to the recognition of CTG-hairpin non-B DNA structure. Using deletion mutants we determined the C-terminal DNA binding domain of p53 to be crucial for triplex recognition. Furthermore, strong p53 recognition of intramolecular T.A.T triplexes (H-DNA), stabilized by negative superhelicity in plasmid DNA, was detected by competition and immunoprecipitation experiments, and visualized by AFM. Moreover, chromatin immunoprecipitation revealed p53 binding T.A.T forming sequence in vivo. Enhanced reporter transactivation by p53 on insertion of triplex forming sequence into plasmid with p53 consensus sequence was observed by luciferase reporter assays. In-silico scan of human regulatory regions for the simultaneous presence of both consensus sequence and T.A.T motifs identified a set of candidate p53 target genes and p53-dependent activation of several of them (ABCG5, ENOX1, INSR, MCC, NFAT5) was confirmed by RT-qPCR. Our results show that T.A.T triplex comprises a new class of p53 binding sites targeted by p53 in a DNA structure-dependent mode in vitro and in cells. The contribution of p53 DNA structure-dependent binding to the regulation of transcription is discussed. PMID:27907175
Comparison between TRF2 and TRF1 of their telomeric DNA-bound structures and DNA-binding activities

PubMed Central

Hanaoka, Shingo; Nagadoi, Aritaka; Nishimura, Yoshifumi

2005-01-01

Mammalian telomeres consist of long tandem arrays of double-stranded telomeric TTAGGG repeats packaged by the telomeric DNA-binding proteins TRF1 and TRF2. Both contain a similar C-terminal Myb domain that mediates sequence-specific binding to telomeric DNA. In a DNA complex of TRF1, only the single Myb-like domain consisting of three helices can bind specifically to double-stranded telomeric DNA. TRF2 also binds to double-stranded telomeric DNA. Although the DNA binding mode of TRF2 is likely identical to that of TRF1, TRF2 plays an important role in the t-loop formation that protects the ends of telomeres. Here, to clarify the details of the double-stranded telomeric DNA-binding modes of TRF1 and TRF2, we determined the solution structure of the DNA-binding domain of human TRF2 bound to telomeric DNA; it consists of three helices, and like TRF1, the third helix recognizes TAGGG sequence in the major groove of DNA with the N-terminal arm locating in the minor groove. However, small but significant differences are observed; in contrast to the minor groove recognition of TRF1, in which an arginine residue recognizes the TT sequence, a lysine residue of TRF2 interacts with the TT part. We examined the telomeric DNA-binding activities of both DNA-binding domains of TRF1 and TRF2 and found that TRF1 binds more strongly than TRF2. Based on the structural differences of both domains, we created several mutants of the DNA-binding domain of TRF2 with stronger binding activities compared to the wild-type TRF2. PMID:15608118
TIA-1 RRM23 binding and recognition of target oligonucleotides

PubMed Central

Waris, Saboora; García-Mauriño, Sofía M.; Sivakumaran, Andrew; Beckham, Simone A.; Loughlin, Fionna E.; Gorospe, Myriam; Díaz-Moreno, Irene; Wilce, Matthew C.J.

2017-01-01

Abstract TIA-1 (T-cell restricted intracellular antigen-1) is an RNA-binding protein involved in splicing and translational repression. It mainly interacts with RNA via its second and third RNA recognition motifs (RRMs), with specificity for U-rich sequences directed by RRM2. It has recently been shown that RRM3 also contributes to binding, with preferential binding for C-rich sequences. Here we designed UC-rich and CU-rich 10-nt sequences for engagement of both RRM2 and RRM3 and demonstrated that the TIA-1 RRM23 construct preferentially binds the UC-rich RNA ligand (5΄-UUUUUACUCC-3΄). Interestingly, this binding depends on the presence of Lys274 that is C-terminal to RRM3 and binding to equivalent DNA sequences occurs with similar affinity. Small-angle X-ray scattering was used to demonstrate that, upon complex formation with target RNA or DNA, TIA-1 RRM23 adopts a compact structure, showing that both RRMs engage with the target 10-nt sequences to form the complex. We also report the crystal structure of TIA-1 RRM2 in complex with DNA to 2.3 Å resolution providing the first atomic resolution structure of any TIA protein RRM in complex with oligonucleotide. Together our data support a specific mode of TIA-1 RRM23 interaction with target oligonucleotides consistent with the role of TIA-1 in binding RNA to regulate gene expression. PMID:28184449

TIA-1 RRM23 binding and recognition of target oligonucleotides.

PubMed

Waris, Saboora; García-Mauriño, Sofía M; Sivakumaran, Andrew; Beckham, Simone A; Loughlin, Fionna E; Gorospe, Myriam; Díaz-Moreno, Irene; Wilce, Matthew C J; Wilce, Jacqueline A

2017-05-05

TIA-1 (T-cell restricted intracellular antigen-1) is an RNA-binding protein involved in splicing and translational repression. It mainly interacts with RNA via its second and third RNA recognition motifs (RRMs), with specificity for U-rich sequences directed by RRM2. It has recently been shown that RRM3 also contributes to binding, with preferential binding for C-rich sequences. Here we designed UC-rich and CU-rich 10-nt sequences for engagement of both RRM2 and RRM3 and demonstrated that the TIA-1 RRM23 construct preferentially binds the UC-rich RNA ligand (5΄-UUUUUACUCC-3΄). Interestingly, this binding depends on the presence of Lys274 that is C-terminal to RRM3 and binding to equivalent DNA sequences occurs with similar affinity. Small-angle X-ray scattering was used to demonstrate that, upon complex formation with target RNA or DNA, TIA-1 RRM23 adopts a compact structure, showing that both RRMs engage with the target 10-nt sequences to form the complex. We also report the crystal structure of TIA-1 RRM2 in complex with DNA to 2.3 Å resolution providing the first atomic resolution structure of any TIA protein RRM in complex with oligonucleotide. Together our data support a specific mode of TIA-1 RRM23 interaction with target oligonucleotides consistent with the role of TIA-1 in binding RNA to regulate gene expression. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
A conserved mechanism for replication origin recognition and binding in archaea.

PubMed

Majerník, Alan I; Chong, James P J

2008-01-15

To date, methanogens are the only group within the archaea where firing DNA replication origins have not been demonstrated in vivo. In the present study we show that a previously identified cluster of ORB (origin recognition box) sequences do indeed function as an origin of replication in vivo in the archaeon Methanothermobacter thermautotrophicus. Although the consensus sequence of ORBs in M. thermautotrophicus is somewhat conserved when compared with ORB sequences in other archaea, the Cdc6-1 protein from M. thermautotrophicus (termed MthCdc6-1) displays sequence-specific binding that is selective for the MthORB sequence and does not recognize ORBs from other archaeal species. Stabilization of in vitro MthORB DNA binding by MthCdc6-1 requires additional conserved sequences 3' to those originally described for M. thermautotrophicus. By testing synthetic sequences bearing mutations in the MthORB consensus sequence, we show that Cdc6/ORB binding is critically dependent on the presence of an invariant guanine found in all archaeal ORB sequences. Mutation of a universally conserved arginine residue in the recognition helix of the winged helix domain of archaeal Cdc6-1 shows that specific origin sequence recognition is dependent on the interaction of this arginine residue with the invariant guanine. Recognition of a mutated origin sequence can be achieved by mutation of the conserved arginine residue to a lysine or glutamine residue. Thus despite a number of differences in protein and DNA sequences between species, the mechanism of origin recognition and binding appears to be conserved throughout the archaea.
Recombinant antibody mediated delivery of organelle-specific DNA pH sensors along endocytic pathways

NASA Astrophysics Data System (ADS)

Modi, Souvik; Halder, Saheli; Nizak, Clément; Krishnan, Yamuna

2013-12-01

DNA has been used to build nanomachines with potential in cellulo and in vivo applications. However their different in cellulo applications are limited by the lack of generalizable strategies to deliver them to precise intracellular locations. Here we describe a new molecular design of DNA pH sensors with response times that are nearly 20 fold faster. Further, by changing the sequence of the pH sensitive domain of the DNA sensor, we have been able to tune their pH sensitive regimes and create a family of DNA sensors spanning ranges from pH 4 to 7.6. To enable a generalizable targeting methodology, this new sensor design also incorporates a `handle' domain. We have identified, using a phage display screen, a set of three recombinant antibodies (scFv) that bind sequence specifically to the handle domain. Sequence analysis of these antibodies revealed several conserved residues that mediate specific interactions with the cognate DNA duplex. We also found that all three scFvs clustered into different branches indicating that their specificity arises from mutations in key residues. When one of these scFvs is fused to a membrane protein (furin) that traffics via the cell surface, the scFv-furin chimera binds the `handle' and ferries a family of DNA pH sensors along the furin endocytic pathway. Post endocytosis, all DNA nanodevices retain their functionality in cellulo and provide spatiotemporal pH maps of retrogradely trafficking furin inside living cells. This new molecular technology of DNA-scFv-protein chimeras can be used to site-specifically complex DNA nanostructures for bioanalytical applications.DNA has been used to build nanomachines with potential in cellulo and in vivo applications. However their different in cellulo applications are limited by the lack of generalizable strategies to deliver them to precise intracellular locations. Here we describe a new molecular design of DNA pH sensors with response times that are nearly 20 fold faster. Further, by changing the sequence of the pH sensitive domain of the DNA sensor, we have been able to tune their pH sensitive regimes and create a family of DNA sensors spanning ranges from pH 4 to 7.6. To enable a generalizable targeting methodology, this new sensor design also incorporates a `handle' domain. We have identified, using a phage display screen, a set of three recombinant antibodies (scFv) that bind sequence specifically to the handle domain. Sequence analysis of these antibodies revealed several conserved residues that mediate specific interactions with the cognate DNA duplex. We also found that all three scFvs clustered into different branches indicating that their specificity arises from mutations in key residues. When one of these scFvs is fused to a membrane protein (furin) that traffics via the cell surface, the scFv-furin chimera binds the `handle' and ferries a family of DNA pH sensors along the furin endocytic pathway. Post endocytosis, all DNA nanodevices retain their functionality in cellulo and provide spatiotemporal pH maps of retrogradely trafficking furin inside living cells. This new molecular technology of DNA-scFv-protein chimeras can be used to site-specifically complex DNA nanostructures for bioanalytical applications. Electronic supplementary information (ESI) available: Detailed description of all oligonucleotide sequences used in this study; list of figures that support claims from the main text. Mainly these show sensor sequences, phage display results, scFv purification and binding data, cell images clamped at different pH and co-localization studies with endocytic tracers. See DOI: 10.1039/c3nr03769j
Molecular dynamics studies on the DNA-binding process of ERG.

PubMed

Beuerle, Matthias G; Dufton, Neil P; Randi, Anna M; Gould, Ian R

2016-11-15

The ETS family of transcription factors regulate gene targets by binding to a core GGAA DNA-sequence. The ETS factor ERG is required for homeostasis and lineage-specific functions in endothelial cells, some subset of haemopoietic cells and chondrocytes; its ectopic expression is linked to oncogenesis in multiple tissues. To date details of the DNA-binding process of ERG including DNA-sequence recognition outside the core GGAA-sequence are largely unknown. We combined available structural and experimental data to perform molecular dynamics simulations to study the DNA-binding process of ERG. In particular we were able to reproduce the ERG DNA-complex with a DNA-binding simulation starting in an unbound configuration with a final root-mean-square-deviation (RMSD) of 2.1 Å to the core ETS domain DNA-complex crystal structure. This allowed us to elucidate the relevance of amino acids involved in the formation of the ERG DNA-complex and to identify Arg385 as a novel key residue in the DNA-binding process. Moreover we were able to show that water-mediated hydrogen bonds are present between ERG and DNA in our simulations and that those interactions have the potential to achieve sequence recognition outside the GGAA core DNA-sequence. The methodology employed in this study shows the promising capabilities of modern molecular dynamics simulations in the field of protein DNA-interactions.
MotifMark: Finding regulatory motifs in DNA sequences.

PubMed

Hassanzadeh, Hamid Reza; Kolhe, Pushkar; Isbell, Charles L; Wang, May D

2017-07-01

The interaction between proteins and DNA is a key driving force in a significant number of biological processes such as transcriptional regulation, repair, recombination, splicing, and DNA modification. The identification of DNA-binding sites and the specificity of target proteins in binding to these regions are two important steps in understanding the mechanisms of these biological activities. A number of high-throughput technologies have recently emerged that try to quantify the affinity between proteins and DNA motifs. Despite their success, these technologies have their own limitations and fall short in precise characterization of motifs, and as a result, require further downstream analysis to extract useful and interpretable information from a haystack of noisy and inaccurate data. Here we propose MotifMark, a new algorithm based on graph theory and machine learning, that can find binding sites on candidate probes and rank their specificity in regard to the underlying transcription factor. We developed a pipeline to analyze experimental data derived from compact universal protein binding microarrays and benchmarked it against two of the most accurate motif search methods. Our results indicate that MotifMark can be a viable alternative technique for prediction of motif from protein binding microarrays and possibly other related high-throughput techniques.
Specification of anteroposterior cell fates in Caenorhabditis elegans by Drosophila Hox proteins.

PubMed

Hunter, C P; Kenyon, C

1995-09-21

Antennapedia class homeobox (Hox) genes specify cell fates in successive anteroposterior body domains in vertebrates, insects and nematodes. The DNA-binding homeodomain sequences are very similar between vertebrate and Drosophila Hox proteins, and this similarity allows vertebrate Hox proteins to function in Drosophila. In contrast, the Caenorhabditis elegans homeodomains are substantially divergent. Further, C. elegans differs from both insects and vertebrates in having a non-segmented body as well as a distinctive mode of development that involves asymmetric early cleavages and invariant cell lineages. Here we report that, despite these differences, Drosophila Hox proteins expressed in C. elegans can substitute for C. elegans Hox proteins in the control of three different cell-fate decisions: the regulation of cell migration, the specification of serotonergic neurons, and the specification of a sensory structure. We also show that the specificity of one C. elegans Hox protein is partly determined by two amino acids that have been implicated in sequence-specific DNA binding. Together these findings suggest that factors important for target recognition by specific Hox proteins have been conserved throughout much of the animal kingdom.
Cell-type-specific profiling of protein-DNA interactions without cell isolation using targeted DamID with next-generation sequencing.

PubMed

Marshall, Owen J; Southall, Tony D; Cheetham, Seth W; Brand, Andrea H

2016-09-01

This protocol is an extension to: Nat. Protoc. 2, 1467-1478 (2007); doi:10.1038/nprot.2007.148; published online 7 June 2007The ability to profile transcription and chromatin binding in a cell-type-specific manner is a powerful aid to understanding cell-fate specification and cellular function in multicellular organisms. We recently developed targeted DamID (TaDa) to enable genome-wide, cell-type-specific profiling of DNA- and chromatin-binding proteins in vivo without cell isolation. As a protocol extension, this article describes substantial modifications to an existing protocol, and it offers additional applications. TaDa builds upon DamID, a technique for detecting genome-wide DNA-binding profiles of proteins, by coupling it with the GAL4 system in Drosophila to enable both temporal and spatial resolution. TaDa ensures that Dam-fusion proteins are expressed at very low levels, thus avoiding toxicity and potential artifacts from overexpression. The modifications to the core DamID technique presented here also increase the speed of sample processing and throughput, and adapt the method to next-generation sequencing technology. TaDa is robust, reproducible and highly sensitive. Compared with other methods for cell-type-specific profiling, the technique requires no cell-sorting, cross-linking or antisera, and binding profiles can be generated from as few as 10,000 total induced cells. By profiling the genome-wide binding of RNA polymerase II (Pol II), TaDa can also identify transcribed genes in a cell-type-specific manner. Here we describe a detailed protocol for carrying out TaDa experiments and preparing the material for next-generation sequencing. Although we developed TaDa in Drosophila, it should be easily adapted to other organisms with an inducible expression system. Once transgenic animals are obtained, the entire experimental procedure-from collecting tissue samples to generating sequencing libraries-can be accomplished within 5 d.
Molecular mechanisms of conformational specificity: A study of Hox in vivo target DNA binding specificities and the structure of a Ure2p mutation that affects fibril formation rates

NASA Astrophysics Data System (ADS)

Bauer, William Joseph, Jr.

The fate of an individual cell, or even an entire organism, is often determined by minute, yet very specific differences in the conformation of a single protein species. Very often, proteins take on alternate folds or even side chain conformations to deal with different situations present within the cell. These differences can be as large as a whole domain or as subtle as the alteration of a single amino acid side chain. Yet, even these seemingly minor side chain conformational differences can determine the development of a cell type during differentiation or even dictate whether a cell will live or die. Two examples of situations where minor conformational differences within a specific protein could lead to major differences in the life cycle of a cell are described herein. The first example describes the variations seen in DNA conformations which can lead to slightly different Hox protein binding conformations responsible for recognizing biologically relevant regulatory sites. These specific differences occur in the minor groove of the bound DNA and are limited to the conformation of only two side chains. The conformation of the bound DNA, however, is not solely determined by the sequence of the DNA, as multiple sequences can result in the same DNA conformation. The second example takes place in the context of a yeast prion protein which contains a mutation that decreases the frequency at which fibrils form. While the specific interactions leading to this physiological change were not directly detected, it can be ascertained from the crystal structure that the structural changes are subtle and most likely involve another binding partner. In both cases, these conformational changes are very slight but have a profound effect on the downstream processes.
Non-B-DNA structures on the interferon-beta promoter?

PubMed

Robbe, K; Bonnefoy, E

1998-01-01

The high mobility group (HMG) I protein intervenes as an essential factor during the virus induced expression of the interferon-beta (IFN-beta) gene. It is a non-histone chromatine associated protein that has the dual capacity of binding to a non-B-DNA structure such as cruciform-DNA as well as to AT rich B-DNA sequences. In this work we compare the binding affinity of HMGI for a synthetic cruciform-DNA to its binding affinity for the HMGI-binding-site present in the positive regulatory domain II (PRDII) of the IFN-beta promoter. Using gel retardation experiments, we show that HMGI protein binds with at least ten times more affinity to the synthetic cruciform-DNA structure than to the PRDII B-DNA sequence. DNA hairpin sequences are present in both the human and the murine PRDII-DNAs. We discuss in this work the presence of, yet putative, non-B-DNA structures in the IFN-beta promoter.
Structure-based Analysis to Hu-DNA Binding

DOE Office of Scientific and Technical Information (OSTI.GOV)

Swinger,K.; Rice, P.

2007-01-01

HU and IHF are prokaryotic proteins that induce very large bends in DNA. They are present in high concentrations in the bacterial nucleoid and aid in chromosomal compaction. They also function as regulatory cofactors in many processes, such as site-specific recombination and the initiation of replication and transcription. HU and IHF have become paradigms for understanding DNA bending and indirect readout of sequence. While IHF shows significant sequence specificity, HU binds preferentially to certain damaged or distorted DNAs. However, none of the structurally diverse HU substrates previously studied in vitro is identical with the distorted substrates in the recently publishedmore » Anabaena HU(AHU)-DNA cocrystal structures. Here, we report binding affinities for AHU and the DNA in the cocrystal structures. The binding free energies for formation of these AHU-DNA complexes range from 10-14.5 kcal/mol, representing K{sub d} values in the nanomolar to low picomolar range, and a maximum stabilization of at least 6.3 kcal/mol relative to complexes with undistorted, non-specific DNA. We investigated IHF binding and found that appropriate structural distortions can greatly enhance its affinity. On the basis of the coupling of structural and relevant binding data, we estimate the amount of conformational strain in an IHF-mediated DNA kink that is relieved by a nick (at least 0.76 kcal/mol) and pinpoint the location of the strain. We show that AHU has a sequence preference for an A+T-rich region in the center of its DNA-binding site, correlating with an unusually narrow minor groove. This is similar to sequence preferences shown by the eukaryotic nucleosome.« less
Alpha-crystallins are involved in specific interactions with the murine gamma D/E/F-crystallin-encoding gene.

PubMed

Pietrowski, D; Durante, M J; Liebstein, A; Schmitt-John, T; Werner, T; Graw, J

1994-07-08

The promoter of the murine gamma E-crystallin (gamma E-Cry) encoding gene (gamma E-cry) was analyzed for specific interactions with lenticular proteins in a gel-retardation assay. A 21-bp fragment immediately downstream of the transcription initiation site (DOTIS) is demonstrated to be responsible for specific interactions with lens extracts. The DOTIS-binding protein(s) accept only the sense DNA strand as target; anti-sense or double-stranded DNA do not interact with these proteins. The DOTIS sequence element is highly conserved among the murine gamma D-, gamma E- and gamma F-cry and is present at comparable positions in the orthologous rat genes. Only a weak or even no protein-binding activity is observed if a few particular bases are changed, as in the rat gamma A-, gamma C- and gamma E-cry elements. DOTIS-binding proteins were found in commercially available bovine alpha-Cry preparations. The essential participation of alpha-Cry in the DNA-binding protein complex was confirmed using alpha-Cry-specific monoclonal antibody. The results reported here point to a novel function of alpha-Cry besides the structural properties in the lens.
Understanding the recognition mechanisms of Zα domain of human editing enzyme ADAR1 (hZα(ADAR1)) and various Z-DNAs from molecular dynamics simulation.

PubMed

Wang, Qianqian; Li, Lanlan; Wang, Xiaoting; Liu, Huanxiang; Yao, Xiaojun

2014-11-01

The Z-DNA-binding domain of human double-stranded RNA adenosine deaminase I (hZαADAR1) can specifically recognize the left-handed Z-DNA which preferentially occurs at alternating purine-pyrimidine repeats, especially the CG-repeats. The interactions of hZαADAR1 and Z-DNAs in different sequence contexts can affect many important biological functions including gene regulation and chromatin remodeling. Therefore it is of great necessity to fully understand their recognition mechanisms. However, most existing studies are aimed at the standard CG-repeat Z-DNA rather than the non-CG-repeats, and whether the molecular basis of hZαADAR1 binding to various Z-DNAs are identical or not is still unclear on the atomic level. Here, based on the recently determined crystal structures of three representative non-CG-repeat Z-DNAs (d(CACGTG)2, d(CGTACG)2 and d(CGGCCG)2) in complex with hZαADAR1, 40 ns molecular dynamics simulation together with binding free energy calculation were performed for each system. For comparison, the standard CG-repeat Z-DNA (d(CGCGCG)2) complexed with hZαADAR1 was also simulated. The consistent results demonstrate that nonpolar interaction is the driving force during the protein-DNA binding process, and that polar interaction mainly from helix α3 also provides important contributions. Five common hot-spot residues were identified, namely Lys169, Lys170, Asn173, Arg174 and Tyr177. Hydrogen bond analysis coupled with surface charge distribution further reveal the interfacial information between hZαADAR1 and Z-DNA in detail. All of the analysis illustrate that four complexes share the common key features and the similar binding modes irrespective of Z-DNA sequences, suggesting that Z-DNA recognition by hZαADAR1 is conformation-specific rather than sequence-specific. Additionally, by analyzing the conformational changes of hZαADAR1, we found that the binding of Z-DNA could effectively stabilize hZαADAR1 protein. Our study can provide some valuable information for better understanding the binding mechanism between hZαADAR1 or even other Z-DNA-binding protein and Z-DNA.
GBshape: a genome browser database for DNA shape annotations.

PubMed

Chiu, Tsu-Pei; Yang, Lin; Zhou, Tianyin; Main, Bradley J; Parker, Stephen C J; Nuzhdin, Sergey V; Tullius, Thomas D; Rohs, Remo

2015-01-01

Many regulatory mechanisms require a high degree of specificity in protein-DNA binding. Nucleotide sequence does not provide an answer to the question of why a protein binds only to a small subset of the many putative binding sites in the genome that share the same core motif. Whereas higher-order effects, such as chromatin accessibility, cooperativity and cofactors, have been described, DNA shape recently gained attention as another feature that fine-tunes the DNA binding specificities of some transcription factor families. Our Genome Browser for DNA shape annotations (GBshape; freely available at http://rohslab.cmb.usc.edu/GBshape/) provides minor groove width, propeller twist, roll, helix twist and hydroxyl radical cleavage predictions for the entire genomes of 94 organisms. Additional genomes can easily be added using the GBshape framework. GBshape can be used to visualize DNA shape annotations qualitatively in a genome browser track format, and to download quantitative values of DNA shape features as a function of genomic position at nucleotide resolution. As biological applications, we illustrate the periodicity of DNA shape features that are present in nucleosome-occupied sequences from human, fly and worm, and we demonstrate structural similarities between transcription start sites in the genomes of four Drosophila species. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
pUL34 binding near the human cytomegalovirus origin of lytic replication enhances DNA replication and viral growth.

PubMed

Slayton, Mark; Hossain, Tanvir; Biegalke, Bonita J

2018-05-01

The human cytomegalovirus (HCMV) UL34 gene encodes sequence-specific DNA-binding proteins (pUL34) which are required for viral replication. Interactions of pUL34 with DNA binding sites represses transcription of two viral immune evasion genes, US3 and US9. 12 additional predicted pUL34-binding sites are present in the HCMV genome (strain AD169) with three binding sites concentrated near the HCMV origin of lytic replication (oriLyt). We used ChIP-seq analysis of pUL34-DNA interactions to confirm that pUL34 binds to the oriLyt region during infection. Mutagenesis of the UL34-binding sites in an oriLyt-containing plasmid significantly reduced viral-mediated oriLyt-dependent DNA replication. Mutagenesis of these sites in the HCMV genome reduced the replication efficiencies of the resulting viruses. Protein-protein interaction analyses demonstrated that pUL34 interacts with the viral proteins IE2, UL44, and UL84, that are essential for viral DNA replication, suggesting that pUL34-DNA interactions in the oriLyt region are involved in the DNA replication cascade. Copyright © 2018 Elsevier Inc. All rights reserved.
Direct DNA binding by Brca1.

PubMed

Paull, T T; Cortez, D; Bowers, B; Elledge, S J; Gellert, M

2001-05-22

The tumor suppressor Brca1 plays an important role in protecting mammalian cells against genomic instability, but little is known about its modes of action. In this work we demonstrate that recombinant human Brca1 protein binds strongly to DNA, an activity conferred by a domain in the center of the Brca1 polypeptide. As a result of this binding, Brca1 inhibits the nucleolytic activities of the Mre11/Rad50/Nbs1 complex, an enzyme implicated in numerous aspects of double-strand break repair. Brca1 displays a preference for branched DNA structures and forms protein-DNA complexes cooperatively between multiple DNA strands, but without DNA sequence specificity. This fundamental property of Brca1 may be an important part of its role in DNA repair and transcription.
DNA mimic proteins: functions, structures, and bioinformatic analysis.

PubMed

Wang, Hao-Ching; Ho, Chun-Han; Hsu, Kai-Cheng; Yang, Jinn-Moon; Wang, Andrew H-J

2014-05-13

DNA mimic proteins have DNA-like negative surface charge distributions, and they function by occupying the DNA binding sites of DNA binding proteins to prevent these sites from being accessed by DNA. DNA mimic proteins control the activities of a variety of DNA binding proteins and are involved in a wide range of cellular mechanisms such as chromatin assembly, DNA repair, transcription regulation, and gene recombination. However, the sequences and structures of DNA mimic proteins are diverse, making them difficult to predict by bioinformatic search. To date, only a few DNA mimic proteins have been reported. These DNA mimics were not found by searching for functional motifs in their sequences but were revealed only by structural analysis of their charge distribution. This review highlights the biological roles and structures of 16 reported DNA mimic proteins. We also discuss approaches that might be used to discover new DNA mimic proteins.
A mammary cell-specific enhancer in mouse mammary tumor virus DNA is composed of multiple regulatory elements including binding sites for CTF/NFI and a novel transcription factor, mammary cell-activating factor.

PubMed Central

Mink, S; Härtig, E; Jennewein, P; Doppler, W; Cato, A C

1992-01-01

Mouse mammary tumor virus (MMTV) is a milk-transmitted retrovirus involved in the neoplastic transformation of mouse mammary gland cells. The expression of this virus is regulated by mammary cell type-specific factors, steroid hormones, and polypeptide growth factors. Sequences for mammary cell-specific expression are located in an enhancer element in the extreme 5' end of the long terminal repeat region of this virus. This enhancer, when cloned in front of the herpes simplex thymidine kinase promoter, endows the promoter with mammary cell-specific response. Using functional and DNA-protein-binding studies with constructs mutated in the MMTV long terminal repeat enhancer, we have identified two main regulatory elements necessary for the mammary cell-specific response. These elements consist of binding sites for a transcription factor in the family of CTF/NFI proteins and the transcription factor mammary cell-activating factor (MAF) that recognizes the sequence G Pu Pu G C/G A A G G/T. Combinations of CTF/NFI- and MAF-binding sites or multiple copies of either one of these binding sites but not solitary binding sites mediate mammary cell-specific expression. The functional activities of these two regulatory elements are enhanced by another factor that binds to the core sequence ACAAAG. Interdigitated binding sites for CTF/NFI, MAF, and/or the ACAAAG factor are also found in the 5' upstream regions of genes encoding whey milk proteins from different species. These findings suggest that mammary cell-specific regulation is achieved by a concerted action of factors binding to multiple regulatory sites. Images PMID:1328867
Adjacent DNA sequences modulate Sox9 transcriptional activation at paired Sox sites in three chondrocyte-specific enhancer elements

PubMed Central

Bridgewater, Laura C.; Walker, Marlan D.; Miller, Gwen C.; Ellison, Trevor A.; Holsinger, L. Daniel; Potter, Jennifer L.; Jackson, Todd L.; Chen, Reuben K.; Winkel, Vicki L.; Zhang, Zhaoping; McKinney, Sandra; de Crombrugghe, Benoit

2003-01-01

Expression of the type XI collagen gene Col11a2 is directed to cartilage by at least three chondrocyte-specific enhancer elements, two in the 5′ region and one in the first intron of the gene. The three enhancers each contain two heptameric sites with homology to the Sox protein-binding consensus sequence. The two sites are separated by 3 or 4 bp and arranged in opposite orientation to each other. Targeted mutational analyses of these three enhancers showed that in the intronic enhancer, as in the other two enhancers, both Sox sites in a pair are essential for enhancer activity. The transcription factor Sox9 binds as a dimer at the paired sites, and the introduction of insertion mutations between the sites demonstrated that physical interactions between the adjacently bound proteins are essential for enhancer activity. Additional mutational analyses demonstrated that although Sox9 binding at the paired Sox sites is necessary for enhancer activity, it alone is not sufficient. Adjacent DNA sequences in each enhancer are also required, and mutation of those sequences can eliminate enhancer activity without preventing Sox9 binding. The data suggest a new model in which adjacently bound proteins affect the DNA bend angle produced by Sox9, which in turn determines whether an active transcriptional enhancer complex is assembled. PMID:12595563
MFP1 is a thylakoid-associated, nucleoid-binding protein with a coiled-coil structure

PubMed Central

Jeong, Sun Yong; Rose, Annkatrin; Meier, Iris

2003-01-01

Plastid DNA, like bacterial and mitochondrial DNA, is organized into protein–DNA complexes called nucleoids. Plastid nucleoids are believed to be associated with the inner envelope in developing plastids and the thylakoid membranes in mature chloroplasts, but the mechanism for this re-localization is unknown. Here, we present the further characterization of the coiled-coil DNA-binding protein MFP1 as a protein associated with nucleoids and with the thylakoid membranes in mature chloroplasts. MFP1 is located in plastids in both suspension culture cells and leaves and is attached to the thylakoid membranes with its C-terminal DNA-binding domain oriented towards the stroma. It has a major DNA-binding activity in mature Arabidopsis chloroplasts and binds to all tested chloroplast DNA fragments without detectable sequence specificity. Its expression is tightly correlated with the accumulation of thylakoid membranes. Importantly, it is associated in vivo with nucleoids, suggesting a function for MFP1 at the interface between chloroplast nucleoids and the developing thylakoid membrane system. PMID:12930969
Substrate sequence selectivity of APOBEC3A implicates intra-DNA interactions.

PubMed

Silvas, Tania V; Hou, Shurong; Myint, Wazo; Nalivaika, Ellen; Somasundaran, Mohan; Kelch, Brian A; Matsuo, Hiroshi; Kurt Yilmaz, Nese; Schiffer, Celia A

2018-05-14

The APOBEC3 (A3) family of human cytidine deaminases is renowned for providing a first line of defense against many exogenous and endogenous retroviruses. However, the ability of these proteins to deaminate deoxycytidines in ssDNA makes A3s a double-edged sword. When overexpressed, A3s can mutate endogenous genomic DNA resulting in a variety of cancers. Although the sequence context for mutating DNA varies among A3s, the mechanism for substrate sequence specificity is not well understood. To characterize substrate specificity of A3A, a systematic approach was used to quantify the affinity for substrate as a function of sequence context, length, secondary structure, and solution pH. We identified the A3A ssDNA binding motif as (T/C)TC(A/G), which correlated with enzymatic activity. We also validated that A3A binds RNA in a sequence specific manner. A3A bound tighter to substrate binding motif within a hairpin loop compared to linear oligonucleotide, suggesting A3A affinity is modulated by substrate structure. Based on these findings and previously published A3A-ssDNA co-crystal structures, we propose a new model with intra-DNA interactions for the molecular mechanism underlying A3A sequence preference. Overall, the sequence and structural preferences identified for A3A leads to a new paradigm for identifying A3A's involvement in mutation of endogenous or exogenous DNA.

A structural-alphabet-based strategy for finding structural motifs across protein families

PubMed Central

Wu, Chih Yuan; Chen, Yao Chi; Lim, Carmay

2010-01-01

Proteins with insignificant sequence and overall structure similarity may still share locally conserved contiguous structural segments; i.e. structural/3D motifs. Most methods for finding 3D motifs require a known motif to search for other similar structures or functionally/structurally crucial residues. Here, without requiring a query motif or essential residues, a fully automated method for discovering 3D motifs of various sizes across protein families with different folds based on a 16-letter structural alphabet is presented. It was applied to structurally non-redundant proteins bound to DNA, RNA, obligate/non-obligate proteins as well as free DNA-binding proteins (DBPs) and proteins with known structures but unknown function. Its usefulness was illustrated by analyzing the 3D motifs found in DBPs. A non-specific motif was found with a ‘corner’ architecture that confers a stable scaffold and enables diverse interactions, making it suitable for binding not only DNA but also RNA and proteins. Furthermore, DNA-specific motifs present ‘only’ in DBPs were discovered. The motifs found can provide useful guidelines in detecting binding sites and computational protein redesign. PMID:20525797
Multiple structure-intrinsic disorder interactions regulate and coordinate Hox protein function

NASA Astrophysics Data System (ADS)

Bondos, Sarah

During animal development, Hox transcription factors determine fate of developing tissues to generate diverse organs and appendages. Hox proteins are famous for their bizarre mutant phenotypes, such as replacing antennae with legs. Clearly, the functions of individual Hox proteins must be distinct and reliable in vivo, or the organism risks malformation or death. However, within the Hox protein family, the DNA-binding homeodomains are highly conserved and the amino acids that contact DNA are nearly invariant. These observations raise the question: How do different Hox proteins correctly identify their distinct target genes using a common DNA binding domain? One possible means to modulate DNA binding is through the influence of the non-homeodomain protein regions, which differ significantly among Hox proteins. However genetic approaches never detected intra-protein interactions, and early biochemical attempts were hindered because the special features of ``intrinsically disordered'' sequences were not appreciated. We propose the first-ever structural model of a Hox protein to explain how specific contacts between distant, intrinsically disordered regions of the protein and the homeodomain regulate DNA binding and coordinate this activity with other Hox molecular functions.
Electrostatic control of DNA intersegmental translocation by the ETS transcription factor ETV6.

PubMed

Vo, Tam; Wang, Shuo; Poon, Gregory M K; Wilson, W David

2017-08-11

To find their DNA target sites in complex solution environments containing excess heterogeneous DNA, sequence-specific DNA-binding proteins execute various translocation mechanisms known collectively as facilitated diffusion. For proteins harboring a single DNA contact surface, long-range translocation occurs by jumping between widely spaced DNA segments. We have configured biosensor-based surface plasmon resonance to directly measure the affinity and kinetics of this intersegmental jumping by the ETS-family transcription factor ETS variant 6 (ETV6). To isolate intersegmental target binding in a functionally defined manner, we pre-equilibrated ETV6 with excess salmon sperm DNA, a heterogeneous polymer, before exposing the nonspecifically bound protein to immobilized oligomeric DNA harboring a high-affinity ETV6 site. In this way, the mechanism of ETV6-target association could be toggled electrostatically through varying NaCl concentration in the bulk solution. Direct measurements of association and dissociation kinetics of the site-specific complex indicated that 1) freely diffusive binding by ETV6 proceeds through a nonspecific-like intermediate, 2) intersegmental jumping is rate-limited by dissociation from the nonspecific polymer, and 3) dissociation of the specific complex is independent of the history of complex formation. These results show that target searches by proteins with an ETS domain, such as ETV6, whose single DNA-binding domain cannot contact both source and destination sites simultaneously, are nonetheless strongly modulated by intersegmental jumping in heterogeneous site environments. Our findings establish biosensors as a general technique for directly and specifically measuring target site search by DNA-binding proteins via intersegmental translocation. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
A Novel DNA Binding Mechanism for maf Basic Region-Leucine Zipper Factors Inferred from a MafA-DNA Complex Structure and Binding Specificities

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lu, Xun; Guanga, Gerald P; Wan, Cheng

2012-11-13

MafA is a proto-oncoprotein and is critical for insulin gene expression in pancreatic β-cells. Maf proteins belong to the AP1 superfamily of basic region-leucine zipper (bZIP) transcription factors. Residues in the basic helix and an ancillary N-terminal domain, the Extended Homology Region (EHR), endow maf proteins with unique DNA binding properties: binding a 13 bp consensus site consisting of a core AP1 site (TGACTCA) flanked by TGC sequences and binding DNA stably as monomers. To further characterize maf DNA binding, we determined the structure of a MafA–DNA complex. MafA forms base-specific hydrogen bonds with the flanking G –5C –4 andmore » central C 0/G 0 bases, but not with the core-TGA bases. However, in vitro binding studies utilizing a pulse–chase electrophoretic mobility shift assay protocol revealed that mutating either the core-TGA or flanking-TGC bases dramatically increases the binding off rate. Comparing the known maf structures, we propose that DNA binding specificity results from positioning the basic helix through unique phosphate contacts. The EHR does not contact DNA directly but stabilizes DNA binding by contacting the basic helix. Collectively, these results suggest a novel multistep DNA binding process involving a conformational change from contacting the core-TGA to contacting the flanking-TGC bases.« less
Two new insulator proteins, Pita and ZIPIC, target CP190 to chromatin.

PubMed

Maksimenko, Oksana; Bartkuhn, Marek; Stakhov, Viacheslav; Herold, Martin; Zolotarev, Nickolay; Jox, Theresa; Buxa, Melanie K; Kirsch, Ramona; Bonchuk, Artem; Fedotova, Anna; Kyrchanova, Olga; Renkawitz, Rainer; Georgiev, Pavel

2015-01-01

Insulators are multiprotein-DNA complexes that regulate the nuclear architecture. The Drosophila CP190 protein is a cofactor for the DNA-binding insulator proteins Su(Hw), CTCF, and BEAF-32. The fact that CP190 has been found at genomic sites devoid of either of the known insulator factors has until now been unexplained. We have identified two DNA-binding zinc-finger proteins, Pita, and a new factor named ZIPIC, that interact with CP190 in vivo and in vitro at specific interaction domains. Genomic binding sites for these proteins are clustered with CP190 as well as with CTCF and BEAF-32. Model binding sites for Pita or ZIPIC demonstrate a partial enhancer-blocking activity and protect gene expression from PRE-mediated silencing. The function of the CTCF-bound MCP insulator sequence requires binding of Pita. These results identify two new insulator proteins and emphasize the unifying function of CP190, which can be recruited by many DNA-binding insulator proteins. © 2015 Maksimenko et al.; Published by Cold Spring Harbor Laboratory Press.
Multiplex single-molecule interaction profiling of DNA-barcoded proteins.

PubMed

Gu, Liangcai; Li, Chao; Aach, John; Hill, David E; Vidal, Marc; Church, George M

2014-11-27

In contrast with advances in massively parallel DNA sequencing, high-throughput protein analyses are often limited by ensemble measurements, individual analyte purification and hence compromised quality and cost-effectiveness. Single-molecule protein detection using optical methods is limited by the number of spectrally non-overlapping chromophores. Here we introduce a single-molecular-interaction sequencing (SMI-seq) technology for parallel protein interaction profiling leveraging single-molecule advantages. DNA barcodes are attached to proteins collectively via ribosome display or individually via enzymatic conjugation. Barcoded proteins are assayed en masse in aqueous solution and subsequently immobilized in a polyacrylamide thin film to construct a random single-molecule array, where barcoding DNAs are amplified into in situ polymerase colonies (polonies) and analysed by DNA sequencing. This method allows precise quantification of various proteins with a theoretical maximum array density of over one million polonies per square millimetre. Furthermore, protein interactions can be measured on the basis of the statistics of colocalized polonies arising from barcoding DNAs of interacting proteins. Two demanding applications, G-protein coupled receptor and antibody-binding profiling, are demonstrated. SMI-seq enables 'library versus library' screening in a one-pot assay, simultaneously interrogating molecular binding affinity and specificity.
Multiplex single-molecule interaction profiling of DNA barcoded proteins

PubMed Central

Gu, Liangcai; Li, Chao; Aach, John; Hill, David E.; Vidal, Marc; Church, George M.

2014-01-01

In contrast with advances in massively parallel DNA sequencing1, high-throughput protein analyses2-4 are often limited by ensemble measurements, individual analyte purification and hence compromised quality and cost-effectiveness. Single-molecule (SM) protein detection achieved using optical methods5 is limited by the number of spectrally nonoverlapping chromophores. Here, we introduce a single molecular interaction-sequencing (SMI-Seq) technology for parallel protein interaction profiling leveraging SM advantages. DNA barcodes are attached to proteins collectively via ribosome display6 or individually via enzymatic conjugation. Barcoded proteins are assayed en masse in aqueous solution and subsequently immobilized in a polyacrylamide (PAA) thin film to construct a random SM array, where barcoding DNAs are amplified into in situ polymerase colonies (polonies)7 and analyzed by DNA sequencing. This method allows precise quantification of various proteins with a theoretical maximum array density of over one million polonies per square millimeter. Furthermore, protein interactions can be measured based on the statistics of colocalized polonies arising from barcoding DNAs of interacting proteins. Two demanding applications, G-protein coupled receptor (GPCR) and antibody binding profiling, were demonstrated. SMI-Seq enables “library vs. library” screening in a one-pot assay, simultaneously interrogating molecular binding affinity and specificity. PMID:25252978
The Electronic Behavior of Zinc-Finger Protein Binding Sites in the Context of the DNA Extended Ladder Model

NASA Astrophysics Data System (ADS)

Oiwa, Nestor; Cordeiro, Claudette; Heermann, Dieter

2016-05-01

Instead of ATCG letter alignments, typically used in bioinformatics, we propose a new alignment method using the probability distribution function of the bottom of the occupied molecular orbital (BOMO), highest occupied molecular orbital (HOMO) and lowest unoccupied orbital (LUMO). We apply the technique to transcription factors with Cys2His2 zinc fingers. These transcription factors search for binding sites, probing for the electronic patterns at the minor and major DNA groves. The eukaryotic Cys2His2 zinc finger proteins bind to DNA ubiquitously at highly conserved domains. They are responsible for gene regulation and the spatial organization of DNA. To study and understand these zinc finger DNA-protein interactions, we use the extended ladder in the DNA model proposed by Zhu, Rasmussen, Balatsky & Bishop (2007) te{Zhu-2007}. Considering one single spinless electron in each nucleotide π-orbital along a double DNA chain (dDNA), we find a typical pattern for the bottom of BOMO, HOMO and LUMO along the binding sites. We specifically looked at two members of zinc finger protein family: specificity protein 1 (SP1) and early grown response 1 transcription factors (EGR1). When the valence band is filled, we find electrons in the purines along the nucleotide sequence, compatible with the electric charges of the binding amino acids in SP1 and EGR1 zinc finger.
RPA and POT1: friends or foes at telomeres?

PubMed

Flynn, Rachel Litman; Chang, Sandy; Zou, Lee

2012-02-15

Telomere maintenance in cycling cells relies on both DNA replication and capping by the protein complex shelterin. Two single-stranded DNA (ssDNA)-binding proteins, replication protein A (RPA) and protection of telomere 1 (POT1) play critical roles in DNA replication and telomere capping, respectively. While RPA binds to ssDNA in a non-sequence-specific manner, POT1 specifically recognizes singlestranded TTAGGG telomeric repeats. Loss of POT1 leads to aberrant accumulation of RPA at telomeres and activation of the ataxia telangiectasia and Rad3-related kinase (ATR)-mediated checkpoint response, suggesting that POT1 antagonizes RPA binding to telomeric ssDNA. The requirement for both POT1 and RPA in telomere maintenance and the antagonism between the two proteins raises the important question of how they function in concert on telomeric ssDNA. Two interesting models were proposed by recent studies to explain the regulation of POT1 and RPA at telomeres. Here, we discuss how these models help unravel the coordination, and also the antagonism, between POT1 and RPA during the cell cycle.
Effects of nucleoside analog incorporation on DNA binding to the DNA binding domain of the GATA-1 erythroid transcription factor.

PubMed

Foti, M; Omichinski, J G; Stahl, S; Maloney, D; West, J; Schweitzer, B I

1999-02-05

We investigate here the effects of the incorporation of the nucleoside analogs araC (1-beta-D-arabinofuranosylcytosine) and ganciclovir (9-[(1,3-dihydroxy-2-propoxy)methyl] guanine) into the DNA binding recognition sequence for the GATA-1 erythroid transcription factor. A 10-fold decrease in binding affinity was observed for the ganciclovir-substituted DNA complex in comparison to an unmodified DNA of the same sequence composition. AraC substitution did not result in any changes in binding affinity. 1H-15N HSQC and NOESY NMR experiments revealed a number of chemical shift changes in both DNA and protein in the ganciclovir-modified DNA-protein complex when compared to the unmodified DNA-protein complex. These changes in chemical shift and binding affinity suggest a change in the binding mode of the complex when ganciclovir is incorporated into the GATA DNA binding site.
Common fold in helix–hairpin–helix proteins

PubMed Central

Shao, Xuguang; Grishin, Nick V.

2000-01-01

Helix–hairpin–helix (HhH) is a widespread motif involved in non-sequence-specific DNA binding. The majority of HhH motifs function as DNA-binding modules, however, some of them are used to mediate protein–protein interactions or have acquired enzymatic activity by incorporating catalytic residues (DNA glycosylases). From sequence and structural analysis of HhH-containing proteins we conclude that most HhH motifs are integrated as a part of a five-helical domain, termed (HhH)2 domain here. It typically consists of two consecutive HhH motifs that are linked by a connector helix and displays pseudo-2-fold symmetry. (HhH)2 domains show clear structural integrity and a conserved hydrophobic core composed of seven residues, one residue from each α-helix and each hairpin, and deserves recognition as a distinct protein fold. In addition to known HhH in the structures of RuvA, RadA, MutY and DNA-polymerases, we have detected new HhH motifs in sterile alpha motif and barrier-to-autointegration factor domains, the α-subunit of Escherichia coli RNA-polymerase, DNA-helicase PcrA and DNA glycosylases. Statistically significant sequence similarity of HhH motifs and pronounced structural conservation argue for homology between (HhH)2 domains in different protein families. Our analysis helps to clarify how non-symmetric protein motifs bind to the double helix of DNA through the formation of a pseudo-2-fold symmetric (HhH)2 functional unit. PMID:10908318
Different domains of the murine RNA polymerase I-specific termination factor mTTF-I serve distinct functions in transcription termination.

PubMed

Evers, R; Smid, A; Rudloff, U; Lottspeich, F; Grummt, I

1995-03-15

Termination of mouse ribosomal gene transcription by RNA polymerase I (Pol I) requires the specific interaction of a DNA binding protein, mTTF-I, with an 18 bp sequence element located downstream of the rRNA coding region. Here we describe the molecular cloning and functional characterization of the cDNA encoding this transcription termination factor. Recombinant mTTF-I binds specifically to the murine terminator elements and terminates Pol I transcription in a reconstituted in vitro system. Deletion analysis has defined a modular structure of mTTF-I comprising a dispensable N-terminal half, a large C-terminal DNA binding region and an internal domain which is required for transcription termination. Significantly, the C-terminal region of mTTF-I reveals striking homology to the DNA binding domains of the proto-oncogene c-Myb and the yeast transcription factor Reb1p. Site-directed mutagenesis of one of the tryptophan residues that is conserved in the homology region of c-Myb, Reb1p and mTTF-I abolishes specific DNA binding, a finding which underscores the functional relevance of these residues in DNA-protein interactions.
Different domains of the murine RNA polymerase I-specific termination factor mTTF-I serve distinct functions in transcription termination.

PubMed Central

Evers, R; Smid, A; Rudloff, U; Lottspeich, F; Grummt, I

1995-01-01

Termination of mouse ribosomal gene transcription by RNA polymerase I (Pol I) requires the specific interaction of a DNA binding protein, mTTF-I, with an 18 bp sequence element located downstream of the rRNA coding region. Here we describe the molecular cloning and functional characterization of the cDNA encoding this transcription termination factor. Recombinant mTTF-I binds specifically to the murine terminator elements and terminates Pol I transcription in a reconstituted in vitro system. Deletion analysis has defined a modular structure of mTTF-I comprising a dispensable N-terminal half, a large C-terminal DNA binding region and an internal domain which is required for transcription termination. Significantly, the C-terminal region of mTTF-I reveals striking homology to the DNA binding domains of the proto-oncogene c-Myb and the yeast transcription factor Reb1p. Site-directed mutagenesis of one of the tryptophan residues that is conserved in the homology region of c-Myb, Reb1p and mTTF-I abolishes specific DNA binding, a finding which underscores the functional relevance of these residues in DNA-protein interactions. Images PMID:7720715
IFI16 Preferentially Binds to DNA with Quadruplex Structure and Enhances DNA Quadruplex Formation.

PubMed

Hároníková, Lucia; Coufal, Jan; Kejnovská, Iva; Jagelská, Eva B; Fojta, Miroslav; Dvořáková, Petra; Muller, Petr; Vojtesek, Borivoj; Brázda, Václav

2016-01-01

Interferon-inducible protein 16 (IFI16) is a member of the HIN-200 protein family, containing two HIN domains and one PYRIN domain. IFI16 acts as a sensor of viral and bacterial DNA and is important for innate immune responses. IFI16 binds DNA and binding has been described to be DNA length-dependent, but a preference for supercoiled DNA has also been demonstrated. Here we report a specific preference of IFI16 for binding to quadruplex DNA compared to other DNA structures. IFI16 binds to quadruplex DNA with significantly higher affinity than to the same sequence in double stranded DNA. By circular dichroism (CD) spectroscopy we also demonstrated the ability of IFI16 to stabilize quadruplex structures with quadruplex-forming oligonucleotides derived from human telomere (HTEL) sequences and the MYC promotor. A novel H/D exchange mass spectrometry approach was developed to assess protein interactions with quadruplex DNA. Quadruplex DNA changed the IFI16 deuteration profile in parts of the PYRIN domain (aa 0-80) and in structurally identical parts of both HIN domains (aa 271-302 and aa 586-617) compared to single stranded or double stranded DNAs, supporting the preferential affinity of IFI16 for structured DNA. Our results reveal the importance of quadruplex DNA structure in IFI16 binding and improve our understanding of how IFI16 senses DNA. IFI16 selectivity for quadruplex structure provides a mechanistic framework for IFI16 in immunity and cellular processes including DNA damage responses and cell proliferation.
The NMR solution structure of a mutant of the Max b/HLH/LZ free of DNA: insights into the specific and reversible DNA binding mechanism of dimeric transcription factors.

PubMed

Sauvé, Simon; Tremblay, Luc; Lavigne, Pierre

2004-09-17

Basic region-helix1-loop-helix2-leucine zipper (b/H(1)LH(2)/LZ) transcription factors bind specific DNA sequence in their target gene promoters as dimers. Max, a b/H(1)LH(2)/LZ transcription factor, is the obligate heterodimeric partner of the related b/H(1)LH(2)/LZ proteins of the Myc and Mad families. These heterodimers specifically bind E-box DNA sequence (CACGTG) to activate (e.g. c-Myc/Max) and repress (e.g. Mad1/Max) transcription. Max can also homodimerize and bind E-box sequences in c-Myc target gene promoters. While the X-ray structure of the Max b/H(1)LH(2)/LZ/DNA complex and that of others have been reported, the precise sequence of events leading to the reversible and specific binding of these important transcription factors is still largely unknown. In order to provide insights into the DNA binding mechanism, we have solved the NMR solution structure of a covalently homodimerized version of a Max b/H(1)LH(2)/LZ protein with two stabilizing mutations in the LZ, and characterized its backbone dynamics from (15)N spin-relaxation measurements in the absence of DNA. Apart from minor differences in the pitch of the LZ, possibly resulting from the mutations in the construct, we observe that the packing of the helices in the H(1)LH(2) domain is almost identical to that of the two crystal structures, indicating that no important conformational change in these helices occurs upon DNA binding. Conversely to the crystal structures of the DNA complexes, the first 14 residues of the basic region are found to be mostly unfolded while the loop is observed to be flexible. This indicates that these domains undergo conformational changes upon DNA binding. On the other hand, we find the last four residues of the basic region form a persistent helical turn contiguous to H(1). In addition, we provide evidence of the existence of internal motions in the backbone of H(1) that are of larger amplitude and longer time-scale (nanoseconds) than the ones in the H(2) and LZ domain. Most interestingly, we note that conformers in the ensemble of calculated structures have highly conserved basic residues (located in the persistent helical turn of the basic region and in the loop) known to be important for specific binding in a conformation that matches that of the DNA-bound state. These partially prefolded conformers can directly fit into the major groove of DNA and as such are proposed to lie on the pathway leading to the reversible and specific DNA binding. In these conformers, the conserved basic side-chains form a cluster that elevates the local electrostatic potential and could provide the necessary driving force for the generation of the internal motions localized in the H(1) and therefore link structural determinants with the DNA binding function. Overall, our results suggests that the Max homodimeric b/H(1)LH(2)/LZ can rapidly and preferentially bind DNA sequence through transient and partially prefolded states and subsequently, adopt the fully helical bound state in a DNA-assisted mechanism or induced-fit.
Characterization of monomeric DNA-binding protein Histone H1 in Leishmania braziliensis.

PubMed

Carmelo, Emma; González, Gloria; Cruz, Teresa; Osuna, Antonio; Hernández, Mariano; Valladares, Basilio

2011-08-01

Histone H1 in Leishmania presents relevant differences compared to higher eukaryote counterparts, such as the lack of a DNA-binding central globular domain. Despite that, it is apparently fully functional since its differential expression levels have been related to changes in chromatin condensation and infectivity, among other features. The localization and the aggregation state of L. braziliensis H1 has been determined by immunolocalization, mass spectrometry, cross-linking and electrophoretic mobility shift assays. Analysis of H1 sequences from the Leishmania Genome Database revealed that our protein is included in a very divergent group of histones H1 that is present only in L. braziliensis. An antibody raised against recombinant L. braziliensis H1 recognized specifically that protein by immunoblot in L. braziliensis extracts, but not in other Leishmania species, a consequence of the sequence divergences observed among Leishmania species. Mass spectrometry analysis and in vitro DNA-binding experiments have also proven that L. braziliensis H1 is monomeric in solution, but oligomerizes upon binding to DNA. Finally, despite the lack of a globular domain, L. braziliensis H1 is able to form complexes with DNA in vitro, with higher affinity for supercoiled compared to linear DNA.
Identification of natural and artificial DNA substrates for the light-activated LOV-HTH transcription factor EL222

PubMed Central

Rivera-Cancel, Giomar; Motta-Mena, Laura B.; Gardner, Kevin H.

2012-01-01

Light-oxygen-voltage (LOV) domains serve as the photosensory modules for a wide range of plant and bacterial proteins, conferring blue light dependent regulation to effector activities as diverse as enzymes and DNA binding. LOV domains can also be engineered into a variety of exogenous targets, enabling similar regulation for new protein-based reagents. Common to these proteins is the ability for LOV domains to reversibly form a photochemical adduct between an internal flavin chromophore and the surrounding protein, using this to trigger conformational changes that affect output activity. Using the Erythrobacter litoralis protein EL222 model system which links LOV regulation to a helix-turn-helix (HTH) DNA binding domain, we demonstrated that the LOV domain binds and inhibits the HTH domain in the dark, releasing these interactions upon illumination [Nash et al. (2011) Proc. Natl. Acad. Sci. USA 108, 9449–9454]. Here we combine genomic and in vitro selection approaches to identify optimal DNA binding sites for EL222. Within the bacterial host, we observe binding several genomic sites using a 12 bp sequence consensus that is also found by in vitro selection methods. Sequence-specific alterations in the DNA consensus reduce EL222-binding affinity in a manner consistent with the expected binding mode: a protein dimer binding to two repeats. Finally, we demonstrate the light-dependent activation of transcription of two genes adjacent to an EL222 binding site. Taken together, these results shed light on the native function of EL222 and provide useful reagents for further basic and applications research of this versatile protein. PMID:23205774
The DnaA Tale

PubMed Central

Hansen, Flemming G.; Atlung, Tove

2018-01-01

More than 50 years have passed since the presentation of the Replicon Model which states that a positively acting initiator interacts with a specific site on a circular chromosome molecule to initiate DNA replication. Since then, the origin of chromosome replication, oriC, has been determined as a specific region that carries sequences required for binding of positively acting initiator proteins, DnaA-boxes and DnaA proteins, respectively. In this review we will give a historical overview of significant findings which have led to the very detailed knowledge we now possess about the initiation process in bacteria using Escherichia coli as the model organism, but emphasizing that virtually all bacteria have DnaA proteins that interacts with DnaA boxes to initiate chromosome replication. We will discuss the dnaA gene regulation, the special features of the dnaA gene expression, promoter strength, and translation efficiency, as well as, the DnaA protein, its concentration, its binding to DnaA-boxes, and its binding of ATP or ADP. Furthermore, we will discuss the different models for regulation of initiation which have been proposed over the years, with particular emphasis on the Initiator Titration Model. PMID:29541066
SRY, like HMG1, recognizes sharp angles in DNA.

PubMed Central

Ferrari, S; Harley, V R; Pontiggia, A; Goodfellow, P N; Lovell-Badge, R; Bianchi, M E

1992-01-01

HMG boxes are DNA binding domains present in chromatin proteins, general transcription factors for nucleolar and mitochondrial RNA polymerases, and gene- and tissue-specific transcriptional regulators. The HMG boxes of HMG1, an abundant component of chromatin, interact specifically with four-way junctions, DNA structures that are cross-shaped and contain angles of approximately 60 and 120 degrees between their arms. We show here also that the HMG box of SRY, the protein that determines the expression of male-specific genes in humans, recognizes four-way junction DNAs irrespective of their sequence. In addition, when SRY binds to linear duplex DNA containing its specific target AACAAAG, it produces a sharp bend. Therefore, the interaction between HMG boxes and DNA appears to be predominantly structure-specific. The production of the recognition of a kink in DNA can serve several distinct functions, such as the repair of DNA lesions, the folding of DNA segments with bound transcriptional factors into productive complexes or the wrapping of DNA in chromatin. Images PMID:1425584
The multi-zinc finger protein ZNF217 contacts DNA through a two-finger domain.

PubMed

Nunez, Noelia; Clifton, Molly M K; Funnell, Alister P W; Artuz, Crisbel; Hallal, Samantha; Quinlan, Kate G R; Font, Josep; Vandevenne, Marylène; Setiyaputra, Surya; Pearson, Richard C M; Mackay, Joel P; Crossley, Merlin

2011-11-04

Classical C2H2 zinc finger proteins are among the most abundant transcription factors found in eukaryotes, and the mechanisms through which they recognize their target genes have been extensively investigated. In general, a tandem array of three fingers separated by characteristic TGERP links is required for sequence-specific DNA recognition. Nevertheless, a significant number of zinc finger proteins do not contain a hallmark three-finger array of this type, raising the question of whether and how they contact DNA. We have examined the multi-finger protein ZNF217, which contains eight classical zinc fingers. ZNF217 is implicated as an oncogene and in repressing the E-cadherin gene. We show that two of its zinc fingers, 6 and 7, can mediate contacts with DNA. We examine its putative recognition site in the E-cadherin promoter and demonstrate that this is a suboptimal site. NMR analysis and mutagenesis is used to define the DNA binding surface of ZNF217, and we examine the specificity of the DNA binding activity using fluorescence anisotropy titrations. Finally, sequence analysis reveals that a variety of multi-finger proteins also contain two-finger units, and our data support the idea that these may constitute a distinct subclass of DNA recognition motif.

The Multi-zinc Finger Protein ZNF217 Contacts DNA through a Two-finger Domain*

PubMed Central

Nunez, Noelia; Clifton, Molly M. K.; Funnell, Alister P. W.; Artuz, Crisbel; Hallal, Samantha; Quinlan, Kate G. R.; Font, Josep; Vandevenne, Marylène; Setiyaputra, Surya; Pearson, Richard C. M.; Mackay, Joel P.; Crossley, Merlin

2011-01-01

Classical C2H2 zinc finger proteins are among the most abundant transcription factors found in eukaryotes, and the mechanisms through which they recognize their target genes have been extensively investigated. In general, a tandem array of three fingers separated by characteristic TGERP links is required for sequence-specific DNA recognition. Nevertheless, a significant number of zinc finger proteins do not contain a hallmark three-finger array of this type, raising the question of whether and how they contact DNA. We have examined the multi-finger protein ZNF217, which contains eight classical zinc fingers. ZNF217 is implicated as an oncogene and in repressing the E-cadherin gene. We show that two of its zinc fingers, 6 and 7, can mediate contacts with DNA. We examine its putative recognition site in the E-cadherin promoter and demonstrate that this is a suboptimal site. NMR analysis and mutagenesis is used to define the DNA binding surface of ZNF217, and we examine the specificity of the DNA binding activity using fluorescence anisotropy titrations. Finally, sequence analysis reveals that a variety of multi-finger proteins also contain two-finger units, and our data support the idea that these may constitute a distinct subclass of DNA recognition motif. PMID:21908891
Sa-Lrp from Sulfolobus acidocaldarius is a versatile, glutamine-responsive, and architectural transcriptional regulator

PubMed Central

Vassart, Amelia; Wolferen, Marleen; Orell, Alvaro; Hong, Ye; Peeters, Eveline; Albers, Sonja-Verena; Charlier, Daniel

2013-01-01

Sa-Lrp is a member of the leucine-responsive regulatory protein (Lrp)-like family of transcriptional regulators in Sulfolobus acidocaldarius. Previously, we demonstrated the binding of Sa-Lrp to the control region of its own gene in vitro. However, the function and cofactor of Sa-Lrp remained an enigma. In this work, we demonstrate that glutamine is the cofactor of Sa-Lrp by inducing the formation of octamers and increasing the DNA-binding affinity and sequence specificity. In vitro protein-DNA interaction assays indicate that Sa-Lrp binds to promoter regions of genes with a variety of functions including ammonia assimilation, transcriptional control, and UV-induced pili synthesis. DNA binding occurs with a specific affinity for AT-rich binding sites, and the protein induces DNA bending and wrapping upon binding, indicating an architectural role of the regulator. Furthermore, by analyzing an Sa-lrp deletion mutant, we demonstrate that the protein affects transcription of some of the genes of which the promoter region is targeted and that it is an important determinant of the cellular aggregation phenotype. Taking all these results into account, we conclude that Sa-Lrp is a glutamine-responsive global transcriptional regulator with an additional architectural role. PMID:23255531
Site- and strand-specific nicking of DNA by fusion proteins derived from MutH and I-SceI or TALE repeats.

PubMed

Gabsalilow, Lilia; Schierling, Benno; Friedhoff, Peter; Pingoud, Alfred; Wende, Wolfgang

2013-04-01

Targeted genome engineering requires nucleases that introduce a highly specific double-strand break in the genome that is either processed by homology-directed repair in the presence of a homologous repair template or by non-homologous end-joining (NHEJ) that usually results in insertions or deletions. The error-prone NHEJ can be efficiently suppressed by 'nickases' that produce a single-strand break rather than a double-strand break. Highly specific nickases have been produced by engineering of homing endonucleases and more recently by modifying zinc finger nucleases (ZFNs) composed of a zinc finger array and the catalytic domain of the restriction endonuclease FokI. These ZF-nickases work as heterodimers in which one subunit has a catalytically inactive FokI domain. We present two different approaches to engineer highly specific nickases; both rely on the sequence-specific nicking activity of the DNA mismatch repair endonuclease MutH which we fused to a DNA-binding module, either a catalytically inactive variant of the homing endonuclease I-SceI or the DNA-binding domain of the TALE protein AvrBs4. The fusion proteins nick strand specifically a bipartite recognition sequence consisting of the MutH and the I-SceI or TALE recognition sequences, respectively, with a more than 1000-fold preference over a stand-alone MutH site. TALE-MutH is a programmable nickase.
Chimeric TALE recombinases with programmable DNA sequence specificity.

PubMed

Mercer, Andrew C; Gaj, Thomas; Fuller, Roberta P; Barbas, Carlos F

2012-11-01

Site-specific recombinases are powerful tools for genome engineering. Hyperactivated variants of the resolvase/invertase family of serine recombinases function without accessory factors, and thus can be re-targeted to sequences of interest by replacing native DNA-binding domains (DBDs) with engineered zinc-finger proteins (ZFPs). However, imperfect modularity with particular domains, lack of high-affinity binding to all DNA triplets, and difficulty in construction has hindered the widespread adoption of ZFPs in unspecialized laboratories. The discovery of a novel type of DBD in transcription activator-like effector (TALE) proteins from Xanthomonas provides an alternative to ZFPs. Here we describe chimeric TALE recombinases (TALERs): engineered fusions between a hyperactivated catalytic domain from the DNA invertase Gin and an optimized TALE architecture. We use a library of incrementally truncated TALE variants to identify TALER fusions that modify DNA with efficiency and specificity comparable to zinc-finger recombinases in bacterial cells. We also show that TALERs recombine DNA in mammalian cells. The TALER architecture described herein provides a platform for insertion of customized TALE domains, thus significantly expanding the targeting capacity of engineered recombinases and their potential applications in biotechnology and medicine.
Quantitative determination of testosterone levels with biolayer interferometry.

PubMed

Zhang, Hao; Li, Wei; Luo, Hong; Xiong, Guangming; Yu, Yuanhua

2017-10-01

Natural and synthetic steroid hormones are widely spread in the environment and are considered as pollutants due to their endocrine activities, even at low concentrations, which are harmful to human health. To detect steroid hormones in the environment, a novel biosensor system was developed based on the principle of biolayer interferometry. Detection is based on changes in the interference pattern of white light reflected from the surface of an optical fiber with bound biomolecules. Monitoring interactions between molecules does not require radioactive, enzymatic, or fluorescent labels. Here, 2 double-stranded DNA fragments of operator 1 (OP1) and OP2 containing 10-bp palindromic sequences in chromosomal Comamonas testosteroni DNA (ATCC11996) were surface-immobilized to streptavidin sensors. Interference changes were detected when repressor protein RepA bound the DNA sequences. DNA-protein interactions were characterized and kinetic parameters were obtained. The dissociation constants between the OP1 and OP2 DNA sequences and RepA were 9.865 × 10 -9 M and 2.750 × 10 -8 M, respectively. The reactions showed high specifically and affinity. Because binding of the 10-bp palindromic sequence and RepA was affected by RepA-testosterone binding, the steroid could be quantitatively determined rapidly using the biosensor system. The mechanism of the binding assay was as follows. RepA could bind both OP1 and testosterone. RepA binding to testosterone changed the protein conformation, which influenced the binding between RepA and OP1. The percentage of the signal detected negative correlation with the testosterone concentration. A standard curve was obtained, and the correlation coefficient value was approximately 0.97. We could quantitatively determine testosterone levels between 2.13 and 136.63 ng/ml. Each sample could be quantitatively detected in 17 min. These results suggested that the specific interaction between double-stranded OP1 DNA and the RepA protein could be used to rapidly and quantitatively determine environmental testosterone levels by the biolayer interferometry technique. Copyright © 2017 Elsevier B.V. All rights reserved.
The FOXP2 forkhead domain binds to a variety of DNA sequences with different rates and affinities.

PubMed

Webb, Helen; Steeb, Olga; Blane, Ashleigh; Rotherham, Lia; Aron, Shaun; Machanick, Philip; Dirr, Heini; Fanucchi, Sylvia

2017-07-01

FOXP2 is a member of the P subfamily of FOX transcription factors, the DNA-binding domain of which is the winged helix forkhead domain (FHD). In this work we show that the FOXP2 FHD is able to bind to various DNA sequences, including a novel sequence identified in this work, with different affinities and rates as detected using surface plasmon resonance. Combining the experimental work with molecular docking, we show that high-affinity sequences remain bound to the protein for longer, form a greater number of interactions with the protein and induce a greater structural change in the protein than low-affinity sequences. We propose a binding model for the FOXP2 FHD that involves three types of binding sequence: low affinity sites which allow for rapid scanning of the genome by the protein in a partially unstructured state; moderate affinity sites which serve to locate the protein near target sites and high-affinity sites which secure the protein to the DNA and induce a conformational change necessary for functional binding and the possible initiation of downstream transcriptional events. © The Authors 2017. Published by Oxford University Press on behalf of the Japanese Biochemical Society. All rights reserved.
Replication of damaged DNA in vitro is blocked by p53

PubMed Central

Zhou, Jianmin; Prives, Carol

2003-01-01

The tumor suppressor protein p53 may have other roles and functions in addition to its well-documented ability to serve as a sequence-specific transcriptional activator in response to DNA damage. We showed previously that p53 can block the replication of polyomavirus origin-containing DNA (Py ori-DNA) in vitro when p53 binding sites are present on the late side of the Py ori. Here we have both further extended these observations and have also examined whether p53 might be able to bind directly to and inhibit the replication of damaged DNA. We found that p53 strongly inhibits replication of γ-irradiated Py ori-DNA and such inhibition requires both the central DNA binding domain and the extreme C-terminus of the p53 protein. An endogenous p53 binding site lies within the Py origin and is required for the ability of p53 to block initiation of replication from γ-irradiated Py ori-DNA, suggesting the possibility of DNA looping caused by p53 binding both non-specifically to sites of DNA damage and specifically to the endogenous site in the polyomavirus origin. Our results thus suggest the possibility that under some circumstances p53 might serve as a direct regulator of DNA replication and suggest as well an additional function for cooperation between its two autonomous DNA binding domains. PMID:12853603
Solution structure of the DNA-binding domain of RPA from Saccharomyces cerevisiae and its interaction with single-stranded DNA and SV40 T antigen

PubMed Central

Park, Chin-Ju; Lee, Joon-Hwa; Choi, Byong-Seok

2005-01-01

Replication protein A (RPA) is a three-subunit complex with multiple roles in DNA metabolism. DNA-binding domain A in the large subunit of human RPA (hRPA70A) binds to single-stranded DNA (ssDNA) and is responsible for the species-specific RPA–T antigen (T-ag) interaction required for Simian virus 40 replication. Although Saccharomyces cerevisiae RPA70A (scRPA70A) shares high sequence homology with hRPA70A, the two are not functionally equivalent. To elucidate the similarities and differences between these two homologous proteins, we determined the solution structure of scRPA70A, which closely resembled the structure of hRPA70A. The structure of ssDNA-bound scRPA70A, as simulated by residual dipolar coupling-based homology modeling, suggested that the positioning of the ssDNA is the same for scRPA70A and hRPA70A, although the conformational changes that occur in the two proteins upon ssDNA binding are not identical. NMR titrations of hRPA70A with T-ag showed that the T-ag binding surface is separate from the ssDNA-binding region and is more neutral than the corresponding part of scRPA70A. These differences might account for the species-specific nature of the hRPA70A–T-ag interaction. Our results provide insight into how these two homologous RPA proteins can exhibit functional differences, but still both retain their ability to bind ssDNA. PMID:16043636
The nucleoid protein Dps binds genomic DNA of Escherichia coli in a non-random manner

PubMed Central

Kondrashov, F. A.; Toshchakov, S. V.; Dominova, I.; Shvyreva, U. S.; Vrublevskaya, V. V.; Morenkov, O. S.; Panyukov, V. V.

2017-01-01

Dps is a multifunctional homododecameric protein that oxidizes Fe2+ ions accumulating them in the form of Fe2O3 within its protein cavity, interacts with DNA tightly condensing bacterial nucleoid upon starvation and performs some other functions. During the last two decades from discovery of this protein, its ferroxidase activity became rather well studied, but the mechanism of Dps interaction with DNA still remains enigmatic. The crucial role of lysine residues in the unstructured N-terminal tails led to the conventional point of view that Dps binds DNA without sequence or structural specificity. However, deletion of dps changed the profile of proteins in starved cells, SELEX screen revealed genomic regions preferentially bound in vitro and certain affinity of Dps for artificial branched molecules was detected by atomic force microscopy. Here we report a non-random distribution of Dps binding sites across the bacterial chromosome in exponentially growing cells and show their enrichment with inverted repeats prone to form secondary structures. We found that the Dps-bound regions overlap with sites occupied by other nucleoid proteins, and contain overrepresented motifs typical for their consensus sequences. Of the two types of genomic domains with extensive protein occupancy, which can be highly expressed or transcriptionally silent only those that are enriched with RNA polymerase molecules were preferentially occupied by Dps. In the dps-null mutant we, therefore, observed a differentially altered expression of several targeted genes and found suppressed transcription from the dps promoter. In most cases this can be explained by the relieved interference with Dps for nucleoid proteins exploiting sequence-specific modes of DNA binding. Thus, protecting bacterial cells from different stresses during exponential growth, Dps can modulate transcriptional integrity of the bacterial chromosome hampering RNA biosynthesis from some genes via competition with RNA polymerase or, vice versa, competing with inhibitors to activate transcription. PMID:28800583
Human mRNA polyadenylate binding protein: evolutionary conservation of a nucleic acid binding motif.

PubMed Central

Grange, T; de Sa, C M; Oddos, J; Pictet, R

1987-01-01

We have isolated a full length cDNA (cDNA) coding for the human poly(A) binding protein. The cDNA derived 73 kd basic translation product has the same Mr, isoelectric point and peptidic map as the poly(A) binding protein. DNA sequence analysis reveals a 70,244 dalton protein. The N terminal part, highly homologous to the yeast poly(A) binding protein, is sufficient for poly(A) binding activity. This domain consists of a four-fold repeated unit of approximately 80 amino acids present in other nucleic acid binding proteins. In the C terminal part there is, as in the yeast protein, a sequence of approximately 150 amino acids, rich in proline, alanine and glutamine which together account for 48% of the residues. A 2,9 kb mRNA corresponding to this cDNA has been detected in several vertebrate cell types and in Drosophila melanogaster at every developmental stage including oogenesis. Images PMID:2885805
Structural analysis of DNA binding by C.Csp231I, a member of a novel class of R-M controller proteins regulating gene expression

DOE Office of Scientific and Technical Information (OSTI.GOV)

Shevtsov, M. B.; Streeter, S. D.; Thresh, S.-J.

2015-02-01

The structure of the new class of controller proteins (exemplified by C.Csp231I) in complex with its 21 bp DNA-recognition sequence is presented, and the molecular basis of sequence recognition in this class of proteins is discussed. An unusual extended spacer between the dimer binding sites suggests a novel interaction between the two C-protein dimers. In a wide variety of bacterial restriction–modification systems, a regulatory ‘controller’ protein (or C-protein) is required for effective transcription of its own gene and for transcription of the endonuclease gene found on the same operon. We have recently turned our attention to a new class ofmore » controller proteins (exemplified by C.Csp231I) that have quite novel features, including a much larger DNA-binding site with an 18 bp (∼60 Å) spacer between the two palindromic DNA-binding sequences and a very different recognition sequence from the canonical GACT/AGTC. Using X-ray crystallography, the structure of the protein in complex with its 21 bp DNA-recognition sequence was solved to 1.8 Å resolution, and the molecular basis of sequence recognition in this class of proteins was elucidated. An unusual aspect of the promoter sequence is the extended spacer between the dimer binding sites, suggesting a novel interaction between the two C-protein dimers when bound to both recognition sites correctly spaced on the DNA. A U-bend model is proposed for this tetrameric complex, based on the results of gel-mobility assays, hydrodynamic analysis and the observation of key contacts at the interface between dimers in the crystal.« less
Custom-Designed Molecular Scissors for Site-Specific Manipulation of the Plant and Mammalian Genomes

NASA Astrophysics Data System (ADS)

Kandavelou, Karthikeyan; Chandrasegaran, Srinivasan

Zinc finger nucleases (ZFNs) are custom-designed molecular scissors, engineered to cut at specific DNA sequences. ZFNs combine the zinc finger proteins (ZFPs) with the nonspecific cleavage domain of the FokI restriction enzyme. The DNA-binding specificity of ZFNs can be easily altered experimentally. This easy manipulation of the ZFN recognition specificity enables one to deliver a targeted double-strand break (DSB) to a genome. The targeted DSB stimulates local gene targeting by several orders of magnitude at that specific cut site via homologous recombination (HR). Thus, ZFNs have become an important experimental tool to make site-specific and permanent alterations to genomes of not only plants and mammals but also of many other organisms. Engineering of custom ZFNs involves many steps. The first step is to identify a ZFN site at or near the chosen chromosomal target within the genome to which ZFNs will bind and cut. The second step is to design and/or select various ZFP combinations that will bind to the chosen target site with high specificity and affinity. The DNA coding sequence for the designed ZFPs are then assembled by polymerase chain reaction (PCR) using oligonucleotides. The third step is to fuse the ZFP constructs to the FokI cleavage domain. The ZFNs are then expressed as proteins by using the rabbit reticulocyte in vitro transcription/translation system and the protein products assayed for their DNA cleavage specificity.
Generation of Aptamers from A Primer-Free Randomized ssDNA Library Using Magnetic-Assisted Rapid Aptamer Selection

NASA Astrophysics Data System (ADS)

Tsao, Shih-Ming; Lai, Ji-Ching; Horng, Horng-Er; Liu, Tu-Chen; Hong, Chin-Yih

2017-04-01

Aptamers are oligonucleotides that can bind to specific target molecules. Most aptamers are generated using random libraries in the standard systematic evolution of ligands by exponential enrichment (SELEX). Each random library contains oligonucleotides with a randomized central region and two fixed primer regions at both ends. The fixed primer regions are necessary for amplifying target-bound sequences by PCR. However, these extra-sequences may cause non-specific bindings, which potentially interfere with good binding for random sequences. The Magnetic-Assisted Rapid Aptamer Selection (MARAS) is a newly developed protocol for generating single-strand DNA aptamers. No repeat selection cycle is required in the protocol. This study proposes and demonstrates a method to isolate aptamers for C-reactive proteins (CRP) from a randomized ssDNA library containing no fixed sequences at 5‧ and 3‧ termini using the MARAS platform. Furthermore, the isolated primer-free aptamer was sequenced and binding affinity for CRP was analyzed. The specificity of the obtained aptamer was validated using blind serum samples. The result was consistent with monoclonal antibody-based nephelometry analysis, which indicated that a primer-free aptamer has high specificity toward targets. MARAS is a feasible platform for efficiently generating primer-free aptamers for clinical diagnoses.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Caberoy, Nora B.; Zhou, Yixiong; Alvarado, Gabriela

To efficiently elucidate the biological roles of phosphatidylserine (PS), we developed open-reading-frame (ORF) phage display to identify PS-binding proteins. The procedure of phage panning was optimized with a phage clone expressing MFG-E8, a well-known PS-binding protein. Three rounds of phage panning with ORF phage display cDNA library resulted in {approx}300-fold enrichment in PS-binding activity. A total of 17 PS-binding phage clones were identified. Unlike phage display with conventional cDNA libraries, all 17 PS-binding clones were ORFs encoding 13 real proteins. Sequence analysis revealed that all identified PS-specific phage clones had dimeric basic amino acid residues. GST fusion proteins were expressedmore » for 3 PS-binding proteins and verified for their binding activity to PS liposomes, but not phosphatidylcholine liposomes. These results elucidated previously unknown PS-binding proteins and demonstrated that ORF phage display is a versatile technology capable of efficiently identifying binding proteins for non-protein molecules like PS.« less
Proteome-wide Identification of Novel Ceramide-binding Proteins by Yeast Surface cDNA Display and Deep Sequencing.

PubMed

Bidlingmaier, Scott; Ha, Kevin; Lee, Nam-Kyung; Su, Yang; Liu, Bin

2016-04-01

Although the bioactive sphingolipid ceramide is an important cell signaling molecule, relatively few direct ceramide-interacting proteins are known. We used an approach combining yeast surface cDNA display and deep sequencing technology to identify novel proteins binding directly to ceramide. We identified 234 candidate ceramide-binding protein fragments and validated binding for 20. Most (17) bound selectively to ceramide, although a few (3) bound to other lipids as well. Several novel ceramide-binding domains were discovered, including the EF-hand calcium-binding motif, the heat shock chaperonin-binding motif STI1, the SCP2 sterol-binding domain, and the tetratricopeptide repeat region motif. Interestingly, four of the verified ceramide-binding proteins (HPCA, HPCAL1, NCS1, and VSNL1) and an additional three candidate ceramide-binding proteins (NCALD, HPCAL4, and KCNIP3) belong to the neuronal calcium sensor family of EF hand-containing proteins. We used mutagenesis to map the ceramide-binding site in HPCA and to create a mutant HPCA that does not bind to ceramide. We demonstrated selective binding to ceramide by mammalian cell-produced wild type but not mutant HPCA. Intriguingly, we also identified a fragment from prostaglandin D2synthase that binds preferentially to ceramide 1-phosphate. The wide variety of proteins and domains capable of binding to ceramide suggests that many of the signaling functions of ceramide may be regulated by direct binding to these proteins. Based on the deep sequencing data, we estimate that our yeast surface cDNA display library covers ∼60% of the human proteome and our selection/deep sequencing protocol can identify target-interacting protein fragments that are present at extremely low frequency in the starting library. Thus, the yeast surface cDNA display/deep sequencing approach is a rapid, comprehensive, and flexible method for the analysis of protein-ligand interactions, particularly for the study of non-protein ligands. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
N-acyl homoserine lactone binding to the CarR receptor determines quorum-sensing specificity in Erwinia.

PubMed

Welch, M; Todd, D E; Whitehead, N A; McGowan, S J; Bycroft, B W; Salmond, G P

2000-02-15

Quorum sensing via an N-acyl homoserine lactone (HSL) pheromone controls the biosynthesis of a carbapenem antibiotic in Erwinia carotovora. Transcription of the carbapenem biosynthetic genes is dependent on the LuxR-type activator protein, CarR. Equilibrium binding of a range of HSL molecules, which are thought to activate CarR to bind to its DNA target sequence, was examined using fluorescence quenching, DNA bandshift analysis, limited proteolysis and reporter gene assays. CarR bound the most physiologically relevant ligand, N-(3-oxohexanoyl)-L-homoserine lactone, with a stoichiometry of two molecules of ligand per dimer of protein and a dissociation constant of 1.8 microM, in good agreement with the concentration of HSL required to activate carbapenem production in vivo. In the presence of HSL, CarR formed a very high molecular weight complex with its target DNA, indicating that the ligand causes the protein to multimerize. Chemical cross-linking analysis supported this interpretation. Our data show that the ability of a given HSL to facilitate CarR binding to its target DNA sequence is directly proportional to the affinity of the HSL for the protein.
Characterization of an AGAMOUS-like MADS Box Protein, a Probable Constituent of Flowering and Fruit Ripening Regulatory System in Banana

PubMed Central

Roy Choudhury, Swarup; Roy, Sujit; Nag, Anish; Singh, Sanjay Kumar; Sengupta, Dibyendu N.

2012-01-01

The MADS-box family of genes has been shown to play a significant role in the development of reproductive organs, including dry and fleshy fruits. In this study, the molecular properties of an AGAMOUS like MADS box transcription factor in banana cultivar Giant governor (Musa sp, AAA group, subgroup Cavendish) has been elucidated. We have detected a CArG-box sequence binding AGAMOUS MADS-box protein in banana flower and fruit nuclear extracts in DNA-protein interaction assays. The protein fraction in the DNA-protein complex was analyzed by mass spectrometry and using this information we have obtained the full length cDNA of the corresponding protein. The deduced protein sequence showed ∼95% amino acid sequence homology with MA-MADS5, a MADS-box protein described previously from banana. We have characterized the domains of the identified AGAMOUS MADS-box protein involved in DNA binding and homodimer formation in vitro using full-length and truncated versions of affinity purified recombinant proteins. Furthermore, in order to gain insight about how DNA bending is achieved by this MADS-box factor, we performed circular permutation and phasing analysis using the wild type recombinant protein. The AGAMOUS MADS-box protein identified in this study has been found to predominantly accumulate in the climacteric fruit pulp and also in female flower ovary. In vivo and in vitro assays have revealed specific binding of the identified AGAMOUS MADS-box protein to CArG-box sequence in the promoters of major ripening genes in banana fruit. Overall, the expression patterns of this MADS-box protein in banana female flower ovary and during various phases of fruit ripening along with the interaction of the protein to the CArG-box sequence in the promoters of major ripening genes lead to interesting assumption about the possible involvement of this AGAMOUS MADS-box factor in banana fruit ripening and floral reproductive organ development. PMID:22984496
Characterization of an AGAMOUS-like MADS box protein, a probable constituent of flowering and fruit ripening regulatory system in banana.

PubMed

Roy Choudhury, Swarup; Roy, Sujit; Nag, Anish; Singh, Sanjay Kumar; Sengupta, Dibyendu N

2012-01-01

The MADS-box family of genes has been shown to play a significant role in the development of reproductive organs, including dry and fleshy fruits. In this study, the molecular properties of an AGAMOUS like MADS box transcription factor in banana cultivar Giant governor (Musa sp, AAA group, subgroup Cavendish) has been elucidated. We have detected a CArG-box sequence binding AGAMOUS MADS-box protein in banana flower and fruit nuclear extracts in DNA-protein interaction assays. The protein fraction in the DNA-protein complex was analyzed by mass spectrometry and using this information we have obtained the full length cDNA of the corresponding protein. The deduced protein sequence showed ~95% amino acid sequence homology with MA-MADS5, a MADS-box protein described previously from banana. We have characterized the domains of the identified AGAMOUS MADS-box protein involved in DNA binding and homodimer formation in vitro using full-length and truncated versions of affinity purified recombinant proteins. Furthermore, in order to gain insight about how DNA bending is achieved by this MADS-box factor, we performed circular permutation and phasing analysis using the wild type recombinant protein. The AGAMOUS MADS-box protein identified in this study has been found to predominantly accumulate in the climacteric fruit pulp and also in female flower ovary. In vivo and in vitro assays have revealed specific binding of the identified AGAMOUS MADS-box protein to CArG-box sequence in the promoters of major ripening genes in banana fruit. Overall, the expression patterns of this MADS-box protein in banana female flower ovary and during various phases of fruit ripening along with the interaction of the protein to the CArG-box sequence in the promoters of major ripening genes lead to interesting assumption about the possible involvement of this AGAMOUS MADS-box factor in banana fruit ripening and floral reproductive organ development.
The Activation Domain of the Bovine Papillomavirus E2 Protein Mediates Association of DNA-Bound Dimers to form DNA Loops

NASA Astrophysics Data System (ADS)

Knight, Jonathan D.; Li, Rong; Botchan, Michael

1991-04-01

The E2 transactivator protein of bovine papillomavirus binds its specific DNA target sequence as a dimer. We have found that E2 dimers, performed in solution independent of DNA, exhibit substantial cooperativity of DNA binding as detected by both nitrocellulose filter retention and footprint analysis techniques. If the binding sites are widely spaced, E2 forms stable DNA loops visible by electron microscopy. When three widely separated binding sites reside on te DNA, E2 condenses the molecule into a bow-tie structure. This implies that each E2 dimer has at least two independent surfaces for multimerization. Two naturally occurring shorter forms of the protein, E2C and D8/E2, which function in vivo as repressors of transcription, do not form such loops. Thus, the looping function of E2 maps to the 161-amino acid activation domain. These results support the looping model of transcription activation by enhancers.
Mutations on the DNA Binding Surface of TBP Discriminate between Yeast TATA and TATA-Less Gene Transcription

PubMed Central

Kamenova, Ivanka; Warfield, Linda

2014-01-01

Most RNA polymerase (Pol) II promoters lack a TATA element, yet nearly all Pol II transcription requires TATA binding protein (TBP). While the TBP-TATA interaction is critical for transcription at TATA-containing promoters, it has been unclear whether TBP sequence-specific DNA contacts are required for transcription at TATA-less genes. Transcription factor IID (TFIID), the TBP-containing coactivator that functions at most TATA-less genes, recognizes short sequence-specific promoter elements in metazoans, but analogous promoter elements have not been identified in Saccharomyces cerevisiae. We generated a set of mutations in the yeast TBP DNA binding surface and found that most support growth of yeast. Both in vivo and in vitro, many of these mutations are specifically defective for transcription of two TATA-containing genes with only minor defects in transcription of two TATA-less, TFIID-dependent genes. TBP binds several TATA-less promoters with apparent high affinity, but our results suggest that this binding is not important for transcription activity. Our results are consistent with the model that sequence-specific TBP-DNA contacts are not important at yeast TATA-less genes and suggest that other general transcription factors or coactivator subunits are responsible for recognition of TATA-less promoters. Our results also explain why yeast TBP derivatives defective for TATA binding appear defective in activated transcription. PMID:24865972

Mutations on the DNA binding surface of TBP discriminate between yeast TATA and TATA-less gene transcription.

PubMed

Kamenova, Ivanka; Warfield, Linda; Hahn, Steven

2014-08-01

Most RNA polymerase (Pol) II promoters lack a TATA element, yet nearly all Pol II transcription requires TATA binding protein (TBP). While the TBP-TATA interaction is critical for transcription at TATA-containing promoters, it has been unclear whether TBP sequence-specific DNA contacts are required for transcription at TATA-less genes. Transcription factor IID (TFIID), the TBP-containing coactivator that functions at most TATA-less genes, recognizes short sequence-specific promoter elements in metazoans, but analogous promoter elements have not been identified in Saccharomyces cerevisiae. We generated a set of mutations in the yeast TBP DNA binding surface and found that most support growth of yeast. Both in vivo and in vitro, many of these mutations are specifically defective for transcription of two TATA-containing genes with only minor defects in transcription of two TATA-less, TFIID-dependent genes. TBP binds several TATA-less promoters with apparent high affinity, but our results suggest that this binding is not important for transcription activity. Our results are consistent with the model that sequence-specific TBP-DNA contacts are not important at yeast TATA-less genes and suggest that other general transcription factors or coactivator subunits are responsible for recognition of TATA-less promoters. Our results also explain why yeast TBP derivatives defective for TATA binding appear defective in activated transcription. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Targeted DNA demethylation in human cells by fusion of a plant 5-methylcytosine DNA glycosylase to a sequence-specific DNA binding domain

PubMed Central

Parrilla-Doblas, Jara Teresa; Ariza, Rafael R.; Roldán-Arjona, Teresa

2017-01-01

ABSTRACT DNA methylation is a crucial epigenetic mark associated to gene silencing, and its targeted removal is a major goal of epigenetic editing. In animal cells, DNA demethylation involves iterative 5mC oxidation by TET enzymes followed by replication-dependent dilution and/or replication-independent DNA repair of its oxidized derivatives. In contrast, plants use specific DNA glycosylases that directly excise 5mC and initiate its substitution for unmethylated C in a base excision repair process. In this work, we have fused the catalytic domain of Arabidopsis ROS1 5mC DNA glycosylase (ROS1_CD) to the DNA binding domain of yeast GAL4 (GBD). We show that the resultant GBD-ROS1_CD fusion protein binds specifically a GBD-targeted DNA sequence in vitro. We also found that transient in vivo expression of GBD-ROS1_CD in human cells specifically reactivates transcription of a methylation-silenced reporter gene, and that such reactivation requires both ROS1_CD catalytic activity and GBD binding capacity. Finally, we show that reactivation induced by GBD-ROS1_CD is accompanied by decreased methylation levels at several CpG sites of the targeted promoter. All together, these results show that plant 5mC DNA glycosylases can be used for targeted active DNA demethylation in human cells. PMID:28277978
Quantitative characterization of conformational-specific protein-DNA binding using a dual-spectral interferometric imaging biosensor.

PubMed

Zhang, Xirui; Daaboul, George G; Spuhler, Philipp S; Dröge, Peter; Ünlü, M Selim

2016-03-14

DNA-binding proteins play crucial roles in the maintenance and functions of the genome and yet, their specific binding mechanisms are not fully understood. Recently, it was discovered that DNA-binding proteins recognize specific binding sites to carry out their functions through an indirect readout mechanism by recognizing and capturing DNA conformational flexibility and deformation. High-throughput DNA microarray-based methods that provide large-scale protein-DNA binding information have shown effective and comprehensive analysis of protein-DNA binding affinities, but do not provide information of DNA conformational changes in specific protein-DNA complexes. Building on the high-throughput capability of DNA microarrays, we demonstrate a quantitative approach that simultaneously measures the amount of protein binding to DNA and nanometer-scale DNA conformational change induced by protein binding in a microarray format. Both measurements rely on spectral interferometry on a layered substrate using a single optical instrument in two distinct modalities. In the first modality, we quantitate the amount of binding of protein to surface-immobilized DNA in each DNA spot using a label-free spectral reflectivity technique that accurately measures the surface densities of protein and DNA accumulated on the substrate. In the second modality, for each DNA spot, we simultaneously measure DNA conformational change using a fluorescence vertical sectioning technique that determines average axial height of fluorophores tagged to specific nucleotides of the surface-immobilized DNA. The approach presented in this paper, when combined with current high-throughput DNA microarray-based technologies, has the potential to serve as a rapid and simple method for quantitative and large-scale characterization of conformational specific protein-DNA interactions.
Human HMG box transcription factor HBP1: a role in hCD2 LCR function.

PubMed Central

Zhuma, T; Tyrrell, R; Sekkali, B; Skavdis, G; Saveliev, A; Tolaini, M; Roderick, K; Norton, T; Smerdon, S; Sedgwick, S; Festenstein, R; Kioussis, D

1999-01-01

The locus control region (LCR) of the human CD2 gene (hCD2) confers T cell-specific, copy-dependent and position-independent gene expression in transgenic mice. This LCR consists of a strong T cell-specific enhancer and an element without enhancer activity (designated HSS3), which is required for prevention of position effect variegation (PEV) in transgenic mice. Here, we identified the HMG box containing protein-1 (HBP1) as a factor binding to HSS3 of the hCD2 LCR. Within the LCR, HBP1 binds to a novel TTCATTCATTCA sequence that is higher in affinity than other recently reported HBP1-binding sites. Mice transgenic for a hCD2 LCR construct carrying a deletion of the HBP1-binding sequences show a propensity for PEV if the transgene integrates in a heterochromatic region of the chromosome such as the centromere or telomere. We propose that HBP1 plays an important role in chromatin opening and remodelling activities by binding to and bending the DNA, thus allowing DNA-protein and/or protein-protein interactions, which increase the probability of establishing an active locus. PMID:10562551
Identification of the DNA-Binding Domains of Human Replication Protein A That Recognize G-Quadruplex DNA

PubMed Central

Prakash, Aishwarya; Natarajan, Amarnath; Marky, Luis A.; Ouellette, Michel M.; Borgstahl, Gloria E. O.

2011-01-01

Replication protein A (RPA), a key player in DNA metabolism, has 6 single-stranded DNA-(ssDNA-) binding domains (DBDs) A-F. SELEX experiments with the DBDs-C, -D, and -E retrieve a 20-nt G-quadruplex forming sequence. Binding studies show that RPA-DE binds preferentially to the G-quadruplex DNA, a unique preference not observed with other RPA constructs. Circular dichroism experiments show that RPA-CDE-core can unfold the G-quadruplex while RPA-DE stabilizes it. Binding studies show that RPA-C binds pyrimidine- and purine-rich sequences similarly. This difference between RPA-C and RPA-DE binding was also indicated by the inability of RPA-CDE-core to unfold an oligonucleotide containing a TC-region 5′ to the G-quadruplex. Molecular modeling studies of RPA-DE and telomere-binding proteins Pot1 and Stn1 reveal structural similarities between the proteins and illuminate potential DNA-binding sites for RPA-DE and Stn1. These data indicate that DBDs of RPA have different ssDNA recognition properties. PMID:21772997
RecA binding to a single double-stranded DNA molecule: A possible role of DNA conformational fluctuations

PubMed Central

Leger, J. F.; Robert, J.; Bourdieu, L.; Chatenay, D.; Marko, J. F.

1998-01-01

Most genetic regulatory mechanisms involve protein–DNA interactions. In these processes, the classical Watson–Crick DNA structure sometimes is distorted severely, which in turn enables the precise recognition of the specific sites by the protein. Despite its key importance, very little is known about such deformation processes. To address this general question, we have studied a model system, namely, RecA binding to double-stranded DNA. Results from micromanipulation experiments indicate that RecA binds strongly to stretched DNA; based on this observation, we propose that spontaneous thermal stretching fluctuations may play a role in the binding of RecA to DNA. This has fundamental implications for the protein–DNA binding mechanism, which must therefore rely in part on a combination of flexibility and thermal fluctuations of the DNA structure. We also show that this mechanism is sequence sensitive. Theoretical simulations support this interpretation of our experimental results, and it is argued that this is of broad relevance to DNA–protein interactions. PMID:9770480
High-resolution mapping of transcription factor binding sites on native chromatin

PubMed Central

Kasinathan, Sivakanthan; Orsi, Guillermo A.; Zentner, Gabriel E.; Ahmad, Kami; Henikoff, Steven

2014-01-01

Sequence-specific DNA-binding proteins including transcription factors (TFs) are key determinants of gene regulation and chromatin architecture. Formaldehyde cross-linking and sonication followed by Chromatin ImmunoPrecipitation (X-ChIP) is widely used for profiling of TF binding, but is limited by low resolution and poor specificity and sensitivity. We present a simple protocol that starts with micrococcal nuclease-digested uncross-linked chromatin and is followed by affinity purification of TFs and paired-end sequencing. The resulting ORGANIC (Occupied Regions of Genomes from Affinity-purified Naturally Isolated Chromatin) profiles of Saccharomyces cerevisiae Abf1 and Reb1 provide highly accurate base-pair resolution maps that are not biased toward accessible chromatin, and do not require input normalization. We also demonstrate the high specificity of our method when applied to larger genomes by profiling Drosophila melanogaster GAGA Factor and Pipsqueak. Our results suggest that ORGANIC profiling is a widely applicable high-resolution method for sensitive and specific profiling of direct protein-DNA interactions. PMID:24336359
A Heterogeneous Nuclear Ribonucleoprotein A/B-Related Protein Binds to Single-Stranded DNA near the 5′ End or within the Genome of Feline Parvovirus and Can Modify Virus Replication

PubMed Central

Wang, Dai; Parrish, Colin R.

1999-01-01

Phage display of cDNA clones prepared from feline cells was used to identify host cell proteins that bound to DNA-containing feline panleukopenia virus (FPV) capsids but not to empty capsids. One gene found in several clones encoded a heterogeneous nuclear ribonucleoprotein (hnRNP)-related protein (DBP40) that was very similar in sequence to the A/B-type hnRNP proteins. DBP40 bound specifically to oligonucleotides representing a sequence near the 5′ end of the genome which is exposed on the outside of the full capsid but did not bind most other terminal sequences. Adding purified DBP40 to an in vitro fill-in reaction using viral DNA as a template inhibited the production of the second strand after nucleotide (nt) 289 but prior to nt 469. DBP40 bound to various regions of the viral genome, including a region between nt 295 and 330 of the viral genome which has been associated with transcriptional attenuation of the parvovirus minute virus of mice, which is mediated by a stem-loop structure of the DNA and cellular proteins. Overexpression of the protein in feline cells from a plasmid vector made them largely resistant to FPV infection. Mutagenesis of the protein binding site within the 5′ end viral genome did not affect replication of the virus. PMID:10438866
Specialized nucleoprotein structures at the origin of replication of bacteriophage lambda: localized unwinding of duplex DNA by a six-protein reaction.

PubMed Central

Dodson, M; Echols, H; Wickner, S; Alfano, C; Mensa-Wilmot, K; Gomes, B; LeBowitz, J; Roberts, J D; McMacken, R

1986-01-01

The O protein of bacteriophage lambda localizes the initiation of DNA replication to a unique site on the lambda genome, ori lambda. By means of electron microscopy, we infer that the binding of O to ori lambda initiates a series of protein addition and transfer reactions that culminate in localized unwinding of the origin DNA, generating a prepriming structure for the initiation of DNA replication. We can define three stages of this prepriming reaction, the first two of which we have characterized previously. First, dimeric O protein binds to multiple DNA binding sites and self-associates to form a nucleoprotein structure, the O-some. Second, lambda P and host DnaB proteins interact with the O-some to generate a larger complex that includes additional DNA from an A + T-rich region adjacent to the O binding sites. Third, the addition of the DnaJ, DnaK, and Ssb proteins and ATP results in an origin-specific unwinding reaction, probably catalyzed by the helicase activity of DnaB. The unwinding reaction is unidirectional, proceeding "rightward" from the origin. The minimal DNA sequence competent for unwinding consists of two O binding sites and the adjacent A + T-rich region to the right of the binding sites. We conclude that the lambda O protein localizes and initiates a six-protein sequential reaction responsible for but preceding the precise initiation of DNA replication. Specialized nucleoprotein structures similar to the O-some may be a general feature of DNA transactions requiring extraordinary precision in localization and control. Images PMID:3020552
iDNA-Prot: Identification of DNA Binding Proteins Using Random Forest with Grey Model

PubMed Central

Lin, Wei-Zhong; Fang, Jian-An; Xiao, Xuan; Chou, Kuo-Chen

2011-01-01

DNA-binding proteins play crucial roles in various cellular processes. Developing high throughput tools for rapidly and effectively identifying DNA-binding proteins is one of the major challenges in the field of genome annotation. Although many efforts have been made in this regard, further effort is needed to enhance the prediction power. By incorporating the features into the general form of pseudo amino acid composition that were extracted from protein sequences via the “grey model” and by adopting the random forest operation engine, we proposed a new predictor, called iDNA-Prot, for identifying uncharacterized proteins as DNA-binding proteins or non-DNA binding proteins based on their amino acid sequences information alone. The overall success rate by iDNA-Prot was 83.96% that was obtained via jackknife tests on a newly constructed stringent benchmark dataset in which none of the proteins included has pairwise sequence identity to any other in a same subset. In addition to achieving high success rate, the computational time for iDNA-Prot is remarkably shorter in comparison with the relevant existing predictors. Hence it is anticipated that iDNA-Prot may become a useful high throughput tool for large-scale analysis of DNA-binding proteins. As a user-friendly web-server, iDNA-Prot is freely accessible to the public at the web-site on http://icpr.jci.edu.cn/bioinfo/iDNA-Prot or http://www.jci-bioinfo.cn/iDNA-Prot. Moreover, for the convenience of the vast majority of experimental scientists, a step-by-step guide is provided on how to use the web-server to get the desired results. PMID:21935457
Saccharomyces cerevisiae SSB1 protein and its relationship to nucleolar RNA-binding proteins.

PubMed

Jong, A Y; Clark, M W; Gilbert, M; Oehm, A; Campbell, J L

1987-08-01

To better define the function of Saccharomyces cerevisiae SSB1, an abundant single-stranded nucleic acid-binding protein, we determined the nucleotide sequence of the SSB1 gene and compared it with those of other proteins of known function. The amino acid sequence contains 293 amino acid residues and has an Mr of 32,853. There are several stretches of sequence characteristic of other eucaryotic single-stranded nucleic acid-binding proteins. At the amino terminus, residues 39 to 54 are highly homologous to a peptide in calf thymus UP1 and UP2 and a human heterogeneous nuclear ribonucleoprotein. Residues 125 to 162 constitute a fivefold tandem repeat of the sequence RGGFRG, the composition of which suggests a nucleic acid-binding site. Near the C terminus, residues 233 to 245 are homologous to several RNA-binding proteins. Of 18 C-terminal residues, 10 are acidic, a characteristic of the procaryotic single-stranded DNA-binding proteins and eucaryotic DNA- and RNA-binding proteins. In addition, examination of the subcellular distribution of SSB1 by immunofluorescence microscopy indicated that SSB1 is a nuclear protein, predominantly located in the nucleolus. Sequence homologies and the nucleolar localization make it likely that SSB1 functions in RNA metabolism in vivo, although an additional role in DNA metabolism cannot be excluded.
Sequence-specific DNA binding by MYC/MAX to low-affinity non-E-box motifs.

PubMed

Allevato, Michael; Bolotin, Eugene; Grossman, Mark; Mane-Padros, Daniel; Sladek, Frances M; Martinez, Ernest

2017-01-01

The MYC oncoprotein regulates transcription of a large fraction of the genome as an obligatory heterodimer with the transcription factor MAX. The MYC:MAX heterodimer and MAX:MAX homodimer (hereafter MYC/MAX) bind Enhancer box (E-box) DNA elements (CANNTG) and have the greatest affinity for the canonical MYC E-box (CME) CACGTG. However, MYC:MAX also recognizes E-box variants and was reported to bind DNA in a "non-specific" fashion in vitro and in vivo. Here, in order to identify potential additional non-canonical binding sites for MYC/MAX, we employed high throughput in vitro protein-binding microarrays, along with electrophoretic mobility-shift assays and bioinformatic analyses of MYC-bound genomic loci in vivo. We identified all hexameric motifs preferentially bound by MYC/MAX in vitro, which include the low-affinity non-E-box sequence AACGTT, and found that the vast majority (87%) of MYC-bound genomic sites in a human B cell line contain at least one of the top 21 motifs bound by MYC:MAX in vitro. We further show that high MYC/MAX concentrations are needed for specific binding to the low-affinity sequence AACGTT in vitro and that elevated MYC levels in vivo more markedly increase the occupancy of AACGTT sites relative to CME sites, especially at distal intergenic and intragenic loci. Hence, MYC binds diverse DNA motifs with a broad range of affinities in a sequence-specific and dose-dependent manner, suggesting that MYC overexpression has more selective effects on the tumor transcriptome than previously thought.
Single helically folded aromatic oligoamides that mimic the charge surface of double-stranded B-DNA

NASA Astrophysics Data System (ADS)

Ziach, Krzysztof; Chollet, Céline; Parissi, Vincent; Prabhakaran, Panchami; Marchivie, Mathieu; Corvaglia, Valentina; Bose, Partha Pratim; Laxmi-Reddy, Katta; Godde, Frédéric; Schmitter, Jean-Marie; Chaignepain, Stéphane; Pourquier, Philippe; Huc, Ivan

2018-05-01

Numerous essential biomolecular processes require the recognition of DNA surface features by proteins. Molecules mimicking these features could potentially act as decoys and interfere with pharmacologically or therapeutically relevant protein-DNA interactions. Although naturally occurring DNA-mimicking proteins have been described, synthetic tunable molecules that mimic the charge surface of double-stranded DNA are not known. Here, we report the design, synthesis and structural characterization of aromatic oligoamides that fold into single helical conformations and display a double helical array of negatively charged residues in positions that match the phosphate moieties in B-DNA. These molecules were able to inhibit several enzymes possessing non-sequence-selective DNA-binding properties, including topoisomerase 1 and HIV-1 integrase, presumably through specific foldamer-protein interactions, whereas sequence-selective enzymes were not inhibited. Such modular and synthetically accessible DNA mimics provide a versatile platform to design novel inhibitors of protein-DNA interactions.
Strand-Specific Analysis of DNA Synthesis and Proteins Association with DNA Replication Forks in Budding Yeast.

PubMed

Yu, Chuanhe; Gan, Haiyun; Zhang, Zhiguo

2018-01-01

DNA replication initiates at DNA replication origins after unwinding of double-strand DNA(dsDNA) by replicative helicase to generate single-stranded DNA (ssDNA) templates for the continuous synthesis of leading-strand and the discontinuous synthesis of lagging-strand. Therefore, methods capable of detecting strand-specific information will likely yield insight into the association of proteins at leading and lagging strand of DNA replication forks and the regulation of leading and lagging strand synthesis during DNA replication. The enrichment and Sequencing of Protein-Associated Nascent DNA (eSPAN), which measure the relative amounts of proteins at nascent leading and lagging strands of DNA replication forks, is a step-wise procedure involving the chromatin immunoprecipitation (ChIP) of a protein of interest followed by the enrichment of protein-associated nascent DNA through BrdU immunoprecipitation. The isolated ssDNA is then subjected to strand-specific sequencing. This method can detect whether a protein is enriched at leading or lagging strand of DNA replication forks. In addition to eSPAN, two other strand-specific methods, (ChIP-ssSeq), which detects potential protein-ssDNA binding and BrdU-IP-ssSeq, which can measure synthesis of both leading and lagging strand, were developed along the way. These methods can provide strand-specific and complementary information about the association of the target protein with DNA replication forks as well as synthesis of leading and lagging strands genome wide. Below, we describe the detailed eSPAN, ChIP-ssSeq, and BrdU-IP-ssSeq protocols.
In silico modeling of epigenetic-induced changes in photoreceptor cis-regulatory elements.

PubMed

Hossain, Reafa A; Dunham, Nicholas R; Enke, Raymond A; Berndsen, Christopher E

2018-01-01

DNA methylation is a well-characterized epigenetic repressor of mRNA transcription in many plant and vertebrate systems. However, the mechanism of this repression is not fully understood. The process of transcription is controlled by proteins that regulate recruitment and activity of RNA polymerase by binding to specific cis-regulatory sequences. Cone-rod homeobox (CRX) is a well-characterized mammalian transcription factor that controls photoreceptor cell-specific gene expression. Although much is known about the functions and DNA binding specificity of CRX, little is known about how DNA methylation modulates CRX binding affinity to genomic cis-regulatory elements. We used bisulfite pyrosequencing of human ocular tissues to measure DNA methylation levels of the regulatory regions of RHO , PDE6B, PAX6 , and LINE1 retrotransposon repeats. To describe the molecular mechanism of repression, we used molecular modeling to illustrate the effect of DNA methylation on human RHO regulatory sequences. In this study, we demonstrate an inverse correlation between DNA methylation in regulatory regions adjacent to the human RHO and PDE6B genes and their subsequent transcription in human ocular tissues. Docking of CRX to the DNA models shows that CRX interacts with the grooves of these sequences, suggesting changes in groove structure could regulate binding. Molecular dynamics simulations of the RHO promoter and enhancer regions show changes in the flexibility and groove width upon epigenetic modification. Models also demonstrate changes in the local dynamics of CRX binding sites within RHO regulatory sequences which may account for the repression of CRX-dependent transcription. Collectively, these data demonstrate epigenetic regulation of CRX binding sites in human retinal tissue and provide insight into the mechanism of this mode of epigenetic regulation to be tested in future experiments.
TALE-PvuII fusion proteins--novel tools for gene targeting.

PubMed

Yanik, Mert; Alzubi, Jamal; Lahaye, Thomas; Cathomen, Toni; Pingoud, Alfred; Wende, Wolfgang

2013-01-01

Zinc finger nucleases (ZFNs) consist of zinc fingers as DNA-binding module and the non-specific DNA-cleavage domain of the restriction endonuclease FokI as DNA-cleavage module. This architecture is also used by TALE nucleases (TALENs), in which the DNA-binding modules of the ZFNs have been replaced by DNA-binding domains based on transcription activator like effector (TALE) proteins. Both TALENs and ZFNs are programmable nucleases which rely on the dimerization of FokI to induce double-strand DNA cleavage at the target site after recognition of the target DNA by the respective DNA-binding module. TALENs seem to have an advantage over ZFNs, as the assembly of TALE proteins is easier than that of ZFNs. Here, we present evidence that variant TALENs can be produced by replacing the catalytic domain of FokI with the restriction endonuclease PvuII. These fusion proteins recognize only the composite recognition site consisting of the target site of the TALE protein and the PvuII recognition sequence (addressed site), but not isolated TALE or PvuII recognition sites (unaddressed sites), even at high excess of protein over DNA and long incubation times. In vitro, their preference for an addressed over an unaddressed site is > 34,000-fold. Moreover, TALE-PvuII fusion proteins are active in cellula with minimal cytotoxicity.
Identification of a factor in HeLa cells specific for an upstream transcriptional control sequence of an EIA-inducible adenovirus promoter and its relative abundance in infected and uninfected cells.

PubMed Central

SivaRaman, L; Subramanian, S; Thimmappaya, B

1986-01-01

Utilizing the gel electrophoresis/DNA binding assay, a factor specific for the upstream transcriptional control sequence of the EIA-inducible adenovirus EIIA-early promoter has been detected in HeLa cell nuclear extract. Analysis of linker-scanning mutants of the promoter by DNA binding assays and methylation-interference experiments show that the factor binds to the 17-nucleotide sequence 5' TGGAGATGACGTAGTTT 3' located between positions -66 and -82 upstream from the cap site. This sequence has been shown to be essential for transcription of this promoter. The EIIA-early-promoter specific factor was found to be present at comparable levels in uninfected HeLa cells and in cells infected with either wild-type adenovirus or the EIA-deletion mutant dl312 under conditions in which the EIA proteins are induced to high levels [7 or 20 hr after infection in the presence of arabinonucleoside (cytosine arabinoside)]. Based on the quantitation in DNA binding assays, it appears that the mechanism of EIA-activated transcription of the EIIA-early promoter does not involve a net change in the amounts of this factor. Images PMID:2942943
Dissecting the protein architecture of DNA-binding transcription factors in bacteria and archaea.

PubMed

Rivera-Gómez, Nancy; Martínez-Núñez, Mario Alberto; Pastor, Nina; Rodriguez-Vazquez, Katya; Perez-Rueda, Ernesto

2017-08-01

Gene regulation at the transcriptional level is a central process in all organisms where DNA-binding transcription factors play a fundamental role. This class of proteins binds specifically at DNA sequences, activating or repressing gene expression as a function of the cell's metabolic status, operator context and ligand-binding status, among other factors, through the DNA-binding domain (DBD). In addition, TFs may contain partner domains (PaDos), which are involved in ligand binding and protein-protein interactions. In this work, we systematically evaluated the distribution, abundance and domain organization of DNA-binding TFs in 799 non-redundant bacterial and archaeal genomes. We found that the distributions of the DBDs and their corresponding PaDos correlated with the size of the genome. We also identified specific combinations between the DBDs and their corresponding PaDos. Within each class of DBDs there are differences in the actual angle formed at the dimerization interface, responding to the presence/absence of ligands and/or crystallization conditions, setting the orientation of the resulting helices and wings facing the DNA. Our results highlight the importance of PaDos as central elements that enhance the diversity of regulatory functions in all bacterial and archaeal organisms, and our results also demonstrate the role of PaDos in sensing diverse signal compounds. The highly specific interactions between DBDs and PaDos observed in this work, together with our structural analysis highlighting the difficulty in predicting both inter-domain geometry and quaternary structure, suggest that these systems appeared once and evolved with diverse duplication events in all the analysed organisms.
Study of base pair mutations in proline-rich homeodomain (PRH)-DNA complexes using molecular dynamics.

PubMed

Jalili, Seifollah; Karami, Leila; Schofield, Jeremy

2013-06-01

Proline-rich homeodomain (PRH) is a regulatory protein controlling transcription and gene expression processes by binding to the specific sequence of DNA, especially to the sequence 5'-TAATNN-3'. The impact of base pair mutations on the binding between the PRH protein and DNA is investigated using molecular dynamics and free energy simulations to identify DNA sequences that form stable complexes with PRH. Three 20-ns molecular dynamics simulations (PRH-TAATTG, PRH-TAATTA and PRH-TAATGG complexes) in explicit solvent water were performed to investigate three complexes structurally. Structural analysis shows that the native TAATTG sequence forms a complex that is more stable than complexes with base pair mutations. It is also observed that upon mutation, the number and occupancy of the direct and water-mediated hydrogen bonds decrease. Free energy calculations performed with the thermodynamic integration method predict relative binding free energies of 0.64 and 2 kcal/mol for GC to AT and TA to GC mutations, respectively, suggesting that among the three DNA sequences, the PRH-TAATTG complex is more stable than the two mutated complexes. In addition, it is demonstrated that the stability of the PRH-TAATTA complex is greater than that of the PRH-TAATGG complex.
Sequence Based Prediction of DNA-Binding Proteins Based on Hybrid Feature Selection Using Random Forest and Gaussian Naïve Bayes

PubMed Central

Lou, Wangchao; Wang, Xiaoqing; Chen, Fan; Chen, Yixiao; Jiang, Bo; Zhang, Hua

2014-01-01

Developing an efficient method for determination of the DNA-binding proteins, due to their vital roles in gene regulation, is becoming highly desired since it would be invaluable to advance our understanding of protein functions. In this study, we proposed a new method for the prediction of the DNA-binding proteins, by performing the feature rank using random forest and the wrapper-based feature selection using forward best-first search strategy. The features comprise information from primary sequence, predicted secondary structure, predicted relative solvent accessibility, and position specific scoring matrix. The proposed method, called DBPPred, used Gaussian naïve Bayes as the underlying classifier since it outperformed five other classifiers, including decision tree, logistic regression, k-nearest neighbor, support vector machine with polynomial kernel, and support vector machine with radial basis function. As a result, the proposed DBPPred yields the highest average accuracy of 0.791 and average MCC of 0.583 according to the five-fold cross validation with ten runs on the training benchmark dataset PDB594. Subsequently, blind tests on the independent dataset PDB186 by the proposed model trained on the entire PDB594 dataset and by other five existing methods (including iDNA-Prot, DNA-Prot, DNAbinder, DNABIND and DBD-Threader) were performed, resulting in that the proposed DBPPred yielded the highest accuracy of 0.769, MCC of 0.538, and AUC of 0.790. The independent tests performed by the proposed DBPPred on completely a large non-DNA binding protein dataset and two RNA binding protein datasets also showed improved or comparable quality when compared with the relevant prediction methods. Moreover, we observed that majority of the selected features by the proposed method are statistically significantly different between the mean feature values of the DNA-binding and the non DNA-binding proteins. All of the experimental results indicate that the proposed DBPPred can be an alternative perspective predictor for large-scale determination of DNA-binding proteins. PMID:24475169

Vital Roles of the Second DNA-binding Site of Rad52 Protein in Yeast Homologous Recombination*

PubMed Central

Arai, Naoto; Kagawa, Wataru; Saito, Kengo; Shingu, Yoshinori; Mikawa, Tsutomu; Kurumizaka, Hitoshi; Shibata, Takehiko

2011-01-01

RecA/Rad51 proteins are essential in homologous DNA recombination and catalyze the ATP-dependent formation of D-loops from a single-stranded DNA and an internal homologous sequence in a double-stranded DNA. RecA and Rad51 require a “recombination mediator” to overcome the interference imposed by the prior binding of single-stranded binding protein/replication protein A to the single-stranded DNA. Rad52 is the prototype of recombination mediators, and the human Rad52 protein has two distinct DNA-binding sites: the first site binds to single-stranded DNA, and the second site binds to either double- or single-stranded DNA. We previously showed that yeast Rad52 extensively stimulates Rad51-catalyzed D-loop formation even in the absence of replication protein A, by forming a 2:1 stoichiometric complex with Rad51. However, the precise roles of Rad52 and Rad51 within the complex are unknown. In the present study, we constructed yeast Rad52 mutants in which the amino acid residues corresponding to the second DNA-binding site of the human Rad52 protein were replaced with either alanine or aspartic acid. We found that the second DNA-binding site is important for the yeast Rad52 function in vivo. Rad51-Rad52 complexes consisting of these Rad52 mutants were defective in promoting the formation of D-loops, and the ability of the complex to associate with double-stranded DNA was specifically impaired. Our studies suggest that Rad52 within the complex associates with double-stranded DNA to assist Rad51-mediated homologous pairing. PMID:21454474
TRX-LOGOS - a graphical tool to demonstrate DNA information content dependent upon backbone dynamics in addition to base sequence.

PubMed

Fortin, Connor H; Schulze, Katharina V; Babbitt, Gregory A

2015-01-01

It is now widely-accepted that DNA sequences defining DNA-protein interactions functionally depend upon local biophysical features of DNA backbone that are important in defining sites of binding interaction in the genome (e.g. DNA shape, charge and intrinsic dynamics). However, these physical features of DNA polymer are not directly apparent when analyzing and viewing Shannon information content calculated at single nucleobases in a traditional sequence logo plot. Thus, sequence logos plots are severely limited in that they convey no explicit information regarding the structural dynamics of DNA backbone, a feature often critical to binding specificity. We present TRX-LOGOS, an R software package and Perl wrapper code that interfaces the JASPAR database for computational regulatory genomics. TRX-LOGOS extends the traditional sequence logo plot to include Shannon information content calculated with regard to the dinucleotide-based BI-BII conformation shifts in phosphate linkages on the DNA backbone, thereby adding a visual measure of intrinsic DNA flexibility that can be critical for many DNA-protein interactions. TRX-LOGOS is available as an R graphics module offered at both SourceForge and as a download supplement at this journal. To demonstrate the general utility of TRX logo plots, we first calculated the information content for 416 Saccharomyces cerevisiae transcription factor binding sites functionally confirmed in the Yeastract database and matched to previously published yeast genomic alignments. We discovered that flanking regions contain significantly elevated information content at phosphate linkages than can be observed at nucleobases. We also examined broader transcription factor classifications defined by the JASPAR database, and discovered that many general signatures of transcription factor binding are locally more information rich at the level of DNA backbone dynamics than nucleobase sequence. We used TRX-logos in combination with MEGA 6.0 software for molecular evolutionary genetics analysis to visually compare the human Forkhead box/FOX protein evolution to its binding site evolution. We also compared the DNA binding signatures of human TP53 tumor suppressor determined by two different laboratory methods (SELEX and ChIP-seq). Further analysis of the entire yeast genome, center aligned at the start codon, also revealed a distinct sequence-independent 3 bp periodic pattern in information content, present only in coding region, and perhaps indicative of the non-random organization of the genetic code. TRX-LOGOS is useful in any situation in which important information content in DNA can be better visualized at the positions of phosphate linkages (i.e. dinucleotides) where the dynamic properties of the DNA backbone functions to facilitate DNA-protein interaction.
DNA-binding by Haemophilus influenzae and Escherichia coli YbaB, members of a widely-distributed bacterial protein family.

PubMed

Cooley, Anne E; Riley, Sean P; Kral, Keith; Miller, M Clarke; DeMoll, Edward; Fried, Michael G; Stevenson, Brian

2009-07-13

Genes orthologous to the ybaB loci of Escherichia coli and Haemophilus influenzae are widely distributed among eubacteria. Several years ago, the three-dimensional structures of the YbaB orthologs of both E. coli and H. influenzae were determined, revealing a novel "tweezer"-like structure. However, a function for YbaB had remained elusive, with an early study of the H. influenzae ortholog failing to detect DNA-binding activity. Our group recently determined that the Borrelia burgdorferi YbaB ortholog, EbfC, is a DNA-binding protein. To reconcile those results, we assessed the abilities of both the H. influenzae and E. coli YbaB proteins to bind DNA to which B. burgdorferi EbfC can bind. Both the H. influenzae and the E. coli YbaB proteins bound to tested DNAs. DNA-binding was not well competed with poly-dI-dC, indicating some sequence preferences for those two proteins. Analyses of binding characteristics determined that both YbaB orthologs bind as homodimers. Different DNA sequence preferences were observed between H. influenzae YbaB, E. coli YbaB and B. burgdorferi EbfC, consistent with amino acid differences in the putative DNA-binding domains of these proteins. Three distinct members of the YbaB/EbfC bacterial protein family have now been demonstrated to bind DNA. Members of this protein family are encoded by a broad range of bacteria, including many pathogenic species, and results of our studies suggest that all such proteins have DNA-binding activities. The functions of YbaB/EbfC family members in each bacterial species are as-yet unknown, but given the ubiquity of these DNA-binding proteins among Eubacteria, further investigations are warranted.
The artificial zinc finger coding gene 'Jazz' binds the utrophin promoter and activates transcription.

PubMed

Corbi, N; Libri, V; Fanciulli, M; Tinsley, J M; Davies, K E; Passananti, C

2000-06-01

Up-regulation of utrophin gene expression is recognized as a plausible therapeutic approach in the treatment of Duchenne muscular dystrophy (DMD). We have designed and engineered new zinc finger-based transcription factors capable of binding and activating transcription from the promoter of the dystrophin-related gene, utrophin. Using the recognition 'code' that proposes specific rules between zinc finger primary structure and potential DNA binding sites, we engineered a new gene named 'Jazz' that encodes for a three-zinc finger peptide. Jazz belongs to the Cys2-His2 zinc finger type and was engineered to target the nine base pair DNA sequence: 5'-GCT-GCT-GCG-3', present in the promoter region of both the human and mouse utrophin gene. The entire zinc finger alpha-helix region, containing the amino acid positions that are crucial for DNA binding, was specifically chosen on the basis of the contacts more frequently represented in the available list of the 'code'. Here we demonstrate that Jazz protein binds specifically to the double-stranded DNA target, with a dissociation constant of about 32 nM. Band shift and super-shift experiments confirmed the high affinity and specificity of Jazz protein for its DNA target. Moreover, we show that chimeric proteins, named Gal4-Jazz and Sp1-Jazz, are able to drive the transcription of a test gene from the human utrophin promoter.
Novel mechanism of gene regulation: the protein Rv1222 of Mycobacterium tuberculosis inhibits transcription by anchoring the RNA polymerase onto DNA.

PubMed

Rudra, Paulami; Prajapati, Ranjit Kumar; Banerjee, Rajdeep; Sengupta, Shreya; Mukhopadhyay, Jayanta

2015-07-13

We propose a novel mechanism of gene regulation in Mycobacterium tuberculosis where the protein Rv1222 inhibits transcription by anchoring RNA polymerase (RNAP) onto DNA. In contrast to our existing knowledge that transcriptional repressors function either by binding to DNA at specific sequences or by binding to RNAP, we show that Rv1222-mediated transcription inhibition requires simultaneous binding of the protein to both RNAP and DNA. We demonstrate that the positively charged C-terminus tail of Rv1222 is responsible for anchoring RNAP on DNA, hence the protein slows down the movement of RNAP along the DNA during transcription elongation. The interaction between Rv1222 and DNA is electrostatic, thus the protein could inhibit transcription from any gene. As Rv1222 slows down the RNA synthesis, upon expression of the protein in Mycobacterium smegmatis or Escherichia coli, the growth rate of the bacteria is severely impaired. The protein does not possess any significant affinity for DNA polymerase, thus, is unable to inhibit DNA synthesis. The proposed mechanism by which Rv1222 inhibits transcription reveals a new repertoire of prokaryotic gene regulation. © Crown copyright 2015.
Incorporation of native antibodies and Fc-fusion proteins on DNA nanostructures via a modular conjugation strategy† †Electronic supplementary information (ESI) available: Experimental methods, DNA origami design, DNA sequences, and additional experimental data. See DOI: 10.1039/c7cc04178k

PubMed Central

Rosier, Bas J. H. M.; Cremers, Glenn A. O.; Engelen, Wouter; Merkx, Maarten; Brunsveld, Luc

2017-01-01

A photocrosslinkable protein G variant was used as an adapter protein to covalently and site-specifically conjugate an antibody and an Fc-fusion protein to an oligonucleotide. This modular approach enables straightforward decoration of DNA nanostructures with complex native proteins while retaining their innate binding affinity, allowing precise control over the nanoscale spatial organization of such proteins for in vitro and in vivo biomedical applications. PMID:28617516
Measurements of nonlinear Hall-driven reconnection in the reversed field pinch

NASA Astrophysics Data System (ADS)

Tharp, Timothy D.

Complex organisms are able to develop because of the complex regulatory systems that control their gene expression. The first step in this regulation, transcription initiation, is controlled by transcription factors. Transcription factors are modular proteins composed of two distinct domains, the DNA binding domain and the regulatory domain. These molecules are involved in a plethora of important biological processes including embryogenesis, development, cell health, and cancer. Tissue enriched transcription factors Nkx-2.5 and Gata4 are involved in cardiac development and cardiac health. In this thesis the DNA binding specificity of Nkx-2.5 will be analyzed using a high throughput double stranded DNA platform called Cognate Site Identifier (CSI) arrays (Chapter 2). The full DNA binding specificity of Nkx-2.5 and Nkx-2.5 mutants will be visualized using Sequence Specificity Landscapes (SSLs). In Chapter 3, the definition of binding specificity will be investigated by evaluating a number of different DNA binding folds by CSI and SSLs. CSI and SSLs will also be used to evaluate different pyrrole/imidazole hairpin polyamides in order to better characterize these small molecule DNA binding domains. CSI and SSL data will be applied to the genome in order to explain the biological function an artificial transcription factor. Chapter 4 will discuss the mechanism of nonspecific DNA binding. The historical means of predicting DNA binding will be challenged by utilizing high throughput experiments. The effect of salt concentration on both specific and nonspecific binding will also be investigated. Finally, in Chapter 5, a generation of Protein DNA Dimerizer will be discussed. A PDD that regulates transcription on genomic DNA by binding cooperatively with the heart IF Gata4 will be characterized. These studies provide understanding of, and a means to control, how transcription factors sample the endless sea of DNA in the genome in order to regulate gene expression with such wonderful specificity.
Structure of homeodomain-leucine zipper/DNA complexes studied using hydroxyl radical cleavage of DNA and methylation interference.

PubMed

Tron, Adriana E; Comelli, Raúl N; Gonzalez, Daniel H

2005-12-27

Homeodomain-leucine zipper (HD-Zip) proteins, unlike most homeodomain proteins, bind a pseudopalindromic DNA sequence as dimers. We have investigated the structure of the DNA complexes formed by two HD-Zip proteins with different nucleotide preferences at the central position of the binding site using footprinting and interference methods. The results indicate that the respective complexes are not symmetric, with the strand bearing a central purine (top strand) showing higher protection around the central region and the bottom strand protected toward the 3' end. Binding to a sequence with a nonpreferred central base pair produces a decrease in protection in either the top or the bottom strand, depending upon the protein. Modeling studies derived from the complex formed by the monomeric Antennapedia homeodomain with DNA indicate that in the HD-Zip/DNA complex the recognition helix of one of the monomers is displaced within the major groove respective to the other one. This monomer seems to lose contacts with a part of the recognition sequence upon binding to the nonpreferred site. The results show that the structure of the complex formed by HD-Zip proteins with DNA is dependent upon both protein intrinsic characteristics and the nucleotides present at the central position of the recognition sequence.
Development of Genome Engineering Tools from Plant-Specific PPR Proteins Using Animal Cultured Cells.

PubMed

Kobayashi, Takehito; Yagi, Yusuke; Nakamura, Takahiro

2016-01-01

The pentatricopeptide repeat (PPR) motif is a sequence-specific RNA/DNA-binding module. Elucidation of the RNA/DNA recognition mechanism has enabled engineering of PPR motifs as new RNA/DNA manipulation tools in living cells, including for genome editing. However, the biochemical characteristics of PPR proteins remain unknown, mostly due to the instability and/or unfolding propensities of PPR proteins in heterologous expression systems such as bacteria and yeast. To overcome this issue, we constructed reporter systems using animal cultured cells. The cell-based system has highly attractive features for PPR engineering: robust eukaryotic gene expression; availability of various vectors, reagents, and antibodies; highly efficient DNA delivery ratio (>80 %); and rapid, high-throughput data production. In this chapter, we introduce an example of such reporter systems: a PPR-based sequence-specific translational activation system. The cell-based reporter system can be applied to characterize plant genes of interested and to PPR engineering.
BuD, a helix–loop–helix DNA-binding domain for genome modification

PubMed Central

Stella, Stefano; Molina, Rafael; López-Méndez, Blanca; Juillerat, Alexandre; Bertonati, Claudia; Daboussi, Fayza; Campos-Olivas, Ramon; Duchateau, Phillippe; Montoya, Guillermo

2014-01-01

DNA editing offers new possibilities in synthetic biology and biomedicine for modulation or modification of cellular functions to organisms. However, inaccuracy in this process may lead to genome damage. To address this important problem, a strategy allowing specific gene modification has been achieved through the addition, removal or exchange of DNA sequences using customized proteins and the endogenous DNA-repair machinery. Therefore, the engineering of specific protein–DNA interactions in protein scaffolds is key to providing ‘toolkits’ for precise genome modification or regulation of gene expression. In a search for putative DNA-binding domains, BurrH, a protein that recognizes a 19 bp DNA target, was identified. Here, its apo and DNA-bound crystal structures are reported, revealing a central region containing 19 repeats of a helix–loop–helix modular domain (BurrH domain; BuD), which identifies the DNA target by a single residue-to-nucleotide code, thus facilitating its redesign for gene targeting. New DNA-binding specificities have been engineered in this template, showing that BuD-derived nucleases (BuDNs) induce high levels of gene targeting in a locus of the human haemoglobin β (HBB) gene close to mutations responsible for sickle-cell anaemia. Hence, the unique combination of high efficiency and specificity of the BuD arrays can push forward diverse genome-modification approaches for cell or organism redesign, opening new avenues for gene editing. PMID:25004980
A Bromodomain-Containing Protein from Tomato Specifically Binds Potato Spindle Tuber Viroid RNA In Vitro and In Vivo

PubMed Central

Martínez de Alba, Angel Emilio; Sägesser, Rudolf; Tabler, Martin; Tsagris, Mina

2003-01-01

For the identification of RNA-binding proteins that specifically interact with potato spindle tuber viroid (PSTVd), we subjected a tomato cDNA expression library prepared from viroid-infected leaves to an RNA ligand screening procedure. We repeatedly identified cDNA clones that expressed a protein of 602 amino acids. The protein contains a bromodomain and was termed viroid RNA-binding protein 1 (VIRP1). The specificity of interaction of VIRP1 with viroid RNA was studied by different methodologies, which included Northwestern blotting, plaque lift, and electrophoretic mobility shift assays. VIRP1 interacted strongly and specifically with monomeric and oligomeric PSTVd positive-strand RNA transcripts. Other RNAs, for example, U1 RNA, did not bind to VIRP1. Further, we could immunoprecipitate complexes from infected tomato leaves that contained VIRP1 and viroid RNA in vivo. Analysis of the protein sequence revealed that VIRP1 is a member of a newly identified family of transcriptional regulators associated with chromatin remodeling. VIRP1 is the first member of this family of proteins, for which a specific RNA-binding activity is shown. A possible role of VIRP1 in viroid replication and in RNA mediated chromatin remodeling is discussed. PMID:12915580
In vivo binding of PRDM9 reveals interactions with noncanonical genomic sites

PubMed Central

Grey, Corinne; Clément, Julie A.J.; Buard, Jérôme; Leblanc, Benjamin; Gut, Ivo; Gut, Marta; Duret, Laurent

2017-01-01

In mouse and human meiosis, DNA double-strand breaks (DSBs) initiate homologous recombination and occur at specific sites called hotspots. The localization of these sites is determined by the sequence-specific DNA binding domain of the PRDM9 histone methyl transferase. Here, we performed an extensive analysis of PRDM9 binding in mouse spermatocytes. Unexpectedly, we identified a noncanonical recruitment of PRDM9 to sites that lack recombination activity and the PRDM9 binding consensus motif. These sites include gene promoters, where PRDM9 is recruited in a DSB-dependent manner. Another subset reveals DSB-independent interactions between PRDM9 and genomic sites, such as the binding sites for the insulator protein CTCF. We propose that these DSB-independent sites result from interactions between hotspot-bound PRDM9 and genomic sequences located on the chromosome axis. PMID:28336543
Characterization and modification of phage T7 DNA polymerase for use in DNA sequencing; Progress report, June 1, 1990--May 31, 1993

DOE Office of Scientific and Technical Information (OSTI.GOV)

Richardson, C.C.

1993-12-31

This project focuses on the DNA polymerase (gene 5 protein) of phage T7 for use in DNA sequence analysis. Gene 5 protein interacts with accessory proteins to acquire properties essential for DNA replication. One goal is to understand these interactions in order to modify the proteins for use in DNA sequencing. E. coli thioredoxin, binds to gene 5 protein and clamps it to a primer-template. They have analyzed the binding of gene 5 protein-thioredoxin to primer-templates and have defined the optimal conditions to form an extremely stable complex with a dNTP in the polymerase catalytic site. The spatial proximity ofmore » these components has been determined using fluorescence emission anisotropy. The T7 DNA binding protein, the gene 2.5 protein, interacts with gene 5 protein and gene 4 protein to increase processivity and primer synthesis, respectively. Mutant gene 2.5 proteins have been isolated that do not interact with T7 DNA polymerase and can not support T7 growth. The nucleotide binding site of the T7 helicase has been identified and mutations affecting the site provide information on how the hydrolysis of NTPs fuel its unidirectional translocation. The sequence, GTC, has been shown to be necessary and sufficient for recognition by the T7 primase. The T7 gene 5.5 protein interacts with the E. coli nucleoid protein, H-NS, and also overcomes the phage {lambda} rex restriction system.« less
Directing an artificial zinc finger protein to new targets by fusion to a non-DNA-binding domain.

PubMed

Lim, Wooi F; Burdach, Jon; Funnell, Alister P W; Pearson, Richard C M; Quinlan, Kate G R; Crossley, Merlin

2016-04-20

Transcription factors are often regarded as having two separable components: a DNA-binding domain (DBD) and a functional domain (FD), with the DBD thought to determine target gene recognition. While this holds true for DNA bindingin vitro, it appears thatin vivoFDs can also influence genomic targeting. We fused the FD from the well-characterized transcription factor Krüppel-like Factor 3 (KLF3) to an artificial zinc finger (AZF) protein originally designed to target the Vascular Endothelial Growth Factor-A (VEGF-A) gene promoter. We compared genome-wide occupancy of the KLF3FD-AZF fusion to that observed with AZF. AZF bound to theVEGF-Apromoter as predicted, but was also found to occupy approximately 25,000 other sites, a large number of which contained the expected AZF recognition sequence, GCTGGGGGC. Interestingly, addition of the KLF3 FD re-distributes the fusion protein to new sites, with total DNA occupancy detected at around 50,000 sites. A portion of these sites correspond to known KLF3-bound regions, while others contained sequences similar but not identical to the expected AZF recognition sequence. These results show that FDs can influence and may be useful in directing AZF DNA-binding proteins to specific targets and provide insights into how natural transcription factors operate. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Widespread evidence of cooperative DNA binding by transcription factors in Drosophila development.

PubMed

Kazemian, Majid; Pham, Hannah; Wolfe, Scot A; Brodsky, Michael H; Sinha, Saurabh

2013-09-01

Regulation of eukaryotic gene transcription is often combinatorial in nature, with multiple transcription factors (TFs) regulating common target genes, often through direct or indirect mutual interactions. Many individual examples of cooperative binding by directly interacting TFs have been identified, but it remains unclear how pervasive this mechanism is during animal development. Cooperative TF binding should be manifest in genomic sequences as biased arrangements of TF-binding sites. Here, we explore the extent and diversity of such arrangements related to gene regulation during Drosophila embryogenesis. We used the DNA-binding specificities of 322 TFs along with chromatin accessibility information to identify enriched spacing and orientation patterns of TF-binding site pairs. We developed a new statistical approach for this task, specifically designed to accurately assess inter-site spacing biases while accounting for the phenomenon of homotypic site clustering commonly observed in developmental regulatory regions. We observed a large number of short-range distance preferences between TF-binding site pairs, including examples where the preference depends on the relative orientation of the binding sites. To test whether these binding site patterns reflect physical interactions between the corresponding TFs, we analyzed 27 TF pairs whose binding sites exhibited short distance preferences. In vitro protein-protein binding experiments revealed that >65% of these TF pairs can directly interact with each other. For five pairs, we further demonstrate that they bind cooperatively to DNA if both sites are present with the preferred spacing. This study demonstrates how DNA-binding motifs can be used to produce a comprehensive map of sequence signatures for different mechanisms of combinatorial TF action.
Identification of a maize nucleic acid-binding protein (NBP) belonging to a family of nuclear-encoded chloroplast proteins.

PubMed Central

Cook, W B; Walker, J C

1992-01-01

A cDNA encoding a nuclear-encoded chloroplast nucleic acid-binding protein (NBP) has been isolated from maize. Identified as an in vitro DNA-binding activity, NBP belongs to a family of nuclear-encoded chloroplast proteins which share a common domain structure and are thought to be involved in posttranscriptional regulation of chloroplast gene expression. NBP contains an N-terminal chloroplast transit peptide, a highly acidic domain and a pair of ribonucleoprotein consensus sequence domains. NBP is expressed in a light-dependent, organ-specific manner which is consistent with its involvement in chloroplast biogenesis. The relationship of NBP to the other members of this protein family and their possible regulatory functions are discussed. Images PMID:1346929
In vitro selection of zinc fingers with altered DNA-binding specificity.

PubMed

Jamieson, A C; Kim, S H; Wells, J A

1994-05-17

We have used random mutagenesis and phage display to alter the DNA-binding specificity of Zif268, a transcription factor that contains three zinc finger domains. Four residues in the helix of finger 1 of Zif268 that potentially mediate DNA binding were identified from an X-ray structure of the Zif268-DNA complex. A library was constructed in which these residues were randomly mutated and the Zif268 variants were fused to a truncated version of the gene III coat protein on the surface of M13 filamentous phage particles. The phage displayed the mutant proteins in a monovalent fashion and were sorted by repeated binding and elution from affinity matrices containing different DNA sequences. When the matrix contained the natural nine base pair operator sequence 5'-GCG-TGG-GCG-3', native-like zinc fingers were isolated. New finger 1 variants were found by sorting with two different operators in which the singly modified triplets, GTG and TCG, replaced the native finger 1 triplet, GCG. Overall, the selected finger 1 variants contained a preponderance of polar residues at the four sites. Interestingly, the net charge of the four residues in any selected finger never derived more that one unit from neutrality despite the fact that about half the variants contained three or four charged residues over the four sites. Measurements of the dissociation constants for two of these purified finger 1 variants by gel-shift assay showed their specificities to vary over a 10-fold range, with the greatest affinity being for the DNA binding site for which they were sorted.(ABSTRACT TRUNCATED AT 250 WORDS)
A DNA sequence obtained by replacement of the dopamine RNA aptamer bases is not an aptamer.

PubMed

Álvarez-Martos, Isabel; Ferapontova, Elena E

2017-08-05

A unique specificity of the aptamer-ligand biorecognition and binding facilitates bioanalysis and biosensor development, contributing to discrimination of structurally related molecules, such as dopamine and other catecholamine neurotransmitters. The aptamer sequence capable of specific binding of dopamine is a 57 nucleotides long RNA sequence reported in 1997 (Biochemistry, 1997, 36, 9726). Later, it was suggested that the DNA homologue of the RNA aptamer retains the specificity of dopamine binding (Biochem. Biophys. Res. Commun., 2009, 388, 732). Here, we show that the DNA sequence obtained by the replacement of the RNA aptamer bases for their DNA analogues is not able of specific biorecognition of dopamine, in contrast to the original RNA aptamer sequence. This DNA sequence binds dopamine and structurally related catecholamine neurotransmitters non-specifically, as any DNA sequence, and, thus, is not an aptamer and cannot be used neither for in vivo nor in situ analysis of dopamine in the presence of structurally related neurotransmitters. Copyright © 2017 Elsevier Inc. All rights reserved.
Saccharomyces cerevisiae SSB1 protein and its relationship to nucleolar RNA-binding proteins.

PubMed Central

Jong, A Y; Clark, M W; Gilbert, M; Oehm, A; Campbell, J L

1987-01-01

To better define the function of Saccharomyces cerevisiae SSB1, an abundant single-stranded nucleic acid-binding protein, we determined the nucleotide sequence of the SSB1 gene and compared it with those of other proteins of known function. The amino acid sequence contains 293 amino acid residues and has an Mr of 32,853. There are several stretches of sequence characteristic of other eucaryotic single-stranded nucleic acid-binding proteins. At the amino terminus, residues 39 to 54 are highly homologous to a peptide in calf thymus UP1 and UP2 and a human heterogeneous nuclear ribonucleoprotein. Residues 125 to 162 constitute a fivefold tandem repeat of the sequence RGGFRG, the composition of which suggests a nucleic acid-binding site. Near the C terminus, residues 233 to 245 are homologous to several RNA-binding proteins. Of 18 C-terminal residues, 10 are acidic, a characteristic of the procaryotic single-stranded DNA-binding proteins and eucaryotic DNA- and RNA-binding proteins. In addition, examination of the subcellular distribution of SSB1 by immunofluorescence microscopy indicated that SSB1 is a nuclear protein, predominantly located in the nucleolus. Sequence homologies and the nucleolar localization make it likely that SSB1 functions in RNA metabolism in vivo, although an additional role in DNA metabolism cannot be excluded. Images PMID:2823109
Structure and specificity of the RNA-guided endonuclease Cas9 during DNA interrogation, target binding and cleavage

PubMed Central

Josephs, Eric A.; Kocak, D. Dewran; Fitzgibbon, Christopher J.; McMenemy, Joshua; Gersbach, Charles A.; Marszalek, Piotr E.

2015-01-01

CRISPR-associated endonuclease Cas9 cuts DNA at variable target sites designated by a Cas9-bound RNA molecule. Cas9's ability to be directed by single ‘guide RNA’ molecules to target nearly any sequence has been recently exploited for a number of emerging biological and medical applications. Therefore, understanding the nature of Cas9's off-target activity is of paramount importance for its practical use. Using atomic force microscopy (AFM), we directly resolve individual Cas9 and nuclease-inactive dCas9 proteins as they bind along engineered DNA substrates. High-resolution imaging allows us to determine their relative propensities to bind with different guide RNA variants to targeted or off-target sequences. Mapping the structural properties of Cas9 and dCas9 to their respective binding sites reveals a progressive conformational transformation at DNA sites with increasing sequence similarity to its target. With kinetic Monte Carlo (KMC) simulations, these results provide evidence of a ‘conformational gating’ mechanism driven by the interactions between the guide RNA and the 14th–17th nucleotide region of the targeted DNA, the stabilities of which we find correlate significantly with reported off-target cleavage rates. KMC simulations also reveal potential methodologies to engineer guide RNA sequences with improved specificity by considering the invasion of guide RNAs into targeted DNA duplex. PMID:26384421

Predicting DNA binding proteins using support vector machine with hybrid fractal features.

PubMed

Niu, Xiao-Hui; Hu, Xue-Hai; Shi, Feng; Xia, Jing-Bo

2014-02-21

DNA-binding proteins play a vitally important role in many biological processes. Prediction of DNA-binding proteins from amino acid sequence is a significant but not fairly resolved scientific problem. Chaos game representation (CGR) investigates the patterns hidden in protein sequences, and visually reveals previously unknown structure. Fractal dimensions (FD) are good tools to measure sizes of complex, highly irregular geometric objects. In order to extract the intrinsic correlation with DNA-binding property from protein sequences, CGR algorithm, fractal dimension and amino acid composition are applied to formulate the numerical features of protein samples in this paper. Seven groups of features are extracted, which can be computed directly from the primary sequence, and each group is evaluated by the 10-fold cross-validation test and Jackknife test. Comparing the results of numerical experiments, the group of amino acid composition and fractal dimension (21-dimension vector) gets the best result, the average accuracy is 81.82% and average Matthew's correlation coefficient (MCC) is 0.6017. This resulting predictor is also compared with existing method DNA-Prot and shows better performances. © 2013 The Authors. Published by Elsevier Ltd All rights reserved.
High-Mobility Group Chromatin Proteins 1 and 2 Functionally Interact with Steroid Hormone Receptors To Enhance Their DNA Binding In Vitro and Transcriptional Activity in Mammalian Cells

PubMed Central

Boonyaratanakornkit, Viroj; Melvin, Vida; Prendergast, Paul; Altmann, Magda; Ronfani, Lorenza; Bianchi, Marco E.; Taraseviciene, Laima; Nordeen, Steven K.; Allegretto, Elizabeth A.; Edwards, Dean P.

1998-01-01

We previously reported that the chromatin high-mobility group protein 1 (HMG-1) enhances the sequence-specific DNA binding activity of progesterone receptor (PR) in vitro, thus providing the first evidence that HMG-1 may have a coregulatory role in steroid receptor-mediated gene transcription. Here we show that HMG-1 and the highly related HMG-2 stimulate DNA binding by other steroid receptors, including estrogen, androgen, and glucocorticoid receptors, but have no effect on DNA binding by several nonsteroid nuclear receptors, including retinoid acid receptor (RAR), retinoic X receptor (RXR), and vitamin D receptor (VDR). As highly purified recombinant full-length proteins, all steroid receptors tested exhibited weak binding affinity for their optimal palindromic hormone response elements (HREs), and the addition of purified HMG-1 or -2 substantially increased their affinity for HREs. Purified RAR, RXR, and VDR also exhibited little to no detectable binding to their cognate direct repeat HREs but, in contrast to results with steroid receptors, the addition of HMG-1 or HMG-2 had no stimulatory effect. Instead, the addition of purified RXR enhanced RAR and VDR DNA binding through a heterodimerization mechanism and HMG-1 or HMG-2 had no further effect on DNA binding by RXR-RAR or RXR-VDR heterodimers. HMG-1 and HMG-2 (HMG-1/-2) themselves do not bind to progesterone response elements, but in the presence of PR they were detected as part of an HMG-PR-DNA ternary complex. HMG-1/-2 can also interact transiently in vitro with PR in the absence of DNA; however, no direct protein interaction was detected with VDR. These results, taken together with the fact that PR can bend its target DNA and that HMG-1/-2 are non-sequence-specific DNA binding proteins that recognize DNA structure, suggest that HMG-1/-2 are recruited to the PR-DNA complex by the combined effect of transient protein interaction and DNA bending. In transient-transfection assays, coexpression of HMG-1 or HMG-2 increased PR-mediated transcription in mammalian cells by as much as 7- to 10-fold without altering the basal promoter activity of target reporter genes. This increase in PR-mediated gene activation by coexpression of HMG-1/-2 was observed in different cell types and with different target promoters, suggesting a generality to the functional interaction between HMG-1/-2 and PR in vivo. Cotransfection of HMG-1 also increased reporter gene activation mediated by other steroid receptors, including glucocorticoid and androgen receptors, but it had a minimal influence on VDR-dependent transcription in vivo. These results support the conclusion that HMG-1/-2 are coregulatory proteins that increase the DNA binding and transcriptional activity of the steroid hormone class of receptors but that do not functionally interact with certain nonsteroid classes of nuclear receptors. PMID:9671457
Multiple Intrinsically Disordered Sequences Alter DNA Binding by the Homeodomain of the Drosophila Hox Protein Ultrabithorax*S⃞

PubMed Central

Liu, Ying; Matthews, Kathleen S.; Bondos, Sarah E.

2008-01-01

During animal development, distinct tissues, organs, and appendages are specified through differential gene transcription by Hox transcription factors. However, the conserved Hox homeodomains bind DNA with high affinity yet low specificity. We have therefore explored the structure of the Drosophila melanogaster Hox protein Ultrabithorax and the impact of its nonhomeodomain regions on DNA binding properties. Computational and experimental approaches identified several conserved, intrinsically disordered regions outside the homeodomain of Ultrabithorax that impact DNA binding by the homeodomain. Full-length Ultrabithorax bound to target DNA 2.5-fold weaker than its isolated homeodomain. Using N-terminal and C-terminal deletion mutants, we demonstrate that the YPWM region and the disordered microexons (termed the I1 region) inhibit DNA binding ∼2-fold, whereas the disordered I2 region inhibits homeodomain-DNA interaction a further ∼40-fold. Binding is restored almost to homeodomain affinity by the mostly disordered N-terminal 174 amino acids (R region) in a length-dependent manner. Both the I2 and R regions contain portions of the activation domain, functionally linking DNA binding and transcription regulation. Given that (i) the I1 region and a portion of the R region alter homeodomain-DNA binding as a function of pH and (ii) an internal deletion within I1 increases Ultrabithorax-DNA affinity, I1 must directly impact homeodomain-DNA interaction energetics. However, I2 appears to indirectly affect DNA binding in a manner countered by the N terminus. The amino acid sequences of I2 and much of the I1 and R regions vary significantly among Ultrabithorax orthologues, potentially diversifying Hox-DNA interactions. PMID:18508761
Single-Nucleotide-Specific Targeting of the Tf1 Retrotransposon Promoted by the DNA-Binding Protein Sap1 of Schizosaccharomyces pombe.

PubMed

Hickey, Anthony; Esnault, Caroline; Majumdar, Anasuya; Chatterjee, Atreyi Ghatak; Iben, James R; McQueen, Philip G; Yang, Andrew X; Mizuguchi, Takeshi; Grewal, Shiv I S; Levin, Henry L

2015-11-01

Transposable elements (TEs) constitute a substantial fraction of the eukaryotic genome and, as a result, have a complex relationship with their host that is both adversarial and dependent. To minimize damage to cellular genes, TEs possess mechanisms that target integration to sequences of low importance. However, the retrotransposon Tf1 of Schizosaccharomyces pombe integrates with a surprising bias for promoter sequences of stress-response genes. The clustering of integration in specific promoters suggests that Tf1 possesses a targeting mechanism that is important for evolutionary adaptation to changes in environment. We report here that Sap1, an essential DNA-binding protein, plays an important role in Tf1 integration. A mutation in Sap1 resulted in a 10-fold drop in Tf1 transposition, and measures of transposon intermediates support the argument that the defect occurred in the process of integration. Published ChIP-Seq data on Sap1 binding combined with high-density maps of Tf1 integration that measure independent insertions at single-nucleotide positions show that 73.4% of all integration occurs at genomic sequences bound by Sap1. This represents high selectivity because Sap1 binds just 6.8% of the genome. A genome-wide analysis of promoter sequences revealed that Sap1 binding and amounts of integration correlate strongly. More important, an alignment of the DNA-binding motif of Sap1 revealed integration clustered on both sides of the motif and showed high levels specifically at positions +19 and -9. These data indicate that Sap1 contributes to the efficiency and position of Tf1 integration. Copyright © 2015 by the Genetics Society of America.
Single-Nucleotide-Specific Targeting of the Tf1 Retrotransposon Promoted by the DNA-Binding Protein Sap1 of Schizosaccharomyces pombe

PubMed Central

Hickey, Anthony; Esnault, Caroline; Majumdar, Anasuya; Chatterjee, Atreyi Ghatak; Iben, James R.; McQueen, Philip G.; Yang, Andrew X.; Mizuguchi, Takeshi; Grewal, Shiv I. S.; Levin, Henry L.

2015-01-01

Transposable elements (TEs) constitute a substantial fraction of the eukaryotic genome and, as a result, have a complex relationship with their host that is both adversarial and dependent. To minimize damage to cellular genes, TEs possess mechanisms that target integration to sequences of low importance. However, the retrotransposon Tf1 of Schizosaccharomyces pombe integrates with a surprising bias for promoter sequences of stress-response genes. The clustering of integration in specific promoters suggests that Tf1 possesses a targeting mechanism that is important for evolutionary adaptation to changes in environment. We report here that Sap1, an essential DNA-binding protein, plays an important role in Tf1 integration. A mutation in Sap1 resulted in a 10-fold drop in Tf1 transposition, and measures of transposon intermediates support the argument that the defect occurred in the process of integration. Published ChIP-Seq data on Sap1 binding combined with high-density maps of Tf1 integration that measure independent insertions at single-nucleotide positions show that 73.4% of all integration occurs at genomic sequences bound by Sap1. This represents high selectivity because Sap1 binds just 6.8% of the genome. A genome-wide analysis of promoter sequences revealed that Sap1 binding and amounts of integration correlate strongly. More important, an alignment of the DNA-binding motif of Sap1 revealed integration clustered on both sides of the motif and showed high levels specifically at positions +19 and −9. These data indicate that Sap1 contributes to the efficiency and position of Tf1 integration. PMID:26358720
Genomic sequencing and in vivo footprinting of an expression-specific DNase I-hypersensitive site of avian vitellogenin II promoter reveal a demethylation of a mCpG and a change in specific interactions of proteins with DNA.

PubMed Central

Saluz, H P; Feavers, I M; Jiricny, J; Jost, J P

1988-01-01

Genomic sequencing was used to study the in vivo methylation pattern of two CpG sites in the promoter region of the avian vitellogenin gene. The CpG at position +10 was fully methylated in DNA isolated from tissues that do not express the gene but was unmethylated in the liver of mature hens and estradiol-treated roosters. In the latter tissue, this site became demethylated and DNase I hypersensitive after estradiol treatment. A second CpG (position -52) was unmethylated in all tissues examined. In vivo genomic footprinting with dimethyl sulfate revealed different patterns of DNA protection in silent and expressed genes. In rooster liver cells, at least 10 base pairs of DNA, including the methylated CpG, were protected by protein(s). Gel-shift assays indicated that a protein factor, present in rooster liver nuclear extract, bound at this site only when it was methylated. In hen liver cells, the same unmethylated CpG lies within a protected region of approximately equal to 20 base pairs. In vitro DNase I protection and gel-shift assays indicate that this sequence is bound by a protein, which binds both double- and single-stranded DNA. For the latter substrate, this factor was shown to bind solely the noncoding (i.e., mRNA-like) strand. Images PMID:3413118
Poxvirus uracil-DNA glycosylase-An unusual member of the family I uracil-DNA glycosylases: Poxvirus Uracil-DNA Glycosylase

DOE Office of Scientific and Technical Information (OSTI.GOV)

Schormann, Norbert; Zhukovskaya, Natalia; Bedwell, Gregory

We report that uracil-DNA glycosylases are ubiquitous enzymes, which play a key role repairing damages in DNA and in maintaining genomic integrity by catalyzing the first step in the base excision repair pathway. Within the superfamily of uracil-DNA glycosylases family I enzymes or UNGs are specific for recognizing and removing uracil from DNA. These enzymes feature conserved structural folds, active site residues and use common motifs for DNA binding, uracil recognition and catalysis. Within this family the enzymes of poxviruses are unique and most remarkable in terms of amino acid sequences, characteristic motifs and more importantly for their novel non-enzymaticmore » function in DNA replication. UNG of vaccinia virus, also known as D4, is the most extensively characterized UNG of the poxvirus family. D4 forms an unusual heterodimeric processivity factor by attaching to a poxvirus-specific protein A20, which also binds to the DNA polymerase E9 and recruits other proteins necessary for replication. D4 is thus integrated in the DNA polymerase complex, and its DNA-binding and DNA scanning abilities couple DNA processivity and DNA base excision repair at the replication fork. In conclusion, the adaptations necessary for taking on the new function are reflected in the amino acid sequence and the three-dimensional structure of D4. We provide an overview of the current state of the knowledge on the structure-function relationship of D4.« less
Chromatin immunoprecipitation of mouse embryos.

PubMed

Voss, Anne K; Dixon, Mathew P; McLennan, Tamara; Kueh, Andrew J; Thomas, Tim

2012-01-01

During prenatal development, a large number of different cell types are formed, the vast majority of which contain identical genetic material. The basis of the great variety in cell phenotype and function is the differential expression of the approximately 25,000 genes in the mammalian genome. Transcriptional activity is regulated at many levels by proteins, including members of the basal transcriptional apparatus, DNA-binding transcription factors, and chromatin-binding proteins. Importantly, chromatin structure dictates the availability of a specific genomic locus for transcriptional activation as well as the efficiency, with which transcription can occur. Chromatin immunoprecipitation (ChIP) is a method to assess if chromatin modifications or proteins are present at a specific locus. ChIP involves the cross linking of DNA and associated proteins and immunoprecipitation using specific antibodies to DNA-associated proteins followed by examination of the co-precipitated DNA sequences or proteins. In the last few years, ChIP has become an essential technique for scientists studying transcriptional regulation and chromatin structure. Using ChIP on mouse embryos, we can document the presence or absence of specific proteins and chromatin modifications at genomic loci in vivo during mammalian development. Here, we describe a ChIP technique adapted for mouse embryos.
Structural changes induced by binding of the high-mobility group I protein to a mouse satellite DNA sequence.

PubMed Central

Slama-Schwok, A; Zakrzewska, K; Léger, G; Leroux, Y; Takahashi, M; Käs, E; Debey, P

2000-01-01

Using spectroscopic methods, we have studied the structural changes induced in both protein and DNA upon binding of the High-Mobility Group I (HMG-I) protein to a 21-bp sequence derived from mouse satellite DNA. We show that these structural changes depend on the stoichiometry of the protein/DNA complexes formed, as determined by Job plots derived from experiments using pyrene-labeled duplexes. Circular dichroism and melting temperature experiments extended in the far ultraviolet range show that while native HMG-I is mainly random coiled in solution, it adopts a beta-turn conformation upon forming a 1:1 complex in which the protein first binds to one of two dA.dT stretches present in the duplex. HMG-I structure in the 1:1 complex is dependent on the sequence of its DNA target. A 3:1 HMG-I/DNA complex can also form and is characterized by a small increase in the DNA natural bend and/or compaction coupled to a change in the protein conformation, as determined from fluorescence resonance energy transfer (FRET) experiments. In addition, a peptide corresponding to an extended DNA-binding domain of HMG-I induces an ordered condensation of DNA duplexes. Based on the constraints derived from pyrene excimer measurements, we present a model of these nucleated structures. Our results illustrate an extreme case of protein structure induced by DNA conformation that may bear on the evolutionary conservation of the DNA-binding motifs of HMG-I. We discuss the functional relevance of the structural flexibility of HMG-I associated with the nature of its DNA targets and the implications of the binding stoichiometry for several aspects of chromatin structure and gene regulation. PMID:10777751
Design, synthesis and DNA interactions of a chimera between a platinum complex and an IHF mimicking peptide.

PubMed

Rao, Harita; Damian, Mariana S; Alshiekh, Alak; Elmroth, Sofi K C; Diederichsen, Ulf

2015-12-28

Conjugation of metal complexes with peptide scaffolds possessing high DNA binding affinity has shown to modulate their biological activities and to enhance their interaction with DNA. In this work, a platinum complex/peptide chimera was synthesized based on a model of the Integration Host Factor (IHF), an architectural protein possessing sequence specific DNA binding and bending abilities through its interaction with a minor groove. The model peptide consists of a cyclic unit resembling the minor grove binding subdomain of IHF, a positively charged lysine dendrimer for electrostatic interactions with the DNA phosphate backbone and a flexible glycine linker tethering the two units. A norvaline derived artificial amino acid was designed to contain a dimethylethylenediamine as a bidentate platinum chelating unit, and introduced into the IHF mimicking peptides. The interaction of the chimeric peptides with various DNA sequences was studied by utilizing the following experiments: thermal melting studies, agarose gel electrophoresis for plasmid DNA unwinding experiments, and native and denaturing gel electrophoresis to visualize non-covalent and covalent peptide-DNA adducts, respectively. By incorporation of the platinum metal center within the model peptide mimicking IHF we have attempted to improve its specificity and DNA targeting ability, particularly towards those sequences containing adjacent guanine residues.
Cryptic MCAT enhancer regulation in fibroblasts and smooth muscle cells. Suppression of TEF-1 mediated activation by the single-stranded DNA-binding proteins, Pur alpha, Pur beta, and MSY1.

PubMed

Carlini, Leslie E; Getz, Michael J; Strauch, Arthur R; Kelm, Robert J

2002-03-08

An asymmetric polypurine-polypyrimidine cis-element located in the 5' region of the mouse vascular smooth muscle alpha-actin gene serves as a binding site for multiple proteins with specific affinity for either single- or double-stranded DNA. Here, we test the hypothesis that single-stranded DNA-binding proteins are responsible for preventing a cryptic MCAT enhancer centered within this element from cooperating with a nearby serum response factor-interacting CArG motif to trans-activate the minimal promoter in fibroblasts and smooth muscle cells. DNA binding studies revealed that the core MCAT sequence mediates binding of transcription enhancer factor-1 to the double-stranded polypurine-polypyrimidine element while flanking nucleotides account for interaction of Pur alpha and Pur beta with the purine-rich strand and MSY1 with the complementary pyrimidine-rich strand. Mutations that selectively impaired high affinity single-stranded DNA binding by fibroblast or smooth muscle cell-derived Pur alpha, Pur beta, and MSY1 in vitro, released the cryptic MCAT enhancer from repression in transfected cells. Additional experiments indicated that Pur alpha, Pur beta, and MSY1 also interact specifically, albeit weakly, with double-stranded DNA and with transcription enhancer factor-1. These results are consistent with two plausible models of cryptic MCAT enhancer regulation by Pur alpha, Pur beta, and MSY1 involving either competitive single-stranded DNA binding or masking of MCAT-bound transcription enhancer factor-1.
Screening for Protein-DNA Interactions by Automatable DNA-Protein Interaction ELISA

PubMed Central

Schüssler, Axel; Kolukisaoglu, H. Üner; Koch, Grit; Wallmeroth, Niklas; Hecker, Andreas; Thurow, Kerstin; Zell, Andreas; Harter, Klaus; Wanke, Dierk

2013-01-01

DNA-binding proteins (DBPs), such as transcription factors, constitute about 10% of the protein-coding genes in eukaryotic genomes and play pivotal roles in the regulation of chromatin structure and gene expression by binding to short stretches of DNA. Despite their number and importance, only for a minor portion of DBPs the binding sequence had been disclosed. Methods that allow the de novo identification of DNA-binding motifs of known DBPs, such as protein binding microarray technology or SELEX, are not yet suited for high-throughput and automation. To close this gap, we report an automatable DNA-protein-interaction (DPI)-ELISA screen of an optimized double-stranded DNA (dsDNA) probe library that allows the high-throughput identification of hexanucleotide DNA-binding motifs. In contrast to other methods, this DPI-ELISA screen can be performed manually or with standard laboratory automation. Furthermore, output evaluation does not require extensive computational analysis to derive a binding consensus. We could show that the DPI-ELISA screen disclosed the full spectrum of binding preferences for a given DBP. As an example, AtWRKY11 was used to demonstrate that the automated DPI-ELISA screen revealed the entire range of in vitro binding preferences. In addition, protein extracts of AtbZIP63 and the DNA-binding domain of AtWRKY33 were analyzed, which led to a refinement of their known DNA-binding consensi. Finally, we performed a DPI-ELISA screen to disclose the DNA-binding consensus of a yet uncharacterized putative DBP, AtTIFY1. A palindromic TGATCA-consensus was uncovered and we could show that the GATC-core is compulsory for AtTIFY1 binding. This specific interaction between AtTIFY1 and its DNA-binding motif was confirmed by in vivo plant one-hybrid assays in protoplasts. Thus, the value and applicability of the DPI-ELISA screen for de novo binding site identification of DBPs, also under automatized conditions, is a promising approach for a deeper understanding of gene regulation in any organism of choice. PMID:24146751
Exo-Dye-based assay for rapid, inexpensive, and sensitive detection of DNA-binding proteins.

PubMed

Chen, Zaozao; Ji, Meiju; Hou, Peng; Lu, Zuhong

2006-07-07

We reported herein a rapid, inexpensive, and sensitive technique for detecting sequence-specific DNA-binding proteins. In this technique, the common exonuclease III (ExoIII) footprinting assay is coupled with simple SYBR Green I staining for monitoring the activities of DNA-binding proteins. We named this technique as ExoIII-Dye-based assay. In this assay, a duplex probe was designed to detect DNA-binding protein. One side of the probe contains one protein-binding site, and another side of it contains five protruding bases at 3' end for protection from ExoIII digestion. If a target protein is present, it will bind to binding sites of probe and produce a physical hindrance to ExoIII, which protects the duplex probe from digestion of ExoIII. SYBR Green I will bind to probe, which results in high fluorescence intensity. On the contrary, in the absence of the target protein, the naked duplex probe will be degraded by ExoIII. SYBR Green I will be released, which results in a low fluorescence intensity. In this study, we employed this technique to successfully detect transcription factor NF-kappaB in crude cell extracts. Moreover, it could also be used to evaluate the binding affinity of NF-kappaB. This technique has therefore wide potential application in research, medical diagnosis, and drug discovery.
Electrostatic study of Alanine mutational effects on transcription: application to GATA-3:DNA interaction complex.

PubMed

El-Assaad, Atlal; Dawy, Zaher; Nemer, Georges

2015-01-01

Protein-DNA interaction is of fundamental importance in molecular biology, playing roles in functions as diverse as DNA transcription, DNA structure formation, and DNA repair. Protein-DNA association is also important in medicine; understanding Protein-DNA binding kinetics can assist in identifying disease root causes which can contribute to drug development. In this perspective, this work focuses on the transcription process by the GATA Transcription Factor (TF). GATA TF binds to DNA promoter region represented by `G,A,T,A' nucleotides sequence, and initiates transcription of target genes. When proper regulation fails due to some mutations on the GATA TF protein sequence or on the DNA promoter sequence (weak promoter), deregulation of the target genes might lead to various disorders. In this study, we aim to understand the electrostatic mechanism behind GATA TF and DNA promoter interactions, in order to predict Protein-DNA binding in the presence of mutations, while elaborating on non-covalent binding kinetics. To generate a family of mutants for the GATA:DNA complex, we replaced every charged amino acid, one at a time, with a neutral amino acid like Alanine (Ala). We then applied Poisson-Boltzmann electrostatic calculations feeding into free energy calculations, for each mutation. These calculations delineate the contribution to binding from each Ala-replaced amino acid in the GATA:DNA interaction. After analyzing the obtained data in view of a two-step model, we are able to identify potential key amino acids in binding. Finally, we applied the model to GATA-3:DNA (crystal structure with PDB-ID: 3DFV) binding complex and validated it against experimental results from the literature.
Multiple conformations of the cytidine repressor DNA-binding domain coalesce to one upon recognition of a specific DNA surface.

PubMed

Moody, Colleen L; Tretyachenko-Ladokhina, Vira; Laue, Thomas M; Senear, Donald F; Cocco, Melanie J

2011-08-09

The cytidine repressor (CytR) is a member of the LacR family of bacterial repressors with distinct functional features. The Escherichia coli CytR regulon comprises nine operons whose palindromic operators vary in both sequence and, most significantly, spacing between the recognition half-sites. This suggests a strong likelihood that protein folding would be coupled to DNA binding as a mechanism to accommodate the variety of different operator architectures to which CytR is targeted. Such coupling is a common feature of sequence-specific DNA-binding proteins, including the LacR family repressors; however, there are no significant structural rearrangements upon DNA binding within the three-helix DNA-binding domains (DBDs) studied to date. We used nuclear magnetic resonance (NMR) spectroscopy to characterize the CytR DBD free in solution and to determine the high-resolution structure of a CytR DBD monomer bound specifically to one DNA half-site of the uridine phosphorylase (udp) operator. We find that the free DBD populates multiple distinct conformations distinguished by up to four sets of NMR peaks per residue. This structural heterogeneity is previously unknown in the LacR family. These stable structures coalesce into a single, more stable udp-bound form that features a three-helix bundle containing a canonical helix-turn-helix motif. However, this structure differs from all other LacR family members whose structures are known with regard to the packing of the helices and consequently their relative orientations. Aspects of CytR activity are unique among repressors; we identify here structural properties that are also distinct and that might underlie the different functional properties. © 2011 American Chemical Society
TALE-PvuII Fusion Proteins – Novel Tools for Gene Targeting

PubMed Central

Yanik, Mert; Alzubi, Jamal; Lahaye, Thomas; Cathomen, Toni; Pingoud, Alfred; Wende, Wolfgang

2013-01-01

Zinc finger nucleases (ZFNs) consist of zinc fingers as DNA-binding module and the non-specific DNA-cleavage domain of the restriction endonuclease FokI as DNA-cleavage module. This architecture is also used by TALE nucleases (TALENs), in which the DNA-binding modules of the ZFNs have been replaced by DNA-binding domains based on transcription activator like effector (TALE) proteins. Both TALENs and ZFNs are programmable nucleases which rely on the dimerization of FokI to induce double-strand DNA cleavage at the target site after recognition of the target DNA by the respective DNA-binding module. TALENs seem to have an advantage over ZFNs, as the assembly of TALE proteins is easier than that of ZFNs. Here, we present evidence that variant TALENs can be produced by replacing the catalytic domain of FokI with the restriction endonuclease PvuII. These fusion proteins recognize only the composite recognition site consisting of the target site of the TALE protein and the PvuII recognition sequence (addressed site), but not isolated TALE or PvuII recognition sites (unaddressed sites), even at high excess of protein over DNA and long incubation times. In vitro, their preference for an addressed over an unaddressed site is > 34,000-fold. Moreover, TALE-PvuII fusion proteins are active in cellula with minimal cytotoxicity. PMID:24349308
Molecular Dynamics Simulations of DNA-Free and DNA-Bound TAL Effectors

PubMed Central

Wan, Hua; Hu, Jian-ping; Li, Kang-shun; Tian, Xu-hong; Chang, Shan

2013-01-01

TAL (transcriptional activator-like) effectors (TALEs) are DNA-binding proteins, containing a modular central domain that recognizes specific DNA sequences. Recently, the crystallographic studies of TALEs revealed the structure of DNA-recognition domain. In this article, molecular dynamics (MD) simulations are employed to study two crystal structures of an 11.5-repeat TALE, in the presence and absence of DNA, respectively. The simulated results indicate that the specific binding of RVDs (repeat-variable diresidues) with DNA leads to the markedly reduced fluctuations of tandem repeats, especially at the two ends. In the DNA-bound TALE system, the base-specific interaction is formed mainly by the residue at position 13 within a TAL repeat. Tandem repeats with weak RVDs are unfavorable for the TALE-DNA binding. These observations are consistent with experimental studies. By using principal component analysis (PCA), the dominant motions are open-close movements between the two ends of the superhelical structure in both DNA-free and DNA-bound TALE systems. The open-close movements are found to be critical for the recognition and binding of TALE-DNA based on the analysis of free energy landscape (FEL). The conformational analysis of DNA indicates that the 5′ end of DNA target sequence has more remarkable structural deformability than the other sites. Meanwhile, the conformational change of DNA is likely associated with the specific interaction of TALE-DNA. We further suggest that the arrangement of N-terminal repeats with strong RVDs may help in the design of efficient TALEs. This study provides some new insights into the understanding of the TALE-DNA recognition mechanism. PMID:24130757
Sequence Dependencies of DNA Deformability and Hydration in the Minor Groove

PubMed Central

Yonetani, Yoshiteru; Kono, Hidetoshi

2009-01-01

Abstract DNA deformability and hydration are both sequence-dependent and are essential in specific DNA sequence recognition by proteins. However, the relationship between the two is not well understood. Here, systematic molecular dynamics simulations of 136 DNA sequences that differ from each other in their central tetramer revealed that sequence dependence of hydration is clearly correlated with that of deformability. We show that this correlation can be illustrated by four typical cases. Most rigid basepair steps are highly likely to form an ordered hydration pattern composed of one water molecule forming a bridge between the bases of distinct strands, but a few exceptions favor another ordered hydration composed of two water molecules forming such a bridge. Steps with medium deformability can display both of these hydration patterns with frequent transition. Highly flexible steps do not have any stable hydration pattern. A detailed picture of this correlation demonstrates that motions of hydration water molecules and DNA bases are tightly coupled with each other at the atomic level. These results contribute to our understanding of the entropic contribution from water molecules in protein or drug binding and could be applied for the purpose of predicting binding sites. PMID:19686662
DNA binding of the p21 repressor ZBTB2 is inhibited by cytosine hydroxymethylation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lafaye, Céline; Barbier, Ewa; Miscioscia, Audrey

2014-03-28

Highlights: • 5-hmC epigenetic modification is measurable in HeLa, SH-SY5Y and UT7-MPL cell lines. • ZBTB2 binds to DNA probes containing 5-mC but not to sequences containing 5-hmC. • This differential binding is verified with DNA sequences involved in p21 regulation. - Abstract: Recent studies have demonstrated that the modified base 5-hydroxymethylcytosine (5-hmC) is detectable at various rates in DNA extracted from human tissues. This oxidative product of 5-methylcytosine (5-mC) constitutes a new and important actor of epigenetic mechanisms. We designed a DNA pull down assay to trap and identify nuclear proteins bound to 5-hmC and/or 5-mC. We applied thismore » strategy to three cancerous cell lines (HeLa, SH-SY5Y and UT7-MPL) in which we also measured 5-mC and 5-hmC levels by HPLC-MS/MS. We found that the putative oncoprotein Zinc finger and BTB domain-containing protein 2 (ZBTB2) is associated with methylated DNA sequences and that this interaction is inhibited by the presence of 5-hmC replacing 5-mC. As published data mention ZBTB2 recognition of p21 regulating sequences, we verified that this sequence specific binding was also alleviated by 5-hmC. ZBTB2 being considered as a multifunctional cell proliferation activator, notably through p21 repression, this work points out new epigenetic processes potentially involved in carcinogenesis.« less
The octamer-binding proteins form multi-protein--DNA complexes with the HSV alpha TIF regulatory protein.

PubMed Central

Kristie, T M; LeBowitz, J H; Sharp, P A

1989-01-01

The herpes simplex virus transactivator, alpha TIF, stimulates transcription of the alpha/immediate early genes via a cis-acting site containing an octamer element and a conserved flanking sequence. The alpha TIF protein, produced in a baculovirus expression system, nucleates the formation of at least two DNA--protein complexes on this regulatory element. Both of these complexes contain the ubiquitous Oct-1 protein, whose POU domain alone is sufficient to allow assembly of the alpha TIF-dependent complexes. A second member of the POU domain family, the lymphoid specific Oct-2 protein, can also be assembled into similar complexes at high concentrations of alpha TIF protein. These complexes contain at least two cellular proteins in addition to Oct-1. One of these proteins is present in both insect and HeLa cells and probably recognizes sequences in the cis element. The second cellular protein, only present in HeLa cells, probably binds by protein-protein interactions. Images PMID:2556266

The octamer-binding proteins form multi-protein--DNA complexes with the HSV alpha TIF regulatory protein.

PubMed

Kristie, T M; LeBowitz, J H; Sharp, P A

1989-12-20

The herpes simplex virus transactivator, alpha TIF, stimulates transcription of the alpha/immediate early genes via a cis-acting site containing an octamer element and a conserved flanking sequence. The alpha TIF protein, produced in a baculovirus expression system, nucleates the formation of at least two DNA--protein complexes on this regulatory element. Both of these complexes contain the ubiquitous Oct-1 protein, whose POU domain alone is sufficient to allow assembly of the alpha TIF-dependent complexes. A second member of the POU domain family, the lymphoid specific Oct-2 protein, can also be assembled into similar complexes at high concentrations of alpha TIF protein. These complexes contain at least two cellular proteins in addition to Oct-1. One of these proteins is present in both insect and HeLa cells and probably recognizes sequences in the cis element. The second cellular protein, only present in HeLa cells, probably binds by protein-protein interactions.
Characterization of DNA-protein interactions using high-throughput sequencing data from pulldown experiments

NASA Astrophysics Data System (ADS)

Moreland, Blythe; Oman, Kenji; Curfman, John; Yan, Pearlly; Bundschuh, Ralf

Methyl-binding domain (MBD) protein pulldown experiments have been a valuable tool in measuring the levels of methylated CpG dinucleotides. Due to the frequent use of this technique, high-throughput sequencing data sets are available that allow a detailed quantitative characterization of the underlying interaction between methylated DNA and MBD proteins. Analyzing such data sets, we first found that two such proteins cannot bind closer to each other than 2 bp, consistent with structural models of the DNA-protein interaction. Second, the large amount of sequencing data allowed us to find rather weak but nevertheless clearly statistically significant sequence preferences for several bases around the required CpG. These results demonstrate that pulldown sequencing is a high-precision tool in characterizing DNA-protein interactions. This material is based upon work supported by the National Science Foundation under Grant No. DMR-1410172.
Increased expression of Aspergillus parasiticus aflR, encoding a sequence-specific DNA-binding protein, relieves nitrate inhibition of aflatoxin biosynthesis.

PubMed Central

Chang, P K; Ehrlich, K C; Yu, J; Bhatnagar, D; Cleveland, T E

1995-01-01

The aflR gene from Aspergillus parasiticus and Aspergillus flavus may be involved in the regulation of aflatoxin biosynthesis. The aflR gene product, AFLR, possesses a GAL4-type binuclear zinc finger DNA-binding domain. A transformant, SU1-N3 (pHSP), containing an additional copy of aflR, showed increased transcription of aflR and the aflatoxin pathway structural genes, nor-1, ver-1, and omt-1, when cells were grown in nitrate medium, which normally suppresses aflatoxin production. Electrophoretic mobility shift assays showed that the recombinant protein containing the DNA-binding domain, AFLR1, bound specifically to the palindromic sequence, TTAGGCCTAA, 120 bp upstream of the AFLR translation start site. Expression of aflR thus appears to be autoregulated. Increased expression of aflatoxin biosynthetic genes in the transformant might result from an elevated basal level of AFLR, allowing it to overcome nitrate inhibition and to bind to the aflR promotor region, thereby initiating aflatoxin biosynthesis. Results further suggest that aflR is involved in the regulation of multiple parts of the aflatoxin biosynthetic pathway. PMID:7793958
Study of intermolecular contacts in the proline-rich homeodomain (PRH)-DNA complex using molecular dynamics simulations.

PubMed

Jalili, Seifollah; Karami, Leila

2012-03-01

The proline-rich homeodomain (PRH)-DNA complex consists of a protein with 60 residues and a 13-base-pair DNA. The PRH protein is a transcription factor that plays a key role in the regulation of gene expression. PRH is a significant member of the Q50 class of homeodomain proteins. The homeodomain section of PRH is essential for binding to DNA and mediates sequence-specific DNA binding. Three 20-ns molecular dynamics (MD) simulations (free protein, free DNA and protein-DNA complex) in explicit solvent water were performed to elucidate the intermolecular contacts in the PRH-DNA complex and the role of dynamics of water molecules forming water-mediated contacts. The simulation provides a detailed explanation of the trajectory of hydration water molecules. The simulations show that some water molecules in the protein-DNA interface exchange with bulk waters. The simulation identifies that most of the contacts consisted of direct interactions between the protein and DNA including specific and non-specific contacts, but several water-mediated polar contacts were also observed. The specific interaction between Gln50 and C18 and water-mediated hydrogen bond between Gln50 and T7 were found to be present during almost the entire time of the simulation. These results show good consistency with experimental and previous computational studies. Structural properties such as root-mean-square deviations (RMSD), root-mean-square fluctuations (RMSF) and secondary structure were also analyzed as a function of time. Analyses of the trajectories showed that the dynamic fluctuations of both the protein and the DNA were lowered by the complex formation.
Structure solution of DNA-binding proteins and complexes with ARCIMBOLDO libraries

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pröpper, Kevin; Instituto de Biologia Molecular de Barcelona; Meindl, Kathrin

2014-06-01

The structure solution of DNA-binding protein structures and complexes based on the combination of location of DNA-binding protein motif fragments with density modification in a multi-solution frame is described. Protein–DNA interactions play a major role in all aspects of genetic activity within an organism, such as transcription, packaging, rearrangement, replication and repair. The molecular detail of protein–DNA interactions can be best visualized through crystallography, and structures emphasizing insight into the principles of binding and base-sequence recognition are essential to understanding the subtleties of the underlying mechanisms. An increasing number of high-quality DNA-binding protein structure determinations have been witnessed despite themore » fact that the crystallographic particularities of nucleic acids tend to pose specific challenges to methods primarily developed for proteins. Crystallographic structure solution of protein–DNA complexes therefore remains a challenging area that is in need of optimized experimental and computational methods. The potential of the structure-solution program ARCIMBOLDO for the solution of protein–DNA complexes has therefore been assessed. The method is based on the combination of locating small, very accurate fragments using the program Phaser and density modification with the program SHELXE. Whereas for typical proteins main-chain α-helices provide the ideal, almost ubiquitous, small fragments to start searches, in the case of DNA complexes the binding motifs and DNA double helix constitute suitable search fragments. The aim of this work is to provide an effective library of search fragments as well as to determine the optimal ARCIMBOLDO strategy for the solution of this class of structures.« less
Characterisation of a DNA sequence element that directs Dictyostelium stalk cell-specific gene expression.

PubMed

Ceccarelli, A; Zhukovskaya, N; Kawata, T; Bozzaro, S; Williams, J

2000-12-01

The ecmB gene of Dictyostelium is expressed at culmination both in the prestalk cells that enter the stalk tube and in ancillary stalk cell structures such as the basal disc. Stalk tube-specific expression is regulated by sequence elements within the cap-site proximal part of the promoter, the stalk tube (ST) promoter region. Dd-STATa, a member of the STAT transcription factor family, binds to elements present in the ST promoter-region and represses transcription prior to entry into the stalk tube. We have characterised an activatory DNA sequence element, that lies distal to the repressor elements and that is both necessary and sufficient for expression within the stalk tube. We have mapped this activator to a 28 nucleotide region (the 28-mer) within which we have identified a GA-containing sequence element that is required for efficient gene transcription. The Dd-STATa protein binds to the 28-mer in an in vitro binding assay, and binding is dependent upon the GA-containing sequence. However, the ecmB gene is expressed in a Dd-STATa null mutant, therefore Dd-STATa cannot be responsible for activating the 28-mer in vivo. Instead, we identified a distinct 28-mer binding activity in nuclear extracts from the Dd-STATa null mutant, the activity of this GA binding activity being largely masked in wild type extracts by the high affinity binding of the Dd-STATa protein. We suggest, that in addition to the long range repression exerted by binding to the two known repressor sites, Dd-STATa inhibits transcription by direct competition with this putative activator for binding to the GA sequence.
Understanding the structural and dynamic consequences of DNA epigenetic modifications: Computational insights into cytosine methylation and hydroxymethylation

PubMed Central

Carvalho, Alexandra T P; Gouveia, Leonor; Kanna, Charan Raju; Wärmländer, Sebastian K T S; Platts, Jamie A; Kamerlin, Shina Caroline Lynn

2014-01-01

We report a series of molecular dynamics (MD) simulations of up to a microsecond combined simulation time designed to probe epigenetically modified DNA sequences. More specifically, by monitoring the effects of methylation and hydroxymethylation of cytosine in different DNA sequences, we show, for the first time, that DNA epigenetic modifications change the molecule's dynamical landscape, increasing the propensity of DNA toward different values of twist and/or roll/tilt angles (in relation to the unmodified DNA) at the modification sites. Moreover, both the extent and position of different modifications have significant effects on the amount of structural variation observed. We propose that these conformational differences, which are dependent on the sequence environment, can provide specificity for protein binding. PMID:25625845
Interactions of DNA binding proteins with G-Quadruplex structures at the single molecule level

NASA Astrophysics Data System (ADS)

Ray, Sujay

Guanine-rich nucleic acid (DNA/RNA) sequences can form non-canonical secondary structures, known as G-quadruplex (GQ). Numerous in vivo and in vitro studies have demonstrated formation of these structures in telomeric and non-telomeric regions of the genome. Telomeric GQs protect the chromosome ends whereas non-telomeric GQs either act as road blocks or recognition sites for DNA metabolic machinery. These observations suggest the significance of these structures in regulation of different metabolic processes, such as replication and repair. GQs are typically thermodynamically more stable than the corresponding Watson-Crick base pairing formed by G-rich and C-rich strands, making protein activity a crucial factor for their destabilization. Inside the cell, GQs interact with different proteins and their enzymatic activity is the determining factor for their stability. We studied interactions of several proteins with GQs to understand the underlying principles of protein-GQ interactions using single-molecule FRET and other biophysical techniques. Replication Protein-A (RPA), a single stranded DNA (ssDNA) binding protein, is known to posses GQ unfolding activity. First, we compared the thermal stability of three potentially GQ-forming DNA sequences (PQS) to their stability against RPA-mediated unfolding. One of these sequences is the human telomeric repeat and the other two, located in the promoter region of tyrosine hydroxylase gene, are highly heterogeneous sequences that better represent PQS in the genome. The thermal stability of these structures do not necessarily correlate with their stability against protein-mediated unfolding. We conclude that thermal stability is not necessarily an adequate criterion for predicting the physiological viability of GQ structures. To determine the critical structural factors that influence protein-GQ interactions we studied two groups of GQ structures that have systematically varying loop lengths and number of G-tetrad layers. We observed a linear increase in the steady-state stability of the GQ against RPA-mediated unfolding with increasing number of layers or decreasing loop length. The stability demonstrated by different GQ structures varied by at least three orders of magnitude. Finally, we studied another protein-GQ system where a protein complex works synergistically with a GQ to suppress DNA damage signals by preventing RPA to bind to telomeric DNA. Human telomeres that terminate with a single-stranded 3' G-overhang can be recognized as a DNA damage site by RPA. The protection of telomere-1 (POT1) and POT1-interacting protein (TPP1) heterodimer, binds specifically to telomeric DNA and protects it against RPA binding. Using model telomeric DNA, we studied the competition between POT1/TPP1 and RPA to access telomeric GQs in vitro. Under physiological salt and pH conditions, POT1/TPP1 stably load to a minimal DNA sequence adjacent to a folded GQ and unfolds the anti-parallel GQ as the parallel conformation remains folded. We showed that GQ formation of telomeres enhances the ability of POT1/TPP1 to block RPA's access to telomeres by two orders of magnitude and contributes to suppress DNA damage signals.
ZifBASE: a database of zinc finger proteins and associated resources.

PubMed

Jayakanthan, Mannu; Muthukumaran, Jayaraman; Chandrasekar, Sanniyasi; Chawla, Konika; Punetha, Ankita; Sundar, Durai

2009-09-09

Information on the occurrence of zinc finger protein motifs in genomes is crucial to the developing field of molecular genome engineering. The knowledge of their target DNA-binding sequences is vital to develop chimeric proteins for targeted genome engineering and site-specific gene correction. There is a need to develop a computational resource of zinc finger proteins (ZFP) to identify the potential binding sites and its location, which reduce the time of in vivo task, and overcome the difficulties in selecting the specific type of zinc finger protein and the target site in the DNA sequence. ZifBASE provides an extensive collection of various natural and engineered ZFP. It uses standard names and a genetic and structural classification scheme to present data retrieved from UniProtKB, GenBank, Protein Data Bank, ModBase, Protein Model Portal and the literature. It also incorporates specialized features of ZFP including finger sequences and positions, number of fingers, physiochemical properties, classes, framework, PubMed citations with links to experimental structures (PDB, if available) and modeled structures of natural zinc finger proteins. ZifBASE provides information on zinc finger proteins (both natural and engineered ones), the number of finger units in each of the zinc finger proteins (with multiple fingers), the synergy between the adjacent fingers and their positions. Additionally, it gives the individual finger sequence and their target DNA site to which it binds for better and clear understanding on the interactions of adjacent fingers. The current version of ZifBASE contains 139 entries of which 89 are engineered ZFPs, containing 3-7F totaling to 296 fingers. There are 50 natural zinc finger protein entries ranging from 2-13F, totaling to 307 fingers. It has sequences and structures from literature, Protein Data Bank, ModBase and Protein Model Portal. The interface is cross linked to other public databases like UniprotKB, PDB, ModBase and Protein Model Portal and PubMed for making it more informative. A database is established to maintain the information of the sequence features, including the class, framework, number of fingers, residues, position, recognition site and physio-chemical properties (molecular weight, isoelectric point) of both natural and engineered zinc finger proteins and dissociation constant of few. ZifBASE can provide more effective and efficient way of accessing the zinc finger protein sequences and their target binding sites with the links to their three-dimensional structures. All the data and functions are available at the advanced web-based search interface http://web.iitd.ac.in/~sundar/zifbase.
The 193-base pair Gsg2 (haspin) promoter region regulates germ cell-specific expression bidirectionally and synchronously.

PubMed

Tokuhiro, Keizo; Miyagawa, Yasushi; Yamada, Shuichi; Hirose, Mika; Ohta, Hiroshi; Nishimune, Yoshitake; Tanaka, Hiromitsu

2007-03-01

Haspin is a unique protein kinase expressed predominantly in haploid male germ cells. The genomic structure of haspin (Gsg2) has revealed it to be intronless, and the entire transcription unit is in an intron of the integrin alphaE (Itgae) gene. Transcription occurs from a bidirectional promoter that also generates an alternatively spliced integrin alphaE-derived mRNA (Aed). In mice, the testis-specific alternative splicing of Aed is expressed bidirectionally downstream from the Gsg2 transcription initiation site, and a segment consisting of 26 bp transcribes both genomic DNA strands between Gsg2 and the Aed transcription initiation sites. To investigate the mechanisms for this unique gene regulation, we cloned and characterized the Gsg2 promoter region. The 193-bp genomic fragment from the 5' end of the Gsg2 and Aed genes, fused with EGFP and DsRed genes, drove the expression of both proteins in haploid germ cells of transgenic mice. This promoter element contained only a GC-rich sequence, and not the previously reported DNA sequences known to bind various transcription factors--with the exception of E2F1, TCFAP2A1 (AP2), and SP1. Here, we show that the 193-bp DNA sequence is sufficient for the specific, bidirectional, and synchronous expression in germ cells in the testis. We also demonstrate the existence of germ cell nuclear factors specifically bound to the promoter sequence. This activity may be regulated by binding to the promoter sequence with germ cell-specific nuclear complex(es) without regulation via DNA methylation.
Structural and Functional Insights into WRKY3 and WRKY4 Transcription Factors to Unravel the WRKY–DNA (W-Box) Complex Interaction in Tomato (Solanum lycopersicum L.). A Computational Approach

PubMed Central

Aamir, Mohd; Singh, Vinay K.; Meena, Mukesh; Upadhyay, Ram S.; Gupta, Vijai K.; Singh, Surendra

2017-01-01

The WRKY transcription factors (TFs), play crucial role in plant defense response against various abiotic and biotic stresses. The role of WRKY3 and WRKY4 genes in plant defense response against necrotrophic pathogens is well-reported. However, their functional annotation in tomato is largely unknown. In the present work, we have characterized the structural and functional attributes of the two identified tomato WRKY transcription factors, WRKY3 (SlWRKY3), and WRKY4 (SlWRKY4) using computational approaches. Arabidopsis WRKY3 (AtWRKY3: NP_178433) and WRKY4 (AtWRKY4: NP_172849) protein sequences were retrieved from TAIR database and protein BLAST was done for finding their sequential homologs in tomato. Sequence alignment, phylogenetic classification, and motif composition analysis revealed the remarkable sequential variation between, these two WRKYs. The tomato WRKY3 and WRKY4 clusters with Solanum pennellii showing the monophyletic origin and evolution from their wild homolog. The functional domain region responsible for sequence specific DNA-binding occupied in both proteins were modeled [using AtWRKY4 (PDB ID:1WJ2) and AtWRKY1 (PDBID:2AYD) as template protein structures] through homology modeling using Discovery Studio 3.0. The generated models were further evaluated for their accuracy and reliability based on qualitative and quantitative parameters. The modeled proteins were found to satisfy all the crucial energy parameters and showed acceptable Ramachandran statistics when compared to the experimentally resolved NMR solution structures and/or X-Ray diffracted crystal structures (templates). The superimposition of the functional WRKY domains from SlWRKY3 and SlWRKY4 revealed remarkable structural similarity. The sequence specific DNA binding for two WRKYs was explored through DNA-protein interaction using Hex Docking server. The interaction studies found that SlWRKY4 binds with the W-box DNA through WRKYGQK with Tyr408, Arg409, and Lys419 with the initial flanking sequences also get involved in binding. In contrast, the SlWRKY3 made interaction with RKYGQK along with the residues from zinc finger motifs. Protein-protein interactions studies were done using STRING version 10.0 to explore all the possible protein partners involved in associative functional interaction networks. The Gene ontology enrichment analysis revealed the functional dimension and characterized the identified WRKYs based on their functional annotation. PMID:28611792
A novel begomovirus isolated from sida contains putative cis- and trans-acting replication specificity determinants that have evolved independently in several geographical lineages.

PubMed

Mauricio-Castillo, J A; Torres-Herrera, S I; Cárdenas-Conejo, Y; Pastor-Palacios, G; Méndez-Lozano, J; Argüello-Astorga, G R

2014-09-01

A novel begomovirus isolated from a Sida rhombifolia plant collected in Sinaloa, Mexico, was characterized. The genomic components of sida mosaic Sinaloa virus (SiMSinV) shared highest sequence identity with DNA-A and DNA-B components of chino del tomate virus (CdTV), suggesting a vertical evolutionary relationship between these viruses. However, recombination analysis indicated that a short segment of SiMSinV DNA-A encompassing the plus-strand replication origin and the 5´-proximal 43 codons of the Rep gene was derived from tomato mottle Taino virus (ToMoTV). Accordingly, the putative cis- and trans-acting replication specificity determinants of SiMSinV were identical to those of ToMoTV but differed from those of CdTV. Modeling of the SiMSinV and CdTV Rep proteins revealed significant differences in the region comprising the small β1/β5 sheet element, where five putative DNA-binding specificity determinants (SPDs) of Rep (i.e., amino acid residues 5, 8, 10, 69 and 71) were previously identified. Computer-assisted searches of public databases led to identification of 33 begomoviruses from three continents encoding proteins with SPDs identical to those of the Rep encoded by SiMSinV. Sequence analysis of the replication origins demonstrated that all 33 begomoviruses harbor potential Rep-binding sites identical to those of SiMSinV. These data support the hypothesis that the Rep β1/β5 sheet region determines specificity of this protein for DNA replication origin sequences.
Expression cloning and characterization of a novel gene that encodes the RNA-binding protein FAU-1 from Pyrococcus furiosus.

PubMed Central

Kanai, Akio; Oida, Hanako; Matsuura, Nana; Doi, Hirofumi

2003-01-01

We systematically screened a genomic DNA library to identify proteins of the hyperthermophilic archaeon Pyrococcus furiosus using an expression cloning method. One gene product, which we named FAU-1 (P. furiosus AU-binding), demonstrated the strongest binding activity of all the genomic library-derived proteins tested against an AU-rich RNA sequence. The protein was purified to near homogeneity as a 54 kDa single polypeptide, and the gene locus corresponding to this FAU-1 activity was also sequenced. The FAU-1 gene encoded a 472-amino-acid protein that was characterized by highly charged domains consisting of both acidic and basic amino acids. The N-terminal half of the gene had a degree of similarity (25%) with RNase E from Escherichia coli. Five rounds of RNA-binding-site selection and footprinting analysis showed that the FAU-1 protein binds specifically to the AU-rich sequence in a loop region of a possible RNA ligand. Moreover, we demonstrated that the FAU-1 protein acts as an oligomer, and mainly as a trimer. These results showed that the FAU-1 protein is a novel heat-stable protein with an RNA loop-binding characteristic. PMID:12614195
Mapping and analysis of Caenorhabditis elegans transcription factor sequence specificities

PubMed Central

Narasimhan, Kamesh; Lambert, Samuel A; Yang, Ally WH; Riddell, Jeremy; Mnaimneh, Sanie; Zheng, Hong; Albu, Mihai; Najafabadi, Hamed S; Reece-Hoyes, John S; Fuxman Bass, Juan I; Walhout, Albertha JM; Weirauch, Matthew T; Hughes, Timothy R

2015-01-01

Caenorhabditis elegans is a powerful model for studying gene regulation, as it has a compact genome and a wealth of genomic tools. However, identification of regulatory elements has been limited, as DNA-binding motifs are known for only 71 of the estimated 763 sequence-specific transcription factors (TFs). To address this problem, we performed protein binding microarray experiments on representatives of canonical TF families in C. elegans, obtaining motifs for 129 TFs. Additionally, we predict motifs for many TFs that have DNA-binding domains similar to those already characterized, increasing coverage of binding specificities to 292 C. elegans TFs (∼40%). These data highlight the diversification of binding motifs for the nuclear hormone receptor and C2H2 zinc finger families and reveal unexpected diversity of motifs for T-box and DM families. Motif enrichment in promoters of functionally related genes is consistent with known biology and also identifies putative regulatory roles for unstudied TFs. DOI: http://dx.doi.org/10.7554/eLife.06967.001 PMID:25905672
Sequence Discrimination by Alternatively Spliced Isoforms of a DNA Binding Zinc Finger Domain

NASA Astrophysics Data System (ADS)

Gogos, Joseph A.; Hsu, Tien; Bolton, Jesse; Kafatos, Fotis C.

1992-09-01

Two major developmentally regulated isoforms of the Drosophila chorion transcription factor CF2 differ by an extra zinc finger within the DNA binding domain. The preferred DNA binding sites were determined and are distinguished by an internal duplication of TAT in the site recognized by the isoform with the extra finger. The results are consistent with modular interactions between zinc fingers and trinucleotides and also suggest rules for recognition of AT-rich DNA sites by zinc finger proteins. The results show how modular finger interactions with trinucleotides can be used, in conjunction with alternative splicing, to alter the binding specificity and increase the spectrum of sites recognized by a DNA binding domain. Thus, CF2 may potentially regulate distinct sets of target genes during development.
A force-based, parallel assay for the quantification of protein-DNA interactions.

PubMed

Limmer, Katja; Pippig, Diana A; Aschenbrenner, Daniela; Gaub, Hermann E

2014-01-01

Analysis of transcription factor binding to DNA sequences is of utmost importance to understand the intricate regulatory mechanisms that underlie gene expression. Several techniques exist that quantify DNA-protein affinity, but they are either very time-consuming or suffer from possible misinterpretation due to complicated algorithms or approximations like many high-throughput techniques. We present a more direct method to quantify DNA-protein interaction in a force-based assay. In contrast to single-molecule force spectroscopy, our technique, the Molecular Force Assay (MFA), parallelizes force measurements so that it can test one or multiple proteins against several DNA sequences in a single experiment. The interaction strength is quantified by comparison to the well-defined rupture stability of different DNA duplexes. As a proof-of-principle, we measured the interaction of the zinc finger construct Zif268/NRE against six different DNA constructs. We could show the specificity of our approach and quantify the strength of the protein-DNA interaction.
Probing the Potential Role of Non-B DNA Structures at Yeast Meiosis-Specific DNA Double-Strand Breaks.

PubMed

Kshirsagar, Rucha; Khan, Krishnendu; Joshi, Mamata V; Hosur, Ramakrishna V; Muniyappa, K

2017-05-23

A plethora of evidence suggests that different types of DNA quadruplexes are widely present in the genome of all organisms. The existence of a growing number of proteins that selectively bind and/or process these structures underscores their biological relevance. Moreover, G-quadruplex DNA has been implicated in the alignment of four sister chromatids by forming parallel guanine quadruplexes during meiosis; however, the underlying mechanism is not well defined. Here we show that a G/C-rich motif associated with a meiosis-specific DNA double-strand break (DSB) in Saccharomyces cerevisiae folds into G-quadruplex, and the C-rich sequence complementary to the G-rich sequence forms an i-motif. The presence of G-quadruplex or i-motif structures upstream of the green fluorescent protein-coding sequence markedly reduces the levels of gfp mRNA expression in S. cerevisiae cells, with a concomitant decrease in green fluorescent protein abundance, and blocks primer extension by DNA polymerase, thereby demonstrating the functional significance of these structures. Surprisingly, although S. cerevisiae Hop1, a component of synaptonemal complex axial/lateral elements, exhibits strong affinity to G-quadruplex DNA, it displays a much weaker affinity for the i-motif structure. However, the Hop1 C-terminal but not the N-terminal domain possesses strong i-motif binding activity, implying that the C-terminal domain has a distinct substrate specificity. Additionally, we found that Hop1 promotes intermolecular pairing between G/C-rich DNA segments associated with a meiosis-specific DSB site. Our results support the idea that the G/C-rich motifs associated with meiosis-specific DSBs fold into intramolecular G-quadruplex and i-motif structures, both in vitro and in vivo, thus revealing an important link between non-B form DNA structures and Hop1 in meiotic chromosome synapsis and recombination. Copyright © 2017 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Nuclear magnetic resonance-based model of a TF1/HmU-DNA complex.

PubMed

Silva, M V; Pasternack, L B; Kearns, D R

1997-12-15

Transcription factor 1 (TF1), a type II DNA-binding protein encoded by the Bacillus subtilis bacteriophage SPO1, has the capacity for sequence-selective DNA binding and a preference for 5-hydroxymethyl-2'-deoxyuridine (HmU)-containing DNA. In NMR studies of the TF1/HmU-DNA complex, intermolecular NOEs indicate that the flexible beta-ribbon and C-terminal alpha-helix are involved in the DNA-binding site of TF1, placing it in the beta-sheet category of DNA-binding proteins proposed to bind by wrapping two beta-ribbon "arms" around the DNA. Intermolecular and intramolecular NOEs were used to generate an energy-minimized model of the protein-DNA complex in which both DNA bending and protein structure changes are evident.
Effects of mutations at amino acid 61 in the arm of TF1 on its DNA-binding properties.

PubMed

Sayre, M H; Geiduschek, E P

1990-12-20

Transcription factor 1 (TF1) is the Bacillus subtilis phage SPO1-encoded member of the family of bacterial DNA-binding proteins that includes Escherichia coli HU and integration host factor (IHF). We have initiated a mutational analysis of the TF1 molecule to understand better its unique DNA-binding properties and to investigate its physiological function. We report here the consequences of mutating the putative DNA-binding "arms" of TF1. At position 61 in its primary sequence, TF1 contains a Phe residue in place of the Arg residue found in all other known members of the HU family. Substituting polar, uncharged residues for Phe61 substantially reduced the DNA-binding affinity and site-selectivity of TF1 in vitro, whereas the substitution of Tyr had no effect. Substituting Trp or Arg for Phe61 had little effect on the affinity of TF1 for SPO1 DNA, but altered the electrophoretic mobilities of protein-DNA complexes in non-denaturing gels. The Arg61 substitution increased the affinity of the protein for non-specific sites on thymine-containing DNA, thus reducing the natural preference of TF1 for (5-hydroxymethyluracil)-containing DNA. The Phe61-to-Arg mutation was also correlated with decreased phage yield and aberrant regulation of viral protein synthesis in vivo.
Fis protein induced λF-DNA bending observed by single-pair fluorescence resonance energy transfer

NASA Astrophysics Data System (ADS)

Chi-Cheng, Fu; Wunshain, Fann; Yuan Hanna, S.

2006-03-01

Fis, a site-specific DNA binding protein, regulates many biological processes including recombination, transcription, and replication in E.coli. Fis induced DNA bending plays an important role in regulating these functions and bending angle range from ˜50 to 95 dependent on the DNA sequence. For instance, the average bending angle of λF-DNA (26 bp, 8.8nm long, contained λF binding site on the center) measured by gel mobility shift assays was ˜ 94 . But the traditional method cannot provide information about the dynamics and the angle distribution. In this study, λF-DNA was labeled with donor (Alexa Fluor 546) and acceptor (Alexa Fluor 647) dyes on its two 5' ends and the donor-acceptor distances were measured using single-pair fluorescence resonance energy transfer (sp-FRET) with and without the present of Fis protein. Combing with structure information of Fis-DNA complex, the sp-FRET results are used to estimate the protein induced DNA bending angle distribution and dynamics.

Structural basis of DNA target recognition by the B3 domain of Arabidopsis epigenome reader VAL1

PubMed Central

Sasnauskas, Giedrius; Kauneckaitė, Kotryna; Siksnys, Virginijus

2018-01-01

Abstract Arabidopsis thaliana requires a prolonged period of cold exposure during winter to initiate flowering in a process termed vernalization. Exposure to cold induces epigenetic silencing of the FLOWERING LOCUS C (FLC) gene by Polycomb group (PcG) proteins. A key role in this epigenetic switch is played by transcriptional repressors VAL1 and VAL2, which specifically recognize Sph/RY DNA sequences within FLC via B3 DNA binding domains, and mediate recruitment of PcG silencing machinery. To understand the structural mechanism of site-specific DNA recognition by VAL1, we have solved the crystal structure of VAL1 B3 domain (VAL1-B3) bound to a 12 bp oligoduplex containing the canonical Sph/RY DNA sequence 5′-CATGCA-3′/5′-TGCATG-3′. We find that VAL1-B3 makes H-bonds and van der Waals contacts to DNA bases of all six positions of the canonical Sph/RY element. In agreement with the structure, in vitro DNA binding studies show that VAL1-B3 does not tolerate substitutions at any position of the 5′-TGCATG-3′ sequence. The VAL1-B3–DNA structure presented here provides a structural model for understanding the specificity of plant B3 domains interacting with the Sph/RY and other DNA sequences. PMID:29660015
Developmental roles of 21 Drosophila transcription factors are determined by quantitative differences in binding to an overlapping set of thousands of genomic regions

DOE Office of Scientific and Technical Information (OSTI.GOV)

MacArthur, Stewart; Li, Xiao-Yong; Li, Jingyi

2009-05-15

BACKGROUND: We previously established that six sequence-specific transcription factors that initiate anterior/posterior patterning in Drosophila bind to overlapping sets of thousands of genomic regions in blastoderm embryos. While regions bound at high levels include known and probable functional targets, more poorly bound regions are preferentially associated with housekeeping genes and/or genes not transcribed in the blastoderm, and are frequently found in protein coding sequences or in less conserved non-coding DNA, suggesting that many are likely non-functional. RESULTS: Here we show that an additional 15 transcription factors that regulate other aspects of embryo patterning show a similar quantitative continuum of functionmore » and binding to thousands of genomic regions in vivo. Collectively, the 21 regulators show a surprisingly high overlap in the regions they bind given that they belong to 11 DNA binding domain families, specify distinct developmental fates, and can act via different cis-regulatory modules. We demonstrate, however, that quantitative differences in relative levels of binding to shared targets correlate with the known biological and transcriptional regulatory specificities of these factors. CONCLUSIONS: It is likely that the overlap in binding of biochemically and functionally unrelated transcription factors arises from the high concentrations of these proteins in nuclei, which, coupled with their broad DNA binding specificities, directs them to regions of open chromatin. We suggest that most animal transcription factors will be found to show a similar broad overlapping pattern of binding in vivo, with specificity achieved by modulating the amount, rather than the identity, of bound factor.« less
UniPROBE, update 2011: expanded content and search tools in the online database of protein-binding microarray data on protein-DNA interactions.

PubMed

Robasky, Kimberly; Bulyk, Martha L

2011-01-01

The Universal PBM Resource for Oligonucleotide-Binding Evaluation (UniPROBE) database is a centralized repository of information on the DNA-binding preferences of proteins as determined by universal protein-binding microarray (PBM) technology. Each entry for a protein (or protein complex) in UniPROBE provides the quantitative preferences for all possible nucleotide sequence variants ('words') of length k ('k-mers'), as well as position weight matrix (PWM) and graphical sequence logo representations of the k-mer data. In this update, we describe >130% expansion of the database content, incorporation of a protein BLAST (blastp) tool for finding protein sequence matches in UniPROBE, the introduction of UniPROBE accession numbers and additional database enhancements. The UniPROBE database is available at http://uniprobe.org.
Direct observation of transcription activator-like effector (TALE) protein dynamics

NASA Astrophysics Data System (ADS)

Cuculis, Luke; Abil, Zhanar; Zhao, Huimin; Schroeder, Charles M.

2014-03-01

In this work, we describe a single molecule assay to probe the site-search dynamics of transcription activator-like effector (TALE) proteins along DNA. In modern genetics, the ability to selectively edit the human genome is an unprecedented development, driven by recent advances in targeted nuclease proteins. Specific gene editing can be accomplished using TALE proteins, which are programmable DNA-binding proteins that can be fused to a nuclease domain. In this way, TALENs are a leading technology that has shown great success in the genomic editing of pluripotent stem cells. A major hurdle facing clinical implementation, however, is the potential for deleterious off-target binding events. For these reasons, a molecular-level understanding of TALE binding and target sequence search on DNA is essential. To this end, we developed a single-molecule fluorescence imaging assay that provides a first-of-its-kind view of the 1-D diffusion of TALE proteins along stretched DNA. Taken together with co-crystal structures of DNA-bound TALEs, our results suggest a rotationally-coupled, major groove tracking model for diffusion. We further report diffusion constants for TALE proteins as a function of salt concentration, consistent with previously described models of 1-D protein diffusion.
Crystal structure of MboIIA methyltransferase.

PubMed

Osipiuk, Jerzy; Walsh, Martin A; Joachimiak, Andrzej

2003-09-15

DNA methyltransferases (MTases) are sequence-specific enzymes which transfer a methyl group from S-adenosyl-L-methionine (AdoMet) to the amino group of either cytosine or adenine within a recognized DNA sequence. Methylation of a base in a specific DNA sequence protects DNA from nucleolytic cleavage by restriction enzymes recognizing the same DNA sequence. We have determined at 1.74 A resolution the crystal structure of a beta-class DNA MTase MboIIA (M.MboIIA) from the bacterium Moraxella bovis, the smallest DNA MTase determined to date. M.MboIIA methylates the 3' adenine of the pentanucleotide sequence 5'-GAAGA-3'. The protein crystallizes with two molecules in the asymmetric unit which we propose to resemble the dimer when M.MboIIA is not bound to DNA. The overall structure of the enzyme closely resembles that of M.RsrI. However, the cofactor-binding pocket in M.MboIIA forms a closed structure which is in contrast to the open-form structures of other known MTases.
Coupled binding-bending-folding: The complex conformational dynamics of protein-DNA binding studied by atomistic molecular dynamics simulations.

PubMed

van der Vaart, Arjan

2015-05-01

Protein-DNA binding often involves dramatic conformational changes such as protein folding and DNA bending. While thermodynamic aspects of this behavior are understood, and its biological function is often known, the mechanism by which the conformational changes occur is generally unclear. By providing detailed structural and energetic data, molecular dynamics simulations have been helpful in elucidating and rationalizing protein-DNA binding. This review will summarize recent atomistic molecular dynamics simulations of the conformational dynamics of DNA and protein-DNA binding. A brief overview of recent developments in DNA force fields is given as well. Simulations have been crucial in rationalizing the intrinsic flexibility of DNA, and have been instrumental in identifying the sequence of binding events, the triggers for the conformational motion, and the mechanism of binding for a number of important DNA-binding proteins. Molecular dynamics simulations are an important tool for understanding the complex binding behavior of DNA-binding proteins. With recent advances in force fields and rapid increases in simulation time scales, simulations will become even more important for future studies. This article is part of a Special Issue entitled Recent developments of molecular dynamics. Copyright © 2014. Published by Elsevier B.V.
Cloning and expression of a nuclear encoded plastid specific 33 kDa ribonucleoprotein gene (33RNP) from pea that is light stimulated.

PubMed

Reddy, M K; Nair, S; Singh, B N; Mudgil, Y; Tewari, K K; Sopory, S K

2001-01-24

We report the cloning and sequencing of both cDNA and genomic DNA of a 33 kDa chloroplast ribonucleoprotein (33RNP) from pea. The analysis of the predicted amino acid sequence of the cDNA clone revealed that the encoded protein contains two RNA binding domains, including the conserved consensus ribonucleoprotein sequences CS-RNP1 and CS-RNP2, on the C-terminus half and the presence of a putative transit peptide sequence in the N-terminus region. The phylogenetic and multiple sequence alignment analysis of pea chloroplast RNP along with RNPs reported from the other plant sources revealed that the pea 33RNP is very closely related to Nicotiana sylvestris 31RNP and 28RNP and also to 31RNP and 28RNP of Arabidopsis and spinach, respectively. The pea 33RNP was expressed in Escherichia coli and purified to homogeneity. The in vitro import of precursor protein into chloroplasts confirmed that the N-terminus putative transit peptide is a bona fide transit peptide and 33RNP is localized in the chloroplast. The nucleic acid-binding properties of the recombinant protein, as revealed by South-Western analysis, showed that 33RNP has higher binding affinity for poly (U) and oligo dT than for ssDNA and dsDNA. The steady state transcript level was higher in leaves than in roots and the expression of this gene is light stimulated. Sequence analysis of the genomic clone revealed that the gene contains four exons and three introns. We have also isolated and analyzed the 5' flanking region of the pea 33RNP gene.
Modeling the interactions of the nucleotide excision repair UvrA(2) dimer with DNA.

PubMed

Gantchev, Tsvetan G; Hunting, Darel J

2010-12-28

The UvrA protein initiates the DNA damage recognition process by the bacterial nucleotide excision repair (NER) system. Recently, crystallographic structures of holo-UvrA(2) dimers from two different microorganisms have been released (Protein Data Bank entries 2r6f , 2vf7 , and 2vf8 ). However, the details of the DNA binding by UvrA(2) and other peculiarities involved in the damage recognition process remain unknown. We have undertaken a molecular modeling approach to appraise the possible modes of DNA-UvrA(2) interaction using molecular docking and short-scale guided molecular dynamics [continuum field, constrained, and/or unrestricted simulated annealing (SA)], taking into account the three-dimensional location of a series of mutation-identified UvrA residues implicated in DNA binding. The molecular docking was based on the assumptions that the UvrA(2) dimer is preformed prior to DNA binding and that no major protein conformational rearrangements, except moderate domain reorientations, are required for binding of undamaged DNA. As a first approximation, DNA was treated as a rigid ligand. From the electrostatic relief of the ventral surface of UvrA(2), we initially identified three, noncollinear DNA binding paths. Each of the three resulting nucleoprotein complexes (C1, C2, and C3) was analyzed separately, including calculation of binding energies, the number and type of interaction residues (including mutated ones), and the predominant mode of translational and rotational motion of specific protein domains after SA to ensure improved DNA binding. The UvrA(2) dimer can accommodate DNA in all three orientations, albeit with different binding strengths. One of the UvrA(2)-DNA complexes (C1) fulfilled most of the requirements (high interaction energy, proximity of DNA to mutated residues, etc.) expected for a natural, high-affinity DNA binding site. This nucleoprotein presents a structural organization that is designed to clamp and bend double-stranded DNA. We examined the binding site in more detail by docking DNAs of significantly different (AT- vs CG-enriched) sequences and by submitting the complexes to DNA-unrestricted SA. It was found that in a manner independent of the DNA sequence and applied MD protocols, UvrA(2) favors binding of a bent and unwound undamaged DNA, with a kink positioned in the proximity of the Zn3 hairpins, anticollinearly aligned at the bottom of the ventral protein surface. It is further hypothesized that the Zn3 modules play an essential role in the damage recognition process and that the apparent existence of a family of DNA binding sites might be biologically relevant. Our data should prove to be useful in rational (structure-based) mutation studies.
Colorimetric Detection of Specific DNA Segments Amplified by Polymerase Chain Reactions

NASA Astrophysics Data System (ADS)

Kemp, David J.; Smith, Donald B.; Foote, Simon J.; Samaras, N.; Peterson, M. Gregory

1989-04-01

The polymerase chain reaction (PCR) procedure has many potential applications in mass screening. We describe here a general assay for colorimetric detection of amplified DNA. The target DNA is first amplified by PCR, and then a second set of oligonucleotides, nested between the first two, is incorporated by three or more PCR cycles. These oligonucleotides bear ligands: for example, one can be biotinylated and the other can contain a site for a double-stranded DNA-binding protein. After linkage to an immobilized affinity reagent (such as a cloned DNA-binding protein, which we describe here) and labeling with a second affinity reagent (for example, avidin) linked to horseradish peroxidase, reaction with a chromogenic substrate allows detection of the amplified DNA. This amplified DNA assay (ADA) is rapid, is readily applicable to mass screening, and uses routine equipment. We show here that it can be used to detect human immunodeficiency virus sequences specifically against a background of human DNA.
Modulating the DNA affinity of Elk-1 with computationally selected mutations.

PubMed

Park, Sheldon; Boder, Eric T; Saven, Jeffery G

2005-04-22

In order to regulate gene expression, transcription factors must first bind their target DNA sequences. The affinity of this binding is determined by both the network of interactions at the interface and the entropy change associated with the complex formation. To study the role of structural fluctuation in fine-tuning DNA affinity, we performed molecular dynamics simulations of two highly homologous proteins, Elk-1 and SAP-1, that exhibit different sequence specificity. Simulation studies show that several residues in Elk have significantly higher main-chain root-mean-square deviations than their counterparts in SAP. In particular, a single residue, D69, may contribute to Elk's lower DNA affinity for P(c-fos) by structurally destabilizing the carboxy terminus of the recognition helix. While D69 does not contact DNA directly, the increased mobility in the region may contribute to its weaker binding. We measured the ability of single point mutants of Elk to bind P(c-fos) in a reporter assay, in which D69 of wild-type Elk has been mutated to other residues with higher helix propensity in order to stabilize the local conformation. The gains in transcriptional activity and the free energy of binding suggested from these measurements correlate well with stability gains computed from helix propensity and charge-macrodipole interactions. The study suggests that residues that are distal to the binding interface may indirectly modulate the binding affinity by stabilizing the protein scaffold required for efficient DNA interaction.
Discovery and information-theoretic characterization of transcription factor binding sites that act cooperatively.

PubMed

Clifford, Jacob; Adami, Christoph

2015-09-02

Transcription factor binding to the surface of DNA regulatory regions is one of the primary causes of regulating gene expression levels. A probabilistic approach to model protein-DNA interactions at the sequence level is through position weight matrices (PWMs) that estimate the joint probability of a DNA binding site sequence by assuming positional independence within the DNA sequence. Here we construct conditional PWMs that depend on the motif signatures in the flanking DNA sequence, by conditioning known binding site loci on the presence or absence of additional binding sites in the flanking sequence of each site's locus. Pooling known sites with similar flanking sequence patterns allows for the estimation of the conditional distribution function over the binding site sequences. We apply our model to the Dorsal transcription factor binding sites active in patterning the Dorsal-Ventral axis of Drosophila development. We find that those binding sites that cooperate with nearby Twist sites on average contain about 0.5 bits of information about the presence of Twist transcription factor binding sites in the flanking sequence. We also find that Dorsal binding site detectors conditioned on flanking sequence information make better predictions about what is a Dorsal site relative to background DNA than detection without information about flanking sequence features.
Tomato ASR1 abrogates the response to abscisic acid and glucose in Arabidopsis by competing with ABI4 for DNA binding.

PubMed

Shkolnik, Doron; Bar-Zvi, Dudy

2008-05-01

The manipulation of transacting factors is commonly used to achieve a wide change in the expression of a large number of genes in transgenic plants as a result of a change in the expression of a single gene product. This is mostly achieved by the overexpression of transactivator or repressor proteins. In this study, it is demonstrated that the overexpression of an exogenous DNA-binding protein can be used to compete with the expression of an endogenous transcription factor sharing the same DNA-binding sequence. Arabidopsis was transformed with cDNA encoding tomato abscisic acid stress ripening 1 (ASR1), a sequence-specific DNA protein that has no orthologues in the Arabidopsis genome. ASR1-overexpressing (ASR1-OE) plants display an abscisic acid-insensitive 4 (abi4) phenotype: seed germination is not sensitive to inhibition by abscisic acid (ABA), glucose, NaCl and paclobutrazol. ASR1 binds coupling element 1 (CE1), a cis-acting element bound by the ABI4 transcription factor, located in the ABI4-regulated promoters, including that of the ABI4 gene. Chromatin immunoprecipitation demonstrates that ASR1 is bound in vivo to the promoter of the ABI4 gene in ASR1-OE plants, but not to promoters of genes known to be regulated by the transcription factors ABI3 or ABI5. Real-time polymerase chain reaction (PCR) analysis confirmed that the expression of ABI4 and ABI4-regulated genes is markedly reduced in ASR1-OE plants. Therefore, it is concluded that the abi4 phenotype of ASR1-OE plants is the result of competition between the foreign ASR1 and the endogenous ABI4 on specific promoter DNA sequences. The biotechnological advantage of using this approach in crop plants from the Brassicaceae family to reduce the transactivation activity of ABI4 is discussed.
Specific Inhibition of the transcription factor Ci by a Cobalt(III)-Schiff base-DNA conjugate

PubMed Central

Hurtado, Ryan R.; Harney, Allison S.; Heffern, Marie C.; Holbrook, Robert J.; Holmgren, Robert A.; Meade, Thomas J.

2012-01-01

We describe the use of Co(III) Schiff base-DNA conjugates, a versatile class of research tools that target C2H2 transcription factors, to inhibit the Hedgehog (Hh) pathway. In developing mammalian embryos, Hh signaling is critical for the formation and development of many tissues and organs. Inappropriate activation of the Hedgehog (Hh) pathway has been implicated in a variety of cancers including medulloblastomas and basal cell carcinomas. It is well known that Hh regulates the activity of the Gli family of C2H2 zinc finger transcription factors in mammals. In Drosophila the function of the Gli proteins is performed by a single transcription factor with an identical DNA binding consensus sequence, Cubitus Interruptus (Ci). We have demonstrated previously that conjugation of a specific 17 base-pair oligonucleotide to a Co(III) Schiff base complex results in a targeted inhibitor of the Snail family C2H2 zinc finger transcription factors. Modification of the oligonucleotide sequence in the Co(III) Schiff base-DNA conjugate to that of Ci’s consensus sequence (Co(III)-Ci) generates an equally selective inhibitor of Ci. Co(III)-Ci irreversibly binds the Ci zinc finger domain and prevents it from binding DNA in vitro. In a Ci responsive tissue culture reporter gene assay, Co(III)-Ci reduces the transcriptional activity of Ci in a concentration dependent manner. In addition, injection of wild-type Drosophila embryos with Co(III)-Ci phenocopies a Ci loss of function phenotype, demonstrating effectiveness in vivo. This study provides evidence that Co(III) Schiff base-DNA conjugates are a versatile class of specific and potent tools for studying zinc finger domain proteins and have potential applications as customizable anti-cancer therapeutics. PMID:22214326
An Improved Method for TAL Effectors DNA-Binding Sites Prediction Reveals Functional Convergence in TAL Repertoires of Xanthomonas oryzae Strains

PubMed Central

Pérez-Quintero, Alvaro L.; Rodriguez-R, Luis M.; Dereeper, Alexis; López, Camilo; Koebnik, Ralf; Szurek, Boris; Cunnac, Sebastien

2013-01-01

Transcription Activators-Like Effectors (TALEs) belong to a family of virulence proteins from the Xanthomonas genus of bacterial plant pathogens that are translocated into the plant cell. In the nucleus, TALEs act as transcription factors inducing the expression of susceptibility genes. A code for TALE-DNA binding specificity and high-resolution three-dimensional structures of TALE-DNA complexes were recently reported. Accurate prediction of TAL Effector Binding Elements (EBEs) is essential to elucidate the biological functions of the many sequenced TALEs as well as for robust design of artificial TALE DNA-binding domains in biotechnological applications. In this work a program with improved EBE prediction performances was developed using an updated specificity matrix and a position weight correction function to account for the matching pattern observed in a validation set of TALE-DNA interactions. To gain a systems perspective on the large TALE repertoires from X. oryzae strains, this program was used to predict rice gene targets for 99 sequenced family members. Integrating predictions and available expression data in a TALE-gene network revealed multiple candidate transcriptional targets for many TALEs as well as several possible instances of functional convergence among TALEs. PMID:23869221
Conditional sterility in plants

DOEpatents

Meagher, Richard B.; McKinney, Elizabeth; Kim, Tehryung

2010-02-23

The present disclosure provides methods, recombinant DNA molecules, recombinant host cells containing the DNA molecules, and transgenic plant cells, plant tissue and plants which contain and express at least one antisense or interference RNA specific for a thiamine biosynthetic coding sequence or a thiamine binding protein or a thiamine-degrading protein, wherein the RNA or thiamine binding protein is expressed under the regulatory control of a transcription regulatory sequence which directs expression in male and/or female reproductive tissue. These transgenic plants are conditionally sterile; i.e., they are fertile only in the presence of exogenous thiamine. Such plants are especially appropriate for use in the seed industry or in the environment, for example, for use in revegetation of contaminated soils or phytoremediation, especially when those transgenic plants also contain and express one or more chimeric genes which confer resistance to contaminants.
Identification of Specific DNA Binding Residues in the TCP Family of Transcription Factors in Arabidopsis[W

PubMed Central

Aggarwal, Pooja; Das Gupta, Mainak; Joseph, Agnel Praveen; Chatterjee, Nirmalya; Srinivasan, N.; Nath, Utpal

2010-01-01

The TCP transcription factors control multiple developmental traits in diverse plant species. Members of this family share an ∼60-residue-long TCP domain that binds to DNA. The TCP domain is predicted to form a basic helix-loop-helix (bHLH) structure but shares little sequence similarity with canonical bHLH domain. This classifies the TCP domain as a novel class of DNA binding domain specific to the plant kingdom. Little is known about how the TCP domain interacts with its target DNA. We report biochemical characterization and DNA binding properties of a TCP member in Arabidopsis thaliana, TCP4. We have shown that the 58-residue domain of TCP4 is essential and sufficient for binding to DNA and possesses DNA binding parameters comparable to canonical bHLH proteins. Using a yeast-based random mutagenesis screen and site-directed mutants, we identified the residues important for DNA binding and dimer formation. Mutants defective in binding and dimerization failed to rescue the phenotype of an Arabidopsis line lacking the endogenous TCP4 activity. By combining structure prediction, functional characterization of the mutants, and molecular modeling, we suggest a possible DNA binding mechanism for this class of transcription factors. PMID:20363772
The NS1 polypeptide of the murine parvovirus minute virus of mice binds to DNA sequences containing the motif [ACCA]2-3.

PubMed Central

Cotmore, S F; Christensen, J; Nüesch, J P; Tattersall, P

1995-01-01

A DNA fragment containing the minute virus of mice 3' replication origin was specifically coprecipitated in immune complexes containing the virally coded NS1, but not the NS2, polypeptide. Antibodies directed against the amino- or carboxy-terminal regions of NS1 precipitated the NS1-origin complexes, but antibodies directed against NS1 amino acids 284 to 459 blocked complex formation. Using affinity-purified histidine-tagged NS1 preparations, we have shown that the specific protein-DNA interaction is of moderate affinity, being stable in 0.1 M salt but rapidly lost at higher salt concentrations. In contrast, generalized (or nonspecific) DNA binding by NS1 could be demonstrated only in low salt. Addition of ATP or gamma S-ATP enhanced specific DNA binding by wild-type NS1 severalfold, but binding was lost under conditions which favored ATP hydrolysis. NS1 molecules with mutations in a critical lysine residue (amino acid 405) in the consensus ATP-binding site bound to the origin, but this binding could not be enhanced by ATP addition. DNase I protection assays carried out with wild-type NS1 in the presence of gamma S-ATP gave footprints which extended over 43 nucleotides on both DNA strands, from the middle of the origin bubble sequence to a position some 14 bp beyond the nick site. The DNA-binding site for NS1 was mapped to a 22-bp fragment from the middle of the 3' replication origin which contains the sequence ACCAACCA. This conforms to a reiterated motif (ACCA)2-3, which occurs, in more or less degenerate form, at many sites throughout the minute virus of mice genome (J. W. Bodner, Virus Genes 2:167-182, 1989). Insertion of a single copy of the sequence (ACCA)3 was shown to be sufficient to confer NS1 binding on an otherwise unrecognized plasmid fragment. The functions of NS1 in the viral life cycle are reevaluated in the light of this result. PMID:7853501
The NS1 polypeptide of the murine parvovirus minute virus of mice binds to DNA sequences containing the motif [ACCA]2-3.

PubMed

Cotmore, S F; Christensen, J; Nüesch, J P; Tattersall, P

1995-03-01

A DNA fragment containing the minute virus of mice 3' replication origin was specifically coprecipitated in immune complexes containing the virally coded NS1, but not the NS2, polypeptide. Antibodies directed against the amino- or carboxy-terminal regions of NS1 precipitated the NS1-origin complexes, but antibodies directed against NS1 amino acids 284 to 459 blocked complex formation. Using affinity-purified histidine-tagged NS1 preparations, we have shown that the specific protein-DNA interaction is of moderate affinity, being stable in 0.1 M salt but rapidly lost at higher salt concentrations. In contrast, generalized (or nonspecific) DNA binding by NS1 could be demonstrated only in low salt. Addition of ATP or gamma S-ATP enhanced specific DNA binding by wild-type NS1 severalfold, but binding was lost under conditions which favored ATP hydrolysis. NS1 molecules with mutations in a critical lysine residue (amino acid 405) in the consensus ATP-binding site bound to the origin, but this binding could not be enhanced by ATP addition. DNase I protection assays carried out with wild-type NS1 in the presence of gamma S-ATP gave footprints which extended over 43 nucleotides on both DNA strands, from the middle of the origin bubble sequence to a position some 14 bp beyond the nick site. The DNA-binding site for NS1 was mapped to a 22-bp fragment from the middle of the 3' replication origin which contains the sequence ACCAACCA. This conforms to a reiterated motif (ACCA)2-3, which occurs, in more or less degenerate form, at many sites throughout the minute virus of mice genome (J. W. Bodner, Virus Genes 2:167-182, 1989). Insertion of a single copy of the sequence (ACCA)3 was shown to be sufficient to confer NS1 binding on an otherwise unrecognized plasmid fragment. The functions of NS1 in the viral life cycle are reevaluated in the light of this result.
Evaluation of Novel Design Strategies for Developing Zinc Finger Nucleases Tools for Treating Human Diseases

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bach, Christian; Sherman, William; Pallis, Jani

Zinc finger nucleases (ZFNs) are associated with cell death and apoptosis by binding at countless undesired locations. This cytotoxicity is associated with the binding ability of engineered zinc finger domains to bind dissimilar DNA sequences with high affinity. In general, binding preferences of transcription factors are associated with significant degenerated diversity and complexity which convolutes the design and engineering of precise DNA binding domains. Evolutionary success of natural zinc finger proteins, however, evinces that nature created specific evolutionary traits and strategies, such as modularity and rank-specific recognition to cope with binding complexity that are critical for creating clinical viable toolsmore » to precisely modify the human genome. Our findings indicate preservation of general modularity and significant alteration of the rank-specific binding preferences of the three-finger binding domain of transcription factor SP1 when exchanging amino acids in the 2nd finger.« less
Evaluation of Novel Design Strategies for Developing Zinc Finger Nucleases Tools for Treating Human Diseases

DOE PAGES

Bach, Christian; Sherman, William; Pallis, Jani; ...

2014-01-01

Zinc finger nucleases (ZFNs) are associated with cell death and apoptosis by binding at countless undesired locations. This cytotoxicity is associated with the binding ability of engineered zinc finger domains to bind dissimilar DNA sequences with high affinity. In general, binding preferences of transcription factors are associated with significant degenerated diversity and complexity which convolutes the design and engineering of precise DNA binding domains. Evolutionary success of natural zinc finger proteins, however, evinces that nature created specific evolutionary traits and strategies, such as modularity and rank-specific recognition to cope with binding complexity that are critical for creating clinical viable toolsmore » to precisely modify the human genome. Our findings indicate preservation of general modularity and significant alteration of the rank-specific binding preferences of the three-finger binding domain of transcription factor SP1 when exchanging amino acids in the 2nd finger.« less

Transcription Factors Bind Thousands of Active and InactiveRegions in the Drosophila Blastoderm

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, Xiao-Yong; MacArthur, Stewart; Bourgon, Richard

2008-01-10

Identifying the genomic regions bound by sequence-specific regulatory factors is central both to deciphering the complex DNA cis-regulatory code that controls transcription in metazoans and to determining the range of genes that shape animal morphogenesis. Here, we use whole-genome tiling arrays to map sequences bound in Drosophila melanogaster embryos by the six maternal and gap transcription factors that initiate anterior-posterior patterning. We find that these sequence-specific DNA binding proteins bind with quantitatively different specificities to highly overlapping sets of several thousand genomic regions in blastoderm embryos. Specific high- and moderate-affinity in vitro recognition sequences for each factor are enriched inmore » bound regions. This enrichment, however, is not sufficient to explain the pattern of binding in vivo and varies in a context-dependent manner, demonstrating that higher-order rules must govern targeting of transcription factors. The more highly bound regions include all of the over forty well-characterized enhancers known to respond to these factors as well as several hundred putative new cis-regulatory modules clustered near developmental regulators and other genes with patterned expression at this stage of embryogenesis. The new targets include most of the microRNAs (miRNAs) transcribed in the blastoderm, as well as all major zygotically transcribed dorsal-ventral patterning genes, whose expression we show to be quantitatively modulated by anterior-posterior factors. In addition to these highly bound regions, there are several thousand regions that are reproducibly bound at lower levels. However, these poorly bound regions are, collectively, far more distant from genes transcribed in the blastoderm than highly bound regions; are preferentially found in protein-coding sequences; and are less conserved than highly bound regions. Together these observations suggest that many of these poorly-bound regions are not involved in early-embryonic transcriptional regulation, and a significant proportion may be nonfunctional. Surprisingly, for five of the six factors, their recognition sites are not unambiguously more constrained evolutionarily than the immediate flanking DNA, even in more highly bound and presumably functional regions, indicating that comparative DNA sequence analysis is limited in its ability to identify functional transcription factor targets.« less
A divergent Pumilio repeat protein family for pre-rRNA processing and mRNA localization

DOE Office of Scientific and Technical Information (OSTI.GOV)

Qiu, Chen; McCann, Kathleen L.; Wine, Robert N.

Pumilio/feminization of XX and XO animals (fem)-3 mRNA-binding factor (PUF) proteins bind sequence specifically to mRNA targets using a single-stranded RNA-binding domain comprising eight Pumilio (PUM) repeats. PUM repeats have now been identified in proteins that function in pre-rRNA processing, including human Puf-A and yeast Puf6. This is a role not previously ascribed to PUF proteins. In this paper we present crystal structures of human Puf-A that reveal a class of nucleic acid-binding proteins with 11 PUM repeats arranged in an “L”-like shape. In contrast to classical PUF proteins, Puf-A forms sequence-independent interactions with DNA or RNA, mediated by conservedmore » basic residues. We demonstrate that equivalent basic residues in yeast Puf6 are important for RNA binding, pre-rRNA processing, and mRNA localization. Finally, PUM repeats can be assembled into alternative folds that bind to structured nucleic acids in addition to forming canonical eight-repeat crescent-shaped RNA-binding domains found in classical PUF proteins.« less
A divergent Pumilio repeat protein family for pre-rRNA processing and mRNA localization

DOE PAGES

Qiu, Chen; McCann, Kathleen L.; Wine, Robert N.; ...

2014-12-15

Pumilio/feminization of XX and XO animals (fem)-3 mRNA-binding factor (PUF) proteins bind sequence specifically to mRNA targets using a single-stranded RNA-binding domain comprising eight Pumilio (PUM) repeats. PUM repeats have now been identified in proteins that function in pre-rRNA processing, including human Puf-A and yeast Puf6. This is a role not previously ascribed to PUF proteins. In this paper we present crystal structures of human Puf-A that reveal a class of nucleic acid-binding proteins with 11 PUM repeats arranged in an “L”-like shape. In contrast to classical PUF proteins, Puf-A forms sequence-independent interactions with DNA or RNA, mediated by conservedmore » basic residues. We demonstrate that equivalent basic residues in yeast Puf6 are important for RNA binding, pre-rRNA processing, and mRNA localization. Finally, PUM repeats can be assembled into alternative folds that bind to structured nucleic acids in addition to forming canonical eight-repeat crescent-shaped RNA-binding domains found in classical PUF proteins.« less
Sliding of proteins non-specifically bound to DNA: Brownian dynamics studies with coarse-grained protein and DNA models.

PubMed

Ando, Tadashi; Skolnick, Jeffrey

2014-12-01

DNA binding proteins efficiently search for their cognitive sites on long genomic DNA by combining 3D diffusion and 1D diffusion (sliding) along the DNA. Recent experimental results and theoretical analyses revealed that the proteins show a rotation-coupled sliding along DNA helical pitch. Here, we performed Brownian dynamics simulations using newly developed coarse-grained protein and DNA models for evaluating how hydrodynamic interactions between the protein and DNA molecules, binding affinity of the protein to DNA, and DNA fluctuations affect the one dimensional diffusion of the protein on the DNA. Our results indicate that intermolecular hydrodynamic interactions reduce 1D diffusivity by 30%. On the other hand, structural fluctuations of DNA give rise to steric collisions between the CG-proteins and DNA, resulting in faster 1D sliding of the protein. Proteins with low binding affinities consistent with experimental estimates of non-specific DNA binding show hopping along the CG-DNA. This hopping significantly increases sliding speed. These simulation studies provide additional insights into the mechanism of how DNA binding proteins find their target sites on the genome.
Nanopore sensing of individual transcription factors bound to DNA

PubMed Central

Squires, Allison; Atas, Evrim; Meller, Amit

2015-01-01

Transcription factor (TF)-DNA interactions are the primary control point in regulation of gene expression. Characterization of these interactions is essential for understanding genetic regulation of biological systems and developing novel therapies to treat cellular malfunctions. Solid-state nanopores are a highly versatile class of single-molecule sensors that can provide rich information about local properties of long charged biopolymers using the current blockage patterns generated during analyte translocation, and provide a novel platform for characterization of TF-DNA interactions. The DNA-binding domain of the TF Early Growth Response Protein 1 (EGR1), a prototypical zinc finger protein known as zif268, is used as a model system for this study. zif268 adopts two distinct bound conformations corresponding to specific and nonspecific binding, according to the local DNA sequence. Here we implement a solid-state nanopore platform for direct, label- and tether-free single-molecule detection of zif268 bound to DNA. We demonstrate detection of single zif268 TFs bound to DNA according to current blockage sublevels and duration of translocation through the nanopore. We further show that the nanopore can detect and discriminate both specific and nonspecific binding conformations of zif268 on DNA via the distinct current blockage patterns corresponding to each of these two known binding modes. PMID:26109509
Nanopore sensing of individual transcription factors bound to DNA

NASA Astrophysics Data System (ADS)

Squires, Allison; Atas, Evrim; Meller, Amit

2015-06-01

Transcription factor (TF)-DNA interactions are the primary control point in regulation of gene expression. Characterization of these interactions is essential for understanding genetic regulation of biological systems and developing novel therapies to treat cellular malfunctions. Solid-state nanopores are a highly versatile class of single-molecule sensors that can provide rich information about local properties of long charged biopolymers using the current blockage patterns generated during analyte translocation, and provide a novel platform for characterization of TF-DNA interactions. The DNA-binding domain of the TF Early Growth Response Protein 1 (EGR1), a prototypical zinc finger protein known as zif268, is used as a model system for this study. zif268 adopts two distinct bound conformations corresponding to specific and nonspecific binding, according to the local DNA sequence. Here we implement a solid-state nanopore platform for direct, label- and tether-free single-molecule detection of zif268 bound to DNA. We demonstrate detection of single zif268 TFs bound to DNA according to current blockage sublevels and duration of translocation through the nanopore. We further show that the nanopore can detect and discriminate both specific and nonspecific binding conformations of zif268 on DNA via the distinct current blockage patterns corresponding to each of these two known binding modes.
Titration of DnaA protein by oriC DnaA-boxes increases dnaA gene expression in Escherichia coli.

PubMed Central

Hansen, F G; Koefoed, S; Sørensen, L; Atlung, T

1987-01-01

Binding of the DnaA protein to its binding sites, the DnaA-boxes (TTATCCACA), was measured by a simple physiological approach. The presence of extra DnaA-boxes in growing cells leads to a derepression of dnaA gene expression, measured as beta-galactosidase activity of a dnaA-lacZ fusion polypeptide. Different DnaA-boxes caused different degrees of derepression indicating that the DnaA protein requires sequences in addition to the DnaA-box for efficient binding. The DnaA-boxes in oriC might act cooperatively in binding of the DnaA protein. The derepressed levels of DnaA protein obtained in a strain carrying an oriC+-pBR322 chimera were very high and sufficient to activate oriC on the chimeric plasmid, which was maintained at a copy number more than three times that of pBR322. PMID:3034578
Evolution of I-SceI Homing Endonucleases with Increased DNA Recognition Site Specificity

DOE Office of Scientific and Technical Information (OSTI.GOV)

Joshi, Rakesh; Ho, Kwok Ki; Tenney, Kristen

2013-09-18

Elucidating how homing endonucleases undergo changes in recognition site specificity will facilitate efforts to engineer proteins for gene therapy applications. I-SceI is a monomeric homing endonuclease that recognizes and cleaves within an 18-bp target. It tolerates limited degeneracy in its target sequence, including substitution of a C:G{sub +4} base pair for the wild-type A:T{sub +4} base pair. Libraries encoding randomized amino acids at I-SceI residue positions that contact or are proximal to A:T{sub +4} were used in conjunction with a bacterial one-hybrid system to select I-SceI derivatives that bind to recognition sites containing either the A:T{sub +4} or the C:G{submore » +4} base pairs. As expected, isolates encoding wild-type residues at the randomized positions were selected using either target sequence. All I-SceI proteins isolated using the C:G{sub +4} recognition site included small side-chain substitutions at G100 and either contained (K86R/G100T, K86R/G100S and K86R/G100C) or lacked (G100A, G100T) a K86R substitution. Interestingly, the binding affinities of the selected variants for the wild-type A:T{sub +4} target are 4- to 11-fold lower than that of wild-type I-SceI, whereas those for the C:G{sub +4} target are similar. The increased specificity of the mutant proteins is also evident in binding experiments in vivo. These differences in binding affinities account for the observed -36-fold difference in target preference between the K86R/G100T and wild-type proteins in DNA cleavage assays. An X-ray crystal structure of the K86R/G100T mutant protein bound to a DNA duplex containing the C:G{sub +4} substitution suggests how sequence specificity of a homing enzyme can increase. This biochemical and structural analysis defines one pathway by which site specificity is augmented for a homing endonuclease.« less
In vitro Selection and Interaction Studies of a DNA Aptamer Targeting Protein A

PubMed Central

Stoltenburg, Regina; Schubert, Thomas; Strehlitz, Beate

2015-01-01

A new DNA aptamer targeting Protein A is presented. The aptamer was selected by use of the FluMag-SELEX procedure. The SELEX technology (Systematic Evolution of Ligands by EXponential enrichment) is widely applied as an in vitro selection and amplification method to generate target-specific aptamers and exists in various modified variants. FluMag-SELEX is one of them and is characterized by the use of magnetic beads for target immobilization and fluorescently labeled oligonucleotides for monitoring the aptamer selection progress. Structural investigations and sequence truncation experiments of the selected aptamer for Protein A led to the conclusion, that a stem-loop structure at its 5’-end including the 5’-primer binding site is essential for aptamer-target binding. Extensive interaction analyses between aptamer and Protein A were performed by methods like surface plasmon resonance, MicroScale Thermophoresis and bead-based binding assays using fluorescence measurements. The binding of the aptamer to its target was thus investigated in assays with immobilization of one of the binding partners each, and with both binding partners in solution. Affinity constants were determined in the low micromolar to submicromolar range, increasing to the nanomolar range under the assumption of avidity. Protein A provides more than one binding site for the aptamer, which may overlap with the known binding sites for immunoglobulins. The aptamer binds specifically to both native and recombinant Protein A, but not to other immunoglobulin-binding proteins like Protein G and L. Cross specificity to other proteins was not found. The application of the aptamer is directed to Protein A detection or affinity purification. Moreover, whole cells of Staphylococcus aureus, presenting Protein A on the cell surface, could also be bound by the aptamer. PMID:26221730
In vitro Selection and Interaction Studies of a DNA Aptamer Targeting Protein A.

PubMed

Stoltenburg, Regina; Schubert, Thomas; Strehlitz, Beate

2015-01-01

A new DNA aptamer targeting Protein A is presented. The aptamer was selected by use of the FluMag-SELEX procedure. The SELEX technology (Systematic Evolution of Ligands by EXponential enrichment) is widely applied as an in vitro selection and amplification method to generate target-specific aptamers and exists in various modified variants. FluMag-SELEX is one of them and is characterized by the use of magnetic beads for target immobilization and fluorescently labeled oligonucleotides for monitoring the aptamer selection progress. Structural investigations and sequence truncation experiments of the selected aptamer for Protein A led to the conclusion, that a stem-loop structure at its 5'-end including the 5'-primer binding site is essential for aptamer-target binding. Extensive interaction analyses between aptamer and Protein A were performed by methods like surface plasmon resonance, MicroScale Thermophoresis and bead-based binding assays using fluorescence measurements. The binding of the aptamer to its target was thus investigated in assays with immobilization of one of the binding partners each, and with both binding partners in solution. Affinity constants were determined in the low micromolar to submicromolar range, increasing to the nanomolar range under the assumption of avidity. Protein A provides more than one binding site for the aptamer, which may overlap with the known binding sites for immunoglobulins. The aptamer binds specifically to both native and recombinant Protein A, but not to other immunoglobulin-binding proteins like Protein G and L. Cross specificity to other proteins was not found. The application of the aptamer is directed to Protein A detection or affinity purification. Moreover, whole cells of Staphylococcus aureus, presenting Protein A on the cell surface, could also be bound by the aptamer.
Sequence-dependent modelling of local DNA bending phenomena: curvature prediction and vibrational analysis.

PubMed

Vlahovicek, K; Munteanu, M G; Pongor, S

1999-01-01

Bending is a local conformational micropolymorphism of DNA in which the original B-DNA structure is only distorted but not extensively modified. Bending can be predicted by simple static geometry models as well as by a recently developed elastic model that incorporate sequence dependent anisotropic bendability (SDAB). The SDAB model qualitatively explains phenomena including affinity of protein binding, kinking, as well as sequence-dependent vibrational properties of DNA. The vibrational properties of DNA segments can be studied by finite element analysis of a model subjected to an initial bending moment. The frequency spectrum is obtained by applying Fourier analysis to the displacement values in the time domain. This analysis shows that the spectrum of the bending vibrations quite sensitively depends on the sequence, for example the spectrum of a curved sequence is characteristically different from the spectrum of straight sequence motifs of identical basepair composition. Curvature distributions are genome-specific, and pronounced differences are found between protein-coding and regulatory regions, respectively, that is, sites of extreme curvature and/or bendability are less frequent in protein-coding regions. A WWW server is set up for the prediction of curvature and generation of 3D models from DNA sequences (http:@www.icgeb.trieste.it/dna).
Metal Ion Binding at the Catalytic Site Induces Widely Distributed Changes in a Sequence Specific Protein–DNA Complex

PubMed Central

2016-01-01

Metal ion cofactors can alter the energetics and specificity of sequence specific protein–DNA interactions, but it is unknown if the underlying effects on structure and dynamics are local or dispersed throughout the protein–DNA complex. This work uses EcoRV endonuclease as a model, and catalytically inactive lanthanide ions, which replace the Mg2+ cofactor. Nuclear magnetic resonance (NMR) titrations indicate that four Lu3+ or two La3+ cations bind, and two new crystal structures confirm that Lu3+ binding is confined to the active sites. NMR spectra show that the metal-free EcoRV complex with cognate (GATATC) DNA is structurally distinct from the nonspecific complex, and that metal ion binding sites are not assembled in the nonspecific complex. NMR chemical shift perturbations were determined for 1H–15N amide resonances, for 1H–13C Ile-δ-CH3 resonances, and for stereospecifically assigned Leu-δ-CH3 and Val-γ-CH3 resonances. Many chemical shifts throughout the cognate complex are unperturbed, so metal binding does not induce major conformational changes. However, some large perturbations of amide and side chain methyl resonances occur as far as 34 Å from the metal ions. Concerted changes in specific residues imply that local effects of metal binding are propagated via a β-sheet and an α-helix. Both amide and methyl resonance perturbations indicate changes in the interface between subunits of the EcoRV homodimer. Bound metal ions also affect amide hydrogen exchange rates for distant residues, including a distant subdomain that contacts DNA phosphates and promotes DNA bending, showing that metal ions in the active sites, which relieve electrostatic repulsion between protein and DNA, cause changes in slow dynamics throughout the complex. PMID:27786446
Nectinepsin: a new extracellular matrix protein of the pexin family. Characterization of a novel cDNA encoding a protein with an RGD cell binding motif.

PubMed

Blancher, C; Omri, B; Bidou, L; Pessac, B; Crisanti, P

1996-10-18

We report the isolation and characterization of a novel cDNA from quail neuroretina encoding a putative protein named nectinepsin. The nectinepsin cDNA identifies a major 2.2-kilobase mRNA that is detected from ED 5 in neuroretina and is increasingly abundant during embryonic development. A nectinepsin mRNA is also found in quail liver, brain, and intestine and in mouse retina. The deduced nectinepsin amino acid sequence contains the RGD cell binding motif of integrin ligands. Furthermore, nectinepsin shares substantial homologies with vitronectin and structural protein similarities with most of the matricial metalloproteases. However, the presence of a specific sequence and the lack of heparin and collagen binding domains of the vitronectin indicate that nectinepsin is a new extracellular matrix protein. Furthermore, genomic Southern blot studies suggest that nectinepsin and vitronectin are encoded by different genes. Western blot analysis with an anti-human vitronectin antiserum revealed, in addition to the 65- and 70-kDa vitronectin bands, an immunoreactive protein of about 54 kDa in all tissues containing nectinepsin mRNA. It seems likely that the form of vitronectin found in chick egg yolk plasma by Nagano et al. ((1992) J. Biol. Chem. 267, 24863-24870) is the protein that corresponds to the nectinepsin cDNA. This new protein could be an important molecule involved in the early steps of the development.
Isolation and Characterization of a Sex-Specific Lectin in a Marine Red Alga, Aglaothamnion oosumiense Itono

PubMed Central

Han, Jong Won; Klochkova, Tatyana A.; Shim, Jun Bo; Yoon, Kangsup

2012-01-01

In red algae, spermatial binding to female trichogynes is mediated by a lectin-carbohydrate complementary system. Aglaothamnion oosumiense is a microscopic filamentous red alga. The gamete recognition and binding occur at the surface of the hairlike trichogyne on the female carpogonium. Male spermatia are nonmotile. Previous studies suggested the presence of a lectin responsible for gamete recognition on the surface of female trychogynes. A novel N-acetyl-d-galactosamine-specific protein was isolated from female plants of A. oosumiense by affinity chromatography and named AOL1. The lectin was monomeric and did not agglutinate horse blood or human erythrocytes. The N-terminal amino acid sequence of the protein was analyzed, and degenerate primers were designed. A full-length cDNA encoding the lectin was obtained using rapid amplification of cDNA ends-PCR (RACE-PCR). The cDNA was 1,095 bp in length and coded for a protein of 259 amino acids with a deduced molecular mass of 21.4 kDa, which agreed well with the protein data. PCR analysis using genomic DNA showed that both male and female plants have this gene. However, Northern blotting and two-dimensional electrophoresis showed that this protein was expressed 12 to 15 times more in female plants. The lectin inhibited spermatial binding to the trichogynes when preincubated with spermatia, suggesting its involvement in gamete binding. PMID:22865077
Onco-Regulon: an integrated database and software suite for site specific targeting of transcription factors of cancer genes

PubMed Central

Tomar, Navneet; Mishra, Akhilesh; Mrinal, Nirotpal; Jayaram, B.

2016-01-01

Transcription factors (TFs) bind at multiple sites in the genome and regulate expression of many genes. Regulating TF binding in a gene specific manner remains a formidable challenge in drug discovery because the same binding motif may be present at multiple locations in the genome. Here, we present Onco-Regulon (http://www.scfbio-iitd.res.in/software/onco/NavSite/index.htm), an integrated database of regulatory motifs of cancer genes clubbed with Unique Sequence-Predictor (USP) a software suite that identifies unique sequences for each of these regulatory DNA motifs at the specified position in the genome. USP works by extending a given DNA motif, in 5′→3′, 3′ →5′ or both directions by adding one nucleotide at each step, and calculates the frequency of each extended motif in the genome by Frequency Counter programme. This step is iterated till the frequency of the extended motif becomes unity in the genome. Thus, for each given motif, we get three possible unique sequences. Closest Sequence Finder program predicts off-target drug binding in the genome. Inclusion of DNA-Protein structural information further makes Onco-Regulon a highly informative repository for gene specific drug development. We believe that Onco-Regulon will help researchers to design drugs which will bind to an exclusive site in the genome with no off-target effects, theoretically. Database URL: http://www.scfbio-iitd.res.in/software/onco/NavSite/index.htm PMID:27515825
Molecular coevolution of mammalian ribosomal gene terminator sequences and the transcription termination factor TTF-I.

PubMed Central

Evers, R; Grummt, I

1995-01-01

Both the DNA elements and the nuclear factors that direct termination of ribosomal gene transcription exhibit species-specific differences. Even between mammals--e.g., human and mouse--the termination signals are not identical and the respective transcription termination factors (TTFs) which bind to the terminator sequence are not fully interchangeable. To elucidate the molecular basis for this species-specificity, we have cloned TTF-I from human and mouse cells and compared their structural and functional properties. Recombinant TTF-I exhibits species-specific DNA binding and terminates transcription both in cell-free transcription assays and in transfection experiments. Chimeric constructs of mouse TTF-I and human TTF-I reveal that the major determinant for species-specific DNA binding resides within the C terminus of TTF-I. Replacing 31 C-terminal amino acids of mouse TTF-I with the homologous human sequences relaxes the DNA-binding specificity and, as a consequence, allows the chimeric factor to bind the human terminator sequence and to specifically stop rDNA transcription. Images Fig. 2 Fig. 3 Fig. 4 PMID:7597036
Flexible DNA binding of the BTB/POZ-domain protein FBI-1.

PubMed

Pessler, Frank; Hernandez, Nouria

2003-08-01

POZ-domain transcription factors are characterized by the presence of a protein-protein interaction domain called the POZ or BTB domain at their N terminus and zinc fingers at their C terminus. Despite the large number of POZ-domain transcription factors that have been identified to date and the significant insights that have been gained into their cellular functions, relatively little is known about their DNA binding properties. FBI-1 is a BTB/POZ-domain protein that has been shown to modulate HIV-1 Tat trans-activation and to repress transcription of some cellular genes. We have used various viral and cellular FBI-1 binding sites to characterize the interaction of a POZ-domain protein with DNA in detail. We find that FBI-1 binds to inverted sequence repeats downstream of the HIV-1 transcription start site. Remarkably, it binds efficiently to probes carrying these repeats in various orientations and spacings with no particular rotational alignment, indicating that its interaction with DNA is highly flexible. Indeed, FBI-1 binding sites in the adenovirus 2 major late promoter, the c-fos gene, and the c-myc P1 and P2 promoters reveal variously spaced direct, inverted, and everted sequence repeats with the consensus sequence G(A/G)GGG(T/C)(C/T)(T/C)(C/T) for each repeat.
Novel Strategy for Discrimination of Transcription Factor Binding Motifs Employing Mathematical Neural Network

NASA Astrophysics Data System (ADS)

Sugimoto, Asuka; Sumi, Takuya; Kang, Jiyoung; Tateno, Masaru

2017-07-01

Recognition in biological macromolecular systems, such as DNA-protein recognition, is one of the most crucial problems to solve toward understanding the fundamental mechanisms of various biological processes. Since specific base sequences of genome DNA are discriminated by proteins, such as transcription factors (TFs), finding TF binding motifs (TFBMs) in whole genome DNA sequences is currently a central issue in interdisciplinary biophysical and information sciences. In the present study, a novel strategy to create a discriminant function for discrimination of TFBMs by constituting mathematical neural networks (NNs) is proposed, together with a method to determine the boundary of signals (TFBMs) and noise in the NN-score (output) space. This analysis also leads to the mathematical limitation of discrimination in the recognition of features representing TFBMs, in an information geometrical manifold. Thus, the present strategy enables the identification of the whole space of TFBMs, right up to the noise boundary.
Identification of Human Lineage-Specific Transcriptional Coregulators Enabled by a Glossary of Binding Modules and Tunable Genomic Backgrounds.

PubMed

Mariani, Luca; Weinand, Kathryn; Vedenko, Anastasia; Barrera, Luis A; Bulyk, Martha L

2017-09-27

Transcription factors (TFs) control cellular processes by binding specific DNA motifs to modulate gene expression. Motif enrichment analysis of regulatory regions can identify direct and indirect TF binding sites. Here, we created a glossary of 108 non-redundant TF-8mer "modules" of shared specificity for 671 metazoan TFs from publicly available and new universal protein binding microarray data. Analysis of 239 ENCODE TF chromatin immunoprecipitation sequencing datasets and associated RNA sequencing profiles suggest the 8mer modules are more precise than position weight matrices in identifying indirect binding motifs and their associated tethering TFs. We also developed GENRE (genomically equivalent negative regions), a tunable tool for construction of matched genomic background sequences for analysis of regulatory regions. GENRE outperformed four state-of-the-art approaches to background sequence construction. We used our TF-8mer glossary and GENRE in the analysis of the indirect binding motifs for the co-occurrence of tethering factors, suggesting novel TF-TF interactions. We anticipate that these tools will aid in elucidating tissue-specific gene-regulatory programs. Copyright © 2017 Elsevier Inc. All rights reserved.
Quantifying the Effect of DNA Packaging on Gene Expression Level

NASA Astrophysics Data System (ADS)

Kim, Harold

2010-10-01

Gene expression, the process by which the genetic code comes alive in the form of proteins, is one of the most important biological processes in living cells, and begins when transcription factors bind to specific DNA sequences in the promoter region upstream of a gene. The relationship between gene expression output and transcription factor input which is termed the gene regulation function is specific to each promoter, and predicting this gene regulation function from the locations of transcription factor binding sites is one of the challenges in biology. In eukaryotic organisms (for example, animals, plants, fungi etc), DNA is highly compacted into nucleosomes, 147-bp segments of DNA tightly wrapped around histone protein core, and therefore, the accessibility of transcription factor binding sites depends on their locations with respect to nucleosomes - sites inside nucleosomes are less accessible than those outside nucleosomes. To understand how transcription factor binding sites contribute to gene expression in a quantitative manner, we obtain gene regulation functions of promoters with various configurations of transcription factor binding sites by using fluorescent protein reporters to measure transcription factor input and gene expression output in single yeast cells. In this talk, I will show that the affinity of a transcription factor binding site inside and outside the nucleosome controls different aspects of the gene regulation function, and explain this finding based on a mass-action kinetic model that includes competition between nucleosomes and transcription factors.

Comprehensive analysis of RNA-protein interactions by high-throughput sequencing-RNA affinity profiling.

PubMed

Tome, Jacob M; Ozer, Abdullah; Pagano, John M; Gheba, Dan; Schroth, Gary P; Lis, John T

2014-06-01

RNA-protein interactions play critical roles in gene regulation, but methods to quantitatively analyze these interactions at a large scale are lacking. We have developed a high-throughput sequencing-RNA affinity profiling (HiTS-RAP) assay by adapting a high-throughput DNA sequencer to quantify the binding of fluorescently labeled protein to millions of RNAs anchored to sequenced cDNA templates. Using HiTS-RAP, we measured the affinity of mutagenized libraries of GFP-binding and NELF-E-binding aptamers to their respective targets and identified critical regions of interaction. Mutations additively affected the affinity of the NELF-E-binding aptamer, whose interaction depended mainly on a single-stranded RNA motif, but not that of the GFP aptamer, whose interaction depended primarily on secondary structure.
Determining the Specificity of Cascade Binding, Interference, and Primed Adaptation In Vivo in the Escherichia coli Type I-E CRISPR-Cas System.

PubMed

Cooper, Lauren A; Stringer, Anne M; Wade, Joseph T

2018-04-17

In clustered regularly interspaced short palindromic repeat (CRISPR)-Cas (CRISPR-associated) immunity systems, short CRISPR RNAs (crRNAs) are bound by Cas proteins, and these complexes target invading nucleic acid molecules for degradation in a process known as interference. In type I CRISPR-Cas systems, the Cas protein complex that binds DNA is known as Cascade. Association of Cascade with target DNA can also lead to acquisition of new immunity elements in a process known as primed adaptation. Here, we assess the specificity determinants for Cascade-DNA interaction, interference, and primed adaptation in vivo , for the type I-E system of Escherichia coli Remarkably, as few as 5 bp of crRNA-DNA are sufficient for association of Cascade with a DNA target. Consequently, a single crRNA promotes Cascade association with numerous off-target sites, and the endogenous E. coli crRNAs direct Cascade binding to >100 chromosomal sites. In contrast to the low specificity of Cascade-DNA interactions, >18 bp are required for both interference and primed adaptation. Hence, Cascade binding to suboptimal, off-target sites is inert. Our data support a model in which the initial Cascade association with DNA targets requires only limited sequence complementarity at the crRNA 5' end whereas recruitment and/or activation of the Cas3 nuclease, a prerequisite for interference and primed adaptation, requires extensive base pairing. IMPORTANCE Many bacterial and archaeal species encode CRISPR-Cas immunity systems that protect against invasion by foreign DNA. In the Escherichia coli CRISPR-Cas system, a protein complex, Cascade, binds 61-nucleotide (nt) CRISPR RNAs (crRNAs). The Cascade complex is directed to invading DNA molecules through base pairing between the crRNA and target DNA. This leads to recruitment of the Cas3 nuclease, which destroys the invading DNA molecule and promotes acquisition of new immunity elements. We made the first in vivo measurements of Cascade binding to DNA targets. Thus, we show that Cascade binding to DNA is highly promiscuous; endogenous E. coli crRNAs can direct Cascade binding to >100 chromosomal locations. In contrast, we show that targeted degradation and acquisition of new immunity elements require highly specific association of Cascade with DNA, limiting CRISPR-Cas function to the appropriate targets. Copyright © 2018 Cooper et al.
LexA Binds to Transcription Regulatory Site of Cell Division Gene ftsZ in Toxic Cyanobacterium Microcystis aeruginosa.

PubMed

Honda, Takashi; Morimoto, Daichi; Sako, Yoshihiko; Yoshida, Takashi

2018-05-17

Previously, we showed that DNA replication and cell division in toxic cyanobacterium Microcystis aeruginosa are coordinated by transcriptional regulation of cell division gene ftsZ and that an unknown protein specifically bound upstream of ftsZ (BpFz; DNA-binding protein to an upstream site of ftsZ) during successful DNA replication and cell division. Here, we purified BpFz from M. aeruginosa strain NIES-298 using DNA-affinity chromatography and gel-slicing combined with gel electrophoresis mobility shift assay (EMSA). The N-terminal amino acid sequence of BpFz was identified as TNLESLTQ, which was identical to that of transcription repressor LexA from NIES-843. EMSA analysis using mutant probes showed that the sequence GTACTAN 3 GTGTTC was important in LexA binding. Comparison of the upstream regions of lexA in the genomes of closely related cyanobacteria suggested that the sequence TASTRNNNNTGTWC could be a putative LexA recognition sequence (LexA box). Searches for TASTRNNNNTGTWC as a transcriptional regulatory site (TRS) in the genome of M. aeruginosa NIES-843 showed that it was present in genes involved in cell division, photosynthesis, and extracellular polysaccharide biosynthesis. Considering that BpFz binds to the TRS of ftsZ during normal cell division, LexA may function as a transcriptional activator of genes related to cell reproduction in M. aeruginosa, including ftsZ. This may be an example of informality in the control of bacterial cell division.
Single-molecule FRET studies of the cooperative and non-cooperative binding kinetics of the bacteriophage T4 single-stranded DNA binding protein (gp32) to ssDNA lattices at replication fork junctions

PubMed Central

Lee, Wonbae; Gillies, John P.; Jose, Davis; Israels, Brett A.; von Hippel, Peter H.; Marcus, Andrew H.

2016-01-01

Gene 32 protein (gp32) is the single-stranded (ss) DNA binding protein of the bacteriophage T4. It binds transiently and cooperatively to ssDNA sequences exposed during the DNA replication process and regulates the interactions of the other sub-assemblies of the replication complex during the replication cycle. We here use single-molecule FRET techniques to build on previous thermodynamic studies of gp32 binding to initiate studies of the dynamics of the isolated and cooperative binding of gp32 molecules within the replication complex. DNA primer/template (p/t) constructs are used as models to determine the effects of ssDNA lattice length, gp32 concentration, salt concentration, binding cooperativity and binding polarity at p/t junctions. Hidden Markov models (HMMs) and transition density plots (TDPs) are used to characterize the dynamics of the multi-step assembly pathway of gp32 at p/t junctions of differing polarity, and show that isolated gp32 molecules bind to their ssDNA targets weakly and dissociate quickly, while cooperatively bound dimeric or trimeric clusters of gp32 bind much more tightly, can ‘slide’ on ssDNA sequences, and exhibit binding dynamics that depend on p/t junction polarities. The potential relationships of these binding dynamics to interactions with other components of the T4 DNA replication complex are discussed. PMID:27694621
The Staphylococcus aureus pSK41 plasmid-encoded ArtA protein is a master regulator of plasmid transmission genes and contains a RHH motif used in alternate DNA-binding modes.

PubMed

Ni, Lisheng; Jensen, Slade O; Ky Tonthat, Nam; Berg, Tracey; Kwong, Stephen M; Guan, Fiona H X; Brown, Melissa H; Skurray, Ronald A; Firth, Neville; Schumacher, Maria A

2009-11-01

Plasmids harbored by Staphylococcus aureus are a major contributor to the spread of bacterial multi-drug resistance. Plasmid conjugation and partition are critical to the dissemination and inheritance of such plasmids. Here, we demonstrate that the ArtA protein encoded by the S. aureus multi-resistance plasmid pSK41 is a global transcriptional regulator of pSK41 genes, including those involved in conjugation and segregation. ArtA shows no sequence homology to any structurally characterized DNA-binding protein. To elucidate the mechanism by which it specifically recognizes its DNA site, we obtained the structure of ArtA bound to its cognate operator, ACATGACATG. The structure reveals that ArtA is representative of a new family of ribbon-helix-helix (RHH) DNA-binding proteins that contain extended, N-terminal basic motifs. Strikingly, unlike most well-studied RHH proteins ArtA binds its cognate operators as a dimer. However, we demonstrate that it is also able to recognize an atypical operator site by binding as a dimer-of-dimers and the extended N-terminal regions of ArtA were shown to be essential for this dimer-of-dimer binding mode. Thus, these data indicate that ArtA is a master regulator of genes critical for both horizontal and vertical transmission of pSK41 and that it can recognize DNA utilizing alternate binding modes.
The Staphylococcus aureus pSK41 plasmid-encoded ArtA protein is a master regulator of plasmid transmission genes and contains a RHH motif used in alternate DNA-binding modes

PubMed Central

Ni, Lisheng; Jensen, Slade O.; Ky Tonthat, Nam; Berg, Tracey; Kwong, Stephen M.; Guan, Fiona H. X.; Brown, Melissa H.; Skurray, Ronald A.; Firth, Neville; Schumacher, Maria A.

2009-01-01

Plasmids harbored by Staphylococcus aureus are a major contributor to the spread of bacterial multi-drug resistance. Plasmid conjugation and partition are critical to the dissemination and inheritance of such plasmids. Here, we demonstrate that the ArtA protein encoded by the S. aureus multi-resistance plasmid pSK41 is a global transcriptional regulator of pSK41 genes, including those involved in conjugation and segregation. ArtA shows no sequence homology to any structurally characterized DNA-binding protein. To elucidate the mechanism by which it specifically recognizes its DNA site, we obtained the structure of ArtA bound to its cognate operator, ACATGACATG. The structure reveals that ArtA is representative of a new family of ribbon–helix–helix (RHH) DNA-binding proteins that contain extended, N-terminal basic motifs. Strikingly, unlike most well-studied RHH proteins ArtA binds its cognate operators as a dimer. However, we demonstrate that it is also able to recognize an atypical operator site by binding as a dimer-of-dimers and the extended N-terminal regions of ArtA were shown to be essential for this dimer-of-dimer binding mode. Thus, these data indicate that ArtA is a master regulator of genes critical for both horizontal and vertical transmission of pSK41 and that it can recognize DNA utilizing alternate binding modes. PMID:19759211
DNA condensing effects and sequence selectivity of DNA binding of antitumor noncovalent polynuclear platinum complexes.

PubMed

Malina, Jaroslav; Farrell, Nicholas P; Brabec, Viktor

2014-02-03

The noncovalent analogues of antitumor polynuclear platinum complexes represent a structurally discrete class of platinum drugs. Their chemical and biological properties differ significantly from those of most platinum chemotherapeutics, which bind to DNA in a covalent manner by formation of Pt-DNA adducts. In spite of the fact that these noncovalent polynuclear platinum complexes contain no leaving groups, they have been shown to bind to DNA with high affinity. We report here on the DNA condensation properties of a series of noncovalent analogues of antitumor polynuclear platinum complexes described by biophysical and biochemical methods. The results demonstrate that these polynuclear platinum compounds are capable of inducing DNA condensation at more than 1 order of magnitude lower concentrations than conventional spermine. Atomic force microscopy studies of DNA condensation confined to a mica substrate have revealed that the DNA morphologies become more compact with increasing concentration of the platinum complexes. Moreover, we also found that the noncovalent polynuclear platinum complex [{Pt(NH3)3}2-μ-{trans-Pt(NH3)2(NH2(CH2)6NH2)2}](6+) (TriplatinNC-A) binds to DNA in a sequence-dependent manner, namely, to A/T-rich sequences and A-tract regions, and that noncovalent polynuclear platinum complexes protect DNA from enzymatic cleavage by DNase I. The results suggest that mechanisms of antitumor and cytotoxic activities of these complexes may be associated with their unique ability to condense DNA along with their sequence-specific DNA binding. Owing to their high cellular accumulation, it is also reasonable to suggest that their mechanism of action is based on the competition with naturally occurring DNA condensing agents, such as polyamines spermine, spermidine, and putrescine, for intracellular binding sites, resulting in the disturbance of the correct binding of regulatory proteins initiating the onset of apoptosis.
Saccharomyces cerevisiae MSH2-MSH3 and MSH2-MSH6 complexes display distinct requirements for DNA binding Domain I in mismatch recognition.

PubMed Central

Lee, Susan D.; Surtees, Jennifer A.; Alani, Eric

2007-01-01

In eukaryotic mismatch repair (MMR) MSH2-MSH6 initiates the repair of base-base and small insertion/deletion mismatches while MSH2-MSH3 repairs larger insertion/deletion mismatches. In this study we showed that the msh2Δ1 mutation, containing a complete deletion of the conserved mismatch recognition Domain I of MSH2, conferred a separation of function phenotype with respect to MSH2-MSH3 and MSH2-MSH6 functions. Strains bearing the msh2Δ1 mutation were nearly wild-type in MSH2-MSH6-mediated MMR and in suppressing recombination between DNA sequences predicted to form mismatches recognized by MSH2-MSH6. However, these strains were completely defective in MSH2-MSH3-mediated MMR and recombination functions. This information encouraged us to analyze the contributions of Domain I to the mismatch binding specificity of MSH2-MSH3 in genetic and biochemical assays. We found that Domain I in MSH2 contributed a non-specific DNA binding activity while Domain I of MSH3 appeared important for mismatch binding specificity and for suppressing non-specific DNA-binding. These observations reveal distinct requirements for the MSH2 DNA binding Domain I in the repair of DNA mismatches and suggest that the binding of MSH2-MSH3 to mismatch DNA involves protein-DNA contacts that appear very different from those required for MSH2-MSH6 mismatch binding. PMID:17157869
Saccharomyces cerevisiae MSH2-MSH3 and MSH2-MSH6 complexes display distinct requirements for DNA binding domain I in mismatch recognition.

PubMed

Lee, Susan D; Surtees, Jennifer A; Alani, Eric

2007-02-09

In eukaryotic mismatch repair (MMR) MSH2-MSH6 initiates the repair of base-base and small insertion/deletion mismatches while MSH2-MSH3 repairs larger insertion/deletion mismatches. Here, we show that the msh2Delta1 mutation, containing a complete deletion of the conserved mismatch recognition domain I of MSH2, conferred a separation of function phenotype with respect to MSH2-MSH3 and MSH2-MSH6 functions. Strains bearing the msh2Delta1 mutation were nearly wild-type in MSH2-MSH6-mediated MMR and in suppressing recombination between DNA sequences predicted to form mismatches recognized by MSH2-MSH6. However, these strains were completely defective in MSH2-MSH3-mediated MMR and recombination functions. This information encouraged us to analyze the contributions of domain I to the mismatch binding specificity of MSH2-MSH3 in genetic and biochemical assays. We found that domain I in MSH2 contributed a non-specific DNA binding activity while domain I of MSH3 appeared important for mismatch binding specificity and for suppressing non-specific DNA binding. These observations reveal distinct requirements for the MSH2 DNA binding domain I in the repair of DNA mismatches and suggest that the binding of MSH2-MSH3 to mismatch DNA involves protein-DNA contacts that appear very different from those required for MSH2-MSH6 mismatch binding.
Circadian clock protein KaiC forms ATP-dependent hexameric rings and binds DNA

PubMed Central

Mori, Tetsuya; Saveliev, Sergei V.; Xu, Yao; Stafford, Walter F.; Cox, Michael M.; Inman, Ross B.; Johnson, Carl H.

2002-01-01

KaiC from Synechococcus elongatus PCC 7942 (KaiC) is an essential circadian clock protein in cyanobacteria. Previous sequence analyses suggested its inclusion in the RecA/DnaB superfamily. A characteristic of the proteins of this superfamily is that they form homohexameric complexes that bind DNA. We show here that KaiC also forms ring complexes with a central pore that can be visualized by electron microscopy. A combination of analytical ultracentrifugation and chromatographic analyses demonstrates that these complexes are hexameric. The association of KaiC molecules into hexamers depends on the presence of ATP. The KaiC sequence does not include the obvious DNA-binding motifs found in RecA or DnaB. Nevertheless, KaiC binds forked DNA substrates. These data support the inclusion of KaiC into the RecA/DnaB superfamily and have important implications for enzymatic activity of KaiC in the circadian clock mechanism that regulates global changes in gene expression patterns. PMID:12477935
Genome-wide profiling of DNA-binding proteins using barcode-based multiplex Solexa sequencing.

PubMed

Raghav, Sunil Kumar; Deplancke, Bart

2012-01-01

Chromatin immunoprecipitation (ChIP) is a commonly used technique to detect the in vivo binding of proteins to DNA. ChIP is now routinely paired to microarray analysis (ChIP-chip) or next-generation sequencing (ChIP-Seq) to profile the DNA occupancy of proteins of interest on a genome-wide level. Because ChIP-chip introduces several biases, most notably due to the use of a fixed number of probes, ChIP-Seq has quickly become the method of choice as, depending on the sequencing depth, it is more sensitive, quantitative, and provides a greater binding site location resolution. With the ever increasing number of reads that can be generated per sequencing run, it has now become possible to analyze several samples simultaneously while maintaining sufficient sequence coverage, thus significantly reducing the cost per ChIP-Seq experiment. In this chapter, we provide a step-by-step guide on how to perform multiplexed ChIP-Seq analyses. As a proof-of-concept, we focus on the genome-wide profiling of RNA Polymerase II as measuring its DNA occupancy at different stages of any biological process can provide insights into the gene regulatory mechanisms involved. However, the protocol can also be used to perform multiplexed ChIP-Seq analyses of other DNA-binding proteins such as chromatin modifiers and transcription factors.
Redesigning the specificity of protein-DNA interactions with Rosetta.

PubMed

Thyme, Summer; Baker, David

2014-01-01

Building protein tools that can selectively bind or cleave specific DNA sequences requires efficient technologies for modifying protein-DNA interactions. Computational design is one method for accomplishing this goal. In this chapter, we present the current state of protein-DNA interface design with the Rosetta macromolecular modeling program. The LAGLIDADG endonuclease family of DNA-cleaving enzymes, under study as potential gene therapy reagents, has been the main testing ground for these in silico protocols. At this time, the computational methods are most useful for designing endonuclease variants that can accommodate small numbers of target site substitutions. Attempts to engineer for more extensive interface changes will likely benefit from an approach that uses the computational design results in conjunction with a high-throughput directed evolution or screening procedure. The family of enzymes presents an engineering challenge because their interfaces are highly integrated and there is significant coordination between the binding and catalysis events. Future developments in the computational algorithms depend on experimental feedback to improve understanding and modeling of these complex enzymatic features. This chapter presents both the basic method of design that has been successfully used to modulate specificity and more advanced procedures that incorporate DNA flexibility and other properties that are likely necessary for reliable modeling of more extensive target site changes.
Regulation of Active DNA Demethylation by a Methyl-CpG-Binding Domain Protein in Arabidopsis thaliana

PubMed Central

Sun, Han; Zeng, Jun; Cao, Zhendong; Li, Yan; Qian, Weiqiang

2015-01-01

Active DNA demethylation plays crucial roles in the regulation of gene expression in both plants and animals. In Arabidopsis thaliana, active DNA demethylation is initiated by the ROS1 subfamily of 5-methylcytosine-specific DNA glycosylases via a base excision repair mechanism. Recently, IDM1 and IDM2 were shown to be required for the recruitment of ROS1 to some of its target loci. However, the mechanism(s) by which IDM1 is targeted to specific genomic loci remains to be determined. Affinity purification of IDM1- and IDM2- associating proteins demonstrated that IDM1 and IDM2 copurify together with two novel components, methyl-CpG-binding domain protein 7 (MBD7) and IDM2-like protein 1 (IDL1). IDL1 encodes an α-crystallin domain protein that shows high sequence similarity with IDM2. MBD7 interacts with IDM2 and IDL1 in vitro and in vivo and they form a protein complex associating with IDM1 in vivo. MBD7 directly binds to the target loci and is required for the H3K18 and H3K23 acetylation in planta. MBD7 dysfunction causes DNA hypermethylation and silencing of reporter genes and a subset of endogenous genes. Our results suggest that a histone acetyltransferase complex functions in active DNA demethylation and in suppression of gene silencing at some loci in Arabidopsis. PMID:25933434
Deciphering the genomic targets of alkylating polyamide conjugates using high-throughput sequencing

PubMed Central

Chandran, Anandhakumar; Syed, Junetha; Taylor, Rhys D.; Kashiwazaki, Gengo; Sato, Shinsuke; Hashiya, Kaori; Bando, Toshikazu; Sugiyama, Hiroshi

2016-01-01

Chemically engineered small molecules targeting specific genomic sequences play an important role in drug development research. Pyrrole-imidazole polyamides (PIPs) are a group of molecules that can bind to the DNA minor-groove and can be engineered to target specific sequences. Their biological effects rely primarily on their selective DNA binding. However, the binding mechanism of PIPs at the chromatinized genome level is poorly understood. Herein, we report a method using high-throughput sequencing to identify the DNA-alkylating sites of PIP-indole-seco-CBI conjugates. High-throughput sequencing analysis of conjugate 2 showed highly similar DNA-alkylating sites on synthetic oligos (histone-free DNA) and on human genomes (chromatinized DNA context). To our knowledge, this is the first report identifying alkylation sites across genomic DNA by alkylating PIP conjugates using high-throughput sequencing. PMID:27098039
An Evolutionary/Biochemical Connection Between Promoter- and Primer-Dependent Polymerases Revealed by Selective Evolution of Ligands by Exponential Enrichment (SELEX).

PubMed

Fenstermacher, Katherine J; Achuthan, Vasudevan; Schneider, Thomas D; DeStefano, Jeffrey J

2018-01-16

DNA polymerases (DNAPs) recognize 3' recessed termini on duplex DNA and carry out nucleotide catalysis. Unlike promoter-specific RNA polymerases (RNAPs), no sequence specificity is required for binding or initiation of catalysis. Despite this, previous results indicate that viral reverse transcriptases bind much more tightly to DNA primers that mimic the polypurine tract. In the current report, primer sequences that bind with high affinity to Taq and Klenow polymerases were identified using a modified Selective Evolution of Ligands by Exponential Enrichment (SELEX) approach. Two Taq -specific primers that bound ∼10 (Taq1) and over 100 (Taq2) times more stably than controls to Taq were identified. Taq1 contained 8 nucleotides (5' -CACTAAAG-3') that matched the phage T3 RNAP "core" promoter. Both primers dramatically outcompeted primers with similar binding thermodynamics in PCR reactions. Similarly, exonuclease minus Klenow polymerase also selected a high affinity primer that contained a related core promoter sequence from phage T7 RNAP (5' -ACTATAG-3'). For both Taq and Klenow, even small modifications to the sequence resulted in large losses in binding affinity suggesting that binding was highly sequence-specific. The results are discussed in the context of possible effects on multi-primer (multiplex) PCR assays, molecular information theory, and the evolution of RNAPs and DNAPs. Importance This work further demonstrates that primer-dependent DNA polymerases can have strong sequence biases leading to dramatically tighter binding to specific sequences. These may be related to biological function, or be a consequences of the structural architecture of the enzyme. New sequence specificity for Taq and Klenow polymerases were uncovered and among them were sequences that contained the core promoter elements from T3 and T7 phage RNA polymerase promoters. This suggests the intriguing possibility that phage RNA polymerases exploited intrinsic binding affinities of ancestral DNA polymerases to develop their promotors. Conversely, DNA polymerases could have evolved from related RNA polymerases and retained the intrinsic binding preference despite there being no clear function for such a preference in DNA biology. Copyright © 2018 American Society for Microbiology.
A map of human PRDM9 binding provides evidence for novel behaviors of PRDM9 and other zinc-finger proteins in meiosis

PubMed Central

Noor, Nudrat; Bitoun, Emmanuelle; Tumian, Afidalina; Imbeault, Michael; Chapman, J Ross; Aricescu, A Radu

2017-01-01

PRDM9 binding localizes almost all meiotic recombination sites in humans and mice. However, most PRDM9-bound loci do not become recombination hotspots. To explore factors that affect binding and subsequent recombination outcomes, we mapped human PRDM9 binding sites in a transfected human cell line and measured PRDM9-induced histone modifications. These data reveal varied DNA-binding modalities of PRDM9. We also find that human PRDM9 frequently binds promoters, despite their low recombination rates, and it can activate expression of a small number of genes including CTCFL and VCX. Furthermore, we identify specific sequence motifs that predict consistent, localized meiotic recombination suppression around a subset of PRDM9 binding sites. These motifs strongly associate with KRAB-ZNF protein binding, TRIM28 recruitment, and specific histone modifications. Finally, we demonstrate that, in addition to binding DNA, PRDM9's zinc fingers also mediate its multimerization, and we show that a pair of highly diverged alleles preferentially form homo-multimers. PMID:29072575
Interactions of Ku70/80 with Double-Strand DNA: Energetic, Dynamics, and Functional Implications

NASA Technical Reports Server (NTRS)

Hu, Shaowen; Cucinotta, Francis A.

2010-01-01

Space radiation is a proficient inducer of DNA damage leading to mutation, aberrant cell signaling, and cancer formation. Ku is among the first responding proteins in nucleus to recognize and bind the DNA double strand breaks (DSBs) whenever they are introduced. Once loaded Ku works as a scaffold to recruit other repair factors of non-homologous end joining and facilitates the following repair processes. The crystallographic study of the Ku70/80 heterodimer indicate the core structure of this protein shows virtually no conformational change after binding with DNA. To investigate the dynamical features as well as the energetic characteristics of Ku-DNA binding, we conduct multi-nanosecond molecular dynamics simulations of a modeled Ku70/80 structure and several complexes with two 24-bp DNA duplexes. Free energy calculations show significant energy differences between the complexes with Ku bound at DSBs and those with Ku associated at an internal site of a chromosome. The results also reveal detailed interactions between different nucleotides and the amino acids along the DNA-binding cradle of Ku, indicating subtle binding preference of Ku at specific DNA sequences. The covariance matrix analyses along the trajectories demonstrate the protein is stimulated to undergo correlated motions of different domains once bound to DNA ends. Additionally, principle component analyses identify these low frequency collective motions suitable for binding with and translocation along duplex DNA. It is proposed that the modification of dynamical properties of Ku upon binding with DSBs may provide a signal for the further recruitment of other repair factors such as DNA-PKcs, XLF, and XRCC4.
An SRY mutation causing human sex reversal resolves a general mechanism of structure-specific DNA recognition: application to the four-way DNA junction.

PubMed

Peters, R; King, C Y; Ukiyama, E; Falsafi, S; Donahoe, P K; Weiss, M A

1995-04-11

SRY, a genetic "master switch" for male development in mammals, exhibits two biochemical activities: sequence-specific recognition of duplex DNA and sequence-independent binding to the sharp angles of four-way DNA junctions. Here, we distinguish between these activities by analysis of a mutant SRY associated with human sex reversal (46, XY female with pure gonadal dysgenesis). The substitution (168T in human SRY) alters a nonpolar side chain in the minor-groove DNA recognition alpha-helix of the HMG box [Haqq, C.M., King, C.-Y., Ukiyama, E., Haqq, T.N., Falsalfi, S., Donahoe, P.K., & Weiss, M.A. (1994) Science 266, 1494-1500]. The native (but not mutant) side chain inserts between specific base pairs in duplex DNA, interrupting base stacking at a site of induced DNA bending. Isotope-aided 1H-NMR spectroscopy demonstrates that analogous side-chain insertion occurs on binding of SRY to a four-way junction, establishing a shared mechanism of sequence- and structure-specific DNA binding. Although the mutant DNA-binding domain exhibits > 50-fold reduction in sequence-specific DNA recognition, near wild-type affinity for four-way junctions is retained. Our results (i) identify a shared SRY-DNA contact at a site of either induced or intrinsic DNA bending, (ii) demonstrate that this contact is not required to bind an intrinsically bent DNA target, and (iii) rationalize patterns of sequence conservation or diversity among HMG boxes. Clinical association of the I68T mutation with human sex reversal supports the hypothesis that specific DNA recognition by SRY is required for male sex determination.
Drosophila Uri, a PP1α binding protein, is essential for viability, maintenance of DNA integrity and normal transcriptional activity

PubMed Central

Kirchner, Jasmin; Vissi, Emese; Gross, Sascha; Szoor, Balazs; Rudenko, Andrey; Alphey, Luke; White-Cooper, Helen

2008-01-01

Background Protein phosphatase 1 (PP1) is involved in diverse cellular processes, and is targeted to substrates via interaction with many different protein binding partners. PP1 catalytic subunits (PP1c) fall into PP1α and PP1β subfamilies based on sequence analysis, however very few PP1c binding proteins have been demonstrated to discriminate between PP1α and PP1β. Results URI (unconventional prefoldin RPB5 interactor) is a conserved molecular chaperone implicated in a variety of cellular processes, including the transcriptional response to nutrient signalling and maintenance of DNA integrity. We show that Drosophila Uri binds PP1α with much higher affinity than PP1β, and that this ability to discriminate between PP1c forms is conserved to humans. Most Uri is cytoplasmic, however we found some protein associated with active RNAPII on chromatin. We generated a uri loss of function allele, and show that uri is essential for viability in Drosophila. uri mutants have transcriptional defects, reduced cell viability and differentiation in the germline, and accumulate DNA damage in their nuclei. Conclusion Uri is the first PP1α specific binding protein to be described in Drosophila. Uri protein plays a role in transcriptional regulation. Activity of uri is required to maintain DNA integrity and cell survival in normal development. PMID:18412953
The Reconstruction of Condition-Specific Transcriptional Modules Provides New Insights in the Evolution of Yeast AP-1 Proteins

PubMed Central

Goudot, Christel; Etchebest, Catherine

2011-01-01

AP-1 proteins are transcription factors (TFs) that belong to the basic leucine zipper family, one of the largest families of TFs in eukaryotic cells. Despite high homology between their DNA binding domains, these proteins are able to recognize diverse DNA motifs. In yeasts, these motifs are referred as YRE (Yap Response Element) and are either seven (YRE-Overlap) or eight (YRE-Adjacent) base pair long. It has been proposed that the AP-1 DNA binding motif preference relies on a single change in the amino acid sequence of the yeast AP-1 TFs (an arginine in the YRE-O binding factors being replaced by a lysine in the YRE-A binding Yaps). We developed a computational approach to infer condition-specific transcriptional modules associated to the orthologous AP-1 protein Yap1p, Cgap1p and Cap1p, in three yeast species: the model yeast Saccharomyces cerevisiae and two pathogenic species Candida glabrata and Candida albicans. Exploitation of these modules in terms of predictions of the protein/DNA regulatory interactions changed our vision of AP-1 protein evolution. Cis-regulatory motif analyses revealed the presence of a conserved adenine in 5′ position of the canonical YRE sites. While Yap1p, Cgap1p and Cap1p shared a remarkably low number of target genes, an impressive conservation was observed in the YRE sequences identified by Yap1p and Cap1p. In Candida glabrata, we found that Cgap1p, unlike Yap1p and Cap1p, recognizes YRE-O and YRE-A motifs. These findings were supported by structural data available for the transcription factor Pap1p (Schizosaccharomyces pombe). Thus, whereas arginine and lysine substitutions in Cgap1p and Yap1p proteins were reported as responsible for a specific YRE-O or YRE-A preference, our analyses rather suggest that the ancestral yeast AP-1 protein could recognize both YRE-O and YRE-A motifs and that the arginine/lysine exchange is not the only determinant of the specialization of modern Yaps for one motif or another. PMID:21695268

Sequence-specific DNA binding Pyrrole-imidazole polyamides and their applications.

PubMed

Kawamoto, Yusuke; Bando, Toshikazu; Sugiyama, Hiroshi

2018-05-01

Pyrrole-imidazole polyamides (Py-Im polyamides) are cell-permeable compounds that bind to the minor groove of double-stranded DNA in a sequence-specific manner without causing denaturation of the DNA. These compounds can be used to control gene expression and to stain specific sequences in cells. Here, we review the history, structural variations, and functional investigations of Py-Im polyamides. Copyright © 2018 Elsevier Ltd. All rights reserved.
DNA Photo Lithography with Cinnamate-based Photo-Bio-Nano-Glue

NASA Astrophysics Data System (ADS)

Feng, Lang; Li, Minfeng; Romulus, Joy; Sha, Ruojie; Royer, John; Wu, Kun-Ta; Xu, Qin; Seeman, Nadrian; Weck, Marcus; Chaikin, Paul

2013-03-01

We present a technique to make patterned functional surfaces, using a cinnamate photo cross-linker and photolithography. We have designed and modified a complementary set of single DNA strands to incorporate a pair of opposing cinnamate molecules. On exposure to 360nm UV, the cinnamate makes a highly specific covalent bond permanently linking only the complementary strands containing the cinnamates. We have studied this specific and efficient crosslinking with cinnamate-containing DNA in solution and on particles. UV addressability allows us to pattern surfaces functionally. The entire surface is coated with a DNA sequence A incorporating cinnamate. DNA strands A'B with one end containing a complementary cinnamated sequence A' attached to another sequence B, are then hybridized to the surface. UV photolithography is used to bind the A'B strand in a specific pattern. The system is heated and the unbound DNA is washed away. The pattern is then observed by thermo-reversibly hybridizing either fluorescently dyed B' strands complementary to B, or colloids coated with B' strands. Our techniques can be used to reversibly and/or permanently bind, via DNA linkers, an assortment of molecules, proteins and nanostructures. Potential applications range from advanced self-assembly, such as templated self-replication schemes recently reported, to designed physical and chemical patterns, to high-resolution multi-functional DNA surfaces for genetic detection or DNA computing.
A TATA binding protein mutant with increased affinity for DNA directs transcription from a reversed TATA sequence in vivo.

PubMed

Spencer, J Vaughn; Arndt, Karen M

2002-12-01

The TATA-binding protein (TBP) nucleates the assembly and determines the position of the preinitiation complex at RNA polymerase II-transcribed genes. We investigated the importance of two conserved residues on the DNA binding surface of Saccharomyces cerevisiae TBP to DNA binding and sequence discrimination. Because they define a significant break in the twofold symmetry of the TBP-TATA interface, Ala100 and Pro191 have been proposed to be key determinants of TBP binding orientation and transcription directionality. In contrast to previous predictions, we found that substitution of an alanine for Pro191 did not allow recognition of a reversed TATA box in vivo; however, the reciprocal change, Ala100 to proline, resulted in efficient utilization of this and other variant TATA sequences. In vitro assays demonstrated that TBP mutants with the A100P and P191A substitutions have increased and decreased affinity for DNA, respectively. The TATA binding defect of TBP with the P191A mutation could be intragenically suppressed by the A100P substitution. Our results suggest that Ala100 and Pro191 are important for DNA binding and sequence recognition by TBP, that the naturally occurring asymmetry of Ala100 and Pro191 is not essential for function, and that a single amino acid change in TBP can lead to elevated DNA binding affinity and recognition of a reversed TATA sequence.
Recognition of Local DNA Structures by p53 Protein

PubMed Central

Brázda, Václav; Coufal, Jan

2017-01-01

p53 plays critical roles in regulating cell cycle, apoptosis, senescence and metabolism and is commonly mutated in human cancer. These roles are achieved by interaction with other proteins, but particularly by interaction with DNA. As a transcription factor, p53 is well known to bind consensus target sequences in linear B-DNA. Recent findings indicate that p53 binds with higher affinity to target sequences that form cruciform DNA structure. Moreover, p53 binds very tightly to non-B DNA structures and local DNA structures are increasingly recognized to influence the activity of wild-type and mutant p53. Apart from cruciform structures, p53 binds to quadruplex DNA, triplex DNA, DNA loops, bulged DNA and hemicatenane DNA. In this review, we describe local DNA structures and summarize information about interactions of p53 with these structural DNA motifs. These recent data provide important insights into the complexity of the p53 pathway and the functional consequences of wild-type and mutant p53 activation in normal and tumor cells. PMID:28208646
May the Best Molecule Win: Competition ESI Mass Spectrometry

PubMed Central

Laughlin, Sarah; Wilson, W. David

2015-01-01

Electrospray ionization mass spectrometry has become invaluable in the characterization of macromolecular biological systems such as nucleic acids and proteins. Recent advances in the field of mass spectrometry and the soft conditions characteristic of electrospray ionization allow for the investigation of non-covalent interactions among large biomolecules and ligands. Modulation of genetic processes through the use of small molecule inhibitors with the DNA minor groove is gaining attention as a potential therapeutic approach. In this review, we discuss the development of a competition method using electrospray ionization mass spectrometry to probe the interactions of multiple DNA sequences with libraries of minor groove binding molecules. Such an approach acts as a high-throughput screening method to determine important information including the stoichiometry, binding mode, cooperativity, and relative binding affinity. In addition to small molecule-DNA complexes, we highlight other applications in which competition mass spectrometry has been used. A competitive approach to simultaneously investigate complex interactions promises to be a powerful tool in the discovery of small molecule inhibitors with high specificity and for specific, important DNA sequences. PMID:26501262
Bullied no more:when and how DNA shoves proteins around

PubMed Central

Pettitt, B. Montgomery; Sumners, De Witt L.; Harris, Sarah A.; Zechiedrich, Lynn

2016-01-01

The predominant protein-centric perspective in protein–DNA-binding studies assumes that the protein drives the interaction. Research focuses on protein structural motifs, electrostatic surfaces and contact potentials, while DNA is often ignored as a passive polymer to be manipulated. Recent studies of DNA topology, the supercoiling, knotting, and linking of the helices, have shown that DNA has the capability to be an active participant in its transactions. DNA topology-induced structural and geometric changes can drive, or at least strongly influence, the interactions between protein and DNA. Deformations of the B-form structure arise from both the considerable elastic energy arising from supercoiling and from the electrostatic energy. Here, we discuss how these energies are harnessed for topology-driven, sequence-specific deformations that can allow DNA to direct its own metabolism. PMID:22850561
Cloning of a cDNA encoding bovine mitochondrial NADP(+)-specific isocitrate dehydrogenase and structural comparison with its isoenzymes from different species.

PubMed Central

Huh, T L; Ryu, J H; Huh, J W; Sung, H C; Oh, I U; Song, B J; Veech, R L

1993-01-01

Mitochondrial NADP(+)-specific isocitrate dehydrogenase (IDP) was co-purified with the pyruvate dehydrogenase complex from bovine kidney mitochondria. The determination of its N-terminal 16-amino-acid sequence revealed that it is highly similar to the IDP from yeast. A cDNA clone (1.8 kb long) encoding this protein was isolated from a bovine kidney lambda gt11 cDNA library using a synthetic oligodeoxynucleotide. The deduced protein sequence of this cDNA clone rendered a precursor protein of 452 amino-acid residues (50,830 Da) and a mature protein of 413 amino-acid residues (46,519 Da). It is 100% identical to the internal tryptic peptide sequences of the autologous form from pig heart and 62% similar to that from yeast. However, it shares little similarity with the mitochondrial NAD(+)-specific isoenzyme from yeast. Structural analyses of the deduced proteins of IDP isoenzymes from different species indicated that similarity exists in certain regions, which may represent the common domains for the active sites or coenzyme-binding sites. In Northern-blot analysis, one species of mRNA (about 2.2 kb for both bovine and human) was hybridized with a 32P-labelled cDNA probe. Southern-blot analysis of genomic DNAs verified simple patterns of hybridization with this cDNA. These results strongly indicate that the mitochondrial IDP may be derived from a single gene family which does not appear to be closely related to that of the NAD(+)-specific isoenzyme. Images Figure 1 Figure 3 Figure 4 Figure 5 PMID:8318002
Thermodynamics-Based Models of Transcriptional Regulation by Enhancers: The Roles of Synergistic Activation, Cooperative Binding and Short-Range Repression

PubMed Central

He, Xin; Samee, Md. Abul Hassan; Blatti, Charles; Sinha, Saurabh

2010-01-01

Quantitative models of cis-regulatory activity have the potential to improve our mechanistic understanding of transcriptional regulation. However, the few models available today have been based on simplistic assumptions about the sequences being modeled, or heuristic approximations of the underlying regulatory mechanisms. We have developed a thermodynamics-based model to predict gene expression driven by any DNA sequence, as a function of transcription factor concentrations and their DNA-binding specificities. It uses statistical thermodynamics theory to model not only protein-DNA interaction, but also the effect of DNA-bound activators and repressors on gene expression. In addition, the model incorporates mechanistic features such as synergistic effect of multiple activators, short range repression, and cooperativity in transcription factor-DNA binding, allowing us to systematically evaluate the significance of these features in the context of available expression data. Using this model on segmentation-related enhancers in Drosophila, we find that transcriptional synergy due to simultaneous action of multiple activators helps explain the data beyond what can be explained by cooperative DNA-binding alone. We find clear support for the phenomenon of short-range repression, where repressors do not directly interact with the basal transcriptional machinery. We also find that the binding sites contributing to an enhancer's function may not be conserved during evolution, and a noticeable fraction of these undergo lineage-specific changes. Our implementation of the model, called GEMSTAT, is the first publicly available program for simultaneously modeling the regulatory activities of a given set of sequences. PMID:20862354
DNA Shape Dominates Sequence Affinity in Nucleosome Formation

NASA Astrophysics Data System (ADS)

Freeman, Gordon S.; Lequieu, Joshua P.; Hinckley, Daniel M.; Whitmer, Jonathan K.; de Pablo, Juan J.

2014-10-01

Nucleosomes provide the basic unit of compaction in eukaryotic genomes, and the mechanisms that dictate their position at specific locations along a DNA sequence are of central importance to genetics. In this Letter, we employ molecular models of DNA and proteins to elucidate various aspects of nucleosome positioning. In particular, we show how DNA's histone affinity is encoded in its sequence-dependent shape, including subtle deviations from the ideal straight B-DNA form and local variations of minor groove width. By relying on high-precision simulations of the free energy of nucleosome complexes, we also demonstrate that, depending on DNA's intrinsic curvature, histone binding can be dominated by bending interactions or electrostatic interactions. More generally, the results presented here explain how sequence, manifested as the shape of the DNA molecule, dominates molecular recognition in the problem of nucleosome positioning.
Crystal structure of MboIIA methyltransferase.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Osipiuk, J.; Walsh, M. A.; Joachimiak, A.

2003-09-15

DNA methyltransferases (MTases) are sequence-specific enzymes which transfer a methyl group from S-adenosyl-L-methionine (AdoMet) to the amino group of either cytosine or adenine within a recognized DNA sequence. Methylation of a base in a specific DNA sequence protects DNA from nucleolytic cleavage by restriction enzymes recognizing the same DNA sequence. We have determined at 1.74 {angstrom} resolution the crystal structure of a {beta}-class DNA MTase MboIIA (M {center_dot} MboIIA) from the bacterium Moraxella bovis, the smallest DNA MTase determined to date. M {center_dot} MboIIA methylates the 3' adenine of the pentanucleotide sequence 5'-GAAGA-3'. The protein crystallizes with two molecules inmore » the asymmetric unit which we propose to resemble the dimer when M {center_dot} MboIIA is not bound to DNA. The overall structure of the enzyme closely resembles that of M {center_dot} RsrI. However, the cofactor-binding pocket in M {center_dot} MboIIA forms a closed structure which is in contrast to the open-form structures of other known MTases.« less
Overproduction, purification, and ATPase activity of the Escherichia coli RuvB protein involved in DNA repair.

PubMed Central

Iwasaki, H; Shiba, T; Makino, K; Nakata, A; Shinagawa, H

1989-01-01

The ruvA and ruvB genes of Escherichia coli constitute an operon which belongs to the SOS regulon. Genetic evidence suggests that the products of the ruv operon are involved in DNA repair and recombination. To begin biochemical characterization of these proteins, we developed a plasmid system that overproduced RuvB protein to 20% of total cell protein. Starting from the overproducing system, we purified RuvB protein. The purified RuvB protein behaved like a monomer in gel filtration chromatography and had an apparent relative molecular mass of 38 kilodaltons in sodium dodecyl sulfate-polyacrylamide gel electrophoresis, which agrees with the value predicted from the DNA sequence. The amino acid sequence of the amino-terminal region of the purified protein was analyzed, and the sequence agreed with the one deduced from the DNA sequence. Since the deduced sequence of RuvB protein contained the consensus sequence for ATP-binding proteins, we examined the ATP-binding and ATPase activities of the purified RuvB protein. RuvB protein had a stronger affinity to ADP than to ATP and weak ATPase activity. The results suggest that the weak ATPase activity of RuvB protein is at least partly due to end product inhibition by ADP. Images PMID:2529252
Solution structure of telomere binding domain of AtTRB2 derived from Arabidopsis thaliana

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yun, Ji-Hye; Lee, Won Kyung; Kim, Heeyoun

Highlights: • We have determined solution structure of Myb domain of AtTRB2. • The Myb domain of AtTRB2 is located in the N-terminal region. • The Myb domain of AtTRB2 binds to plant telomeric DNA without fourth helix. • Helix 2 and 3 of the Myb domain of AtTRB2 are involved in DNA recognition. • AtTRB2 is a novel protein distinguished from other known plant TBP. - Abstract: Telomere homeostasis is regulated by telomere-associated proteins, and the Myb domain is well conserved for telomere binding. AtTRB2 is a member of the SMH (Single-Myb-Histone)-like family in Arabidopsis thaliana, having an N-terminalmore » Myb domain, which is responsible for DNA binding. The Myb domain of AtTRB2 contains three α-helices and loops for DNA binding, which is unusual given that other plant telomere-binding proteins have an additional fourth helix that is essential for DNA binding. To understand the structural role for telomeric DNA binding of AtTRB2, we determined the solution structure of the Myb domain of AtTRB2 (AtTRB2{sub 1–64}) using nuclear magnetic resonance (NMR) spectroscopy. In addition, the inter-molecular interaction between AtTRB2{sub 1–64} and telomeric DNA has been characterized by the electrophoretic mobility shift assay (EMSA) and NMR titration analyses for both plant (TTTAGGG)n and human (TTAGGG)n telomere sequences. Data revealed that Trp28, Arg29, and Val47 residues located in Helix 2 and Helix 3 are crucial for DNA binding, which are well conserved among other plant telomere binding proteins. We concluded that although AtTRB2 is devoid of the additional fourth helix in the Myb-extension domain, it is able to bind to plant telomeric repeat sequences as well as human telomeric repeat sequences.« less
Double-stranded telomeric DNA binding proteins: Diversity matters.

PubMed

Červenák, Filip; Juríková, Katarína; Sepšiová, Regina; Neboháčová, Martina; Nosek, Jozef; Tomáška, L'ubomír

2017-01-01

Telomeric sequences constitute only a small fraction of the whole genome yet they are crucial for ensuring genomic stability. This function is in large part mediated by protein complexes recruited to telomeric sequences by specific telomere-binding proteins (TBPs). Although the principal tasks of nuclear telomeres are the same in all eukaryotes, TBPs in various taxa exhibit a surprising diversity indicating their distinct evolutionary origin. This diversity is especially pronounced in ascomycetous yeasts where they must have co-evolved with rapidly diversifying sequences of telomeric repeats. In this article we (i) provide a historical overview of the discoveries leading to the current list of TBPs binding to double-stranded (ds) regions of telomeres, (ii) describe examples of dsTBPs highlighting their diversity in even closely related species, and (iii) speculate about possible evolutionary trajectories leading to a long list of various dsTBPs fulfilling the same general role(s) in their own unique ways.
Microfluidic affinity and ChIP-seq analyses converge on a conserved FOXP2-binding motif in chimp and human, which enables the detection of evolutionarily novel targets.

PubMed

Nelson, Christopher S; Fuller, Chris K; Fordyce, Polly M; Greninger, Alexander L; Li, Hao; DeRisi, Joseph L

2013-07-01

The transcription factor forkhead box P2 (FOXP2) is believed to be important in the evolution of human speech. A mutation in its DNA-binding domain causes severe speech impairment. Humans have acquired two coding changes relative to the conserved mammalian sequence. Despite intense interest in FOXP2, it has remained an open question whether the human protein's DNA-binding specificity and chromatin localization are conserved. Previous in vitro and ChIP-chip studies have provided conflicting consensus sequences for the FOXP2-binding site. Using MITOMI 2.0 microfluidic affinity assays, we describe the binding site of FOXP2 and its affinity profile in base-specific detail for all substitutions of the strongest binding site. We find that human and chimp FOXP2 have similar binding sites that are distinct from previously suggested consensus binding sites. Additionally, through analysis of FOXP2 ChIP-seq data from cultured neurons, we find strong overrepresentation of a motif that matches our in vitro results and identifies a set of genes with FOXP2 binding sites. The FOXP2-binding sites tend to be conserved, yet we identified 38 instances of evolutionarily novel sites in humans. Combined, these data present a comprehensive portrait of FOXP2's-binding properties and imply that although its sequence specificity has been conserved, some of its genomic binding sites are newly evolved.
Proliferating cell nuclear antigen (Pcna) as a direct downstream target gene of Hoxc8

DOE Office of Scientific and Technical Information (OSTI.GOV)

Min, Hyehyun; Lee, Ji-Yeon; Bok, Jinwoong

2010-02-19

Hoxc8 is a member of Hox family transcription factors that play crucial roles in spatiotemporal body patterning during embryogenesis. Hox proteins contain a conserved 61 amino acid homeodomain, which is responsible for recognition and binding of the proteins onto Hox-specific DNA binding motifs and regulates expression of their target genes. Previously, using proteome analysis, we identified Proliferating cell nuclear antigen (Pcna) as one of the putative target genes of Hoxc8. Here, we asked whether Hoxc8 regulates Pcna expression by directly binding to the regulatory sequence of Pcna. In mouse embryos at embryonic day 11.5, the expression pattern of Pcna wasmore » similar to that of Hoxc8 along the anteroposterior body axis. Moreover, Pcna transcript levels as well as cell proliferation rate were increased by overexpression of Hoxc8 in C3H10T1/2 mouse embryonic fibroblast cells. Characterization of 2.3 kb genomic sequence upstream of Pcna coding region revealed that the upstream sequence contains several Hox core binding sequences and one Hox-Pbx binding sequence. Direct binding of Hoxc8 proteins to the Pcna regulatory sequence was verified by chromatin immunoprecipitation assay. Taken together, our data suggest that Pcna is a direct downstream target of Hoxc8.« less
Binding of sulphonated indigo derivatives to RepA-WH1 inhibits DNA-induced protein amyloidogenesis

PubMed Central

Gasset-Rosa, Fátima; Maté, María Jesús; Dávila-Fajardo, Cristina; Bravo, Jerónimo; Giraldo, Rafael

2008-01-01

The quest for inducers and inhibitors of protein amyloidogenesis is of utmost interest, since they are key tools to understand the molecular bases of proteinopathies such as Alzheimer, Parkinson, Huntington and Creutzfeldt–Jakob diseases. It is also expected that such molecules could lead to valid therapeutic agents. In common with the mammalian prion protein (PrP), the N-terminal Winged-Helix (WH1) domain of the pPS10 plasmid replication protein (RepA) assembles in vitro into a variety of amyloid nanostructures upon binding to different specific dsDNA sequences. Here we show that di- (S2) and tetra-sulphonated (S4) derivatives of indigo stain dock at the DNA recognition interface in the RepA-WH1 dimer. They compete binding of RepA to its natural target dsDNA repeats, found at the repA operator and at the origin of replication of the plasmid. Calorimetry points to the existence of a major site, with micromolar affinity, for S4-indigo in RepA-WH1 dimers. As revealed by electron microscopy, in the presence of inducer dsDNA, both S2/S4 stains inhibit the assembly of RepA-WH1 into fibres. These results validate the concept that DNA can promote protein assembly into amyloids and reveal that the binding sites of effector molecules can be targeted to inhibit amyloidogenesis. PMID:18285361
Radiation-induced oxidative damage to the DNA-binding domain of the lactose repressor

PubMed Central

Gillard, Nathalie; Goffinont, Stephane; Buré, Corinne; Davidkova, Marie; Maurizot, Jean-Claude; Cadene, Martine; Spotheim-Maurizot, Melanie

2007-01-01

Understanding the cellular effects of radiation-induced oxidation requires the unravelling of key molecular events, particularly damage to proteins with important cellular functions. The Escherichia coli lactose operon is a classical model of gene regulation systems. Its functional mechanism involves the specific binding of a protein, the repressor, to a specific DNA sequence, the operator. We have shown previously that upon irradiation with γ-rays in solution, the repressor loses its ability to bind the operator. Water radiolysis generates hydroxyl radicals (OH· radicals) which attack the protein. Damage of the repressor DNA-binding domain, called the headpiece, is most likely to be responsible of this loss of function. Using CD, fluorescence spectroscopy and a combination of proteolytic cleavage with MS, we have examined the state of the irradiated headpiece. CD measurements revealed a dose-dependent conformational change involving metastable intermediate states. Fluorescence measurements showed a gradual degradation of tyrosine residues. MS was used to count the number of oxidations in different regions of the headpiece and to narrow down the parts of the sequence bearing oxidized residues. By calculating the relative probabilities of reaction of each amino acid with OH· radicals, we can predict the most probable oxidation targets. By comparing the experimental results with the predictions we conclude that Tyr7, Tyr12, Tyr17, Met42 and Tyr47 are the most likely hotspots of oxidation. The loss of repressor function is thus correlated with chemical modifications and conformational changes of the headpiece. PMID:17263689
hPDI: a database of experimental human protein-DNA interactions.

PubMed

Xie, Zhi; Hu, Shaohui; Blackshaw, Seth; Zhu, Heng; Qian, Jiang

2010-01-15

The human protein DNA Interactome (hPDI) database holds experimental protein-DNA interaction data for humans identified by protein microarray assays. The unique characteristics of hPDI are that it contains consensus DNA-binding sequences not only for nearly 500 human transcription factors but also for >500 unconventional DNA-binding proteins, which are completely uncharacterized previously. Users can browse, search and download a subset or the entire data via a web interface. This database is freely accessible for any academic purposes. http://bioinfo.wilmer.jhu.edu/PDI/.
The region of CQQQKPQRRP of PGC-1{alpha} interacts with the DNA-binding complex of FXR/RXR{alpha}

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kanaya, Eiko; Jingami, Hisato

2006-04-14

PGC-1{alpha} co-activates transcription by several nuclear receptors. To study the interaction among PGC-1{alpha}, RXR{alpha}/FXR, and DNA, we performed electrophoresis mobility shift assays. The RXR{alpha}/FXR proteins specifically bound to DNA containing the IR-1 sequence in the absence of ligand. When the fusion protein of GST-PGC-1{alpha} was added to the mixture of RXR{alpha}/FXR/DNA, the ligand-influenced retardation of the mobility was observed. The ligand for RXR{alpha} (9-cis-retinoic acid) was necessary for this retardation, whereas, the ligand for FXR, chenodeoxycholic acid, barely had an effect. The results obtained using truncated PGC-1{alpha} proteins suggested that two regions are necessary for PGC-1{alpha} to interact with themore » DNA-binding complex of RXR{alpha}/FXR. One is the region of the second leucine-rich motif, and the other is that of the amino acid sequence CQQQKPQRRP, present between the second and third leucine-rich motifs. The results obtained with the SPQSS mutation for KPQRR suggested that the basic amino acids are important for the interaction.« less
The evolutionary turnover of recombination hot spots contributes to speciation in mice.

PubMed

Smagulova, Fatima; Brick, Kevin; Pu, Yongmei; Camerini-Otero, R Daniel; Petukhova, Galina V

2016-02-01

Meiotic recombination is required for the segregation of homologous chromosomes and is essential for fertility. In most mammals, the DNA double-strand breaks (DSBs) that initiate meiotic recombination are directed to a subset of genomic loci (hot spots) by sequence-specific binding of the PRDM9 protein. Rapid evolution of the DNA-binding specificity of PRDM9 and gradual erosion of PRDM9-binding sites by gene conversion will alter the recombination landscape over time. To better understand the evolutionary turnover of recombination hot spots and its consequences, we mapped DSB hot spots in four major subspecies of Mus musculus with different Prdm9 alleles and in their F1 hybrids. We found that hot spot erosion governs the preferential usage of some Prdm9 alleles over others in hybrid mice and increases sequence diversity specifically at hot spots that become active in the hybrids. As crossovers are disfavored at such hot spots, we propose that sequence divergence generated by hot spot turnover may create an impediment for recombination in hybrids, potentially leading to reduced fertility and, eventually, speciation. Published by Cold Spring Harbor Laboratory Press.

The evolutionary turnover of recombination hot spots contributes to speciation in mice

PubMed Central

Smagulova, Fatima; Brick, Kevin; Pu, Yongmei; Camerini-Otero, R. Daniel; Petukhova, Galina V.

2016-01-01

Meiotic recombination is required for the segregation of homologous chromosomes and is essential for fertility. In most mammals, the DNA double-strand breaks (DSBs) that initiate meiotic recombination are directed to a subset of genomic loci (hot spots) by sequence-specific binding of the PRDM9 protein. Rapid evolution of the DNA-binding specificity of PRDM9 and gradual erosion of PRDM9-binding sites by gene conversion will alter the recombination landscape over time. To better understand the evolutionary turnover of recombination hot spots and its consequences, we mapped DSB hot spots in four major subspecies of Mus musculus with different Prdm9 alleles and in their F1 hybrids. We found that hot spot erosion governs the preferential usage of some Prdm9 alleles over others in hybrid mice and increases sequence diversity specifically at hot spots that become active in the hybrids. As crossovers are disfavored at such hot spots, we propose that sequence divergence generated by hot spot turnover may create an impediment for recombination in hybrids, potentially leading to reduced fertility and, eventually, speciation. PMID:26833728
Sequence-dependent DNA flexibility mediates DNase I cleavage.

PubMed

Heddi, Brahim; Abi-Ghanem, Josephine; Lavigne, Marc; Hartmann, Brigitte

2010-01-08

Understanding the preference of nonspecific proteins for certain DNA structural features requires an accurate description of the properties of free DNA, especially regarding their possible predisposition to adopt a conformation that favors the formation of a complex. Exploiting previous exhaustive NMR studies performed on free DNA oligomers, we investigated the molecular basis of DNase I sensitivity under conditions where DNase I binding limits the probability of cleavage. We showed that cleavage intensity was correlated with adjacent 3' phosphate linkage flexibility, monitored by (31)P chemical shifts. Examining NMR-refined DNA structures highlighted that sequence-dependent flexible phosphates were associated with large minor groove variations that may promote the affinity of DNase I, according to relevant DNA-protein complexes. In sum, this work demonstrates that specificity in DNA-DNase I interaction is mediated by DNA flexibility, which influences the induced-fit transitions required to form productive complexes.
Real-Time Analysis of Specific Protein-DNA Interactions with Surface Plasmon Resonance

PubMed Central

Ritzefeld, Markus; Sewald, Norbert

2012-01-01

Several proteins, like transcription factors, bind to certain DNA sequences, thereby regulating biochemical pathways that determine the fate of the corresponding cell. Due to these key positions, it is indispensable to analyze protein-DNA interactions and to identify their mode of action. Surface plasmon resonance is a label-free method that facilitates the elucidation of real-time kinetics of biomolecular interactions. In this article, we focus on this biosensor-based method and provide a detailed guide how SPR can be utilized to study binding of proteins to oligonucleotides. After a description of the physical phenomenon and the instrumental realization including fiber-optic-based SPR and SPR imaging, we will continue with a survey of immobilization methods. Subsequently, we will focus on the optimization of the experiment, expose pitfalls, and introduce how data should be analyzed and published. Finally, we summarize several interesting publications of the last decades dealing with protein-DNA and RNA interaction analysis by SPR. PMID:22500214
Motif discovery with data mining in 3D protein structure databases: discovery, validation and prediction of the U-shape zinc binding ("Huf-Zinc") motif.

PubMed

Maurer-Stroh, Sebastian; Gao, He; Han, Hao; Baeten, Lies; Schymkowitz, Joost; Rousseau, Frederic; Zhang, Louxin; Eisenhaber, Frank

2013-02-01

Data mining in protein databases, derivatives from more fundamental protein 3D structure and sequence databases, has considerable unearthed potential for the discovery of sequence motif--structural motif--function relationships as the finding of the U-shape (Huf-Zinc) motif, originally a small student's project, exemplifies. The metal ion zinc is critically involved in universal biological processes, ranging from protein-DNA complexes and transcription regulation to enzymatic catalysis and metabolic pathways. Proteins have evolved a series of motifs to specifically recognize and bind zinc ions. Many of these, so called zinc fingers, are structurally independent globular domains with discontinuous binding motifs made up of residues mostly far apart in sequence. Through a systematic approach starting from the BRIX structure fragment database, we discovered that there exists another predictable subset of zinc-binding motifs that not only have a conserved continuous sequence pattern but also share a characteristic local conformation, despite being included in totally different overall folds. While this does not allow general prediction of all Zn binding motifs, a HMM-based web server, Huf-Zinc, is available for prediction of these novel, as well as conventional, zinc finger motifs in protein sequences. The Huf-Zinc webserver can be freely accessed through this URL (http://mendel.bii.a-star.edu.sg/METHODS/hufzinc/).
Binding-induced DNA walker for signal amplification in highly selective electrochemical detection of protein.

PubMed

Ji, Yuhang; Zhang, Lei; Zhu, Longyi; Lei, Jianping; Wu, Jie; Ju, Huangxian

2017-10-15

A binding-induced DNA walker-assisted signal amplification was developed for highly selective electrochemical detection of protein. Firstly, the track of DNA walker was constructed by self-assembly of the high density ferrocene (Fc)-labeled anchor DNA and aptamer 1 on the gold electrode surface. Sequentially, a long swing-arm chain containing aptamer 2 and walking strand DNA was introduced onto gold electrode through aptamers-target specific recognition, and thus initiated walker strand sequences to hybridize with anchor DNA. Then, the DNA walker was activated by the stepwise cleavage of the hybridized anchor DNA by nicking endonuclease to release multiple Fc molecules for signal amplification. Taking thrombin as the model target, the Fc-generated electrochemical signal decreased linearly with logarithm value of thrombin concentration ranging from 10pM to 100nM with a detection limit of 2.5pM under the optimal conditions. By integrating the specific recognition of aptamers to target with the enzymatic cleavage of nicking endonuclease, the aptasensor showed the high selectivity. The binding-induced DNA walker provides a promising strategy for signal amplification in electrochemical biosensor, and has the extensive applications in sensitive and selective detection of the various targets. Copyright © 2017 Elsevier B.V. All rights reserved.
Molecular sled sequences are common in mammalian proteins.

PubMed

Xiong, Kan; Blainey, Paul C

2016-03-18

Recent work revealed a new class of molecular machines called molecular sleds, which are small basic molecules that bind and slide along DNA with the ability to carry cargo along DNA. Here, we performed biochemical and single-molecule flow stretching assays to investigate the basis of sliding activity in molecular sleds. In particular, we identified the functional core of pVIc, the first molecular sled characterized; peptide functional groups that control sliding activity; and propose a model for the sliding activity of molecular sleds. We also observed widespread DNA binding and sliding activity among basic polypeptide sequences that implicate mammalian nuclear localization sequences and many cell penetrating peptides as molecular sleds. These basic protein motifs exhibit weak but physiologically relevant sequence-nonspecific DNA affinity. Our findings indicate that many mammalian proteins contain molecular sled sequences and suggest the possibility that substantial undiscovered sliding activity exists among nuclear mammalian proteins. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Genomic structure of the human D-site binding protein (DBP) gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Shutler, G.; Glassco, T.; Kang, Xiaolin

1996-06-15

The human gene for the D-Site Binding Protein (DBP) has been sequenced and characterized. This gene is a member of the b/ZIP family of transcription factors and is one of three genes forming the PAR sub-family. DBP has been implicated in the diurnal regulation of a variety of liver-specific genes. Examination of the genomic structure of DBP reveals that the gene is divided into four exons and is contained within a relatively compact region of approximately 6 kb. These exons appear to correspond to functional divisions the DBP protein. Exon 1 contains a long 5{prime} UTR, and conservation between themore » rat and the human genes of the presence of small open reading frames within this region suggests that is may play a role in translational control. Exon 2 contains a limited region of similarity to the other PAR domain genes, which may be part of a potential activation domain. Exon 3 contains the PAR domain and differs by only 1 of 71 amino acids between rat and human. Exon 4, containing both the basic and the leucine zipper domains, is likewise highly conserved. The overall degree of homology between the rat and the human cDNA sequences is 82% for the nucleic acid sequence and 92% for the protein sequence. comparison of the rat and human proximal promoters reveals extensive sequence conservation, with two previously characterized DNA binding sites being conserved at the functional and sequence levels. 31 refs., 4 figs.« less
Transposable Elements and DNA Methylation Create in Embryonic Stem Cells Human-Specific Regulatory Sequences Associated with Distal Enhancers and Noncoding RNAs

PubMed Central

Glinsky, Gennadi V.

2015-01-01

Despite significant progress in the structural and functional characterization of the human genome, understanding of the mechanisms underlying the genetic basis of human phenotypic uniqueness remains limited. Here, I report that transposable element-derived sequences, most notably LTR7/HERV-H, LTR5_Hs, and L1HS, harbor 99.8% of the candidate human-specific regulatory loci (HSRL) with putative transcription factor-binding sites in the genome of human embryonic stem cells (hESC). A total of 4,094 candidate HSRL display selective and site-specific binding of critical regulators (NANOG [Nanog homeobox], POU5F1 [POU class 5 homeobox 1], CCCTC-binding factor [CTCF], Lamin B1), and are preferentially located within the matrix of transcriptionally active DNA segments that are hypermethylated in hESC. hESC-specific NANOG-binding sites are enriched near the protein-coding genes regulating brain size, pluripotency long noncoding RNAs, hESC enhancers, and 5-hydroxymethylcytosine-harboring regions immediately adjacent to binding sites. Sequences of only 4.3% of hESC-specific NANOG-binding sites are present in Neanderthals’ genome, suggesting that a majority of these regulatory elements emerged in Modern Humans. Comparisons of estimated creation rates of novel TF-binding sites revealed that there was 49.7-fold acceleration of creation rates of NANOG-binding sites in genomes of Chimpanzees compared with the mouse genomes and further 5.7-fold acceleration in genomes of Modern Humans compared with the Chimpanzees genomes. Preliminary estimates suggest that emergence of one novel NANOG-binding site detectable in hESC required 466 years of evolution. Pathway analysis of coding genes that have hESC-specific NANOG-binding sites within gene bodies or near gene boundaries revealed their association with physiological development and functions of nervous and cardiovascular systems, embryonic development, behavior, as well as development of a diverse spectrum of pathological conditions such as cancer, diseases of cardiovascular and reproductive systems, metabolic diseases, multiple neurological and psychological disorders. A proximity placement model is proposed explaining how a 33–47% excess of NANOG, CTCF, and POU5F1 proteins immobilized on a DNA scaffold may play a functional role at distal regulatory elements. PMID:25956794
Preliminary crystallographic analysis of mouse Elf3 C-terminal DNA-binding domain in complex with type II TGF-[beta] receptor promoter DNA

DOE Office of Scientific and Technical Information (OSTI.GOV)

Agarkar, Vinod B.; Babayeva, Nigar D.; Rizzino, Angie

2010-10-08

Ets proteins are transcription factors that activate or repress the expression of genes that are involved in various biological processes, including cellular proliferation, differentiation, development, transformation and apoptosis. Like other Ets-family members, Elf3 functions as a sequence-specific DNA-binding transcriptional factor. A mouse Elf3 C-terminal fragment (amino-acid residues 269-371) containing the DNA-binding domain has been crystallized in complex with mouse type II TGF-{beta} receptor promoter (TR-II) DNA. The crystals belonged to space group P2{sub 1}2{sub 1}2{sub 1}, with unit-cell parameters a = 42.66, b = 52, c = 99.78 {angstrom}, and diffracted to a resolution of 2.2 {angstrom}.
Yeast one-hybrid system used to identify the binding proteins for rat glutathione S-transferase P enhancer I.

PubMed

Liao, Ming-Xiang; Liu, Dong-Yuan; Zuo, Jin; Fang, Fu-De

2002-03-01

To detect the trans-factors specifically binding to the strong enhancer element (GPEI) in the upstream of rat glutathione S-transferase P (GST-P) gene. Yeast one-hybrid system was used to screen rat lung MATCHMAKER cDNA library to identify potential trans-factors that can interact with core sequence of GPEI(cGPEI). Electrophoresis mobility shift assay (EMSA) was used to analyze the binding of transfactors to cGPEI. cDNA fragments coding for the C-terminal part of the transcription factor c-Jun and rat adenine nucleotide translocator (ANT) were isolated. The binding of c-Jun and ANT to GPEI core sequence were confirmed. Rat c-jun transcriptional factor and ANT may interact with cGPEI. They could play an important role in the induced expression of GST-P gene.
Replication Protein A-1 Has a Preference for the Telomeric G-rich Sequence in Trypanosoma cruzi.

PubMed

Pavani, Raphael Souza; Vitarelli, Marcela O; Fernandes, Carlos A H; Mattioli, Fabio F; Morone, Mariana; Menezes, Milene C; Fontes, Marcos R M; Cano, Maria Isabel N; Elias, Maria Carolina

2018-05-01

Replication protein A (RPA), the major eukaryotic single-stranded binding protein, is a heterotrimeric complex formed by RPA-1, RPA-2, and RPA-3. RPA is a fundamental player in replication, repair, recombination, and checkpoint signaling. In addition, increasing evidences have been adding functions to RPA in telomere maintenance, such as interaction with telomerase to facilitate its activity and also involvement in telomere capping in some conditions. Trypanosoma cruzi, the etiological agent of Chagas disease is a protozoa parasite that appears early in the evolution of eukaryotes. Recently, we have showed that T. cruziRPA presents canonical functions being involved with DNA replication and DNA damage response. Here, we found by FISH/IF assays that T. cruziRPA localizes at telomeres even outside replication (S) phase. In vitro analysis showed that one telomeric repeat is sufficient to bind RPA-1. Telomeric DNA induces different secondary structural modifications on RPA-1 in comparison with other types of DNA. In addition, RPA-1 presents a higher affinity for telomeric sequence compared to randomic sequence, suggesting that RPA may play specific roles in T. cruzi telomeric region. © 2017 The Author(s) Journal of Eukaryotic Microbiology © 2017 International Society of Protistologists.
Role of sequence encoded κB DNA geometry in gene regulation by Dorsal

PubMed Central

Mrinal, Nirotpal; Tomar, Archana; Nagaraju, Javaregowda

2011-01-01

Many proteins of the Rel family can act as both transcriptional activators and repressors. However, mechanism that discerns the ‘activator/repressor’ functions of Rel-proteins such as Dorsal (Drosophila homologue of mammalian NFκB) is not understood. Using genomic, biophysical and biochemical approaches, we demonstrate that the underlying principle of this functional specificity lies in the ‘sequence-encoded structure’ of the κB-DNA. We show that Dorsal-binding motifs exist in distinct activator and repressor conformations. Molecular dynamics of DNA-Dorsal complexes revealed that repressor κB-motifs typically have A-tract and flexible conformation that facilitates interaction with co-repressors. Deformable structure of repressor motifs, is due to changes in the hydrogen bonding in A:T pair in the ‘A-tract’ core. The sixth nucleotide in the nonameric κB-motif, ‘A’ (A6) in the repressor motifs and ‘T’ (T6) in the activator motifs, is critical to confer this functional specificity as A6 → T6 mutation transformed flexible repressor conformation into a rigid activator conformation. These results highlight that ‘sequence encoded κB DNA-geometry’ regulates gene expression by exerting allosteric effect on binding of Rel proteins which in turn regulates interaction with co-regulators. Further, we identified and characterized putative repressor motifs in Dl-target genes, which can potentially aid in functional annotation of Dorsal gene regulatory network. PMID:21890896
Sequence-specific binding of counterions to B-DNA

PubMed Central

Denisov, Vladimir P.; Halle, Bertil

2000-01-01

Recent studies by x-ray crystallography, NMR, and molecular simulations have suggested that monovalent counterions can penetrate deeply into the minor groove of B form DNA. Such groove-bound ions potentially could play an important role in AT-tract bending and groove narrowing, thereby modulating DNA function in vivo. To address this issue, we report here 23Na magnetic relaxation dispersion measurements on oligonucleotides, including difference experiments with the groove-binding drug netropsin. The exquisite sensitivity of this method to ions in long-lived and intimate association with DNA allows us to detect sequence-specific sodium ion binding in the minor groove AT tract of three B-DNA dodecamers. The sodium ion occupancy is only a few percent, however, and therefore is not likely to contribute importantly to the ensemble of B-DNA structures. We also report results of ion competition experiments, indicating that potassium, rubidium, and cesium ions bind to the minor groove with similarly weak affinity as sodium ions, whereas ammonium ion binding is somewhat stronger. The present findings are discussed in the light of previous NMR and diffraction studies of sequence-specific counterion binding to DNA. PMID:10639130
Kinetics of interaction of Cotton Leaf Curl Kokhran Virus-Dabawali (CLCuKV-Dab) coat protein and its mutants with ssDNA

DOE Office of Scientific and Technical Information (OSTI.GOV)

Priyadarshini, C.G. Poornima; Savithri, H.S., E-mail: bchss@biochem.iisc.ernet.i

Gemini viral assembly and transport of viral DNA into nucleus for replication, essentially involve DNA-coat protein interactions. The kinetics of interaction of Cotton Leaf Curl Kokhran Virus-Dabawali recombinant coat protein (rCP) with DNA was studied by electrophoretic mobility shift assay (EMSA) and surface plasmon resonance (SPR). The rCP interacted with ssDNA with a K{sub A}, of 2.6 +- 0.29 x 10{sup 8} M{sup -1} in a sequence non-specific manner. The CP has a conserved C2H2 type zinc finger motif composed of residues C68, C72, H81 and H85. Mutation of these residues to alanine resulted in reduced binding to DNA probes.more » The H85A mutant rCP showed the least binding with approximately 756 fold loss in the association rate and a three order magnitude decrease in the binding affinity as compared to rCP. The CP-DNA interactions via the zinc finger motif could play a crucial role in virus assembly and in nuclear transport.« less
MNDA binds NPM/B23 and the NPM-MLF1 chimera generated by the t(3;5) associated with myelodysplastic syndrome and acute myeloid leukemia.

PubMed

Xie, J; Briggs, J A; Morris, S W; Olson, M O; Kinney, M C; Briggs, R C

1997-10-01

The myeloid cell nuclear differentiation antigen (MNDA) is a nuclear protein expressed specifically in developing cells of the human myelomonocytic lineage, including the end-stage monocytes/macrophages and granulocytes. Nuclear localization, lineage- and stage-specific expression, association with chromatin, and regulation by interferon alpha indicate that this protein is involved in regulating gene expression uniquely associated with the differentiation process and/or function of the monocyte/macrophage. MNDA does not bind specific DNA sequences, but rather a set of nuclear proteins that includes nucleolin (C23). Both in vitro binding assays and co-immunoprecipitation were used to demonstrate that MNDA also binds protein B23 (nucleophosmin/NPM). Three reciprocal chromosome translocations found in certain cases of leukemia/lymphoma involve fusions with the NPM/B23 gene, t(5;17) NPM-RARalpha, t(2;5) NPM-ALK, and the t(3;5) NPM-MLF1. In the current study, MNDA was not able to bind the NPM-ALK chimera originating from the t(2;5) and containing residues 1-117 of NPM. However, MNDA did bind the NPM-MLF1 product of the t(3;5) that contains the N-terminal 175 residues of NPM. The additional 58 amino acids (amino acids 117-175) of the NPM sequence that are contained in the product of the NPM-MLF1 fusion gene relative to the product of the NPM-ALK fusion appear responsible for MNDA binding. This additional NPM sequence contains a nuclear localization signal and clusters of acidic residues believed to bind nuclear localization signals of other proteins. Whereas NPM and nucleolin are primarily localized within the nucleolus, MNDA is distributed throughout the nucleus including the nucleolus, suggesting that additional interactions define overall MNDA localization.
Genome-wide inference of transcription factor-DNA binding specificity in cell regeneration using a combination strategy.

PubMed

Wang, Xiaofeng; Zhang, Aiqun; Ren, Weizheng; Chen, Caiyu; Dong, Jiahong

2012-11-01

The cell growth, development, and regeneration of tissue and organ are associated with a large number of gene regulation events, which are mediated in part by transcription factors (TFs) binding to cis-regulatory elements involved in the genome. Predicting the binding affinity and inferring the binding specificity of TF-DNA interactions at the genomic level would be fundamentally helpful for our understanding of the molecular mechanism and biological implication underlying sequence-specific TF-DNA recognition. In this study, we report the development of a combination method to characterize the interaction behavior of a 11-mer oligonucleotide segment and its mutations with the Gcn4p protein, a homodimeric, basic leucine zipper TF, and to predict the binding affinity and specificity of potential Gcn4p binders in the genome-wide scale. In this procedure, a position-mutated energy matrix is created based on molecular modeling analysis of native and mutated Gcn4p-DNA complex structures to describe the position-independent interaction energy profile of Gcn4p with different nucleotide types at each position of the oligonucleotide, and the energy terms extracted from the matrix and their interactives are then correlated with experimentally measured affinities of 19268 distinct oligonucleotides using statistical modeling methodology. Subsequently, the best one of built regression models is successfully applied to screen those of potential high-affinity Gcn4p binders from the complete genome. The findings arising from this study are briefly listed below: (i) The 11 positions of oligonucleotides are highly interactive and non-additive in contribution to Gcn4p-DNA binding affinity; (ii) Indirect conformational effects upon nucleotide mutations as well as associated subtle changes in interfacial atomic contacts, but not the direct nonbonded interactions, are primarily responsible for the sequence-specific recognition; (iii) The intrinsic synergistic effects among the sequence positions of oligonucleotides determine Gcn4p-DNA binding affinity and specificity; (iv) Linear regression models in conjunction with variable selection seem to perform fairly well in capturing the internal dependences hidden in the Gcn4p-DNA system, albeit ignoring nonlinear factors may lead the models to systematically underestimate and overestimate high- and low-affinity samples, respectively. © 2012 John Wiley & Sons A/S.
Effective DNA Inhibitors of Cathepsin G by In Vitro Selection

PubMed Central

Gatto, Barbara; Vianini, Elena; Lucatello, Lorena; Sissi, Claudia; Moltrasio, Danilo; Pescador, Rodolfo; Porta, Roberto; Palumbo, Manlio

2008-01-01

Cathepsin G (CatG) is a chymotrypsin-like protease released upon degranulation of neutrophils. In several inflammatory and ischaemic diseases the impaired balance between CatG and its physiological inhibitors leads to tissue destruction and platelet aggregation. Inhibitors of CatG are suitable for the treatment of inflammatory diseases and procoagulant conditions. DNA released upon the death of neutrophils at injury sites binds CatG. Moreover, short DNA fragments are more inhibitory than genomic DNA. Defibrotide, a single stranded polydeoxyribonucleotide with antithrombotic effect is also a potent CatG inhibitor. Given the above experimental evidences we employed a selection protocol to assess whether DNA inhibition of CatG may be ascribed to specific sequences present in defibrotide DNA. A Selex protocol was applied to identify the single-stranded DNA sequences exhibiting the highest affinity for CatG, the diversity of a combinatorial pool of oligodeoxyribonucleotides being a good representation of the complexity found in defibrotide. Biophysical and biochemical studies confirmed that the selected sequences bind tightly to the target enzyme and also efficiently inhibit its catalytic activity. Sequence analysis carried out to unveil a motif responsible for CatG recognition showed a recurrence of alternating TG repeats in the selected CatG binders, adopting an extended conformation that grants maximal interaction with the highly charged protein surface. This unprecedented finding is validated by our results showing high affinity and inhibition of CatG by specific DNA sequences of variable length designed to maximally reduce pairing/folding interactions. PMID:19325843
Adrenocortical nuclear progesterone-binding protein: Identification by photoaffinity labeling and evidence for deoxyribonucleic acid binding and stimulation by adrenocorticotropin

DOE Office of Scientific and Technical Information (OSTI.GOV)

Demura, T.; Driscoll, W.J.; Lee, Y.C.

1991-01-01

Nuclei of the guinea pig adrenal cortex contain a protein that specifically binds progesterone and that, biochemically, is clearly distinct from the classical progesterone receptor. The adrenocortical nuclear progesterone-binding protein has now been purified more than 2000-fold by steroid-affinity chromatography with a 75% yield. The purified protein preparation demonstrated three major bands on sodium dodecyl sulfate-polyacrylamide gel of 79K, 74K, and 50K. To determine which of the three might represent the progesterone-binding protein, steroid photoaffinity labeling was performed which resulted in the specific and exclusive labeling of a 50K band. Thus, the adrenocortical nuclear progesterone-binding protein appears to be distinctmore » from the classical progesterone receptor not only biochemically, but also on the basis of molecular size. To test whether the adrenocortical nuclear progesterone-binding protein can be hormonally stimulated, guinea pigs were treated with ACTH. The chronic administration of ACTH caused a 4- to 6-fold increase in the specific progesterone binding capacity without a change in the binding affinity. There appeared to be no significant difference in nuclear progesterone binding between the zona fasciculata and zona reticularis. This finding suggests a mediating role for the progesterone-binding protein in ACTH action. In addition, the nuclear progesterone-binding protein bound to nonspecific DNA sequences, further suggesting a possible transcriptional regulatory role.« less
Crystal structure of the Msx-1 homeodomain/DNA complex.

PubMed

Hovde, S; Abate-Shen, C; Geiger, J H

2001-10-09

The Msx-1 homeodomain protein plays a crucial role in craniofacial, limb, and nervous system development. Homeodomain DNA-binding domains are comprised of 60 amino acids that show a high degree of evolutionary conservation. We have determined the structure of the Msx-1 homeodomain complexed to DNA at 2.2 A resolution. The structure has an unusually well-ordered N-terminal arm with a unique trajectory across the minor groove of the DNA. DNA specificity conferred by bases flanking the core TAAT sequence is explained by well ordered water-mediated interactions at Q50. Most interactions seen at the TAAT sequence are typical of the interactions seen in other homeodomain structures. Comparison of the Msx-1-HD structure to all other high resolution HD-DNA complex structures indicate a remarkably well-conserved sphere of hydration between the DNA and protein in these complexes.
Determining the Specificity of Cascade Binding, Interference, and Primed Adaptation In Vivo in the Escherichia coli Type I-E CRISPR-Cas System

PubMed Central

Cooper, Lauren A.; Stringer, Anne M.

2018-01-01

ABSTRACT In clustered regularly interspaced short palindromic repeat (CRISPR)-Cas (CRISPR-associated) immunity systems, short CRISPR RNAs (crRNAs) are bound by Cas proteins, and these complexes target invading nucleic acid molecules for degradation in a process known as interference. In type I CRISPR-Cas systems, the Cas protein complex that binds DNA is known as Cascade. Association of Cascade with target DNA can also lead to acquisition of new immunity elements in a process known as primed adaptation. Here, we assess the specificity determinants for Cascade-DNA interaction, interference, and primed adaptation in vivo, for the type I-E system of Escherichia coli. Remarkably, as few as 5 bp of crRNA-DNA are sufficient for association of Cascade with a DNA target. Consequently, a single crRNA promotes Cascade association with numerous off-target sites, and the endogenous E. coli crRNAs direct Cascade binding to >100 chromosomal sites. In contrast to the low specificity of Cascade-DNA interactions, >18 bp are required for both interference and primed adaptation. Hence, Cascade binding to suboptimal, off-target sites is inert. Our data support a model in which the initial Cascade association with DNA targets requires only limited sequence complementarity at the crRNA 5′ end whereas recruitment and/or activation of the Cas3 nuclease, a prerequisite for interference and primed adaptation, requires extensive base pairing. PMID:29666291

Specific labeling of zinc finger proteins using noncanonical amino acids and copper-free click chemistry.

PubMed

Kim, Younghoon; Kim, Sung Hoon; Ferracane, Dean; Katzenellenbogen, John A; Schroeder, Charles M

2012-09-19

Zinc finger proteins (ZFPs) play a key role in transcriptional regulation and serve as invaluable tools for gene modification and genetic engineering. Development of efficient strategies for labeling metalloproteins such as ZFPs is essential for understanding and controlling biological processes. In this work, we engineered ZFPs containing cysteine-histidine (Cys2-His2) motifs by metabolic incorporation of the unnatural amino acid azidohomoalanine (AHA), followed by specific protein labeling via click chemistry. We show that cyclooctyne promoted [3 + 2] dipolar cycloaddition with azides, known as copper-free click chemistry, provides rapid and specific labeling of ZFPs at high yields as determined by mass spectrometry analysis. We observe that the DNA-binding activity of ZFPs labeled by conventional copper-mediated click chemistry was completely abolished, whereas ZFPs labeled by copper-free click chemistry retain their sequence-specific DNA-binding activity under native conditions, as determined by electrophoretic mobility shift assays, protein microarrays, and kinetic binding assays based on Förster resonance energy transfer (FRET). Our work provides a general framework to label metalloproteins such as ZFPs by metabolic incorporation of unnatural amino acids followed by copper-free click chemistry.
Specific Labeling of Zinc Finger Proteins using Non-canonical Amino Acids and Copper-free Click Chemistry

PubMed Central

Kim, Younghoon; Kim, Sung Hoon; Ferracane, Dean; Katzenellenbogen, John A.

2012-01-01

Zinc finger proteins (ZFPs) play a key role in transcriptional regulation and serve as invaluable tools for gene modification and genetic engineering. Development of efficient strategies for labeling metalloproteins such as ZFPs is essential for understanding and controlling biological processes. In this work, we engineered ZFPs containing cysteine-histidine (Cys2-His2) motifs by metabolic incorporation of the unnatural amino acid azidohomoalanine (AHA), followed by specific protein labeling via click chemistry. We show that cyclooctyne promoted [3 + 2] dipolar cycloaddition with azides, known as copper-free click chemistry, provides rapid and specific labeling of ZFPs at high yields as determined by mass spectrometry analysis. We observe that the DNA-binding activity of ZFPs labeled by conventional copper-mediated click chemistry was completely abolished, whereas ZFPs labeled by copper-free click chemistry retain their sequence-specific DNA-binding activity under native conditions, as determined by electrophoretic mobility shift assays, protein microarrays and kinetic binding assays based on Förster resonance energy transfer (FRET). Our work provides a general framework to label metalloproteins such as ZFPs by metabolic incorporation of unnatural amino acids followed by copper-free click chemistry. PMID:22871171
The structures of non-CG-repeat Z-DNAs co-crystallized with the Z-DNA-binding domain, hZ alpha(ADAR1).

PubMed

Ha, Sung Chul; Choi, Jongkeun; Hwang, Hye-Yeon; Rich, Alexander; Kim, Yang-Gyun; Kim, Kyeong Kyu

2009-02-01

The Z-DNA conformation preferentially occurs at alternating purine-pyrimidine repeats, and is specifically recognized by Z alpha domains identified in several Z-DNA-binding proteins. The binding of Z alpha to foreign or chromosomal DNA in various sequence contexts is known to influence various biological functions, including the DNA-mediated innate immune response and transcriptional modulation of gene expression. For these reasons, understanding its binding mode and the conformational diversity of Z alpha bound Z-DNAs is of considerable importance. However, structural studies of Z alpha bound Z-DNA have been mostly limited to standard CG-repeat DNAs. Here, we have solved the crystal structures of three representative non-CG repeat DNAs, d(CACGTG)(2), d(CGTACG)(2) and d(CGGCCG)(2) complexed to hZ alpha(ADAR1) and compared those structures with that of hZ alpha(ADAR1)/d(CGCGCG)(2) and the Z alpha-free Z-DNAs. hZ alpha(ADAR1) bound to each of the three Z-DNAs showed a well conserved binding mode with very limited structural deviation irrespective of the DNA sequence, although varying numbers of residues were in contact with Z-DNA. Z-DNAs display less structural alterations in the Z alpha-bound state than in their free form, thereby suggesting that conformational diversities of Z-DNAs are restrained by the binding pocket of Z alpha. These data suggest that Z-DNAs are recognized by Z alpha through common conformational features regardless of the sequence and structural alterations.
HMG I(Y) interferes with the DNA binding of NF-AT factors and the induction of the interleukin 4 promoter in T cells

PubMed Central

Klein-Hessling, Stefan; Schneider, Günter; Heinfling, Annette; Chuvpilo, Sergei; Serfling, Edgar

1996-01-01

HMG I(Y) proteins bind to double-stranded A+T oligonucleotides longer than three base pairs. Such motifs form part of numerous NF-AT-binding sites of lymphokine promoters, including the interleukin 4 (IL-4) promoter. NF-AT factors share short homologous peptide sequences in their DNA-binding domain with NF-κB factors and bind to certain NF-κB sites. It has been shown that HMG I(Y) proteins enhance NF-κB binding to the interferon β promoter and virus-mediated interferon β promoter induction. We show that HMG I(Y) proteins exert an opposite effect on the DNA binding of NF-AT factors and the induction of the IL-4 promoter in T lymphocytes. Introduction of mutations into a high-affinity HMG I(Y)-binding site of the IL-4 promoter, which decreased HMG I(Y)-binding to a NF-AT-binding sequence, the Pu-bB (or P) site, distinctly increased the induction of the IL-4 promoter in Jurkat T leukemia cells. High concentrations of HMG I(Y) proteins are able to displace NF-ATp from its binding to the Pu-bB site. High HMG I(Y) concentrations are typical for Jurkat cells and peripheral blood T lymphocytes, whereas El4 T lymphoma cells and certain T helper type 2 cell clones contain relatively low HMG I(Y) concentrations. Our results indicate that HMG I(Y) proteins do not cooperate, but instead compete with NF-AT factors for the binding to DNA even though NF-AT factors share some DNA-binding properties with NF-kB factors. This competition between HMG I(Y) and NF-AT proteins for DNA binding might be due to common contacts with minor groove nucleotides of DNA and may be one mechanism contributing to the selective IL-4 expression in certain T lymphocyte populations, such as T helper type 2 cells. PMID:8986808
The human haptoglobin gene promoter: interleukin-6-responsive elements interact with a DNA-binding protein induced by interleukin-6.

PubMed Central

Oliviero, S; Cortese, R

1989-01-01

Transcription of the human haptoglobin (Hp) gene is induced by interleukin-6 (IL-6) in the human hepatoma cell line Hep3B. Cis-acting elements responsible for this response are localized within the first 186 bp of the 5'-flanking region. Site-specific mutants of the Hp promoter fused to the chloramphenicol acetyl transferase (CAT) gene were analysed by transient transfection into uninduced and IL-6-treated Hep3B cells. We identified three regions, A, B and C, defined by mutation, which are important for the IL-6 response. Band shift experiments using nuclear extracts from untreated or IL-6-treated cells revealed the presence of IL-6-inducible DNA binding activities when DNA fragments containing the A or the C sequences were used. Competition experiments showed that both sequences bind to the same nuclear factors. Polymers of oligonucleotides containing either the A or the C regions confer IL-6 responsiveness to a truncated SV40 promoter. The B region forms several complexes with specific DNA-binding proteins different from those which bind to the A and C region. The B region complexes are identical in nuclear extracts from IL-6-treated and untreated cells. While important for IL-6 induction in the context of the haptoglobin promoter, the B site does not confer IL-6 inducibility to the SV40 promoter. Our results indicate that the IL-6 response of the haptoglobin promoter is dependent on the presence of multiple, partly redundant, cis-acting elements. Images PMID:2787245
Widespread evidence of cooperative DNA binding by transcription factors in Drosophila development

PubMed Central

Kazemian, Majid; Pham, Hannah; Wolfe, Scot A.; Brodsky, Michael H.; Sinha, Saurabh

2013-01-01

Regulation of eukaryotic gene transcription is often combinatorial in nature, with multiple transcription factors (TFs) regulating common target genes, often through direct or indirect mutual interactions. Many individual examples of cooperative binding by directly interacting TFs have been identified, but it remains unclear how pervasive this mechanism is during animal development. Cooperative TF binding should be manifest in genomic sequences as biased arrangements of TF-binding sites. Here, we explore the extent and diversity of such arrangements related to gene regulation during Drosophila embryogenesis. We used the DNA-binding specificities of 322 TFs along with chromatin accessibility information to identify enriched spacing and orientation patterns of TF-binding site pairs. We developed a new statistical approach for this task, specifically designed to accurately assess inter-site spacing biases while accounting for the phenomenon of homotypic site clustering commonly observed in developmental regulatory regions. We observed a large number of short-range distance preferences between TF-binding site pairs, including examples where the preference depends on the relative orientation of the binding sites. To test whether these binding site patterns reflect physical interactions between the corresponding TFs, we analyzed 27 TF pairs whose binding sites exhibited short distance preferences. In vitro protein–protein binding experiments revealed that >65% of these TF pairs can directly interact with each other. For five pairs, we further demonstrate that they bind cooperatively to DNA if both sites are present with the preferred spacing. This study demonstrates how DNA-binding motifs can be used to produce a comprehensive map of sequence signatures for different mechanisms of combinatorial TF action. PMID:23847101
The adenovirus L4-22K protein regulates transcription and RNA splicing via a sequence-specific single-stranded RNA binding.

PubMed

Lan, Susan; Kamel, Wael; Punga, Tanel; Akusjärvi, Göran

2017-02-28

The adenovirus L4-22K protein both activates and suppresses transcription from the adenovirus major late promoter (MLP) by binding to DNA elements located downstream of the MLP transcriptional start site: the so-called DE element (positive) and the R1 region (negative). Here we show that L4-22K preferentially binds to the RNA form of the R1 region, both to the double-stranded RNA and the single-stranded RNA of the same polarity as the nascent MLP transcript. Further, L4-22K binds to a 5΄-CAAA-3΄ motif in the single-stranded RNA, which is identical to the sequence motif characterized for L4-22K DNA binding. L4-22K binding to single-stranded RNA results in an enhancement of U1 snRNA recruitment to the major late first leader 5΄ splice site. This increase in U1 snRNA binding results in a suppression of MLP transcription and a concurrent stimulation of major late first intron splicing. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Structural and functional analysis of an enhancer GPEI having a phorbol 12-O-tetradecanoate 13-acetate responsive element-like sequence found in the rat glutathione transferase P gene.

PubMed

Okuda, A; Imagawa, M; Maeda, Y; Sakai, M; Muramatsu, M

1989-10-05

We have recently identified a typical enhancer, termed GPEI, located about 2.5 kilobases upstream from the transcription initiation site of the rat glutathione transferase P gene. Analyses of 5' and 3' deletion mutants revealed that the cis-acting sequence of GPEI contained the phorbol 12-O-tetradecanoate 13-acetate responsive element (TRE)-like sequence in it. For the maximal activity, however, GPEI required an adjacent upstream sequence of about 19 base pairs in addition to the TRE-like sequence. With the DNA binding gel-shift assay, we could detect protein(s) that specifically binds to the TRE-like sequence of GPEI fragment, which was possibly c-jun.c-fos complex or a similar protein complex. The sequence immediately upstream of the TRE-like sequence did not have any activity by itself, but augmented the latter activity by about 5-fold.
The cellular transcription factor CREB corresponds to activating transcription factor 47 (ATF-47) and forms complexes with a group of polypeptides related to ATF-43.

PubMed

Hurst, H C; Masson, N; Jones, N C; Lee, K A

1990-12-01

Promoter elements containing the sequence motif CGTCA are important for a variety of inducible responses at the transcriptional level. Multiple cellular factors specifically bind to these elements and are encoded by a multigene family. Among these factors, polypeptides termed activating transcription factor 43 (ATF-43) and ATF-47 have been purified from HeLa cells and a factor referred to as cyclic AMP response element-binding protein (CREB) has been isolated from PC12 cells and rat brain. We demonstrated that CREB and ATF-47 are identical and that CREB and ATF-43 form protein-protein complexes. We also found that the cis requirements for stable DNA binding by ATF-43 and CREB are different. Using antibodies to ATF-43 we have identified a group of polypeptides (ATF-43) in the size range from 40 to 43 kDa. ATF-43 polypeptides are related by their reactivity with anti-ATF-43, DNA-binding specificity, complex formation with CREB, heat stability, and phosphorylation by protein kinase A. Certain cell types vary in their ATF-43 complement, suggesting that CREB activity is modulated in a cell-type-specific manner through interaction with ATF-43. ATF-43 polypeptides do not appear simply to correspond to the gene products of the ATF multigene family, suggesting that the size of the ATF family at the protein level is even larger than predicted from cDNA-cloning studies.
HMMBinder: DNA-Binding Protein Prediction Using HMM Profile Based Features.

PubMed

Zaman, Rianon; Chowdhury, Shahana Yasmin; Rashid, Mahmood A; Sharma, Alok; Dehzangi, Abdollah; Shatabda, Swakkhar

2017-01-01

DNA-binding proteins often play important role in various processes within the cell. Over the last decade, a wide range of classification algorithms and feature extraction techniques have been used to solve this problem. In this paper, we propose a novel DNA-binding protein prediction method called HMMBinder. HMMBinder uses monogram and bigram features extracted from the HMM profiles of the protein sequences. To the best of our knowledge, this is the first application of HMM profile based features for the DNA-binding protein prediction problem. We applied Support Vector Machines (SVM) as a classification technique in HMMBinder. Our method was tested on standard benchmark datasets. We experimentally show that our method outperforms the state-of-the-art methods found in the literature.
Selection of homeotic proteins for binding to a human DNA replication origin.

PubMed

de Stanchina, E; Gabellini, D; Norio, P; Giacca, M; Peverali, F A; Riva, S; Falaschi, A; Biamonti, G

2000-06-09

We have previously shown that a cell cycle-dependent nucleoprotein complex assembles in vivo on a 74 bp sequence within the human DNA replication origin associated to the Lamin B2 gene. Here, we report the identification, using a one-hybrid screen in yeast, of three proteins interacting with the 74 bp sequence. All of them, namely HOXA13, HOXC10 and HOXC13, are orthologues of the Abdominal-B gene of Drosophila melanogaster and are members of the homeogene family of developmental regulators. We describe the complete open reading frame sequence of HOXC10 and HOXC13 along with the structure of the HoxC13 gene. The specificity of binding of these two proteins to the Lamin B2 origin is confirmed by both band-shift and in vitro footprinting assays. In addition, the ability of HOXC10 and HOXC13 to increase the activity of a promoter containing the 74 bp sequence, as assayed by CAT-assay experiments, demonstrates a direct interaction of these homeoproteins with the origin sequence in mammalian cells. We also show that HOXC10 expression is cell-type-dependent and positively correlates with cell proliferation. Copyright 2000 Academic Press.
Simulation studies of DNA at the nanoscale: Interactions with proteins, polycations, and surfaces

NASA Astrophysics Data System (ADS)

Elder, Robert M.

Understanding the nanoscale interactions of DNA, a multifunctional biopolymer with sequence-dependent properties, with other biological and synthetic substrates and molecules is essential to advancing these technologies. This doctoral thesis research is aimed at understanding the thermodynamics and molecular-level structure when DNA interacts with proteins, polycations, and functionalized surfaces. First, we investigate the ability of a DNA damage recognition protein (HMGB1a) to bind to anti-cancer drug-induced DNA damage, seeking to explain how HMGB1a differentiates between the drugs in vivo. Using atomistic molecular dynamics simulations, we show that the structure of the drug-DNA molecule exhibits drug- and base sequence-dependence that explains some of the experimentally observed differential recognition of the drugs in various sequence contexts. Then, we show how steric hindrance from the drug decreases the deformability of the drug-DNA molecule, which decreases recognition by the protein, a concept that can be applied to rational drug design. Second, we study how polycation architecture and chemistry affect polycation-DNA binding so as to design optimal polycations for high efficiency gene (DNA) delivery. Using a multiscale computational approach involving atomistic and coarse-grained simulations, we examine how rearranging polylysine from a linear to a grafted architecture, and several aspects of the grafted architecture, affect polycation-DNA binding and the structure of polycation-DNA complexes. Next, going beyond lysine we examine how oligopeptide chemistry and sequence in the grafted architecture affects polycation-DNA binding and find that strategic placement of hydrophobic peptides might be used to tailor binding strength. Third, we study the adsorption and conformations of single-stranded DNA (an amphiphilic biopolymer) on model hydrophilic and hydrophobic surfaces. Short ssDNA oligomers adsorb to both surfaces with similar strength, with the strength of adsorption to the hydrophobic surface depending on the composition of the DNA strands, i.e. purine or pyrimidine bases. Additionally, DNA-surface and DNA-water interactions near the surfaces govern the adsorption. For longer ssDNA oligomers, the effects of surface chemistry and temperature on ssDNA conformations are rather small, but either the hydrophilic surface or increased temperature favor slightly more compact conformations due to energetic and entropic effects, respectively.
Recognition of platinum-DNA adducts by HMGB1a.

PubMed

Ramachandran, Srinivas; Temple, Brenda; Alexandrova, Anastassia N; Chaney, Stephen G; Dokholyan, Nikolay V

2012-09-25

Cisplatin (CP) and oxaliplatin (OX), platinum-based drugs used widely in chemotherapy, form adducts on intrastrand guanines (5'GG) in genomic DNA. DNA damage recognition proteins, transcription factors, mismatch repair proteins, and DNA polymerases discriminate between CP- and OX-GG DNA adducts, which could partly account for differences in the efficacy, toxicity, and mutagenicity of CP and OX. In addition, differential recognition of CP- and OX-GG adducts is highly dependent on the sequence context of the Pt-GG adduct. In particular, DNA binding protein domain HMGB1a binds to CP-GG DNA adducts with up to 53-fold greater affinity than to OX-GG adducts in the TGGA sequence context but shows much smaller differences in binding in the AGGC or TGGT sequence contexts. Here, simulations of the HMGB1a-Pt-DNA complex in the three sequence contexts revealed a higher number of interface contacts for the CP-DNA complex in the TGGA sequence context than in the OX-DNA complex. However, the number of interface contacts was similar in the TGGT and AGGC sequence contexts. The higher number of interface contacts in the CP-TGGA sequence context corresponded to a larger roll of the Pt-GG base pair step. Furthermore, geometric analysis of stacking of phenylalanine 37 in HMGB1a (Phe37) with the platinated guanines revealed more favorable stacking modes correlated with a larger roll of the Pt-GG base pair step in the TGGA sequence context. These data are consistent with our previous molecular dynamics simulations showing that the CP-TGGA complex was able to sample larger roll angles than the OX-TGGA complex or either CP- or OX-DNA complexes in the AGGC or TGGT sequences. We infer that the high binding affinity of HMGB1a for CP-TGGA is due to the greater flexibility of CP-TGGA compared to OX-TGGA and other Pt-DNA adducts. This increased flexibility is reflected in the ability of CP-TGGA to sample larger roll angles, which allows for a higher number of interface contacts between the Pt-DNA adduct and HMGB1a.
An isoleucine to leucine mutation that switches the cofactor requirement of the EcoRV restriction endonuclease from magnesium to manganese.

PubMed

Vipond, I B; Moon, B J; Halford, S E

1996-02-13

The EcoRV restriction endonuclease cleaves DNA at its recognition sequence more readily with Mg2+ as the cofactor than with Mn2+ but, at noncognate sequences that differ from the EcoRV site by one base pair, Mn2+ gives higher rates than Mg2+. A mutant of EcoRV, in which an isoleucine near the active site was replaced by leucine, showed the opposite behavior. It had low activity with Mg2+, but, in the presence of Mn2+ ions, it cleaved the recognition site faster than wild-type EcoRV with either Mn2+ or Mg2+. The mutant was also more specific for the recognition sequence than the native enzyme: the noncognate DNA cleavages by wild-type EcoRV and Mn2+ were not detected with the mutant. Further mutagenesis showed that the protein required the same acidic residues at its active site as wild-type EcoRV. The Ile-->Leu mutation seems to perturb the configuration of the metal-binding ligands at the active site so that the protein has virtually no affinity for Mg2+ yet it can still bind Mn2+ ions, though the latter only occurs when the protein is at the recognition site. This contrasts to wild-type EcoRV, where Mn2+ ions bind readily to complexes with either cognate and noncognate DNA and only Mg2+ shows the discrimination between the complexes. The structural perturbation is a specific consequence of leucine in place of isoleucine, since mutants with valine or alanine were similar to wild-type EcoRV.
Circadian clock protein KaiC forms ATP-dependent hexameric rings and binds DNA.

PubMed

Mori, Tetsuya; Saveliev, Sergei V; Xu, Yao; Stafford, Walter F; Cox, Michael M; Inman, Ross B; Johnson, Carl H

2002-12-24

KaiC from Synechococcus elongatus PCC 7942 (KaiC) is an essential circadian clock protein in cyanobacteria. Previous sequence analyses suggested its inclusion in the RecADnaB superfamily. A characteristic of the proteins of this superfamily is that they form homohexameric complexes that bind DNA. We show here that KaiC also forms ring complexes with a central pore that can be visualized by electron microscopy. A combination of analytical ultracentrifugation and chromatographic analyses demonstrates that these complexes are hexameric. The association of KaiC molecules into hexamers depends on the presence of ATP. The KaiC sequence does not include the obvious DNA-binding motifs found in RecA or DnaB. Nevertheless, KaiC binds forked DNA substrates. These data support the inclusion of KaiC into the RecADnaB superfamily and have important implications for enzymatic activity of KaiC in the circadian clock mechanism that regulates global changes in gene expression patterns.
Structural Determinants of DNA Binding by a P. falciparum ApiAP2 Transcriptional Regulator

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lindner, Scott E.; De Silva, Erandi K.; Keck, James L.

2010-11-05

Putative transcription factors have only recently been identified in the Plasmodium spp., with the major family of regulators comprising the Apicomplexan Apetala2 (AP2) proteins. To better understand the DNA-binding mechanisms of these transcriptional regulators, we characterized the structure and in vitro function of an AP2 DNA-binding domain from a prototypical Apicomplexan AP2 protein, PF14{_}0633 from Plasmodium falciparum. The X-ray crystal structure of the PF14{_}0633 AP2 domain bound to DNA reveals a {beta}-sheet fold that binds the DNA major groove through base-specific and backbone contacts; a prominent {alpha}-helix supports the {beta}-sheet structure. Substitution of predicted DNA-binding residues with alanine weakened ormore » eliminated DNA binding in solution. In contrast to plant AP2 domains, the PF14{_}0633 AP2 domain dimerizes upon binding to DNA through a domain-swapping mechanism in which the {alpha}-helices of the AP2 domains pack against the {beta}-sheets of the dimer mates. DNA-induced dimerization of PF14{_}0633 may be important for tethering two distal DNA loci together in the nucleus and/or for inducing functional rearrangements of its domains to facilitate transcriptional regulation. Consistent with a multisite binding mode, at least two copies of the consensus sequence recognized by PF14{_}0633 are present upstream of a previously identified group of sporozoite-stage genes. Taken together, these findings illustrate how Plasmodium has adapted the AP2 DNA-binding domain for genome-wide transcriptional regulation.« less
Inhibition of HMGA2 binding to DNA by netropsin

PubMed Central

Miao, Yi; Cui, Tengjiao; Leng, Fenfei; Wilson, W. David

2008-01-01

The design of small synthetic molecules that can be used to affect gene expression is an area of active interest for development of agents in therapeutic and biotechnology applications. Many compounds that target the minor groove in AT sequences in DNA are well characterized and are promising reagents for use as modulators of protein-DNA complexes. The mammalian high mobility group transcriptional factor, HMGA2, also targets the DNA minor groove and plays critical roles in disease processes from cancer to obesity. Biosensor-surface plasmon resonance methods were used to monitor HMGA2 binding to target sites on immobilized DNA and a competition assay for inhibition of the HMGA2-DNA complex was designed. HMGA2 binds strongly to the DNA through AT hook domains with KD values of 20 - 30 nM depending on the DNA sequence. The well-characterized minor groove binder, netropsin, was used to develop and test the assay. The compound has two binding sites in the protein-DNA interaction sequence and this provides an advantage for inhibition. An equation for analysis of results when the inhibitor has two binding sites in the biopolymer recognition surface is presented with the results. The assay provides a platform for discovery of HMGA2 inhibitors. PMID:18023407
PDNAsite: Identification of DNA-binding Site from Protein Sequence by Incorporating Spatial and Sequence Context

PubMed Central

Zhou, Jiyun; Xu, Ruifeng; He, Yulan; Lu, Qin; Wang, Hongpeng; Kong, Bing

2016-01-01

Protein-DNA interactions are involved in many fundamental biological processes essential for cellular function. Most of the existing computational approaches employed only the sequence context of the target residue for its prediction. In the present study, for each target residue, we applied both the spatial context and the sequence context to construct the feature space. Subsequently, Latent Semantic Analysis (LSA) was applied to remove the redundancies in the feature space. Finally, a predictor (PDNAsite) was developed through the integration of the support vector machines (SVM) classifier and ensemble learning. Results on the PDNA-62 and the PDNA-224 datasets demonstrate that features extracted from spatial context provide more information than those from sequence context and the combination of them gives more performance gain. An analysis of the number of binding sites in the spatial context of the target site indicates that the interactions between binding sites next to each other are important for protein-DNA recognition and their binding ability. The comparison between our proposed PDNAsite method and the existing methods indicate that PDNAsite outperforms most of the existing methods and is a useful tool for DNA-binding site identification. A web-server of our predictor (http://hlt.hitsz.edu.cn:8080/PDNAsite/) is made available for free public accessible to the biological research community. PMID:27282833
Diversity in Requirement of Genetic and Epigenetic Factors for Centromere Function in Fungi ▿

PubMed Central

Roy, Babhrubahan; Sanyal, Kaustuv

2011-01-01

A centromere is a chromosomal region on which several proteins assemble to form the kinetochore. The centromere-kinetochore complex helps in the attachment of chromosomes to spindle microtubules to mediate segregation of chromosomes to daughter cells during mitosis and meiosis. In several budding yeast species, the centromere forms in a DNA sequence-dependent manner, whereas in most other fungi, factors other than the DNA sequence also determine the centromere location, as centromeres were able to form on nonnative sequences (neocentromeres) when native centromeres were deleted in engineered strains. Thus, in the absence of a common DNA sequence, the cues that have facilitated centromere formation on a specific DNA sequence for millions of years remain a mystery. Kinetochore formation is facilitated by binding of a centromere-specific histone protein member of the centromeric protein A (CENP-A) family that replaces a canonical histone H3 to form a specialized centromeric chromatin structure. However, the process of kinetochore formation on the rapidly evolving and seemingly diverse centromere DNAs in different fungal species is largely unknown. More interestingly, studies in various yeasts suggest that the factors required for de novo centromere formation (establishment) may be different from those required for maintenance (propagation) of an already established centromere. Apart from the DNA sequence and CENP-A, many other factors, such as posttranslational modification (PTM) of histones at centric and pericentric chromatin, RNA interference, and DNA methylation, are also involved in centromere formation, albeit in a species-specific manner. In this review, we discuss how several genetic and epigenetic factors influence the evolution of structure and function of centromeres in fungal species. PMID:21908596
DNA Binding of Centromere Protein C (CENPC) Is Stabilized by Single-Stranded RNA

PubMed Central

Du, Yaqing; Topp, Christopher N.; Dawe, R. Kelly

2010-01-01

Centromeres are the attachment points between the genome and the cytoskeleton: centromeres bind to kinetochores, which in turn bind to spindles and move chromosomes. Paradoxically, the DNA sequence of centromeres has little or no role in perpetuating kinetochores. As such they are striking examples of genetic information being transmitted in a manner that is independent of DNA sequence (epigenetically). It has been found that RNA transcribed from centromeres remains bound within the kinetochore region, and this local population of RNA is thought to be part of the epigenetic marking system. Here we carried out a genetic and biochemical study of maize CENPC, a key inner kinetochore protein. We show that DNA binding is conferred by a localized region 122 amino acids long, and that the DNA-binding reaction is exquisitely sensitive to single-stranded RNA. Long, single-stranded nucleic acids strongly promote the binding of CENPC to DNA, and the types of RNAs that stabilize DNA binding match in size and character the RNAs present on kinetochores in vivo. Removal or replacement of the binding module with HIV integrase binding domain causes a partial delocalization of CENPC in vivo. The data suggest that centromeric RNA helps to recruit CENPC to the inner kinetochore by altering its DNA binding characteristics. PMID:20140237

Genome-wide Expression Profiling, In Vivo DNA Binding Analysis, and Probabilistic Motif Prediction Reveal Novel Abf1 Target Genes during Fermentation, Respiration, and Sporulation in Yeast

PubMed Central

Schlecht, Ulrich; Erb, Ionas; Demougin, Philippe; Robine, Nicolas; Borde, Valérie; van Nimwegen, Erik; Nicolas, Alain

2008-01-01

The autonomously replicating sequence binding factor 1 (Abf1) was initially identified as an essential DNA replication factor and later shown to be a component of the regulatory network controlling mitotic and meiotic cell cycle progression in budding yeast. The protein is thought to exert its functions via specific interaction with its target site as part of distinct protein complexes, but its roles during mitotic growth and meiotic development are only partially understood. Here, we report a comprehensive approach aiming at the identification of direct Abf1-target genes expressed during fermentation, respiration, and sporulation. Computational prediction of the protein's target sites was integrated with a genome-wide DNA binding assay in growing and sporulating cells. The resulting data were combined with the output of expression profiling studies using wild-type versus temperature-sensitive alleles. This work identified 434 protein-coding loci as being transcriptionally dependent on Abf1. More than 60% of their putative promoter regions contained a computationally predicted Abf1 binding site and/or were bound by Abf1 in vivo, identifying them as direct targets. The present study revealed numerous loci previously unknown to be under Abf1 control, and it yielded evidence for the protein's variable DNA binding pattern during mitotic growth and meiotic development. PMID:18305101
An ethylene-responsive enhancer element is involved in the senescence-related expression of the carnation glutathione-S-transferase (GST1) gene.

PubMed

Itzhaki, H; Maxson, J M; Woodson, W R

1994-09-13

The increased production of ethylene during carnation petal senescence regulates the transcription of the GST1 gene encoding a subunit of glutathione-S-transferase. We have investigated the molecular basis for this ethylene-responsive transcription by examining the cis elements and trans-acting factors involved in the expression of the GST1 gene. Transient expression assays following delivery of GST1 5' flanking DNA fused to a beta-glucuronidase receptor gene were used to functionally define sequences responsible for ethylene-responsive expression. Deletion analysis of the 5' flanking sequences of GST1 identified a single positive regulatory element of 197 bp between -667 and -470 necessary for ethylene-responsive expression. The sequences within this ethylene-responsive region were further localized to 126 bp between -596 and -470. The ethylene-responsive element (ERE) within this region conferred ethylene-regulated expression upon a minimal cauliflower mosaic virus-35S TATA-box promoter in an orientation-independent manner. Gel electrophoresis mobility-shift assays and DNase I footprinting were used to identify proteins that bind to sequences within the ERE. Nuclear proteins from carnation petals were shown to specifically interact with the 126-bp ERE and the presence and binding of these proteins were independent of ethylene or petal senescence. DNase I footprinting defined DNA sequences between -510 and -488 within the ERE specifically protected by bound protein. An 8-bp sequence (ATTTCAAA) within the protected region shares significant homology with promoter sequences required for ethylene responsiveness from the tomato fruit-ripening E4 gene.
Binding and Fusion of Extracellular Vesicles to the Plasma Membrane of Their Cell Targets.

PubMed

Prada, Ilaria; Meldolesi, Jacopo

2016-08-09

Exosomes and ectosomes, extracellular vesicles of two types generated by all cells at multivesicular bodies and the plasma membrane, respectively, play critical roles in physiology and pathology. A key mechanism of their function, analogous for both types of vesicles, is the fusion of their membrane to the plasma membrane of specific target cells, followed by discharge to the cytoplasm of their luminal cargo containing proteins, RNAs, and DNA. Here we summarize the present knowledge about the interactions, binding and fusions of vesicles with the cell plasma membrane. The sequence initiates with dynamic interactions, during which vesicles roll over the plasma membrane, followed by the binding of specific membrane proteins to their cell receptors. Membrane binding is then converted rapidly into fusion by mechanisms analogous to those of retroviruses. Specifically, proteins of the extracellular vesicle membranes are structurally rearranged, and their hydrophobic sequences insert into the target cell plasma membrane which undergoes lipid reorganization, protein restructuring and membrane dimpling. Single fusions are not the only process of vesicle/cell interactions. Upon intracellular reassembly of their luminal cargoes, vesicles can be regenerated, released and fused horizontally to other target cells. Fusions of extracellular vesicles are relevant also for specific therapy processes, now intensely investigated.
Recognition and Binding of Human Telomeric G-Quadruplex DNA by Unfolding Protein 1

PubMed Central

2015-01-01

The specific recognition by proteins of G-quadruplex structures provides evidence of a functional role for in vivo G-quadruplex structures. As previously reported, the ribonucleoprotein, hnRNP Al, and it is proteolytic derivative, unwinding protein 1 (UP1), bind to and destabilize G-quadruplex structures formed by the human telomeric repeat d(TTAGGG)n. UP1 has been proposed to be involved in the recruitment of telomerase to telomeres for chain extension. In this study, a detailed thermodynamic characterization of the binding of UP1 to a human telomeric repeat sequence, the d[AGGG(TTAGGG)3] G-quadruplex, is presented and reveals key insights into the UP1-induced unfolding of the G-quadruplex structure. The UP1–G-quadruplex interactions are shown to be enthalpically driven, exhibiting large negative enthalpy changes for the formation of both the Na+ and K+ G-quadruplex–UP1 complexes (ΔH values of −43 and −19 kcal/mol, respectively). These data reveal three distinct enthalpic contributions from the interactions of UP1 with the Na+ form of G-quadruplex DNA. The initial interaction is characterized by a binding affinity of 8.5 × 108 M–1 (strand), 200 times stronger than the binding of UP1 to a single-stranded DNA with a comparable but non-quadruplex-forming sequence [4.1 × 106 M–1 (strand)]. Circular dichroism spectroscopy reveals the Na+ form of the G-quadruplex to be completely unfolded by UP1 at a binding ratio of 2:1 (UP1:G-quadruplex DNA). The data presented here demonstrate that the favorable energetics of the initial binding event are closely coupled with and drive the unfolding of the G-quadruplex structure. PMID:24831962
Yeast aconitase binds and provides metabolically coupled protection to mitochondrial DNA.

PubMed

Chen, Xin Jie; Wang, Xiaowen; Butow, Ronald A

2007-08-21

Aconitase (Aco1p) is a multifunctional protein: It is an enzyme of the tricarboxylic acid cycle. In animal cells, Aco1p also is a cytosolic protein binding to mRNAs to regulate iron metabolism. In yeast, Aco1p was identified as a component of mtDNA nucleoids. Here we show that yeast Aco1p protects mtDNA from excessive accumulation of point mutations and ssDNA breaks and suppresses reductive recombination of mtDNA. Aconitase binds to both ds- and ssDNA, with a preference for GC-containing sequences. Therefore, mitochondria are opportunistic organelles that seize proteins, such as metabolic enzymes, for construction of the nucleoid, an mtDNA maintenance/segregation apparatus.
Combinatorial interactions of two amino acids with a single base pair define target site specificity in plant dimeric homeodomain proteins

PubMed Central

Tron, Adriana E.; Bertoncini, Carlos W.; Palena, Claudia M.; Chan, Raquel L.; Gonzalez, Daniel H.

2001-01-01

Four groups of plant homeodomain proteins contain a dimerization motif closely linked to the homeodomain. We here show that two sunflower homeodomain proteins, Hahb-4 and HAHR1, which belong to the Hd-Zip I and GL2/Hd-Zip IV groups, respectively, show different binding preferences at a defined position of a pseudopalindromic DNA-binding site used as a target. HAHR1 shows a preference for the sequence 5′-CATT(A/T)AATG-3′, rather than 5′-CAAT(A/T)ATTG-3′, recognized by Hahb-4. To analyze the molecular basis of this behavior, we have constructed a set of mutants with exchanged residues (Phe→Ile and Ile→Phe) at position 47 of the homeodomain, together with chimeric proteins between HAHR1 and Hahb-4. The results obtained indicate that Phe47, but not Ile47, allows binding to 5′-CATT(A/T)AATG-3′. However, the preference for this sequence is determined, in addition, by amino acids located C-terminal to residue 53 of the HAHR1 homeodomain. A double mutant of Hahb-4 (Ile47→Phe/Ala54→Thr) shows the same binding behavior as HAHR1, suggesting that combinatorial interactions of amino acid residues at positions 47 and 54 of the homeodomain are involved in establishing the affinity and selectivity of plant dimeric homeodomain proteins with different DNA target sequences. PMID:11726696
Two high-mobility group box domains act together to underwind and kink DNA

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sánchez-Giraldo, R.; Acosta-Reyes, F. J.; Malarkey, C. S.

The crystal structure of HMGB1 box A bound to an unmodified AT-rich DNA fragment is reported at a resolution of 2 Å. A new mode of DNA recognition for HMG box proteins is found in which two box A domains bind in an unusual configuration generating a highly kinked DNA structure. High-mobility group protein 1 (HMGB1) is an essential and ubiquitous DNA architectural factor that influences a myriad of cellular processes. HMGB1 contains two DNA-binding domains, box A and box B, which have little sequence specificity but have remarkable abilities to underwind and bend DNA. Although HMGB1 box A ismore » thought to be responsible for the majority of HMGB1–DNA interactions with pre-bent or kinked DNA, little is known about how it recognizes unmodified DNA. Here, the crystal structure of HMGB1 box A bound to an AT-rich DNA fragment is reported at a resolution of 2 Å. Two box A domains of HMGB1 collaborate in an unusual configuration in which the Phe37 residues of both domains stack together and intercalate the same CG base pair, generating highly kinked DNA. This represents a novel mode of DNA recognition for HMGB proteins and reveals a mechanism by which structure-specific HMG boxes kink linear DNA.« less
Interference between Triplex and Protein Binding to Distal Sites on Supercoiled DNA.

PubMed

Noy, Agnes; Maxwell, Anthony; Harris, Sarah A

2017-02-07

We have explored the interdependence of the binding of a DNA triplex and a repressor protein to distal recognition sites on supercoiled DNA minicircles using MD simulations. We observe that the interaction between the two ligands through their influence on their DNA template is determined by a subtle interplay of DNA mechanics and electrostatics, that the changes in flexibility induced by ligand binding play an important role and that supercoiling can instigate additional ligand-DNA contacts that would not be possible in simple linear DNA sequences. Copyright © 2017. Published by Elsevier Inc.
Template-directed covalent conjugation of DNA to native antibodies, transferrin and other metal-binding proteins

NASA Astrophysics Data System (ADS)

Rosen, Christian B.; Kodal, Anne L. B.; Nielsen, Jesper S.; Schaffert, David H.; Scavenius, Carsten; Okholm, Anders H.; Voigt, Niels V.; Enghild, Jan J.; Kjems, Jørgen; Tørring, Thomas; Gothelf, Kurt V.

2014-09-01

DNA-protein conjugates are important in bioanalytical chemistry, molecular diagnostics and bionanotechnology, as the DNA provides a unique handle to identify, functionalize or otherwise manipulate proteins. To maintain protein activity, conjugation of a single DNA handle to a specific location on the protein is often needed. However, preparing such high-quality site-specific conjugates often requires genetically engineered proteins, which is a laborious and technically challenging approach. Here we demonstrate a simpler method to create site-selective DNA-protein conjugates. Using a guiding DNA strand modified with a metal-binding functionality, we directed a second DNA strand to the vicinity of a metal-binding site of His6-tagged or wild-type metal-binding proteins, such as serotransferrin, where it subsequently reacted with lysine residues at that site. This method, DNA-templated protein conjugation, facilitates the production of site-selective protein conjugates, and also conjugation to IgG1 antibodies via a histidine cluster in the constant domain.
Presynaptic Filament Dynamics in Homologous Recombination and DNA Repair

PubMed Central

Liu, Jie; Ehmsen, Kirk T.; Heyer, Wolf-Dietrich; Morrical, Scott W.

2014-01-01

Homologous Recombination (HR) is an essential genome stability mechanism used for high-fidelity repair of DNA double-strand breaks and for the recovery of stalled or collapsed DNA replication forks. The crucial homology search and DNA strand exchange steps of HR are catalyzed by presynaptic filaments—helical filaments of a recombinase enzyme bound to single-stranded DNA. Presynaptic filaments are fundamentally dynamic structures, the assembly, catalytic turnover, and disassembly of which must be closely coordinated with other elements of the DNA recombination, repair, and replication machinery in order for genome maintenance functions to be effective. Here, we review the major dynamic elements controlling the assembly, activity, and disassembly of presynaptic filaments: some intrinsic such as recombinase ATP binding and hydrolytic activities, others extrinsic such as ssDNA-binding proteins, mediator proteins, and DNA motor proteins. We examine dynamic behavior on multiple levels, including atomic- and filament-level structural changes associated with ATP binding and hydrolysis as evidenced in crystal structures, as well as subunit binding and dissociation events driven by intrinsic and extrinsic factors. We examine the biochemical properties of recombination proteins from four model systems (T4 phage, E. coli, S. cerevisiae, and H. sapiens), demonstrating how their properties are tailored for the context-specific requirements in these diverse species. We propose that the presynaptic filament has evolved to rely on multiple external factors for increased multi-level regulation of HR processes in genomes with greater structural and sequence complexity. PMID:21599536
The discovery of zinc fingers and their development for practical applications in gene regulation and genome manipulation.

PubMed

Klug, Aaron

2010-02-01

A long-standing goal of molecular biologists has been to construct DNA-binding proteins for the control of gene expression. The classical Cys2His2 (C2H2) zinc finger design is ideally suited for such purposes. Discriminating between closely related DNA sequences both in vitro and in vivo, this naturally occurring design was adopted for engineering zinc finger proteins (ZFPs) to target genes specifically. Zinc fingers were discovered in 1985, arising from the interpretation of our biochemical studies on the interaction of the Xenopus protein transcription factor IIIA (TFIIIA) with 5S RNA. Subsequent structural studies revealed its three-dimensional structure and its interaction with DNA. Each finger constitutes a self-contained domain stabilized by a zinc (Zn) ion ligated to a pair of cysteines and a pair of histidines and also by an inner structural hydrophobic core. This discovery showed not only a new protein fold but also a novel principle of DNA recognition. Whereas other DNA-binding proteins generally make use of the 2-fold symmetry of the double helix, functioning as homo- or heterodimers, zinc fingers can be linked linearly in tandem to recognize nucleic acid sequences of varying lengths. This modular design offers a large number of combinatorial possibilities for the specific recognition of DNA (or RNA). It is therefore not surprising that the zinc finger is found widespread in nature, including 3% of the genes of the human genome. The zinc finger design can be used to construct DNA-binding proteins for specific intervention in gene expression. By fusing selected zinc finger peptides to repression or activation domains, genes can be selectively switched off or on by targeting the peptide to the desired gene target. It was also suggested that by combining an appropriate zinc finger peptide with other effector or functional domains, e.g. from nucleases or integrases to form chimaeric proteins, genomes could be modified or manipulated. The first example of the power of the method was published in 1994 when a three-finger protein was constructed to block the expression of a human oncogene transformed into a mouse cell line. The same paper also described how a reporter gene was activated by targeting an inserted 9-base pair (bp) sequence, which acts as the promoter. Thus, by fusing zinc finger peptides to repression or activation domains, genes can be selectively switched off or on. It was also suggested that, by combining zinc fingers with other effector or functional domains, e.g. from nucleases or integrases, to form chimaeric proteins, genomes could be manipulated or modified. Several applications of such engineered ZFPs are described here, including some of therapeutic importance, and also their adaptation for breeding improved crop plants.
Reactivation of mutant p53: Constraints on mechanism highlighted by principal component analysis of the DNA binding domain.

PubMed

Ouaray, Zahra; ElSawy, Karim M; Lane, David P; Essex, Jonathan W; Verma, Chandra

2016-10-01

Most p53 mutations associated with cancer are located in its DNA binding domain (DBD). Many structures (X-ray and NMR) of this domain are available in the protein data bank (PDB) and a vast conformational heterogeneity characterizes the various free and complexed states. The major difference between the apo and the holo-complexed states appears to lie in the L1 loop. In particular, the conformations of this loop appear to depend intimately on the sequence of DNA to which it binds. This conclusion builds upon recent observations that implicate the tetramerization and the C-terminal domains (respectively TD and Cter) in DNA binding specificity. Detailed PCA analysis of the most recent collection of DBD structures from the PDB have been carried out. In contrast to recommendations that small molecules/drugs stabilize the flexible L1 loop to rescue mutant p53, our study highlights a need to retain the flexibility of the p53 DNA binding surface (DBS). It is the adaptability of this region that enables p53 to engage in the diverse interactions responsible for its functionality. Proteins 2016; 84:1443-1461. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Crystallization of bFGF-DNA Aptamer Complexes Using a Sparse Matrix Designed for Protein-Nucleic Acid Complexes

NASA Technical Reports Server (NTRS)

Cannone, Jaime J.; Barnes, Cindy L.; Achari, Aniruddha; Kundrot, Craig E.; Whitaker, Ann F. (Technical Monitor)

2001-01-01

The Sparse Matrix approach for obtaining lead crystallization conditions has proven to be very fruitful for the crystallization of proteins and nucleic acids. Here we report a Sparse Matrix developed specifically for the crystallization of protein-DNA complexes. This method is rapid and economical, typically requiring 2.5 mg of complex to test 48 conditions. The method was originally developed to crystallize basic fibroblast growth factor (bFGF) complexed with DNA sequences identified through in vitro selection, or SELEX, methods. Two DNA aptamers that bind with approximately nanomolar affinity and inhibit the angiogenic properties of bFGF were selected for co-crystallization. The Sparse Matrix produced lead crystallization conditions for both bFGF-DNA complexes.
Role of the Adenovirus DNA-Binding Protein in In Vitro Adeno-Associated Virus DNA Replication

PubMed Central

Ward, Peter; Dean, Frank B.; O’Donnell, Michael E.; Berns, Kenneth I.

1998-01-01

A basic question in adeno-associated virus (AAV) biology has been whether adenovirus (Ad) infection provided any function which directly promoted replication of AAV DNA. Previously in vitro assays for AAV DNA replication, using linear duplex AAV DNA as the template, uninfected or Ad-infected HeLa cell extracts, and exogenous AAV Rep protein, demonstrated that Ad infection provides a direct helper effect for AAV DNA replication. It was shown that the nature of this helper effect was to increase the processivity of AAV DNA replication. Left unanswered was the question of whether this effect was the result of cellular factors whose activity was enhanced by Ad infection or was the result of direct participation of Ad proteins in AAV DNA replication. In this report, we show that in the in vitro assay, enhancement of processivity occurs with the addition of either the Ad DNA-binding protein (Ad-DBP) or the human single-stranded DNA-binding protein (replication protein A [RPA]). Clearly Ad-DBP is present after Ad infection but not before, whereas the cellular level of RPA is not apparently affected by Ad infection. However, we have not measured possible modifications of RPA which might occur after Ad infection and affect AAV DNA replication. When the substrate for replication was an AAV genome inserted into a plasmid vector, RPA was not an effective substitute for Ad-DBP. Extracts supplemented with Ad-DBP preferentially replicated AAV sequences rather than adjacent vector sequences; in contrast, extracts supplemented with RPA preferentially replicated vector sequences. PMID:9420241
Elevated expression of ribosomal protein genes L37, RPP-1, and S2 in the presence of mutant p53.

PubMed

Loging, W T; Reisman, D

1999-11-01

The wild-type p53 protein is a DNA-binding transcription factor that activates genes such as p21, MDM2, GADD45, and Bax that are required for the regulation of cell cycle progression or apoptosis in response to DNA damage. Mutant forms of p53, which are transforming oncogenes and are expressed at high levels in tumor cells, generally have a reduced binding affinity for the consensus DNA sequence. Interestingly, some p53 mutants that are no longer effective at binding to the consensus DNA sequence and transactivating promoters containing this target site have acquired the ability to transform cells in culture, in part through their ability to transactivate promoters of a number of genes that are not targets of the wild-type protein. Certain p53 mutants are therefore considered to be gain-of-function mutants and appear to be promoting proliferation or transforming cells through their ability to alter the expression of novel sets of genes. Our goal is to identify genes that have altered expression in the presence of a specific mutant p53 (Arg to Trp mutation at codon 248) protein. Through examining differential gene expression in cells devoid of p53 expression and in cells that express high levels of mutant p53 protein, we have identified three ribosomal protein genes that have elevated expression in response to mutant p53. Consistent with these findings, the overexpression of a number of ribosomal protein genes in human tumors and evidence for their contribution to oncogenic transformation have been reported previously, although the mechanism leading to this overexpression has remained elusive. We show results that indicate that expression of these specific ribosomal protein genes is increased in the presence of the R248W p53 mutant, which provides a mechanism for their overexpression in human tumors.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Damian, Luminita, E-mail: luminitadamian@microcal.eu.com; Universite de Toulouse, UPS, IPBS, F-31077 Toulouse; IUB, School of Engineering and Science, D-28727 Bremen

Is single-strand DNA translatable? Since the 60s, the question still remains whether or not DNA could be directly translated into protein. Some discrepancies in the results were reported about functional translation of single-strand DNA but all results converged on a similar behavior of RNA and ssDNA in the initiation step. Isothermal Titration Calorimetry method was used to determine thermodynamic constants of interaction between single-strand DNA and S30 extract of Escherichia coli. Our results showed that the binding was not affected by the nature of the template tested and the dissociation constants were in the same range when ssDNA (K{sub d}more » = 3.62 {+-} 2.1 x 10{sup -8} M) or the RNA corresponding sequence (K{sub d} = 2.7 {+-} 0.82 x 10{sup -8} M) bearing SD/ATG sequences were used. The binding specificity was confirmed by antibiotic interferences which block the initiation complex formation. These results suggest that the limiting step in translation of ssDNA is the elongation process.« less
Interactions of HIPPI, a molecular partner of Huntingtin interacting protein HIP1, with the specific motif present at the putative promoter sequence of the caspase-1, caspase-8 and caspase-10 genes.

PubMed

Majumder, P; Choudhury, A; Banerjee, M; Lahiri, A; Bhattacharyya, N P

2007-08-01

To investigate the mechanism of increased expression of caspase-1 caused by exogenous Hippi, observed earlier in HeLa and Neuro2A cells, in this work we identified a specific motif AAAGACATG (- 101 to - 93) at the caspase-1 gene upstream sequence where HIPPI could bind. Various mutations in this specific sequence compromised the interaction, showing the specificity of the interactions. In the luciferase reporter assay, when the reporter gene was driven by caspase-1 gene upstream sequences (- 151 to - 92) with the mutation G to T at position - 98, luciferase activity was decreased significantly in green fluorescent protein-Hippi-expressing HeLa cells in comparison to that obtained with the wild-type caspase-1 gene 60 bp upstream sequence, indicating the biological significance of such binding. It was observed that the C-terminal 'pseudo' death effector domain of HIPPI interacted with the 60 bp (- 151 to - 92) upstream sequence of the caspase-1 gene containing the motif. We further observed that expression of caspase-8 and caspase-10 was increased in green fluorescent protein-Hippi-expressing HeLa cells. In addition, HIPPI interacted in vitro with putative promoter sequences of these genes, containing a similar motif. In summary, we identified a novel function of HIPPI; it binds to specific upstream sequences of the caspase-1, caspase-8 and caspase-10 genes and alters the expression of the genes. This result showed the motif-specific interaction of HIPPI with DNA, and indicates that it could act as transcription regulator.
Finding the target sites of RNA-binding proteins

PubMed Central

Li, Xiao; Kazan, Hilal; Lipshitz, Howard D; Morris, Quaid D

2014-01-01

RNA–protein interactions differ from DNA–protein interactions because of the central role of RNA secondary structure. Some RNA-binding domains (RBDs) recognize their target sites mainly by their shape and geometry and others are sequence-specific but are sensitive to secondary structure context. A number of small- and large-scale experimental approaches have been developed to measure RNAs associated in vitro and in vivo with RNA-binding proteins (RBPs). Generalizing outside of the experimental conditions tested by these assays requires computational motif finding. Often RBP motif finding is done by adapting DNA motif finding methods; but modeling secondary structure context leads to better recovery of RBP-binding preferences. Genome-wide assessment of mRNA secondary structure has recently become possible, but these data must be combined with computational predictions of secondary structure before they add value in predicting in vivo binding. There are two main approaches to incorporating structural information into motif models: supplementing primary sequence motif models with preferred secondary structure contexts (e.g., MEMERIS and RNAcontext) and directly modeling secondary structure recognized by the RBP using stochastic context-free grammars (e.g., CMfinder and RNApromo). The former better reconstruct known binding preferences for sequence-specific RBPs but are not suitable for modeling RBPs that recognize shape and geometry of RNAs. Future work in RBP motif finding should incorporate interactions between multiple RBDs and multiple RBPs in binding to RNA. WIREs RNA 2014, 5:111–130. doi: 10.1002/wrna.1201 PMID:24217996
Re-visiting protein-centric two-tier classification of existing DNA-protein complexes

PubMed Central

2012-01-01

Background Precise DNA-protein interactions play most important and vital role in maintaining the normal physiological functioning of the cell, as it controls many high fidelity cellular processes. Detailed study of the nature of these interactions has paved the way for understanding the mechanisms behind the biological processes in which they are involved. Earlier in 2000, a systematic classification of DNA-protein complexes based on the structural analysis of the proteins was proposed at two tiers, namely groups and families. With the advancement in the number and resolution of structures of DNA-protein complexes deposited in the Protein Data Bank, it is important to revisit the existing classification. Results On the basis of the sequence analysis of DNA binding proteins, we have built upon the protein centric, two-tier classification of DNA-protein complexes by adding new members to existing families and making new families and groups. While classifying the new complexes, we also realised the emergence of new groups and families. The new group observed was where β-propeller was seen to interact with DNA. There were 34 SCOP folds which were observed to be present in the complexes of both old and new classifications, whereas 28 folds are present exclusively in the new complexes. Some new families noticed were NarL transcription factor, Z-α DNA binding proteins, Forkhead transcription factor, AP2 protein, Methyl CpG binding protein etc. Conclusions Our results suggest that with the increasing number of availability of DNA-protein complexes in Protein Data Bank, the number of families in the classification increased by approximately three fold. The folds present exclusively in newly classified complexes is suggestive of inclusion of proteins with new function in new classification, the most populated of which are the folds responsible for DNA damage repair. The proposed re-visited classification can be used to perform genome-wide surveys in the genomes of interest for the presence of DNA-binding proteins. Further analysis of these complexes can aid in developing algorithms for identifying DNA-binding proteins and their family members from mere sequence information. PMID:22800292
Re-visiting protein-centric two-tier classification of existing DNA-protein complexes.

PubMed

Malhotra, Sony; Sowdhamini, Ramanathan

2012-07-16

Precise DNA-protein interactions play most important and vital role in maintaining the normal physiological functioning of the cell, as it controls many high fidelity cellular processes. Detailed study of the nature of these interactions has paved the way for understanding the mechanisms behind the biological processes in which they are involved. Earlier in 2000, a systematic classification of DNA-protein complexes based on the structural analysis of the proteins was proposed at two tiers, namely groups and families. With the advancement in the number and resolution of structures of DNA-protein complexes deposited in the Protein Data Bank, it is important to revisit the existing classification. On the basis of the sequence analysis of DNA binding proteins, we have built upon the protein centric, two-tier classification of DNA-protein complexes by adding new members to existing families and making new families and groups. While classifying the new complexes, we also realised the emergence of new groups and families. The new group observed was where β-propeller was seen to interact with DNA. There were 34 SCOP folds which were observed to be present in the complexes of both old and new classifications, whereas 28 folds are present exclusively in the new complexes. Some new families noticed were NarL transcription factor, Z-α DNA binding proteins, Forkhead transcription factor, AP2 protein, Methyl CpG binding protein etc. Our results suggest that with the increasing number of availability of DNA-protein complexes in Protein Data Bank, the number of families in the classification increased by approximately three fold. The folds present exclusively in newly classified complexes is suggestive of inclusion of proteins with new function in new classification, the most populated of which are the folds responsible for DNA damage repair. The proposed re-visited classification can be used to perform genome-wide surveys in the genomes of interest for the presence of DNA-binding proteins. Further analysis of these complexes can aid in developing algorithms for identifying DNA-binding proteins and their family members from mere sequence information.

Crystallization and preliminary X-ray diffraction analysis of the Bacillus subtilis replication termination protein in complex with the 37-base-pair TerI-binding site

DOE Office of Scientific and Technical Information (OSTI.GOV)

Vivian, J. P.; Porter, C.; Wilce, J. A.

2006-11-01

A preparation of replication terminator protein (RTP) of B. subtilis and a 37-base-pair TerI sequence (comprising two binding sites for RTP) has been purified and crystallized. The replication terminator protein (RTP) of Bacillus subtilis binds to specific DNA sequences that halt the progression of the replisome in a polar manner. These terminator complexes flank a defined region of the chromosome into which they allow replication forks to enter but not exit. Forcing the fusion of replication forks in a specific zone is thought to allow the coordination of post-replicative processes. The functional terminator complex comprises two homodimers each of 29more » kDa bound to overlapping binding sites. A preparation of RTP and a 37-base-pair TerI sequence (comprising two binding sites for RTP) has been purified and crystallized. A data set to 3.9 Å resolution with 97.0% completeness and an R{sub sym} of 12% was collected from a single flash-cooled crystal using synchrotron radiation. The diffraction data are consistent with space group P622, with unit-cell parameters a = b = 118.8, c = 142.6 Å.« less
Transcriptional control of the tissue-specific, developmentally regulated osteocalcin gene requires a binding motif for the Msx family of homeodomain proteins.

PubMed

Hoffmann, H M; Catron, K M; van Wijnen, A J; McCabe, L R; Lian, J B; Stein, G S; Stein, J L

1994-12-20

The OC box of the rat osteocalcin promoter (nt -99 to -76) is the principal proximal regulatory element contributing to both tissue-specific and developmental control of osteocalcin gene expression. The central motif of the OC box includes a perfect consensus DNA binding site for certain homeodomain proteins. Homeodomain proteins are transcription factors that direct proper development by regulating specific temporal and spatial patterns of gene expression. We therefore addressed the role of the homeodomain binding motif in the activity of the OC promoter. In this study, by the combined application of mutagenesis and site-specific protein recognition analysis, we examined interactions of ROS 17/2.8 osteosarcoma cell nuclear proteins and purified Msx-1 homeodomain protein with the OC box. We detected a series of related specific protein-DNA interactions, a subset of which were inhibited by antibodies directed against the Msx-1 homeodomain but which also recognize the Msx-2 homeodomain. Our results show that the sequence requirements for binding the Msx-1 or Msx-2 homeodomain closely parallel those necessary for osteocalcin gene promoter activity in vivo. This functional relationship was demonstrated by transient expression in ROS 17/2.8 osteosarcoma cells of a series of osteocalcin promoter (nt -1097 to +24)-reporter gene constructs containing mutations within and flanking the homeodomain binding site of the OC box. Northern blot analysis of several bone-related cell types showed that all of the cells expressed msx-1, whereas msx-2 expression was restricted to cells transcribing osteocalcin. Taken together, our results suggest a role for Msx-1 and -2 or related homeodomain proteins in transcription of the osteocalcin gene.
Interaction of the Sliding Clamp β-Subunit and Hda, a DnaA-Related Protein

PubMed Central

Kurz, Mareike; Dalrymple, Brian; Wijffels, Gene; Kongsuwan, Kritaya

2004-01-01

In Escherichia coli, interactions between the replication initiation protein DnaA, the β subunit of DNA polymerase III (the sliding clamp protein), and Hda, the recently identified DnaA-related protein, are required to convert the active ATP-bound form of DnaA to an inactive ADP-bound form through the accelerated hydrolysis of ATP. This rapid hydrolysis of ATP is proposed to be the main mechanism that blocks multiple initiations during cell cycle and acts as a molecular switch from initiation to replication. However, the biochemical mechanism for this crucial step in DNA synthesis has not been resolved. Using purified Hda and β proteins in a plate binding assay and Ni-nitrilotriacetic acid pulldown analysis, we show for the first time that Hda directly interacts with β in vitro. A new β-binding motif, a hexapeptide with the consensus sequence QL[SP]LPL, related to the previously identified β-binding pentapeptide motif (QL[SD]LF) was found in the amino terminus of the Hda protein. Mutants of Hda with amino acid changes in the hexapeptide motif are severely defective in their ability to bind β. A 10-amino-acid peptide containing the E. coli Hda β-binding motif was shown to compete with Hda for binding to β in an Hda-β interaction assay. These results establish that the interaction of Hda with β is mediated through the hexapeptide sequence. We propose that this interaction may be crucial to the events that lead to the inactivation of DnaA and the prevention of excess initiation of rounds of replication. PMID:15150238
Interaction of the sliding clamp beta-subunit and Hda, a DnaA-related protein.

PubMed

Kurz, Mareike; Dalrymple, Brian; Wijffels, Gene; Kongsuwan, Kritaya

2004-06-01

In Escherichia coli, interactions between the replication initiation protein DnaA, the beta subunit of DNA polymerase III (the sliding clamp protein), and Hda, the recently identified DnaA-related protein, are required to convert the active ATP-bound form of DnaA to an inactive ADP-bound form through the accelerated hydrolysis of ATP. This rapid hydrolysis of ATP is proposed to be the main mechanism that blocks multiple initiations during cell cycle and acts as a molecular switch from initiation to replication. However, the biochemical mechanism for this crucial step in DNA synthesis has not been resolved. Using purified Hda and beta proteins in a plate binding assay and Ni-nitrilotriacetic acid pulldown analysis, we show for the first time that Hda directly interacts with beta in vitro. A new beta-binding motif, a hexapeptide with the consensus sequence QL[SP]LPL, related to the previously identified beta-binding pentapeptide motif (QL[SD]LF) was found in the amino terminus of the Hda protein. Mutants of Hda with amino acid changes in the hexapeptide motif are severely defective in their ability to bind beta. A 10-amino-acid peptide containing the E. coli Hda beta-binding motif was shown to compete with Hda for binding to beta in an Hda-beta interaction assay. These results establish that the interaction of Hda with beta is mediated through the hexapeptide sequence. We propose that this interaction may be crucial to the events that lead to the inactivation of DnaA and the prevention of excess initiation of rounds of replication.
Surface shapes and surrounding environment analysis of single- and double-stranded DNA-binding proteins in protein-DNA interface.

PubMed

Wang, Wei; Liu, Juan; Sun, Lin

2016-07-01

Protein-DNA bindings are critical to many biological processes. However, the structural mechanisms underlying these interactions are not fully understood. Here, we analyzed the residues shape (peak, flat, or valley) and the surrounding environment of double-stranded DNA-binding proteins (DSBs) and single-stranded DNA-binding proteins (SSBs) in protein-DNA interfaces. In the results, we found that the interface shapes, hydrogen bonds, and the surrounding environment present significant differences between the two kinds of proteins. Built on the investigation results, we constructed a random forest (RF) classifier to distinguish DSBs and SSBs with satisfying performance. In conclusion, we present a novel methodology to characterize protein interfaces, which will deepen our understanding of the specificity of proteins binding to ssDNA (single-stranded DNA) or dsDNA (double-stranded DNA). Proteins 2016; 84:979-989. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Zuotin, a putative Z-DNA binding protein in Saccharomyces cerevisiae

NASA Technical Reports Server (NTRS)

Zhang, S.; Lockshin, C.; Herbert, A.; Winter, E.; Rich, A.

1992-01-01

A putative Z-DNA binding protein, named zuotin, was purified from a yeast nuclear extract by means of a Z-DNA binding assay using [32P]poly(dG-m5dC) and [32P]oligo(dG-Br5dC)22 in the presence of B-DNA competitor. Poly(dG-Br5dC) in the Z-form competed well for the binding of a zuotin containing fraction, but salmon sperm DNA, poly(dG-dC) and poly(dA-dT) were not effective. Negatively supercoiled plasmid pUC19 did not compete, whereas an otherwise identical plasmid pUC19(CG), which contained a (dG-dC)7 segment in the Z-form was an excellent competitor. A Southwestern blot using [32P]poly(dG-m5dC) as a probe in the presence of MgCl2 identified a protein having a molecular weight of 51 kDa. The 51 kDa zuotin was partially sequenced at the N-terminal and the gene, ZUO1, was cloned, sequenced and expressed in Escherichia coli; the expressed zuotin showed similar Z-DNA binding activity, but with lower affinity than zuotin that had been partially purified from yeast. Zuotin was deduced to have a number of potential phosphorylation sites including two CDC28 (homologous to the human and Schizosaccharomyces pombe cdc2) phosphorylation sites. The hexapeptide motif KYHPDK was found in zuotin as well as in several yeast proteins, DnaJ of E.coli, csp29 and csp32 proteins of Drosophila and the small t and large T antigens of the polyoma virus. A 60 amino acid segment of zuotin has similarity to several histone H1 sequences. Disruption of ZUO1 in yeast resulted in a slow growth phenotype.
Studies of Xenopus laevis mitochondrial DNA: D-loop mapping and characterization of DNA-binding proteins

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cairns, S.S.

1987-01-01

In X. laevis oocytes, mitochondrial DNA accumulates to 10/sup 5/ times the somatic cell complement, and is characterized by a high frequency of a triple-stranded displacement hoop structure at the origin of replication. To map the termini of the single strands, it was necessary to correct the nucleotide sequence of the D-loop region. The revised sequence of 2458 nucleotides contains 54 discrepancies in comparison to a previously published sequence. Radiolabeling of the nascent strands of the D-loop structure either at the 5' end or at the 3' end identifies a major species with a length of 1670 nucleotides. Cleavage ofmore » the 5' labeled strands reveals two families of ends located near several matches to an element, designated CSB-1, that is conserved in this location in several vertebrate genomes. Cleavage of 3' labeled strands produced one fragment. The unique 3' end maps to about 15 nucleotides preceding the tRNA/sup Pro/ gene. A search for proteins which may bind to mtDNA in this region to regulate nucleic acid synthesis has identified three activities in lysates of X. laevis mitochondria. The DNA-binding proteins were assayed by monitoring their ability to retard the migration of labeled double- or single-stranded DNA fragments in polyacrylamide gels. The DNA binding preference was determined by competition with an excess of either ds- or ssDNA.« less
A single-stranded DNA binding protein from mouse tumor cells specifically recognizes the C-rich strand of the (AGG:CCT)n repeats that can alter DNA conformation.

PubMed Central

Muraiso, T; Nomoto, S; Yamazaki, H; Mishima, Y; Kominami, R

1992-01-01

A protein that binds to a synthetic oligonucleotide of (CCT)12 has been purified from Ehrlich ascites tumor cells by a (CCT)12 affinity chromatography. The protein (p70) has an apparent molecular mass of 70 kDa, as assayed by Southwestern analysis. A competition experiment revealed that p70 binds to (CCT)12, (CCCT)8 and (CCTCCCT)6, but not to (CTT)12, (CT)16 and (CCTGCCT)6, suggesting that p70 has a sequence-specificity. The complementary (AGG)12 and the double stranded DNA did not show the binding. It is also confirmed by S1 nuclease analysis that the (AGG:CCT)12 duplex takes a single-stranded conformation in the absence of the protein. This raises a possibility that the duplex forms two single-stranded loops in chromosomes, the C-rich strand being bound to p70. Structural analysis of the resulting (AGG)12 strand by non-denaturing polyacrylamide gel electrophoresis demonstrated the presence of slower and faster migrated conformers in a neutral pH buffer containing 50 mM NaCl at 5 degrees C. The ratio was dependent on the DNA concentration. Both conformers disappeared in the absence of NaCl. This suggests that (AGG)12 can form intra- and inter-molecular complexes by non-Watson-Crick, guanine:guanine base-pairing. The possible biological function of the (AGG:CCT)n duplex and the p70 is discussed. Images PMID:1480484
Changes in solvation during DNA binding and cleavage are critical to altered specificity of the EcoRI endonuclease

PubMed Central

Robinson, Clifford R.; Sligar, Stephen G.

1998-01-01

Restriction endonucleases such as EcoRI bind and cleave DNA with great specificity and represent a paradigm for protein–DNA interactions and molecular recognition. Using osmotic pressure to induce water release, we demonstrate the participation of bound waters in the sequence discrimination of substrate DNA by EcoRI. Changes in solvation can play a critical role in directing sequence-specific DNA binding by EcoRI and are also crucial in assisting site discrimination during catalysis. By measuring the volume change for complex formation, we show that at the cognate sequence (GAATTC) EcoRI binding releases about 70 fewer water molecules than binding at an alternate DNA sequence (TAATTC), which differs by a single base pair. EcoRI complexation with nonspecific DNA releases substantially less water than either of these specific complexes. In cognate substrates (GAATTC) kcat decreases as osmotic pressure is increased, indicating the binding of about 30 water molecules accompanies the cleavage reaction. For the alternate substrate (TAATTC), release of about 40 water molecules accompanies the reaction, indicated by a dramatic acceleration of the rate when osmotic pressure is raised. These large differences in solvation effects demonstrate that water molecules can be key players in the molecular recognition process during both association and catalytic phases of the EcoRI reaction, acting to change the specificity of the enzyme. For both the protein–DNA complex and the transition state, there may be substantial conformational differences between cognate and alternate sites, accompanied by significant alterations in hydration and solvent accessibility. PMID:9482860
The electrostatic role of the Zn-Cys2His2 complex in binding of operator DNA with transcription factors: mouse EGR-1 from the Cys2His2 family.

PubMed

Chirgadze, Y N; Boshkova, E A; Polozov, R V; Sivozhelezov, V S; Dzyabchenko, A V; Kuzminsky, M B; Stepanenko, V A; Ivanov, V V

2018-01-07

The mouse factor Zif268, known also as early growth response protein EGR-1, is a classical representative for the Cys2His2 transcription factor family. It is required for binding the RNA polymerase with operator dsDNA to initialize the transcription process. We have shown that only in this family of total six Zn-finger protein families the Zn complex plays a significant role in the protein-DNA binding. Electrostatic feature of this complex in the binding of factor Zif268 from Mus musculus with operator DNA has been considered. The factor consists of three similar Zn-finger units which bind with triplets of coding DNA. Essential contacts of the factor with the DNA phosphates are formed by three conservative His residues, one in each finger. We describe here the results of calculations of the electrostatic potentials for the Zn-Cys2His2 complex, Zn-finger unit 1, and the whole transcription factor. The potential of Zif268 has a positive area on the factor surface, and it corresponds exactly to the binding sites of each of Zn-finger units. The main part of these areas is determined by conservative His residues, which form contacts with the DNA phosphate groups. Our result shows that the electrostatic positive potential of this histidine residue is enhanced due to the Zn complex. The other contacts of the Zn-finger with DNA are related to nucleotide bases, and they are responsible for the sequence-specific binding with DNA. This result may be extended to all other members of the Cys2His2 transcription factor family.
Inhibition of host cell RNA polymerase III-mediated transcription by poliovirus: Inactivation of specific transcription factors

DOE Office of Scientific and Technical Information (OSTI.GOV)

Fradkin, L.G.; Yoshinaga, S.K.; Berk, A.J.

1987-11-01

The inhibition of transcription by RNA polymerase III in poliovirus-infected cells was studied. Experiments utilizing two different cell lines showed that the initiation step of transcription by RNA polymerase III was impaired by infection of these cells with the virus. The observed inhibition of transcription was not due to shut-off of host cell protein synthesis by poliovirus. Among four distinct components required for accurate transcription in vitro from cloned DNA templates, activities of RNA polymerase III and transcription factor TFIIIA were not significantly affected by virus infection. The activity of transcription factor TFIIIC, the limiting component required for transcription ofmore » RNA polymerase III genes, was severely inhibited in infected cells, whereas that of transcription factor TFIIIB was inhibited to a lesser extent. The sequence-specific DNA-binding of TFIIIC to the adenovirus VA1 gene internal promoted, however, was not altered by infection of cells with the virus. The authors conclude that (i) at least two transcription factors, TFIIIB and TFIIIC, are inhibited by infection of cells with poliovirtus, (ii) inactivation of TFIIIC does not involve destruction of its DNA-binding domain, and (iii) sequence-specific DNA binding by TFIIIC may be necessary but is not sufficient for the formation of productive transcription complexes.« less
Experimental identification of specificity determinants in the domain linker of a LacI/GalR protein: bioinformatics-based predictions generate true positives and false negatives.

PubMed

Meinhardt, Sarah; Swint-Kruse, Liskin

2008-12-01

In protein families, conserved residues often contribute to a common general function, such as DNA-binding. However, unique attributes for each homolog (e.g. recognition of alternative DNA sequences) must arise from variation in other functionally-important positions. The locations of these "specificity determinant" positions are obscured amongst the background of varied residues that do not make significant contributions to either structure or function. To isolate specificity determinants, a number of bioinformatics algorithms have been developed. When applied to the LacI/GalR family of transcription regulators, several specificity determinants are predicted in the 18 amino acids that link the DNA-binding and regulatory domains. However, results from alternative algorithms are only in partial agreement with each other. Here, we experimentally evaluate these predictions using an engineered repressor comprising the LacI DNA-binding domain, the LacI linker, and the GalR regulatory domain (LLhG). "Wild-type" LLhG has altered DNA specificity and weaker lacO(1) repression compared to LacI or a similar LacI:PurR chimera. Next, predictions of linker specificity determinants were tested, using amino acid substitution and in vivo repression assays to assess functional change. In LLhG, all predicted sites are specificity determinants, as well as three sites not predicted by any algorithm. Strategies are suggested for diminishing the number of false negative predictions. Finally, individual substitutions at LLhG specificity determinants exhibited a broad range of functional changes that are not predicted by bioinformatics algorithms. Results suggest that some variants have altered affinity for DNA, some have altered allosteric response, and some appear to have changed specificity for alternative DNA ligands.
Two synthetic Sp1-binding sites functionally substitute for the 21-base-pair repeat region to activate simian virus 40 growth in CV-1 cells.

PubMed Central

Lednicky, J; Folk, W R

1992-01-01

The 21-bp repeat region of simian virus 40 (SV40) activates viral transcription and DNA replication and contains binding sites for many cellular proteins, including Sp1, LSF, ETF, Ap2, Ap4, GT-1B, H16, and p53, and for the SV40 large tumor antigen. We have attempted to reduce the complexity of this region while maintaining its growth-promoting capacity. Deletion of the 21-bp repeat region from the SV40 genome delays the expression of viral early proteins and DNA replication and reduces virus production in CV-1 cells. Replacement of the 21-bp repeat region with two copies of DNA sequence motifs bound with high affinities by Sp1 promotes SV40 growth in CV-1 cells to nearly wild-type levels, but substitution by motifs bound less avidly by Sp1 or bound by other activator proteins does not restore growth. This indicates that Sp1 or a protein with similar sequence specificity is primarily responsible for the function of the 21-bp repeat region. We speculate about how Sp1 activates both SV40 transcription and DNA replication. Images PMID:1328672
Cooperative heteroassembly of the adenoviral L4-22K and IVa2 proteins onto the viral packaging sequence DNA.

PubMed

Yang, Teng-Chieh; Maluf, Nasib Karl

2012-02-21

Human adenovirus (Ad) is an icosahedral, double-stranded DNA virus. Viral DNA packaging refers to the process whereby the viral genome becomes encapsulated by the viral particle. In Ad, activation of the DNA packaging reaction requires at least three viral components: the IVa2 and L4-22K proteins and a section of DNA within the viral genome, called the packaging sequence. Previous studies have shown that the IVa2 and L4-22K proteins specifically bind to conserved elements within the packaging sequence and that these interactions are absolutely required for the observation of DNA packaging. However, the equilibrium mechanism for assembly of IVa2 and L4-22K onto the packaging sequence has not been determined. Here we characterize the assembly of the IVa2 and L4-22K proteins onto truncated packaging sequence DNA by analytical sedimentation velocity and equilibrium methods. At limiting concentrations of L4-22K, we observe a species with two IVa2 monomers and one L4-22K monomer bound to the DNA. In this species, the L4-22K monomer is promoting positive cooperative interactions between the two bound IVa2 monomers. As L4-22K levels are increased, we observe a species with one IVa2 monomer and three L4-22K monomers bound to the DNA. To explain this result, we propose a model in which L4-22K self-assembly on the DNA competes with IVa2 for positive heterocooperative interactions, destabilizing binding of the second IVa2 monomer. Thus, we propose that L4-22K levels control the extent of cooperativity observed between adjacently bound IVa2 monomers. We have also determined the hydrodynamic properties of all observed stoichiometric species; we observe that species with three L4-22K monomers bound have more extended conformations than species with a single L4-22K bound. We suggest this might reflect a molecular switch that controls insertion of the viral DNA into the capsid.
Structural and Thermodynamic Signatures of DNA Recognition by Mycobacterium tuberculosis DnaA

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tsodikov, Oleg V.; Biswas, Tapan

An essential protein, DnaA, binds to 9-bp DNA sites within the origin of replication oriC. These binding events are prerequisite to forming an enigmatic nucleoprotein scaffold that initiates replication. The number, sequences, positions, and orientations of these short DNA sites, or DnaA boxes, within the oriCs of different bacteria vary considerably. To investigate features of DnaA boxes that are important for binding Mycobacterium tuberculosis DnaA (MtDnaA), we have determined the crystal structures of the DNA binding domain (DBD) of MtDnaA bound to a cognate MtDnaA-box (at 2.0 {angstrom} resolution) and to a consensus Escherichia coli DnaA-box (at 2.3 {angstrom}). Thesemore » structures, complemented by calorimetric equilibrium binding studies of MtDnaA DBD in a series of DnaA-box variants, reveal the main determinants of DNA recognition and establish the [T/C][T/A][G/A]TCCACA sequence as a high-affinity MtDnaA-box. Bioinformatic and calorimetric analyses indicate that DnaA-box sequences in mycobacterial oriCs generally differ from the optimal binding sequence. This sequence variation occurs commonly at the first 2 bp, making an in vivo mycobacterial DnaA-box effectively a 7-mer and not a 9-mer. We demonstrate that the decrease in the affinity of these MtDnaA-box variants for MtDnaA DBD relative to that of the highest-affinity box TTGTCCACA is less than 10-fold. The understanding of DnaA-box recognition by MtDnaA and E. coli DnaA enables one to map DnaA-box sequences in the genomes of M. tuberculosis and other eubacteria.« less
cgDNAweb: a web interface to the cgDNA sequence-dependent coarse-grain model of double-stranded DNA.

PubMed

De Bruin, Lennart; Maddocks, John H

2018-06-14

The sequence-dependent statistical mechanical properties of fragments of double-stranded DNA is believed to be pertinent to its biological function at length scales from a few base pairs (or bp) to a few hundreds of bp, e.g. indirect read-out protein binding sites, nucleosome positioning sequences, phased A-tracts, etc. In turn, the equilibrium statistical mechanics behaviour of DNA depends upon its ground state configuration, or minimum free energy shape, as well as on its fluctuations as governed by its stiffness (in an appropriate sense). We here present cgDNAweb, which provides browser-based interactive visualization of the sequence-dependent ground states of double-stranded DNA molecules, as predicted by the underlying cgDNA coarse-grain rigid-base model of fragments with arbitrary sequence. The cgDNAweb interface is specifically designed to facilitate comparison between ground state shapes of different sequences. The server is freely available at cgDNAweb.epfl.ch with no login requirement.
Proteomics to study DNA-bound and chromatin-associated gene regulatory complexes

PubMed Central

Wierer, Michael; Mann, Matthias

2016-01-01

High-resolution mass spectrometry (MS)-based proteomics is a powerful method for the identification of soluble protein complexes and large-scale affinity purification screens can decode entire protein interaction networks. In contrast, protein complexes residing on chromatin have been much more challenging, because they are difficult to purify and often of very low abundance. However, this is changing due to recent methodological and technological advances in proteomics. Proteins interacting with chromatin marks can directly be identified by pulldowns with synthesized histone tails containing posttranslational modifications (PTMs). Similarly, pulldowns with DNA baits harbouring single nucleotide polymorphisms or DNA modifications reveal the impact of those DNA alterations on the recruitment of transcription factors. Accurate quantitation – either isotope-based or label free – unambiguously pinpoints proteins that are significantly enriched over control pulldowns. In addition, protocols that combine classical chromatin immunoprecipitation (ChIP) methods with mass spectrometry (ChIP-MS) target gene regulatory complexes in their in-vivo context. Similar to classical ChIP, cells are crosslinked with formaldehyde and chromatin sheared by sonication or nuclease digested. ChIP-MS baits can be proteins in tagged or endogenous form, histone PTMs, or lncRNAs. Locus-specific ChIP-MS methods would allow direct purification of a single genomic locus and the proteins associated with it. There, loci can be targeted either by artificial DNA-binding sites and corresponding binding proteins or via proteins with sequence specificity such as TAL or nuclease deficient Cas9 in combination with a specific guide RNA. We predict that advances in MS technology will soon make such approaches generally applicable tools in epigenetics. PMID:27402878
Sequence walkers: a graphical method to display how binding proteins interact with DNA or RNA sequences | Center for Cancer Research

Cancer.gov

A graphical method is presented for displaying how binding proteins and other macromolecules interact with individual bases of nucleotide sequences. Characters representing the sequence are either oriented normally and placed above a line indicating favorable contact, or upside-down and placed below the line indicating unfavorable contact. The positive or negative height of
Interaction of the replication terminator protein of Bacillus subtilis with DNA probed by NMR spectroscopy

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hastings, Adam F.; Otting, Gottfried; Folmer, Rutger H.A.

2005-09-23

Termination of DNA replication in Bacillus subtilis involves the polar arrest of replication forks by a specific complex formed between the dimeric 29 kDa replication terminator protein (RTP) and DNA terminator sites. We have used NMR spectroscopy to probe the changes in {sup 1}H-{sup 15}N correlation spectra of a {sup 15}N-labelled RTP.C110S mutant upon the addition of a 21 base pair symmetrical DNA binding site. Assignment of the {sup 1}H-{sup 15}N correlations was achieved using a suite of triple resonance NMR experiments with {sup 15}N,{sup 13}C,70% {sup 2}H enriched protein recorded at 800 MHz and using TROSY pulse sequences. Perturbationsmore » to {sup 1}H-{sup 15}N spectra revealed that the N-termini, {alpha}3-helices and several loops are affected by the binding interaction. An analysis of this data in light of the crystallographically determined apo- and DNA-bound forms of RTP.C110S revealed that the NMR spectral perturbations correlate more closely to protein structural changes upon complex formation rather than to interactions at the protein-DNA interface.« less
Effect of Base Sequence "Defects" on the Electrostatic Potential of Dissolved DNA

NASA Astrophysics Data System (ADS)

Adams, Scott V.; Wagner, Katrina; Kephart, Thomas S.; Edwards, Glenn

1997-11-01

An analytical model of the electrostatic potential surrounding dissolved DNA has been developed. The model consists of an all-atom, mathematically helical structure for DNA, in which the atoms are arranged in infinite lines of discrete point charges on concentric cylindrical surfaces. The surrounding solvent and counterions are treated with the Debye-Huckel approximation (Wagner et al., Biophysical Journal 73, 21-30, 1997). Variation in the electrostatic potential due to structural differences between A, B, and Z conformations and homopolymer base sequence is apparent. The most recent modification to the model exploits the principle of superposition to calculate the potential of DNA with a base sequence containing `defects.' That is, the base sequence is no longer uniform along the polymer. Differences between the potential of homopolymer DNA and the potential of DNA containing base `defects' are immediately obvious. These results may aid in understanding the role of electrostatics in base-sequence specificity exhibited by DNA-binding proteins.

Molecular Cloning of Drebrin: Progress and Perspectives.

PubMed

Kojima, Nobuhiko

2017-01-01

Chicken drebrin isoforms were first identified in the optic tectum of developing brain. Although the time course of protein expression was different in each drebrin isoform, the similarity between their protein structures was suggested by biochemical analysis of purified protein. To determine their protein structures, the cloning of drebrin cDNAs was conducted. Comparison between the cDNA sequences shows that all drebrin cDNAs are identical except that the internal insertion sequences are present or absent in their sequences. Chicken drebrin are now classified into three isoforms, namely, drebrins E1, E2, and A. Genomic cloning demonstrated that the three isoforms are generated by an alternative splicing of individual exons encoding the insertion sequences from single drebrin gene. The mechanism should be precisely regulated in cell-type-specific and developmental stage-specific fashion. Drebrin protein, which is well conserved in various vertebrate species, although mammalian drebrin has only two isoforms, namely, drebrin E and drebrin A, is different from chicken drebrin that has three isoforms. Drebrin belongs to an actin-depolymerizing factor homology (ADF-H) domain protein family. Besides the ADF-H domain, drebrin has other domains, including the actin-binding domain and Homer-binding motifs. Diversity of protein isoform and multiple domains of drebrin could interact differentially with the actin cytoskeleton and other intracellular proteins and regulate diverse cellular processes.
Expression, purification and biochemical characterization of a single-stranded DNA binding protein from Herbaspirillum seropedicae.

PubMed

Vernal, Javier; Serpa, Viviane I; Tavares, Carolina; Souza, Emanuel M; Pedrosa, Fábio O; Terenzi, Hernán

2007-05-01

An open reading frame encoding a protein similar in size and sequence to the Escherichia coli single-stranded DNA binding protein (SSB protein) was identified in the Herbaspirillum seropedicae genome. This open reading frame was cloned into the expression plasmid pET14b. The SSB protein from H. seropedicae, named Hs_SSB, was overexpressed in E. coli strain BL21(DE3) and purified to homogeneity. Mass spectrometry data confirmed the identity of this protein. The apparent molecular mass of the native Hs_SSB was estimated by gel filtration, suggesting that the native protein is a tetramer made up of four similar subunits. The purified protein binds to single-stranded DNA (ssDNA) in a similar manner to other SSB proteins. The production of this recombinant protein in good yield opens up the possibility of obtaining its 3D-structure and will help further investigations into DNA metabolism.
The identification of FANCD2 DNA binding domains reveals nuclear localization sequences.

PubMed

Niraj, Joshi; Caron, Marie-Christine; Drapeau, Karine; Bérubé, Stéphanie; Guitton-Sert, Laure; Coulombe, Yan; Couturier, Anthony M; Masson, Jean-Yves

2017-08-21

Fanconi anemia (FA) is a recessive genetic disorder characterized by congenital abnormalities, progressive bone-marrow failure, and cancer susceptibility. The FA pathway consists of at least 21 FANC genes (FANCA-FANCV), and the encoded protein products interact in a common cellular pathway to gain resistance against DNA interstrand crosslinks. After DNA damage, FANCD2 is monoubiquitinated and accumulates on chromatin. FANCD2 plays a central role in the FA pathway, using yet unidentified DNA binding regions. By using synthetic peptide mapping and DNA binding screen by electromobility shift assays, we found that FANCD2 bears two major DNA binding domains predominantly consisting of evolutionary conserved lysine residues. Furthermore, one domain at the N-terminus of FANCD2 bears also nuclear localization sequences for the protein. Mutations in the bifunctional DNA binding/NLS domain lead to a reduction in FANCD2 monoubiquitination and increase in mitomycin C sensitivity. Such phenotypes are not fully rescued by fusion with an heterologous NLS, which enable separation of DNA binding and nuclear import functions within this domain that are necessary for FANCD2 functions. Collectively, our results enlighten the importance of DNA binding and NLS residues in FANCD2 to activate an efficient FA pathway. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Architecture of the 99 bp DNA-six-protein regulatory complex of the lambda att site.

PubMed

Sun, Xingmin; Mierke, Dale F; Biswas, Tapan; Lee, Sang Yeol; Landy, Arthur; Radman-Livaja, Marta

2006-11-17

The highly directional and tightly regulated recombination reaction used to site-specifically excise the bacteriophage lambda chromosome out of its E. coli host chromosome requires the binding of six sequence-specific proteins to a 99 bp segment of the phage att site. To gain structural insights into this recombination pathway, we measured 27 FRET distances between eight points on the 99 bp regulatory DNA bound with all six proteins. Triangulation of these distances using a metric matrix distance-geometry algorithm provided coordinates for these eight points. The resulting path for the protein-bound regulatory DNA, which fits well with the genetics, biochemistry, and X-ray crystal structures describing the individual proteins and their interactions with DNA, provides a new structural perspective into the molecular mechanism and regulation of the recombination reaction and illustrates a design by which different families of higher-order complexes can be assembled from different numbers and combinations of the same few proteins.
Regulation of expression of the ada gene controlling the adaptive response. Interactions with the ada promoter of the Ada protein and RNA polymerase.

PubMed

Sakumi, K; Sekiguchi, M

1989-01-20

The Ada protein of Escherichia coli catalyzes transfer of methyl groups from methylated DNA to its own molecule, and the methylated form of Ada protein promotes transcription of its own gene, ada. Using an in vitro reconstituted system, we found that both the sigma factor and the methylated Ada protein are required for transcription of the ada gene. To elucidate molecular mechanisms involved in the regulation of the ada transcription, we investigated interactions of the non-methylated and methylated forms of Ada protein and the RNA polymerase holo enzyme (the core enzyme and sigma factor) with a DNA fragment carrying the ada promoter region. Footprinting analyses revealed that the methylated Ada protein binds to a region from positions -63 to -31, which includes the ada regulatory sequence AAAGCGCA. No firm binding was observed with the non-methylated Ada protein, although some DNase I-hypersensitive sites were produced in the promoter by both types of Ada protein. RNA polymerase did bind to the promoter once the methylated Ada protein had bound to the upstream sequence. To correlate these phenomena with the process in vivo, we used the DNAs derived from promoter-defective mutants. No binding of Ada protein nor of RNA polymerase occurred with a mutant DNA having a C to G substitution at position -47 within the ada regulatory sequence. In the case of a -35 box mutant with a T to A change at position -34, the methylated Ada protein did bind to the ada regulatory sequence, yet there was no RNA polymerase binding. Thus, the binding of the methylated Ada protein to the upstream region apparently facilitates binding of the RNA polymerase to the proper region of the promoter. The Ada protein possesses two known methyl acceptor sites, Cys69 and Cys321. The role of methylation of each cysteine residue was investigated using mutant forms of the Ada protein. The Ada protein with the cysteine residue at position 69 replaced by alanine was incapable of binding to the ada promoter even when the cysteine residue at position 321 of the protein was methylated. When the Ada protein with alanine at position 321 was methylated, it acquired the potential to bind to the ada promoter. These results are compatible with the notion that methylation of the cysteine residue at position 69 causes a conformational change of the Ada protein, thereby facilitating binding of the protein to the upstream regulatory sequence.
Screening the sequence selectivity of DNA-binding molecules using a gold nanoparticle-based colorimetric approach.

PubMed

Hurst, Sarah J; Han, Min Su; Lytton-Jean, Abigail K R; Mirkin, Chad A

2007-09-15

We have developed a novel competition assay that uses a gold nanoparticle (Au NP)-based, high-throughput colorimetric approach to screen the sequence selectivity of DNA-binding molecules. This assay hinges on the observation that the melting behavior of DNA-functionalized Au NP aggregates is sensitive to the concentration of the DNA-binding molecule in solution. When short, oligomeric hairpin DNA sequences were added to a reaction solution consisting of DNA-functionalized Au NP aggregates and DNA-binding molecules, these molecules may either bind to the Au NP aggregate interconnects or the hairpin stems based on their relative affinity for each. This relative affinity can be measured as a change in the melting temperature (Tm) of the DNA-modified Au NP aggregates in solution. As a proof of concept, we evaluated the selectivity of 4',6-diamidino-2-phenylindone (an AT-specific binder), ethidium bromide (a nonspecific binder), and chromomycin A (a GC-specific binder) for six sequences of hairpin DNA having different numbers of AT pairs in a five-base pair variable stem region. Our assay accurately and easily confirmed the known trends in selectivity for the DNA binders in question without the use of complicated instrumentation. This novel assay will be useful in assessing large libraries of potential drug candidates that work by binding DNA to form a drug/DNA complex.
Specific interaction of the nonstructural protein NS1 of minute virus of mice (MVM) with [ACCA](2) motifs in the centre of the right-end MVM DNA palindrome induces hairpin-primed viral DNA replication.

PubMed

Willwand, Kurt; Moroianu, Adela; Hörlein, Rita; Stremmel, Wolfgang; Rommelaere, Jean

2002-07-01

The linear single-stranded DNA genome of minute virus of mice (MVM) is replicated via a double-stranded replicative form (RF) intermediate DNA. Amplification of viral RF DNA requires the structural transition of the right-end palindrome from a linear duplex into a double-hairpin structure, which serves for the repriming of unidirectional DNA synthesis. This conformational transition was found previously to be induced by the MVM nonstructural protein NS1. Elimination of the cognate NS1-binding sites, [ACCA](2), from the central region of the right-end palindrome next to the axis of symmetry was shown to markedly reduce the efficiency of hairpin-primed DNA replication, as measured in a reconstituted in vitro replication system. Thus, [ACCA](2) sequence motifs are essential as NS1-binding elements in the context of the structural transition of the right-end MVM palindrome.
Structure and Function of Lipopolysaccharide Binding Protein

NASA Astrophysics Data System (ADS)

Schumann, Ralf R.; Leong, Steven R.; Flaggs, Gail W.; Gray, Patrick W.; Wright, Samuel D.; Mathison, John C.; Tobias, Peter S.; Ulevitch, Richard J.

1990-09-01

The primary structure of lipopolysaccharide binding protein (LBP), a trace plasma protein that binds to the lipid A moiety of bacterial lipopolysaccharides (LPSs), was deduced by sequencing cloned complementary DNA. LBP shares sequence identity with another LPS binding protein found in granulocytes, bactericidal/permeability-increasing protein, and with cholesterol ester transport protein of the plasma. LBP may control the response to LPS under physiologic conditions by forming high-affinity complexes with LPS that bind to monocytes and macrophages, which then secrete tumor necrosis factor. The identification of this pathway for LPS-induced monocyte stimulation may aid in the development of treatments for diseases in which Gram-negative sepsis or endotoxemia are involved.
Alteration of gene expression in human hepatocellular carcinoma with integrated hepatitis B virus DNA.

PubMed

Tamori, Akihiro; Yamanishi, Yoshihiro; Kawashima, Shuichi; Kanehisa, Minoru; Enomoto, Masaru; Tanaka, Hiromu; Kubo, Shoji; Shiomi, Susumu; Nishiguchi, Shuhei

2005-08-15

Integration of hepatitis B virus (HBV) DNA into the human genome is one of the most important steps in HBV-related carcinogenesis. This study attempted to find the link between HBV DNA, the adjoining cellular sequence, and altered gene expression in hepatocellular carcinoma (HCC) with integrated HBV DNA. We examined 15 cases of HCC infected with HBV by cassette ligation-mediated PCR. The human DNA adjacent to the integrated HBV DNA was sequenced. Protein coding sequences were searched for in the human sequence. In five cases with HBV DNA integration, from which good quality RNA was extracted, gene expression was examined by cDNA microarray analysis. The human DNA sequence successive to integrated HBV DNA was determined in the 15 HCCs. Eight protein-coding regions were involved: ras-responsive element binding protein 1, calmodulin 1, mixed lineage leukemia 2 (MLL2), FLJ333655, LOC220272, LOC255345, LOC220220, and LOC168991. The MLL2 gene was expressed in three cases with HBV DNA integrated into exon 3 of MLL2 and in one case with HBV DNA integrated into intron 3 of MLL2. Gene expression analysis suggested that two HCCs with HBV integrated into MLL2 had similar patterns of gene expression compared with three HCCs with HBV integrated into other loci of human chromosomes. HBV DNA was integrated at random sites of human DNA, and the MLL2 gene was one of the targets for integration. Our results suggest that HBV DNA might modulate human genes near integration sites, followed by integration site-specific expression of such genes during hepatocarcinogenesis.
Toward a General Approach for RNA-Templated Hierarchical Assembly of Split-Proteins

PubMed Central

Furman, Jennifer L.; Badran, Ahmed H.; Ajulo, Oluyomi; Porter, Jason R.; Stains, Cliff I.; Segal, David J.; Ghosh, Indraneel

2010-01-01

The ability to conditionally turn on a signal or induce a function in the presence of a user-defined RNA target has potential applications in medicine and synthetic biology. Although sequence-specific pumilio repeat proteins can target a limited set of ssRNA sequences, there are no general methods for targeting ssRNA with designed proteins. As a first step toward RNA recognition, we utilized the RNA binding domain of argonaute, implicated in RNA interference, for specifically targeting generic 2-nucleotide, 3' overhangs of any dsRNA. We tested the reassembly of a split-luciferase enzyme guided by argonaute-mediated recognition of newly generated nucleotide overhangs when ssRNA is targeted by a designed complementary guide sequence. This approach was successful when argonaute was utilized in conjunction with a pumilio repeat and expanded the scope of potential ssRNA targets. However, targeting any desired ssRNA remained elusive as two argonaute domains provided minimal reassembled split-luciferase. We next designed and tested a second hierarchical assembly, wherein ssDNA guides are appended to DNA hairpins that serve as a scaffold for high affinity zinc fingers attached to split-luciferase. In the presence of a ssRNA target containing adjacent sequences complementary to the guides, the hairpins are brought into proximity, allowing for zinc finger binding and concomitant reassembly of the fragmented luciferase. The scope of this new approach was validated by specifically targeting RNA encoding VEGF, hDM2, and HER2. These approaches provide potentially general design paradigms for the conditional reassembly of fragmented proteins in the presence of any desired ssRNA target. PMID:20681585
TALE: a tale of genome editing.

PubMed

Zhang, Mingjie; Wang, Feng; Li, Shifei; Wang, Yan; Bai, Yun; Xu, Xueqing

2014-01-01

Transcription activator-like effectors (TALEs), first identified in Xanthomonas bacteria, are naturally occurring or artificially designed proteins that modulate gene transcription. These proteins recognize and bind DNA sequences based on a variable numbers of tandem repeats. Each repeat is comprised of a set of ∼ 34 conserved amino acids; within this conserved domain, there are usually two amino acids that distinguish one TALE from another. Interestingly, TALEs have revealed a simple cipher for the one-to-one recognition of proteins for DNA bases. Synthetic TALEs have been used to successfully target genes in a variety of species, including humans. Depending on the type of functional domain that is fused to the TALE of interest, these proteins can have diverse biological effects. For example, after binding DNA, TALEs fused to transcriptional activation domains can function as robust transcription factors (TALE-TFs), while fused to restriction endonucleases (TALENs) can cut DNA. Targeted genome editing, in theory, is capable of modifying any endogenous gene sequence of interest; this can be performed in cells or organisms, and may be applied to clinical gene-based therapies in the future. With current technologies, highly accurate, specific, and reliable gene editing cannot be achieved. Thus, recognition and binding mechanisms governing TALE biology are currently hot research areas. In this review, we summarize the major advances in TALE technology over the past several years with a focus on the interaction between TALEs and DNA, TALE design and construction, potential applications for this technology, and unique characteristics that make TALEs superior to zinc finger endonucleases. Copyright © 2013 Elsevier Ltd. All rights reserved.
Proteopedia: 3D Visualization and Annotation of Transcription Factor-DNA Readout Modes

ERIC Educational Resources Information Center

Dantas Machado, Ana Carolina; Saleebyan, Skyler B.; Holmes, Bailey T.; Karelina, Maria; Tam, Julia; Kim, Sharon Y.; Kim, Keziah H.; Dror, Iris; Hodis, Eran; Martz, Eric; Compeau, Patricia A.; Rohs, Remo

2012-01-01

3D visualization assists in identifying diverse mechanisms of protein-DNA recognition that can be observed for transcription factors and other DNA binding proteins. We used Proteopedia to illustrate transcription factor-DNA readout modes with a focus on DNA shape, which can be a function of either nucleotide sequence (Hox proteins) or base pairing…
Method for promoting specific alignment of short oligonucleotides on nucleic acids

DOEpatents

Studier, F. William; Kieleczawa, Jan; Dunn, John J.

1996-01-01

Disclosed is a method for promoting specific alignment of short oligonucleotides on a nucleic acid polymer. The nucleic acid polymer is incubated in a solution containing a single-stranded DNA-binding protein and a plurality of oligonucleotides which are perfectly complementary to distinct but adjacent regions of a predetermined contiguous nucleotide sequence in the nucleic acid polymer. The plurality of oligonucleotides anneal to the nucleic acid polymer to form a contiguous region of double stranded nucleic acid. Specific application of the methods disclosed include priming DNA synthesis and template-directed ligation.
Proteomic analysis of the nuclear matrix in the early stages of rat liver carcinogenesis: Identification of differentially expressed and MAR-binding proteins

DOE Office of Scientific and Technical Information (OSTI.GOV)

Barboro, Paola; D'Arrigo, Cristina; Repaci, Erica

Tumor progression is characterized by definite changes in the protein composition of the nuclear matrix (NM). The interactions of chromatin with the NM occur via specific DNA sequences called MARs (matrix attachment regions). In the present study, we applied a proteomic approach along with a Southwestern assay to detect both differentially expressed and MAR-binding NM proteins, in persistent hepatocyte nodules (PHN) in respect with normal hepatocytes (NH). In PHN, the NM undergoes changes both in morphology and in protein composition. We detected over 500 protein spots in each two dimensional map and 44 spots were identified. Twenty-three proteins were differentiallymore » expressed; among these, 15 spots were under-expressed and 8 spots were over-expressed in PHN compared to NH. These changes were synchronous with several modifications in both NM morphology and the ability of NM proteins to bind nuclear RNA and/or DNA containing MARs sequences. In PHN, we observed a general decrease in the expression of the basic proteins that bound nuclear RNA and the over-expression of two species of Mw 135 kDa and 81 kDa and pI 6.7-7.0 and 6.2-7.4, respectively, which exclusively bind to MARs. These results suggest that the deregulated expression of these species might be related to large-scale chromatin reorganization observed in the process of carcinogenesis by modulating the interaction between MARs and the scaffold structure.« less
Selecting Fully-Modified XNA Aptamers Using Synthetic Genetics.

PubMed

Taylor, Alexander I; Holliger, Philipp

2018-06-01

This unit describes the application of "synthetic genetics," i.e., the replication of xeno nucleic acids (XNAs), artificial analogs of DNA and RNA bearing alternative backbone or sugar congeners, to the directed evolution of synthetic oligonucleotide ligands (XNA aptamers) specific for target proteins or nucleic acid motifs, using a cross-chemistry selective exponential enrichment (X-SELEX) approach. Protocols are described for synthesis of diverse-sequence XNA repertoires (typically 10 14 molecules) using DNA templates, isolation and panning for functional XNA sequences using targets immobilized on solid phase or gel shift induced by target binding in solution, and XNA reverse transcription to allow cDNA amplification or sequencing. The method may be generally applied to select fully-modified XNA aptamers specific for a wide range of target molecules. © 2018 by John Wiley & Sons, Inc. Copyright © 2018 John Wiley & Sons, Inc.
The LINE-1 DNA sequences in four mammalian orders predict proteins that conserve homologies to retrovirus proteins.

PubMed Central

Fanning, T; Singer, M

1987-01-01

Recent work suggests that one or more members of the highly repeated LINE-1 (L1) DNA family found in all mammals may encode one or more proteins. Here we report the sequence of a portion of an L1 cloned from the domestic cat (Felis catus). These data permit comparison of the L1 sequences in four mammalian orders (Carnivore, Lagomorph, Rodent and Primate) and the comparison supports the suggested coding potential. In two separate, noncontiguous regions in the carboxy terminal half of the proteins predicted from the DNA sequences, there are several strongly conserved segments. In one region, these share homology with known or suspected reverse transcriptases, as described by others in rodents and primates. In the second region, closer to the carboxy terminus, the strongly conserved segments are over 90% homologous among the four orders. One of the latter segments is cysteine rich and resembles the putative metal binding domains of nucleic acid binding proteins, including those of TFIIIA and retroviruses. PMID:3562227
Regulation of DNA Replication Timing on Human Chromosome by a Cell-Type Specific DNA Binding Protein SATB1

PubMed Central

Oda, Masako; Kanoh, Yutaka; Watanabe, Yoshihisa; Masai, Hisao

2012-01-01

Background Replication timing of metazoan DNA during S-phase may be determined by many factors including chromosome structures, nuclear positioning, patterns of histone modifications, and transcriptional activity. It may be determined by Mb-domain structures, termed as “replication domains”, and recent findings indicate that replication timing is under developmental and cell type-specific regulation. Methodology/Principal Findings We examined replication timing on the human 5q23/31 3.5-Mb segment in T cells and non-T cells. We used two independent methods to determine replication timing. One is quantification of nascent replicating DNA in cell cycle-fractionated stage-specific S phase populations. The other is FISH analyses of replication foci. Although the locations of early- and late-replicating domains were common between the two cell lines, the timing transition region (TTR) between early and late domains were offset by 200-kb. We show that Special AT-rich sequence Binding protein 1 (SATB1), specifically expressed in T-cells, binds to the early domain immediately adjacent to TTR and delays the replication timing of the TTR. Measurement of the chromosome copy number along the TTR during synchronized S phase suggests that the fork movement may be slowed down by SATB1. Conclusions Our results reveal a novel role of SATB1 in cell type-specific regulation of replication timing along the chromosome. PMID:22879953
Regulation of DNA replication timing on human chromosome by a cell-type specific DNA binding protein SATB1.

PubMed

Oda, Masako; Kanoh, Yutaka; Watanabe, Yoshihisa; Masai, Hisao

2012-01-01

Replication timing of metazoan DNA during S-phase may be determined by many factors including chromosome structures, nuclear positioning, patterns of histone modifications, and transcriptional activity. It may be determined by Mb-domain structures, termed as "replication domains", and recent findings indicate that replication timing is under developmental and cell type-specific regulation. We examined replication timing on the human 5q23/31 3.5-Mb segment in T cells and non-T cells. We used two independent methods to determine replication timing. One is quantification of nascent replicating DNA in cell cycle-fractionated stage-specific S phase populations. The other is FISH analyses of replication foci. Although the locations of early- and late-replicating domains were common between the two cell lines, the timing transition region (TTR) between early and late domains were offset by 200-kb. We show that Special AT-rich sequence Binding protein 1 (SATB1), specifically expressed in T-cells, binds to the early domain immediately adjacent to TTR and delays the replication timing of the TTR. Measurement of the chromosome copy number along the TTR during synchronized S phase suggests that the fork movement may be slowed down by SATB1. Our results reveal a novel role of SATB1 in cell type-specific regulation of replication timing along the chromosome.
Bombyx mori Nucleopolyhedrovirus Encodes a DNA-Binding Protein Capable of Destabilizing Duplex DNA

PubMed Central

Mikhailov, Victor S.; Mikhailova, Alla L.; Iwanaga, Masashi; Gomi, Sumiko; Maeda, Susumu

1998-01-01

A DNA-binding protein (designated DBP) with an apparent molecular mass of 38 kDa was purified to homogeneity from BmN cells (derived from Bombyx mori) infected with the B. mori nucleopolyhedrovirus (BmNPV). Six peptides obtained after digestion of the isolated protein with Achromobacter protease I were partially or completely sequenced. The determined amino acid sequences indicated that DBP was encoded by an open reading frame (ORF16) located at nucleotides (nt) 16189 to 17139 in the BmNPV genome (GenBank accession no. L33180). This ORF (designated dbp) is a homolog of Autographa californica multicapsid NPV ORF25, whose product has not been identified. BmNPV DBP is predicted to contain 317 amino acids (calculated molecular mass of 36.7 kDa) and to have an isoelectric point of 7.8. DBP showed a tendency to multimerization in the course of purification and was found to bind preferentially to single-stranded DNA. When bound to oligonucleotides, DBP protected them from hydrolysis by phage T4 DNA polymerase-associated 3′→5′ exonuclease. The sizes of the protected fragments indicated that a binding site size for DBP is about 30 nt per protein monomer. DBP, but not BmNPV LEF-3, was capable of unwinding partial DNA duplexes in an in vitro system. This helix-destabilizing ability is consistent with the prediction that DBP functions as a single-stranded DNA binding protein in virus replication. PMID:9525636
Information analysis of sequences that bind the replication initiator RepA | Center for Cancer Research

Cancer.gov

The tall letters represent the highly conserved bases in DNA binding sites of several prokaryotic repressors and activators. Conservation is strongest where major grooves of the double helical DNA (represented by crests of a cosine wave) face the protein. This shows that conservation analysis alone can be used to predict the face of DNA that contacts the proteins.

An improved SELEX technique for selection of DNA aptamers binding to M-type 11 of Streptococcus pyogenes.

PubMed

Hamula, Camille L A; Peng, Hanyong; Wang, Zhixin; Tyrrell, Gregory J; Li, Xing-Fang; Le, X Chris

2016-03-15

Streptococcus pyogenes is a clinically important pathogen consisting of various serotypes determined by different M proteins expressed on the cell surface. The M type is therefore a useful marker to monitor the spread of invasive S. pyogenes in a population. Serotyping and nucleic acid amplification/sequencing methods for the identification of M types are laborious, inconsistent, and usually confined to reference laboratories. The primary objective of this work is to develop a technique that enables generation of aptamers binding to specific M-types of S. pyogenes. We describe here an in vitro technique that directly used live bacterial cells and the Systematic Evolution of Ligands by Exponential Enrichment (SELEX) strategy. Live S. pyogenes cells were incubated with DNA libraries consisting of 40-nucleotides randomized sequences. Those sequences that bound to the cells were separated, amplified using polymerase chain reaction (PCR), purified using gel electrophoresis, and served as the input DNA pool for the next round of SELEX selection. A specially designed forward primer containing extended polyA20/5Sp9 facilitated gel electrophoresis purification of ssDNA after PCR amplification. A counter-selection step using non-target cells was introduced to improve selectivity. DNA libraries of different starting sequence diversity (10(16) and 10(14)) were compared. Aptamer pools from each round of selection were tested for their binding to the target and non-target cells using flow cytometry. Selected aptamer pools were then cloned and sequenced. Individual aptamer sequences were screened on the basis of their binding to the 10 M-types that were used as targets. Aptamer pools obtained from SELEX rounds 5-8 showed high affinity to the target S. pyogenes cells. Tests against non-target Streptococcus bovis, Streptococcus pneumoniae, and Enterococcus species demonstrated selectivity of these aptamers for binding to S. pyogenes. Several aptamer sequences were found to bind preferentially to the M11 M-type of S. pyogenes. Estimated binding dissociation constants (Kd) were in the low nanomolar range for the M11 specific sequences; for example, sequence E-CA20 had a Kd of 7±1 nM. These affinities are comparable to those of a monoclonal antibody. The improved bacterial cell-SELEX technique is successful in generating aptamers selective for S. pyogenes and some of its M-types. These aptamers are potentially useful for detecting S. pyogenes, achieving binding profiles of the various M-types, and developing new M-typing technologies for non-specialized laboratories or point-of-care testing. Copyright © 2015 Elsevier Inc. All rights reserved.
DNA sequence analysis of a 10 624 bp fragment of the left arm of chromosome XV from Saccharomyces cerevisiae reveals a RNA binding protein, a mitochondrial protein, two ribosomal proteins and two new open reading frames.

PubMed

Lafuente, M J; Gamo, F J; Gancedo, C

1996-09-01

We have determined the sequence of a 10624 bp DNA segment located in the left arm of chromosome XV of Saccharomyces cerevisiae. The sequence contains eight open reading frames (ORFs) longer than 100 amino acids. Two of them do not present significant homology with sequences found in the databases. The product of ORF o0553 is identical to the protein encoded by the gene SMF1. Internal to it there is another ORF, o0555 that is apparently expressed. The proteins encoded by ORFs o0559 and o0565 are identical to ribosomal proteins S19.e and L18 respectively. ORF o0550 encodes a protein with an RNA binding signature including RNP motifs and stretches rich in asparagine, glutamine and arginine.
Detection of Z DNA binding proteins in tissue culture cells.

PubMed Central

Leith, I R; Hay, R T; Russell, W C

1988-01-01

A gel electrophoresis DNA binding assay to detect Z DNA binding proteins has been developed utilising [32P] labelled poly [d(G-C)] which was converted to the Z form by incubation in 100 microM Co(NH3)6Cl3. The parameters of the assay were established using a Z DNA antibody as a model system and then applied to extracts of Hela and BHK21 cells. Using an anti-Z DNA antibody conditions were established which allowed resolution of antibody-DNA complexes and free DNA in the presence of 100 microM Co(NH3)6Cl3. The inclusion of unlabelled complementary homopolymers eliminated non-specific binding to the labelled Z-DNA probe. Competition experiments demonstrated that the assay was highly specific for double stranded non-B DNA. Application of the technique to extracts of mammalian cells demonstrated that human and hamster cells contain Z-DNA binding proteins; further characterisation by a blotting technique indicated that a 56,000 molecular weight cell protein preferentially binds Z-DNA. Images PMID:3419919
Diversification of transcription factor-DNA interactions and the evolution of gene regulatory networks.

PubMed

Rogers, Julia M; Bulyk, Martha L

2018-04-25

Sequence-specific transcription factors (TFs) bind short DNA sequences in the genome to regulate the expression of target genes. In the last decade, numerous technical advances have enabled the determination of the DNA-binding specificities of many of these factors. Large-scale screens of many TFs enabled the creation of databases of TF DNA-binding specificities, typically represented as position weight matrices (PWMs). Although great progress has been made in determining and predicting binding specificities systematically, there are still many surprises to be found when studying a particular TF's interactions with DNA in detail. Paralogous TFs' binding specificities can differ in subtle ways, in a manner that is not immediately apparent from looking at their PWMs. These differences affect gene regulatory outputs and enable TFs to rewire transcriptional networks over evolutionary time. This review discusses recent observations made in the study of TF-DNA interactions that highlight the importance of continued in-depth analysis of TF-DNA interactions and their inherent complexity. This article is categorized under: Biological Mechanisms > Regulatory Biology. © 2018 Wiley Periodicals, Inc.
The Replication Focus Targeting Sequence (RFTS) Domain Is a DNA-competitive Inhibitor of Dnmt1

DOE Office of Scientific and Technical Information (OSTI.GOV)

Syeda, Farisa; Fagan, Rebecca L.; Wean, Matthew

Dnmt1 (DNA methyltransferase 1) is the principal enzyme responsible for maintenance of cytosine methylation at CpG dinucleotides in the mammalian genome. The N-terminal replication focus targeting sequence (RFTS) domain of Dnmt1 has been implicated in subcellular localization, protein association, and catalytic function. However, progress in understanding its function has been limited by the lack of assays for and a structure of this domain. Here, we show that the naked DNA- and polynucleosome-binding activities of Dnmt1 are inhibited by the RFTS domain, which functions by virtue of binding the catalytic domain to the exclusion of DNA. Kinetic analysis with a fluorogenicmore » DNA substrate established the RFTS domain as a 600-fold inhibitor of Dnmt1 enzymatic activity. The crystal structure of the RFTS domain reveals a novel fold and supports a mechanism in which an RFTS-targeted Dnmt1-binding protein, such as Uhrf1, may activate Dnmt1 for DNA binding.« less
Detection of First-Line Drug Resistance Mutations and Drug-Protein Interaction Dynamics from Tuberculosis Patients in South India.

PubMed

Nachappa, Somanna Ajjamada; Neelambike, Sumana M; Amruthavalli, Chokkanna; Ramachandra, Nallur B

2018-05-01

Diagnosis of drug-resistant tuberculosis predominantly relies on culture-based drug susceptibility testing, which take weeks to produce a result and a more time-efficient alternative method is multiplex allele-specific PCR (MAS-PCR). Also, understanding the role of mutations in causing resistance helps better drug designing. To evaluate the ability of MAS-PCR in the detection of drug resistance and to understand the mechanism of interaction of drugs with mutant proteins in Mycobacterium tuberculosis. Detection of drug-resistant mutations using MAS-PCR and validation through DNA sequencing. MAS-PCR targeted five loci on three genes, katG 315 and inhA -15 for the drug isoniazid (INH), and rpoB 516, 526, and 531 for rifampicin (RIF). Furthermore, the sequence data were analyzed to study the effect on interaction of the anti-TB drug molecule with the target protein using in silico docking. We identified drug-resistant mutations in 8 out of 114 isolates with 2 of them as multidrug-resistant TB using MAS-PCR. DNA sequencing confirmed only six of these, recording a sensitivity of 85.7% and specificity of 99.3% for MAS-PCR. Molecular docking showed estimated free energy of binding (ΔG) being higher for RIF binding with RpoB S531L mutant. Codon 315 in KatG does not directly interact with INH but blocks the drug access to active site. We propose DNA sequencing-based drug resistance detection for TB, which is more accurate than MAS-PCR. Understanding the action of resistant mutations in disrupting the normal drug-protein interaction aids in designing effective drug alternatives.
Tissue specificity of the hormonal response in sex accessory tissues is associated with nuclear matrix protein patterns.

PubMed

Getzenberg, R H; Coffey, D S

1990-09-01

The DNA of interphase nuclei have very specific three-dimensional organizations that are different in different cell types, and it is possible that this varying DNA organization is responsible for the tissue specificity of gene expression. The nuclear matrix organizes the three-dimensional structure of the DNA and is believed to be involved in the control of gene expression. This study compares the nuclear structural proteins between two sex accessory tissues in the same animal responding to the same androgen stimulation by the differential expression of major tissue-specific secretory proteins. We demonstrate here that the nuclear matrix is tissue specific in the rat ventral prostate and seminal vesicle, and undergoes characteristic alterations in its protein composition upon androgen withdrawal. Three types of nuclear matrix proteins were observed: 1) nuclear matrix proteins that are different and tissue specific in the rat ventral prostate and seminal vesicle, 2) a set of nuclear matrix proteins that either appear or disappear upon androgen withdrawal, and 3) a set of proteins that are common to both the ventral prostate and seminal vesicle and do not change with the hormonal state of the animal. Since the nuclear matrix is known to bind androgen receptors in a tissue- and steroid-specific manner, we propose that the tissue specificity of the nuclear matrix arranges the DNA in a unique conformation, which may be involved in the specific interaction of transcription factors with DNA sequences, resulting in tissue-specific patterns of secretory protein expression.
Trigger Factor and DnaK possess overlapping substrate pools and binding specificities.

PubMed

Deuerling, Elke; Patzelt, Holger; Vorderwülbecke, Sonja; Rauch, Thomas; Kramer, Günter; Schaffitzel, Elke; Mogk, Axel; Schulze-Specking, Agnes; Langen, Hanno; Bukau, Bernd

2003-03-01

Ribosome-associated Trigger Factor (TF) and the DnaK chaperone system assist the folding of newly synthesized proteins in Escherichia coli. Here, we show that DnaK and TF share a common substrate pool in vivo. In TF-deficient cells, deltatig, depleted for DnaK and DnaJ the amount of aggregated proteins increases with increasing temperature, amounting to 10% of total soluble protein (approximately 340 protein species) at 37 degrees C. A similar population of proteins aggregated in DnaK depleted tig+ cells, albeit to a much lower extent. Ninety-four aggregated proteins isolated from DnaK- and DnaJ-depleted deltatig cells were identified by mass spectrometry and found to include essential cytosolic proteins. Four potential in vivo substrates were screened for chaperone binding sites using peptide libraries. Although TF and DnaK recognize different binding motifs, 77% of TF binding peptides also associated with DnaK. In the case of the nascent polypeptides TF and DnaK competed for binding, however, with competitive advantage for TF. In vivo, the loss of TF is compensated by the induction of the heat shock response and thus enhanced levels of DnaK. In summary, our results demonstrate that the co-operation of the two mechanistically distinct chaperones in protein folding is based on their overlap in substrate specificities.
DIVERSITY in binding, regulation, and evolution revealed from high-throughput ChIP.

PubMed

Mitra, Sneha; Biswas, Anushua; Narlikar, Leelavati

2018-04-01

Genome-wide in vivo protein-DNA interactions are routinely mapped using high-throughput chromatin immunoprecipitation (ChIP). ChIP-reported regions are typically investigated for enriched sequence-motifs, which are likely to model the DNA-binding specificity of the profiled protein and/or of co-occurring proteins. However, simple enrichment analyses can miss insights into the binding-activity of the protein. Note that ChIP reports regions making direct contact with the protein as well as those binding through intermediaries. For example, consider a ChIP experiment targeting protein X, which binds DNA at its cognate sites, but simultaneously interacts with four other proteins. Each of these proteins also binds to its own specific cognate sites along distant parts of the genome, a scenario consistent with the current view of transcriptional hubs and chromatin loops. Since ChIP will pull down all X-associated regions, the final reported data will be a union of five distinct sets of regions, each containing binding sites of one of the five proteins, respectively. Characterizing all five different motifs and the corresponding sets is important to interpret the ChIP experiment and ultimately, the role of X in regulation. We present diversity which attempts exactly this: it partitions the data so that each partition can be characterized with its own de novo motif. Diversity uses a Bayesian approach to identify the optimal number of motifs and the associated partitions, which together explain the entire dataset. This is in contrast to standard motif finders, which report motifs individually enriched in the data, but do not necessarily explain all reported regions. We show that the different motifs and associated regions identified by diversity give insights into the various complexes that may be forming along the chromatin, something that has so far not been attempted from ChIP data. Webserver at http://diversity.ncl.res.in/; standalone (Mac OS X/Linux) from https://github.com/NarlikarLab/DIVERSITY/releases/tag/v1.0.0.
Transcription activation mediated by a cyclic AMP receptor protein from Thermus thermophilus HB8.

PubMed

Shinkai, Akeo; Kira, Satoshi; Nakagawa, Noriko; Kashihara, Aiko; Kuramitsu, Seiki; Yokoyama, Shigeyuki

2007-05-01

The extremely thermophilic bacterium Thermus thermophilus HB8, which belongs to the phylum Deinococcus-Thermus, has an open reading frame encoding a protein belonging to the cyclic AMP (cAMP) receptor protein (CRP) family present in many bacteria. The protein named T. thermophilus CRP is highly homologous to the CRP family proteins from the phyla Firmicutes, Actinobacteria, and Cyanobacteria, and it forms a homodimer and interacts with cAMP. CRP mRNA and intracellular cAMP were detected in this strain, which did not drastically fluctuate during cultivation in a rich medium. The expression of several genes was altered upon disruption of the T. thermophilus CRP gene. We found six CRP-cAMP-dependent promoters in in vitro transcription assays involving DNA fragments containing the upstream regions of the genes exhibiting decreased expression in the CRP disruptant, indicating that the CRP is a transcriptional activator. The consensus T. thermophilus CRP-binding site predicted upon nucleotide sequence alignment is 5'-(C/T)NNG(G/T)(G/T)C(A/C)N(A/T)NNTCACAN(G/C)(G/C)-3'. This sequence is unique compared with the known consensus binding sequences of CRP family proteins. A putative -10 hexamer sequence resides at 18 to 19 bp downstream of the predicted T. thermophilus CRP-binding site. The CRP-regulated genes found in this study comprise clustered regularly interspaced short palindromic repeat (CRISPR)-associated (cas) ones, and the genes of a putative transcriptional regulator, a protein containing the exonuclease III-like domain of DNA polymerase, a GCN5-related acetyltransferase homolog, and T. thermophilus-specific proteins of unknown function. These results suggest a role for cAMP signal transduction in T. thermophilus and imply the T. thermophilus CRP is a cAMP-responsive regulator.
Amino acids 16-275 of minute virus of mice NS1 include a domain that specifically binds (ACCA)2-3-containing DNA.

PubMed

Mouw, M; Pintel, D J

1998-11-10

GST-NS1 purified from Escherichia coli and insect cells binds double-strand DNA in an (ACCA)2-3-dependent fashion under similar ionic conditions, independent of the presence of anti-NS1 antisera or exogenously supplied ATP and interacts with single-strand DNA and RNA in a sequence-independent manner. An amino-terminal domain (amino acids 1-275) of NS1 [GST-NS1(1-275)], representing 41% of the full-length NS1 molecule, includes a domain that binds double-strand DNA in a sequence-specific manner at levels comparable to full-length GST-NS1, as well as single-strand DNA and RNA in a sequence-independent manner. The deletion of 15 additional amino-terminal amino acids yielded a molecule [GST-NS1(1-275)] that maintained (ACCA)2-3-specific double-strand DNA binding; however, this molecule was more sensitive to increasing ionic conditions than full-length GST-NS1 and GST-NS1(1-275) and could not be demonstrated to bind single-strand nucleic acids. A quantitative filter binding assay showed that E. coli- and baculovirus-expressed GST-NS1 and E. coli GST-NS1(1-275) specifically bound double-strand DNA with similar equilibrium kinetics [as measured by their apparent equilibrium DNA binding constants (KD)], whereas GST-NS1(16-275) bound 4- to 8-fold less well. Copyright 1998 Academic Press.
Two copies of mthmg1, encoding a novel mitochondrial HMG-like protein, delay accumulation of mitochondrial DNA deletions in Podospora anserina.

PubMed

Dequard-Chablat, Michelle; Allandt, Cynthia

2002-08-01

In the filamentous fungus Podospora anserina, two degenerative processes which result in growth arrest are associated with mitochondrial genome (mitochondrial DNA [mtDNA]) instability. Senescence is correlated with mtDNA rearrangements and amplification of specific regions (senDNAs). Premature death syndrome is characterized by the accumulation of specific mtDNA deletions. This accumulation is due to indirect effects of the AS1-4 mutation, which alters a cytosolic ribosomal protein gene. The mthmg1 gene has been identified as a double-copy suppressor of premature death. It greatly delays premature death and the accumulation of deletions when it is present in two copies in an ASI-4 context. The duplication of mthmg1 has no significant effect on the wild-type life span or on senDNA patterns. In anAS1+ context, deletion of the mthmg1 gene alters germination, growth, and fertility and reduces the life span. The deltamthmg1 senescent strains display a particular senDNA pattern. This deletion is lethal in an AS1-4 context. According to its physical properties (very basic protein with putative mitochondrial targeting sequence and HMG-type DNA-binding domains) and the cellular localization of an mtHMG1-green fluorescent protein fusion, mtHMG1 appears to be a mitochondrial protein possibly associated with mtDNA. It is noteworthy that it is the first example of a protein combining the two DNA-binding domains, AT-hook motif and HMG-1 boxes. It may be involved in the stability and/or transmission of the mitochondrial genome. To date, no structural homologues have been found in other organisms. However, mtHMG1 displays functional similarities with the Saccharomyces cerevisiae mitochondrial HMG-box protein Abf2.
Detecting Coevolution in and among Protein Domains

PubMed Central

Yeang, Chen-Hsiang; Haussler, David

2007-01-01

Correlated changes of nucleic or amino acids have provided strong information about the structures and interactions of molecules. Despite the rich literature in coevolutionary sequence analysis, previous methods often have to trade off between generality, simplicity, phylogenetic information, and specific knowledge about interactions. Furthermore, despite the evidence of coevolution in selected protein families, a comprehensive screening of coevolution among all protein domains is still lacking. We propose an augmented continuous-time Markov process model for sequence coevolution. The model can handle different types of interactions, incorporate phylogenetic information and sequence substitution, has only one extra free parameter, and requires no knowledge about interaction rules. We employ this model to large-scale screenings on the entire protein domain database (Pfam). Strikingly, with 0.1 trillion tests executed, the majority of the inferred coevolving protein domains are functionally related, and the coevolving amino acid residues are spatially coupled. Moreover, many of the coevolving positions are located at functionally important sites of proteins/protein complexes, such as the subunit linkers of superoxide dismutase, the tRNA binding sites of ribosomes, the DNA binding region of RNA polymerase, and the active and ligand binding sites of various enzymes. The results suggest sequence coevolution manifests structural and functional constraints of proteins. The intricate relations between sequence coevolution and various selective constraints are worth pursuing at a deeper level. PMID:17983264
Viral nanoparticle-encapsidated enzyme and restructured DNA for cell delivery and gene expression

PubMed Central

Liu, Jinny L.; Dixit, Aparna Banerjee; Robertson, Kelly L.; Qiao, Eric; Black, Lindsay W.

2014-01-01

Packaging specific exogenous active proteins and DNAs together within a single viral-nanocontainer is challenging. The bacteriophage T4 capsid (100 × 70 nm) is well suited for this purpose, because it can hold a single long DNA or multiple short pieces of DNA up to 170 kb packed together with more than 1,000 protein molecules. Any linear DNA can be packaged in vitro into purified procapsids. The capsid-targeting sequence (CTS) directs virtually any protein into the procapsid. Procapsids are assembled with specific CTS-directed exogenous proteins that are encapsidated before the DNA. The capsid also can display on its surface high-affinity eukaryotic cell-binding peptides or proteins that are in fusion with small outer capsid and head outer capsid surface-decoration proteins that can be added in vivo or in vitro. In this study, we demonstrate that the site-specific recombinase cyclic recombination (Cre) targeted into the procapsid is enzymatically active within the procapsid and recircularizes linear plasmid DNA containing two terminal loxP recognition sites when packaged in vitro. mCherry expression driven by a cytomegalovirus promoter in the capsid containing Cre-circularized DNA is enhanced over linear DNA, as shown in recipient eukaryotic cells. The efficient and specific packaging into capsids and the unpackaging of both DNA and protein with release of the enzymatically altered protein–DNA complexes from the nanoparticles into cells have potential in numerous downstream drug and gene therapeutic applications. PMID:25161284
Identification of a GTP-binding protein. cap alpha. subunit that lacks an apparent ADP-ribosylation site for pertussis toxin

DOE Office of Scientific and Technical Information (OSTI.GOV)

Fong, H.K.W.; Yoshimoto, K.K.; Eversole-Cire, P.

1988-05-01

Recent molecular cloning of cDNA for the ..cap alpha.. subunit of bovine transducin (a guanine nucleotide-binding regulatory protein, or G protein) has revealed the presence of two retinal-specific transducins, called T/sub r/ and T/sub c/, which are expressed in rod or cone photoreceptor cells. In a further study of G-protein diversity and signal transduction in the retina, the authors have identified a G-protein ..cap alpha.. subunit, which they refer to as G/sub z/..cap alpha.., by isolating a human retinal cDNA clone that cross-hybridizes at reduced stringency with bovine T/sub r/ ..cap alpha..-subunit cDNA. The deduced amino acid sequence of G/submore » z/..cap alpha.. is 41-67% identical with those of other known G-protein ..cap alpha.. subunits. However, the 355-residue G/sub z/..cap alpha.. lacks a consensus site for ADP-ribosylation by pertussis toxin, and its amino acid sequence varies within a number of regions that are strongly conserved among all of the other G-protein ..cap alpha.. subunits. They suggest that G/sub z/..cap alpha.., which appears to be highly expressed in neural tissues, represents a member of a subfamily of G proteins that mediate signal transduction in pertussis toxin-insensitive systems.« less
Helix Unwinding and Base Flipping Enable Human MTERF1 to Terminate Mitochondrial Transcription

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yakubovskaya, E.; Mejia, E; Byrnes, J

2010-01-01

Defects in mitochondrial gene expression are associated with aging and disease. Mterf proteins have been implicated in modulating transcription, replication and protein synthesis. We have solved the structure of a member of this family, the human mitochondrial transcriptional terminator MTERF1, bound to dsDNA containing the termination sequence. The structure indicates that upon sequence recognition MTERF1 unwinds the DNA molecule, promoting eversion of three nucleotides. Base flipping is critical for stable binding and transcriptional termination. Additional structural and biochemical results provide insight into the DNA binding mechanism and explain how MTERF1 recognizes its target sequence. Finally, we have demonstrated that themore » mitochondrial pathogenic G3249A and G3244A mutations interfere with key interactions for sequence recognition, eliminating termination. Our results provide insight into the role of mterf proteins and suggest a link between mitochondrial disease and the regulation of mitochondrial transcription.« less
Hydrogen/Deuterium Exchange Reflects Binding of Human Centrin 2 to Ca2+ and Xeroderma Pigmentosum Group C Peptide: An Example of EX1 Kinetics

PubMed Central

Sperry, Justin B.; Ryan, Zachary C.; Kumar, Rajiv; Gross, Michael L.

2012-01-01

Xeroderma pigmentosum (XP) is a genetic disease affecting 1 in 10,000-100,000 and predisposes people to early-age skin cancer, a disease that is increasing. Those with XP have decreased ability to repair UV-induced DNA damage, leading to increased susceptibility of cancerous non-melanomas and melanomas. A vital, heterotrimeric protein complex is linked to the nucleotide excision repair pathway for the damaged DNA. The complex consists of XPC protein, human centrin 2, and RAD23B. One of the members, human centrin 2, is a ubiquitous, acidic, Ca2+-binding protein belonging to the calmodulin superfamily. The XPC protein contains a sequence motif specific for binding to human centrin 2. We report here the Ca2+-binding properties of human centrin 2 and its interaction with the XPC peptide motif. We utilized a region-specific H/D exchange protocol to localize the interaction of the XPC peptide with the C-terminal domain of centrin, the binding of which is different than that of calmodulin complexes. The binding dynamics of human centrin 2 to the XPC peptide in the absence and presence of Ca2+ are revealed by the observation of EX1 H/D exchange regime, indicating that a locally unfolded population exists in solution and undergoes fast H/D exchange. PMID:23439742
A calmodulin binding protein from Arabidopsis is induced by ethylene and contains a DNA-binding motif

NASA Technical Reports Server (NTRS)

Reddy, A. S.; Reddy, V. S.; Golovkin, M.

2000-01-01

Calmodulin (CaM), a key calcium sensor in all eukaryotes, regulates diverse cellular processes by interacting with other proteins. To isolate CaM binding proteins involved in ethylene signal transduction, we screened an expression library prepared from ethylene-treated Arabidopsis seedlings with 35S-labeled CaM. A cDNA clone, EICBP (Ethylene-Induced CaM Binding Protein), encoding a protein that interacts with activated CaM was isolated in this screening. The CaM binding domain in EICBP was mapped to the C-terminus of the protein. These results indicate that calcium, through CaM, could regulate the activity of EICBP. The EICBP is expressed in different tissues and its expression in seedlings is induced by ethylene. The EICBP contains, in addition to a CaM binding domain, several features that are typical of transcription factors. These include a DNA-binding domain at the N terminus, an acidic region at the C terminus, and nuclear localization signals. In database searches a partial cDNA (CG-1) encoding a DNA-binding motif from parsley and an ethylene up-regulated partial cDNA from tomato (ER66) showed significant similarity to EICBP. In addition, five hypothetical proteins in the Arabidopsis genome also showed a very high sequence similarity with EICBP, indicating that there are several EICBP-related proteins in Arabidopsis. The structural features of EICBP are conserved in all EICBP-related proteins in Arabidopsis, suggesting that they may constitute a new family of DNA binding proteins and are likely to be involved in modulating gene expression in the presence of ethylene.
Cell-penetrating DNA-binding protein as a safe and efficient naked DNA delivery carrier in vitro and in vivo

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kim, Eun-Sung; Yang, Seung-Woo; Hong, Dong-Ki

Non-viral gene delivery is a safe and suitable alternative to viral vector-mediated delivery to overcome the immunogenicity and tumorigenesis associated with viral vectors. Using the novel, human-origin Hph-1 protein transduction domain that can facilitate the transduction of protein into cells, we developed a new strategy to deliver naked DNA in vitro and in vivo. The new DNA delivery system contains Hph-1-GAL4 DNA-binding domain (DBD) fusion protein and enhanced green fluorescent protein (EGFP) reporter plasmid that includes the five repeats of GAL4 upstream activating sequence (UAS). Hph-1-GAL4-DBD protein formed complex with plasmid DNA through the specific interaction between GAL4-DBD and UAS,more » and delivered into the cells via the Hph-1-PTD. The pEGFP DNA was successfully delivered by the Hph-1-GAL4 system, and the EGFP was effectively expressed in mammalian cells such as HeLa and Jurkat, as well as in Bright Yellow-2 (BY-2) plant cells. When 10 {mu}g of pEGFP DNA was intranasally administered to mice using Hph-1-GAL4 protein, a high level of EGFP expression was detected throughout the lung tissue for 7 days. These results suggest that an Hph-1-PTD-mediated DNA delivery strategy may be an useful non-viral DNA delivery system for gene therapy and DNA vaccines.« less
Cloning and sequence analysis of Galleria mellonella juvenile hormone binding protein--a search for ancestors and relatives.

PubMed

Rodriguez Parkitna, Jan M; Ozyhar, Andrzej; Wiśniewski, Jacek R; Kochman, Marian

2002-09-01

Juvenile hormone binding proteins (JHBPs) serve as specific carriers of juvenile hormone (JH) in insect hemolymph. As shown in this report, Galleria mellonella JHBP is encoded by a cDNA of 1063 nucleotides. The pre-protein consists of 245 amino acids with a 20 amino acid leader sequence. The concentration of the JHBP mRNA reaches a maximum on the third day of the last larval instar, and decreases five-fold towards pupation. Comparison of amino acid sequences of JHBPs from Bombyx mori, Heliothis virescens, Manduca sexta and G. mellonella shows that 57 positions out of 226 are occupied by identical amino acids. A phylogeny tree was constructed from 32 proteins, which function could be associated to JH. It has three major branches: (i) ligand binding domains of nuclear receptors, (ii) JHBPs and JH esterases (JHEs), and (iii) hypothetical proteins found in Drosophila melanogaster genome. Despite the close positioning of JHEs and JHBPs on the tree, which probably arises from the presence of a common JH binding motif, these proteins are unlikely to belong to the same family. Detailed analysis of the secondary structure modeling shows that JHBPs may contain a beta-barrel motif flanked by alpha-helices and thus be evolutionary related to the same superfamily as calycins.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.