Yousaf, Nasim; Gould, David
2017-01-01
Confirming the binding of a transcription factor with a particular DNA sequence may be important in characterizing interactions with a synthetic promoter. Electrophoretic mobility shift assay is a powerful approach to demonstrate the specific DNA sequence that is bound by a transcription factor and also to confirm the specific transcription factor involved in the interaction. In this chapter we describe a method we have successfully used to demonstrate interactions of endogenous transcription factors with sequences derived from endogenous and synthetic promoters.
Evers, R; Grummt, I
1995-01-01
Both the DNA elements and the nuclear factors that direct termination of ribosomal gene transcription exhibit species-specific differences. Even between mammals--e.g., human and mouse--the termination signals are not identical and the respective transcription termination factors (TTFs) which bind to the terminator sequence are not fully interchangeable. To elucidate the molecular basis for this species-specificity, we have cloned TTF-I from human and mouse cells and compared their structural and functional properties. Recombinant TTF-I exhibits species-specific DNA binding and terminates transcription both in cell-free transcription assays and in transfection experiments. Chimeric constructs of mouse TTF-I and human TTF-I reveal that the major determinant for species-specific DNA binding resides within the C terminus of TTF-I. Replacing 31 C-terminal amino acids of mouse TTF-I with the homologous human sequences relaxes the DNA-binding specificity and, as a consequence, allows the chimeric factor to bind the human terminator sequence and to specifically stop rDNA transcription. Images Fig. 2 Fig. 3 Fig. 4 PMID:7597036
SivaRaman, L; Subramanian, S; Thimmappaya, B
1986-01-01
Utilizing the gel electrophoresis/DNA binding assay, a factor specific for the upstream transcriptional control sequence of the EIA-inducible adenovirus EIIA-early promoter has been detected in HeLa cell nuclear extract. Analysis of linker-scanning mutants of the promoter by DNA binding assays and methylation-interference experiments show that the factor binds to the 17-nucleotide sequence 5' TGGAGATGACGTAGTTT 3' located between positions -66 and -82 upstream from the cap site. This sequence has been shown to be essential for transcription of this promoter. The EIIA-early-promoter specific factor was found to be present at comparable levels in uninfected HeLa cells and in cells infected with either wild-type adenovirus or the EIA-deletion mutant dl312 under conditions in which the EIA proteins are induced to high levels [7 or 20 hr after infection in the presence of arabinonucleoside (cytosine arabinoside)]. Based on the quantitation in DNA binding assays, it appears that the mechanism of EIA-activated transcription of the EIIA-early promoter does not involve a net change in the amounts of this factor. Images PMID:2942943
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fradkin, L.G.; Yoshinaga, S.K.; Berk, A.J.
1987-11-01
The inhibition of transcription by RNA polymerase III in poliovirus-infected cells was studied. Experiments utilizing two different cell lines showed that the initiation step of transcription by RNA polymerase III was impaired by infection of these cells with the virus. The observed inhibition of transcription was not due to shut-off of host cell protein synthesis by poliovirus. Among four distinct components required for accurate transcription in vitro from cloned DNA templates, activities of RNA polymerase III and transcription factor TFIIIA were not significantly affected by virus infection. The activity of transcription factor TFIIIC, the limiting component required for transcription ofmore » RNA polymerase III genes, was severely inhibited in infected cells, whereas that of transcription factor TFIIIB was inhibited to a lesser extent. The sequence-specific DNA-binding of TFIIIC to the adenovirus VA1 gene internal promoted, however, was not altered by infection of cells with the virus. The authors conclude that (i) at least two transcription factors, TFIIIB and TFIIIC, are inhibited by infection of cells with poliovirtus, (ii) inactivation of TFIIIC does not involve destruction of its DNA-binding domain, and (iii) sequence-specific DNA binding by TFIIIC may be necessary but is not sufficient for the formation of productive transcription complexes.« less
Position specific variation in the rate of evolution in transcription factor binding sites
Moses, Alan M; Chiang, Derek Y; Kellis, Manolis; Lander, Eric S; Eisen, Michael B
2003-01-01
Background The binding sites of sequence specific transcription factors are an important and relatively well-understood class of functional non-coding DNAs. Although a wide variety of experimental and computational methods have been developed to characterize transcription factor binding sites, they remain difficult to identify. Comparison of non-coding DNA from related species has shown considerable promise in identifying these functional non-coding sequences, even though relatively little is known about their evolution. Results Here we analyse the genome sequences of the budding yeasts Saccharomyces cerevisiae, S. bayanus, S. paradoxus and S. mikatae to study the evolution of transcription factor binding sites. As expected, we find that both experimentally characterized and computationally predicted binding sites evolve slower than surrounding sequence, consistent with the hypothesis that they are under purifying selection. We also observe position-specific variation in the rate of evolution within binding sites. We find that the position-specific rate of evolution is positively correlated with degeneracy among binding sites within S. cerevisiae. We test theoretical predictions for the rate of evolution at positions where the base frequencies deviate from background due to purifying selection and find reasonable agreement with the observed rates of evolution. Finally, we show how the evolutionary characteristics of real binding motifs can be used to distinguish them from artefacts of computational motif finding algorithms. Conclusion As has been observed for protein sequences, the rate of evolution in transcription factor binding sites varies with position, suggesting that some regions are under stronger functional constraint than others. This variation likely reflects the varying importance of different positions in the formation of the protein-DNA complex. The characterization of the pattern of evolution in known binding sites will likely contribute to the effective use of comparative sequence data in the identification of transcription factor binding sites and is an important step toward understanding the evolution of functional non-coding DNA. PMID:12946282
Narad, Priyanka; Kumar, Abhishek; Chakraborty, Amlan; Patni, Pranav; Sengupta, Abhishek; Wadhwa, Gulshan; Upadhyaya, K C
2017-09-01
Transcription factors are trans-acting proteins that interact with specific nucleotide sequences known as transcription factor binding site (TFBS), and these interactions are implicated in regulation of the gene expression. Regulation of transcriptional activation of a gene often involves multiple interactions of transcription factors with various sequence elements. Identification of these sequence elements is the first step in understanding the underlying molecular mechanism(s) that regulate the gene expression. For in silico identification of these sequence elements, we have developed an online computational tool named transcription factor information system (TFIS) for detecting TFBS for the first time using a collection of JAVA programs and is mainly based on TFBS detection using position weight matrix (PWM). The database used for obtaining position frequency matrices (PFM) is JASPAR and HOCOMOCO, which is an open-access database of transcription factor binding profiles. Pseudo-counts are used while converting PFM to PWM, and TFBS detection is carried out on the basis of percent score taken as threshold value. TFIS is equipped with advanced features such as direct sequence retrieving from NCBI database using gene identification number and accession number, detecting binding site for common TF in a batch of gene sequences, and TFBS detection after generating PWM from known raw binding sequences in addition to general detection methods. TFIS can detect the presence of potential TFBSs in both the directions at the same time. This feature increases its efficiency. And the results for this dual detection are presented in different colors specific to the orientation of the binding site. Results obtained by the TFIS are more detailed and specific to the detected TFs as integration of more informative links from various related web servers are added in the result pages like Gene Ontology, PAZAR database and Transcription Factor Encyclopedia in addition to NCBI and UniProt. Common TFs like SP1, AP1 and NF-KB of the Amyloid beta precursor gene is easily detected using TFIS along with multiple binding sites. In another scenario of embryonic developmental process, TFs of the FOX family (FOXL1 and FOXC1) were also identified. TFIS is platform-independent which is publicly available along with its support and documentation at http://tfistool.appspot.com and http://www.bioinfoplus.com/tfis/ . TFIS is licensed under the GNU General Public License, version 3 (GPL-3.0).
Reading of the non-template DNA by transcription elongation factors.
Svetlov, Vladimir; Nudler, Evgeny
2018-05-14
Unlike transcription initiation and termination, which have easily discernable signals such as promoters and terminators, elongation is regulated through a dynamic network involving RNA/DNA pause signals and states- rather than sequence-specific protein interactions. A report by Nedialkov et al. (in press) provides experimental evidence for sequence-specific recruitment of elongation factor RfaH to transcribing RNA polymerase (RNAP) and outlines the mechanism of gene expression regulation by restraint ("locking") of the DNA non-template strand. According to this model, the elongation complex pauses at the so called "operon polarity sequence" (found in some long bacterial operons coding for virulence genes), when the usually flexible non-template DNA strand adopts a distinct hairpin-loop conformation on the surface of transcribing RNAP. Sequence-specific binding of RfaH to this DNA segment facilitates conversion of RfaH from its inactive closed to its active open conformation. The interaction network formed between RfaH, non-template DNA, and RNAP locks DNA in a conformation that renders the elongation complex resistant to pausing and termination. The effects of such locking on transcript elongation can be mimicked by restraint of the non-template strand due to its shortening. This work advances our understanding of regulation of transcript elongation and has important implications for the action of general transcription factors, such as NusG, which lack apparent sequence-specificity, as well as for the mechanisms of other processes linked to transcription such as transcription-coupled DNA repair. This article is protected by copyright. All rights reserved. © 2018 John Wiley & Sons Ltd.
Transcription Factors Bind Thousands of Active and InactiveRegions in the Drosophila Blastoderm
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Xiao-Yong; MacArthur, Stewart; Bourgon, Richard
2008-01-10
Identifying the genomic regions bound by sequence-specific regulatory factors is central both to deciphering the complex DNA cis-regulatory code that controls transcription in metazoans and to determining the range of genes that shape animal morphogenesis. Here, we use whole-genome tiling arrays to map sequences bound in Drosophila melanogaster embryos by the six maternal and gap transcription factors that initiate anterior-posterior patterning. We find that these sequence-specific DNA binding proteins bind with quantitatively different specificities to highly overlapping sets of several thousand genomic regions in blastoderm embryos. Specific high- and moderate-affinity in vitro recognition sequences for each factor are enriched inmore » bound regions. This enrichment, however, is not sufficient to explain the pattern of binding in vivo and varies in a context-dependent manner, demonstrating that higher-order rules must govern targeting of transcription factors. The more highly bound regions include all of the over forty well-characterized enhancers known to respond to these factors as well as several hundred putative new cis-regulatory modules clustered near developmental regulators and other genes with patterned expression at this stage of embryogenesis. The new targets include most of the microRNAs (miRNAs) transcribed in the blastoderm, as well as all major zygotically transcribed dorsal-ventral patterning genes, whose expression we show to be quantitatively modulated by anterior-posterior factors. In addition to these highly bound regions, there are several thousand regions that are reproducibly bound at lower levels. However, these poorly bound regions are, collectively, far more distant from genes transcribed in the blastoderm than highly bound regions; are preferentially found in protein-coding sequences; and are less conserved than highly bound regions. Together these observations suggest that many of these poorly-bound regions are not involved in early-embryonic transcriptional regulation, and a significant proportion may be nonfunctional. Surprisingly, for five of the six factors, their recognition sites are not unambiguously more constrained evolutionarily than the immediate flanking DNA, even in more highly bound and presumably functional regions, indicating that comparative DNA sequence analysis is limited in its ability to identify functional transcription factor targets.« less
2014-01-01
Background Retroviral elements are pervasively transcribed and dynamically regulated during development. While multiple histone- and DNA-modifying enzymes have broadly been associated with their global silencing, little is known about how the many diverse retroviral families are each selectively recognized. Results Here we show that the zinc finger protein Krüppel-like Factor 3 (KLF3) specifically silences transcription from the ORR1A0 long terminal repeat in murine fetal and adult erythroid cells. In the absence of KLF3, we detect widespread transcription from ORR1A0 elements driven by the master erythroid regulator KLF1. In several instances these aberrant transcripts are spliced to downstream genic exons. One such chimeric transcript produces a novel, dominant negative isoform of PU.1 that can induce erythroid differentiation. Conclusions We propose that KLF3 ensures the integrity of the murine erythroid transcriptome through the selective repression of a particular retroelement and is likely one of multiple sequence-specific factors that cooperate to achieve global silencing. PMID:24946810
Faivre-Rampant, Odile; Bryan, Glenn J; Roberts, Alison G; Milbourne, Daniel; Viola, Roberto; Taylor, Mark A
2004-04-01
In this study, the aim was to determine whether TCP transcription factors are implicated in meristem activation in potato (Solanum tuberosum). By searching a database of potato EST sequences, with a sequence characteristically conserved in TCP domains, a potato tcp gene was identified. A BAC clone containing the tcp sequence was isolated and the genomic sequence was determined. Using a CAPS marker assay, the potato tcp gene (sttcp1) was mapped to chromosome 8. In dormant buds, relatively high levels of sttcp1-specific transcript were detected by in situ hybridization. By contrast, in sprouting buds, no expression of the sttcp1 could be detected. Furthermore, an inverse relationship between axillary bud size and the steady-state level of the sstcp1 transcript was demonstrated. In non-growing buds exhibiting correlative inhibition, sttcpI-specific transcript levels were also relatively high, but rapidly decreased when apical dominance was removed by excision of the apical bud.
The physical size of transcription factors is key to transcriptional regulation in chromatin domains
NASA Astrophysics Data System (ADS)
Maeshima, Kazuhiro; Kaizu, Kazunari; Tamura, Sachiko; Nozaki, Tadasu; Kokubo, Tetsuro; Takahashi, Koichi
2015-02-01
Genetic information, which is stored in the long strand of genomic DNA as chromatin, must be scanned and read out by various transcription factors. First, gene-specific transcription factors, which are relatively small (˜50 kDa), scan the genome and bind regulatory elements. Such factors then recruit general transcription factors, Mediators, RNA polymerases, nucleosome remodellers, and histone modifiers, most of which are large protein complexes of 1-3 MDa in size. Here, we propose a new model for the functional significance of the size of transcription factors (or complexes) for gene regulation of chromatin domains. Recent findings suggest that chromatin consists of irregularly folded nucleosome fibres (10 nm fibres) and forms numerous condensed domains (e.g., topologically associating domains). Although the flexibility and dynamics of chromatin allow repositioning of genes within the condensed domains, the size exclusion effect of the domain may limit accessibility of DNA sequences by transcription factors. We used Monte Carlo computer simulations to determine the physical size limit of transcription factors that can enter condensed chromatin domains. Small gene-specific transcription factors can penetrate into the chromatin domains and search their target sequences, whereas large transcription complexes cannot enter the domain. Due to this property, once a large complex binds its target site via gene-specific factors it can act as a ‘buoy’ to keep the target region on the surface of the condensed domain and maintain transcriptional competency. This size-dependent specialization of target-scanning and surface-tethering functions could provide novel insight into the mechanisms of various DNA transactions, such as DNA replication and repair/recombination.
Kamenova, Ivanka; Warfield, Linda
2014-01-01
Most RNA polymerase (Pol) II promoters lack a TATA element, yet nearly all Pol II transcription requires TATA binding protein (TBP). While the TBP-TATA interaction is critical for transcription at TATA-containing promoters, it has been unclear whether TBP sequence-specific DNA contacts are required for transcription at TATA-less genes. Transcription factor IID (TFIID), the TBP-containing coactivator that functions at most TATA-less genes, recognizes short sequence-specific promoter elements in metazoans, but analogous promoter elements have not been identified in Saccharomyces cerevisiae. We generated a set of mutations in the yeast TBP DNA binding surface and found that most support growth of yeast. Both in vivo and in vitro, many of these mutations are specifically defective for transcription of two TATA-containing genes with only minor defects in transcription of two TATA-less, TFIID-dependent genes. TBP binds several TATA-less promoters with apparent high affinity, but our results suggest that this binding is not important for transcription activity. Our results are consistent with the model that sequence-specific TBP-DNA contacts are not important at yeast TATA-less genes and suggest that other general transcription factors or coactivator subunits are responsible for recognition of TATA-less promoters. Our results also explain why yeast TBP derivatives defective for TATA binding appear defective in activated transcription. PMID:24865972
Kamenova, Ivanka; Warfield, Linda; Hahn, Steven
2014-08-01
Most RNA polymerase (Pol) II promoters lack a TATA element, yet nearly all Pol II transcription requires TATA binding protein (TBP). While the TBP-TATA interaction is critical for transcription at TATA-containing promoters, it has been unclear whether TBP sequence-specific DNA contacts are required for transcription at TATA-less genes. Transcription factor IID (TFIID), the TBP-containing coactivator that functions at most TATA-less genes, recognizes short sequence-specific promoter elements in metazoans, but analogous promoter elements have not been identified in Saccharomyces cerevisiae. We generated a set of mutations in the yeast TBP DNA binding surface and found that most support growth of yeast. Both in vivo and in vitro, many of these mutations are specifically defective for transcription of two TATA-containing genes with only minor defects in transcription of two TATA-less, TFIID-dependent genes. TBP binds several TATA-less promoters with apparent high affinity, but our results suggest that this binding is not important for transcription activity. Our results are consistent with the model that sequence-specific TBP-DNA contacts are not important at yeast TATA-less genes and suggest that other general transcription factors or coactivator subunits are responsible for recognition of TATA-less promoters. Our results also explain why yeast TBP derivatives defective for TATA binding appear defective in activated transcription. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Interactions between the R2R3-MYB Transcription Factor, AtMYB61, and Target DNA Binding Sites
Prouse, Michael B.; Campbell, Malcolm M.
2013-01-01
Despite the prominent roles played by R2R3-MYB transcription factors in the regulation of plant gene expression, little is known about the details of how these proteins interact with their DNA targets. For example, while Arabidopsis thaliana R2R3-MYB protein AtMYB61 is known to alter transcript abundance of a specific set of target genes, little is known about the specific DNA sequences to which AtMYB61 binds. To address this gap in knowledge, DNA sequences bound by AtMYB61 were identified using cyclic amplification and selection of targets (CASTing). The DNA targets identified using this approach corresponded to AC elements, sequences enriched in adenosine and cytosine nucleotides. The preferred target sequence that bound with the greatest affinity to AtMYB61 recombinant protein was ACCTAC, the AC-I element. Mutational analyses based on the AC-I element showed that ACC nucleotides in the AC-I element served as the core recognition motif, critical for AtMYB61 binding. Molecular modelling predicted interactions between AtMYB61 amino acid residues and corresponding nucleotides in the DNA targets. The affinity between AtMYB61 and specific target DNA sequences did not correlate with AtMYB61-driven transcriptional activation with each of the target sequences. CASTing-selected motifs were found in the regulatory regions of genes previously shown to be regulated by AtMYB61. Taken together, these findings are consistent with the hypothesis that AtMYB61 regulates transcription from specific cis-acting AC elements in vivo. The results shed light on the specifics of DNA binding by an important family of plant-specific transcriptional regulators. PMID:23741471
Kemme, Catherine A; Esadze, Alexandre; Iwahara, Junji
2015-11-10
Functions of transcription factors require formation of specific complexes at particular sites in cis-regulatory elements of genes. However, chromosomal DNA contains numerous sites that are similar to the target sequences recognized by transcription factors. The influence of such "quasi-specific" sites on functions of the transcription factors is not well understood at present by experimental means. In this work, using fluorescence methods, we have investigated the influence of quasi-specific DNA sites on the efficiency of target location by the zinc finger DNA-binding domain of the inducible transcription factor Egr-1, which recognizes a 9 bp sequence. By stopped-flow assays, we measured the kinetics of Egr-1's association with a target site on 143 bp DNA in the presence of various competitor DNAs, including nonspecific and quasi-specific sites. The presence of quasi-specific sites on competitor DNA significantly decelerated the target association by the Egr-1 protein. The impact of the quasi-specific sites depended strongly on their affinity, their concentration, and the degree of their binding to the protein. To quantitatively describe the kinetic impact of the quasi-specific sites, we derived an analytical form of the apparent kinetic rate constant for the target association and used it for fitting to the experimental data. Our kinetic data with calf thymus DNA as a competitor suggested that there are millions of high-affinity quasi-specific sites for Egr-1 among the 3 billion bp of genomic DNA. This study quantitatively demonstrates that naturally abundant quasi-specific sites on DNA can considerably impede the target search processes of sequence-specific DNA-binding proteins.
Posse, Viktor; Hoberg, Emily; Dierckx, Anke; Shahzad, Saba; Koolmeister, Camilla; Larsson, Nils-Göran; Wilhelmsson, L. Marcus; Hällberg, B. Martin; Gustafsson, Claes M.
2014-01-01
Mammalian mitochondrial transcription is executed by a single subunit mitochondrial RNA polymerase (Polrmt) and its two accessory factors, mitochondrial transcription factors A and B2 (Tfam and Tfb2m). Polrmt is structurally related to single-subunit phage RNA polymerases, but it also contains a unique N-terminal extension (NTE) of unknown function. We here demonstrate that the NTE functions together with Tfam to ensure promoter-specific transcription. When the NTE is deleted, Polrmt can initiate transcription in the absence of Tfam, both from promoters and non-specific DNA sequences. Additionally, when in presence of Tfam and a mitochondrial promoter, the NTE-deleted mutant has an even higher transcription activity than wild-type polymerase, indicating that the NTE functions as an inhibitory domain. Our studies lead to a model according to which Tfam specifically recruits wild-type Polrmt to promoter sequences, relieving the inhibitory effect of the NTE, as a first step in transcription initiation. In the second step, Tfb2m is recruited into the complex and transcription is initiated. PMID:24445803
2015-01-01
Functions of transcription factors require formation of specific complexes at particular sites in cis-regulatory elements of genes. However, chromosomal DNA contains numerous sites that are similar to the target sequences recognized by transcription factors. The influence of such “quasi-specific” sites on functions of the transcription factors is not well understood at present by experimental means. In this work, using fluorescence methods, we have investigated the influence of quasi-specific DNA sites on the efficiency of target location by the zinc finger DNA-binding domain of the inducible transcription factor Egr-1, which recognizes a 9 bp sequence. By stopped-flow assays, we measured the kinetics of Egr-1’s association with a target site on 143 bp DNA in the presence of various competitor DNAs, including nonspecific and quasi-specific sites. The presence of quasi-specific sites on competitor DNA significantly decelerated the target association by the Egr-1 protein. The impact of the quasi-specific sites depended strongly on their affinity, their concentration, and the degree of their binding to the protein. To quantitatively describe the kinetic impact of the quasi-specific sites, we derived an analytical form of the apparent kinetic rate constant for the target association and used it for fitting to the experimental data. Our kinetic data with calf thymus DNA as a competitor suggested that there are millions of high-affinity quasi-specific sites for Egr-1 among the 3 billion bp of genomic DNA. This study quantitatively demonstrates that naturally abundant quasi-specific sites on DNA can considerably impede the target search processes of sequence-specific DNA-binding proteins. PMID:26502071
2013-01-01
Background Sequence-specific DNA-binding proteins, with their paramount importance in the regulation of expression of the genetic material, are encoded by approximately 5% of the genes in an animal’s genome. But it is unclear to what extent alternative transcripts from these genes may further increase the complexity of the transcription factor complement. Results Of the 938 potential C. elegans transcription factor genes, 197 were annotated in WormBase as encoding at least two distinct isoforms. Evaluation of prior evidence identified, with different levels of confidence, 50 genes with alternative transcript starts, 23 with alternative transcript ends, 35 with alternative splicing and 34 with alternative transcripts generated by a combination of mechanisms, leaving 55 that were discounted. Expression patterns were determined for transcripts for a sample of 29 transcription factor genes, concentrating on those with alternative transcript starts for which the evidence was strongest. Seamless fosmid recombineering was used to generate reporter gene fusions with minimal modification to assay expression of specific transcripts while maintaining the broad genomic DNA context and alternative transcript production. Alternative transcription factor gene transcripts were typically expressed with identical or substantially overlapping distributions rather than in distinct domains. Conclusions Increasingly sensitive sequencing technologies will reveal rare transcripts but many of these are clearly non-productive. The majority of the transcription factor gene alternative transcripts that are productive may represent tolerable noise rather than encoding functionally distinct isoforms. PMID:23586691
Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes.
Riechmann, J L; Heard, J; Martin, G; Reuber, L; Jiang, C; Keddie, J; Adam, L; Pineda, O; Ratcliffe, O J; Samaha, R R; Creelman, R; Pilgrim, M; Broun, P; Zhang, J Z; Ghandehari, D; Sherman, B K; Yu, G
2000-12-15
The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over 5% of its genome to code for more than 1500 transcription factors, about 45% of which are from families specific to plants. Arabidopsis transcription factors that belong to families common to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the conserved DNA binding domains, many of which have been arranged in combinations specific to each lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the regulation of transcription.
Ewulonu, U K; Snyder, L; Silver, L M; Schimenti, J C
1996-03-01
Transgenic mice were generated to localize essential promoter elements in the mouse testis-expressed Tcp-10 genes. These genes are expressed exclusively in male germ cells, and exhibit a diffuse range of transcriptional start sites, possibly due to the absence of a TATA box. A series of transgene constructs containing different amounts of 5' flanking DNA revealed that all sequences necessary for appropriate temporal and tissue-specific transcription of Tcp-10 reside between positions -1 to -973. All transgenic animals containing these sequences expressed a chimeric transgene at high levels, in a pattern that paralleled the endogenous genes. These experiments further defined a 227 bp fragment from -746 to -973 that was absolutely essential for expression. In a gel-shift assay, this 227-bp fragment bound nuclear protein from testis, but not other tissues, to yield two retarded bands. Sequence analysis of this fragment revealed a half-site for the AP-2 transcription factor recognition sequence. Gel shift assays using native or mutant oligonucleotides demonstrated that the putative AP-2 recognition sequence was essential for generating the retarded bands. Since the binding activity is testis-specific, but AP-2 expression is not exclusive to male germ cells, it is possible that transcription of Tcp-10 requires interaction between AP-2 and a germ cell-specific transcription factor.
Terrados, Gloria; Finkernagel, Florian; Stielow, Bastian; Sadic, Dennis; Neubert, Juliane; Herdt, Olga; Krause, Michael; Scharfe, Maren; Jarek, Michael; Suske, Guntram
2012-01-01
The transcription factor Sp2 is essential for early mouse development and for proliferation of mouse embryonic fibroblasts in culture. Yet its mechanisms of action and its target genes are largely unknown. In this study, we have combined RNA interference, in vitro DNA binding, chromatin immunoprecipitation sequencing and global gene-expression profiling to investigate the role of Sp2 for cellular functions, to define target sites and to identify genes regulated by Sp2. We show that Sp2 is important for cellular proliferation that it binds to GC-boxes and occupies proximal promoters of genes essential for vital cellular processes including gene expression, replication, metabolism and signalling. Moreover, we identified important key target genes and cellular pathways that are directly regulated by Sp2. Most significantly, Sp2 binds and activates numerous sequence-specific transcription factor and co-activator genes, and represses the whole battery of cholesterol synthesis genes. Our results establish Sp2 as a sequence-specific regulator of vitally important genes. PMID:22684502
He, Qiye; Johnston, Jeff; Zeitlinger, Julia
2014-01-01
Understanding how eukaryotic enhancers are bound and regulated by specific combinations of transcription factors is still a major challenge. To better map transcription factor binding genome-wide at nucleotide resolution in vivo, we have developed a robust ChIP-exo protocol called ChIP experiments with nucleotide resolution through exonuclease, unique barcode and single ligation (ChIP-nexus), which utilizes an efficient DNA self-circularization step during library preparation. Application of ChIP-nexus to four proteins—human TBP and Drosophila NFkB, Twist and Max— demonstrates that it outperforms existing ChIP protocols in resolution and specificity, pinpoints relevant binding sites within enhancers containing multiple binding motifs and allows the analysis of in vivo binding specificities. Notably, we show that Max frequently interacts with DNA sequences next to its motif, and that this binding pattern correlates with local DNA sequence features such as DNA shape. ChIP-nexus will be broadly applicable to studying in vivo transcription factor binding specificity and its relationship to cis-regulatory changes in humans and model organisms. PMID:25751057
Diehl, Adam G
2018-01-01
Abstract The mouse is widely used as system to study human genetic mechanisms. However, extensive rewiring of transcriptional regulatory networks often confounds translation of findings between human and mouse. Site-specific gain and loss of individual transcription factor binding sites (TFBS) has caused functional divergence of orthologous regulatory loci, and so we must look beyond this positional conservation to understand common themes of regulatory control. Fortunately, transcription factor co-binding patterns shared across species often perform conserved regulatory functions. These can be compared to ‘regulatory sentences’ that retain the same meanings regardless of sequence and species context. By analyzing TFBS co-occupancy patterns observed in four human and mouse cell types, we learned a regulatory grammar: the rules by which TFBS are combined into meaningful regulatory sentences. Different parts of this grammar associate with specific sets of functional annotations regardless of sequence conservation and predict functional signatures more accurately than positional conservation. We further show that both species-specific and conserved portions of this grammar are involved in gene expression divergence and human disease risk. These findings expand our understanding of transcriptional regulatory mechanisms, suggesting that phenotypic divergence and disease risk are driven by a complex interplay between deeply conserved and species-specific transcriptional regulatory pathways. PMID:29361190
TFBSshape: a motif database for DNA shape features of transcription factor binding sites.
Yang, Lin; Zhou, Tianyin; Dror, Iris; Mathelier, Anthony; Wasserman, Wyeth W; Gordân, Raluca; Rohs, Remo
2014-01-01
Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein-DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone.
TFBSshape: a motif database for DNA shape features of transcription factor binding sites
Yang, Lin; Zhou, Tianyin; Dror, Iris; Mathelier, Anthony; Wasserman, Wyeth W.; Gordân, Raluca; Rohs, Remo
2014-01-01
Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein–DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone. PMID:24214955
Cong, Le; Zhou, Ruhong; Kuo, Yu-chi; Cunniff, Margaret; Zhang, Feng
2012-01-01
Transcription activator-like effectors (TALE) are sequence-specific DNA binding proteins that harbor modular, repetitive DNA binding domains. TALEs have enabled the creation of customizable designer transcriptional factors and sequence-specific nucleases for genome engineering. Here we report two improvements of the TALE toolbox for achieving efficient activation and repression of endogenous gene expression in mammalian cells. We show that the naturally occurring repeat variable diresidue (RVD) Asn-His (NH) has high biological activity and specificity for guanine, a highly prevalent base in mammalian genomes. We also report an effective TALE transcriptional repressor architecture for targeted inhibition of transcription in mammalian cells. These findings will improve the precision and effectiveness of genome engineering that can be achieved using TALEs. PMID:22828628
Tokuhiro, Keizo; Miyagawa, Yasushi; Yamada, Shuichi; Hirose, Mika; Ohta, Hiroshi; Nishimune, Yoshitake; Tanaka, Hiromitsu
2007-03-01
Haspin is a unique protein kinase expressed predominantly in haploid male germ cells. The genomic structure of haspin (Gsg2) has revealed it to be intronless, and the entire transcription unit is in an intron of the integrin alphaE (Itgae) gene. Transcription occurs from a bidirectional promoter that also generates an alternatively spliced integrin alphaE-derived mRNA (Aed). In mice, the testis-specific alternative splicing of Aed is expressed bidirectionally downstream from the Gsg2 transcription initiation site, and a segment consisting of 26 bp transcribes both genomic DNA strands between Gsg2 and the Aed transcription initiation sites. To investigate the mechanisms for this unique gene regulation, we cloned and characterized the Gsg2 promoter region. The 193-bp genomic fragment from the 5' end of the Gsg2 and Aed genes, fused with EGFP and DsRed genes, drove the expression of both proteins in haploid germ cells of transgenic mice. This promoter element contained only a GC-rich sequence, and not the previously reported DNA sequences known to bind various transcription factors--with the exception of E2F1, TCFAP2A1 (AP2), and SP1. Here, we show that the 193-bp DNA sequence is sufficient for the specific, bidirectional, and synchronous expression in germ cells in the testis. We also demonstrate the existence of germ cell nuclear factors specifically bound to the promoter sequence. This activity may be regulated by binding to the promoter sequence with germ cell-specific nuclear complex(es) without regulation via DNA methylation.
Functional specificity of a Hox protein mediated by the recognition of minor groove structure.
Joshi, Rohit; Passner, Jonathan M; Rohs, Remo; Jain, Rinku; Sosinsky, Alona; Crickmore, Michael A; Jacob, Vinitha; Aggarwal, Aneel K; Honig, Barry; Mann, Richard S
2007-11-02
The recognition of specific DNA-binding sites by transcription factors is a critical yet poorly understood step in the control of gene expression. Members of the Hox family of transcription factors bind DNA by making nearly identical major groove contacts via the recognition helices of their homeodomains. In vivo specificity, however, often depends on extended and unstructured regions that link Hox homeodomains to a DNA-bound cofactor, Extradenticle (Exd). Using a combination of structure determination, computational analysis, and in vitro and in vivo assays, we show that Hox proteins recognize specific Hox-Exd binding sites via residues located in these extended regions that insert into the minor groove but only when presented with the correct DNA sequence. Our results suggest that these residues, which are conserved in a paralog-specific manner, confer specificity by recognizing a sequence-dependent DNA structure instead of directly reading a specific DNA sequence.
In vitro fluorescence studies of transcription factor IIB-DNA interaction.
Górecki, Andrzej; Figiel, Małgorzata; Dziedzicka-Wasylewska, Marta
2015-01-01
General transcription factor TFIIB is one of the basal constituents of the preinitiation complex of eukaryotic RNA polymerase II, acting as a bridge between the preinitiation complex and the polymerase, and binding promoter DNA in an asymmetric manner, thereby defining the direction of the transcription. Methods of fluorescence spectroscopy together with circular dichroism spectroscopy were used to observe conformational changes in the structure of recombinant human TFIIB after binding to specific DNA sequence. To facilitate the exploration of the structural changes, several site-directed mutations have been introduced altering the fluorescence properties of the protein. Our observations showed that binding of specific DNA sequences changed the protein structure and dynamics, and TFIIB may exist in two conformational states, which can be described by a different microenvironment of W52. Fluorescence studies using both intrinsic and exogenous fluorophores showed that these changes significantly depended on the recognition sequence and concerned various regions of the protein, including those interacting with other transcription factors and RNA polymerase II. DNA binding can cause rearrangements in regions of proteins interacting with the polymerase in a manner dependent on the recognized sequences, and therefore, influence the gene expression.
Bouallaga, I; Massicard, S; Yaniv, M; Thierry, F
2000-11-01
Recent studies have reported new mechanisms that mediate the transcriptional synergy of strong tissue-specific enhancers, involving the cooperative assembly of higher-order nucleoprotein complexes called enhanceosomes. Here we show that the HPV18 enhancer, which controls the epithelial-specific transcription of the E6 and E7 transforming genes, exhibits characteristic features of these structures. We used deletion experiments to show that a core enhancer element cooperates, in a specific helical phasing, with distant essential factors binding to the ends of the enhancer. This core sequence, binding a Jun B/Fra-2 heterodimer, cooperatively recruits the architectural protein HMG-I(Y) in a nucleoprotein complex, where they interact with each other. Therefore, in HeLa cells, HPV18 transcription seems to depend upon the assembly of an enhanceosome containing multiple cellular factors recruited by a core sequence interacting with AP1 and HMG-I(Y).
A purified transcription factor (TIF-IB) binds to essential sequences of the mouse rDNA promoter.
Clos, J; Buttgereit, D; Grummt, I
1986-01-01
A transcription factor that is specific for mouse rDNA has been partially purified from Ehrlich ascites cells. This factor [designated transcription initiation factor (TIF)-IB] is required for accurate in vitro synthesis of mouse rRNA in addition to RNA polymerase I and another regulatory factor, TIF-IA. TIF-IB activity is present in extracts both from growing and nongrowing cells in comparable amounts. Prebinding competition experiments with wild-type and mutant templates suggest that TIF-IB interacts with the core control element of the rDNA promoter, which is located immediately upstream of the initiation site. The specific binding of TIF-IB to the RNA polymerase I promoter is demonstrated by exonuclease III protection experiments. The 3' border of the sequences protected by TIF-IB is shown to be on the coding strand at position -21 and on the noncoding strand at position -7. The results suggest that direct binding of TIF-IB to sequences in the core promoter element is the mechanism by which this factor imparts promoter selectivity to RNA polymerase I. Images PMID:3456157
Ashworth, Justin; Plaisier, Christopher L.; Lo, Fang Yin; Reiss, David J.; Baliga, Nitin S.
2014-01-01
Widespread microbial genome sequencing presents an opportunity to understand the gene regulatory networks of non-model organisms. This requires knowledge of the binding sites for transcription factors whose DNA-binding properties are unknown or difficult to infer. We adapted a protein structure-based method to predict the specificities and putative regulons of homologous transcription factors across diverse species. As a proof-of-concept we predicted the specificities and transcriptional target genes of divergent archaeal feast/famine regulatory proteins, several of which are encoded in the genome of Halobacterium salinarum. This was validated by comparison to experimentally determined specificities for transcription factors in distantly related extremophiles, chromatin immunoprecipitation experiments, and cis-regulatory sequence conservation across eighteen related species of halobacteria. Through this analysis we were able to infer that Halobacterium salinarum employs a divergent local trans-regulatory strategy to regulate genes (carA and carB) involved in arginine and pyrimidine metabolism, whereas Escherichia coli employs an operon. The prediction of gene regulatory binding sites using structure-based methods is useful for the inference of gene regulatory relationships in new species that are otherwise difficult to infer. PMID:25255272
Ashworth, Justin; Plaisier, Christopher L; Lo, Fang Yin; Reiss, David J; Baliga, Nitin S
2014-01-01
Widespread microbial genome sequencing presents an opportunity to understand the gene regulatory networks of non-model organisms. This requires knowledge of the binding sites for transcription factors whose DNA-binding properties are unknown or difficult to infer. We adapted a protein structure-based method to predict the specificities and putative regulons of homologous transcription factors across diverse species. As a proof-of-concept we predicted the specificities and transcriptional target genes of divergent archaeal feast/famine regulatory proteins, several of which are encoded in the genome of Halobacterium salinarum. This was validated by comparison to experimentally determined specificities for transcription factors in distantly related extremophiles, chromatin immunoprecipitation experiments, and cis-regulatory sequence conservation across eighteen related species of halobacteria. Through this analysis we were able to infer that Halobacterium salinarum employs a divergent local trans-regulatory strategy to regulate genes (carA and carB) involved in arginine and pyrimidine metabolism, whereas Escherichia coli employs an operon. The prediction of gene regulatory binding sites using structure-based methods is useful for the inference of gene regulatory relationships in new species that are otherwise difficult to infer.
Mazzoni, Esteban O; Mahony, Shaun; Closser, Michael; Morrison, Carolyn A; Nedelec, Stephane; Williams, Damian J; An, Disi; Gifford, David K; Wichterle, Hynek
2013-01-01
Efficient transcriptional programming promises to open new frontiers in regenerative medicine. However, mechanisms by which programming factors transform cell fate are unknown, preventing more rational selection of factors to generate desirable cell types. Three transcription factors, Ngn2, Isl1 and Lhx3, were sufficient to program rapidly and efficiently spinal motor neuron identity when expressed in differentiating mouse embryonic stem cells. Replacement of Lhx3 by Phox2a led to specification of cranial, rather than spinal, motor neurons. Chromatin immunoprecipitation–sequencing analysis of Isl1, Lhx3 and Phox2a binding sites revealed that the two cell fates were programmed by the recruitment of Isl1-Lhx3 and Isl1-Phox2a complexes to distinct genomic locations characterized by a unique grammar of homeodomain binding motifs. Our findings suggest that synergistic interactions among transcription factors determine the specificity of their recruitment to cell type–specific binding sites and illustrate how a single transcription factor can be repurposed to program different cell types. PMID:23872598
Mink, S; Härtig, E; Jennewein, P; Doppler, W; Cato, A C
1992-01-01
Mouse mammary tumor virus (MMTV) is a milk-transmitted retrovirus involved in the neoplastic transformation of mouse mammary gland cells. The expression of this virus is regulated by mammary cell type-specific factors, steroid hormones, and polypeptide growth factors. Sequences for mammary cell-specific expression are located in an enhancer element in the extreme 5' end of the long terminal repeat region of this virus. This enhancer, when cloned in front of the herpes simplex thymidine kinase promoter, endows the promoter with mammary cell-specific response. Using functional and DNA-protein-binding studies with constructs mutated in the MMTV long terminal repeat enhancer, we have identified two main regulatory elements necessary for the mammary cell-specific response. These elements consist of binding sites for a transcription factor in the family of CTF/NFI proteins and the transcription factor mammary cell-activating factor (MAF) that recognizes the sequence G Pu Pu G C/G A A G G/T. Combinations of CTF/NFI- and MAF-binding sites or multiple copies of either one of these binding sites but not solitary binding sites mediate mammary cell-specific expression. The functional activities of these two regulatory elements are enhanced by another factor that binds to the core sequence ACAAAG. Interdigitated binding sites for CTF/NFI, MAF, and/or the ACAAAG factor are also found in the 5' upstream regions of genes encoding whey milk proteins from different species. These findings suggest that mammary cell-specific regulation is achieved by a concerted action of factors binding to multiple regulatory sites. Images PMID:1328867
Novel mechanism and factor for regulation by HIV-1 Tat.
Zhou, Q; Sharp, P A
1995-01-01
Tat regulation of human immunodeficiency virus (HIV) transcription is unique because of its specificity for an RNA target, TAR, and its ability to increase the efficiency of elongation by polymerase. A reconstituted reaction that is Tat-specific and TAR-dependent for activation of HIV transcription has been used to identify and partially purify a cellular activity that is required for trans-activation by Tat, but not by other activators. In the reaction, Tat stimulates the efficiency of elongation by polymerase, whereas Sp1 and other DNA sequence-specific transcription factors activate the rate of initiation. Furthermore, while TATA binding protein (TBP)-associated factors (TAFs) in the TFIID complex are required for activation by transcription factors, they are dispensable for Tat function. Thus, Tat acts through a novel mechanism, which is mediated by a specific host cellular factor, to stimulate HIV-1 gene expression. Images PMID:7835343
Functional Analysis of Maize Silk-Specific ZmbZIP25 Promoter.
Li, Wanying; Yu, Dan; Yu, Jingjuan; Zhu, Dengyun; Zhao, Qian
2018-03-12
ZmbZIP25 ( Zea mays bZIP (basic leucine zipper) transcription factor 25) is a function-unknown protein that belongs to the D group of the bZIP transcription factor family. RNA-seq data showed that the expression of ZmbZIP25 was tissue-specific in maize silks, and this specificity was confirmed by RT-PCR (reverse transcription-polymerase chain reaction). In situ RNA hybridization showed that ZmbZIP25 was expressed exclusively in the xylem of maize silks. A 5' RACE (rapid amplification of cDNA ends) assay identified an adenine residue as the transcription start site of the ZmbZIP25 gene. To characterize this silk-specific promoter, we isolated and analyzed a 2450 bp (from -2083 to +367) and a 2600 bp sequence of ZmbZIP25 (from -2083 to +517, the transcription start site was denoted +1). Stable expression assays in Arabidopsis showed that the expression of the reporter gene GUS driven by the 2450 bp ZmbZIP25 5'-flanking fragment occurred exclusively in the papillae of Arabidopsis stigmas. Furthermore, transient expression assays in maize indicated that GUS and GFP expression driven by the 2450 bp ZmbZIP25 5'-flanking sequences occurred only in maize silks and not in other tissues. However, no GUS or GFP expression was driven by the 2600 bp ZmbZIP25 5'-flanking sequences in either stable or transient expression assays. A series of deletion analyses of the 2450 bp ZmbZIP25 5'-flanking sequence was performed in transgenic Arabidopsis plants, and probable elements prediction analysis revealed the possible presence of negative regulatory elements within the 161 bp region from -1117 to -957 that were responsible for the specificity of the ZmbZIP25 5'-flanking sequence.
Functional Analysis of Maize Silk-Specific ZmbZIP25 Promoter
Li, Wanying; Yu, Dan; Yu, Jingjuan; Zhu, Dengyun; Zhao, Qian
2018-01-01
ZmbZIP25 (Zea mays bZIP (basic leucine zipper) transcription factor 25) is a function-unknown protein that belongs to the D group of the bZIP transcription factor family. RNA-seq data showed that the expression of ZmbZIP25 was tissue-specific in maize silks, and this specificity was confirmed by RT-PCR (reverse transcription-polymerase chain reaction). In situ RNA hybridization showed that ZmbZIP25 was expressed exclusively in the xylem of maize silks. A 5′ RACE (rapid amplification of cDNA ends) assay identified an adenine residue as the transcription start site of the ZmbZIP25 gene. To characterize this silk-specific promoter, we isolated and analyzed a 2450 bp (from −2083 to +367) and a 2600 bp sequence of ZmbZIP25 (from −2083 to +517, the transcription start site was denoted +1). Stable expression assays in Arabidopsis showed that the expression of the reporter gene GUS driven by the 2450 bp ZmbZIP25 5′-flanking fragment occurred exclusively in the papillae of Arabidopsis stigmas. Furthermore, transient expression assays in maize indicated that GUS and GFP expression driven by the 2450 bp ZmbZIP25 5′-flanking sequences occurred only in maize silks and not in other tissues. However, no GUS or GFP expression was driven by the 2600 bp ZmbZIP25 5′-flanking sequences in either stable or transient expression assays. A series of deletion analyses of the 2450 bp ZmbZIP25 5′-flanking sequence was performed in transgenic Arabidopsis plants, and probable elements prediction analysis revealed the possible presence of negative regulatory elements within the 161 bp region from −1117 to −957 that were responsible for the specificity of the ZmbZIP25 5′-flanking sequence. PMID:29534529
Wang, Guohua; Wang, Fang; Huang, Qian; Li, Yu; Liu, Yunlong; Wang, Yadong
2015-01-01
Transcription factors are proteins that bind to DNA sequences to regulate gene transcription. The transcription factor binding sites are short DNA sequences (5-20 bp long) specifically bound by one or more transcription factors. The identification of transcription factor binding sites and prediction of their function continue to be challenging problems in computational biology. In this study, by integrating the DNase I hypersensitive sites with known position weight matrices in the TRANSFAC database, the transcription factor binding sites in gene regulatory region are identified. Based on the global gene expression patterns in cervical cancer HeLaS3 cell and HelaS3-ifnα4h cell (interferon treatment on HeLaS3 cell for 4 hours), we present a model-based computational approach to predict a set of transcription factors that potentially cause such differential gene expression. Significantly, 6 out 10 predicted functional factors, including IRF, IRF-2, IRF-9, IRF-1 and IRF-3, ICSBP, belong to interferon regulatory factor family and upregulate the gene expression levels responding to the interferon treatment. Another factor, ISGF-3, is also a transcriptional activator induced by interferon alpha. Using the different transcription factor binding sites selected criteria, the prediction result of our model is consistent. Our model demonstrated the potential to computationally identify the functional transcription factors in gene regulation.
Specific Inhibition of the transcription factor Ci by a Cobalt(III)-Schiff base-DNA conjugate
Hurtado, Ryan R.; Harney, Allison S.; Heffern, Marie C.; Holbrook, Robert J.; Holmgren, Robert A.; Meade, Thomas J.
2012-01-01
We describe the use of Co(III) Schiff base-DNA conjugates, a versatile class of research tools that target C2H2 transcription factors, to inhibit the Hedgehog (Hh) pathway. In developing mammalian embryos, Hh signaling is critical for the formation and development of many tissues and organs. Inappropriate activation of the Hedgehog (Hh) pathway has been implicated in a variety of cancers including medulloblastomas and basal cell carcinomas. It is well known that Hh regulates the activity of the Gli family of C2H2 zinc finger transcription factors in mammals. In Drosophila the function of the Gli proteins is performed by a single transcription factor with an identical DNA binding consensus sequence, Cubitus Interruptus (Ci). We have demonstrated previously that conjugation of a specific 17 base-pair oligonucleotide to a Co(III) Schiff base complex results in a targeted inhibitor of the Snail family C2H2 zinc finger transcription factors. Modification of the oligonucleotide sequence in the Co(III) Schiff base-DNA conjugate to that of Ci’s consensus sequence (Co(III)-Ci) generates an equally selective inhibitor of Ci. Co(III)-Ci irreversibly binds the Ci zinc finger domain and prevents it from binding DNA in vitro. In a Ci responsive tissue culture reporter gene assay, Co(III)-Ci reduces the transcriptional activity of Ci in a concentration dependent manner. In addition, injection of wild-type Drosophila embryos with Co(III)-Ci phenocopies a Ci loss of function phenotype, demonstrating effectiveness in vivo. This study provides evidence that Co(III) Schiff base-DNA conjugates are a versatile class of specific and potent tools for studying zinc finger domain proteins and have potential applications as customizable anti-cancer therapeutics. PMID:22214326
Watada, Hirotaka; Mirmira, Raghavendra G.; Kalamaras, Julie; German, Michael S.
2000-01-01
The developmentally important homeodomain transcription factors of the NK-2 class contain a highly conserved region, the NK2-specific domain (NK2-SD). The function of this domain, however, remains unknown. The primary structure of the NK2-SD suggests that it might function as an accessory DNA-binding domain or as a protein–protein interaction interface. To assess the possibility that the NK2-SD may contribute to DNA-binding specificity, we used a PCR-based approach to identify a consensus DNA-binding sequences for Nkx2.2, an NK-2 family member involved in pancreas and central nervous system development. The consensus sequence (TCTAAGTGAGCTT) is similar to the known binding sequences for other NK-2 homeodomain proteins, but we show that the NK2-SD does not contribute significantly to specific DNA binding to this sequence. To determine whether the NK2-SD contributes to transactivation, we used GAL4-Nkx2.2 fusion constructs to map a powerful transcriptional activation domain in the C-terminal region beyond the conserved NK2-SD. Interestingly, this C-terminal region functions as a transcriptional activator only in the absence of an intact NK2-SD. The NK2-SD also can mask transactivation from the paired homeodomain transcription factor Pax6, but it has no effect on transcription by itself. These results demonstrate that the NK2-SD functions as an intramolecular regulator of the C-terminal activation domain in Nkx2.2 and support a model in which interactions through the NK2-SD regulate the ability of NK-2-class proteins to activate specific genes during development. PMID:10944215
Trapnell, Cole; Davidson, Stuart; Pachter, Lior; Chu, Hou Cheng; Tonkin, Leath A.; Biggin, Mark D.; Eisen, Michael B.
2010-01-01
Changes in gene expression play an important role in evolution, yet the molecular mechanisms underlying regulatory evolution are poorly understood. Here we compare genome-wide binding of the six transcription factors that initiate segmentation along the anterior-posterior axis in embryos of two closely related species: Drosophila melanogaster and Drosophila yakuba. Where we observe binding by a factor in one species, we almost always observe binding by that factor to the orthologous sequence in the other species. Levels of binding, however, vary considerably. The magnitude and direction of the interspecies differences in binding levels of all six factors are strongly correlated, suggesting a role for chromatin or other factor-independent forces in mediating the divergence of transcription factor binding. Nonetheless, factor-specific quantitative variation in binding is common, and we show that it is driven to a large extent by the gain and loss of cognate recognition sequences for the given factor. We find only a weak correlation between binding variation and regulatory function. These data provide the first genome-wide picture of how modest levels of sequence divergence between highly morphologically similar species affect a system of coordinately acting transcription factors during animal development, and highlight the dominant role of quantitative variation in transcription factor binding over short evolutionary distances. PMID:20351773
Joshi, Anagha
2014-12-30
Transcriptional hotspots are defined as genomic regions bound by multiple factors. They have been identified recently as cell type specific enhancers regulating developmentally essential genes in many species such as worm, fly and humans. The in-depth analysis of hotspots across multiple cell types in same species still remains to be explored and can bring new biological insights. We therefore collected 108 transcription-related factor (TF) ChIP sequencing data sets in ten murine cell types and classified the peaks in each cell type in three groups according to binding occupancy as singletons (low-occupancy), combinatorials (mid-occupancy) and hotspots (high-occupancy). The peaks in the three groups clustered largely according to the occupancy, suggesting priming of genomic loci for mid occupancy irrespective of cell type. We then characterized hotspots for diverse structural functional properties. The genes neighbouring hotspots had a small overlap with hotspot genes in other cell types and were highly enriched for cell type specific function. Hotspots were enriched for sequence motifs of key TFs in that cell type and more than 90% of hotspots were occupied by pioneering factors. Though we did not find any sequence signature in the three groups, the H3K4me1 binding profile had bimodal peaks at hotspots, distinguishing hotspots from mono-modal H3K4me1 singletons. In ES cells, differentially expressed genes after perturbation of activators were enriched for hotspot genes suggesting hotspots primarily act as transcriptional activator hubs. Finally, we proposed that ES hotspots might be under control of SetDB1 and not DNMT for silencing. Transcriptional hotspots are enriched for tissue specific enhancers near cell type specific highly expressed genes. In ES cells, they are predicted to act as transcriptional activator hubs and might be under SetDB1 control for silencing.
Global Analysis of Transcription Factor-Binding Sites in Yeast Using ChIP-Seq
Lefrançois, Philippe; Gallagher, Jennifer E. G.; Snyder, Michael
2016-01-01
Transcription factors influence gene expression through their ability to bind DNA at specific regulatory elements. Specific DNA-protein interactions can be isolated through the chromatin immunoprecipitation (ChIP) procedure, in which DNA fragments bound by the protein of interest are recovered. ChIP is followed by high-throughput DNA sequencing (Seq) to determine the genomic provenance of ChIP DNA fragments and their relative abundance in the sample. This chapter describes a ChIP-Seq strategy adapted for budding yeast to enable the genome-wide characterization of binding sites of transcription factors (TFs) and other DNA-binding proteins in an efficient and cost-effective way. Yeast strains with epitope-tagged TFs are most commonly used for ChIP-Seq, along with their matching untagged control strains. The initial step of ChIP involves the cross-linking of DNA and proteins. Next, yeast cells are lysed and sonicated to shear chromatin into smaller fragments. An antibody against an epitope-tagged TF is used to pull down chromatin complexes containing DNA and the TF of interest. DNA is then purified and proteins degraded. Specific barcoded adapters for multiplex DNA sequencing are ligated to ChIP DNA. Short DNA sequence reads (28–36 base pairs) are parsed according to the barcode and aligned against the yeast reference genome, thus generating a nucleotide-resolution map of transcription factor-binding sites and their occupancy. PMID:25213249
Hashimoto, H; Toide, K; Kitamura, R; Fujita, M; Tagawa, S; Itoh, S; Kamataki, T
1993-12-01
CYP3 A4 is the adult-specific form of cytochrome P450 in human livers [Komori, M., Nishio, K., Kitada, M., Shiramatsu, K., Muroya, K., Soma, M., Nagashima, K. & Kamataki, T. (1990) Biochemistry 29, 4430-4433]. The sequences of three genomic clones for CYP3A4 were analyzed for all exons, exon-intron junctions and the 5'-flanking region from the major transcription site to nucleotide position -1105, and compared with those of the CYP3A7 gene, a fetal-specific form of cytochrome P450 in humans. The results showed that the identity of 5'-flanking sequences between CYP3A4 and CYP3A7 genes was 91%, and that each 5'-flanking region had characteristic sequences termed as NFSE (P450NF-specific element) and HFLaSE (P450HFLa specific element), respectively. A basic transcription element (BTE) also lay in the 5'-flanking region of the CYP3A4 gene as seen in many CYP genes [Yanagida, A., Sogawa, K., Yasumoto, K. & Fujii-Kuriyama, Y. (1990) Mol. Cell. Biol. 10, 1470-1475]. The BTE binding factor (BTEB) was present in both adult and fetal human livers. To examine the transcriptional activity of the CYP3A4 gene, DNA fragments in the 5'-flanking region of the gene were inserted in front of the simian virus 40 promoter and the chloramphenicol acetyltransferase structural gene, and the constructs were transfected in HepG2 cells. The analysis of the chloramphenicol acetyltransferase activity indicated that (a) specific element(s) which could bind with a factor(s) in livers was present in the 5'-flanking region of the CYP3A4 gene to show the transcriptional activity.
Properties of a U1 RNA enhancer-like sequence.
Ciliberto, G; Palla, F; Tebb, G; Mattaj, I W; Philipson, L
1987-01-01
The properties of a X.laevis U1B snRNA gene enhancer have been studied by microinjection in Xenopus oocytes. The enhancer-like sequence, defined as a short DNA stretch that is able to activate transcription in an orientation independent manner, is interchangeable between different U snRNA genes. The enhancer sequence alone does not, however, efficiently activate transcription from an SV40 pol II promoter but regains its activity when combined with the U-gene specific proximal sequence element. DNase I protection experiments show that the X.laevis U1B enhancer can interact specifically with a nuclear factor present in mammalian cells. Images PMID:3031597
Rogers, Julia M; Bulyk, Martha L
2018-04-25
Sequence-specific transcription factors (TFs) bind short DNA sequences in the genome to regulate the expression of target genes. In the last decade, numerous technical advances have enabled the determination of the DNA-binding specificities of many of these factors. Large-scale screens of many TFs enabled the creation of databases of TF DNA-binding specificities, typically represented as position weight matrices (PWMs). Although great progress has been made in determining and predicting binding specificities systematically, there are still many surprises to be found when studying a particular TF's interactions with DNA in detail. Paralogous TFs' binding specificities can differ in subtle ways, in a manner that is not immediately apparent from looking at their PWMs. These differences affect gene regulatory outputs and enable TFs to rewire transcriptional networks over evolutionary time. This review discusses recent observations made in the study of TF-DNA interactions that highlight the importance of continued in-depth analysis of TF-DNA interactions and their inherent complexity. This article is categorized under: Biological Mechanisms > Regulatory Biology. © 2018 Wiley Periodicals, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
MacArthur, Stewart; Li, Xiao-Yong; Li, Jingyi
2009-05-15
BACKGROUND: We previously established that six sequence-specific transcription factors that initiate anterior/posterior patterning in Drosophila bind to overlapping sets of thousands of genomic regions in blastoderm embryos. While regions bound at high levels include known and probable functional targets, more poorly bound regions are preferentially associated with housekeeping genes and/or genes not transcribed in the blastoderm, and are frequently found in protein coding sequences or in less conserved non-coding DNA, suggesting that many are likely non-functional. RESULTS: Here we show that an additional 15 transcription factors that regulate other aspects of embryo patterning show a similar quantitative continuum of functionmore » and binding to thousands of genomic regions in vivo. Collectively, the 21 regulators show a surprisingly high overlap in the regions they bind given that they belong to 11 DNA binding domain families, specify distinct developmental fates, and can act via different cis-regulatory modules. We demonstrate, however, that quantitative differences in relative levels of binding to shared targets correlate with the known biological and transcriptional regulatory specificities of these factors. CONCLUSIONS: It is likely that the overlap in binding of biochemically and functionally unrelated transcription factors arises from the high concentrations of these proteins in nuclei, which, coupled with their broad DNA binding specificities, directs them to regions of open chromatin. We suggest that most animal transcription factors will be found to show a similar broad overlapping pattern of binding in vivo, with specificity achieved by modulating the amount, rather than the identity, of bound factor.« less
Comparative Analysis of Transcription Factors Families across Fungal Tree of Life
DOE Office of Scientific and Technical Information (OSTI.GOV)
Salamov, Asaf; Grigoriev, Igor
2015-03-19
Transcription factors (TFs) are proteins that regulate the transcription of genes, by binding to specific DNA sequences. Based on literature (Shelest, 2008; Weirauch and Hughes,2011) collected and manually curated list of DBD Pfam domains (in total 62 DBD domains) We looked for distribution of TFs in 395 fungal genomes plus additionally in plant genomes (Phytozome), prokaryotes(IMG), some animals/metazoans and protists genomes
The expanding universe of p53 targets.
Menendez, Daniel; Inga, Alberto; Resnick, Michael A
2009-10-01
The p53 tumour suppressor is modified through mutation or changes in expression in most cancers, leading to the altered regulation of hundreds of genes that are directly influenced by this sequence-specific transcription factor. Central to the p53 master regulatory network are the target response element (RE) sequences. The extent of p53 transactivation and transcriptional repression is influenced by many factors, including p53 levels, cofactors and the specific RE sequences, all of which contribute to the role that p53 has in the aetiology of cancer. This Review describes the identification and functionality of REs and highlights the inclusion of non-canonical REs that expand the universe of genes and regulation mechanisms in the p53 tumour suppressor network.
Chen, Zhen-Yong; Guo, Xiao-Jiang; Chen, Zhong-Xu; Chen, Wei-Ying; Wang, Ji-Rui
2017-06-01
The binding sites of transcription factors (TFs) in upstream DNA regions are called transcription factor binding sites (TFBSs). TFBSs are important elements for regulating gene expression. To date, there have been few studies on the profiles of TFBSs in plants. In total, 4,873 sequences with 5' upstream regions from 8530 wheat fl-cDNA sequences were used to predict TFBSs. We found 4572 TFBSs for the MADS TF family, which was twice as many as for bHLH (1951), B3 (1951), HB superfamily (1914), ERF (1820), and AP2/ERF (1725) TFs, and was approximately four times higher than the remaining TFBS types. The percentage of TFBSs and TF members showed a distinct distribution in different tissues. Overall, the distribution of TFBSs in the upstream regions of wheat fl-cDNA sequences had significant difference. Meanwhile, high frequencies of some types of TFBSs were found in specific regions in the upstream sequences. Both TFs and fl-cDNA with TFBSs predicted in the same tissues exhibited specific distribution preferences for regulating gene expression. The tissue-specific analysis of TFs and fl-cDNA with TFBSs provides useful information for functional research, and can be used to identify relationships between tissue-specific TFs and fl-cDNA with TFBSs. Moreover, the positional distribution of TFBSs indicates that some types of wheat TFBS have different positional distribution preferences in the upstream regions of genes.
Schnapp, A; Clos, J; Hädelt, W; Schreck, R; Cvekl, A; Grummt, I
1990-03-25
The murine ribosomal gene promoter contains two cis-acting control elements which operate in concert to promote efficient and accurate transcription initiation by RNA polymerase I. The start site proximal core element which is indispensable for promoter recognition by RNA polymerase I (pol I) encompasses sequences from position -39 to -1. An upstream control element (UCE) which is located between nucleotides -142 and -112 stimulates the efficiency of transcription initiation both in vivo and in vitro. Here we report the isolation and functional characterization of a specific rDNA binding protein, the transcription initiation factor TIF-IB, which specifically interacts with the core region of the mouse ribosomal RNA gene promoter. Highly purified TIF-IB complements transcriptional activity in the presence of two other essential initiation factors TIF-IA and TIF-IC. We demonstrate that the binding efficiency of purified TIF-IB to the core promoter is strongly enhanced by the presence in cis of the UCE. This positive effect of upstream sequences on TIF-IB binding is observed throughout the purification procedure suggesting that the synergistic action of the two distant promoter elements is not mediated by a protein different from TIF-IB. Increasing the distance between both control elements still facilitates stable factor binding but eliminates transcriptional activation. The results demonstrate that TIF-IB binding to the rDNA promoter is an essential early step in the assembly of a functional transcription initiation complex. The subsequent interaction of TIF-IB with other auxiliary transcription initiation factors, however, requires the correct spacing between the UCE and the core promoter element.
Methods and compositions for regulating gene expression in plant cells
NASA Technical Reports Server (NTRS)
Dai, Shunhong (Inventor); Beachy, Roger N. (Inventor); Luis, Maria Isabel Ordiz (Inventor)
2010-01-01
Novel chimeric plant promoter sequences are provided, together with plant gene expression cassettes comprising such sequences. In certain preferred embodiments, the chimeric plant promoters comprise the BoxII cis element and/or derivatives thereof. In addition, novel transcription factors are provided, together with nucleic acid sequences encoding such transcription factors and plant gene expression cassettes comprising such nucleic acid sequences. In certain preferred embodiments, the novel transcription factors comprise the acidic domain, or fragments thereof, of the RF2a transcription factor. Methods for using the chimeric plant promoter sequences and novel transcription factors in regulating the expression of at least one gene of interest are provided, together with transgenic plants comprising such chimeric plant promoter sequences and novel transcription factors.
Sun, D X; Cabrera-Martinez, R M; Setlow, P
1991-05-01
The Bacillus subtilis spoIIIG gene codes for a sigma factor termed sigma G which directs transcription of genes expressed only in the forespore compartment of the sporulating cell. Use of spoIIIG-lacZ transcriptional fusions showed that spoIIIG is cotranscribed with the spoIIG operon beginning at t0.5-1 of sporulation. However, this large mRNA produced little if any sigma G, and transferring the spoIIIG gene without the spoIIG promoter into the amyE locus resulted in a Spo+ phenotype. Significant translation of spoIIIG began at t2.5-3 with use of an mRNA whose 5' end is just upstream of the spoIIIG coding sequence. Synthesis of this spoIIIG-specific mRNA was not abolished by a deletion in spoIIIG itself. Similar results were obtained when a spoIIIG-lacZ translational fusion lacking the spoIIG promoter was integrated at the amyE locus. These data suggest that synthesis of sigma G is dependent neither on transcription from the spoIIG promoter nor on sigma G itself but can be due to another transcription factor. This transcription factor may be sigma F, the product of the spoIIAC locus, since a spoIIAC mutation blocked spoIIIG expression, and sequences upstream of the 5' end of the spoIIIG-specific mRNA agree well with the recognition sequence for sigma F. RNA polymerase containing sigma F (E sigma F) initiated transcription in vitro on a spoIIIG template at the 5' end found in vivo, as did E sigma G. However, E sigma F showed a greater than 20-fold preference for spoIIIG over a known sigma G-dependent gene compared with the activity of E sigma G.
Itoh, S; Yanagimoto, T; Tagawa, S; Hashimoto, H; Kitamura, R; Nakajima, Y; Okochi, T; Fujimoto, S; Uchino, J; Kamataki, T
1992-03-24
P-450IIIA7 is a form of cytochrome P-450 which was isolated from human fetal livers and termed P-450HFLa. This form has been clarified to be expressed during fetal life specifically (Komori, M., Nishio, K., Kitada, M., Shiramatsu, K., Muroya, K., Soma, M., Nagashima, K. and Kamataki, T. (1990) Biochemistry 29, 4430-4433). In the present study, we isolated five independent clones which probably corresponded to the human P-450IIIA7 gene. These clones were completely sequenced, all exons, exon-intron junctions and the 5' flanking region from the cap site to-869. Although the sequences in the coding region were completely identical to P-450IIIA7, it is possible that genomic fragments sequenced in this study encode portions of other P-450IIIA7-related genes since we could not obtain a complete overlapping set of genomic clones. Within its 5' flanking sequence, the putative binding sites of several transcriptional regulatory factors existed. Among them, it was shown that a basic transcription element binding factor (BTEB) actually interacted with the 5' flanking region of this gene.
Arimbasseri, Aneeshkumar G.; Maraia, Richard J.
2015-01-01
SUMMARY Understanding the mechanism of transcription termination by a eukaryotic RNA polymerase (RNAP) has been limited by lack of a characterizable intermediate that reflects transition from an elongation complex to a true termination event. While other multisubunit RNAPs require multipartite cis-signals and/or ancillary factors to mediate pausing and release of the nascent transcript from the clutches of these enzymes, RNAP III does so with precision and efficiency on a simple oligo(dT) tract, independent of other cis-elements or trans-factors. We report a RNAP III pre-termination complex that reveals termination mechanisms controlled by sequence-specific elements in the non-template strand. Furthermore, the TFIIF-like, RNAP III subunit, C37 is required for this function of the non-template strand signal. The results reveal the RNAP III terminator as an information-rich control element. While the template strand promotes destabilization via a weak oligo(rU:dA) hybrid, the non-template strand provides distinct sequence-specific destabilizing information through interactions with the C37 subunit. PMID:25959395
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mangelsen, Elke; Kilian, Joachim; Berendzen, Kenneth W.
2008-02-01
WRKY proteins belong to the WRKY-GCM1 superfamily of zinc finger transcription factors that have been subject to a large plant-specific diversification. For the cereal crop barley (Hordeum vulgare), three different WRKY proteins have been characterized so far, as regulators in sucrose signaling, in pathogen defense, and in response to cold and drought, respectively. However, their phylogenetic relationship remained unresolved. In this study, we used the available sequence information to identify a minimum number of 45 barley WRKY transcription factor (HvWRKY) genes. According to their structural features the HvWRKY factors were classified into the previously defined polyphyletic WRKY subgroups 1 tomore » 3. Furthermore, we could assign putative orthologs of the HvWRKY proteins in Arabidopsis and rice. While in most cases clades of orthologous proteins were formed within each group or subgroup, other clades were composed of paralogous proteins for the grasses and Arabidopsis only, which is indicative of specific gene radiation events. To gain insight into their putative functions, we examined expression profiles of WRKY genes from publicly available microarray data resources and found group specific expression patterns. While putative orthologs of the HvWRKY transcription factors have been inferred from phylogenetic sequence analysis, we performed a comparative expression analysis of WRKY genes in Arabidopsis and barley. Indeed, highly correlative expression profiles were found between some of the putative orthologs. HvWRKY genes have not only undergone radiation in monocot or dicot species, but exhibit evolutionary traits specific to grasses. HvWRKY proteins exhibited not only sequence similarities between orthologs with Arabidopsis, but also relatedness in their expression patterns. This correlative expression is indicative for a putative conserved function of related WRKY proteins in mono- and dicot species.« less
Turatsinze, Jean-Valery; Thomas-Chollier, Morgane; Defrance, Matthieu; van Helden, Jacques
2008-01-01
This protocol shows how to detect putative cis-regulatory elements and regions enriched in such elements with the regulatory sequence analysis tools (RSAT) web server (http://rsat.ulb.ac.be/rsat/). The approach applies to known transcription factors, whose binding specificity is represented by position-specific scoring matrices, using the program matrix-scan. The detection of individual binding sites is known to return many false predictions. However, results can be strongly improved by estimating P value, and by searching for combinations of sites (homotypic and heterotypic models). We illustrate the detection of sites and enriched regions with a study case, the upstream sequence of the Drosophila melanogaster gene even-skipped. This protocol is also tested on random control sequences to evaluate the reliability of the predictions. Each task requires a few minutes of computation time on the server. The complete protocol can be executed in about one hour.
Perdomo-Sabogal, Alvaro; Nowick, Katja; Piccini, Ilaria; Sudbrak, Ralf; Lehrach, Hans; Yaspo, Marie-Laure; Warnatz, Hans-Jörg; Querfurth, Robert
2016-01-01
A substantial fraction of phenotypic differences between closely related species are likely caused by differences in gene regulation. While this has already been postulated over 30 years ago, only few examples of evolutionary changes in gene regulation have been verified. Here, we identified and investigated binding sites of the transcription factor GA-binding protein alpha (GABPa) aiming to discover cis-regulatory adaptations on the human lineage. By performing chromatin immunoprecipitation-sequencing experiments in a human cell line, we found 11,619 putative GABPa binding sites. Through sequence comparisons of the human GABPa binding regions with orthologous sequences from 34 mammals, we identified substitutions that have resulted in 224 putative human-specific GABPa binding sites. To experimentally assess the transcriptional impact of those substitutions, we selected four promoters for promoter-reporter gene assays using human and African green monkey cells. We compared the activities of wild-type promoters to mutated forms, where we have introduced one or more substitutions to mimic the ancestral state devoid of the GABPa consensus binding sequence. Similarly, we introduced the human-specific substitutions into chimpanzee and macaque promoter backgrounds. Our results demonstrate that the identified substitutions are functional, both in human and nonhuman promoters. In addition, we performed GABPa knock-down experiments and found 1,215 genes as strong candidates for primary targets. Further analyses of our data sets link GABPa to cognitive disorders, diabetes, KRAB zinc finger (KRAB-ZNF), and human-specific genes. Thus, we propose that differences in GABPa binding sites played important roles in the evolution of human-specific phenotypes. PMID:26814189
Peixoto, Paul; Liu, Yang; Depauw, Sabine; Hildebrand, Marie-Paule; Boykin, David W; Bailly, Christian; Wilson, W David; David-Cordonnier, Marie-Hélène
2008-06-01
The development of small molecules to control gene expression could be the spearhead of future-targeted therapeutic approaches in multiple pathologies. Among heterocyclic dications developed with this aim, a phenyl-furan-benzimidazole dication DB293 binds AT-rich sites as a monomer and 5'-ATGA sequence as a stacked dimer, both in the minor groove. Here, we used a protein/DNA array approach to evaluate the ability of DB293 to specifically inhibit transcription factors DNA-binding in a single-step, competitive mode. DB293 inhibits two POU-domain transcription factors Pit-1 and Brn-3 but not IRF-1, despite the presence of an ATGA and AT-rich sites within all three consensus sequences. EMSA, DNase I footprinting and surface-plasmon-resonance experiments determined the precise binding site, affinity and stoichiometry of DB293 interaction to the consensus targets. Binding of DB293 occurred as a cooperative dimer on the ATGA part of Brn-3 site but as two monomers on AT-rich sites of IRF-1 sequence. For Pit-1 site, ATGA or AT-rich mutated sequences identified the contribution of both sites for DB293 recognition. In conclusion, DB293 is a strong inhibitor of two POU-domain transcription factors through a cooperative binding to ATGA. These findings are the first to show that heterocyclic dications can inhibit major groove transcription factors and they open the door to the control of transcription factors activity by those compounds.
van Verk, Marcel C; Pappaioannou, Dimitri; Neeleman, Lyda; Bol, John F; Linthorst, Huub J M
2008-04-01
PR-1a is a salicylic acid-inducible defense gene of tobacco (Nicotiana tabacum). One-hybrid screens identified a novel tobacco WRKY transcription factor (NtWRKY12) with specific binding sites in the PR-1a promoter at positions -564 (box WK(1)) and -859 (box WK(2)). NtWRKY12 belongs to the class of transcription factors in which the WRKY sequence is followed by a GKK rather than a GQK sequence. The binding sequence of NtWRKY12 (WK box TTTTCCAC) deviated significantly from the consensus sequence (W box TTGAC[C/T]) shown to be recognized by WRKY factors with the GQK sequence. Mutation of the GKK sequence in NtWRKY12 into GQK or GEK abolished binding to the WK box. The WK(1) box is in close proximity to binding sites in the PR-1a promoter for transcription factors TGA1a (as-1 box) and Myb1 (MBSII box). Expression studies with PR-1a promoterbeta-glucuronidase (GUS) genes in stably and transiently transformed tobacco indicated that NtWRKY12 and TGA1a act synergistically in PR-1a expression induced by salicylic acid and bacterial elicitors. Cotransfection of Arabidopsis thaliana protoplasts with 35SNtWRKY12 and PR-1aGUS promoter fusions showed that overexpression of NtWRKY12 resulted in a strong increase in GUS expression, which required functional WK boxes in the PR-1a promoter.
Mariani, Luca; Weinand, Kathryn; Vedenko, Anastasia; Barrera, Luis A; Bulyk, Martha L
2017-09-27
Transcription factors (TFs) control cellular processes by binding specific DNA motifs to modulate gene expression. Motif enrichment analysis of regulatory regions can identify direct and indirect TF binding sites. Here, we created a glossary of 108 non-redundant TF-8mer "modules" of shared specificity for 671 metazoan TFs from publicly available and new universal protein binding microarray data. Analysis of 239 ENCODE TF chromatin immunoprecipitation sequencing datasets and associated RNA sequencing profiles suggest the 8mer modules are more precise than position weight matrices in identifying indirect binding motifs and their associated tethering TFs. We also developed GENRE (genomically equivalent negative regions), a tunable tool for construction of matched genomic background sequences for analysis of regulatory regions. GENRE outperformed four state-of-the-art approaches to background sequence construction. We used our TF-8mer glossary and GENRE in the analysis of the indirect binding motifs for the co-occurrence of tethering factors, suggesting novel TF-TF interactions. We anticipate that these tools will aid in elucidating tissue-specific gene-regulatory programs. Copyright © 2017 Elsevier Inc. All rights reserved.
Kovacs, A; Kandala, J C; Weber, K T; Guntaka, R V
1996-01-19
Type I and III fibrillar collagens are the major structural proteins of the extracellular matrix found in various organs including the myocardium. Abnormal and progressive accumulation of fibrillar type I collagen in the interstitial spaces compromises organ function and therefore, the study of transcriptional regulation of this gene and specific targeting of its expression is of major interest. Transient transfection of adult cardiac fibroblasts indicate that the polypurine-polypyrimidine sequence of alpha 1(I) collagen promoter between nucleotides - 200 and -140 represents an overall positive regulatory element. DNase I footprinting and electrophoretic mobility shift assays suggest that multiple factors bind to different elements of this promoter region. We further demonstrate that the unique polypyrimidine sequence between -172 and -138 of the promoter represents a suitable target for a single-stranded polypurine oligonucleotide (TFO) to form a triple helix DNA structure. Modified electrophoretic mobility shift assays show that this TFO specifically inhibits the protein-DNA interaction within the target region. In vitro transcription assays and transient transfection experiments demonstrate that the transcriptional activity of the promoter is inhibited by this oligonucleotide. We propose that TFOs represent a therapeutic potential to specifically influence the expression of alpha 1(I) collagen gene in various disease states where abnormal type I collagen accumulation is known to occur.
The RNA Export Factor, Nxt1, Is Required for Tissue Specific Transcriptional Regulation
Jiang, Jianqiao; White-Cooper, Helen
2013-01-01
The highly conserved, Nxf/Nxt (TAP/p15) RNA nuclear export pathway is important for export of most mRNAs from the nucleus, by interacting with mRNAs and promoting their passage through nuclear pores. Nxt1 is essential for viability; using a partial loss of function allele, we reveal a role for this gene in tissue specific transcription. We show that many Drosophila melanogaster testis-specific mRNAs require Nxt1 for their accumulation. The transcripts that require Nxt1 also depend on a testis-specific transcription complex, tMAC. We show that loss of Nxt1 leads to reduced transcription of tMAC targets. A reporter transcript from a tMAC-dependent promoter is under-expressed in Nxt1 mutants, however the same transcript accumulates in mutants if driven by a tMAC-independent promoter. Thus, in Drosophila primary spermatocytes, the transcription factor used to activate expression of a transcript, rather than the RNA sequence itself or the core transcription machinery, determines whether this expression requires Nxt1. We additionally find that transcripts from intron-less genes are more sensitive to loss of Nxt1 function than those from intron-containing genes and propose a mechanism in which transcript processing feeds back to increase activity of a tissue specific transcription complex. PMID:23754955
Identifying transcription factor functions and targets by phenotypic activation
Chua, Gordon; Morris, Quaid D.; Sopko, Richelle; Robinson, Mark D.; Ryan, Owen; Chan, Esther T.; Frey, Brendan J.; Andrews, Brenda J.; Boone, Charles; Hughes, Timothy R.
2006-01-01
Mapping transcriptional regulatory networks is difficult because many transcription factors (TFs) are activated only under specific conditions. We describe a generic strategy for identifying genes and pathways induced by individual TFs that does not require knowledge of their normal activation cues. Microarray analysis of 55 yeast TFs that caused a growth phenotype when overexpressed showed that the majority caused increased transcript levels of genes in specific physiological categories, suggesting a mechanism for growth inhibition. Induced genes typically included established targets and genes with consensus promoter motifs, if known, indicating that these data are useful for identifying potential new target genes and binding sites. We identified the sequence 5′-TCACGCAA as a binding sequence for Hms1p, a TF that positively regulates pseudohyphal growth and previously had no known motif. The general strategy outlined here presents a straightforward approach to discovery of TF activities and mapping targets that could be adapted to any organism with transgenic technology. PMID:16880382
Luo, Shengzhan D.; Baker, Bruce S.
2015-01-01
“Regulatory evolution,” that is, changes in a gene’s expression pattern through changes at its regulatory sequence, rather than changes at the coding sequence of the gene or changes of the upstream transcription factors, has been increasingly recognized as a pervasive evolution mechanism. Many somatic sexually dimorphic features of Drosophila melanogaster are the results of gene expression regulated by the doublesex (dsx) gene, which encodes sex-specific transcription factors (DSXF in females and DSXM in males). Rapid changes in such sexually dimorphic features are likely a result of changes at the regulatory sequence of the target genes. We focused on the Flavin-containing monooxygenase-2 (Fmo-2) gene, a likely direct dsx target, to elucidate how sexually dimorphic expression and its evolution are brought about. We found that dsx is deployed to regulate the Fmo-2 transcription both in the midgut and in fat body cells of the spermatheca (a female-specific tissue), through a canonical DSX-binding site in the Fmo-2 regulatory sequence. In the melanogaster group, Fmo-2 transcription in the midgut has evolved rapidly, in contrast to the conserved spermathecal transcription. We identified two cis-regulatory modules (CRM-p and CRM-d) that direct sexually monomorphic or dimorphic Fmo-2 transcription, respectively, in the midguts of these species. Changes of Fmo-2 transcription in the midgut from sexually dimorphic to sexually monomorphic in some species are caused by the loss of CRM-d function, but not the loss of the canonical DSX-binding site. Thus, conferring transcriptional regulation on a CRM level allows the regulation to evolve rapidly in one tissue while evading evolutionary constraints posed by other tissues. PMID:25675536
Walker, M D; Park, C W; Rosen, A; Aronheim, A
1990-01-01
Cell specific expression of the insulin gene is achieved through transcriptional mechanisms operating on multiple DNA sequence elements located in the 5' flanking region of the gene. Of particular importance in the rat insulin I gene are two closely similar 9 bp sequences (IEB1 and IEB2): mutation of either of these leads to 5-10 fold reduction in transcriptional activity. We have screened an expression cDNA library derived from mouse pancreatic endocrine beta cells with a radioactive DNA probe containing multiple copies of the IEB1 sequence. A cDNA clone (A1) isolated by this procedure encodes a protein which shows efficient binding to the IEB1 probe, but much weaker binding to either an unrelated DNA probe or to a probe bearing a single base pair insertion within the recognition sequence. DNA sequence analysis indicates a protein belonging to the helix-loop-helix family of DNA-binding proteins. The ability of the protein encoded by clone A1 to recognize a number of wild type and mutant DNA sequences correlates closely with the ability of each sequence element to support transcription in vivo in the context of the insulin 5' flanking DNA. We conclude that the isolated cDNA may encode a transcription factor that participates in control of insulin gene expression. Images PMID:2181401
Deciphering the combinatorial architecture of a Drosophila homeotic gene enhancer
Drewell, Robert A.; Nevarez, Michael J.; Kurata, Jessica S.; Winkler, Lauren N.; Li, Lily; Dresch, Jacqueline M.
2013-01-01
Summary In Drosophila, the 330 kb bithorax complex regulates cellular differentiation along the anterio-posterior axis during development in the thorax and abdomen and is comprised of three homeotic genes: Ultrabithorax, abdominal-A, and Abdominal-B. The expression of each of these genes is in turn controlled through interactions between transcription factors and a number of cis-regulatory modules in the neighboring intergenic regions. In this study, we examine how the sequence architecture of transcription factor binding sites mediates the functional activity of one of these cis-regulatory modules. Using computational, mathematical modeling and experimental molecular genetic approaches we investigate the IAB7b enhancer, which regulates Abdominal-B expression specifically in the presumptive seventh and ninth abdominal segments of the early embryo. A cross-species comparison of the IAB7b enhancer reveals an evolutionarily conserved signature motif containing two FUSHI-TARAZU activator transcription factor binding sites. We find that the transcriptional repressors KNIRPS, KRUPPEL and GIANT are able to restrict reporter gene expression to the posterior abdominal segments, using different molecular mechanisms including short-range repression and competitive binding. Additionally, we show the functional importance of the spacing between the two FUSHI-TARAZU binding sites and discuss the potential importance of cooperativity for transcriptional activation. Our results demonstrate that the transcriptional output of the IAB7b cis-regulatory module relies on a complex set of combinatorial inputs mediated by specific transcription factor binding and that the sequence architecture at this enhancer is critical to maintain robust regulatory function. PMID:24514265
kmer-SVM: a web server for identifying predictive regulatory sequence features in genomic data sets
Fletez-Brant, Christopher; Lee, Dongwon; McCallion, Andrew S.; Beer, Michael A.
2013-01-01
Massively parallel sequencing technologies have made the generation of genomic data sets a routine component of many biological investigations. For example, Chromatin immunoprecipitation followed by sequence assays detect genomic regions bound (directly or indirectly) by specific factors, and DNase-seq identifies regions of open chromatin. A major bottleneck in the interpretation of these data is the identification of the underlying DNA sequence code that defines, and ultimately facilitates prediction of, these transcription factor (TF) bound or open chromatin regions. We have recently developed a novel computational methodology, which uses a support vector machine (SVM) with kmer sequence features (kmer-SVM) to identify predictive combinations of short transcription factor-binding sites, which determine the tissue specificity of these genomic assays (Lee, Karchin and Beer, Discriminative prediction of mammalian enhancers from DNA sequence. Genome Res. 2011; 21:2167–80). This regulatory information can (i) give confidence in genomic experiments by recovering previously known binding sites, and (ii) reveal novel sequence features for subsequent experimental testing of cooperative mechanisms. Here, we describe the development and implementation of a web server to allow the broader research community to independently apply our kmer-SVM to analyze and interpret their genomic datasets. We analyze five recently published data sets and demonstrate how this tool identifies accessory factors and repressive sequence elements. kmer-SVM is available at http://kmersvm.beerlab.org. PMID:23771147
kmer-SVM: a web server for identifying predictive regulatory sequence features in genomic data sets.
Fletez-Brant, Christopher; Lee, Dongwon; McCallion, Andrew S; Beer, Michael A
2013-07-01
Massively parallel sequencing technologies have made the generation of genomic data sets a routine component of many biological investigations. For example, Chromatin immunoprecipitation followed by sequence assays detect genomic regions bound (directly or indirectly) by specific factors, and DNase-seq identifies regions of open chromatin. A major bottleneck in the interpretation of these data is the identification of the underlying DNA sequence code that defines, and ultimately facilitates prediction of, these transcription factor (TF) bound or open chromatin regions. We have recently developed a novel computational methodology, which uses a support vector machine (SVM) with kmer sequence features (kmer-SVM) to identify predictive combinations of short transcription factor-binding sites, which determine the tissue specificity of these genomic assays (Lee, Karchin and Beer, Discriminative prediction of mammalian enhancers from DNA sequence. Genome Res. 2011; 21:2167-80). This regulatory information can (i) give confidence in genomic experiments by recovering previously known binding sites, and (ii) reveal novel sequence features for subsequent experimental testing of cooperative mechanisms. Here, we describe the development and implementation of a web server to allow the broader research community to independently apply our kmer-SVM to analyze and interpret their genomic datasets. We analyze five recently published data sets and demonstrate how this tool identifies accessory factors and repressive sequence elements. kmer-SVM is available at http://kmersvm.beerlab.org.
Architecture of the human regulatory network derived from ENCODE data.
Gerstein, Mark B; Kundaje, Anshul; Hariharan, Manoj; Landt, Stephen G; Yan, Koon-Kiu; Cheng, Chao; Mu, Xinmeng Jasmine; Khurana, Ekta; Rozowsky, Joel; Alexander, Roger; Min, Renqiang; Alves, Pedro; Abyzov, Alexej; Addleman, Nick; Bhardwaj, Nitin; Boyle, Alan P; Cayting, Philip; Charos, Alexandra; Chen, David Z; Cheng, Yong; Clarke, Declan; Eastman, Catharine; Euskirchen, Ghia; Frietze, Seth; Fu, Yao; Gertz, Jason; Grubert, Fabian; Harmanci, Arif; Jain, Preti; Kasowski, Maya; Lacroute, Phil; Leng, Jing Jane; Lian, Jin; Monahan, Hannah; O'Geen, Henriette; Ouyang, Zhengqing; Partridge, E Christopher; Patacsil, Dorrelyn; Pauli, Florencia; Raha, Debasish; Ramirez, Lucia; Reddy, Timothy E; Reed, Brian; Shi, Minyi; Slifer, Teri; Wang, Jing; Wu, Linfeng; Yang, Xinqiong; Yip, Kevin Y; Zilberman-Schapira, Gili; Batzoglou, Serafim; Sidow, Arend; Farnham, Peggy J; Myers, Richard M; Weissman, Sherman M; Snyder, Michael
2012-09-06
Transcription factors bind in a combinatorial fashion to specify the on-and-off states of genes; the ensemble of these binding events forms a regulatory network, constituting the wiring diagram for a cell. To examine the principles of the human transcriptional regulatory network, we determined the genomic binding information of 119 transcription-related factors in over 450 distinct experiments. We found the combinatorial, co-association of transcription factors to be highly context specific: distinct combinations of factors bind at specific genomic locations. In particular, there are significant differences in the binding proximal and distal to genes. We organized all the transcription factor binding into a hierarchy and integrated it with other genomic information (for example, microRNA regulation), forming a dense meta-network. Factors at different levels have different properties; for instance, top-level transcription factors more strongly influence expression and middle-level ones co-regulate targets to mitigate information-flow bottlenecks. Moreover, these co-regulations give rise to many enriched network motifs (for example, noise-buffering feed-forward loops). Finally, more connected network components are under stronger selection and exhibit a greater degree of allele-specific activity (that is, differential binding to the two parental alleles). The regulatory information obtained in this study will be crucial for interpreting personal genome sequences and understanding basic principles of human biology and disease.
Martin, Elizabeth M.; Fry, Rebecca C.
2016-01-01
Abstract A biological mechanism by which exposure to environmental contaminants results in gene-specific CpG methylation patterning is currently unknown. We hypothesize that gene-specific CpG methylation is related to environmentally perturbed transcription factor occupancy. To test this hypothesis, a database of 396 genes with altered CpG methylation either in cord blood leukocytes or placental tissue was compiled from 14 studies representing assessments of six environmental contaminants. Subsequently, an in silico approach was used to identify transcription factor binding sites enriched among the genes with altered CpG methylation in relationship to the suite of environmental contaminants. For each study, the sequences of the promoter regions (representing −1000 to +500 bp from the transcription start site) of all genes with altered CpG methylation were analyzed for enrichment of transcription factor binding sites. Binding sites for a total of 56 unique transcription factors were identified to be enriched within the promoter regions of the genes. Binding sites for the Kidney-Enriched Krupple-like Factor 15, a known responder to endogenous stress, were enriched ( P < 0.001–0.041) among the genes with altered CpG methylation associated for five of the six environmental contaminants. These data support the transcription factor occupancy theory as a potential mechanism underlying environmentally-induced gene-specific CpG methylation. PMID:27066266
Devaux, B; Albrecht, G; Kedinger, C
1987-01-01
Genomic DNase I footprinting was used to compare specific protein binding to the adenovirus type 5 early, EIa-inducible, EIIa promoter. Identical protection patterns of the promoter region were observed whether EIIa transcription was undetectable or fully induced. These results suggest that EIa-mediated transcriptional induction does not increase binding of limiting transcription factors to the promoter but rather that transactivation results from the proper interactions between factors already bound to their cognate sequences. Images PMID:2963956
Kim, Taehyung; Tyndel, Marc S; Huang, Haiming; Sidhu, Sachdev S; Bader, Gary D; Gfeller, David; Kim, Philip M
2012-03-01
Peptide recognition domains and transcription factors play crucial roles in cellular signaling. They bind linear stretches of amino acids or nucleotides, respectively, with high specificity. Experimental techniques that assess the binding specificity of these domains, such as microarrays or phage display, can retrieve thousands of distinct ligands, providing detailed insight into binding specificity. In particular, the advent of next-generation sequencing has recently increased the throughput of such methods by several orders of magnitude. These advances have helped reveal the presence of distinct binding specificity classes that co-exist within a set of ligands interacting with the same target. Here, we introduce a software system called MUSI that can rapidly analyze very large data sets of binding sequences to determine the relevant binding specificity patterns. Our pipeline provides two major advances. First, it can detect previously unrecognized multiple specificity patterns in any data set. Second, it offers integrated processing of very large data sets from next-generation sequencing machines. The results are visualized as multiple sequence logos describing the different binding preferences of the protein under investigation. We demonstrate the performance of MUSI by analyzing recent phage display data for human SH3 domains as well as microarray data for mouse transcription factors.
High-resolution mapping of transcription factor binding sites on native chromatin
Kasinathan, Sivakanthan; Orsi, Guillermo A.; Zentner, Gabriel E.; Ahmad, Kami; Henikoff, Steven
2014-01-01
Sequence-specific DNA-binding proteins including transcription factors (TFs) are key determinants of gene regulation and chromatin architecture. Formaldehyde cross-linking and sonication followed by Chromatin ImmunoPrecipitation (X-ChIP) is widely used for profiling of TF binding, but is limited by low resolution and poor specificity and sensitivity. We present a simple protocol that starts with micrococcal nuclease-digested uncross-linked chromatin and is followed by affinity purification of TFs and paired-end sequencing. The resulting ORGANIC (Occupied Regions of Genomes from Affinity-purified Naturally Isolated Chromatin) profiles of Saccharomyces cerevisiae Abf1 and Reb1 provide highly accurate base-pair resolution maps that are not biased toward accessible chromatin, and do not require input normalization. We also demonstrate the high specificity of our method when applied to larger genomes by profiling Drosophila melanogaster GAGA Factor and Pipsqueak. Our results suggest that ORGANIC profiling is a widely applicable high-resolution method for sensitive and specific profiling of direct protein-DNA interactions. PMID:24336359
Predicting the binding preference of transcription factors to individual DNA k-mers.
Alleyne, Trevis M; Peña-Castillo, Lourdes; Badis, Gwenael; Talukder, Shaheynoor; Berger, Michael F; Gehrke, Andrew R; Philippakis, Anthony A; Bulyk, Martha L; Morris, Quaid D; Hughes, Timothy R
2009-04-15
Recognition of specific DNA sequences is a central mechanism by which transcription factors (TFs) control gene expression. Many TF-binding preferences, however, are unknown or poorly characterized, in part due to the difficulty associated with determining their specificity experimentally, and an incomplete understanding of the mechanisms governing sequence specificity. New techniques that estimate the affinity of TFs to all possible k-mers provide a new opportunity to study DNA-protein interaction mechanisms, and may facilitate inference of binding preferences for members of a given TF family when such information is available for other family members. We employed a new dataset consisting of the relative preferences of mouse homeodomains for all eight-base DNA sequences in order to ask how well we can predict the binding profiles of homeodomains when only their protein sequences are given. We evaluated a panel of standard statistical inference techniques, as well as variations of the protein features considered. Nearest neighbour among functionally important residues emerged among the most effective methods. Our results underscore the complexity of TF-DNA recognition, and suggest a rational approach for future analyses of TF families.
Chen, Dana; Orenstein, Yaron; Golodnitsky, Rada; Pellach, Michal; Avrahami, Dorit; Wachtel, Chaim; Ovadia-Shochat, Avital; Shir-Shapira, Hila; Kedmi, Adi; Juven-Gershon, Tamar; Shamir, Ron; Gerber, Doron
2016-01-01
Transcription factors (TFs) alter gene expression in response to changes in the environment through sequence-specific interactions with the DNA. These interactions are best portrayed as a landscape of TF binding affinities. Current methods to study sequence-specific binding preferences suffer from limited dynamic range, sequence bias, lack of specificity and limited throughput. We have developed a microfluidic-based device for SELEX Affinity Landscape MAPping (SELMAP) of TF binding, which allows high-throughput measurement of 16 proteins in parallel. We used it to measure the relative affinities of Pho4, AtERF2 and Btd full-length proteins to millions of different DNA binding sites, and detected both high and low-affinity interactions in equilibrium conditions, generating a comprehensive landscape of the relative TF affinities to all possible DNA 6-mers, and even DNA10-mers with increased sequencing depth. Low quantities of both the TFs and DNA oligomers were sufficient for obtaining high-quality results, significantly reducing experimental costs. SELMAP allows in-depth screening of hundreds of TFs, and provides a means for better understanding of the regulatory processes that govern gene expression. PMID:27628341
Quantifying the Effect of DNA Packaging on Gene Expression Level
NASA Astrophysics Data System (ADS)
Kim, Harold
2010-10-01
Gene expression, the process by which the genetic code comes alive in the form of proteins, is one of the most important biological processes in living cells, and begins when transcription factors bind to specific DNA sequences in the promoter region upstream of a gene. The relationship between gene expression output and transcription factor input which is termed the gene regulation function is specific to each promoter, and predicting this gene regulation function from the locations of transcription factor binding sites is one of the challenges in biology. In eukaryotic organisms (for example, animals, plants, fungi etc), DNA is highly compacted into nucleosomes, 147-bp segments of DNA tightly wrapped around histone protein core, and therefore, the accessibility of transcription factor binding sites depends on their locations with respect to nucleosomes - sites inside nucleosomes are less accessible than those outside nucleosomes. To understand how transcription factor binding sites contribute to gene expression in a quantitative manner, we obtain gene regulation functions of promoters with various configurations of transcription factor binding sites by using fluorescent protein reporters to measure transcription factor input and gene expression output in single yeast cells. In this talk, I will show that the affinity of a transcription factor binding site inside and outside the nucleosome controls different aspects of the gene regulation function, and explain this finding based on a mass-action kinetic model that includes competition between nucleosomes and transcription factors.
Nuclear factor ETF specifically stimulates transcription from promoters without a TATA box.
Kageyama, R; Merlino, G T; Pastan, I
1989-09-15
Transcription factor ETF stimulates the expression of the epidermal growth factor receptor (EGFR) gene which does not have a TATA box in the promoter region. Here, we show that ETF recognizes various GC-rich sequences including stretches of deoxycytidine or deoxyguanosine residues and GC boxes with similar affinities. ETF also binds to TATA boxes but with a lower affinity. ETF stimulated in vitro transcription from several promoters without TATA boxes but had little or no effect on TATA box-containing promoters even though they had strong ETF-binding sites. These inactive ETF-binding sites became functional when placed upstream of the EGFR promoter whose own ETF-binding sites were removed. Furthermore, when a TATA box was introduced into the EGFR promoter, the responsiveness to ETF was abolished. These results indicate that ETF is a specific transcription factor for promoters which do not contain TATA elements.
Overview Article: Identifying transcriptional cis-regulatory modules in animal genomes
Suryamohan, Kushal; Halfon, Marc S.
2014-01-01
Gene expression is regulated through the activity of transcription factors and chromatin modifying proteins acting on specific DNA sequences, referred to as cis-regulatory elements. These include promoters, located at the transcription initiation sites of genes, and a variety of distal cis-regulatory modules (CRMs), the most common of which are transcriptional enhancers. Because regulated gene expression is fundamental to cell differentiation and acquisition of new cell fates, identifying, characterizing, and understanding the mechanisms of action of CRMs is critical for understanding development. CRM discovery has historically been challenging, as CRMs can be located far from the genes they regulate, have few readily-identifiable sequence characteristics, and for many years were not amenable to high-throughput discovery methods. However, the recent availability of complete genome sequences and the development of next-generation sequencing methods has led to an explosion of both computational and empirical methods for CRM discovery in model and non-model organisms alike. Experimentally, CRMs can be identified through chromatin immunoprecipitation directed against transcription factors or histone post-translational modifications, identification of nucleosome-depleted “open” chromatin regions, or sequencing-based high-throughput functional screening. Computational methods include comparative genomics, clustering of known or predicted transcription factor binding sites, and supervised machine-learning approaches trained on known CRMs. All of these methods have proven effective for CRM discovery, but each has its own considerations and limitations, and each is subject to a greater or lesser number of false-positive identifications. Experimental confirmation of predictions is essential, although shortcomings in current methods suggest that additional means of validation need to be developed. PMID:25704908
Liu, Jun-Jun; Xiang, Yu
2011-01-01
WRKY transcription factors are key regulators of numerous biological processes in plant growth and development, as well as plant responses to abiotic and biotic stresses. Research on biological functions of plant WRKY genes has focused in the past on model plant species or species with largely characterized transcriptomes. However, a variety of non-model plants, such as forest conifers, are essential as feed, biofuel, and wood or for sustainable ecosystems. Identification of WRKY genes in these non-model plants is equally important for understanding the evolutionary and function-adaptive processes of this transcription factor family. Because of limited genomic information, the rarity of regulatory gene mRNAs in transcriptomes, and the sequence divergence to model organism genes, identification of transcription factors in non-model plants using methods similar to those generally used for model plants is difficult. This chapter describes a gene family discovery strategy for identification of WRKY transcription factors in conifers by a combination of in silico-based prediction and PCR-based experimental approaches. Compared to traditional cDNA library screening or EST sequencing at transcriptome scales, this integrated gene discovery strategy provides fast, simple, reliable, and specific methods to unveil the WRKY gene family at both genome and transcriptome levels in non-model plants.
Evers, R; Smid, A; Rudloff, U; Lottspeich, F; Grummt, I
1995-03-15
Termination of mouse ribosomal gene transcription by RNA polymerase I (Pol I) requires the specific interaction of a DNA binding protein, mTTF-I, with an 18 bp sequence element located downstream of the rRNA coding region. Here we describe the molecular cloning and functional characterization of the cDNA encoding this transcription termination factor. Recombinant mTTF-I binds specifically to the murine terminator elements and terminates Pol I transcription in a reconstituted in vitro system. Deletion analysis has defined a modular structure of mTTF-I comprising a dispensable N-terminal half, a large C-terminal DNA binding region and an internal domain which is required for transcription termination. Significantly, the C-terminal region of mTTF-I reveals striking homology to the DNA binding domains of the proto-oncogene c-Myb and the yeast transcription factor Reb1p. Site-directed mutagenesis of one of the tryptophan residues that is conserved in the homology region of c-Myb, Reb1p and mTTF-I abolishes specific DNA binding, a finding which underscores the functional relevance of these residues in DNA-protein interactions.
Evers, R; Smid, A; Rudloff, U; Lottspeich, F; Grummt, I
1995-01-01
Termination of mouse ribosomal gene transcription by RNA polymerase I (Pol I) requires the specific interaction of a DNA binding protein, mTTF-I, with an 18 bp sequence element located downstream of the rRNA coding region. Here we describe the molecular cloning and functional characterization of the cDNA encoding this transcription termination factor. Recombinant mTTF-I binds specifically to the murine terminator elements and terminates Pol I transcription in a reconstituted in vitro system. Deletion analysis has defined a modular structure of mTTF-I comprising a dispensable N-terminal half, a large C-terminal DNA binding region and an internal domain which is required for transcription termination. Significantly, the C-terminal region of mTTF-I reveals striking homology to the DNA binding domains of the proto-oncogene c-Myb and the yeast transcription factor Reb1p. Site-directed mutagenesis of one of the tryptophan residues that is conserved in the homology region of c-Myb, Reb1p and mTTF-I abolishes specific DNA binding, a finding which underscores the functional relevance of these residues in DNA-protein interactions. Images PMID:7720715
An RNA motif advances transcription by preventing Rho-dependent termination
Sevostyanova, Anastasia; Groisman, Eduardo A.
2015-01-01
The transcription termination factor Rho associates with most nascent bacterial RNAs as they emerge from RNA polymerase. However, pharmacological inhibition of Rho derepresses only a small fraction of these transcripts. What, then, determines the specificity of Rho-dependent transcription termination? We now report the identification of a Rho-antagonizing RNA element (RARE) that hinders Rho-dependent transcription termination. We establish that RARE traps Rho in an inactive complex but does not prevent Rho binding to its recruitment sites. Although translating ribosomes normally block Rho access to an mRNA, inefficient translation of an open reading frame in the leader region of the Salmonella mgtCBR operon actually enables transcription of its associated coding region by favoring an RNA conformation that sequesters RARE. The discovery of an RNA element that inactivates Rho signifies that the specificity of nucleic-acid binding proteins is defined not only by the sequences that recruit these proteins but also by sequences that antagonize their activity. PMID:26630006
Cistrome of the aldosterone-activated mineralocorticoid receptor in human renal cells.
Le Billan, Florian; Khan, Junaid A; Lamribet, Khadija; Viengchareun, Say; Bouligand, Jérôme; Fagart, Jérôme; Lombès, Marc
2015-09-01
Aldosterone exerts its effects mainly by activating the mineralocorticoid receptor (MR), a transcription factor that regulates gene expression through complex and dynamic interactions with coregulators and transcriptional machinery, leading to fine-tuned control of vectorial ionic transport in the distal nephron. To identify genome-wide aldosterone-regulated MR targets in human renal cells, we set up a chromatin immunoprecipitation (ChIP) assay by using a specific anti-MR antibody in a differentiated human renal cell line expressing green fluorescent protein (GFP)-MR. This approach, coupled with high-throughput sequencing, allowed identification of 974 genomic MR targets. Computational analysis identified an MR response element (MRE) including single or multiple half-sites and palindromic motifs in which the AGtACAgxatGTtCt sequence was the most prevalent motif. Most genomic MR-binding sites (MBSs) are located >10 kb from the transcriptional start sites of target genes (84%). Specific aldosterone-induced recruitment of MR on the first most relevant genomic sequences was further validated by ChIP-quantitative (q)PCR and correlated with concomitant and positive aldosterone-activated transcriptional regulation of the corresponding gene, as assayed by RT-qPCR. It was notable that most MBSs lacked MREs but harbored DNA recognition motifs for other transcription factors (FOX, EGR1, AP1, PAX5) suggesting functional interaction. This work provides new insights into aldosterone MR-mediated renal signaling and opens relevant perspectives for mineralocorticoid-related pathophysiology. © FASEB.
Dynamic maps of UV damage formation and repair for the human genome
Hu, Jinchuan; Adebali, Ogun; Adar, Sheera; Sancar, Aziz
2017-01-01
Formation and repair of UV-induced DNA damage in human cells are affected by cellular context. To study factors influencing damage formation and repair genome-wide, we developed a highly sensitive single-nucleotide resolution damage mapping method [high-sensitivity damage sequencing (HS–Damage-seq)]. Damage maps of both cyclobutane pyrimidine dimers (CPDs) and pyrimidine-pyrimidone (6-4) photoproducts [(6-4)PPs] from UV-irradiated cellular and naked DNA revealed that the effect of transcription factor binding on bulky adducts formation varies, depending on the specific transcription factor, damage type, and strand. We also generated time-resolved UV damage maps of both CPDs and (6-4)PPs by HS–Damage-seq and compared them to the complementary repair maps of the human genome obtained by excision repair sequencing to gain insight into factors that affect UV-induced DNA damage and repair and ultimately UV carcinogenesis. The combination of the two methods revealed that, whereas UV-induced damage is virtually uniform throughout the genome, repair is affected by chromatin states, transcription, and transcription factor binding, in a manner that depends on the type of DNA damage. PMID:28607063
Dynamic maps of UV damage formation and repair for the human genome.
Hu, Jinchuan; Adebali, Ogun; Adar, Sheera; Sancar, Aziz
2017-06-27
Formation and repair of UV-induced DNA damage in human cells are affected by cellular context. To study factors influencing damage formation and repair genome-wide, we developed a highly sensitive single-nucleotide resolution damage mapping method [high-sensitivity damage sequencing (HS-Damage-seq)]. Damage maps of both cyclobutane pyrimidine dimers (CPDs) and pyrimidine-pyrimidone (6-4) photoproducts [(6-4)PPs] from UV-irradiated cellular and naked DNA revealed that the effect of transcription factor binding on bulky adducts formation varies, depending on the specific transcription factor, damage type, and strand. We also generated time-resolved UV damage maps of both CPDs and (6-4)PPs by HS-Damage-seq and compared them to the complementary repair maps of the human genome obtained by excision repair sequencing to gain insight into factors that affect UV-induced DNA damage and repair and ultimately UV carcinogenesis. The combination of the two methods revealed that, whereas UV-induced damage is virtually uniform throughout the genome, repair is affected by chromatin states, transcription, and transcription factor binding, in a manner that depends on the type of DNA damage.
Shozu, M; Zhao, Y; Simpson, E R
1997-12-01
The expression of aromatase, the enzyme responsible for estrogen biosynthesis, has been studied in THP-1 cells of human mononuclear leukemic origin, which exhibit high rates of aromatase activity. These cells have the capacity to differentiate in the presence of vitamin D into cells with osteoclast-like properties. Differentiated cells displayed higher rates of aromatase than undifferentiated cells, and, in both cases, activity was stimulated 10- to 20-fold by dexamethasone. Phorbol esters also increased aromatase activity, but the effect was the same in differentiated as in undifferentiated cells. In a similar fashion to adipose stromal cells, serum potentiated the response to dexamethasone but had no effect on phorbol ester-stimulated activity. By contrast to its action in adipose stromal cells, (Bu)2cAMP markedly inhibited aromatase activity of THP-1 cells, as did factors whose actions are mediated by cAMP, such as PTH and PTH-related peptide. This was true of control cells, as well as of dexamethasone- and phorbol ester-stimulated cells. Previously we have shown that type 1 cytokines as well as tumor necrosis factor-alpha stimulate aromatase activity of adipose stromal cells in the presence of dexamethasone. By contrast, interleukin-6, interleukin-11, and leukemia-inhibitory factor had no effect on aromatase activity of THP-1 cells, whereas tumor necrosis factor-alpha, oncostatin M, and platelet-derived growth factor were slightly inhibitory of aromatase activity. Exon-specific Southern analysis of rapid amplification of cDNA ends-amplified transcripts was employed to examine the distribution of the various 5'-termini of aromatase transcripts. In the control group, most of the clones contained transcripts specific for the proximal promoter II, whereas in dexamethasone-treated cells, most transcripts contained exon I.4. In the phorbol ester-treated cells, a broader spectrum of transcripts was present, with equal proportions of I.4, II, and I.3-containing clones. Additionally, one clone containing a new sequence, exon I.6, was found. This was shown to be located about 1 kb upstream of exon II. By contrast, all clones from cells treated with (Bu)2cAMP contained promoter II-specific sequences. In addition to these transcripts, two clones in the library from the dexamethasone-treated cells contained the sequence previously defined as the brain-specific sequence, 1f. In one of these, the 1f sequence was fused downstream of exon I.4, indicative that its expression likely employed promoter I.4. These results point to similarities and important differences between aromatase expression in THP-1 cells and other cells such as adipose stromal cells, indicative of unique regulatory pathways governing aromatase expression in these cells.
Wong, S W; Schaffer, P A
1991-05-01
Like other DNA-containing viruses, the three origins of herpes simplex virus type 1 (HSV-1) DNA replication are flanked by sequences containing transcriptional regulatory elements. In a transient plasmid replication assay, deletion of sequences comprising the transcriptional regulatory elements of ICP4 and ICP22/47, which flank oriS, resulted in a greater than 80-fold decrease in origin function compared with a plasmid, pOS-822, which retains these sequences. In an effort to identify specific cis-acting elements responsible for this effect, we conducted systematic deletion analysis of the flanking region with plasmid pOS-822 and tested the resulting mutant plasmids for origin function. Stimulation by cis-acting elements was shown to be both distance and orientation dependent, as changes in either parameter resulted in a decrease in oriS function. Additional evidence for the stimulatory effect of flanking sequences on origin function was demonstrated by replacement of these sequences with the cytomegalovirus immediate-early promoter, resulting in nearly wild-type levels of oriS function. In competition experiments, cotransfection of cells with the test plasmid, pOS-822, and increasing molar concentrations of a competitor plasmid which contained the ICP4 and ICP22/47 transcriptional regulatory regions but lacked core origin sequences resulted in a significant reduction in the replication efficiency of pOS-822, demonstrating that factors which bind specifically to the oriS-flanking sequences are likely involved as auxiliary proteins in oriS function. Together, these studies demonstrate that trans-acting factors and the sites to which they bind play a critical role in the efficiency of HSV-1 DNA replication from oriS in transient-replication assays.
Heix, J; Zomerdijk, J C; Ravanpay, A; Tjian, R; Grummt, I
1997-03-04
Promoter selectivity for all three classes of eukaryotic RNA polymerases is brought about by multimeric protein complexes containing TATA box binding protein (TBP) and specific TBP-associated factors (TAFs). Unlike class II- and III-specific TBP-TAF complexes, the corresponding murine and human class I-specific transcription initiation factor TIF-IB/SL1 exhibits a pronounced selectivity for its homologous promoter. As a first step toward understanding the molecular basis of species-specific promoter recognition, we cloned the cDNAs encoding the three mouse pol I-specific TBP-associated factors (TAFIs) and compared the amino acid sequences of the murine TAFIs with their human counterparts. The four subunits from either species can form stable chimeric complexes that contain stoichiometric amounts of TBP and TAFIs, demonstrating that differences in the primary structure of human and mouse TAFIs do not dramatically alter the network of protein-protein contacts responsible for assembly of the multimeric complex. Thus, primate vs. rodent promoter selectivity mediated by the TBP-TAFI complex is likely to be the result of cumulative subtle differences between individual subunits that lead to species-specific properties of RNA polymerase I transcription.
2014-01-01
Background Deciphering of the information content of eukaryotic promoters has remained confined to universal landmarks and conserved sequence elements such as enhancers and transcription factor binding motifs, which are considered sufficient for gene activation and regulation. Gene-specific sequences, interspersed between the canonical transacting factor binding sites or adjoining them within a promoter, are generally taken to be devoid of any regulatory information and have therefore been largely ignored. An unanswered question therefore is, do gene-specific sequences within a eukaryotic promoter have a role in gene activation? Here, we present an exhaustive experimental analysis of a gene-specific sequence adjoining the heat shock element (HSE) in the proximal promoter of the small heat shock protein gene, αB-crystallin (cryab). These sequences are highly conserved between the rodents and the humans. Results Using human retinal pigment epithelial cells in culture as the host, we have identified a 10-bp gene-specific promoter sequence (GPS), which, unlike an enhancer, controls expression from the promoter of this gene, only when in appropriate position and orientation. Notably, the data suggests that GPS in comparison with the HSE works in a context-independent fashion. Additionally, when moved upstream, about a nucleosome length of DNA (−154 bp) from the transcription start site (TSS), the activity of the promoter is markedly inhibited, suggesting its involvement in local promoter access. Importantly, we demonstrate that deletion of the GPS results in complete loss of cryab promoter activity in transgenic mice. Conclusions These data suggest that gene-specific sequences such as the GPS, identified here, may have critical roles in regulating gene-specific activity from eukaryotic promoters. PMID:24589182
Detection of the CLOCK/BMAL1 heterodimer using a nucleic acid probe with cycling probe technology.
Nakagawa, Kazuhiro; Yamamoto, Takuro; Yasuda, Akio
2010-09-15
An isothermal signal amplification technique for specific DNA sequences, known as cycling probe technology (CPT), has enabled rapid acquisition of genomic information. Here we report an analogous technique for the detection of an activated transcription factor, a transcription element-binding assay with fluorescent amplification by apurinic/apyrimidinic (AP) site lysis cycle (TEFAL). This simple amplification assay can detect activated transcription factors by using a unique nucleic acid probe containing a consensus binding sequence and an AP site, which enables the CPT reaction with AP endonuclease. In this article, we demonstrate that this method detects the functional CLOCK/BMAL1 heterodimer via the TEFAL probe containing the E-box consensus sequence to which the CLOCK/BMAL1 heterodimer binds. Using TEFAL combined with immunoassays, we measured oscillations in the amount of CLOCK/BMAL1 heterodimer in serum-stimulated HeLa cells. Furthermore, we succeeded in measuring the circadian accumulation of the functional CLOCK/BMAL1 heterodimer in human buccal mucosa cells. TEFAL contributes greatly to the study of transcription factor activation in mammalian tissues and cell extracts and is a powerful tool for less invasive investigation of human circadian rhythms. 2010 Elsevier Inc. All rights reserved.
The zebrafish dorsal axis is apparent at the four-cell stage.
Gore, Aniket V; Maegawa, Shingo; Cheong, Albert; Gilligan, Patrick C; Weinberg, Eric S; Sampath, Karuna
2005-12-15
A central question in the development of multicellular organisms pertains to the timing and mechanisms of specification of the embryonic axes. In many organisms, specification of the dorsoventral axis requires signalling by proteins of the Transforming growth factor-beta and Wnt families. Here we show that maternal transcripts of the zebrafish Nodal-related morphogen, Squint (Sqt), can localize to two blastomeres at the four-cell stage and predict the dorsal axis. Removal of cells containing sqt transcripts from four-to-eight-cell embryos or injection of antisense morpholino oligonucleotides targeting sqt into oocytes can cause a loss of dorsal structures. Localization of sqt transcripts is independent of maternal Wnt pathway function and requires a highly conserved sequence in the 3' untranslated region. Thus, the dorsoventral axis is apparent by early cleavage stages and may require the maternally encoded morphogen Sqt and its associated factors. Because the 3' untranslated region of the human nodal gene can also localize exogenous sequences to dorsal cells, this mechanism may be evolutionarily conserved.
Dai, Hanjun; Umarov, Ramzan; Kuwahara, Hiroyuki; Li, Yu; Song, Le; Gao, Xin
2017-11-15
An accurate characterization of transcription factor (TF)-DNA affinity landscape is crucial to a quantitative understanding of the molecular mechanisms underpinning endogenous gene regulation. While recent advances in biotechnology have brought the opportunity for building binding affinity prediction methods, the accurate characterization of TF-DNA binding affinity landscape still remains a challenging problem. Here we propose a novel sequence embedding approach for modeling the transcription factor binding affinity landscape. Our method represents DNA binding sequences as a hidden Markov model which captures both position specific information and long-range dependency in the sequence. A cornerstone of our method is a novel message passing-like embedding algorithm, called Sequence2Vec, which maps these hidden Markov models into a common nonlinear feature space and uses these embedded features to build a predictive model. Our method is a novel combination of the strength of probabilistic graphical models, feature space embedding and deep learning. We conducted comprehensive experiments on over 90 large-scale TF-DNA datasets which were measured by different high-throughput experimental technologies. Sequence2Vec outperforms alternative machine learning methods as well as the state-of-the-art binding affinity prediction methods. Our program is freely available at https://github.com/ramzan1990/sequence2vec. xin.gao@kaust.edu.sa or lsong@cc.gatech.edu. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.
Pramila, Tata; Wu, Wei; Miles, Shawna; Noble, William Stafford; Breeden, Linda L
2006-08-15
Transcription patterns shift dramatically as cells transit from one phase of the cell cycle to another. To better define this transcriptional circuitry, we collected new microarray data across the cell cycle of budding yeast. The combined analysis of these data with three other cell cycle data sets identifies hundreds of new highly periodic transcripts and provides a weighted average peak time for each transcript. Using these data and phylogenetic comparisons of promoter sequences, we have identified a late S-phase-specific promoter element. This element is the binding site for the forkhead protein Hcm1, which is required for its cell cycle-specific activity. Among the cell cycle-regulated genes that contain conserved Hcm1-binding sites, there is a significant enrichment of genes involved in chromosome segregation, spindle dynamics, and budding. This may explain why Hcm1 mutants show 10-fold elevated rates of chromosome loss and require the spindle checkpoint for viability. Hcm1 also induces the M-phase-specific transcription factors FKH1, FKH2, and NDD1, and two cell cycle-specific transcriptional repressors, WHI5 and YHP1. As such, Hcm1 fills a significant gap in our understanding of the transcriptional circuitry that underlies the cell cycle.
Passamaneck, Yale J; Katikala, Lavanya; Perrone, Lorena; Dunn, Matthew P; Oda-Ishii, Izumi; Di Gregorio, Anna
2009-11-01
The notochord is a defining feature of the chordate body plan. Experiments in ascidian, frog and mouse embryos have shown that co-expression of Brachyury and FoxA class transcription factors is required for notochord development. However, studies on the cis-regulatory sequences mediating the synergistic effects of these transcription factors are complicated by the limited knowledge of notochord genes and cis-regulatory modules (CRMs) that are directly targeted by both. We have identified an easily testable model for such investigations in a 155-bp notochord-specific CRM from the ascidian Ciona intestinalis. This CRM contains functional binding sites for both Ciona Brachyury (Ci-Bra) and FoxA (Ci-FoxA-a). By combining point mutation analysis and misexpression experiments, we demonstrate that binding of both transcription factors to this CRM is necessary and sufficient to activate transcription. To gain insights into the cis-regulatory criteria controlling its activity, we investigated the organization of the transcription factor binding sites within the 155-bp CRM. The 155-bp sequence contains two Ci-Bra binding sites with identical core sequences but opposite orientations, only one of which is required for enhancer activity. Changes in both orientation and spacing of these sites substantially affect the activity of the CRM, as clusters of identical sites found in the Ciona genome with different arrangements are unable to activate transcription in notochord cells. This work presents the first evidence of a synergistic interaction between Brachyury and FoxA in the activation of an individual notochord CRM, and highlights the importance of transcription factor binding site arrangement for its function.
Differential pleiotropy and HOX functional organization.
Sivanantharajah, Lovesha; Percival-Smith, Anthony
2015-02-01
Key studies led to the idea that transcription factors are composed of defined modular protein motifs or domains, each with separable, unique function. During evolution, the recombination of these modular domains could give rise to transcription factors with new properties, as has been shown using recombinant molecules. This archetypic, modular view of transcription factor organization is based on the analyses of a few transcription factors such as GAL4, which may represent extreme exemplars rather than an archetype or the norm. Recent work with a set of Homeotic selector (HOX) proteins has revealed differential pleiotropy: the observation that highly-conserved HOX protein motifs and domains make small, additive, tissue specific contributions to HOX activity. Many of these differentially pleiotropic HOX motifs may represent plastic sequence elements called short linear motifs (SLiMs). The coupling of differential pleiotropy with SLiMs, suggests that protein sequence changes in HOX transcription factors may have had a greater impact on morphological diversity during evolution than previously believed. Furthermore, differential pleiotropy may be the genetic consequence of an ensemble nature of HOX transcription factor allostery, where HOX proteins exist as an ensemble of states with the capacity to integrate an extensive array of developmental information. Given a new structural model for HOX functional domain organization, the properties of the archetypic TF may require reassessment. Copyright © 2014 Elsevier Inc. All rights reserved.
Zandvakili, Arya; Campbell, Ian; Weirauch, Matthew T.
2018-01-01
Cells use thousands of regulatory sequences to recruit transcription factors (TFs) and produce specific transcriptional outcomes. Since TFs bind degenerate DNA sequences, discriminating functional TF binding sites (TFBSs) from background sequences represents a significant challenge. Here, we show that a Drosophila regulatory element that activates Epidermal Growth Factor signaling requires overlapping, low-affinity TFBSs for competing TFs (Pax2 and Senseless) to ensure cell- and segment-specific activity. Testing available TF binding models for Pax2 and Senseless, however, revealed variable accuracy in predicting such low-affinity TFBSs. To better define parameters that increase accuracy, we developed a method that systematically selects subsets of TFBSs based on predicted affinity to generate hundreds of position-weight matrices (PWMs). Counterintuitively, we found that degenerate PWMs produced from datasets depleted of high-affinity sequences were more accurate in identifying both low- and high-affinity TFBSs for the Pax2 and Senseless TFs. Taken together, these findings reveal how TFBS arrangement can be constrained by competition rather than cooperativity and that degenerate models of TF binding preferences can improve identification of biologically relevant low affinity TFBSs. PMID:29617378
The punctilious RNA polymerase II core promoter
Vo ngoc, Long; Wang, Yuan-Liang; Kassavetis, George A.; Kadonaga, James T.
2017-01-01
The signals that direct the initiation of transcription ultimately converge at the core promoter, which is the gateway to transcription. Here we provide an overview of the RNA polymerase II core promoter in bilateria (bilaterally symmetric animals). The core promoter is diverse in terms of its composition and function yet is also punctilious, as it acts with strict rules and precision. We additionally describe an expanded view of the core promoter that comprises the classical DNA sequence motifs, sequence-specific DNA-binding transcription factors, chromatin signals, and DNA structure. This model may eventually lead to a more unified conceptual understanding of the core promoter. PMID:28808065
Genome-scale cold stress response regulatory networks in ten Arabidopsis thaliana ecotypes
2013-01-01
Background Low temperature leads to major crop losses every year. Although several studies have been conducted focusing on diversity of cold tolerance level in multiple phenotypically divergent Arabidopsis thaliana (A. thaliana) ecotypes, genome-scale molecular understanding is still lacking. Results In this study, we report genome-scale transcript response diversity of 10 A. thaliana ecotypes originating from different geographical locations to non-freezing cold stress (10°C). To analyze the transcriptional response diversity, we initially compared transcriptome changes in all 10 ecotypes using Arabidopsis NimbleGen ATH6 microarrays. In total 6061 transcripts were significantly cold regulated (p < 0.01) in 10 ecotypes, including 498 transcription factors and 315 transposable elements. The majority of the transcripts (75%) showed ecotype specific expression pattern. By using sequence data available from Arabidopsis thaliana 1001 genome project, we further investigated sequence polymorphisms in the core cold stress regulon genes. Significant numbers of non-synonymous amino acid changes were observed in the coding region of the CBF regulon genes. Considering the limited knowledge about regulatory interactions between transcription factors and their target genes in the model plant A. thaliana, we have adopted a powerful systems genetics approach- Network Component Analysis (NCA) to construct an in-silico transcriptional regulatory network model during response to cold stress. The resulting regulatory network contained 1,275 nodes and 7,720 connections, with 178 transcription factors and 1,331 target genes. Conclusions A. thaliana ecotypes exhibit considerable variation in transcriptome level responses to non-freezing cold stress treatment. Ecotype specific transcripts and related gene ontology (GO) categories were identified to delineate natural variation of cold stress regulated differential gene expression in the model plant A. thaliana. The predicted regulatory network model was able to identify new ecotype specific transcription factors and their regulatory interactions, which might be crucial for their local geographic adaptation to cold temperature. Additionally, since the approach presented here is general, it could be adapted to study networks regulating biological process in any biological systems. PMID:24148294
Conserved noncoding sequences (CNSs) in higher plants.
Freeling, Michael; Subramaniam, Shabarinath
2009-04-01
Plant conserved noncoding sequences (CNSs)--a specific category of phylogenetic footprint--have been shown experimentally to function. No plant CNS is conserved to the extent that ultraconserved noncoding sequences are conserved in vertebrates. Plant CNSs are enriched in known transcription factor or other cis-acting binding sites, and are usually clustered around genes. Genes that encode transcription factors and/or those that respond to stimuli are particularly CNS-rich. Only rarely could this function involve small RNA binding. Some transcribed CNSs encode short translation products as a form of negative control. Approximately 4% of Arabidopsis gene content is estimated to be both CNS-rich and occupies a relatively long stretch of chromosome: Bigfoot genes (long phylogenetic footprints). We discuss a 'DNA-templated protein assembly' idea that might help explain Bigfoot gene CNSs.
Zhang, Qi; Zeng, Xin; Younkin, Sam; Kawli, Trupti; Snyder, Michael P; Keleş, Sündüz
2016-02-24
Chromatin immunoprecipitation followed by sequencing (ChIP-seq) experiments revolutionized genome-wide profiling of transcription factors and histone modifications. Although maturing sequencing technologies allow these experiments to be carried out with short (36-50 bps), long (75-100 bps), single-end, or paired-end reads, the impact of these read parameters on the downstream data analysis are not well understood. In this paper, we evaluate the effects of different read parameters on genome sequence alignment, coverage of different classes of genomic features, peak identification, and allele-specific binding detection. We generated 101 bps paired-end ChIP-seq data for many transcription factors from human GM12878 and MCF7 cell lines. Systematic evaluations using in silico variations of these data as well as fully simulated data, revealed complex interplay between the sequencing parameters and analysis tools, and indicated clear advantages of paired-end designs in several aspects such as alignment accuracy, peak resolution, and most notably, allele-specific binding detection. Our work elucidates the effect of design on the downstream analysis and provides insights to investigators in deciding sequencing parameters in ChIP-seq experiments. We present the first systematic evaluation of the impact of ChIP-seq designs on allele-specific binding detection and highlights the power of pair-end designs in such studies.
He, Xin; Samee, Md. Abul Hassan; Blatti, Charles; Sinha, Saurabh
2010-01-01
Quantitative models of cis-regulatory activity have the potential to improve our mechanistic understanding of transcriptional regulation. However, the few models available today have been based on simplistic assumptions about the sequences being modeled, or heuristic approximations of the underlying regulatory mechanisms. We have developed a thermodynamics-based model to predict gene expression driven by any DNA sequence, as a function of transcription factor concentrations and their DNA-binding specificities. It uses statistical thermodynamics theory to model not only protein-DNA interaction, but also the effect of DNA-bound activators and repressors on gene expression. In addition, the model incorporates mechanistic features such as synergistic effect of multiple activators, short range repression, and cooperativity in transcription factor-DNA binding, allowing us to systematically evaluate the significance of these features in the context of available expression data. Using this model on segmentation-related enhancers in Drosophila, we find that transcriptional synergy due to simultaneous action of multiple activators helps explain the data beyond what can be explained by cooperative DNA-binding alone. We find clear support for the phenomenon of short-range repression, where repressors do not directly interact with the basal transcriptional machinery. We also find that the binding sites contributing to an enhancer's function may not be conserved during evolution, and a noticeable fraction of these undergo lineage-specific changes. Our implementation of the model, called GEMSTAT, is the first publicly available program for simultaneously modeling the regulatory activities of a given set of sequences. PMID:20862354
Jaeger, Alex M.; Makley, Leah N.; Gestwicki, Jason E.; Thiele, Dennis J.
2014-01-01
The heat shock transcription factor 1 (HSF1) activates expression of a variety of genes involved in cell survival, including protein chaperones, the protein degradation machinery, anti-apoptotic proteins, and transcription factors. Although HSF1 activation has been linked to amelioration of neurodegenerative disease, cancer cells exhibit a dependence on HSF1 for survival. Indeed, HSF1 drives a program of gene expression in cancer cells that is distinct from that activated in response to proteotoxic stress, and HSF1 DNA binding activity is elevated in cycling cells as compared with arrested cells. Active HSF1 homotrimerizes and binds to a DNA sequence consisting of inverted repeats of the pentameric sequence nGAAn, known as heat shock elements (HSEs). Recent comprehensive ChIP-seq experiments demonstrated that the architecture of HSEs is very diverse in the human genome, with deviations from the consensus sequence in the spacing, orientation, and extent of HSE repeats that could influence HSF1 DNA binding efficacy and the kinetics and magnitude of target gene expression. To understand the mechanisms that dictate binding specificity, HSF1 was purified as either a monomer or trimer and used to evaluate DNA-binding site preferences in vitro using fluorescence polarization and thermal denaturation profiling. These results were compared with quantitative chromatin immunoprecipitation assays in vivo. We demonstrate a role for specific orientations of extended HSE sequences in driving preferential HSF1 DNA binding to target loci in vivo. These studies provide a biochemical basis for understanding differential HSF1 target gene recognition and transcription in neurodegenerative disease and in cancer. PMID:25204655
Hematopoietic transcriptional mechanisms: from locus-specific to genome-wide vantage points.
DeVilbiss, Andrew W; Sanalkumar, Rajendran; Johnson, Kirby D; Keles, Sunduz; Bresnick, Emery H
2014-08-01
Hematopoiesis is an exquisitely regulated process in which stem cells in the developing embryo and the adult generate progenitor cells that give rise to all blood lineages. Master regulatory transcription factors control hematopoiesis by integrating signals from the microenvironment and dynamically establishing and maintaining genetic networks. One of the most rudimentary aspects of cell type-specific transcription factor function, how they occupy a highly restricted cohort of cis-elements in chromatin, remains poorly understood. Transformative technologic advances involving the coupling of next-generation DNA sequencing technology with the chromatin immunoprecipitation assay (ChIP-seq) have enabled genome-wide mapping of factor occupancy patterns. However, formidable problems remain; notably, ChIP-seq analysis yields hundreds to thousands of chromatin sites occupied by a given transcription factor, and only a fraction of the sites appear to be endowed with critical, non-redundant function. It has become en vogue to map transcription factor occupancy patterns genome-wide, while using powerful statistical tools to establish correlations to inform biology and mechanisms. With the advent of revolutionary genome editing technologies, one can now reach beyond correlations to conduct definitive hypothesis testing. This review focuses on key discoveries that have emerged during the path from single loci to genome-wide analyses, specifically in the context of hematopoietic transcriptional mechanisms. Copyright © 2014 ISEH - International Society for Experimental Hematology. Published by Elsevier Inc. All rights reserved.
Qiao, Huan; May, James M.
2012-01-01
Transcription of the ascorbate transporter, SVCT2, is driven by two distinct promoters in exon 1 of the transporter sequence. The exon 1a promoter lacks a classical transcription start site and little is known about regulation of promoter activity in the transcription start site core (TSSC) region. Here we present evidence that the TSSC binds the multifunctional initiator-binding protein YY1. Electrophoresis shift assays using YY1 antibody showed that YY1 is present as one of two major complexes that specifically bind to the TSSC. The other complex contains the transcription factor NF-Y. Mutations in the TSSC that decreased YY1 binding also impaired the exon 1a promoter activity despite the presence of an upstream activating NF-Y/USF complex, suggesting that YY1 is involved in the regulation of the exon 1a transcription. Furthermore, YY1 interaction with NF-Y and/or USF synergistically enhanced the exon 1a promoter activity in transient transfections and co-activator p300 enhanced their synergistic activation. We propose that the TSSC plays a vital role in the exon 1a transcription and that this function is partially carried out by the transcription factor YY1. Moreover, co-activator p300 might be able to synergistically enhance the TSSC function via a “bridge” mechanism with upstream sequences. PMID:22532872
Kim, Chun Sung; Choi, Hack Sun; Hwang, Cheol Kyu; Song, Kyu Young; Lee, Byung-Kwon; Law, Ping-Yee; Wei, Li-Na; Loh, Horace H.
2006-01-01
Previously, we reported that the neuron-restrictive silencer element (NRSE) of mu opioid receptor (MOR) functions as a critical regulator to repress the MOR transcription in specific neuronal cells, depending on neuron-restriction silence factor (NRSF) expression levels [C.S.Kim, C.K.Hwang, H.S.Choi, K.Y.Song, P.Y.Law, L.N.Wei and H.H.Loh (2004) J. Biol. Chem., 279, 46464–46473]. Herein, we identify a conserved GC sequence next to NRSE region in the mouse MOR gene. The inhibition of Sp family factors binding to this GC box by mithramycin A led to a significant increase in the endogenous MOR transcription. In the co-immunoprecipitation experiment, NRSF interacted with the full-length Sp3 factor, but not with Sp1 or two short Sp3 isoforms. The sequence specific and functional binding by Sp3 at this GC box was confirmed by in vitro gel-shift assays using either in vitro translated proteins or nuclear extract, and by in vivo chromatin immunoprecipitation assays. Transient transfection assays showed that Sp3-binding site of the MOR gene is a functionally synergic repressor element with NRSE in NS20Y cells, but not in the NRSF negative PC12 cells. The results suggest that the synergic interaction between NRSF and Sp3 is required to negatively regulate MOR gene transcription and that transcription of MOR gene would be governed by the context of available transcription factors rather than by a master regulator. PMID:17130167
Antisense Transcription Is Pervasive but Rarely Conserved in Enteric Bacteria
Raghavan, Rahul; Sloan, Daniel B.; Ochman, Howard
2012-01-01
ABSTRACT Noncoding RNAs, including antisense RNAs (asRNAs) that originate from the complementary strand of protein-coding genes, are involved in the regulation of gene expression in all domains of life. Recent application of deep-sequencing technologies has revealed that the transcription of asRNAs occurs genome-wide in bacteria. Although the role of the vast majority of asRNAs remains unknown, it is often assumed that their presence implies important regulatory functions, similar to those of other noncoding RNAs. Alternatively, many antisense transcripts may be produced by chance transcription events from promoter-like sequences that result from the degenerate nature of bacterial transcription factor binding sites. To investigate the biological relevance of antisense transcripts, we compared genome-wide patterns of asRNA expression in closely related enteric bacteria, Escherichia coli and Salmonella enterica serovar Typhimurium, by performing strand-specific transcriptome sequencing. Although antisense transcripts are abundant in both species, less than 3% of asRNAs are expressed at high levels in both species, and only about 14% appear to be conserved among species. And unlike the promoters of protein-coding genes, asRNA promoters show no evidence of sequence conservation between, or even within, species. Our findings suggest that many or even most bacterial asRNAs are nonadaptive by-products of the cell’s transcription machinery. PMID:22872780
Cenci, Albero; Guignon, Valentin; Roux, Nicolas; Rouard, Mathieu
2014-05-01
Identifying the molecular mechanisms underlying tolerance to abiotic stresses is important in crop breeding. A comprehensive understanding of the gene families associated with drought tolerance is therefore highly relevant. NAC transcription factors form a large plant-specific gene family involved in the regulation of tissue development and responses to biotic and abiotic stresses. The main goal of this study was to set up a framework of orthologous groups determined by an expert sequence comparison of NAC genes from both monocots and dicots. In order to clarify the orthologous relationships among NAC genes of different species, we performed an in-depth comparative study of four divergent taxa, in dicots and monocots, whose genomes have already been completely sequenced: Arabidopsis thaliana, Vitis vinifera, Musa acuminata and Oryza sativa. Due to independent evolution, NAC copy number is highly variable in these plant genomes. Based on an expert NAC sequence comparison, we propose forty orthologous groups of NAC sequences that were probably derived from an ancestor gene present in the most recent common ancestor of dicots and monocots. These orthologous groups provide a curated resource for large-scale protein sequence annotation of NAC transcription factors. The established orthology relationships also provide a useful reference for NAC function studies in newly sequenced genomes such as M. acuminata and other plant species.
Analysis of tissue-specific region in sericin 1 gene promoter of Bombyx mori
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu Yan; Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031; Yu Lian
The gene encoding sericin 1 (Ser1) of silkworm (Bombyx mori) is specifically expressed in the middle silk gland cells. To identify element involved in this transcription-dependent spatial restriction, truncation of the 5' terminal from the sericin 1 (Ser1) promoter is studied in vivo. A 209 bp DNA sequence upstream of the transcriptional start site (-586 to -378) is found to be responsible for promoting tissue-specific transcription. Analysis of this 209 bp region by overlapping deletion studies showed that a 25 bp region (-500 to -476) suppresses the ectopic expression of the Ser1 promoter. An unknown factor abundant in fat bodymore » nuclear extracts is shown to bind to this 25 bp fragment. These results suggest that this 25 bp region and the unknown factor are necessary for determining the tissue-specificity of the Ser1 promoter.« less
Cheng, Christine S.; Feldman, Kristyn E.; Lee, James; Verma, Shilpi; Huang, De-Bin; Huynh, Kim; Chang, Mikyoung; Ponomarenko, Julia V.; Sun, Shao-Cong; Benedict, Chris A.; Ghosh, Gourisankar; Hoffmann, Alexander
2011-01-01
The specific binding of transcription factors to cognate sequence elements is thought to be critical for the generation of specific gene expression programs. Members of the nuclear factor κB (NF-κB) and interferon (IFN) regulatory factor (IRF) transcription factor families bind to the κB site and the IFN response element (IRE), respectively, of target genes, and they are activated in macrophages after exposure to pathogens. However, how these factors produce pathogen-specific inflammatory and immune responses remains poorly understood. Combining top-down and bottom-up systems biology approaches, we have identified the NF-κB p50 homodimer as a regulator of IRF responses. Unbiased genome-wide expression and biochemical and structural analyses revealed that the p50 homodimer repressed a subset of IFN-inducible genes through a previously uncharacterized subclass of guanine-rich IRE (G-IRE) sequences. Mathematical modeling predicted that the p50 homodimer might enforce the stimulus specificity of composite promoters. Indeed, the production of the antiviral regulator IFN-β was rendered stimulus-specific by the binding of the p50 homodimer to the G-IRE–containing IFNβ enhancer to suppress cytotoxic IFN signaling. Specifically, a deficiency in p50 resulted in the inappropriate production of IFN-β in response to bacterial DNA sensed by Toll-like receptor 9. This role for the NF-κB p50 homodimer in enforcing the specificity of the cellular response to pathogens by binding to a subset of IRE sequences alters our understanding of how the NF-κB and IRF signaling systems cooperate to regulate antimicrobial immunity. PMID:21343618
Suciu, Maria C.; Telenius, Jelena
2017-01-01
In the era of genome-wide association studies (GWAS) and personalized medicine, predicting the impact of single nucleotide polymorphisms (SNPs) in regulatory elements is an important goal. Current approaches to determine the potential of regulatory SNPs depend on inadequate knowledge of cell-specific DNA binding motifs. Here, we present Sasquatch, a new computational approach that uses DNase footprint data to estimate and visualize the effects of noncoding variants on transcription factor binding. Sasquatch performs a comprehensive k-mer-based analysis of DNase footprints to determine any k-mer's potential for protein binding in a specific cell type and how this may be changed by sequence variants. Therefore, Sasquatch uses an unbiased approach, independent of known transcription factor binding sites and motifs. Sasquatch only requires a single DNase-seq data set per cell type, from any genotype, and produces consistent predictions from data generated by different experimental procedures and at different sequence depths. Here we demonstrate the effectiveness of Sasquatch using previously validated functional SNPs and benchmark its performance against existing approaches. Sasquatch is available as a versatile webtool incorporating publicly available data, including the human ENCODE collection. Thus, Sasquatch provides a powerful tool and repository for prioritizing likely regulatory SNPs in the noncoding genome. PMID:28904015
TFIIS-Dependent Non-coding Transcription Regulates Developmental Genome Rearrangements
Maliszewska-Olejniczak, Kamila; Gruchota, Julita; Gromadka, Robert; Denby Wilkes, Cyril; Arnaiz, Olivier; Mathy, Nathalie; Duharcourt, Sandra; Bétermier, Mireille; Nowak, Jacek K.
2015-01-01
Because of their nuclear dimorphism, ciliates provide a unique opportunity to study the role of non-coding RNAs (ncRNAs) in the communication between germline and somatic lineages. In these unicellular eukaryotes, a new somatic nucleus develops at each sexual cycle from a copy of the zygotic (germline) nucleus, while the old somatic nucleus degenerates. In the ciliate Paramecium tetraurelia, the genome is massively rearranged during this process through the reproducible elimination of repeated sequences and the precise excision of over 45,000 short, single-copy Internal Eliminated Sequences (IESs). Different types of ncRNAs resulting from genome-wide transcription were shown to be involved in the epigenetic regulation of genome rearrangements. To understand how ncRNAs are produced from the entire genome, we have focused on a homolog of the TFIIS elongation factor, which regulates RNA polymerase II transcriptional pausing. Six TFIIS-paralogs, representing four distinct families, can be found in P. tetraurelia genome. Using RNA interference, we showed that TFIIS4, which encodes a development-specific TFIIS protein, is essential for the formation of a functional somatic genome. Molecular analyses and high-throughput DNA sequencing upon TFIIS4 RNAi demonstrated that TFIIS4 is involved in all kinds of genome rearrangements, including excision of ~48% of IESs. Localization of a GFP-TFIIS4 fusion revealed that TFIIS4 appears specifically in the new somatic nucleus at an early developmental stage, before IES excision. RT-PCR experiments showed that TFIIS4 is necessary for the synthesis of IES-containing non-coding transcripts. We propose that these IES+ transcripts originate from the developing somatic nucleus and serve as pairing substrates for germline-specific short RNAs that target elimination of their homologous sequences. Our study, therefore, connects the onset of zygotic non coding transcription to the control of genome plasticity in Paramecium, and establishes for the first time a specific role of TFIIS in non-coding transcription in eukaryotes. PMID:26177014
von Dassow, Peter; Ogata, Hiroyuki; Probert, Ian; Wincker, Patrick; Da Silva, Corinne; Audic, Stéphane; Claverie, Jean-Michel; de Vargas, Colomban
2009-01-01
Eukaryotes are classified as either haplontic, diplontic, or haplo-diplontic, depending on which ploidy levels undergo mitotic cell division in the life cycle. Emiliania huxleyi is one of the most abundant phytoplankton species in the ocean, playing an important role in global carbon fluxes, and represents haptophytes, an enigmatic group of unicellular organisms that diverged early in eukaryotic evolution. This species is haplo-diplontic. Little is known about the haploid cells, but they have been hypothesized to allow persistence of the species between the yearly blooms of diploid cells. We sequenced over 38,000 expressed sequence tags from haploid and diploid E. huxleyi normalized cDNA libraries to identify genes involved in important processes specific to each life phase (2N calcification or 1N motility), and to better understand the haploid phase of this prominent haplo-diplontic organism. The haploid and diploid transcriptomes showed a dramatic differentiation, with approximately 20% greater transcriptome richness in diploid cells than in haploid cells and only
2009-01-01
Background Eukaryotes are classified as either haplontic, diplontic, or haplo-diplontic, depending on which ploidy levels undergo mitotic cell division in the life cycle. Emiliania huxleyi is one of the most abundant phytoplankton species in the ocean, playing an important role in global carbon fluxes, and represents haptophytes, an enigmatic group of unicellular organisms that diverged early in eukaryotic evolution. This species is haplo-diplontic. Little is known about the haploid cells, but they have been hypothesized to allow persistence of the species between the yearly blooms of diploid cells. We sequenced over 38,000 expressed sequence tags from haploid and diploid E. huxleyi normalized cDNA libraries to identify genes involved in important processes specific to each life phase (2N calcification or 1N motility), and to better understand the haploid phase of this prominent haplo-diplontic organism. Results The haploid and diploid transcriptomes showed a dramatic differentiation, with approximately 20% greater transcriptome richness in diploid cells than in haploid cells and only ≤ 50% of transcripts estimated to be common between the two phases. The major functional category of transcripts differentiating haploids included signal transduction and motility genes. Diploid-specific transcripts included Ca2+, H+, and HCO3- pumps. Potential factors differentiating the transcriptomes included haploid-specific Myb transcription factor homologs and an unusual diploid-specific histone H4 homolog. Conclusions This study permitted the identification of genes likely involved in diploid-specific biomineralization, haploid-specific motility, and transcriptional control. Greater transcriptome richness in diploid cells suggests they may be more versatile for exploiting a diversity of rich environments whereas haploid cells are intrinsically more streamlined. PMID:19832986
Deletion of transcription factor binding motifs using the CRISPR/spCas9 system in the β-globin LCR.
Kim, Yea Woon; Kim, AeRi
2017-07-20
Transcription factors play roles in gene transcription through direct binding to their motifs in genome, and inhibiting this binding provides an effective strategy for studying their roles. Here we applied the CRISPR/spCas9 system to mutate the binding motifs of transcription factors. Binding motifs for erythroid specific transcription factors were mutated in the locus control region hypersensitive sites of the human β-globin locus. Guide RNAs targeting binding motifs were cloned into lentiviral CRISPR vector containing the spCas9 gene, and transduced into MEL/ch11 cells carrying a human chromosome 11. DNA mutations in clonal cells were initially screened by quantitative PCR in genomic DNA and then clarified by sequencing. Mutations in binding motifs reduced occupancy by transcription factors in a chromatin environment. Characterization of mutations revealed that the CRISPR/spCas9 system mainly induced deletions in short regions of <20 bp and preferentially deleted nucleotides around the fifth nucleotide upstream of Protospacer adjacent motifs. These results indicate that the CRISPR/Cas9 system is suitable for mutating the binding motifs of transcription factors, and, consequently, would contribute to elucidate the direct roles of transcription factors. ©2017 The Author(s).
DOE Office of Scientific and Technical Information (OSTI.GOV)
Xiao, Xiao; Gang, Yi; Department of Infectious Diseases, Tangdu Hospital, Fourth Military Medical University, Xi’an 710038, Shaanxi Province
2015-02-06
Highlights: • A shRNA vector based transcription factor decoy, VB-ODN, was designed. • VB-ODN for NF-κB inhibited cell viability in HEK293 cells. • VB-ODN inhibited expression of downstream genes of target transcription factors. • VB-ODN may enhance nuclear entry ratio for its feasibility of virus production. - Abstract: In this study, we designed a short hairpin RNA vector-based oligodeoxynucleotide (VB-ODN) carrying transcription factor (TF) consensus sequence which could function as a decoy to block TF activity. Specifically, VB-ODN for Nuclear factor-κB (NF-κB) could inhibit cell viability and decrease downstream gene expression in HEK293 cells without affecting expression of NF-κB itself.more » The specific binding between VB-ODN produced double-stranded RNA and NF-κB was evidenced by electrophoretic mobility shift assay. Moreover, similar VB-ODNs designed for three other TFs also inhibit their downstream gene expression but not that of themselves. Our study provides a new design of decoy for blocking TF activity.« less
Satapathy, Lopamudra; Singh, Dharmendra; Ranjan, Prashant; Kumar, Dhananjay; Kumar, Manish; Prabhu, Kumble Vinod; Mukhopadhyay, Kunal
2014-12-01
WRKY, a plant-specific transcription factor family, has important roles in pathogen defense, abiotic cues and phytohormone signaling, yet little is known about their roles and molecular mechanism of function in response to rust diseases in wheat. We identified 100 TaWRKY sequences using wheat Expressed Sequence Tag database of which 22 WRKY sequences were novel. Identified proteins were characterized based on their zinc finger motifs and phylogenetic analysis clustered them into six clades consisting of class IIc and class III WRKY proteins. Functional annotation revealed major functions in metabolic and cellular processes in control plants; whereas response to stimuli, signaling and defense in pathogen inoculated plants, their major molecular function being binding to DNA. Tag-based expression analysis of the identified genes revealed differential expression between mock and Puccinia triticina inoculated wheat near isogenic lines. Gene expression was also performed with six rust-related microarray experiments at Gene Expression Omnibus database. TaWRKY10, 15, 17 and 56 were common in both tag-based and microarray-based differential expression analysis and could be representing rust specific WRKY genes. The obtained results will bestow insight into the functional characterization of WRKY transcription factors responsive to leaf rust pathogenesis that can be used as candidate genes in molecular breeding programs to improve biotic stress tolerance in wheat.
Augustin, Regina; Lichtenthaler, Stefan F.; Greeff, Michael; Hansen, Jens; Wurst, Wolfgang; Trümbach, Dietrich
2011-01-01
The molecular mechanisms and genetic risk factors underlying Alzheimer's disease (AD) pathogenesis are only partly understood. To identify new factors, which may contribute to AD, different approaches are taken including proteomics, genetics, and functional genomics. Here, we used a bioinformatics approach and found that distinct AD-related genes share modules of transcription factor binding sites, suggesting a transcriptional coregulation. To detect additional coregulated genes, which may potentially contribute to AD, we established a new bioinformatics workflow with known multivariate methods like support vector machines, biclustering, and predicted transcription factor binding site modules by using in silico analysis and over 400 expression arrays from human and mouse. Two significant modules are composed of three transcription factor families: CTCF, SP1F, and EGRF/ZBPF, which are conserved between human and mouse APP promoter sequences. The specific combination of in silico promoter and multivariate analysis can identify regulation mechanisms of genes involved in multifactorial diseases. PMID:21559189
An overview of transcriptional regulation in response to toxicological insult.
Jennings, Paul; Limonciel, Alice; Felice, Luca; Leonard, Martin O
2013-01-01
The completion of the human genome project and the subsequent advent of DNA microarray and high-throughput sequencing technologies have led to a renaissance in molecular toxicology. Toxicogenomic data sets, from both in vivo and in vitro studies, are growing exponentially, providing a wealth of information on regulation of stress pathways at the transcriptome level. Through such studies, we are now beginning to appreciate the diversity and complexity of biological responses to xenobiotics. In this review, we aim to consolidate and summarise the major toxicologically relevant transcription factor-governed molecular pathways. It is becoming clear that different chemical entities can cause oxidative, genotoxic and proteotoxic stress, which induce cellular responses in an effort to restore homoeostasis. Primary among the response pathways involved are NFE2L2 (Nrf2), NFE2L1 (Nrf1), p53, heat shock factor and the unfolded protein response. Additionally, more specific mechanisms exist where xenobiotics act as ligands, including the aryl hydrocarbon receptor, metal-responsive transcription factor-1 and the nuclear receptor family of transcription factors. Other pathways including the immunomodulatory transcription factors NF-κB and STAT together with the hypoxia-inducible transcription factor HIF are also implicated in cellular responses to xenobiotic exposure. A less specific but equally important aspect to cellular injury controlled by transcriptional activity is loss of tissue-specific gene expression, resulting in dedifferentiation of target cells and compromise of tissue function. Here, we review these pathways and the genes they regulate in order to provide an overview of this growing field of molecular toxicology.
Eguchi, Asuka; Lee, Garrett O.; Wan, Fang; Erwin, Graham S.; Ansari, Aseem Z.
2014-01-01
Transcription factors control the fate of a cell by regulating the expression of genes and regulatory networks. Recent successes in inducing pluripotency in terminally differentiated cells as well as directing differentiation with natural transcription factors has lent credence to the efforts that aim to direct cell fate with rationally designed transcription factors. Because DNA-binding factors are modular in design, they can be engineered to target specific genomic sequences and perform pre-programmed regulatory functions upon binding. Such precision-tailored factors can serve as molecular tools to reprogramme or differentiate cells in a targeted manner. Using different types of engineered DNA binders, both regulatory transcriptional controls of gene networks, as well as permanent alteration of genomic content, can be implemented to study cell fate decisions. In the present review, we describe the current state of the art in artificial transcription factor design and the exciting prospect of employing artificial DNA-binding factors to manipulate the transcriptional networks as well as epigenetic landscapes that govern cell fate. PMID:25145439
Liao, Ming-Xiang; Liu, Dong-Yuan; Zuo, Jin; Fang, Fu-De
2002-03-01
To detect the trans-factors specifically binding to the strong enhancer element (GPEI) in the upstream of rat glutathione S-transferase P (GST-P) gene. Yeast one-hybrid system was used to screen rat lung MATCHMAKER cDNA library to identify potential trans-factors that can interact with core sequence of GPEI(cGPEI). Electrophoresis mobility shift assay (EMSA) was used to analyze the binding of transfactors to cGPEI. cDNA fragments coding for the C-terminal part of the transcription factor c-Jun and rat adenine nucleotide translocator (ANT) were isolated. The binding of c-Jun and ANT to GPEI core sequence were confirmed. Rat c-jun transcriptional factor and ANT may interact with cGPEI. They could play an important role in the induced expression of GST-P gene.
The siRNA Non-seed Region and Its Target Sequences Are Auxiliary Determinants of Off-Target Effects.
Kamola, Piotr J; Nakano, Yuko; Takahashi, Tomoko; Wilson, Paul A; Ui-Tei, Kumiko
2015-12-01
RNA interference (RNAi) is a powerful tool for post-transcriptional gene silencing. However, the siRNA guide strand may bind unintended off-target transcripts via partial sequence complementarity by a mechanism closely mirroring micro RNA (miRNA) silencing. To better understand these off-target effects, we investigated the correlation between sequence features within various subsections of siRNA guide strands, and its corresponding target sequences, with off-target activities. Our results confirm previous reports that strength of base-pairing in the siRNA seed region is the primary factor determining the efficiency of off-target silencing. However, the degree of downregulation of off-target transcripts with shared seed sequence is not necessarily similar, suggesting that there are additional auxiliary factors that influence the silencing potential. Here, we demonstrate that both the melting temperature (Tm) in a subsection of siRNA non-seed region, and the GC contents of its corresponding target sequences, are negatively correlated with the efficiency of off-target effect. Analysis of experimentally validated miRNA targets demonstrated a similar trend, indicating a putative conserved mechanistic feature of seed region-dependent targeting mechanism. These observations may prove useful as parameters for off-target prediction algorithms and improve siRNA 'specificity' design rules.
Evolution of a tissue-specific splicing network
Taliaferro, J. Matthew; Alvarez, Nehemiah; Green, Richard E.; Blanchette, Marco; Rio, Donald C.
2011-01-01
Alternative splicing of precursor mRNA (pre-mRNA) is a strategy employed by most eukaryotes to increase transcript and proteomic diversity. Many metazoan splicing factors are members of multigene families, with each member having different functions. How these highly related proteins evolve unique properties has been unclear. Here we characterize the evolution and function of a new Drosophila splicing factor, termed LS2 (Large Subunit 2), that arose from a gene duplication event of dU2AF50, the large subunit of the highly conserved heterodimeric general splicing factor U2AF (U2-associated factor). The quickly evolving LS2 gene has diverged from the splicing-promoting, ubiquitously expressed dU2AF50 such that it binds a markedly different RNA sequence, acts as a splicing repressor, and is preferentially expressed in testes. Target transcripts of LS2 are also enriched for performing testes-related functions. We therefore propose a path for the evolution of a new splicing factor in Drosophila that regulates specific pre-mRNAs and contributes to transcript diversity in a tissue-specific manner. PMID:21406555
Bouard, Charlotte; Terreux, Raphael; Honorat, Mylène; Manship, Brigitte; Ansieau, Stéphane; Vigneron, Arnaud M.; Puisieux, Alain; Payen, Léa
2016-01-01
Abstract The TWIST1 bHLH transcription factor controls embryonic development and cancer processes. Although molecular and genetic analyses have provided a wealth of data on the role of bHLH transcription factors, very little is known on the molecular mechanisms underlying their binding affinity to the E-box sequence of the promoter. Here, we used an in silico model of the TWIST1/E12 (TE) heterocomplex and performed molecular dynamics (MD) simulations of its binding to specific (TE-box) and modified E-box sequences. We focused on (i) active E-box and inactive E-box sequences, on (ii) modified active E-box sequences, as well as on (iii) two box sequences with modified adjacent bases the AT- and TA-boxes. Our in silico models were supported by functional in vitro binding assays. This exploration highlighted the predominant role of protein side-chain residues, close to the heart of the complex, at anchoring the dimer to DNA sequences, and unveiled a shift towards adjacent ((-1) and (-1*)) bases and conserved bases of modified E-box sequences. In conclusion, our study provides proof of the predictive value of these MD simulations, which may contribute to the characterization of specific inhibitors by docking approaches, and their use in pharmacological therapies by blocking the tumoral TWIST1/E12 function in cancers. PMID:27151200
Zandarashvili, Levani; White, Mark A; Esadze, Alexandre; Iwahara, Junji
2015-07-08
The inducible transcription factor Egr-1 binds specifically to 9-bp target sequences containing two CpG sites that can potentially be methylated at four cytosine bases. Although it appears that complete CpG methylation would make an unfavorable steric clash in the previous crystal structures of the complexes with unmethylated or partially methylated DNA, our affinity data suggest that DNA recognition by Egr-1 is insensitive to CpG methylation. We have determined, at a 1.4-Å resolution, the crystal structure of the Egr-1 zinc-finger complex with completely methylated target DNA. Structural comparison of the three different methylation states reveals why Egr-1 can recognize the target sequences regardless of CpG methylation. Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Mapping and analysis of Caenorhabditis elegans transcription factor sequence specificities
Narasimhan, Kamesh; Lambert, Samuel A; Yang, Ally WH; Riddell, Jeremy; Mnaimneh, Sanie; Zheng, Hong; Albu, Mihai; Najafabadi, Hamed S; Reece-Hoyes, John S; Fuxman Bass, Juan I; Walhout, Albertha JM; Weirauch, Matthew T; Hughes, Timothy R
2015-01-01
Caenorhabditis elegans is a powerful model for studying gene regulation, as it has a compact genome and a wealth of genomic tools. However, identification of regulatory elements has been limited, as DNA-binding motifs are known for only 71 of the estimated 763 sequence-specific transcription factors (TFs). To address this problem, we performed protein binding microarray experiments on representatives of canonical TF families in C. elegans, obtaining motifs for 129 TFs. Additionally, we predict motifs for many TFs that have DNA-binding domains similar to those already characterized, increasing coverage of binding specificities to 292 C. elegans TFs (∼40%). These data highlight the diversification of binding motifs for the nuclear hormone receptor and C2H2 zinc finger families and reveal unexpected diversity of motifs for T-box and DM families. Motif enrichment in promoters of functionally related genes is consistent with known biology and also identifies putative regulatory roles for unstudied TFs. DOI: http://dx.doi.org/10.7554/eLife.06967.001 PMID:25905672
Regulation of endogenous human gene expression by ligand-inducible TALE transcription factors.
Mercer, Andrew C; Gaj, Thomas; Sirk, Shannon J; Lamb, Brian M; Barbas, Carlos F
2014-10-17
The construction of increasingly sophisticated synthetic biological circuits is dependent on the development of extensible tools capable of providing specific control of gene expression in eukaryotic cells. Here, we describe a new class of synthetic transcription factors that activate gene expression in response to extracellular chemical stimuli. These inducible activators consist of customizable transcription activator-like effector (TALE) proteins combined with steroid hormone receptor ligand-binding domains. We demonstrate that these ligand-responsive TALE transcription factors allow for tunable and conditional control of gene activation and can be used to regulate the expression of endogenous genes in human cells. Since TALEs can be designed to recognize any contiguous DNA sequence, the conditional gene regulatory system described herein will enable the design of advanced synthetic gene networks.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, Boo-Ja; Park, Chang-Jin; Kim, Sung-Kyu
2006-05-26
We find that salicylic acid and ethephon treatment in hot pepper increases the expression of a putative basic/leucine zipper (bZIP) transcription factor gene, CabZIP1. CabZIP1 mRNA is expressed ubiquitously in various organs. The green fluorescent protein-fused transcription factor, CabZIP1::GFP, can be specifically localized to the nucleus, an action that is consistent with the presence of a nuclear localization signal in its protein sequence. Transient overexpression of the CabZIP1 transcription factor results in an increase in PR-1 transcripts level in Nicotiana benthamiana leaves. Using chromatin immunoprecipitation, we demonstrate that CabZIP1 binds to the G-box elements in native promoter of the hotmore » pepper pathogenesis-related protein 1 (CaPR-1) gene in vivo. Taken together, our results suggest that CabZIP1 plays a role as a transcriptional regulator of the CaPR-1 gene.« less
The yeast genome may harbor hypoxia response elements (HRE).
Ferreira, Túlio César; Hertzberg, Libi; Gassmann, Max; Campos, Elida Geralda
2007-01-01
The hypoxia-inducible factor-1 (HIF-1) is a heterodimeric transcription factor activated when cells are submitted to hypoxia. The heterodimer is composed of two subunits, HIF-1alpha and the constitutively expressed HIF-1beta. During normoxia, HIF-1alpha is degraded by the 26S proteasome, but hypoxia causes HIF-1alpha to be stabilized, enter the nucleus and bind to HIF-1beta, thus forming the active complex. The complex then binds to the regulatory sequences of various genes involved in physiological and pathological processes. The specific regulatory sequence recognized by HIF-1 is the hypoxia response element (HRE) that has the consensus sequence 5'BRCGTGVBBB3'. Although the basic transcriptional regulation machinery is conserved between yeast and mammals, Saccharomyces cerevisiae does not express HIF-1 subunits. However, we hypothesized that baker's yeast has a protein analogous to HIF-1 which participates in the response to changes in oxygen levels by binding to HRE sequences. In this study we screened the yeast genome for HREs using probabilistic motif search tools. We described 24 yeast genes containing motifs with high probability of being HREs (p-value<0.1) and classified them according to biological function. Our results show that S. cerevisiae may harbor HREs and indicate that a transcription factor analogous to HIF-1 may exist in this organism.
Ozawa, Tatsuhiko; Kondo, Masato; Isobe, Masaharu
2004-01-01
The 3' rapid amplification of cDNA ends (3' RACE) is widely used to isolate the cDNA of unknown 3' flanking sequences. However, the conventional 3' RACE often fails to amplify cDNA from a large transcript if there is a long distance between the 5' gene-specific primer and poly(A) stretch, since the conventional 3' RACE utilizes 3' oligo-dT-containing primer complementary to the poly(A) tail of mRNA at the first strand cDNA synthesis. To overcome this problem, we have developed an improved 3' RACE method suitable for the isolation of cDNA derived from very large transcripts. By using the oligonucleotide-containing random 9mer together with the GC-rich sequence for the suppression PCR technology at the first strand of cDNA synthesis, we have been able to amplify the cDNA from a very large transcript, such as the microtubule-actin crosslinking factor 1 (MACF1) gene, which codes a transcript of 20 kb in size. When there is no splicing variant, our highly specific amplification allows us to perform the direct sequencing of 3' RACE products without requiring cloning in bacterial hosts. Thus, this stepwise 3' RACE walking will help rapid characterization of the 3' structure of a gene, even when it encodes a very large transcript.
Prakash, Celine; Haeseler, Arndt Von
2017-03-01
RNA sequencing (RNA-seq) has emerged as the method of choice for measuring the expression of RNAs in a given cell population. In most RNA-seq technologies, sequencing the full length of RNA molecules requires fragmentation into smaller pieces. Unfortunately, the issue of nonuniform sequencing coverage across a genomic feature has been a concern in RNA-seq and is attributed to biases for certain fragments in RNA-seq library preparation and sequencing. To investigate the expected coverage obtained from fragmentation, we develop a simple fragmentation model that is independent of bias from the experimental method and is not specific to the transcript sequence. Essentially, we enumerate all configurations for maximal placement of a given fragment length, F, on transcript length, T, to represent every possible fragmentation pattern, from which we compute the expected coverage profile across a transcript. We extend this model to incorporate general empirical attributes such as read length, fragment length distribution, and number of molecules of the transcript. We further introduce the fragment starting-point, fragment coverage, and read coverage profiles. We find that the expected profiles are not uniform and that factors such as fragment length to transcript length ratio, read length to fragment length ratio, fragment length distribution, and number of molecules influence the variability of coverage across a transcript. Finally, we explore a potential application of the model where, with simulations, we show that it is possible to correctly estimate the transcript copy number for any transcript in the RNA-seq experiment.
Haeseler, Arndt Von
2017-01-01
Abstract RNA sequencing (RNA-seq) has emerged as the method of choice for measuring the expression of RNAs in a given cell population. In most RNA-seq technologies, sequencing the full length of RNA molecules requires fragmentation into smaller pieces. Unfortunately, the issue of nonuniform sequencing coverage across a genomic feature has been a concern in RNA-seq and is attributed to biases for certain fragments in RNA-seq library preparation and sequencing. To investigate the expected coverage obtained from fragmentation, we develop a simple fragmentation model that is independent of bias from the experimental method and is not specific to the transcript sequence. Essentially, we enumerate all configurations for maximal placement of a given fragment length, F, on transcript length, T, to represent every possible fragmentation pattern, from which we compute the expected coverage profile across a transcript. We extend this model to incorporate general empirical attributes such as read length, fragment length distribution, and number of molecules of the transcript. We further introduce the fragment starting-point, fragment coverage, and read coverage profiles. We find that the expected profiles are not uniform and that factors such as fragment length to transcript length ratio, read length to fragment length ratio, fragment length distribution, and number of molecules influence the variability of coverage across a transcript. Finally, we explore a potential application of the model where, with simulations, we show that it is possible to correctly estimate the transcript copy number for any transcript in the RNA-seq experiment. PMID:27661099
SRY, like HMG1, recognizes sharp angles in DNA.
Ferrari, S; Harley, V R; Pontiggia, A; Goodfellow, P N; Lovell-Badge, R; Bianchi, M E
1992-01-01
HMG boxes are DNA binding domains present in chromatin proteins, general transcription factors for nucleolar and mitochondrial RNA polymerases, and gene- and tissue-specific transcriptional regulators. The HMG boxes of HMG1, an abundant component of chromatin, interact specifically with four-way junctions, DNA structures that are cross-shaped and contain angles of approximately 60 and 120 degrees between their arms. We show here also that the HMG box of SRY, the protein that determines the expression of male-specific genes in humans, recognizes four-way junction DNAs irrespective of their sequence. In addition, when SRY binds to linear duplex DNA containing its specific target AACAAAG, it produces a sharp bend. Therefore, the interaction between HMG boxes and DNA appears to be predominantly structure-specific. The production of the recognition of a kink in DNA can serve several distinct functions, such as the repair of DNA lesions, the folding of DNA segments with bound transcriptional factors into productive complexes or the wrapping of DNA in chromatin. Images PMID:1425584
Heix, Jutta; Zomerdijk, Joost C. B. M.; Ravanpay, Ali; Tjian, Robert; Grummt, Ingrid
1997-01-01
Promoter selectivity for all three classes of eukaryotic RNA polymerases is brought about by multimeric protein complexes containing TATA box binding protein (TBP) and specific TBP-associated factors (TAFs). Unlike class II- and III-specific TBP–TAF complexes, the corresponding murine and human class I-specific transcription initiation factor TIF-IB/SL1 exhibits a pronounced selectivity for its homologous promoter. As a first step toward understanding the molecular basis of species-specific promoter recognition, we cloned the cDNAs encoding the three mouse pol I-specific TBP-associated factors (TAFIs) and compared the amino acid sequences of the murine TAFIs with their human counterparts. The four subunits from either species can form stable chimeric complexes that contain stoichiometric amounts of TBP and TAFIs, demonstrating that differences in the primary structure of human and mouse TAFIs do not dramatically alter the network of protein–protein contacts responsible for assembly of the multimeric complex. Thus, primate vs. rodent promoter selectivity mediated by the TBP–TAFI complex is likely to be the result of cumulative subtle differences between individual subunits that lead to species-specific properties of RNA polymerase I transcription. PMID:9050847
Llorens-Bobadilla, Enric; Zhao, Sheng; Baser, Avni; Saiz-Castro, Gonzalo; Zwadlo, Klara; Martin-Villalba, Ana
2015-09-03
Heterogeneous pools of adult neural stem cells (NSCs) contribute to brain maintenance and regeneration after injury. The balance of NSC activation and quiescence, as well as the induction of lineage-specific transcription factors, may contribute to diversity of neuronal and glial fates. To identify molecular hallmarks governing these characteristics, we performed single-cell sequencing of an unbiased pool of adult subventricular zone NSCs. This analysis identified a discrete, dormant NSC subpopulation that already expresses distinct combinations of lineage-specific transcription factors during homeostasis. Dormant NSCs enter a primed-quiescent state before activation, which is accompanied by downregulation of glycolytic metabolism, Notch, and BMP signaling and a concomitant upregulation of lineage-specific transcription factors and protein synthesis. In response to brain ischemia, interferon gamma signaling induces dormant NSC subpopulations to enter the primed-quiescent state. This study unveils general principles underlying NSC activation and lineage priming and opens potential avenues for regenerative medicine in the brain. Copyright © 2015 Elsevier Inc. All rights reserved.
Alexandrov, Boian S; Fukuyo, Yayoi; Lange, Martin; Horikoshi, Nobuo; Gelev, Vladimir; Rasmussen, Kim Ø; Bishop, Alan R; Usheva, Anny
2012-11-01
The genome-wide mapping of the major gene expression regulators, the transcription factors (TFs) and their DNA binding sites, is of great importance for describing cellular behavior and phenotypic diversity. Presently, the methods for prediction of genomic TF binding produce a large number of false positives, most likely due to insufficient description of the physiochemical mechanisms of protein-DNA binding. Growing evidence suggests that, in the cell, the double-stranded DNA (dsDNA) is subject to local transient strands separations (breathing) that contribute to genomic functions. By using site-specific chromatin immunopecipitations, gel shifts, BIOBASE data, and our model that accurately describes the melting behavior and breathing dynamics of dsDNA we report a specific DNA breathing profile found at YY1 binding sites in cells. We find that the genomic flanking sequence variations and SNPs, may exert long-range effects on DNA dynamics and predetermine YY1 binding. The ubiquitous TF YY1 has a fundamental role in essential biological processes by activating, initiating or repressing transcription depending upon the sequence context it binds. We anticipate that consensus binding sequences together with the related DNA dynamics profile may significantly improve the accuracy of genomic TF binding sites and TF binding-related functional SNPs.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sollome, James; Martin, Elizabeth
MicroRNAs (miRNAs) regulate gene expression by binding mRNA and inhibiting translation and/or inducing degradation of the associated transcripts. Expression levels of miRNAs have been shown to be altered in response to environmental toxicants, thus impacting cellular function and influencing disease risk. Transcription factors (TFs) are known to be altered in response to environmental toxicants and play a critical role in the regulation of miRNA expression. To date, environmentally-responsive TFs that are important for regulating miRNAs remain understudied. In a state-of-the-art analysis, we utilized an in silico bioinformatic approach to characterize potential transcriptional regulators of environmentally-responsive miRNAs. Using the miRStart database,more » genomic sequences of promoter regions for all available human miRNAs (n = 847) were identified and promoter regions were defined as − 1000/+500 base pairs from the transcription start site. Subsequently, the promoter region sequences of environmentally-responsive miRNAs (n = 128) were analyzed using enrichment analysis to determine overrepresented TF binding sites (TFBS). While most (56/73) TFs differed across environmental contaminants, a set of 17 TFs was enriched for promoter binding among miRNAs responsive to numerous environmental contaminants. Of these, one TF was common to miRNAs altered by the majority of environmental contaminants, namely SWI/SNF-related, matrix-associated, actin-dependent regulator of chromatin, subfamily A, member 3 (SMARCA3). These identified TFs represent candidate common transcriptional regulators of miRNAs perturbed by environmental toxicants. - Highlights: • Transcription factors that regulate environmentally-modulated miRNA expression are understudied • Transcription factor binding sites (TFBS) located within DNA promoter regions of miRNAs were identified. • Specific transcription factors may serve as master regulators of environmentally-mediated microRNA expression.« less
Kulakovskiy, Ivan V; Vorontsov, Ilya E; Yevshin, Ivan S; Sharipov, Ruslan N; Fedorova, Alla D; Rumynskiy, Eugene I; Medvedeva, Yulia A; Magana-Mora, Arturo; Bajic, Vladimir B; Papatsenko, Dmitry A; Kolpakov, Fedor A; Makeev, Vsevolod J
2018-01-04
We present a major update of the HOCOMOCO collection that consists of patterns describing DNA binding specificities for human and mouse transcription factors. In this release, we profited from a nearly doubled volume of published in vivo experiments on transcription factor (TF) binding to expand the repertoire of binding models, replace low-quality models previously based on in vitro data only and cover more than a hundred TFs with previously unknown binding specificities. This was achieved by systematic motif discovery from more than five thousand ChIP-Seq experiments uniformly processed within the BioUML framework with several ChIP-Seq peak calling tools and aggregated in the GTRD database. HOCOMOCO v11 contains binding models for 453 mouse and 680 human transcription factors and includes 1302 mononucleotide and 576 dinucleotide position weight matrices, which describe primary binding preferences of each transcription factor and reliable alternative binding specificities. An interactive interface and bulk downloads are available on the web: http://hocomoco.autosome.ru and http://www.cbrc.kaust.edu.sa/hocomoco11. In this release, we complement HOCOMOCO by MoLoTool (Motif Location Toolbox, http://molotool.autosome.ru) that applies HOCOMOCO models for visualization of binding sites in short DNA sequences. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Research Resource: Aorta- and Liver-Specific ERα-Binding Patterns and Gene Regulation by Estrogen
Gordon, Francesca K.; Vallaster, Caroline S.; Westerling, Thomas; Iyer, Lakshmanan K.; Brown, Myles
2014-01-01
Estrogen has vascular protective effects in premenopausal women and in women younger than 60 years who are receiving hormone replacement therapy. However, estrogen also increases the risks of breast and uterine cancers and of venous thromboses linked to up-regulation of coagulation factors in the liver. In mouse models, the vasculoprotective effects of estrogen are mediated by the estrogen receptor α (ERα) transcription factor. Here, through next-generation sequencing approaches, we show that almost all of the genes regulated by 17β-estradiol (E2) differ between mouse aorta and mouse liver, ex vivo, and that this difference is associated with a distinct genomewide distribution of ERα on chromatin. Bioinformatic analysis of E2-regulated promoters and ERα binding site sequences identify several transcription factors that may determine the tissue specificity of ERα binding and E2-regulated genes, including the enrichment of NF-κB, AML1, and AP1 sites in the promoters of E2 down-regulated inflammatory genes in aorta but not liver. The possible vascular-specific functions of these factors suggest ways in which the protective effects of estrogen could be promoted in the vasculature without incurring negative effects in other tissues. PMID:24992180
The zinc fingers of YY1 bind single-stranded RNA with low sequence specificity.
Wai, Dorothy C C; Shihab, Manar; Low, Jason K K; Mackay, Joel P
2016-11-02
Classical zinc fingers (ZFs) are traditionally considered to act as sequence-specific DNA-binding domains. More recently, classical ZFs have been recognised as potential RNA-binding modules, raising the intriguing possibility that classical-ZF transcription factors are involved in post-transcriptional gene regulation via direct RNA binding. To date, however, only one classical ZF-RNA complex, that involving TFIIIA, has been structurally characterised. Yin Yang-1 (YY1) is a multi-functional transcription factor involved in many regulatory processes, and binds DNA via four classical ZFs. Recent evidence suggests that YY1 also interacts with RNA, but the molecular nature of the interaction remains unknown. In the present work, we directly assess the ability of YY1 to bind RNA using in vitro assays. Systematic Evolution of Ligands by EXponential enrichment (SELEX) was used to identify preferred RNA sequences bound by the YY1 ZFs from a randomised library over multiple rounds of selection. However, a strong motif was not consistently recovered, suggesting that the RNA sequence selectivity of these domains is modest. YY1 ZF residues involved in binding to single-stranded RNA were identified by NMR spectroscopy and found to be largely distinct from the set of residues involved in DNA binding, suggesting that interactions between YY1 and ssRNA constitute a separate mode of nucleic acid binding. Our data are consistent with recent reports that YY1 can bind to RNA in a low-specificity, yet physiologically relevant manner. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Specificity determinants for the abscisic acid response element.
Sarkar, Aditya Kumar; Lahiri, Ansuman
2013-01-01
Abscisic acid (ABA) response elements (ABREs) are a group of cis-acting DNA elements that have been identified from promoter analysis of many ABA-regulated genes in plants. We are interested in understanding the mechanism of binding specificity between ABREs and a class of bZIP transcription factors known as ABRE binding factors (ABFs). In this work, we have modeled the homodimeric structure of the bZIP domain of ABRE binding factor 1 from Arabidopsis thaliana (AtABF1) and studied its interaction with ACGT core motif-containing ABRE sequences. We have also examined the variation in the stability of the protein-DNA complex upon mutating ABRE sequences using the protein design algorithm FoldX. The high throughput free energy calculations successfully predicted the ability of ABF1 to bind to alternative core motifs like GCGT or AAGT and also rationalized the role of the flanking sequences in determining the specificity of the protein-DNA interaction.
Measurements of nonlinear Hall-driven reconnection in the reversed field pinch
NASA Astrophysics Data System (ADS)
Tharp, Timothy D.
Complex organisms are able to develop because of the complex regulatory systems that control their gene expression. The first step in this regulation, transcription initiation, is controlled by transcription factors. Transcription factors are modular proteins composed of two distinct domains, the DNA binding domain and the regulatory domain. These molecules are involved in a plethora of important biological processes including embryogenesis, development, cell health, and cancer. Tissue enriched transcription factors Nkx-2.5 and Gata4 are involved in cardiac development and cardiac health. In this thesis the DNA binding specificity of Nkx-2.5 will be analyzed using a high throughput double stranded DNA platform called Cognate Site Identifier (CSI) arrays (Chapter 2). The full DNA binding specificity of Nkx-2.5 and Nkx-2.5 mutants will be visualized using Sequence Specificity Landscapes (SSLs). In Chapter 3, the definition of binding specificity will be investigated by evaluating a number of different DNA binding folds by CSI and SSLs. CSI and SSLs will also be used to evaluate different pyrrole/imidazole hairpin polyamides in order to better characterize these small molecule DNA binding domains. CSI and SSL data will be applied to the genome in order to explain the biological function an artificial transcription factor. Chapter 4 will discuss the mechanism of nonspecific DNA binding. The historical means of predicting DNA binding will be challenged by utilizing high throughput experiments. The effect of salt concentration on both specific and nonspecific binding will also be investigated. Finally, in Chapter 5, a generation of Protein DNA Dimerizer will be discussed. A PDD that regulates transcription on genomic DNA by binding cooperatively with the heart IF Gata4 will be characterized. These studies provide understanding of, and a means to control, how transcription factors sample the endless sea of DNA in the genome in order to regulate gene expression with such wonderful specificity.
Fonseca, Dora Janeth; Ortega-Recalde, Oscar; Esteban-Perez, Clara; Moreno-Ortiz, Harold; Patiño, Liliana Catherine; Bermúdez, Olga María; Ortiz, Angela María; Restrepo, Carlos M; Lucena, Elkin; Laissue, Paul
2014-11-01
BMP15 has drawn particular attention in the pathophysiology of reproduction, as its mutations in mammalian species have been related to different reproductive phenotypes. In humans, BMP15 coding regions have been sequenced in large panels of women with premature ovarian failure (POF), but only some mutations have been definitely validated as causing the phenotype. A functional association between the BMP15 c.-9C>G promoter polymorphism and cause of POF have been reported. The aim of this study was to determine the potential functional effect of this sequence variant on specific BMP15 promoter transactivation disturbances. Bioinformatics was used to identify transcription factor binding sites located on the promoter region of BMP15. Reverse transcription polymerase chain reaction was used to study specific gene expression in ovarian tissue. Luciferase reporter assays were used to establish transactivation disturbances caused by the BMP15 c.-9C>G variant. The c.-9C>G variant was found to modify the PITX1 transcription factor binding site. PITX1 and BMP15 co-expressed in human and mouse ovarian tissue, and PITX1 transactivated both BMP15 promoter versions (-9C and -9G). It was found that the BMP15 c.-9G allele was related to BMP15 increased transcription, supporting c.-9C>G as a causal agent of POF. Copyright © 2014 Reproductive Healthcare Ltd. Published by Elsevier Ltd. All rights reserved.
Peoples, R J; Cisco, M J; Kaplan, P; Francke, U
1998-01-01
We have identified a novel gene (WBSCR9) within the common Williams-Beuren syndrome (WBS) deletion by interspecies sequence conservation. The WBSCR9 gene encodes a roughly 7-kb transcript with an open reading frame of 1483 amino acids and a predicted protein product size of 170.8 kDa. WBSCR9 is comprised of at least 20 exons extending over 60 kb. The transcript is expressed ubiquitously throughout development and is subject to alternative splicing. Functional motifs identified by sequence homology searches include a bromodomain; a PHD, or C4HC3, finger; several putative nuclear localization signals; four nuclear receptor binding motifs; a polyglutamate stretch and two PEST sequences. Bromodomains, PHD motifs and nuclear receptor binding motifs are cardinal features of proteins that are involved in chromatin remodeling and modulation of transcription. Haploinsufficiency for WBSCR9 gene products may contribute to the complex phenotype of WBS by interacting with tissue-specific regulatory factors during development.
Lu, Wanxiang; Yang, Li; Jiang, Yuanzhong; Luo, Keming
2013-01-01
Because of the importance of wood in many industrial applications, tremendous studies have been performed on wood formation, especially in lignin biosynthesis. MYB transcription factors (TFs), which consist of a large family of plant TFs, have been reported to directly regulate lignin biosynthetic genes in a number of plants. In this study, we describe the cloning and functional characterization of PtoMYB216, a cDNA isolated from Chinese white poplar (Populus tomentosa Carr.). PtoMYB216 encodes a protein belonging to the R2R3-MYB family and displays significant similarity with other MYB factors shown to regulate lignin synthesis in Arabidopsis. Gene expression profiling studies showed that PtoMYB216 mRNA is specifically expressed during secondary wall formation in wood. The 1.8-kb promoter sequence of PtoMYB216 was fused to the GUS coding sequence and introduced into wild-type A. thaliana. GUS expression was shown to be restricted to tissues undergoing secondary cell wall formation. Overexpression of PtoMYB216 specifically activated the expression of the upstream genes in the lignin biosynthetic pathway and resulted in ectopic deposition of lignin in cells that are normally unligninified. These results suggest that PtoMYB216 is specific transcriptional activators of lignin biosynthesis and involved in the regulation of wood formation in poplar. PMID:24204619
Schwessinger, Ron; Suciu, Maria C; McGowan, Simon J; Telenius, Jelena; Taylor, Stephen; Higgs, Doug R; Hughes, Jim R
2017-10-01
In the era of genome-wide association studies (GWAS) and personalized medicine, predicting the impact of single nucleotide polymorphisms (SNPs) in regulatory elements is an important goal. Current approaches to determine the potential of regulatory SNPs depend on inadequate knowledge of cell-specific DNA binding motifs. Here, we present Sasquatch, a new computational approach that uses DNase footprint data to estimate and visualize the effects of noncoding variants on transcription factor binding. Sasquatch performs a comprehensive k -mer-based analysis of DNase footprints to determine any k -mer's potential for protein binding in a specific cell type and how this may be changed by sequence variants. Therefore, Sasquatch uses an unbiased approach, independent of known transcription factor binding sites and motifs. Sasquatch only requires a single DNase-seq data set per cell type, from any genotype, and produces consistent predictions from data generated by different experimental procedures and at different sequence depths. Here we demonstrate the effectiveness of Sasquatch using previously validated functional SNPs and benchmark its performance against existing approaches. Sasquatch is available as a versatile webtool incorporating publicly available data, including the human ENCODE collection. Thus, Sasquatch provides a powerful tool and repository for prioritizing likely regulatory SNPs in the noncoding genome. © 2017 Schwessinger et al.; Published by Cold Spring Harbor Laboratory Press.
Long noncoding RNAs responsive to Fusarium oxysporum infection in Arabidopsis thaliana.
Zhu, Qian-Hao; Stephen, Stuart; Taylor, Jennifer; Helliwell, Chris A; Wang, Ming-Bo
2014-01-01
Short noncoding RNAs have been demonstrated to play important roles in regulation of gene expression and stress responses, but the repertoire and functions of long noncoding RNAs (lncRNAs) remain largely unexplored, particularly in plants. To explore the role of lncRNAs in disease resistance, we used a strand-specific RNA-sequencing approach to identify lncRNAs responsive to Fusarium oxysporum infection in Arabidopsis thaliana. Antisense transcription was found in c. 20% of the annotated A. thaliana genes. Several noncoding natural antisense transcripts responsive to F. oxysporum infection were found in genes implicated in disease defense. While the majority of the novel transcriptionally active regions (TARs) were adjacent to annotated genes and could be an extension of the annotated transcripts, 159 novel intergenic TARs, including 20 F. oxysporum-responsive lncTARs, were identified. Ten F. oxysporum-induced lncTARs were functionally characterized using T-DNA insertion or RNA-interference knockdown lines, and five were demonstrated to be related to disease development. Promoter analysis suggests that some of the F. oxysporum-induced lncTARs are direct targets of transcription factor(s) responsive to pathogen attack. Our results demonstrated that strand-specific RNA sequencing is a powerful tool for uncovering hidden levels of transcriptome and that IncRNAs are important components of the antifungal networks in A. thaliana. © 2013 The Authors. New Phytologist © 2013 New Phytologist Trust.
Nascent-Seq reveals novel features of mouse circadian transcriptional regulation
Menet, Jerome S; Rodriguez, Joseph; Abruzzi, Katharine C; Rosbash, Michael
2012-01-01
A substantial fraction of the metazoan transcriptome undergoes circadian oscillations in many cells and tissues. Based on the transcription feedback loops important for circadian timekeeping, it is commonly assumed that this mRNA cycling reflects widespread transcriptional regulation. To address this issue, we directly measured the circadian dynamics of mouse liver transcription using Nascent-Seq (genome-wide sequencing of nascent RNA). Although many genes are rhythmically transcribed, many rhythmic mRNAs manifest poor transcriptional rhythms, indicating a prominent contribution of post-transcriptional regulation to circadian mRNA expression. This analysis of rhythmic transcription also showed that the rhythmic DNA binding profile of the transcription factors CLOCK and BMAL1 does not determine the transcriptional phase of most target genes. This likely reflects gene-specific collaborations of CLK:BMAL1 with other transcription factors. These insights from Nascent-Seq indicate that it should have broad applicability to many other gene expression regulatory issues. DOI: http://dx.doi.org/10.7554/eLife.00011.001 PMID:23150795
An Autonomous BMP2 Regulatory Element in Mesenchymal Cells
Kruithof, Boudewijn P.T.; Fritz, David T.; Liu, Yijun; Garsetti, Diane E.; Frank, David B.; Pregizer, Steven K.; Gaussin, Vinciane; Mortlock, Douglas P.; Rogers, Melissa B.
2014-01-01
BMP2 is a morphogen that controls mesenchymal cell differentiation and behavior. For example, BMP2 concentration controls the differentiation of mesenchymal precursors into myocytes, adipocytes, chondrocytes, and osteoblasts. Sequences within the 3′untranslated region (UTR) of the Bmp2 mRNA mediate a post-transcriptional block of protein synthesis. Interaction of cell and developmental stage-specific trans-regulatory factors with the 3′UTR is a nimble and versatile mechanism for modulating this potent morphogen in different cell types. We show here, that an ultra-conserved sequence in the 3′UTR functions independently of promoter, coding region, and 3′UTR context in primary and immortalized tissue culture cells and in transgenic mice. Our findings indicate that the ultra-conserved sequence is an autonomously functioning post-transcriptional element that may be used to modulate the level of BMP2 and other proteins while retaining tissue specific regulatory elements. PMID:21268088
Murthy, S C; Bhat, G P; Thimmappaya, B
1985-01-01
A molecular dissection of the adenovirus EIIA early (E) promoter was undertaken to study the sequence elements required for transcription and to examine the nucleotide sequences, if any, specific for its trans-activation by the viral pre-early EIA gene product. A chimeric gene in which the EIIA-E promoter region fused to the coding sequences of the bacterial chloramphenicol acetyltransferase (CAT) gene was used in transient assays to identify the transcriptional control regions. Deletion mapping studies revealed that the upstream DNA sequences up to -86 were sufficient for the optimal basal level transcription in HeLa cells and also for the EIA-induced transcription. A series of linker-scanning (LS) mutants were constructed to precisely identify the nucleotide sequences that control transcription. Analysis of these LS mutants allowed us to identify two regions of the promoter that are critical for the EIIA-E transcription. These regions are located between -29 and -21 (region I) and between -82 and -66 (region II). Mutations in region I affected initiation and appeared functionally similar to the "TATA" sequence of the commonly studied promoters. To examine whether or not the EIIA-E promoter contained DNA sequences specific for the trans-activation by the EIA, the LS mutants were analyzed in a cotransfection assay containing a plasmid carrying the EIA gene. CAT activity of all of the LS mutants was induced by the EIA gene in this assay, suggesting that the induction of transcription of the EIIA-E promoter by the EIA gene is not sequence-specific. Images PMID:3857577
Keilwagen, Jens; Grau, Jan; Paponov, Ivan A; Posch, Stefan; Strickert, Marc; Grosse, Ivo
2011-02-10
Transcription factors are a main component of gene regulation as they activate or repress gene expression by binding to specific binding sites in promoters. The de-novo discovery of transcription factor binding sites in target regions obtained by wet-lab experiments is a challenging problem in computational biology, which has not been fully solved yet. Here, we present a de-novo motif discovery tool called Dispom for finding differentially abundant transcription factor binding sites that models existing positional preferences of binding sites and adjusts the length of the motif in the learning process. Evaluating Dispom, we find that its prediction performance is superior to existing tools for de-novo motif discovery for 18 benchmark data sets with planted binding sites, and for a metazoan compendium based on experimental data from micro-array, ChIP-chip, ChIP-DSL, and DamID as well as Gene Ontology data. Finally, we apply Dispom to find binding sites differentially abundant in promoters of auxin-responsive genes extracted from Arabidopsis thaliana microarray data, and we find a motif that can be interpreted as a refined auxin responsive element predominately positioned in the 250-bp region upstream of the transcription start site. Using an independent data set of auxin-responsive genes, we find in genome-wide predictions that the refined motif is more specific for auxin-responsive genes than the canonical auxin-responsive element. In general, Dispom can be used to find differentially abundant motifs in sequences of any origin. However, the positional distribution learned by Dispom is especially beneficial if all sequences are aligned to some anchor point like the transcription start site in case of promoter sequences. We demonstrate that the combination of searching for differentially abundant motifs and inferring a position distribution from the data is beneficial for de-novo motif discovery. Hence, we make the tool freely available as a component of the open-source Java framework Jstacs and as a stand-alone application at http://www.jstacs.de/index.php/Dispom.
Regulation of cell fate determination by single-repeat R3 MYB transcription factors in Arabidopsis
Wang, Shucai; Chen, Jin-Gui
2014-01-01
MYB transcription factors regulate multiple aspects of plant growth and development. Among the large family of MYB transcription factors, single-repeat R3 MYBs are characterized by their short sequence (<120 amino acids) consisting largely of the single MYB DNA-binding repeat. In the model plant Arabidopsis, R3 MYBs mediate lateral inhibition during epidermal patterning and are best characterized for their regulatory roles in trichome and root hair development. R3 MYBs act as negative regulators for trichome formation but as positive regulators for root hair development. In this article, we provide a comprehensive review on the role of R3 MYBs in the regulation of cell type specification in the model plant Arabidopsis. PMID:24782874
Regulation of Cell Fate Determination by Single-Repeat R3 MYB Transcription Factors in Arabidopsis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Shucai; Chen, Jay
2014-01-01
MYB transcription factors regulate multiple aspects of plant growth and development. Among the large family of MYB transcription factors, single-repeat R3 MYB are characterized by their short sequence (<120 amino acids) consisting largely of the single MYB DNA-binding repeat. In the model plant Arabidopsis, R3 MYBs mediate lateral inhibition during epidermal patterning and are best characterized for their regulatory roles in trichome and root hair development. R3 MYBs act as negative regulators for trichome formation but as positive regulators for root hair development. In this article, we provide a comprehensive review on the role of R3 MYBs in the regulationmore » of cell type specification in the model plant Arabidopsis.« less
In silico modeling of epigenetic-induced changes in photoreceptor cis-regulatory elements.
Hossain, Reafa A; Dunham, Nicholas R; Enke, Raymond A; Berndsen, Christopher E
2018-01-01
DNA methylation is a well-characterized epigenetic repressor of mRNA transcription in many plant and vertebrate systems. However, the mechanism of this repression is not fully understood. The process of transcription is controlled by proteins that regulate recruitment and activity of RNA polymerase by binding to specific cis-regulatory sequences. Cone-rod homeobox (CRX) is a well-characterized mammalian transcription factor that controls photoreceptor cell-specific gene expression. Although much is known about the functions and DNA binding specificity of CRX, little is known about how DNA methylation modulates CRX binding affinity to genomic cis-regulatory elements. We used bisulfite pyrosequencing of human ocular tissues to measure DNA methylation levels of the regulatory regions of RHO , PDE6B, PAX6 , and LINE1 retrotransposon repeats. To describe the molecular mechanism of repression, we used molecular modeling to illustrate the effect of DNA methylation on human RHO regulatory sequences. In this study, we demonstrate an inverse correlation between DNA methylation in regulatory regions adjacent to the human RHO and PDE6B genes and their subsequent transcription in human ocular tissues. Docking of CRX to the DNA models shows that CRX interacts with the grooves of these sequences, suggesting changes in groove structure could regulate binding. Molecular dynamics simulations of the RHO promoter and enhancer regions show changes in the flexibility and groove width upon epigenetic modification. Models also demonstrate changes in the local dynamics of CRX binding sites within RHO regulatory sequences which may account for the repression of CRX-dependent transcription. Collectively, these data demonstrate epigenetic regulation of CRX binding sites in human retinal tissue and provide insight into the mechanism of this mode of epigenetic regulation to be tested in future experiments.
Estimation of pairwise sequence similarity of mammalian enhancers with word neighbourhood counts.
Göke, Jonathan; Schulz, Marcel H; Lasserre, Julia; Vingron, Martin
2012-03-01
The identity of cells and tissues is to a large degree governed by transcriptional regulation. A major part is accomplished by the combinatorial binding of transcription factors at regulatory sequences, such as enhancers. Even though binding of transcription factors is sequence-specific, estimating the sequence similarity of two functionally similar enhancers is very difficult. However, a similarity measure for regulatory sequences is crucial to detect and understand functional similarities between two enhancers and will facilitate large-scale analyses like clustering, prediction and classification of genome-wide datasets. We present the standardized alignment-free sequence similarity measure N2, a flexible framework that is defined for word neighbourhoods. We explore the usefulness of adding reverse complement words as well as words including mismatches into the neighbourhood. On simulated enhancer sequences as well as functional enhancers in mouse development, N2 is shown to outperform previous alignment-free measures. N2 is flexible, faster than competing methods and less susceptible to single sequence noise and the occurrence of repetitive sequences. Experiments on the mouse enhancers reveal that enhancers active in different tissues can be separated by pairwise comparison using N2. N2 represents an improvement over previous alignment-free similarity measures without compromising speed, which makes it a good candidate for large-scale sequence comparison of regulatory sequences. The software is part of the open-source C++ library SeqAn (www.seqan.de) and a compiled version can be downloaded at http://www.seqan.de/projects/alf.html. Supplementary data are available at Bioinformatics online.
CicerTransDB 1.0: a resource for expression and functional study of chickpea transcription factors.
Gayali, Saurabh; Acharya, Shankar; Lande, Nilesh Vikram; Pandey, Aarti; Chakraborty, Subhra; Chakraborty, Niranjan
2016-07-29
Transcription factor (TF) databases are major resource for systematic studies of TFs in specific species as well as related family members. Even though there are several publicly available multi-species databases, the information on the amount and diversity of TFs within individual species is fragmented, especially for newly sequenced genomes of non-model species of agricultural significance. We constructed CicerTransDB (Cicer Transcription Factor Database), the first database of its kind, which would provide a centralized putatively complete list of TFs in a food legume, chickpea. CicerTransDB, available at www.cicertransdb.esy.es , is based on chickpea (Cicer arietinum L.) annotation v 1.0. The database is an outcome of genome-wide domain study and manual classification of TF families. This database not only provides information of the gene, but also gene ontology, domain and motif architecture. CicerTransDB v 1.0 comprises information of 1124 genes of chickpea and enables the user to not only search, browse and download sequences but also retrieve sequence features. CicerTransDB also provides several single click interfaces, transconnecting to various other databases to ease further analysis. Several webAPI(s) integrated in the database allow end-users direct access of data. A critical comparison of CicerTransDB with PlantTFDB (Plant Transcription Factor Database) revealed 68 novel TFs in the chickpea genome, hitherto unexplored. Database URL: http://www.cicertransdb.esy.es.
Petit, F G; Métivier, R; Valotaire, Y; Pakdel, F
1999-01-01
In all oviparous, liver represents one of the main E2-target tissues where estrogen receptor (ER) constitutes the key mediator of estrogen action. The rainbow trout estrogen receptor (rtER) gene expression is markedly up-regulated by estrogens and the sequences responsible for this autoregulation have been located in a 0.2 kb upstream transcription start site within - 40/- 248 enhancer region. Absence of interference with steroid hormone receptors and tissue-specific factors and a conserved basal transcriptional machinery between yeast and higher eukaryotes, make yeast a simple assay system that will enable determination of important cis-acting regulatory sequences within rtER gene promoter and identification of transcription factors implicated in the regulation of this gene. Deletion analysis allowed to show a synergistic effect between an imperfect estrogen-responsive element (ERE) and a consensus half-ERE to achieve a high hormone-dependent transcriptional activation of the rtER gene promoter in the presence of stably expressed rtER. As in mammalian cells, here we observed a positive regulation of the rtER gene promoter by the chicken ovalbumin upstream promoter-transcription factor I (COUP-TFI) through enhancing autoregulation. Using a point mutation COUP-TFI mutant unable to bind DNA demonstrates that enhancement of rtER gene autoregulation requires the interaction of COUP-TFI to the DNA. Moreover, this enhancement of transcriptional activation by COUP-TFI requires specifically the AF-1 transactivation function of ER and can be observed in the presence of E2 or 4-hydroxytamoxifen but not ICI 164384. Thus, this paper describes the reconstitution of a hormone-responsive transcription unit in yeast in which the regulation of rtER gene promoter could be enhanced by the participation of cis-elements and/or trans-acting factors, such as ER itself or COUP-TF.
Protein-protein interactions in the regulation of WRKY transcription factors.
Chi, Yingjun; Yang, Yan; Zhou, Yuan; Zhou, Jie; Fan, Baofang; Yu, Jing-Quan; Chen, Zhixiang
2013-03-01
It has been almost 20 years since the first report of a WRKY transcription factor, SPF1, from sweet potato. Great progress has been made since then in establishing the diverse biological roles of WRKY transcription factors in plant growth, development, and responses to biotic and abiotic stress. Despite the functional diversity, almost all analyzed WRKY proteins recognize the TTGACC/T W-box sequences and, therefore, mechanisms other than mere recognition of the core W-box promoter elements are necessary to achieve the regulatory specificity of WRKY transcription factors. Research over the past several years has revealed that WRKY transcription factors physically interact with a wide range of proteins with roles in signaling, transcription, and chromatin remodeling. Studies of WRKY-interacting proteins have provided important insights into the regulation and mode of action of members of the important family of transcription factors. It has also emerged that the slightly varied WRKY domains and other protein motifs conserved within each of the seven WRKY subfamilies participate in protein-protein interactions and mediate complex functional interactions between WRKY proteins and between WRKY and other regulatory proteins in the modulation of important biological processes. In this review, we summarize studies of protein-protein interactions for WRKY transcription factors and discuss how the interacting partners contribute, at different levels, to the establishment of the complex regulatory and functional network of WRKY transcription factors.
Regulating the dorsal neural tube expression of Ptf1a through a distal 3' enhancer.
Mona, Bishakha; Avila, John M; Meredith, David M; Kollipara, Rahul K; Johnson, Jane E
2016-10-01
Generating the correct balance of inhibitory and excitatory neurons in a neural network is essential for normal functioning of a nervous system. The neural network in the dorsal spinal cord functions in somatosensation where it modulates and relays sensory information from the periphery. PTF1A is a key transcriptional regulator present in a specific subset of neural progenitor cells in the dorsal spinal cord, cerebellum and retina that functions to specify an inhibitory neuronal fate while suppressing excitatory neuronal fates. Thus, the regulation of Ptf1a expression is critical for determining mechanisms controlling neuronal diversity in these regions of the nervous system. Here we identify a sequence conserved, tissue-specific enhancer located 10.8kb 3' of the Ptf1a coding region that is sufficient to direct expression to dorsal neural tube progenitors that give rise to neurons in the dorsal spinal cord in chick and mouse. DNA binding motifs for Paired homeodomain (Pd-HD) and zinc finger (ZF) transcription factors are required for enhancer activity. Mutations in these sequences implicate the Pd-HD motif for activator function and the ZF motif for repressor function. Although no repressor transcription factor was identified, both PAX6 and SOX3 can increase enhancer activity in reporter assays. Thus, Ptf1a is regulated by active and repressive inputs integrated through multiple sequence elements within a highly conserved sequence downstream of the Ptf1a gene. Copyright © 2016 Elsevier Inc. All rights reserved.
Searching for transcription factor binding sites in vector spaces
2012-01-01
Background Computational approaches to transcription factor binding site identification have been actively researched in the past decade. Learning from known binding sites, new binding sites of a transcription factor in unannotated sequences can be identified. A number of search methods have been introduced over the years. However, one can rarely find one single method that performs the best on all the transcription factors. Instead, to identify the best method for a particular transcription factor, one usually has to compare a handful of methods. Hence, it is highly desirable for a method to perform automatic optimization for individual transcription factors. Results We proposed to search for transcription factor binding sites in vector spaces. This framework allows us to identify the best method for each individual transcription factor. We further introduced two novel methods, the negative-to-positive vector (NPV) and optimal discriminating vector (ODV) methods, to construct query vectors to search for binding sites in vector spaces. Extensive cross-validation experiments showed that the proposed methods significantly outperformed the ungapped likelihood under positional background method, a state-of-the-art method, and the widely-used position-specific scoring matrix method. We further demonstrated that motif subtypes of a TF can be readily identified in this framework and two variants called the k NPV and k ODV methods benefited significantly from motif subtype identification. Finally, independent validation on ChIP-seq data showed that the ODV and NPV methods significantly outperformed the other compared methods. Conclusions We conclude that the proposed framework is highly flexible. It enables the two novel methods to automatically identify a TF-specific subspace to search for binding sites. Implementations are available as source code at: http://biogrid.engr.uconn.edu/tfbs_search/. PMID:23244338
Directing an artificial zinc finger protein to new targets by fusion to a non-DNA-binding domain.
Lim, Wooi F; Burdach, Jon; Funnell, Alister P W; Pearson, Richard C M; Quinlan, Kate G R; Crossley, Merlin
2016-04-20
Transcription factors are often regarded as having two separable components: a DNA-binding domain (DBD) and a functional domain (FD), with the DBD thought to determine target gene recognition. While this holds true for DNA bindingin vitro, it appears thatin vivoFDs can also influence genomic targeting. We fused the FD from the well-characterized transcription factor Krüppel-like Factor 3 (KLF3) to an artificial zinc finger (AZF) protein originally designed to target the Vascular Endothelial Growth Factor-A (VEGF-A) gene promoter. We compared genome-wide occupancy of the KLF3FD-AZF fusion to that observed with AZF. AZF bound to theVEGF-Apromoter as predicted, but was also found to occupy approximately 25,000 other sites, a large number of which contained the expected AZF recognition sequence, GCTGGGGGC. Interestingly, addition of the KLF3 FD re-distributes the fusion protein to new sites, with total DNA occupancy detected at around 50,000 sites. A portion of these sites correspond to known KLF3-bound regions, while others contained sequences similar but not identical to the expected AZF recognition sequence. These results show that FDs can influence and may be useful in directing AZF DNA-binding proteins to specific targets and provide insights into how natural transcription factors operate. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Crinelli, Rita; Carloni, Elisa; Menotta, Michele; Giacomini, Elisa; Bianchi, Marzia; Ambrosi, Gianluca; Giorgi, Luca; Magnani, Mauro
2010-05-25
Oligonucleotide (ODN) decoys are synthetic ODNs containing the DNA binding sequence of a transcription factor. When delivered to cells, these molecules can compete with endogenous sequences for binding the transcription factor, thus inhibiting its ability to activate the expression of target genes. Modulation of gene expression by decoy ODNs against nuclear factor-kappaB (NF-kappaB), a transcription factor regulating many genes involved in immunity, has been achieved in a variety of immune/inflammatory disorders. However, the successful use of transcription factor decoys depends on an efficient means to bring the synthetic DNA to target cells. It is known that single-walled carbon nanotubes (SWCNTs), under certain conditions, are able to cross the cell membrane. Thus, we have evaluated the possibility to functionalize SWCNTs with decoy ODNs against NF-kappaB in order to improve their intracellular delivery. To couple ODNs to CNTs, we have exploited the carbodiimide chemistry which allows covalent binding of amino-modified ODNs to carboxyl groups introduced onto SWCNTs through oxidation. The effective binding of ODNs to nanotubes has been demonstrated by a combination of microscopic, spectroscopic, and electrophoretic techniques. The uptake and subcellular distribution of ODN decoys bound to SWCNTs was analyzed by fluorescence microscopy. ODNs were internalized into macrophages and accumulated in the cytosol. Moreover, no cytotoxicity associated with SWCNT administration was observed. Finally, NF-kappaB-dependent gene expression was significantly reduced in cells receiving nanomolar concentrations of SWCNT-NF-kappaB decoys compared to cells receiving SWCNTs or SWCNTs functionalized with a nonspecific ODN sequence, demonstrating both efficacy and specificity of the approach.
Anish, Ramakrishnan; Hossain, Mohammad B.; Jacobson, Raymond H.; Takada, Shinako
2009-01-01
Background More than 80% of mammalian protein-coding genes are driven by TATA-less promoters which often show multiple transcriptional start sites (TSSs). However, little is known about the core promoter DNA sequences or mechanisms of transcriptional initiation for this class of promoters. Methodology/Principal Findings Here we identify a new core promoter element XCPE2 (X core promoter element 2) (consensus sequence: A/C/G-C-C/T-C-G/A-T-T-G/A-C-C/A+1-C/T) that can direct specific transcription from the second TSS of hepatitis B virus X gene mRNA. XCPE2 sequences can also be found in human promoter regions and typically appear to drive one of the start sites within multiple TSS-containing TATA-less promoters. To gain insight into mechanisms of transcriptional initiation from this class of promoters, we examined requirements of several general transcription factors by in vitro transcription experiments using immunodepleted nuclear extracts and purified factors. Our results show that XCPE2-driven transcription uses at least TFIIB, either TFIID or free TBP, RNA polymerase II (RNA pol II) and the MED26-containing mediator complex but not Gcn5. Therefore, XCPE2-driven transcription can be carried out by a mechanism which differs from previously described TAF-dependent mechanisms for initiator (Inr)- or downstream promoter element (DPE)-containing promoters, the TBP- and SAGA (Spt-Ada-Gcn5-acetyltransferase)-dependent mechanism for yeast TATA-containing promoters, or the TFTC (TBP-free-TAF-containing complex)-dependent mechanism for certain Inr-containing TATA-less promoters. EMSA assays using XCPE2 promoter and purified factors further suggest that XCPE2 promoter recognition requires a set of factors different from those for TATA box, Inr, or DPE promoter recognition. Conclusions/Significance We identified a new core promoter element XCPE2 that are found in multiple TSS-containing TATA-less promoters. Mechanisms of promoter recognition and transcriptional initiation for XCPE2-driven promoters appear different from previously shown mechanisms for classical promoters that show single “focused” TSSs. Our studies provide insight into novel mechanisms of RNA Pol II transcription from multiple TSS-containing TATA-less promoters. PMID:19337366
Ciolkowski, Ingo; Wanke, Dierk; Birkenbihl, Rainer P; Somssich, Imre E
2008-09-01
WRKY transcription factors have been shown to play a major role in regulating, both positively and negatively, the plant defense transcriptome. Nearly all studied WRKY factors appear to have a stereotypic binding preference to one DNA element termed the W-box. How specificity for certain promoters is accomplished therefore remains completely unknown. In this study, we tested five distinct Arabidopsis WRKY transcription factor subfamily members for their DNA binding selectivity towards variants of the W-box embedded in neighboring DNA sequences. These studies revealed for the first time differences in their binding site preferences, which are partly dependent on additional adjacent DNA sequences outside of the TTGACY-core motif. A consensus WRKY binding site derived from these studies was used for in silico analysis to identify potential target genes within the Arabidopsis genome. Furthermore, we show that even subtle amino acid substitutions within the DNA binding region of AtWRKY11 strongly impinge on its binding activity. Additionally, all five factors were found localized exclusively to the plant cell nucleus and to be capable of trans-activating expression of a reporter gene construct in vivo.
Clifford, Jacob; Adami, Christoph
2015-09-02
Transcription factor binding to the surface of DNA regulatory regions is one of the primary causes of regulating gene expression levels. A probabilistic approach to model protein-DNA interactions at the sequence level is through position weight matrices (PWMs) that estimate the joint probability of a DNA binding site sequence by assuming positional independence within the DNA sequence. Here we construct conditional PWMs that depend on the motif signatures in the flanking DNA sequence, by conditioning known binding site loci on the presence or absence of additional binding sites in the flanking sequence of each site's locus. Pooling known sites with similar flanking sequence patterns allows for the estimation of the conditional distribution function over the binding site sequences. We apply our model to the Dorsal transcription factor binding sites active in patterning the Dorsal-Ventral axis of Drosophila development. We find that those binding sites that cooperate with nearby Twist sites on average contain about 0.5 bits of information about the presence of Twist transcription factor binding sites in the flanking sequence. We also find that Dorsal binding site detectors conditioned on flanking sequence information make better predictions about what is a Dorsal site relative to background DNA than detection without information about flanking sequence features.
SM-TF: A structural database of small molecule-transcription factor complexes.
Xu, Xianjin; Ma, Zhiwei; Sun, Hongmin; Zou, Xiaoqin
2016-06-30
Transcription factors (TFs) are the proteins involved in the transcription process, ensuring the correct expression of specific genes. Numerous diseases arise from the dysfunction of specific TFs. In fact, over 30 TFs have been identified as therapeutic targets of about 9% of the approved drugs. In this study, we created a structural database of small molecule-transcription factor (SM-TF) complexes, available online at http://zoulab.dalton.missouri.edu/SM-TF. The 3D structures of the co-bound small molecule and the corresponding binding sites on TFs are provided in the database, serving as a valuable resource to assist structure-based drug design related to TFs. Currently, the SM-TF database contains 934 entries covering 176 TFs from a variety of species. The database is further classified into several subsets by species and organisms. The entries in the SM-TF database are linked to the UniProt database and other sequence-based TF databases. Furthermore, the druggable TFs from human and the corresponding approved drugs are linked to the DrugBank. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Santillán, Orlando; Ramírez-Romero, Miguel A; Lozano, Luis; Checa, Alberto; Encarnación, Sergio M; Dávila, Guillermo
2016-01-01
Sigma factors are RNA polymerase subunits engaged in promoter recognition and DNA strand separation during transcription initiation in bacteria. Primary sigma factors are responsible for the expression of housekeeping genes and are essential for survival. RpoD, the primary sigma factor of Escherichia coli, a γ-proteobacteria, recognizes consensus promoter sequences highly similar to those of some α-proteobacteria species. Despite this resemblance, RpoD is unable to sustain transcription from most of the α-proteobacterial promoters tested so far. In contrast, we have found that SigA, the primary sigma factor of Rhizobium etli, an α-proteobacteria, is able to transcribe E. coli promoters, although it exhibits only 48% identity (98% coverage) to RpoD. We have called this the transcriptional laxity phenomenon. Here, we show that SigA partially complements the thermo-sensitive deficiency of RpoD285 from E. coli strain UQ285 and that the SigA region σ4 is responsible for this phenotype. Sixteen out of 74 residues (21.6%) within region σ4 are variable between RpoD and SigA. Mutating these residues significantly improves SigA ability to complement E. coli UQ285. Only six of these residues fall into positions already known to interact with promoter DNA and to comprise a helix-turn-helix motif. The remaining variable positions are located on previously unexplored sites inside region σ4, specifically into the first two α-helices of the region. Neither of the variable positions confined to these helices seem to interact directly with promoter sequence; instead, we adduce that these residues participate allosterically by contributing to correct region folding and/or positioning of the HTH motif. We propose that transcriptional laxity is a mechanism for ensuring transcription in spite of naturally occurring mutations from endogenous promoters and/or horizontally transferred DNA sequences, allowing survival and fast environmental adaptation of α-proteobacteria.
Santillán, Orlando; Ramírez-Romero, Miguel A.; Lozano, Luis; Checa, Alberto; Encarnación, Sergio M.; Dávila, Guillermo
2016-01-01
Sigma factors are RNA polymerase subunits engaged in promoter recognition and DNA strand separation during transcription initiation in bacteria. Primary sigma factors are responsible for the expression of housekeeping genes and are essential for survival. RpoD, the primary sigma factor of Escherichia coli, a γ-proteobacteria, recognizes consensus promoter sequences highly similar to those of some α-proteobacteria species. Despite this resemblance, RpoD is unable to sustain transcription from most of the α-proteobacterial promoters tested so far. In contrast, we have found that SigA, the primary sigma factor of Rhizobium etli, an α-proteobacteria, is able to transcribe E. coli promoters, although it exhibits only 48% identity (98% coverage) to RpoD. We have called this the transcriptional laxity phenomenon. Here, we show that SigA partially complements the thermo-sensitive deficiency of RpoD285 from E. coli strain UQ285 and that the SigA region σ4 is responsible for this phenotype. Sixteen out of 74 residues (21.6%) within region σ4 are variable between RpoD and SigA. Mutating these residues significantly improves SigA ability to complement E. coli UQ285. Only six of these residues fall into positions already known to interact with promoter DNA and to comprise a helix-turn-helix motif. The remaining variable positions are located on previously unexplored sites inside region σ4, specifically into the first two α-helices of the region. Neither of the variable positions confined to these helices seem to interact directly with promoter sequence; instead, we adduce that these residues participate allosterically by contributing to correct region folding and/or positioning of the HTH motif. We propose that transcriptional laxity is a mechanism for ensuring transcription in spite of naturally occurring mutations from endogenous promoters and/or horizontally transferred DNA sequences, allowing survival and fast environmental adaptation of α-proteobacteria. PMID:27468278
Zhao, A; Guo, A; Liu, Z; Pape, L
1997-01-01
The coding sequences for a Schizosaccharomyces pombe sequence-specific DNA binding protein, Reb1p, have been cloned. The predicted S. pombe Reb1p is 24-29% identical to mouse TTF-1 (transcription termination factor-1) and Saccharomyces cerevisiae REB1 protein, both of which direct termination of RNA polymerase I catalyzed transcripts. The S.pombe Reb1 cDNA encodes a predicted polypeptide of 504 amino acids with a predicted molecular weight of 58.4 kDa. The S. pombe Reb1p is unusual in that the bipartite DNA binding motif identified originally in S.cerevisiae and Klyveromyces lactis REB1 proteins is uninterrupted and thus S.pombe Reb1p may contain the smallest natural REB1 homologous DNA binding domain. Its genomic coding sequences were shown to be interrupted by two introns. A recombinant histidine-tagged Reb1 protein bearing the rDNA binding domain has two homologous, sequence-specific binding sites in the S. pomber DNA intergenic spacer, located between 289 and 480 nt downstream of the end of the approximately 25S rRNA coding sequences. Each binding site is 13-14 bp downstream of two of the three proposed in vivo termination sites. The core of this 17 bp site, AGGTAAGGGTAATGCAC, is specifically protected by Reb1p in footprinting analysis. PMID:9016645
A Comparative Encyclopedia of DNA Elements in the Mouse Genome
Yue, Feng; Cheng, Yong; Breschi, Alessandra; Vierstra, Jeff; Wu, Weisheng; Ryba, Tyrone; Sandstrom, Richard; Ma, Zhihai; Davis, Carrie; Pope, Benjamin D.; Shen, Yin; Pervouchine, Dmitri D.; Djebali, Sarah; Thurman, Bob; Kaul, Rajinder; Rynes, Eric; Kirilusha, Anthony; Marinov, Georgi K.; Williams, Brian A.; Trout, Diane; Amrhein, Henry; Fisher-Aylor, Katherine; Antoshechkin, Igor; DeSalvo, Gilberto; See, Lei-Hoon; Fastuca, Meagan; Drenkow, Jorg; Zaleski, Chris; Dobin, Alex; Prieto, Pablo; Lagarde, Julien; Bussotti, Giovanni; Tanzer, Andrea; Denas, Olgert; Li, Kanwei; Bender, M. A.; Zhang, Miaohua; Byron, Rachel; Groudine, Mark T.; McCleary, David; Pham, Long; Ye, Zhen; Kuan, Samantha; Edsall, Lee; Wu, Yi-Chieh; Rasmussen, Matthew D.; Bansal, Mukul S.; Keller, Cheryl A.; Morrissey, Christapher S.; Mishra, Tejaswini; Jain, Deepti; Dogan, Nergiz; Harris, Robert S.; Cayting, Philip; Kawli, Trupti; Boyle, Alan P.; Euskirchen, Ghia; Kundaje, Anshul; Lin, Shin; Lin, Yiing; Jansen, Camden; Malladi, Venkat S.; Cline, Melissa S.; Erickson, Drew T.; Kirkup, Vanessa M; Learned, Katrina; Sloan, Cricket A.; Rosenbloom, Kate R.; de Sousa, Beatriz Lacerda; Beal, Kathryn; Pignatelli, Miguel; Flicek, Paul; Lian, Jin; Kahveci, Tamer; Lee, Dongwon; Kent, W. James; Santos, Miguel Ramalho; Herrero, Javier; Notredame, Cedric; Johnson, Audra; Vong, Shinny; Lee, Kristen; Bates, Daniel; Neri, Fidencio; Diegel, Morgan; Canfield, Theresa; Sabo, Peter J.; Wilken, Matthew S.; Reh, Thomas A.; Giste, Erika; Shafer, Anthony; Kutyavin, Tanya; Haugen, Eric; Dunn, Douglas; Reynolds, Alex P.; Neph, Shane; Humbert, Richard; Hansen, R. Scott; De Bruijn, Marella; Selleri, Licia; Rudensky, Alexander; Josefowicz, Steven; Samstein, Robert; Eichler, Evan E.; Orkin, Stuart H.; Levasseur, Dana; Papayannopoulou, Thalia; Chang, Kai-Hsin; Skoultchi, Arthur; Gosh, Srikanta; Disteche, Christine; Treuting, Piper; Wang, Yanli; Weiss, Mitchell J.; Blobel, Gerd A.; Good, Peter J.; Lowdon, Rebecca F.; Adams, Leslie B.; Zhou, Xiao-Qiao; Pazin, Michael J.; Feingold, Elise A.; Wold, Barbara; Taylor, James; Kellis, Manolis; Mortazavi, Ali; Weissman, Sherman M.; Stamatoyannopoulos, John; Snyder, Michael P.; Guigo, Roderic; Gingeras, Thomas R.; Gilbert, David M.; Hardison, Ross C.; Beer, Michael A.; Ren, Bing
2014-01-01
Summary As the premier model organism in biomedical research, the laboratory mouse shares the majority of protein-coding genes with humans, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications, and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of other sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases. PMID:25409824
A comparative encyclopedia of DNA elements in the mouse genome.
Yue, Feng; Cheng, Yong; Breschi, Alessandra; Vierstra, Jeff; Wu, Weisheng; Ryba, Tyrone; Sandstrom, Richard; Ma, Zhihai; Davis, Carrie; Pope, Benjamin D; Shen, Yin; Pervouchine, Dmitri D; Djebali, Sarah; Thurman, Robert E; Kaul, Rajinder; Rynes, Eric; Kirilusha, Anthony; Marinov, Georgi K; Williams, Brian A; Trout, Diane; Amrhein, Henry; Fisher-Aylor, Katherine; Antoshechkin, Igor; DeSalvo, Gilberto; See, Lei-Hoon; Fastuca, Meagan; Drenkow, Jorg; Zaleski, Chris; Dobin, Alex; Prieto, Pablo; Lagarde, Julien; Bussotti, Giovanni; Tanzer, Andrea; Denas, Olgert; Li, Kanwei; Bender, M A; Zhang, Miaohua; Byron, Rachel; Groudine, Mark T; McCleary, David; Pham, Long; Ye, Zhen; Kuan, Samantha; Edsall, Lee; Wu, Yi-Chieh; Rasmussen, Matthew D; Bansal, Mukul S; Kellis, Manolis; Keller, Cheryl A; Morrissey, Christapher S; Mishra, Tejaswini; Jain, Deepti; Dogan, Nergiz; Harris, Robert S; Cayting, Philip; Kawli, Trupti; Boyle, Alan P; Euskirchen, Ghia; Kundaje, Anshul; Lin, Shin; Lin, Yiing; Jansen, Camden; Malladi, Venkat S; Cline, Melissa S; Erickson, Drew T; Kirkup, Vanessa M; Learned, Katrina; Sloan, Cricket A; Rosenbloom, Kate R; Lacerda de Sousa, Beatriz; Beal, Kathryn; Pignatelli, Miguel; Flicek, Paul; Lian, Jin; Kahveci, Tamer; Lee, Dongwon; Kent, W James; Ramalho Santos, Miguel; Herrero, Javier; Notredame, Cedric; Johnson, Audra; Vong, Shinny; Lee, Kristen; Bates, Daniel; Neri, Fidencio; Diegel, Morgan; Canfield, Theresa; Sabo, Peter J; Wilken, Matthew S; Reh, Thomas A; Giste, Erika; Shafer, Anthony; Kutyavin, Tanya; Haugen, Eric; Dunn, Douglas; Reynolds, Alex P; Neph, Shane; Humbert, Richard; Hansen, R Scott; De Bruijn, Marella; Selleri, Licia; Rudensky, Alexander; Josefowicz, Steven; Samstein, Robert; Eichler, Evan E; Orkin, Stuart H; Levasseur, Dana; Papayannopoulou, Thalia; Chang, Kai-Hsin; Skoultchi, Arthur; Gosh, Srikanta; Disteche, Christine; Treuting, Piper; Wang, Yanli; Weiss, Mitchell J; Blobel, Gerd A; Cao, Xiaoyi; Zhong, Sheng; Wang, Ting; Good, Peter J; Lowdon, Rebecca F; Adams, Leslie B; Zhou, Xiao-Qiao; Pazin, Michael J; Feingold, Elise A; Wold, Barbara; Taylor, James; Mortazavi, Ali; Weissman, Sherman M; Stamatoyannopoulos, John A; Snyder, Michael P; Guigo, Roderic; Gingeras, Thomas R; Gilbert, David M; Hardison, Ross C; Beer, Michael A; Ren, Bing
2014-11-20
The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases.
Imbert, J; Zafarullah, M; Culotta, V C; Gedamu, L; Hamer, D
1989-01-01
Metallothionein (MT) gene promoters in higher eucaryotes contain multiple metal regulatory elements (MREs) that are responsible for the metal induction of MT gene transcription. We identified and purified to near homogeneity a 74-kilodalton mouse nuclear protein that specifically binds to certain MRE sequences. This protein, MBF-I, was purified employing as an affinity reagent a trout MRE that is shown to be functional in mouse cells but which lacks the G+C-rich and SP1-like sequences found in many mammalian MT gene promoters. Using point-mutated MREs, we showed that there is a strong correlation between DNA binding in vitro and MT gene regulation in vivo, suggesting a direct role of MBF-I in MT gene transcription. We also showed that MBF-I can induce MT gene transcription in vitro in a mouse extract and that this stimulation requires zinc. Images PMID:2586522
Novel transcriptional networks regulated by CLOCK in human neurons.
Fontenot, Miles R; Berto, Stefano; Liu, Yuxiang; Werthmann, Gordon; Douglas, Connor; Usui, Noriyoshi; Gleason, Kelly; Tamminga, Carol A; Takahashi, Joseph S; Konopka, Genevieve
2017-11-01
The molecular mechanisms underlying human brain evolution are not fully understood; however, previous work suggested that expression of the transcription factor CLOCK in the human cortex might be relevant to human cognition and disease. In this study, we investigated this novel transcriptional role for CLOCK in human neurons by performing chromatin immunoprecipitation sequencing for endogenous CLOCK in adult neocortices and RNA sequencing following CLOCK knockdown in differentiated human neurons in vitro. These data suggested that CLOCK regulates the expression of genes involved in neuronal migration, and a functional assay showed that CLOCK knockdown increased neuronal migratory distance. Furthermore, dysregulation of CLOCK disrupts coexpressed networks of genes implicated in neuropsychiatric disorders, and the expression of these networks is driven by hub genes with human-specific patterns of expression. These data support a role for CLOCK-regulated transcriptional cascades involved in human brain evolution and function. © 2017 Fontenot et al.; Published by Cold Spring Harbor Laboratory Press.
Structural basis for genome wide recognition of 5-bp GC motifs by SMAD transcription factors.
Martin-Malpartida, Pau; Batet, Marta; Kaczmarska, Zuzanna; Freier, Regina; Gomes, Tiago; Aragón, Eric; Zou, Yilong; Wang, Qiong; Xi, Qiaoran; Ruiz, Lidia; Vea, Angela; Márquez, José A; Massagué, Joan; Macias, Maria J
2017-12-12
Smad transcription factors activated by TGF-β or by BMP receptors form trimeric complexes with Smad4 to target specific genes for cell fate regulation. The CAGAC motif has been considered as the main binding element for Smad2/3/4, whereas Smad1/5/8 have been thought to preferentially bind GC-rich elements. However, chromatin immunoprecipitation analysis in embryonic stem cells showed extensive binding of Smad2/3/4 to GC-rich cis-regulatory elements. Here, we present the structural basis for specific binding of Smad3 and Smad4 to GC-rich motifs in the goosecoid promoter, a nodal-regulated differentiation gene. The structures revealed a 5-bp consensus sequence GGC(GC)|(CG) as the binding site for both TGF-β and BMP-activated Smads and for Smad4. These 5GC motifs are highly represented as clusters in Smad-bound regions genome-wide. Our results provide a basis for understanding the functional adaptability of Smads in different cellular contexts, and their dependence on lineage-determining transcription factors to target specific genes in TGF-β and BMP pathways.
SP and KLF Transcription Factors in Digestive Physiology and Diseases.
Kim, Chang-Kyung; He, Ping; Bialkowska, Agnieszka B; Yang, Vincent W
2017-06-01
Specificity proteins (SPs) and Krüppel-like factors (KLFs) belong to the family of transcription factors that contain conserved zinc finger domains involved in binding to target DNA sequences. Many of these proteins are expressed in different tissues and have distinct tissue-specific activities and functions. Studies have shown that SPs and KLFs regulate not only physiological processes such as growth, development, differentiation, proliferation, and embryogenesis, but pathogenesis of many diseases, including cancer and inflammatory disorders. Consistently, these proteins have been shown to regulate normal functions and pathobiology in the digestive system. We review recent findings on the tissue- and organ-specific functions of SPs and KLFs in the digestive system including the oral cavity, esophagus, stomach, small and large intestines, pancreas, and liver. We provide a list of agents under development to target these proteins. Copyright © 2017 AGA Institute. Published by Elsevier Inc. All rights reserved.
Cellular and molecular mechanisms of HIV-1 integration targeting.
Engelman, Alan N; Singh, Parmit K
2018-07-01
Integration is central to HIV-1 replication and helps mold the reservoir of cells that persists in AIDS patients. HIV-1 interacts with specific cellular factors to target integration to interior regions of transcriptionally active genes within gene-dense regions of chromatin. The viral capsid interacts with several proteins that are additionally implicated in virus nuclear import, including cleavage and polyadenylation specificity factor 6, to suppress integration into heterochromatin. The viral integrase protein interacts with transcriptional co-activator lens epithelium-derived growth factor p75 to principally position integration within gene bodies. The integrase additionally senses target DNA distortion and nucleotide sequence to help fine-tune the specific phosphodiester bonds that are cleaved at integration sites. Research into virus-host interactions that underlie HIV-1 integration targeting has aided the development of a novel class of integrase inhibitors and may help to improve the safety of viral-based gene therapy vectors.
Cenik, Can; Chua, Hon Nian; Singh, Guramrit; Akef, Abdalla; Snyder, Michael P; Palazzo, Alexander F; Moore, Melissa J; Roth, Frederick P
2017-03-01
Introns are found in 5' untranslated regions (5'UTRs) for 35% of all human transcripts. These 5'UTR introns are not randomly distributed: Genes that encode secreted, membrane-bound and mitochondrial proteins are less likely to have them. Curiously, transcripts lacking 5'UTR introns tend to harbor specific RNA sequence elements in their early coding regions. To model and understand the connection between coding-region sequence and 5'UTR intron status, we developed a classifier that can predict 5'UTR intron status with >80% accuracy using only sequence features in the early coding region. Thus, the classifier identifies transcripts with 5 ' proximal- i ntron- m inus-like-coding regions ("5IM" transcripts). Unexpectedly, we found that the early coding sequence features defining 5IM transcripts are widespread, appearing in 21% of all human RefSeq transcripts. The 5IM class of transcripts is enriched for non-AUG start codons, more extensive secondary structure both preceding the start codon and near the 5' cap, greater dependence on eIF4E for translation, and association with ER-proximal ribosomes. 5IM transcripts are bound by the exon junction complex (EJC) at noncanonical 5' proximal positions. Finally, N 1 -methyladenosines are specifically enriched in the early coding regions of 5IM transcripts. Taken together, our analyses point to the existence of a distinct 5IM class comprising ∼20% of human transcripts. This class is defined by depletion of 5' proximal introns, presence of specific RNA sequence features associated with low translation efficiency, N 1 -methyladenosines in the early coding region, and enrichment for noncanonical binding by the EJC. © 2017 Cenik et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Gillotin, Sébastien; Guillemot, François
2016-06-20
Chromatin immunoprecipitation followed by deep sequencing (ChIP-Seq) is an important strategy to study gene regulation. When availability of cells is limited, however, it can be useful to focus on specific genes to investigate in depth the role of transcription factors or histone marks. Unfortunately, performing ChIP experiments to study transcription factors' binding to DNA can be difficult when biological material is restricted. This protocol describes a robust method to perform μChIP for over-expressed or endogenous transcription factors using ~100,000 cells per ChIP experiment (Masserdotti et al ., 2015). We also describe optimization steps, which we think are critical for this protocol to work and which can be used to further reduce the number of cells.
Robinson, Lois; Panayiotakis, Alexandra; Papas, Takis S.; Kola, Ismail; Seth, Arun
1997-01-01
ETS transcription factors play important roles in hematopoiesis, angiogenesis, and organogenesis during murine development. The ETS genes also have a role in neoplasia, for example in Ewing’s sarcomas and retrovirally induced cancers. The ETS genes encode transcription factors that bind to specific DNA sequences and activate transcription of various cellular and viral genes. To isolate novel ETS target genes, we used two approaches. In the first approach, we isolated genes by the RNA differential display technique. Previously, we have shown that the overexpression of ETS1 and ETS2 genes effects transformation of NIH 3T3 cells and specific transformants produce high levels of the ETS proteins. To isolate ETS1 and ETS2 responsive genes in these transformed cells, we prepared RNA from ETS1, ETS2 transformants, and normal NIH 3T3 cell lines and converted it into cDNA. This cDNA was amplified by PCR and displayed on sequencing gels. The differentially displayed bands were subcloned into plasmid vectors. By Northern blot analysis, several clones showed differential patterns of mRNA expression in the NIH 3T3-, ETS1-, and ETS2-expressing cell lines. Sixteen clones were analyzed by DNA sequence analysis, and 13 of them appeared to be unique because their DNA sequences did not match with any of the known genes present in the gene bank. Three known genes were found to be identical to the CArG box binding factor, phospholipase A2-activating protein, and early growth response 1 (Egr1) genes. In the second approach, to isolate ETS target promoters directly, we performed ETS1 binding with MboI-cleaved genomic DNA in the presence of a specific mAb followed by whole genome PCR. The immune complex-bound ETS binding sites containing DNA fragments were amplified and subcloned into pBluescript and subjected to DNA sequence and computer analysis. We found that, of a large number of clones isolated, 43 represented unique sequences not previously identified. Three clones turned out to contain regulatory sequences derived from human serglycin, preproapolipoprotein C II, and Egr1 genes. The ETS binding sites derived from these three regulatory sequences showed specific binding with recombinant ETS proteins. Of interest, Egr1 was identified by both of these techniques, suggesting strongly that it is indeed an ETS target gene. PMID:9207063
Xu, Yue; Li, Song Feng; Parish, Roger W
2017-07-01
Targeted gene manipulation is a central strategy for studying gene function and identifying related biological processes. However, a methodology for manipulating the regulatory motifs of transcription factors is lacking as these factors commonly possess multiple motifs (e.g. repression and activation motifs) which collaborate with each other to regulate multiple biological processes. We describe a novel approach designated conserved sequence-guided repressor inhibition (CoSRI) that can specifically reduce or abolish the repressive activities of transcription factors in vivo. The technology was evaluated using the chimeric MYB80-EAR transcription factor and subsequently the endogenous WUS transcription factor. The technology was employed to develop a reversible male sterility system applicable to hybrid seed production. In order to determine the capacity of the technology to regulate the activity of endogenous transcription factors, the WUS repressor was chosen. The WUS repression motif could be inhibited in vivo and the transformed plants exhibited the wus-1 phenotype. Consequently, the technology can be used to manipulate the activities of transcriptional repressor motifs regulating beneficial traits in crop plants and other eukaryotic organisms. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Sequence specificity of single-stranded DNA-binding proteins: a novel DNA microarray approach
Morgan, Hugh P.; Estibeiro, Peter; Wear, Martin A.; Max, Klaas E.A.; Heinemann, Udo; Cubeddu, Liza; Gallagher, Maurice P.; Sadler, Peter J.; Walkinshaw, Malcolm D.
2007-01-01
We have developed a novel DNA microarray-based approach for identification of the sequence-specificity of single-stranded nucleic-acid-binding proteins (SNABPs). For verification, we have shown that the major cold shock protein (CspB) from Bacillus subtilis binds with high affinity to pyrimidine-rich sequences, with a binding preference for the consensus sequence, 5′-GTCTTTG/T-3′. The sequence was modelled onto the known structure of CspB and a cytosine-binding pocket was identified, which explains the strong preference for a cytosine base at position 3. This microarray method offers a rapid high-throughput approach for determining the specificity and strength of ss DNA–protein interactions. Further screening of this newly emerging family of transcription factors will help provide an insight into their cellular function. PMID:17488853
Lamatsch, Dunja K.; Adolfsson, Sofia; Senior, Alistair M.; Christiansen, Guntram; Pichler, Maria; Ozaki, Yuichi; Smeds, Linnea; Schartl, Manfred; Nakagawa, Shinichi
2015-01-01
Sex-specific markers are a prerequisite for understanding reproductive biology, genetic factors involved in sex differences, mechanisms of sex determination, and ultimately the evolution of sex chromosomes. The Western mosquitofish, Gambusia affinis, may be considered a model species for sex-chromosome evolution, as it displays female heterogamety (ZW/ZZ), and is also ecologically interesting as a worldwide invasive species. Here, de novo RNA-sequencing on the gonads of sexually mature G. affinis was used to identify contigs that were highly transcribed in females but not in males (i.e., transcripts with ovary-specific expression). Subsequently, 129 primer pairs spanning 79 contigs were tested by PCR to identify sex-specific transcripts. Of those primer pairs, one female-specific DNA marker was identified, Sanger sequenced and subsequently validated in 115 fish. Sequence analyses revealed a high similarity between the identified sex-specific marker and the 3´ UTR of the aminomethyl transferase (amt) gene of the closely related platyfish (Xiphophorus maculatus). This is the first time that RNA-seq has been used to successfully characterize a sex-specific marker in a fish species in the absence of a genome map. Additionally, the identified sex-specific marker represents one of only a handful of such markers in fishes. PMID:25707007
Yang, Yuping; Yan, Pengcheng; Yi, Che; Li, Wenzheng; Chai, Yuhui; Fei, Lingling; Gao, Ping; Zhao, Heping; Wang, Yingdian; Timko, Michael P; Wang, Bingwu; Han, Shengcheng
2017-08-01
Jasmonates (JAs) are well-known regulators of stress, defence, and secondary metabolism in plants, with JA perception triggering extensive transcriptional reprogramming, including both activation and/or repression of entire metabolic pathways. We performed RNA sequencing based transcriptomic profiling of tobacco BY-2 cells before and after treatment with methyl jasmonate (MeJA) to identify novel transcriptional regulators associated with alkaloid formation. A total of 107,140 unigenes were obtained through de novo assembly, and at least 33,213 transcripts (31%) encode proteins, in which 3419 transcription factors (TFs) were identified, representing 72 gene families, as well as 840 transcriptional regulators (TRs) distributed among 19 gene families. After MeJA treatment BY-2 cells, 7260 differentially expressed transcripts were characterised, which include 4443 MeJA-upregulated and 2817 MeJA-downregulated genes. Of these, 227 TFs/TRs in 36 families were specifically upregulated, and 102 TFs/TRs in 38 families were downregulated in MeJA-treated BY-2 cells. We further showed that the expression of 12 ethylene response factors and four basic helix-loop-helix factors increased at the transcriptional level after MeJA treatment in BY-2 cells and displayed specific expression patterns in nic mutants with or without MeJA treatments. Our data provide a catalogue of transcripts of tobacco BY-2 cells and benefit future study of JA-modulated regulation of secondary metabolism in tobacco. Copyright © 2017 Elsevier GmbH. All rights reserved.
TEMPLE: analysing population genetic variation at transcription factor binding sites.
Litovchenko, Maria; Laurent, Stefan
2016-11-01
Genetic variation occurring at the level of regulatory sequences can affect phenotypes and fitness in natural populations. This variation can be analysed in a population genetic framework to study how genetic drift and selection affect the evolution of these functional elements. However, doing this requires a good understanding of the location and nature of regulatory regions and has long been a major hurdle. The current proliferation of genomewide profiling experiments of transcription factor occupancies greatly improves our ability to identify genomic regions involved in specific DNA-protein interactions. Although software exists for predicting transcription factor binding sites (TFBS), and the effects of genetic variants on TFBS specificity, there are no tools currently available for inferring this information jointly with the genetic variation at TFBS in natural populations. We developed the software Transcription Elements Mapping at the Population LEvel (TEMPLE), which predicts TFBS, evaluates the effects of genetic variants on TFBS specificity and summarizes the genetic variation occurring at TFBS in intraspecific sequence alignments. We demonstrate that TEMPLE's TFBS prediction algorithms gives identical results to PATSER, a software distribution commonly used in the field. We also illustrate the unique features of TEMPLE by analysing TFBS diversity for the TF Senseless (SENS) in one ancestral and one cosmopolitan population of the fruit fly Drosophila melanogaster. TEMPLE can be used to localize TFBS that are characterized by strong genetic differentiation across natural populations. This will be particularly useful for studies aiming to identify adaptive mutations. TEMPLE is a java-based cross-platform software that easily maps the genetic diversity at predicted TFBSs using a graphical interface, or from the Unix command line. © 2016 John Wiley & Sons Ltd.
Goldie, Belinda J; Fitzsimmons, Chantel; Weidenhofer, Judith; Atkins, Joshua R; Wang, Dan O; Cairns, Murray J
2017-01-01
While the cytoplasmic function of microRNA (miRNA) as post-transcriptional regulators of mRNA has been the subject of significant research effort, their activity in the nucleus is less well characterized. Here we use a human neuronal cell model to show that some mature miRNA are preferentially enriched in the nucleus. These molecules were predominantly primate-specific and contained a sequence motif with homology to the consensus MAZ transcription factor binding element. Precursor miRNA containing this motif were shown to have affinity for MAZ protein in nuclear extract. We then used Ago1/2 RIP-Seq to explore nuclear miRNA-associated mRNA targets. Interestingly, the genes for Ago2-associated transcripts were also significantly enriched with MAZ binding sites and neural function, whereas Ago1-transcripts were associated with general metabolic processes and localized with SC35 spliceosomes. These findings suggest the MAZ transcription factor is associated with miRNA in the nucleus and may influence the regulation of neuronal development through Ago2-associated miRNA induced silencing complexes. The MAZ transcription factor may therefore be important for organizing higher order integration of transcriptional and post-transcriptional processes in primate neurons.
Transcription factor trapping by RNA in gene regulatory elements.
Sigova, Alla A; Abraham, Brian J; Ji, Xiong; Molinie, Benoit; Hannett, Nancy M; Guo, Yang Eric; Jangi, Mohini; Giallourakis, Cosmas C; Sharp, Phillip A; Young, Richard A
2015-11-20
Transcription factors (TFs) bind specific sequences in promoter-proximal and -distal DNA elements to regulate gene transcription. RNA is transcribed from both of these DNA elements, and some DNA binding TFs bind RNA. Hence, RNA transcribed from regulatory elements may contribute to stable TF occupancy at these sites. We show that the ubiquitously expressed TF Yin-Yang 1 (YY1) binds to both gene regulatory elements and their associated RNA species across the entire genome. Reduced transcription of regulatory elements diminishes YY1 occupancy, whereas artificial tethering of RNA enhances YY1 occupancy at these elements. We propose that RNA makes a modest but important contribution to the maintenance of certain TFs at gene regulatory elements and suggest that transcription of regulatory elements produces a positive-feedback loop that contributes to the stability of gene expression programs. Copyright © 2015, American Association for the Advancement of Science.
Brady, J; Radonovich, M; Thoren, M; Das, G; Salzman, N P
1984-01-01
We have previously identified an 11-base DNA sequence, 5'-G-G-T-A-C-C-T-A-A-C-C-3' (simian virus 40 [SV40] map position 294 to 304), which is important in the control of SV40 late RNA expression in vitro and in vivo (Brady et al., Cell 31:625-633, 1982). We report here the identification of another domain of the SV40 late promoter. A series of mutants with deletions extending from SV40 map position 0 to 300 was prepared by nuclease BAL 31 treatment. The cloned templates were then analyzed for efficiency and accuracy of late SV40 RNA expression in the Manley in vitro transcription system. Our studies showed that, in addition to the promoter domain near map position 300, there are essential DNA sequences between nucleotide positions 74 and 95 that are required for efficient expression of late SV40 RNA. Included in this SV40 DNA sequence were two of the six GGGCGG SV40 repeat sequences and an 11-nucleotide segment which showed strong homology with the upstream sequences required for the efficient in vitro and in vivo expression of the histone H2A gene. This upstream promoter sequence supported transcription with the same efficiency even when it was moved 72 nucleotides closer to the major late cap site. In vitro promoter competition analysis demonstrated that the upstream promoter sequence, independent of the 294 to 304 promoter element, is capable of binding polymerase-transcription factors required for SV40 late gene transcription. Finally, we show that DNA sequences which control the specificity of RNA initiation at nucleotide 325 lie downstream of map position 294. Images PMID:6321950
Cell type-specific termination of transcription by transposable element sequences.
Conley, Andrew B; Jordan, I King
2012-09-30
Transposable elements (TEs) encode sequences necessary for their own transposition, including signals required for the termination of transcription. TE sequences within the introns of human genes show an antisense orientation bias, which has been proposed to reflect selection against TE sequences in the sense orientation owing to their ability to terminate the transcription of host gene transcripts. While there is evidence in support of this model for some elements, the extent to which TE sequences actually terminate transcription of human gene across the genome remains an open question. Using high-throughput sequencing data, we have characterized over 9,000 distinct TE-derived sequences that provide transcription termination sites for 5,747 human genes across eight different cell types. Rarefaction curve analysis suggests that there may be twice as many TE-derived termination sites (TE-TTS) genome-wide among all human cell types. The local chromatin environment for these TE-TTS is similar to that seen for 3' UTR canonical TTS and distinct from the chromatin environment of other intragenic TE sequences. However, those TE-TTS located within the introns of human genes were found to be far more cell type-specific than the canonical TTS. TE-TTS were much more likely to be found in the sense orientation than other intragenic TE sequences of the same TE family and TE-TTS in the sense orientation terminate transcription more efficiently than those found in the antisense orientation. Alu sequences were found to provide a large number of relatively weak TTS, whereas LTR elements provided a smaller number of much stronger TTS. TE sequences provide numerous termination sites to human genes, and TE-derived TTS are particularly cell type-specific. Thus, TE sequences provide a powerful mechanism for the diversification of transcriptional profiles between cell types and among evolutionary lineages, since most TE-TTS are evolutionarily young. The extent of transcription termination by TEs seen here, along with the preference for sense-oriented TE insertions to provide TTS, is consistent with the observed antisense orientation bias of human TEs.
EBR1 genomic expansion and its role in virulence of Fusarium species
USDA-ARS?s Scientific Manuscript database
Genome sequencing of Fusarium oxysporum revealed that pathogenic forms of this fungus harbor supernumerary chromosomes with a wide variety of genes, many of which likely encode traits required for pathogenicity or niche specialization. Specific transcription factor (TF) gene families are expanded on...
PWMScan: a fast tool for scanning entire genomes with a position-specific weight matrix.
Ambrosini, Giovanna; Groux, Romain; Bucher, Philipp
2018-03-05
Transcription factors (TFs) regulate gene expression by binding to specific short DNA sequences of 5 to 20-bp to regulate the rate of transcription of genetic information from DNA to messenger RNA. We present PWMScan, a fast web-based tool to scan server-resident genomes for matches to a user-supplied PWM or TF binding site model from a public database. The web server and source code are available at http://ccg.vital-it.ch/pwmscan and https://sourceforge.net/projects/pwmscan, respectively. giovanna.ambrosini@epfl.ch. SUPPLEMENTARY DATA ARE AVAILABLE AT BIOINFORMATICS ONLINE.
Ceccarelli, A; Zhukovskaya, N; Kawata, T; Bozzaro, S; Williams, J
2000-12-01
The ecmB gene of Dictyostelium is expressed at culmination both in the prestalk cells that enter the stalk tube and in ancillary stalk cell structures such as the basal disc. Stalk tube-specific expression is regulated by sequence elements within the cap-site proximal part of the promoter, the stalk tube (ST) promoter region. Dd-STATa, a member of the STAT transcription factor family, binds to elements present in the ST promoter-region and represses transcription prior to entry into the stalk tube. We have characterised an activatory DNA sequence element, that lies distal to the repressor elements and that is both necessary and sufficient for expression within the stalk tube. We have mapped this activator to a 28 nucleotide region (the 28-mer) within which we have identified a GA-containing sequence element that is required for efficient gene transcription. The Dd-STATa protein binds to the 28-mer in an in vitro binding assay, and binding is dependent upon the GA-containing sequence. However, the ecmB gene is expressed in a Dd-STATa null mutant, therefore Dd-STATa cannot be responsible for activating the 28-mer in vivo. Instead, we identified a distinct 28-mer binding activity in nuclear extracts from the Dd-STATa null mutant, the activity of this GA binding activity being largely masked in wild type extracts by the high affinity binding of the Dd-STATa protein. We suggest, that in addition to the long range repression exerted by binding to the two known repressor sites, Dd-STATa inhibits transcription by direct competition with this putative activator for binding to the GA sequence.
GENOME-ENABLED DISCOVERY OF CARBON SEQUESTRATION GENES IN POPLAR
DOE Office of Scientific and Technical Information (OSTI.GOV)
DAVIS J M
2007-10-11
Plants utilize carbon by partitioning the reduced carbon obtained through photosynthesis into different compartments and into different chemistries within a cell and subsequently allocating such carbon to sink tissues throughout the plant. Since the phytohormones auxin and cytokinin are known to influence sink strength in tissues such as roots (Skoog & Miller 1957, Nordstrom et al. 2004), we hypothesized that altering the expression of genes that regulate auxin-mediated (e.g., AUX/IAA or ARF transcription factors) or cytokinin-mediated (e.g., RR transcription factors) control of root growth and development would impact carbon allocation and partitioning belowground (Fig. 1 - Renewal Proposal). Specifically, themore » ARF, AUX/IAA and RR transcription factor gene families mediate the effects of the growth regulators auxin and cytokinin on cell expansion, cell division and differentiation into root primordia. Invertases (IVR), whose transcript abundance is enhanced by both auxin and cytokinin, are critical components of carbon movement and therefore of carbon allocation. Thus, we initiated comparative genomic studies to identify the AUX/IAA, ARF, RR and IVR gene families in the Populus genome that could impact carbon allocation and partitioning. Bioinformatics searches using Arabidopsis gene sequences as queries identified regions with high degrees of sequence similarities in the Populus genome. These Populus sequences formed the basis of our transgenic experiments. Transgenic modification of gene expression involving members of these gene families was hypothesized to have profound effects on carbon allocation and partitioning.« less
Avram, Dorina; Fields, Andrew; Senawong, Thanaset; Topark-Ngarm, Acharawan; Leid, Mark
2002-01-01
Chicken ovalbumin upstream promoter transcription factor (COUP-TF)-interacting proteins 1 and 2 [CTIP1/Evi9/B cell leukaemia (Bcl) l1a and CTIP2/Bcl11b respectively] are highly related C(2)H(2) zinc finger proteins that are abundantly expressed in brain and the immune system, and are associated with immune system malignancies. A selection procedure was employed to isolate high-affinity DNA binding sites for CTIP1. The core binding site on DNA identified in these studies, 5'-GGCCGG-3' (upper strand), is highly related to the canonical GC box and was bound by a CTIP1 oligomeric complex(es) in vitro. Furthermore, both CTIP1 and CTIP2 repressed transcription of a reporter gene harbouring a multimerized CTIP binding site, and this repression was neither reversed by trichostatin A (an inhibitor of known class I and II histone deacetylases) nor stimulated by co-transfection of a COUP-TF family member. These results demonstrate that CTIP1 is a sequence-specific DNA binding protein and a bona fide transcriptional repressor that is capable of functioning independently of COUP-TF family members. These findings may be relevant to the physiological and/or pathological action(s) of CTIPs in cells that do not express COUP-TF family members, such as cells of the haematopoietic and immune systems. PMID:12196208
Wang, Hsiu-Yu; Chang, Hao-Teng; Pai, Tun-Wen; Wu, Chung-I; Lee, Yuan-Hung; Chang, Yen-Hsin; Tai, Hsiu-Ling; Tang, Chuan-Yi; Chou, Wei-Yao; Chang, Margaret Dah-Tsyr
2007-01-01
Background Human eosinophil-derived neurotoxin (edn) and eosinophil cationic protein (ecp) are members of a subfamily of primate ribonuclease (rnase) genes. Although they are generated by gene duplication event, distinct edn and ecp expression profile in various tissues have been reported. Results In this study, we obtained the upstream promoter sequences of several representative primate eosinophil rnases. Bioinformatic analysis revealed the presence of a shared 34-nucleotide (nt) sequence stretch located at -81 to -48 in all edn promoters and macaque ecp promoter. Such a unique sequence motif constituted a region essential for transactivation of human edn in hepatocellular carcinoma cells. Gel electrophoretic mobility shift assay, transient transfection and scanning mutagenesis experiments allowed us to identify binding sites for two transcription factors, Myc-associated zinc finger protein (MAZ) and SV-40 protein-1 (Sp1), within the 34-nt segment. Subsequent in vitro and in vivo binding assays demonstrated a direct molecular interaction between this 34-nt region and MAZ and Sp1. Interestingly, overexpression of MAZ and Sp1 respectively repressed and enhanced edn promoter activity. The regulatory transactivation motif was mapped to the evolutionarily conserved -74/-65 region of the edn promoter, which was guanidine-rich and critical for recognition by both transcription factors. Conclusion Our results provide the first direct evidence that MAZ and Sp1 play important roles on the transcriptional activation of the human edn promoter through specific binding to a 34-nt segment present in representative primate eosinophil rnase promoters. PMID:17927842
In Silico Detection of Sequence Variations Modifying Transcriptional Regulation
Andersen, Malin C; Engström, Pär G; Lithwick, Stuart; Arenillas, David; Eriksson, Per; Lenhard, Boris; Wasserman, Wyeth W; Odeberg, Jacob
2008-01-01
Identification of functional genetic variation associated with increased susceptibility to complex diseases can elucidate genes and underlying biochemical mechanisms linked to disease onset and progression. For genes linked to genetic diseases, most identified causal mutations alter an encoded protein sequence. Technological advances for measuring RNA abundance suggest that a significant number of undiscovered causal mutations may alter the regulation of gene transcription. However, it remains a challenge to separate causal genetic variations from linked neutral variations. Here we present an in silico driven approach to identify possible genetic variation in regulatory sequences. The approach combines phylogenetic footprinting and transcription factor binding site prediction to identify variation in candidate cis-regulatory elements. The bioinformatics approach has been tested on a set of SNPs that are reported to have a regulatory function, as well as background SNPs. In the absence of additional information about an analyzed gene, the poor specificity of binding site prediction is prohibitive to its application. However, when additional data is available that can give guidance on which transcription factor is involved in the regulation of the gene, the in silico binding site prediction improves the selection of candidate regulatory polymorphisms for further analyses. The bioinformatics software generated for the analysis has been implemented as a Web-based application system entitled RAVEN (regulatory analysis of variation in enhancers). The RAVEN system is available at http://www.cisreg.ca for all researchers interested in the detection and characterization of regulatory sequence variation. PMID:18208319
rVISTA 2.0: Evolutionary Analysis of Transcription Factor Binding Sites
DOE Office of Scientific and Technical Information (OSTI.GOV)
Loots, G G; Ovcharenko, I
2004-01-28
Identifying and characterizing the patterns of DNA cis-regulatory modules represents a challenge that has the potential to reveal the regulatory language the genome uses to dictate transcriptional dynamics. Several studies have demonstrated that regulatory modules are under positive selection and therefore are often conserved between related species. Using this evolutionary principle we have created a comparative tool, rVISTA, for analyzing the regulatory potential of noncoding sequences. The rVISTA tool combines transcription factor binding site (TFBS) predictions, sequence comparisons and cluster analysis to identify noncoding DNA regions that are highly conserved and present in a specific configuration within an alignment. Heremore » we present the newly developed version 2.0 of the rVISTA tool that can process alignments generated by both zPicture and PipMaker alignment programs or use pre-computed pairwise alignments of seven vertebrate genomes available from the ECR Browser. The rVISTA web server is closely interconnected with the TRANSFAC database, allowing users to either search for matrices present in the TRANSFAC library collection or search for user-defined consensus sequences. rVISTA tool is publicly available at http://rvista.dcode.org/.« less
Jacobs, Jelle; Atkins, Mardelle; Davie, Kristofer; Imrichova, Hana; Romanelli, Lucia; Christiaens, Valerie; Hulselmans, Gert; Potier, Delphine; Wouters, Jasper; Taskiran, Ibrahim I; Paciello, Giulia; González-Blas, Carmen B; Koldere, Duygu; Aibar, Sara; Halder, Georg; Aerts, Stein
2018-06-04
Transcriptional enhancers function as docking platforms for combinations of transcription factors (TFs) to control gene expression. How enhancer sequences determine nucleosome occupancy, TF recruitment and transcriptional activation in vivo remains unclear. Using ATAC-seq across a panel of Drosophila inbred strains, we found that SNPs affecting binding sites of the TF Grainy head (Grh) causally determine the accessibility of epithelial enhancers. We show that deletion and ectopic expression of Grh cause loss and gain of DNA accessibility, respectively. However, although Grh binding is necessary for enhancer accessibility, it is insufficient to activate enhancers. Finally, we show that human Grh homologs-GRHL1, GRHL2 and GRHL3-function similarly. We conclude that Grh binding is necessary and sufficient for the opening of epithelial enhancers but not for their activation. Our data support a model positing that complex spatiotemporal expression patterns are controlled by regulatory hierarchies in which pioneer factors, such as Grh, establish tissue-specific accessible chromatin landscapes upon which other factors can act.
Sites of instability in the human TCF3 (E2A) gene adopt G-quadruplex DNA structures in vitro
Williams, Jonathan D.; Fleetwood, Sara; Berroyer, Alexandra; Kim, Nayun; Larson, Erik D.
2015-01-01
The formation of highly stable four-stranded DNA, called G-quadruplex (G4), promotes site-specific genome instability. G4 DNA structures fold from repetitive guanine sequences, and increasing experimental evidence connects G4 sequence motifs with specific gene rearrangements. The human transcription factor 3 (TCF3) gene (also termed E2A) is subject to genetic instability associated with severe disease, most notably a common translocation event t(1;19) associated with acute lymphoblastic leukemia. The sites of instability in TCF3 are not randomly distributed, but focused to certain sequences. We asked if G4 DNA formation could explain why TCF3 is prone to recombination and mutagenesis. Here we demonstrate that sequences surrounding the major t(1;19) break site and a region associated with copy number variations both contain G4 sequence motifs. The motifs identified readily adopt G4 DNA structures that are stable enough to interfere with DNA synthesis in physiological salt conditions in vitro. When introduced into the yeast genome, TCF3 G4 motifs promoted gross chromosomal rearrangements in a transcription-dependent manner. Our results provide a molecular rationale for the site-specific instability of human TCF3, suggesting that G4 DNA structures contribute to oncogenic DNA breaks and recombination. PMID:26029241
Hurst, H C; Masson, N; Jones, N C; Lee, K A
1990-12-01
Promoter elements containing the sequence motif CGTCA are important for a variety of inducible responses at the transcriptional level. Multiple cellular factors specifically bind to these elements and are encoded by a multigene family. Among these factors, polypeptides termed activating transcription factor 43 (ATF-43) and ATF-47 have been purified from HeLa cells and a factor referred to as cyclic AMP response element-binding protein (CREB) has been isolated from PC12 cells and rat brain. We demonstrated that CREB and ATF-47 are identical and that CREB and ATF-43 form protein-protein complexes. We also found that the cis requirements for stable DNA binding by ATF-43 and CREB are different. Using antibodies to ATF-43 we have identified a group of polypeptides (ATF-43) in the size range from 40 to 43 kDa. ATF-43 polypeptides are related by their reactivity with anti-ATF-43, DNA-binding specificity, complex formation with CREB, heat stability, and phosphorylation by protein kinase A. Certain cell types vary in their ATF-43 complement, suggesting that CREB activity is modulated in a cell-type-specific manner through interaction with ATF-43. ATF-43 polypeptides do not appear simply to correspond to the gene products of the ATF multigene family, suggesting that the size of the ATF family at the protein level is even larger than predicted from cDNA-cloning studies.
Chow, Chi-Nga; Zheng, Han-Qin; Wu, Nai-Yun; Chien, Chia-Hung; Huang, Hsien-Da; Lee, Tzong-Yi; Chiang-Hsieh, Yi-Fan; Hou, Ping-Fu; Yang, Tien-Yi; Chang, Wen-Chi
2016-01-04
Transcription factors (TFs) are sequence-specific DNA-binding proteins acting as critical regulators of gene expression. The Plant Promoter Analysis Navigator (PlantPAN; http://PlantPAN2.itps.ncku.edu.tw) provides an informative resource for detecting transcription factor binding sites (TFBSs), corresponding TFs, and other important regulatory elements (CpG islands and tandem repeats) in a promoter or a set of plant promoters. Additionally, TFBSs, CpG islands, and tandem repeats in the conserve regions between similar gene promoters are also identified. The current PlantPAN release (version 2.0) contains 16 960 TFs and 1143 TF binding site matrices among 76 plant species. In addition to updating of the annotation information, adding experimentally verified TF matrices, and making improvements in the visualization of transcriptional regulatory networks, several new features and functions are incorporated. These features include: (i) comprehensive curation of TF information (response conditions, target genes, and sequence logos of binding motifs, etc.), (ii) co-expression profiles of TFs and their target genes under various conditions, (iii) protein-protein interactions among TFs and their co-factors, (iv) TF-target networks, and (v) downstream promoter elements. Furthermore, a dynamic transcriptional regulatory network under various conditions is provided in PlantPAN 2.0. The PlantPAN 2.0 is a systematic platform for plant promoter analysis and reconstructing transcriptional regulatory networks. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Mohamad Ishak, Nur Syafiqah; Nong, Quang Dang; Matsuura, Tomoaki; Kato, Yasuhiko; Watanabe, Hajime
2017-11-01
Divergence of upstream regulatory pathways of the transcription factor Doublesex (Dsx) serves as a basis for evolution of sex-determining mechanisms in animals. However, little is known about the regulation of Dsx in environmental sex determination. In the crustacean Daphnia magna, environmental sex determination is implemented by male-specific expression of the Dsx ortholog, Dsx1. Transcriptional regulation of Dsx1 comprises at least three phases during embryogenesis: non-sex-specific initiation, male-specific up-regulation, and its maintenance. Herein, we demonstrate that the male-specific up-regulation is controlled by the bZIP transcription factor, Vrille (Vri), an ortholog of the circadian clock genes-Drosophila Vri and mammalian E4BP4/NFIL3. Sequence analysis of the Dsx1 promoter/enhancer revealed a conserved element among two Daphnia species (D. magna and D. pulex), which contains a potential enhancer harboring a consensus Vri binding site overlapped with a consensus Dsx binding site. Besides non-sex-specific expression of Vri in late embryos, we found male-specific expression in early gastrula before the Dsx1 up-regulation phase begins. Knockdown of Vri in male embryos showed reduction of Dsx1 expression. In addition, transient overexpression of Vri in early female embryos up-regulated the expression of Dsx1 and induced male-specific trait. Targeted mutagenesis using CRISPR/Cas9 disrupted the enhancer on genome in males, which led to the reduction of Dsx1 expression. These results indicate that Vri was co-opted as a transcriptional activator of Dsx1 in environmental sex determination of D. magna. The data suggests the remarkably plastic nature of gene regulatory network in sex determination.
Nong, Quang Dang; Matsuura, Tomoaki; Watanabe, Hajime
2017-01-01
Divergence of upstream regulatory pathways of the transcription factor Doublesex (Dsx) serves as a basis for evolution of sex-determining mechanisms in animals. However, little is known about the regulation of Dsx in environmental sex determination. In the crustacean Daphnia magna, environmental sex determination is implemented by male-specific expression of the Dsx ortholog, Dsx1. Transcriptional regulation of Dsx1 comprises at least three phases during embryogenesis: non-sex-specific initiation, male-specific up-regulation, and its maintenance. Herein, we demonstrate that the male-specific up-regulation is controlled by the bZIP transcription factor, Vrille (Vri), an ortholog of the circadian clock genes—Drosophila Vri and mammalian E4BP4/NFIL3. Sequence analysis of the Dsx1 promoter/enhancer revealed a conserved element among two Daphnia species (D. magna and D. pulex), which contains a potential enhancer harboring a consensus Vri binding site overlapped with a consensus Dsx binding site. Besides non-sex-specific expression of Vri in late embryos, we found male-specific expression in early gastrula before the Dsx1 up-regulation phase begins. Knockdown of Vri in male embryos showed reduction of Dsx1 expression. In addition, transient overexpression of Vri in early female embryos up-regulated the expression of Dsx1 and induced male-specific trait. Targeted mutagenesis using CRISPR/Cas9 disrupted the enhancer on genome in males, which led to the reduction of Dsx1 expression. These results indicate that Vri was co-opted as a transcriptional activator of Dsx1 in environmental sex determination of D. magna. The data suggests the remarkably plastic nature of gene regulatory network in sex determination. PMID:29095827
Zhou, Jia; Sears, Renee L; Xing, Xiaoyun; Zhang, Bo; Li, Daofeng; Rockweiler, Nicole B; Jang, Hyo Sik; Choudhary, Mayank N K; Lee, Hyung Joo; Lowdon, Rebecca F; Arand, Jason; Tabers, Brianne; Gu, C Charles; Cicero, Theodore J; Wang, Ting
2017-09-12
Uncovering mechanisms of epigenome evolution is an essential step towards understanding the evolution of different cellular phenotypes. While studies have confirmed DNA methylation as a conserved epigenetic mechanism in mammalian development, little is known about the conservation of tissue-specific genome-wide DNA methylation patterns. Using a comparative epigenomics approach, we identified and compared the tissue-specific DNA methylation patterns of rat against those of mouse and human across three shared tissue types. We confirmed that tissue-specific differentially methylated regions are strongly associated with tissue-specific regulatory elements. Comparisons between species revealed that at a minimum 11-37% of tissue-specific DNA methylation patterns are conserved, a phenomenon that we define as epigenetic conservation. Conserved DNA methylation is accompanied by conservation of other epigenetic marks including histone modifications. Although a significant amount of locus-specific methylation is epigenetically conserved, the majority of tissue-specific DNA methylation is not conserved across the species and tissue types that we investigated. Examination of the genetic underpinning of epigenetic conservation suggests that primary sequence conservation is a driving force behind epigenetic conservation. In contrast, evolutionary dynamics of tissue-specific DNA methylation are best explained by the maintenance or turnover of binding sites for important transcription factors. Our study extends the limited literature of comparative epigenomics and suggests a new paradigm for epigenetic conservation without genetic conservation through analysis of transcription factor binding sites.
An Approach to Greater Specificity for Glucocorticoids
Chow, Carson C.; Simons, S. Stoney
2018-01-01
Glucocorticoid steroids are among the most prescribed drugs each year. Nonetheless, the many undesirable side effects, and lack of selectivity, restrict their greater usage. Research to increase glucocorticoid specificity has spanned many years. These efforts have been hampered by the ability of glucocorticoids to both induce and repress gene transcription and also by the lack of success in defining any predictable properties that control glucocorticoid specificity. Correlations of transcriptional specificity have been observed with changes in steroid structure, receptor and chromatin conformation, DNA sequence for receptor binding, and associated cofactors. However, none of these studies have progressed to the point of being able to offer guidance for increased specificity. We summarize here a mathematical theory that allows a novel and quantifiable approach to increase selectivity. The theory applies to all three major actions of glucocorticoid receptors: induction by agonists, induction by antagonists, and repression by agonists. Simple graphical analysis of competition assays involving any two factors (steroid, chemical, peptide, protein, DNA, etc.) yields information (1) about the kinetically described mechanism of action for each factor at that step where the factor acts in the overall reaction sequence and (2) about the relative position of that step where each factor acts. These two pieces of information uniquely provide direction for increasing the specificity of glucocorticoid action. Consideration of all three modes of action indicate that the most promising approach for increased specificity is to vary the concentrations of those cofactors/pharmaceuticals that act closest to the observed end point. The potential for selectivity is even greater when varying cofactors/pharmaceuticals in conjunction with a select class of antagonists. PMID:29593646
Technical Advance: Transcription factor, promoter, and enhancer utilization in human myeloid cells.
Joshi, Anagha; Pooley, Christopher; Freeman, Tom C; Lennartsson, Andreas; Babina, Magda; Schmidl, Christian; Geijtenbeek, Teunis; Michoel, Tom; Severin, Jessica; Itoh, Masayoshi; Lassmann, Timo; Kawaji, Hideya; Hayashizaki, Yoshihide; Carninci, Piero; Forrest, Alistair R R; Rehli, Michael; Hume, David A
2015-05-01
The generation of myeloid cells from their progenitors is regulated at the level of transcription by combinatorial control of key transcription factors influencing cell-fate choice. To unravel the global dynamics of this process at the transcript level, we generated transcription profiles for 91 human cell types of myeloid origin by use of CAGE profiling. The CAGE sequencing of these samples has allowed us to investigate diverse aspects of transcription control during myelopoiesis, such as identification of novel transcription factors, miRNAs, and noncoding RNAs specific to the myeloid lineage. We further reconstructed a transcription regulatory network by clustering coexpressed transcripts and associating them with enriched cis-regulatory motifs. With the use of the bidirectional expression as a proxy for enhancers, we predicted over 2000 novel enhancers, including an enhancer 38 kb downstream of IRF8 and an intronic enhancer in the KIT gene locus. Finally, we highlighted relevance of these data to dissect transcription dynamics during progressive maturation of granulocyte precursors. A multifaceted analysis of the myeloid transcriptome is made available (www.myeloidome.roslin.ed.ac.uk). This high-quality dataset provides a powerful resource to study transcriptional regulation during myelopoiesis and to infer the likely functions of unannotated genes in human innate immunity. © The Author(s).
Lee, Mei-Ling Ting; Bulyk, Martha L; Whitmore, G A; Church, George M
2002-12-01
There is considerable scientific interest in knowing the probability that a site-specific transcription factor will bind to a given DNA sequence. Microarray methods provide an effective means for assessing the binding affinities of a large number of DNA sequences as demonstrated by Bulyk et al. (2001, Proceedings of the National Academy of Sciences, USA 98, 7158-7163) in their study of the DNA-binding specificities of Zif268 zinc fingers using microarray technology. In a follow-up investigation, Bulyk, Johnson, and Church (2002, Nucleic Acid Research 30, 1255-1261) studied the interdependence of nucleotides on the binding affinities of transcription proteins. Our article is motivated by this pair of studies. We present a general statistical methodology for analyzing microarray intensity measurements reflecting DNA-protein interactions. The log probability of a protein binding to a DNA sequence on an array is modeled using a linear ANOVA model. This model is convenient because it employs familiar statistical concepts and procedures and also because it is effective for investigating the probability structure of the binding mechanism.
DNA Recognition by a σ 54 Transcriptional Activator from Aquifex aeolicus
Vidangos, Natasha K.; Heideker, Johanna; Lyubimov, Artem; ...
2014-08-23
Transcription initiation by bacterial σ 54-polymerase requires the action of a transcriptional activator protein. Activators bind sequence-specifically upstream of the transcription initiation site via a DNA-binding domain. The structurally characterized DNA-binding domains from activators all belong to the Factor for Inversion Stimulation (Fis) family of helix-turn-helix DNA-binding proteins. We report here structures of the free and DNA-bound forms of the DNA-binding domain of NtrC4 (4DBD) from Aquifex aeolicus, a member of the NtrC family of σ 54 activators. Two NtrC4 binding sites were identified upstream (-145 and -85 base pairs) from the start of the lpxC gene, which is responsiblemore » for the first committed step in Lipid A biosynthesis. This is the first experimental evidence for σ 54 regulation in lpxC expression. 4DBD was crystallized both without DNA and in complex with the -145 binding site. The structures, together with biochemical data, indicate that NtrC4 binds to DNA in a manner that is similar to that of its close homologue, Fis. Ultimately, the greater sequence specificity for the binding of 4DBD relative to Fis seems to arise from a larger number of base specific contacts contributing to affinity than for Fis.« less
Information Propagation in Developmental Enhancers
NASA Astrophysics Data System (ADS)
Jena, Siddhartha; Levine, Michael
Rather than encoding information about protein sequence, certain lengths of noncoding DNA, called enhancers, interact with protein machinery such as transcription factors to precisely regulate gene expression. Enhancers have been studied extensively in the fruit fly Drosophila melanogaster, where they regulate the expression of developmental genes that establish the blueprint of the adult fly. It has been suggested that enhancer sequences possess a specific but unknown syntax with regards to the placement and strength of transcription factor binding sites. Moreover, studies in divergent fly species have shown that compensatory evolution allows for maintenance of enhancer functionality despite considerable variation in primary DNA sequence. Here, the possible role of enhancers as signal processing modules is studied as a way of explaining these two findings. We first demonstrate how this framework can be used to explain the fine-tuned spatiotemporal dynamics of gene expression. We then explore the evolutionary pressure on enhancer sequences and the resulting emergence of enhancers that are linked by compensatory mutations. This study provides a possible mechanism for the function of multiple enhancers linked to a single gene.
Koshino-Kimura, Yoshihiro; Wada, Takuji; Tachibana, Tatsuhiko; Tsugeki, Ryuji; Ishiguro, Sumie; Okada, Kiyotaka
2005-06-01
Epidermal cell differentiation in Arabidopsis root is studied as a model system for understanding cell fate specification. Two types of MYB-related transcription factors are involved in this cell differentiation. One of these, CAPRICE (CPC), encoding an R3-type MYB protein, is a positive regulator of hair cell differentiation and is preferentially transcribed in hairless cells. We analyzed the regulatory mechanism of CPC transcription. Deletion analyses of the CPC promoter revealed that hairless cell-specific transcription of the CPC gene required a 69 bp sequence, and a tandem repeat of this region was sufficient for its expression in epidermis. This region includes two MYB-binding sites, and the epidermis-specific transcription of CPC was abolished when base substitutions were introduced in these sites. We showed by gel mobility shift experiments and by yeast one-hybrid assay that WEREWOLF (WER), which is an R2R3-type MYB protein, directly binds to this region. We showed that WER also binds to the GL2 promoter region, indicating that WER directly regulates CPC and GL2 transcription by binding to their promoter regions.
Pancreatic islet enhancer clusters enriched in type 2 diabetes risk-associated variants.
Pasquali, Lorenzo; Gaulton, Kyle J; Rodríguez-Seguí, Santiago A; Mularoni, Loris; Miguel-Escalada, Irene; Akerman, İldem; Tena, Juan J; Morán, Ignasi; Gómez-Marín, Carlos; van de Bunt, Martijn; Ponsa-Cobas, Joan; Castro, Natalia; Nammo, Takao; Cebola, Inês; García-Hurtado, Javier; Maestro, Miguel Angel; Pattou, François; Piemonti, Lorenzo; Berney, Thierry; Gloyn, Anna L; Ravassard, Philippe; Skarmeta, José Luis Gómez; Müller, Ferenc; McCarthy, Mark I; Ferrer, Jorge
2014-02-01
Type 2 diabetes affects over 300 million people, causing severe complications and premature death, yet the underlying molecular mechanisms are largely unknown. Pancreatic islet dysfunction is central in type 2 diabetes pathogenesis, and understanding islet genome regulation could therefore provide valuable mechanistic insights. We have now mapped and examined the function of human islet cis-regulatory networks. We identify genomic sequences that are targeted by islet transcription factors to drive islet-specific gene activity and show that most such sequences reside in clusters of enhancers that form physical three-dimensional chromatin domains. We find that sequence variants associated with type 2 diabetes and fasting glycemia are enriched in these clustered islet enhancers and identify trait-associated variants that disrupt DNA binding and islet enhancer activity. Our studies illustrate how islet transcription factors interact functionally with the epigenome and provide systematic evidence that the dysregulation of islet enhancers is relevant to the mechanisms underlying type 2 diabetes.
Paca-Uccaralertkun, S; Zhao, L J; Adya, N; Cross, J V; Cullen, B R; Boros, I M; Giam, C Z
1994-01-01
The human T-cell lymphotropic virus type I (HTLV-I) transactivator, Tax, the ubiquitous transcriptional factor cyclic AMP (cAMP) response element-binding protein (CREB protein), and the 21-bp repeats in the HTLV-I transcriptional enhancer form a ternary nucleoprotein complex (L. J. Zhao and C. Z. Giam, Proc. Natl. Acad. Sci. USA 89:7070-7074, 1992). Using an antibody directed against the COOH-terminal region of Tax along with purified Tax and CREB proteins, we selected DNA elements bound specifically by the Tax-CREB complex in vitro. Two distinct but related groups of sequences containing the cAMP response element (CRE) flanked by long runs of G and C residues in the 5' and 3' regions, respectively, were preferentially recognized by Tax-CREB. In contrast, CREB alone binds only to CRE motifs (GNTGACG[T/C]) without neighboring G- or C-rich sequences. The Tax-CREB-selected sequences bear a striking resemblance to the 5' or 3' two-thirds of the HTLV-I 21-bp repeats and are highly inducible by Tax. Gel electrophoretic mobility shift assays, DNA transfection, and DNase I footprinting analyses indicated that the G- and C-rich sequences flanking the CRE motif are crucial for Tax-CREB-DNA ternary complex assembly and Tax transactivation but are not in direct contact with the Tax-CREB complex. These data show that Tax recruits CREB to form a multiprotein complex that specifically recognizes the viral 21-bp repeats. The expanded DNA binding specificity of Tax-CREB and the obligatory role the ternary Tax-CREB-DNA complex plays in transactivation reveal a novel mechanism for regulating the transcriptional activity of leucine zipper proteins like CREB.
Identification and characterization of cell-specific enhancer elements for the mouse ETF/Tead2 gene.
Tanoue, Y; Yasunami, M; Suzuki, K; Ohkubo, H
2001-12-21
We have identified and characterized by transient transfection assays the cell-specific 117-bp enhancer sequence in the first intron of the mouse ETF (Embryonic TEA domain-containing factor)/Tead2 gene required for transcriptional activation in ETF/Tead2 gene-expressing cells, such as P19 cells. The 117-bp enhancer contains one GC-rich sequence (5'-GGGGCGGGG-3'), termed the GC box, and two tandemly repeated GA-rich sequences (5'-GGGGGAGGGG-3'), termed the proximal and distal GA elements. Further analyses, including transfection studies and electrophoretic mobility shift assays using a series of deletion and mutation constructs, indicated that Sp1, a putative activator, may be required to predominate over its competition with another unknown putative repressor, termed the GA element-binding factor, for binding to both the GC box, which overlapped with the proximal GA element, and the distal GA element in the 117-bp sequence in order to achieve a full enhancer activity. We also discuss a possible mechanism underlying the cell-specific enhancer activity of the 117-bp sequence.
The alpha1-fetoprotein locus is activated by a nuclear receptor of the Drosophila FTZ-F1 family.
Galarneau, L; Paré, J F; Allard, D; Hamel, D; Levesque, L; Tugwood, J D; Green, S; Bélanger, L
1996-07-01
The alpha1-fetoprotein (AFP) gene is located between the albumin and alpha-albumin genes and is activated by transcription factor FTF (fetoprotein transcription factor), presumed to transduce early developmental signals to the albumin gene cluster. We have identified FTF as an orphan nuclear receptor of the Drosophila FTZ-F1 family. FTF recognizes the DNA sequence 5'-TCAAGGTCA-3', the canonical recognition motif for FTZ-F1 receptors. cDNA sequence homologies indicate that rat FTF is the ortholog of mouse LRH-1 and Xenopus xFF1rA. Rodent FTF is encoded by a single-copy gene, related to the gene encoding steroidogenic factor 1 (SF-1). The 5.2-kb FTF transcript is translated from several in-frame initiator codons into FTF isoforms (54 to 64 kDa) which appear to bind DNA as monomers, with no need for a specific ligand, similar KdS (approximately equal 3 x 10(-10) M), and similar transcriptional effects. FTF activates the AFP promoter without the use of an amino-terminal activation domain; carboxy-terminus-truncated FTF exerts strong dominant negative effects. In the AFP promoter, FTF recruits an accessory trans-activator which imparts glucocorticoid reactivity upon the AFP gene. FTF binding sites are found in the promoters of other liver-expressed genes, some encoding liver transcription factors; FTF, liver alpha1-antitrypsin promoter factor LFB2, and HNF-3beta promoter factor UF2-H3beta are probably the same factor. FTF is also abundantly expressed in the pancreas and may exert differentiation functions in endodermal sublineages, similar to SF-1 in steroidogenic tissues. HepG2 hepatoma cells seem to express a mutated form of FTF.
Atomistic molecular dynamics simulations of bioactive engrailed 1 interference peptides (EN1-iPeps).
Gandhi, Neha S; Blancafort, Pilar; Mancera, Ricardo L
2018-04-27
The neural-specific transcription factor Engrailed 1 - is overexpressed in basal-like breast tumours. Synthetic interference peptides - comprising a cell-penetrating peptide/nuclear localisation sequence and the Engrailed 1-specific sequence from the N-terminus have been engineered to produce a strong apoptotic response in tumour cells overexpressing EN1, with no toxicity to normal or non Engrailed 1-expressing cells. Here scaled molecular dynamics simulations were used to study the conformational dynamics of these interference peptides in aqueous solution to characterise their structure and dynamics. Transitions from disordered to α-helical conformation, stabilised by hydrogen bonds and proline-aromatic interactions, were observed throughout the simulations. The backbone of the wild-type peptide folds to a similar conformation as that found in ternary complexes of anterior Hox proteins with conserved hexapeptide motifs important for recognition of pre-B-cell leukemia Homeobox 1, indicating that the motif may possess an intrinsic preference for helical structure. The predicted NMR chemical shifts of these peptides are consistent with the Hox hexapeptides in solution and Engrailed 2 NMR data. These findings highlight the importance of aromatic residues in determining the structure of Engrailed 1 interference peptides, shedding light on the rational design strategy of molecules that could be adopted to inhibit other transcription factors overexpressed in other cancer types, potentially including other transcription factor families that require highly conserved and cooperative protein-protein partnerships for biological activity.
On the Concept of Cis-regulatory Information: From Sequence Motifs to Logic Functions
NASA Astrophysics Data System (ADS)
Tarpine, Ryan; Istrail, Sorin
The regulatory genome is about the “system level organization of the core genomic regulatory apparatus, and how this is the locus of causality underlying the twin phenomena of animal development and animal evolution” (E.H. Davidson. The Regulatory Genome: Gene Regulatory Networks in Development and Evolution, Academic Press, 2006). Information processing in the regulatory genome is done through regulatory states, defined as sets of transcription factors (sequence-specific DNA binding proteins which determine gene expression) that are expressed and active at the same time. The core information processing machinery consists of modular DNA sequence elements, called cis-modules, that interact with transcription factors. The cis-modules “read” the information contained in the regulatory state of the cell through transcription factor binding, “process” it, and directly or indirectly communicate with the basal transcription apparatus to determine gene expression. This endowment of each gene with the information-receiving capacity through their cis-regulatory modules is essential for the response to every possible regulatory state to which it might be exposed during all phases of the life cycle and in all cell types. We present here a set of challenges addressed by our CYRENE research project aimed at studying the cis-regulatory code of the regulatory genome. The CYRENE Project is devoted to (1) the construction of a database, the cis-Lexicon, containing comprehensive information across species about experimentally validated cis-regulatory modules; and (2) the software development of a next-generation genome browser, the cis-Browser, specialized for the regulatory genome. The presentation is anchored on three main computational challenges: the Gene Naming Problem, the Consensus Sequence Bottleneck Problem, and the Logic Function Inference Problem.
Cis-regulatory landscapes of four cell types of the retina
Hartl, Dominik; Jüttner, Josephine
2017-01-01
Abstract The retina is composed of ∼50 cell-types with specific functions for the process of vision. Identification of the cis-regulatory elements active in retinal cell-types is key to elucidate the networks controlling this diversity. Here, we combined transcriptome and epigenome profiling to map the regulatory landscape of four cell-types isolated from mouse retinas including rod and cone photoreceptors as well as rare inter-neuron populations such as horizontal and starburst amacrine cells. Integration of this information reveals sequence determinants and candidate transcription factors for controlling cellular specialization. Additionally, we refined parallel reporter assays to enable studying the transcriptional activity of large collection of sequences in individual cell-types isolated from a tissue. We provide proof of concept for this approach and its scalability by characterizing the transcriptional capacity of several hundred putative regulatory sequences within individual retinal cell-types. This generates a catalogue of cis-regulatory regions active in retinal cell types and we further demonstrate their utility as potential resource for cellular tagging and manipulation. PMID:29059322
Transcription factor clusters regulate genes in eukaryotic cells
Hedlund, Erik G; Friemann, Rosmarie; Hohmann, Stefan
2017-01-01
Transcription is regulated through binding factors to gene promoters to activate or repress expression, however, the mechanisms by which factors find targets remain unclear. Using single-molecule fluorescence microscopy, we determined in vivo stoichiometry and spatiotemporal dynamics of a GFP tagged repressor, Mig1, from a paradigm signaling pathway of Saccharomyces cerevisiae. We find the repressor operates in clusters, which upon extracellular signal detection, translocate from the cytoplasm, bind to nuclear targets and turnover. Simulations of Mig1 configuration within a 3D yeast genome model combined with a promoter-specific, fluorescent translation reporter confirmed clusters are the functional unit of gene regulation. In vitro and structural analysis on reconstituted Mig1 suggests that clusters are stabilized by depletion forces between intrinsically disordered sequences. We observed similar clusters of a co-regulatory activator from a different pathway, supporting a generalized cluster model for transcription factors that reduces promoter search times through intersegment transfer while stabilizing gene expression. PMID:28841133
Dissecting protein:protein interactions between transcription factors with an RNA aptamer.
Tian, Y; Adya, N; Wagner, S; Giam, C Z; Green, M R; Ellington, A D
1995-01-01
Nucleic acid aptamers isolated from random sequence pools have generally proven useful at inhibiting the interactions of nucleic acid binding proteins with their cognate nucleic acids. In order to develop reagents that could also be used to study protein:protein interactions, we have used in vitro selection to search for RNA aptamers that could interact with the transactivating protein Tax from human T-cell leukemia virus. Tax does not normally bind to nucleic acids, but instead stimulates transcription by interacting with a variety of cellular transcription factors, including the cyclic AMP-response element binding protein (CREB), NF-kappa B, and the serum response factor (SRF). Starting from a pool of greater than 10(13) different RNAs with a core of 120 random sequence positions, RNAs were selected for their ability to be co-retained on nitrocellulose filters with Tax. After five cycles of selection and amplification, a single nucleic acid species remained. This aptamer was found to bind Tax with high affinity and specificity, and could disrupt complex formation between Tax and NF-kappa B, but not with SRF. The differential effects of our aptamer probe on protein:protein interactions suggest a model for how the transcription factor binding sites on the surface of the Tax protein are organized. This model is consistent with data from a variety of other studies. PMID:7489503
Lampronti, Ilaria; Khan, Mahmud T.H.; Borgatti, Monica; Bianchi, Nicoletta
2008-01-01
Several transcription factors (TFs) play crucial roles in governing the expression of different genes involved in the immune response, embryo or cell lineage development, cell apoptosis, cell cycle progression, oncogenesis, repair and fibrosis processes and inflammation. As far as inflammation, TFs playing pivotal roles are nuclear factor kappa B (NF-kB), activator protein (AP-1), signal transducer and activator of transcription (STATs), cAMP response element binding protein (CREB) and GATA-1 factors. All these TFs regulate the expression of pro-inflammatory cytokines and are involved in the pathogenesis of a number of human disorders, particularly those with an inflammatory component. Since several medicinal plants can be employed to produce extracts exhibiting biological effects and because alteration of gene transcription represents a very interesting approach to control the expression of selected genes, this study sought to verify the ability of several extracts derived from Bangladeshi medicinal plants in interfering with molecular interactions between different TFs and specific DNA sequences. We first analyzed the antiproliferative activity of 19 medicinal plants on different human cell lines, including erythroleukemia K562, B lymphoid Raji and T lymphoid Jurkat cell lines. Secondly, we employed the electrophoretic mobility shift assay as a suitable technique for a fast screening of plant extracts altering the binding between NF-kB, AP-1, GATA-1, STAT-3, CREB and the relative target DNA elements. PMID:18830455
Wang, XinGang; Ono, Yosuke; Tan, Swee Chuan; Chai, Ruth JinFen; Parkin, Caroline; Ingham, Philip W
2011-10-01
Sox6 has been proposed to play a conserved role in vertebrate skeletal muscle fibre type specification. In zebrafish, sox6 transcription is repressed in slow-twitch progenitors by the Prdm1a transcription factor. Here we identify sox6 cis-regulatory sequences that drive fast-twitch-specific expression in a Prdm1a-dependent manner. We show that sox6 transcription subsequently becomes derepressed in slow-twitch fibres, whereas Sox6 protein remains restricted to fast-twitch fibres. We find that translational repression of sox6 is mediated by miR-499, the slow-twitch-specific expression of which is in turn controlled by Prdm1a, forming a regulatory loop that initiates and maintains the slow-twitch muscle lineage.
Tomar, Navneet; Mishra, Akhilesh; Mrinal, Nirotpal; Jayaram, B.
2016-01-01
Transcription factors (TFs) bind at multiple sites in the genome and regulate expression of many genes. Regulating TF binding in a gene specific manner remains a formidable challenge in drug discovery because the same binding motif may be present at multiple locations in the genome. Here, we present Onco-Regulon (http://www.scfbio-iitd.res.in/software/onco/NavSite/index.htm), an integrated database of regulatory motifs of cancer genes clubbed with Unique Sequence-Predictor (USP) a software suite that identifies unique sequences for each of these regulatory DNA motifs at the specified position in the genome. USP works by extending a given DNA motif, in 5′→3′, 3′ →5′ or both directions by adding one nucleotide at each step, and calculates the frequency of each extended motif in the genome by Frequency Counter programme. This step is iterated till the frequency of the extended motif becomes unity in the genome. Thus, for each given motif, we get three possible unique sequences. Closest Sequence Finder program predicts off-target drug binding in the genome. Inclusion of DNA-Protein structural information further makes Onco-Regulon a highly informative repository for gene specific drug development. We believe that Onco-Regulon will help researchers to design drugs which will bind to an exclusive site in the genome with no off-target effects, theoretically. Database URL: http://www.scfbio-iitd.res.in/software/onco/NavSite/index.htm PMID:27515825
Chromatin immunoprecipitation of mouse embryos.
Voss, Anne K; Dixon, Mathew P; McLennan, Tamara; Kueh, Andrew J; Thomas, Tim
2012-01-01
During prenatal development, a large number of different cell types are formed, the vast majority of which contain identical genetic material. The basis of the great variety in cell phenotype and function is the differential expression of the approximately 25,000 genes in the mammalian genome. Transcriptional activity is regulated at many levels by proteins, including members of the basal transcriptional apparatus, DNA-binding transcription factors, and chromatin-binding proteins. Importantly, chromatin structure dictates the availability of a specific genomic locus for transcriptional activation as well as the efficiency, with which transcription can occur. Chromatin immunoprecipitation (ChIP) is a method to assess if chromatin modifications or proteins are present at a specific locus. ChIP involves the cross linking of DNA and associated proteins and immunoprecipitation using specific antibodies to DNA-associated proteins followed by examination of the co-precipitated DNA sequences or proteins. In the last few years, ChIP has become an essential technique for scientists studying transcriptional regulation and chromatin structure. Using ChIP on mouse embryos, we can document the presence or absence of specific proteins and chromatin modifications at genomic loci in vivo during mammalian development. Here, we describe a ChIP technique adapted for mouse embryos.
[Regulation of heat shock gene expression in response to stress].
Garbuz, D G
2017-01-01
Heat shock (HS) genes, or stress genes, code for a number of proteins that collectively form the most ancient and universal stress defense system. The system determines the cell capability of adaptation to various adverse factors and performs a variety of auxiliary functions in normal physiological conditions. Common stress factors, such as higher temperatures, hypoxia, heavy metals, and others, suppress transcription and translation for the majority of genes, while HS genes are upregulated. Transcription of HS genes is controlled by transcription factors of the HS factor (HSF) family. Certain HSFs are activated on exposure to higher temperatures or other adverse factors to ensure stress-induced HS gene expression, while other HSFs are specifically activated at particular developmental stages. The regulation of the main mammalian stress-inducible factor HSF1 and Drosophila melanogaster HSF includes many components, such as a variety of early warning signals indicative of abnormal cell activity (e.g., increases in intracellular ceramide, cytosolic calcium ions, or partly denatured proteins); protein kinases, which phosphorylate HSFs at various Ser residues; acetyltransferases; and regulatory proteins, such as SUMO and HSBP1. Transcription factors other than HSFs are also involved in activating HS gene transcription; the set includes D. melanogaster GAF, mammalian Sp1 and NF-Y, and other factors. Transcription of several stress genes coding for molecular chaperones of the glucose-regulated protein (GRP) family is predominantly regulated by another stress-detecting system, which is known as the unfolded protein response (UPR) system and is activated in response to massive protein misfolding in the endoplasmic reticulum and mitochondrial matrix. A translational fine tuning of HS protein expression occurs via changing the phosphorylation status of several proteins involved in translation initiation. In addition, specific signal sequences in the 5'-UTRs of some HS protein mRNAs ensure their preferential translation in stress.
1994-01-01
Elevation of cAMP can cause gene-specific inhibition of interleukin 2 (IL-2) expression. To investigate the mechanism of this effect, we have combined electrophoretic mobility shift assays and in vivo genomic footprinting to assess both the availability of putative IL-2 transcription factors in forskolin-treated cells and the functional capacity of these factors to engage their sites in vivo. All observed effects of forskolin depended upon protein kinase A, for they were blocked by introduction of a dominant negative mutant subunit of protein kinase A. In the EL4.E1 cell line, we report specific inhibitory effects of cAMP elevation both on NF-kappa B/Rel family factors binding at -200 bp, and on a novel, biochemically distinct "TGGGC" factor binding at -225 bp with respect to the IL-2 transcriptional start site. Neither NF-AT nor AP-1 binding activities are detectably inhibited in gel mobility shift assays. Elevation of cAMP inhibits NF-kappa B activity with delayed kinetics in association with a delayed inhibition of IL-2 RNA accumulation. Activation of cells in the presence of forskolin prevents the maintenance of stable protein- DNA interactions in vivo, not only at the NF-kappa B and TGGGC sites of the IL-2 enhancer, but also at the NF-AT, AP-1, and other sites. This result, and similar results in cyclosporin A-treated cells, imply that individual IL-2 transcription factors cannot stably bind their target sequences in vivo without coengagement of all other distinct factors at neighboring sites. It is proposed that nonhierarchical, cooperative enhancement of binding is a structural basis of combinatorial transcription factor action at the IL-2 locus. PMID:8113685
Sauvé, Simon; Tremblay, Luc; Lavigne, Pierre
2004-09-17
Basic region-helix1-loop-helix2-leucine zipper (b/H(1)LH(2)/LZ) transcription factors bind specific DNA sequence in their target gene promoters as dimers. Max, a b/H(1)LH(2)/LZ transcription factor, is the obligate heterodimeric partner of the related b/H(1)LH(2)/LZ proteins of the Myc and Mad families. These heterodimers specifically bind E-box DNA sequence (CACGTG) to activate (e.g. c-Myc/Max) and repress (e.g. Mad1/Max) transcription. Max can also homodimerize and bind E-box sequences in c-Myc target gene promoters. While the X-ray structure of the Max b/H(1)LH(2)/LZ/DNA complex and that of others have been reported, the precise sequence of events leading to the reversible and specific binding of these important transcription factors is still largely unknown. In order to provide insights into the DNA binding mechanism, we have solved the NMR solution structure of a covalently homodimerized version of a Max b/H(1)LH(2)/LZ protein with two stabilizing mutations in the LZ, and characterized its backbone dynamics from (15)N spin-relaxation measurements in the absence of DNA. Apart from minor differences in the pitch of the LZ, possibly resulting from the mutations in the construct, we observe that the packing of the helices in the H(1)LH(2) domain is almost identical to that of the two crystal structures, indicating that no important conformational change in these helices occurs upon DNA binding. Conversely to the crystal structures of the DNA complexes, the first 14 residues of the basic region are found to be mostly unfolded while the loop is observed to be flexible. This indicates that these domains undergo conformational changes upon DNA binding. On the other hand, we find the last four residues of the basic region form a persistent helical turn contiguous to H(1). In addition, we provide evidence of the existence of internal motions in the backbone of H(1) that are of larger amplitude and longer time-scale (nanoseconds) than the ones in the H(2) and LZ domain. Most interestingly, we note that conformers in the ensemble of calculated structures have highly conserved basic residues (located in the persistent helical turn of the basic region and in the loop) known to be important for specific binding in a conformation that matches that of the DNA-bound state. These partially prefolded conformers can directly fit into the major groove of DNA and as such are proposed to lie on the pathway leading to the reversible and specific DNA binding. In these conformers, the conserved basic side-chains form a cluster that elevates the local electrostatic potential and could provide the necessary driving force for the generation of the internal motions localized in the H(1) and therefore link structural determinants with the DNA binding function. Overall, our results suggests that the Max homodimeric b/H(1)LH(2)/LZ can rapidly and preferentially bind DNA sequence through transient and partially prefolded states and subsequently, adopt the fully helical bound state in a DNA-assisted mechanism or induced-fit.
Smith, Robin P; Riesenfeld, Samantha J; Holloway, Alisha K; Li, Qiang; Murphy, Karl K; Feliciano, Natalie M; Orecchia, Lorenzo; Oksenberg, Nir; Pollard, Katherine S; Ahituv, Nadav
2013-07-18
Large-scale annotation efforts have improved our ability to coarsely predict regulatory elements throughout vertebrate genomes. However, it is unclear how complex spatiotemporal patterns of gene expression driven by these elements emerge from the activity of short, transcription factor binding sequences. We describe a comprehensive promoter extension assay in which the regulatory potential of all 6 base-pair (bp) sequences was tested in the context of a minimal promoter. To enable this large-scale screen, we developed algorithms that use a reverse-complement aware decomposition of the de Bruijn graph to design a library of DNA oligomers incorporating every 6-bp sequence exactly once. Our library multiplexes all 4,096 unique 6-mers into 184 double-stranded 15-bp oligomers, which is sufficiently compact for in vivo testing. We injected each multiplexed construct into zebrafish embryos and scored GFP expression in 15 tissues at two developmental time points. Twenty-seven constructs produced consistent expression patterns, with the majority doing so in only one tissue. Functional sequences are enriched near biologically relevant genes, match motifs for developmental transcription factors, and are required for enhancer activity. By concatenating tissue-specific functional sequences, we generated completely synthetic enhancers for the notochord, epidermis, spinal cord, forebrain and otic lateral line, and show that short regulatory sequences do not always function modularly. This work introduces a unique in vivo catalog of short, functional regulatory sequences and demonstrates several important principles of regulatory element organization. Furthermore, we provide resources for designing compact, reverse-complement aware k-mer libraries.
2013-01-01
Background The ACVR1 gene encodes a type I receptor for bone morphogenetic proteins (BMPs). Mutations in the ACVR1 gene are associated with Fibrodysplasia Ossificans Progressiva (FOP), a rare and extremely disabling disorder characterized by congenital malformation of the great toes and progressive heterotopic endochondral ossification in muscles and other non-skeletal tissues. Several aspects of FOP pathophysiology are still poorly understood, including mechanisms regulating ACVR1 expression. This work aimed to identify regulatory elements that control ACVR1 gene transcription. Methods and results We first characterized the structure and composition of human ACVR1 gene transcripts by identifying the transcription start site, and then characterized a 2.9 kb upstream region. This region showed strong activating activity when tested by reporter gene assays in transfected cells. We identified specific elements within the 2.9 kb region that are important for transcription factor binding using deletion constructs, co-transfection experiments with plasmids expressing selected transcription factors, site-directed mutagenesis of consensus binding-site sequences, and by protein/DNA binding assays. We also characterized a GC-rich minimal promoter region containing binding sites for the Sp1 transcription factor. Conclusions Our results showed that several transcription factors such as Egr-1, Egr-2, ZBTB7A/LRF, and Hey1, regulate the ACVR1 promoter by binding to the -762/-308 region, which is essential to confer maximal transcriptional activity. The Sp1 transcription factor acts at the most proximal promoter segment upstream of the transcription start site. We observed significant differences in different cell types suggesting tissue specificity of transcriptional regulation. These findings provide novel insights into the molecular mechanisms that regulate expression of the ACVR1 gene and that could be targets of new strategies for future therapeutic treatments. PMID:24047559
Liang, C L; Tsai, C N; Chung, P J; Chen, J L; Sun, C M; Chen, R H; Hong, J H; Chang, Y S
2000-11-10
In Epstein-Barr virus (EBV)-infected BL cells, the oncogenic EBV-encoded nuclear antigen 1 (EBNA 1) gene is directed from the latent promoter Qp. Yeast one-hybrid screen analysis using the -50 to -37 sequence of Qp as the bait was carried out to identify transcriptional factors that may control Qp activity. Results showed that Smad4 binds the -50 to -37 sequence of Qp, indicating that this promoter is potentially regulated by TGF-beta. The association of Smad4 with Qp was further confirmed by supershift of EMSA complexes using Smad4-specific antibody. The transfection of a Qp reporter construct in two EBV(+) BL cell lines, Rael and WW2, showed that Qp activity is repressed in response to the TGF-beta treatment. This repression involves the interaction of a Smad3/Smad4 complex and the transcriptional repressor TGIF, as determined by cotransfection assay and coimmunoprecipitation analysis. Results suggest that TGF-beta may transcriptionally repress Qp through the Smad4-binding site in human BL cells. Copyright 2000 Academic Press.
Poley, Jordan D; Sutherland, Ben J G; Jones, Simon R M; Koop, Ben F; Fast, Mark D
2016-07-04
Salmon lice, Lepeophtheirus salmonis (Copepoda: Caligidae), are highly important ectoparasites of farmed and wild salmonids, and cause multi-million dollar losses to the salmon aquaculture industry annually. Salmon lice display extensive sexual dimorphism in ontogeny, morphology, physiology, behavior, and more. Therefore, the identification of transcripts with differential expression between males and females (sex-biased transcripts) may help elucidate the relationship between sexual selection and sexually dimorphic characteristics. Sex-biased transcripts were identified from transcriptome analyses of three L. salmonis populations, including both Atlantic and Pacific subspecies. A total of 35-43 % of all quality-filtered transcripts were sex-biased in L. salmonis, with male-biased transcripts exhibiting higher fold change than female-biased transcripts. For Gene Ontology and functional analyses, a consensus-based approach was used to identify concordantly differentially expressed sex-biased transcripts across the three populations. A total of 127 male-specific transcripts (i.e. those without detectable expression in any female) were identified, and were enriched with reproductive functions (e.g. seminal fluid and male accessory gland proteins). Other sex-biased transcripts involved in morphogenesis, feeding, energy generation, and sensory and immune system development and function were also identified. Interestingly, as observed in model systems, male-biased L. salmonis transcripts were more frequently without annotation compared to female-biased or unbiased transcripts, suggesting higher rates of sequence divergence in male-biased transcripts. Transcriptome differences between male and female L. salmonis described here provide key insights into the molecular mechanisms controlling sexual dimorphism in L. salmonis. This analysis offers targets for parasite control and provides a foundation for further analyses exploring critical topics such as the interaction between sex and drug resistance, sex-specific factors in host-parasite relationships, and reproductive roles within L. salmonis.
Li, S; Zhang, P; Zhang, M; Fu, C; Yu, L
2013-01-01
Although the regulation of taxol biosynthesis at the transcriptional level remains unclear, 10-deacetylbaccatin III-10 β-O-acetyl transferase (DBAT) is a critical enzyme in the biosynthesis of taxol. The 1740 bp fragment 5'-flanking sequence of the dbat gene was cloned from Taxus chinensis cells. Important regulatory elements needed for activity of the dbat promoter were located by deletion analyses in T. chinensis cells. A novel WRKY transcription factor, TcWRKY1, was isolated with the yeast one-hybrid system from a T. chinensis cell cDNA library using the important regulatory elements as bait. The gene expression of TcWRKY1 in T. chinensis suspension cells was specifically induced by methyl jasmonate (MeJA). Biochemical analysis indicated that TcWRKY1 protein specifically interacts with the two W-box (TGAC) cis-elements among the important regulatory elements. Overexpression of TcWRKY1 enhanced dbat expression in T. chinensis suspension cells, and RNA interference (RNAi) reduced the level of transcripts of dbat. These results suggest that TcWRKY1 participates in regulation of taxol biosynthesis in T. chinensis cells, and that dbat is a target gene of this transcription factor. This research also provides a potential candidate gene for engineering increased taxol accumulation in Taxus cell cultures. © 2012 German Botanical Society and The Royal Botanical Society of the Netherlands.
O'Connell, T D; Rokosh, D G; Simpson, P C
2001-05-01
alpha1-Adrenergic receptor (AR) subtypes in the heart are expressed by myocytes but not by fibroblasts, a feature that distinguishes alpha1-ARs from beta-ARs. Here we studied myocyte-specific expression of alpha1-ARs, focusing on the subtype alpha1C (also called alpha1A), a subtype implicated in cardiac hypertrophic signaling in rat models. We first cloned the mouse alpha1C-AR gene, which consisted of two exons with an 18 kb intron, similar to the alpha1B-AR gene. The receptor coding sequence was >90% homologous to that of rat and human. alpha1C-AR transcription in mouse heart was initiated from a single Inr consensus sequence at -588 from the ATG; this and a putative polyadenylation sequence 8.5 kb 3' could account for the predominant 11 kb alpha1C mRNA in mouse heart. A 5'-nontranscribed fragment of 4.4 kb was active as a promoter in cardiac myocytes but not in fibroblasts. Promoter activity in myocytes required a single muscle CAT (MCAT) element, and this MCAT bound in vitro to recombinant and endogenous transcriptional enhancer factor-1. Thus, alpha1C-AR transcription in cardiac myocytes shares MCAT dependence with other cardiac-specific genes, including the alpha- and beta-myosin heavy chains, skeletal alpha-actin, and brain natriuretic peptide. However, the mouse alpha1C gene was not transcribed in the neonatal heart and was not activated by alpha1-AR and other hypertrophic agonists in rat myocytes, and thus differed from other MCAT-dependent genes and the rat alpha1C gene.
Melia, Tisha; Hao, Pengying; Yilmaz, Feyza
2015-01-01
Long intergenic noncoding RNAs (lincRNAs) are increasingly recognized as key chromatin regulators, yet few studies have characterized lincRNAs in a single tissue under diverse conditions. Here, we analyzed 45 mouse liver RNA sequencing (RNA-Seq) data sets collected under diverse conditions to systematically characterize 4,961 liver lincRNAs, 59% of them novel, with regard to gene structures, species conservation, chromatin accessibility, transcription factor binding, and epigenetic states. To investigate the potential for functionality, we focused on the responses of the liver lincRNAs to growth hormone stimulation, which imparts clinically relevant sex differences to hepatic metabolism and liver disease susceptibility. Sex-biased expression characterized 247 liver lincRNAs, with many being nuclear RNA enriched and regulated by growth hormone. The sex-biased lincRNA genes are enriched for nearby and correspondingly sex-biased accessible chromatin regions, as well as sex-biased binding sites for growth hormone-regulated transcriptional activators (STAT5, hepatocyte nuclear factor 6 [HNF6], FOXA1, and FOXA2) and transcriptional repressors (CUX2 and BCL6). Repression of female-specific lincRNAs in male liver, but not that of male-specific lincRNAs in female liver, was associated with enrichment of H3K27me3-associated inactive states and poised (bivalent) enhancer states. Strikingly, we found that liver-specific lincRNA gene promoters are more highly species conserved and have a significantly higher frequency of proximal binding by liver transcription factors than liver-specific protein-coding gene promoters. Orthologs for many liver lincRNAs were identified in one or more supraprimates, including two rat lincRNAs showing the same growth hormone-regulated, sex-biased expression as their mouse counterparts. This integrative analysis of liver lincRNA chromatin states, transcription factor occupancy, and growth hormone regulation provides novel insights into the expression of sex-specific lincRNAs and their potential for regulation of sex differences in liver physiology and disease. PMID:26459762
Hucl, Tomas; Brody, Jonathan R; Gallmeier, Eike; Iacobuzio-Donahue, Christine A; Farrance, Iain K; Kern, Scott E
2007-10-01
Identification of genes with cancer-specific overexpression offers the potential to efficiently discover cancer-specific activities in an unbiased manner. We apply this paradigm to study mesothelin (MSLN) overexpression, a nearly ubiquitous, diagnostically and therapeutically useful characteristic of pancreatic cancer. We identified an 18-bp upstream enhancer, termed CanScript, strongly activating transcription from an otherwise weak tissue-nonspecific promoter and operating selectively in cells having aberrantly elevated cancer-specific MSLN transcription. Introducing mutations into CanScript showed two functionally distinct sites: an Sp1-like site and an MCAT element. Gel retardation and chromatin immunoprecipitation assays showed the MCAT element to be bound by transcription enhancer factor (TEF)-1 (TEAD1) in vitro and in vivo. The presence of TEF-1 was required for MSLN protein overexpression as determined by TEF-1 knockdown experiments. The cancer specificity seemed to be provided by a putative limiting cofactor of TEF-1 that could be outcompeted by exogenous TEF-1 only in a MSLN-overexpressing cell line. A CanScript concatemer offered enhanced activity. These results identify a TEF family member as a major regulator of MSLN overexpression, a fundamental characteristic of pancreatic and other cancers, perhaps due to an upstream and highly frequent aberrant cellular activity. The CanScript sequence represents a modular element for cancer-specific targeting, potentially suitable for nearly a third of human malignancies.
Chertkova, Aleksandra A; Schiffman, Joshua S; Nuzhdin, Sergey V; Kozlov, Konstantin N; Samsonova, Maria G; Gursky, Vitaly V
2017-02-07
Cis-regulatory sequences are often composed of many low-affinity transcription factor binding sites (TFBSs). Determining the evolutionary and functional importance of regulatory sequence composition is impeded without a detailed knowledge of the genotype-phenotype map. We simulate the evolution of regulatory sequences involved in Drosophila melanogaster embryo segmentation during early development. Natural selection evaluates gene expression dynamics produced by a computational model of the developmental network. We observe a dramatic decrease in the total number of transcription factor binding sites through the course of evolution. Despite a decrease in average sequence binding energies through time, the regulatory sequences tend towards organisations containing increased high affinity transcription factor binding sites. Additionally, the binding energies of separate sequence segments demonstrate ubiquitous mutual correlations through time. Fewer than 10% of initial TFBSs are maintained throughout the entire simulation, deemed 'core' sites. These sites have increased functional importance as assessed under wild-type conditions and their binding energy distributions are highly conserved. Furthermore, TFBSs within close proximity of core sites exhibit increased longevity, reflecting functional regulatory interactions with core sites. In response to elevated mutational pressure, evolution tends to sample regulatory sequence organisations with fewer, albeit on average, stronger functional transcription factor binding sites. These organisations are also shaped by the regulatory interactions among core binding sites with sites in their local vicinity.
Identification of MicroRNA Targets of Capsicum spp. Using MiRTrans—a Trans-Omics Approach
Zhang, Lu; Qin, Cheng; Mei, Junpu; Chen, Xiaocui; Wu, Zhiming; Luo, Xirong; Cheng, Jiaowen; Tang, Xiangqun; Hu, Kailin; Li, Shuai C.
2017-01-01
The microRNA (miRNA) can regulate the transcripts that are involved in eukaryotic cell proliferation, differentiation, and metabolism. Especially for plants, our understanding of miRNA targets, is still limited. Early attempts of prediction on sequence alignments have been plagued by enormous false positives. It is helpful to improve target prediction specificity by incorporating the other data sources such as the dependency between miRNA and transcript expression or even cleaved transcripts by miRNA regulations, which are referred to as trans-omics data. In this paper, we developed MiRTrans (Prediction of MiRNA targets by Trans-omics data) to explore miRNA targets by incorporating miRNA sequencing, transcriptome sequencing, and degradome sequencing. MiRTrans consisted of three major steps. First, the target transcripts of miRNAs were predicted by scrutinizing their sequence characteristics and collected as an initial potential targets pool. Second, false positive targets were eliminated if the expression of miRNA and its targets were weakly correlated by lasso regression. Third, degradome sequencing was utilized to capture the miRNA targets by examining the cleaved transcripts that regulated by miRNAs. Finally, the predicted targets from the second and third step were combined by Fisher's combination test. MiRTrans was applied to identify the miRNA targets for Capsicum spp. (i.e., pepper). It can generate more functional miRNA targets than sequence-based predictions by evaluating functional enrichment. MiRTrans identified 58 miRNA-transcript pairs with high confidence from 18 miRNA families conserved in eudicots. Most of these targets were transcription factors; this lent support to the role of miRNA as key regulator in pepper. To our best knowledge, this work is the first attempt to investigate the miRNA targets of pepper, as well as their regulatory networks. Surprisingly, only a small proportion of miRNA-transcript pairs were shared between degradome sequencing and expression dependency predictions, suggesting that miRNA targets predicted by a single technology alone may be prone to report false negatives. PMID:28443105
McClay, David R
2016-01-01
In the sea urchin morphogenesis follows extensive molecular specification. The specification controls the many morphogenetic events and these, in turn, precede patterning steps that establish the larval body plan. To understand how the embryo is built it was necessary to understand those series of molecular steps. Here an example of the historical sequence of those discoveries is presented as it unfolded over the last 50 years, the years during which major progress in understanding development of many animals and plants was documented by CTDB. In sea urchin development a rich series of experimental studies first established many of the phenomenological components of skeletal morphogenesis and patterning without knowledge of the molecular components. The many discoveries of transcription factors, signals, and structural proteins that contribute to the shape of the endoskeleton of the sea urchin larva then followed as molecular tools became available. A number of transcription factors and signals were discovered that were necessary for specification, morphogenesis, and patterning. Perturbation of the transcription factors and signals provided the means for assembling models of the gene regulatory networks used for specification and controlled the subsequent morphogenetic events. The earlier experimental information informed perturbation experiments that asked how patterning worked. As a consequence it was learned that ectoderm provides a series of patterning signals to the skeletogenic cells and as a consequence the skeletogenic cells secrete a highly patterned skeleton based on their ability to genotypically decode the localized reception of several signals. We still do not understand the complexity of the signals received by the skeletogenic cells, nor do we understand in detail how the genotypic information shapes the secreted skeletal biomineral, but the current knowledge at least outlines the sequence of events and provides a useful template for future discoveries. © 2016 Elsevier Inc. All rights reserved.
Regulation of gene expression mediating indeterminate muscle growth in teleosts.
Ahammad, A K Shakur; Asaduzzaman, Md; Asakawa, Shuichi; Watabe, Shugo; Kinoshita, Shigeharu
2015-08-01
Teleosts are unique among vertebrates due to their indeterminate muscle growth, i.e., continued production of neonatal muscle fibers until death. However, the molecular mechanism(s) underlying this property is unknown. Here, we focused on the torafugu (Takifugu rubripes) myosin heavy chain gene, MYHM2528-1, which is specifically expressed in neonatal muscle fibers produced by indeterminate muscle growth. We examined the flanking region of MYHM2528-1 through an in vivo reporter assay using zebrafish (Danio rerio) and identified a 2100 bp 5'-flanking sequence that contained sufficient promoter activity to allow specific gene expression. The effects of enhanced promoter activity were observed at the outer region of the fast muscle and the dorsal edge of slow muscle in zebrafish larvae. At the juvenile stage, the promoter was specifically activated in small diameter muscle fibers scattered throughout fast muscle and in slow muscle near the septum separating slow and fast muscles. This spatio-temporal promoter activity overlapped with known myogenic zones involved in teleost indeterminate muscle growth. A deletion mutant analysis revealed that the -2100 to -600 bp 5'flanking sequence of MYHM2528-1 is essential for promoter activity. This region contains putative binding sites for several representative myogenesis-related transcription factors and nuclear factor of activated T-cell (NFAT), a transcription activator involved in regeneration of mammalian adult skeletal muscle. A significant reduction in the promoter activity of the MYHM2528-1 deletion constructs was observed in accordance with a reduction in the number of these binding sites, suggesting the involvement of specific transcription factors in indeterminate muscle growth. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Bohr, Stefan; Patel, Suraj J; Vasko, Radovan; Shen, Keyue; Iracheta-Vellve, Arvin; Lee, Jungwoo; Bale, Shyam Sundhar; Chakraborty, Nilay; Brines, Michael; Cerami, Anthony; Berthiaume, Francois; Yarmush, Martin L
2014-01-01
Tissue protective properties of erythropoietin (EPO) have let to the discovery of an alternative EPO-signaling via an EPO-R/CD131 receptor complex which can now be specifically targeted through pharmaceutically designed short sequence peptides such as ARA290. However, little is still known about specific functions of alternative EPO-signaling in defined cell populations. In this study we investigated effects of signaling through EPO-R/CD131 complex on cellular stress responses and pro-inflammatory activation in different mesenchymal-derived phenotypes. We show that anti-apoptotic, anti-inflammatory effects of ARA290 and EPO coincide with the externalization of CD131 receptor component as an immediate response to cellular stress. In addition, alternative EPO-signaling strongly modulated transcriptional, translational or metabolic responses after stressor removal. Specifically, we saw that ARA290 was able overcome a TNFα-mediated inhibition of transcription factor activation related to cell stress responses, most notably of serum response factor (SRF), heat shock transcription factor protein 1 (HSF1) and activator protein 1 (AP1). We conclude that alternative EPO-signaling acts as a modulator of pro-inflammatory signaling pathways and likely plays a role in restoring tissue homeostasis. PMID:25373867
Bridgewater, Laura C.; Walker, Marlan D.; Miller, Gwen C.; Ellison, Trevor A.; Holsinger, L. Daniel; Potter, Jennifer L.; Jackson, Todd L.; Chen, Reuben K.; Winkel, Vicki L.; Zhang, Zhaoping; McKinney, Sandra; de Crombrugghe, Benoit
2003-01-01
Expression of the type XI collagen gene Col11a2 is directed to cartilage by at least three chondrocyte-specific enhancer elements, two in the 5′ region and one in the first intron of the gene. The three enhancers each contain two heptameric sites with homology to the Sox protein-binding consensus sequence. The two sites are separated by 3 or 4 bp and arranged in opposite orientation to each other. Targeted mutational analyses of these three enhancers showed that in the intronic enhancer, as in the other two enhancers, both Sox sites in a pair are essential for enhancer activity. The transcription factor Sox9 binds as a dimer at the paired sites, and the introduction of insertion mutations between the sites demonstrated that physical interactions between the adjacently bound proteins are essential for enhancer activity. Additional mutational analyses demonstrated that although Sox9 binding at the paired Sox sites is necessary for enhancer activity, it alone is not sufficient. Adjacent DNA sequences in each enhancer are also required, and mutation of those sequences can eliminate enhancer activity without preventing Sox9 binding. The data suggest a new model in which adjacently bound proteins affect the DNA bend angle produced by Sox9, which in turn determines whether an active transcriptional enhancer complex is assembled. PMID:12595563
Li, Meng-Yao; Tan, Hua-Wei; Wang, Feng; Jiang, Qian; Xu, Zhi-Sheng; Tian, Chang; Xiong, Ai-Sheng
2014-01-01
Parsley is an important biennial Apiaceae species that is widely cultivated as herb, spice, and vegetable. Previous studies on parsley principally focused on its physiological and biochemical properties, including phenolic compound and volatile oil contents. However, little is known about the molecular and genetic properties of parsley. In this study, 23,686,707 high-quality reads were obtained and assembled into 81,852 transcripts and 50,161 unigenes for the first time. Functional annotation showed that 30,516 unigenes had sequence similarity to known genes. In addition, 3,244 putative simple sequence repeats were detected in curly parsley. Finally, 1,569 of the identified unigenes belonged to 58 transcription factor families. Various abiotic stresses have a strong detrimental effect on the yield and quality of parsley. AP2/ERF transcription factors have important functions in plant development, hormonal regulation, and abiotic response. A total of 88 putative AP2/ERF factors were identified from the transcriptome sequence of parsley. Seven AP2/ERF transcription factors were selected in this study to analyze the expression profiles of parsley under different abiotic stresses. Our data provide a potentially valuable resource that can be used for intensive parsley research.
Wang, Feng; Jiang, Qian; Xu, Zhi-Sheng; Tian, Chang; Xiong, Ai-Sheng
2014-01-01
Parsley is an important biennial Apiaceae species that is widely cultivated as herb, spice, and vegetable. Previous studies on parsley principally focused on its physiological and biochemical properties, including phenolic compound and volatile oil contents. However, little is known about the molecular and genetic properties of parsley. In this study, 23,686,707 high-quality reads were obtained and assembled into 81,852 transcripts and 50,161 unigenes for the first time. Functional annotation showed that 30,516 unigenes had sequence similarity to known genes. In addition, 3,244 putative simple sequence repeats were detected in curly parsley. Finally, 1,569 of the identified unigenes belonged to 58 transcription factor families. Various abiotic stresses have a strong detrimental effect on the yield and quality of parsley. AP2/ERF transcription factors have important functions in plant development, hormonal regulation, and abiotic response. A total of 88 putative AP2/ERF factors were identified from the transcriptome sequence of parsley. Seven AP2/ERF transcription factors were selected in this study to analyze the expression profiles of parsley under different abiotic stresses. Our data provide a potentially valuable resource that can be used for intensive parsley research. PMID:25268141
Convergence of the transcriptional responses to heat shock and singlet oxygen stresses.
Dufour, Yann S; Imam, Saheed; Koo, Byoung-Mo; Green, Heather A; Donohue, Timothy J
2012-09-01
Cells often mount transcriptional responses and activate specific sets of genes in response to stress-inducing signals such as heat or reactive oxygen species. Transcription factors in the RpoH family of bacterial alternative σ factors usually control gene expression during a heat shock response. Interestingly, several α-proteobacteria possess two or more paralogs of RpoH, suggesting some functional distinction. We investigated the target promoters of Rhodobacter sphaeroides RpoH(I) and RpoH(II) using genome-scale data derived from gene expression profiling and the direct interactions of each protein with DNA in vivo. We found that the RpoH(I) and RpoH(II) regulons have both distinct and overlapping gene sets. We predicted DNA sequence elements that dictate promoter recognition specificity by each RpoH paralog. We found that several bases in the highly conserved TTG in the -35 element are important for activity with both RpoH homologs; that the T-9 position, which is over-represented in the RpoH(I) promoter sequence logo, is critical for RpoH(I)-dependent transcription; and that several bases in the predicted -10 element were important for activity with either RpoH(II) or both RpoH homologs. Genes that are transcribed by both RpoH(I) and RpoH(II) are predicted to encode for functions involved in general cell maintenance. The functions specific to the RpoH(I) regulon are associated with a classic heat shock response, while those specific to RpoH(II) are associated with the response to the reactive oxygen species, singlet oxygen. We propose that a gene duplication event followed by changes in promoter recognition by RpoH(I) and RpoH(II) allowed convergence of the transcriptional responses to heat and singlet oxygen stress in R. sphaeroides and possibly other bacteria.
The identification of cis-regulatory elements: A review from a machine learning perspective.
Li, Yifeng; Chen, Chih-Yu; Kaye, Alice M; Wasserman, Wyeth W
2015-12-01
The majority of the human genome consists of non-coding regions that have been called junk DNA. However, recent studies have unveiled that these regions contain cis-regulatory elements, such as promoters, enhancers, silencers, insulators, etc. These regulatory elements can play crucial roles in controlling gene expressions in specific cell types, conditions, and developmental stages. Disruption to these regions could contribute to phenotype changes. Precisely identifying regulatory elements is key to deciphering the mechanisms underlying transcriptional regulation. Cis-regulatory events are complex processes that involve chromatin accessibility, transcription factor binding, DNA methylation, histone modifications, and the interactions between them. The development of next-generation sequencing techniques has allowed us to capture these genomic features in depth. Applied analysis of genome sequences for clinical genetics has increased the urgency for detecting these regions. However, the complexity of cis-regulatory events and the deluge of sequencing data require accurate and efficient computational approaches, in particular, machine learning techniques. In this review, we describe machine learning approaches for predicting transcription factor binding sites, enhancers, and promoters, primarily driven by next-generation sequencing data. Data sources are provided in order to facilitate testing of novel methods. The purpose of this review is to attract computational experts and data scientists to advance this field. Crown Copyright © 2015. Published by Elsevier Ireland Ltd. All rights reserved.
2013-01-01
Background Ginger (Zingiber officinale) and turmeric (Curcuma longa) accumulate important pharmacologically active metabolites at high levels in their rhizomes. Despite their importance, relatively little is known regarding gene expression in the rhizomes of ginger and turmeric. Results In order to identify rhizome-enriched genes and genes encoding specialized metabolism enzymes and pathway regulators, we evaluated an assembled collection of expressed sequence tags (ESTs) from eight different ginger and turmeric tissues. Comparisons to publicly available sorghum rhizome ESTs revealed a total of 777 gene transcripts expressed in ginger/turmeric and sorghum rhizomes but apparently absent from other tissues. The list of rhizome-specific transcripts was enriched for genes associated with regulation of tissue growth, development, and transcription. In particular, transcripts for ethylene response factors and AUX/IAA proteins appeared to accumulate in patterns mirroring results from previous studies regarding rhizome growth responses to exogenous applications of auxin and ethylene. Thus, these genes may play important roles in defining rhizome growth and development. Additional associations were made for ginger and turmeric rhizome-enriched MADS box transcription factors, their putative rhizome-enriched homologs in sorghum, and rhizomatous QTLs in rice. Additionally, analysis of both primary and specialized metabolism genes indicates that ginger and turmeric rhizomes are primarily devoted to the utilization of leaf supplied sucrose for the production and/or storage of specialized metabolites associated with the phenylpropanoid pathway and putative type III polyketide synthase gene products. This finding reinforces earlier hypotheses predicting roles of this enzyme class in the production of curcuminoids and gingerols. Conclusion A significant set of genes were found to be exclusively or preferentially expressed in the rhizome of ginger and turmeric. Specific transcription factors and other regulatory genes were found that were common to the two species and that are excellent candidates for involvement in rhizome growth, differentiation and development. Large classes of enzymes involved in specialized metabolism were also found to have apparent tissue-specific expression, suggesting that gene expression itself may play an important role in regulating metabolite production in these plants. PMID:23410187
Large-scale collection of full-length cDNA and transcriptome analysis in Hevea brasiliensis
Makita, Yuko; Ng, Kiaw Kiaw; Veera Singham, G.; Kawashima, Mika; Hirakawa, Hideki; Sato, Shusei
2017-01-01
Abstract Natural rubber has unique physical properties that cannot be replaced by products from other latex-producing plants or petrochemically produced synthetic rubbers. Rubber from Hevea brasiliensis is the main commercial source for this natural rubber that has a cis-polyisoprene configuration. For sustainable production of enough rubber to meet demand elucidation of the molecular mechanisms involved in the production of latex is vital. To this end, we firstly constructed rubber full-length cDNA libraries of RRIM 600 cultivar and sequenced around 20,000 clones by the Sanger method and over 15,000 contigs by Illumina sequencer. With these data, we updated around 5,500 gene structures and newly annotated around 9,500 transcription start sites. Second, to elucidate the rubber biosynthetic pathways and their transcriptional regulation, we carried out tissue- and cultivar-specific RNA-Seq analysis. By using our recently published genome sequence, we confirmed the expression patterns of the rubber biosynthetic genes. Our data suggest that the cytoplasmic mevalonate (MVA) pathway is the main route for isoprenoid biosynthesis in latex production. In addition to the well-studied polymerization factors, we suggest that rubber elongation factor 8 (REF8) is a candidate factor in cis-polyisoprene biosynthesis. We have also identified 39 transcription factors that may be key regulators in latex production. Expression profile analysis using two additional cultivars, RRIM 901 and PB 350, via an RNA-Seq approach revealed possible expression differences between a high latex-yielding cultivar and a disease-resistant cultivar. PMID:28431015
The G-Box Transcriptional Regulatory Code in Arabidopsis1[OPEN
Shepherd, Samuel J.K.; Brestovitsky, Anna; Dickinson, Patrick; Biswas, Surojit
2017-01-01
Plants have significantly more transcription factor (TF) families than animals and fungi, and plant TF families tend to contain more genes; these expansions are linked to adaptation to environmental stressors. Many TF family members bind to similar or identical sequence motifs, such as G-boxes (CACGTG), so it is difficult to predict regulatory relationships. We determined that the flanking sequences near G-boxes help determine in vitro specificity but that this is insufficient to predict the transcription pattern of genes near G-boxes. Therefore, we constructed a gene regulatory network that identifies the set of bZIPs and bHLHs that are most predictive of the expression of genes downstream of perfect G-boxes. This network accurately predicts transcriptional patterns and reconstructs known regulatory subnetworks. Finally, we present Ara-BOX-cis (araboxcis.org), a Web site that provides interactive visualizations of the G-box regulatory network, a useful resource for generating predictions for gene regulatory relations. PMID:28864470
Regulation of rice root development by a retrotransposon acting as a microRNA sponge.
Cho, Jungnam; Paszkowski, Jerzy
2017-08-26
It is well documented that transposable elements (TEs) can regulate the expression of neighbouring genes. However, their ability to act in trans and influence ectopic loci has been reported rarely. We searched in rice transcriptomes for tissue-specific expression of TEs and found them to be regulated developmentally. They often shared sequence homology with co-expressed genes and contained potential microRNA-binding sites, which suggested possible contributions to gene regulation. In fact, we have identified a retrotransposon that is highly transcribed in roots and whose spliced transcript constitutes a target mimic for miR171. miR171 destabilizes mRNAs encoding the root-specific family of SCARECROW-Like transcription factors. We demonstrate that retrotransposon-derived transcripts act as decoys for miR171, triggering its degradation and thus results in the root-specific accumulation of SCARECROW-Like mRNAs. Such transposon-mediated post-transcriptional control of miR171 levels is conserved in diverse rice species.
Dunn, Matthew P; Di Gregorio, Anna
2009-04-15
In Ciona intestinalis, leprecan was identified as a target of the notochord-specific transcription factor Ciona Brachyury (Ci-Bra) (Takahashi, H., Hotta, K., Erives, A., Di Gregorio, A., Zeller, R.W., Levine, M., Satoh, N., 1999. Brachyury downstream notochord differentiation in the ascidian embryo. Genes Dev. 13, 1519-1523). By screening approximately 14 kb of the Ci-leprecan locus for cis-regulatory activity, we have identified a 581-bp minimal notochord-specific cis-regulatory module (CRM) whose activity depends upon T-box binding sites located at the 3'-end of its sequence. These sites are specifically bound in vitro by a GST-Ci-Bra fusion protein, and mutations that abolish binding in vitro result in loss or decrease of regulatory activity in vivo. Serial deletions of the 581-bp notochord CRM revealed that this sequence is also able to direct expression in muscle cells through the same T-box sites that are utilized by Ci-Bra in the notochord, which are also bound in vitro by the muscle-specific T-box activators Ci-Tbx6b and Ci-Tbx6c. Additionally, we created plasmids aimed to interfere with the function of Ci-leprecan and categorized the resulting phenotypes, which consist of variable dislocations of notochord cells along the anterior-posterior axis. Together, these observations provide mechanistic insights generally applicable to T-box transcription factors and their target sequences, as well as a first set of clues on the function of Leprecan in early chordate development.
NASA Astrophysics Data System (ADS)
Sugimoto, Asuka; Sumi, Takuya; Kang, Jiyoung; Tateno, Masaru
2017-07-01
Recognition in biological macromolecular systems, such as DNA-protein recognition, is one of the most crucial problems to solve toward understanding the fundamental mechanisms of various biological processes. Since specific base sequences of genome DNA are discriminated by proteins, such as transcription factors (TFs), finding TF binding motifs (TFBMs) in whole genome DNA sequences is currently a central issue in interdisciplinary biophysical and information sciences. In the present study, a novel strategy to create a discriminant function for discrimination of TFBMs by constituting mathematical neural networks (NNs) is proposed, together with a method to determine the boundary of signals (TFBMs) and noise in the NN-score (output) space. This analysis also leads to the mathematical limitation of discrimination in the recognition of features representing TFBMs, in an information geometrical manifold. Thus, the present strategy enables the identification of the whole space of TFBMs, right up to the noise boundary.
Kim, Sang Hee; Son, Geon Hui; Bhattacharjee, Saikat; Kim, Hye Jin; Nam, Ji Chul; Nguyen, Phuong Dung T; Hong, Jong Chan; Gassmann, Walter
2014-06-01
The plant immune system must be tightly controlled both positively and negatively to maintain normal plant growth and health. We previously identified SUPPRESSOR OF rps4-RLD1 (SRFR1) as a negative regulator specifically of effector-triggered immunity. SRFR1 is localized in both a cytoplasmic microsomal compartment and in the nucleus. Its TPR domain has sequence similarity to TPR domains of transcriptional repressors in other organisms, suggesting that SRFR1 may negatively regulate effector-triggered immunity via transcriptional control. We show here that excluding SRFR1 from the nucleus prevented complementation of the srfr1 phenotype. To identify transcription factors that interact with SRFR1, we screened an Arabidopsis transcription factor prey library by yeast two-hybrid assay and isolated six class I members of the TEOSINTE BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factor family. Specific interactions were verified in planta. Although single or double T-DNA mutant tcp8, tcp14 or tcp15 lines were not more susceptible to bacteria expressing AvrRps4, the triple tcp8 tcp14 tcp15 mutant displayed decreased effector-triggered immunity mediated by the resistance genes RPS2, RPS4, RPS6 and RPM1. In addition, expression of PATHOGENESIS-RELATED PROTEIN2 was attenuated in srfr1-4 tcp8-1 tcp14-5 tcp15-3 plants compared to srfr1-4 plants. To date, TCP transcription factors have been implicated mostly in developmental processes. Our data indicate that one function of a subset of TCP proteins is to regulate defense gene expression in antagonism to SRFR1, and suggest a mechanism for an intimate connection between plant development and immunity. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.
Lijun Liu; Trevor Ramsay; Matthew S. Zinkgraf; David Sundell; Nathaniel Robert Street; Vladimir Filkov; Andrew Groover
2015-01-01
Identifying transcription factor target genes is essential for modeling the transcriptional networks underlying developmental processes. Here we report a chromatin immunoprecipitation sequencing (ChIP-seq) resource consisting of genome-wide binding regions and associated putative target genes for four Populus homeodomain transcription factors...
Genome wide gene expression regulation by HIP1 Protein Interactor, HIPPI: prediction and validation.
Datta, Moumita; Choudhury, Ananyo; Lahiri, Ansuman; Bhattacharyya, Nitai P
2011-09-26
HIP1 Protein Interactor (HIPPI) is a pro-apoptotic protein that induces Caspase8 mediated apoptosis in cell. We have shown earlier that HIPPI could interact with a specific 9 bp sequence motif, defined as the HIPPI binding site (HBS), present in the upstream promoter of Caspase1 gene and regulate its expression. We also have shown that HIPPI, without any known nuclear localization signal, could be transported to the nucleus by HIP1, a NLS containing nucleo-cytoplasmic shuttling protein. Thus our present work aims at the investigation of the role of HIPPI as a global transcription regulator. We carried out genome wide search for the presence of HBS in the upstream sequences of genes. Our result suggests that HBS was predominantly located within 2 Kb upstream from transcription start site. Transcription factors like CREBP1, TBP, OCT1, EVI1 and P53 half site were significantly enriched in the 100 bp vicinity of HBS indicating that they might co-operate with HIPPI for transcription regulation. To illustrate the role of HIPPI on transcriptome, we performed gene expression profiling by microarray. Exogenous expression of HIPPI in HeLa cells resulted in up-regulation of 580 genes (p < 0.05) while 457 genes were down-regulated. Several transcription factors including CBP, REST, C/EBP beta were altered by HIPPI in this study. HIPPI also interacted with P53 in the protein level. This interaction occurred exclusively in the nuclear compartment and was absent in cells where HIP1 was knocked down. HIPPI-P53 interaction was necessary for HIPPI mediated up-regulation of Caspase1 gene. Finally, we analyzed published microarray data obtained with post mortem brains of Huntington's disease (HD) patients to investigate the possible involvement of HIPPI in HD pathogenesis. We observed that along with the transcription factors like CREB, P300, SREBP1, Sp1 etc. which are already known to be involved in HD, HIPPI binding site was also significantly over-represented in the upstream sequences of genes altered in HD. Taken together, the results suggest that HIPPI could act as an important transcription regulator in cell regulating a vast array of genes, particularly transcription factors and at least, in part, play a role in transcription deregulation observed in HD.
Genome wide gene expression regulation by HIP1 Protein Interactor, HIPPI: Prediction and validation
2011-01-01
Background HIP1 Protein Interactor (HIPPI) is a pro-apoptotic protein that induces Caspase8 mediated apoptosis in cell. We have shown earlier that HIPPI could interact with a specific 9 bp sequence motif, defined as the HIPPI binding site (HBS), present in the upstream promoter of Caspase1 gene and regulate its expression. We also have shown that HIPPI, without any known nuclear localization signal, could be transported to the nucleus by HIP1, a NLS containing nucleo-cytoplasmic shuttling protein. Thus our present work aims at the investigation of the role of HIPPI as a global transcription regulator. Results We carried out genome wide search for the presence of HBS in the upstream sequences of genes. Our result suggests that HBS was predominantly located within 2 Kb upstream from transcription start site. Transcription factors like CREBP1, TBP, OCT1, EVI1 and P53 half site were significantly enriched in the 100 bp vicinity of HBS indicating that they might co-operate with HIPPI for transcription regulation. To illustrate the role of HIPPI on transcriptome, we performed gene expression profiling by microarray. Exogenous expression of HIPPI in HeLa cells resulted in up-regulation of 580 genes (p < 0.05) while 457 genes were down-regulated. Several transcription factors including CBP, REST, C/EBP beta were altered by HIPPI in this study. HIPPI also interacted with P53 in the protein level. This interaction occurred exclusively in the nuclear compartment and was absent in cells where HIP1 was knocked down. HIPPI-P53 interaction was necessary for HIPPI mediated up-regulation of Caspase1 gene. Finally, we analyzed published microarray data obtained with post mortem brains of Huntington's disease (HD) patients to investigate the possible involvement of HIPPI in HD pathogenesis. We observed that along with the transcription factors like CREB, P300, SREBP1, Sp1 etc. which are already known to be involved in HD, HIPPI binding site was also significantly over-represented in the upstream sequences of genes altered in HD. Conclusions Taken together, the results suggest that HIPPI could act as an important transcription regulator in cell regulating a vast array of genes, particularly transcription factors and at least, in part, play a role in transcription deregulation observed in HD. PMID:21943362
Campbell, Elizabeth A.; Greenwell, Roger; Anthony, Jennifer R.; Wang, Sheng; Lim, Lionel; Das, Kalyan; Sofia, Heidi J.; Donohue, Timothy J.; Darst, Seth A.
2008-01-01
SUMMARY A transcriptional response to singlet oxygen in Rhodobacter sphaeroides is controlled by the group IV σ factor σE and its cognate anti-σ ChrR. Crystal structures of the σE/ChrR complex reveal a modular, two-domain architecture for ChrR. The ChrR N-terminal anti-σ domain (ASD) binds a Zn2+ ion, contacts σE, and is sufficient to inhibit σE-dependent transcription. The ChrR C-terminal domain adopts a cupin fold, can coordinate an additional Zn2+, and is required for the transcriptional response to singlet oxygen. Structure-based sequence analyses predict that the ASD defines a common structural fold among predicted group IV antiσs. These ASDs are fused to diverse C-terminal domains that are likely involved in responding to specific environmental signals that control the activity of their cognate σ factor. PMID:17803943
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bach, Christian; Sherman, William; Pallis, Jani
Zinc finger nucleases (ZFNs) are associated with cell death and apoptosis by binding at countless undesired locations. This cytotoxicity is associated with the binding ability of engineered zinc finger domains to bind dissimilar DNA sequences with high affinity. In general, binding preferences of transcription factors are associated with significant degenerated diversity and complexity which convolutes the design and engineering of precise DNA binding domains. Evolutionary success of natural zinc finger proteins, however, evinces that nature created specific evolutionary traits and strategies, such as modularity and rank-specific recognition to cope with binding complexity that are critical for creating clinical viable toolsmore » to precisely modify the human genome. Our findings indicate preservation of general modularity and significant alteration of the rank-specific binding preferences of the three-finger binding domain of transcription factor SP1 when exchanging amino acids in the 2nd finger.« less
Bach, Christian; Sherman, William; Pallis, Jani; ...
2014-01-01
Zinc finger nucleases (ZFNs) are associated with cell death and apoptosis by binding at countless undesired locations. This cytotoxicity is associated with the binding ability of engineered zinc finger domains to bind dissimilar DNA sequences with high affinity. In general, binding preferences of transcription factors are associated with significant degenerated diversity and complexity which convolutes the design and engineering of precise DNA binding domains. Evolutionary success of natural zinc finger proteins, however, evinces that nature created specific evolutionary traits and strategies, such as modularity and rank-specific recognition to cope with binding complexity that are critical for creating clinical viable toolsmore » to precisely modify the human genome. Our findings indicate preservation of general modularity and significant alteration of the rank-specific binding preferences of the three-finger binding domain of transcription factor SP1 when exchanging amino acids in the 2nd finger.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Long, Cong; Wang, Jingchao; Guo, Wei
Primary effusion lymphoma (PEL) is a rare and aggressive non-Hodgkin's lymphoma. Human telomerase reverse transcriptase (hTERT), a key component responsible for the regulation of telomerase activity, plays important roles in cellular immortalization and cancer development. Triptolide purified from Tripterygium extracts displays a broad-spectrum bioactivity profile, including immunosuppressive, anti-inflammatory, and anti-tumor. In this study, it is investigated whether triptolide reduces hTERT expression and suppresses its activity in PEL cells. The mRNA and protein levels of hTERT were examined by real time-PCR and Western blotting, respectively. The activity of hTERT promoter was determined by Dual luciferase reporter assay. Our results demonstrated thatmore » triptolide decreased expression of hTERT at both mRNA and protein levels. Further gene sequence analysis indicated that the activity of hTERT promoter was suppressed by triptolide. Triptolide also reduced the half-time of hTERT. Additionally, triptolide inhibited the expression of transcription factor specificity protein 1(Sp1) in PEL cells. Furthermore, knock-down of Sp1 by using specific shRNAs resulted in down-regulation of hTERT transcription and protein expression levels. Inhibition of Sp1 by specific shRNAs enhanced triptolide-induced cell growth inhibition and apoptosis. Collectively, our results demonstrate that the inhibitory effect of triptolide on hTERT transcription is possibly mediated by inhibition of transcription factor Sp1 in PEL cells. - Highlights: • Triptolide reduces expression of hTERT by decreasing its transcription level. • Triptolide reduces promoter activity and stability of hTERT. • Triptolide down-regulates expression of Sp1. • Special Sp1 shRNAs inhibit transcription and protein expression of hTERT. • Triptolide and Sp1 shRNA2 induce cell proliferation inhibition and apoptosis.« less
Identification of distal silencing elements in the murine interferon-A11 gene promoter.
Roffet, P; Lopez, S; Navarro, S; Bandu, M T; Coulombel, C; Vignal, M; Doly, J; Vodjdani, G
1996-08-01
The murine interferon-A11 (Mu IFN-A11) gene is a member of the IFN-A multigenic family. In mouse L929 cells, the weak response of the gene's promoter to viral induction is due to a combination of both a point mutation in the virus responsive element (VRE) and the presence of negatively regulating sequences surrounding the VRE. In the distal part of the promoter, the negatively acting E1E2 sequence was delimited. This sequence displays an inhibitory effect in either orientation or position on the inducibility of a virus-responsive heterologous promoter. It selectively represses VRE-dependent transcription but is not able to reduce the transcriptional activity of a VRE-lacking promoter. In a transient transfection assay, an E1E2-containing DNA competitor was able to derepress the native Mu IFN-A11 promoter. Specific nuclear factors bind to this sequence; thus the binding of trans-regulators participates in the repression of the Mu IFN-A11 gene. The E1E2 sequence contains an IFN regulatory factor (IRF)-binding site. Recombinant IRF2 binds this sequence and anti-IRF2 antibodies supershift a major complex formed with nuclear extracts. The protein composing the complex is 50 kDa in size, indicating the presence of IRF2 or antigenically related proteins in the complex. The Mu IFN-A11 gene is the first example within the murine IFN-A family, in which a distal promoter element has been identified that can negatively modulate the transcriptional response to viral induction.
Design of Conditionally Active STATs: Insights into STAT Activation and Gene Regulatory Function
Milocco, Lawrence H.; Haslam, Jennifer A.; Rosen, Jonathan; Seidel, H. Martin
1999-01-01
The STAT (signal transducer and activator of transcription) signaling pathway is activated by a large number of cytokines and growth factors. We sought to design a conditionally active STAT that could not only provide insight into basic questions about STAT function but also serve as a powerful tool to determine the precise biological role of STATs. To this end, we have developed a conditionally active STAT by fusing STATs with the ligand-binding domain of the estrogen receptor (ER). We have demonstrated that the resulting STAT-ER chimeras are estrogen-inducible transcription factors that retain the functional and biochemical characteristics of the cognate wild-type STATs. In addition, these tools have allowed us to evaluate separately the contribution of tyrosine phosphorylation and dimerization to STAT function. We have for the first time provided experimental data supporting the model that the only apparent role of STAT tyrosine phosphorylation is to drive dimerization, as dimerization alone is sufficient to unmask a latent STAT nuclear localization sequence and induce nuclear translocation, sequence-specific DNA binding, and transcriptional activity. PMID:10082558
Baumann, G; Geisse, S; Sullivan, M
1991-03-01
The structurally unrelated immunosuppressive drugs cyclosporin A (Sandimmun) and FK-506 both interfere with the process of T-cell proliferation by blocking the transcription of the T-cell growth factor interleukin-2 (IL-2). Here we demonstrate that the transcriptional activation of this gene requires the binding of regulatory nuclear proteins to a promoter element with sequence similarity to the consensus binding site for NF-kappa B-related transcription factors. We present evidence that the binding by regulatory nuclear proteins to the kappa B element of the IL-2 promoter is affected negatively by cyclosporin A and FK-506 at concentrations paralleling their immunosuppressive activity in vivo. The decrease in DNA-protein complex formation induced by the immunosuppressive drugs correlates with a decrease in IL-2 production. FK-506 is 10 to 100 times more potent than cyclosporin A in its ability to inhibit sequence-specific DNA binding and IL-2 production. Our findings suggest that the actions of both drugs converge at the level of DNA-protein interaction.
VirF-Independent Regulation of Shigella virB Transcription is Mediated by the Small RNA RyhB
Broach, William H.; Egan, Nicholas; Wing, Helen J.; Payne, Shelley M.; Murphy, Erin R.
2012-01-01
Infection of the human host by Shigella species requires the coordinated production of specific Shigella virulence factors, a process mediated largely by the VirF/VirB regulatory cascade. VirF promotes the transcription of virB, a gene encoding the transcriptional activator of several virulence-associated genes. This study reveals that transcription of virB is also regulated by the small RNA RyhB, and importantly, that this regulation is not achieved indirectly via modulation of VirF activity. These data are the first to demonstrate that the regulation of virB transcription can be uncoupled from the master regulator VirF. It is also established that efficient RyhB-dependent regulation of transcription is facilitated by specific nucleic acid sequences within virB. This study not only reveals RyhB-dependent regulation of virB transcription as a novel point of control in the central regulatory circuit modulating Shigella virulence, but also highlights the versatility of RyhB in controlling bacterial gene expression. PMID:22701677
Alten, Leonie; Schuster-Gossler, Karin; Eichenlaub, Michael P; Wittbrodt, Beate; Wittbrodt, Joachim; Gossler, Achim
2012-01-01
The vertebrate organizer and notochord have conserved, essential functions for embryonic development and patterning. The restricted expression of developmental regulators in these tissues is directed by specific cis-regulatory modules (CRMs) whose sequence conservation varies considerably. Some CRMs have been conserved throughout vertebrates and likely represent ancestral regulatory networks, while others have diverged beyond recognition but still function over a wide evolutionary range. Here we identify and characterize a mammalian-specific CRM required for node and notochord specific (NNC) expression of NOTO, a transcription factor essential for node morphogenesis, nodal cilia movement and establishment of laterality in mouse. A 523 bp enhancer region (NOCE) upstream the Noto promoter was necessary and sufficient for NNC expression from the endogenous Noto locus. Three subregions in NOCE together mediated full activity in vivo. Binding sites for known transcription factors in NOCE were functional in vitro but dispensable for NOCE activity in vivo. A FOXA2 site in combination with a novel motif was necessary for NOCE activity in vivo. Strikingly, syntenic regions in non-mammalian vertebrates showed no recognizable sequence similarities. In contrast to its activity in mouse NOCE did not drive NNC expression in transgenic fish. NOCE represents a novel, mammal-specific CRM required for the highly restricted Noto expression in the node and nascent notochord and thus regulates normal node development and function.
Identification of Androgen Receptor-Specific Enhancer RNAs
2016-06-01
Release; Distribution Unlimited The views, opinions and/or findings contained in this report are those of the author(s) and should not be construed as...There are two Tasks in this application. First, we will perform global run-on assay and deep sequencing to identify AR -specific enhancer RNAs. Second...1 Introduction The androgen receptor ( AR ) is a nuclear receptor transcription factor required for normal prostate development and prostate
Li, Shui-gen; Li, Wan-feng; Han, Su-ying; Yang, Wen-hua; Qi, Li-wang
2013-06-15
Polar auxin transport provides a developmental signal for cell fate specification during somatic embryogenesis. Some members of the HD-ZIP III transcription factors participate in regulation of auxin transport, but little is known about this regulation in somatic embryogenesis. Here, four HD-ZIP III homologues from Larix leptolepis were identified and designated LaHDZ31, 32, 33 and 34. The occurrence of a miR165/166 target sequence in all four cDNA sequences indicated that they might be targets of miR165/166. Identification of the cleavage products of LaHDZ31 and LaHDZ32 in vivo confirmed that they were regulated by miRNA. Their mRNA accumulation patterns during somatic embryogenesis and the effects of 1-N-naphthylphthalamic acid (NPA) on their transcript levels and somatic embryo maturation were investigated. The results showed that the four genes had higher transcript levels at mature stages than at the proliferation stage, and that NPA treatment down-regulated the mRNA abundance of LaHDZ31, 32 and 33 at cotyledonary embryo stages, but had no effect on the mRNA abundance of LaHDZ34. We concluded that these four members of Larix HD-ZIP III family might participate in polar auxin transport and the development of somatic embryos, providing new insights into the regulatory mechanisms of somatic embryogenesis. Copyright © 2013 Elsevier B.V. All rights reserved.
Genomic dissection of conserved transcriptional regulation in intestinal epithelial cells
Camp, J. Gray; Weiser, Matthew; Cocchiaro, Jordan L.; Kingsley, David M.; Furey, Terrence S.; Sheikh, Shehzad Z.; Rawls, John F.
2017-01-01
The intestinal epithelium serves critical physiologic functions that are shared among all vertebrates. However, it is unknown how the transcriptional regulatory mechanisms underlying these functions have changed over the course of vertebrate evolution. We generated genome-wide mRNA and accessible chromatin data from adult intestinal epithelial cells (IECs) in zebrafish, stickleback, mouse, and human species to determine if conserved IEC functions are achieved through common transcriptional regulation. We found evidence for substantial common regulation and conservation of gene expression regionally along the length of the intestine from fish to mammals and identified a core set of genes comprising a vertebrate IEC signature. We also identified transcriptional start sites and other putative regulatory regions that are differentially accessible in IECs in all 4 species. Although these sites rarely showed sequence conservation from fish to mammals, surprisingly, they drove highly conserved IEC expression in a zebrafish reporter assay. Common putative transcription factor binding sites (TFBS) found at these sites in multiple species indicate that sequence conservation alone is insufficient to identify much of the functionally conserved IEC regulatory information. Among the rare, highly sequence-conserved, IEC-specific regulatory regions, we discovered an ancient enhancer upstream from her6/HES1 that is active in a distinct population of Notch-positive cells in the intestinal epithelium. Together, these results show how combining accessible chromatin and mRNA datasets with TFBS prediction and in vivo reporter assays can reveal tissue-specific regulatory information conserved across 420 million years of vertebrate evolution. We define an IEC transcriptional regulatory network that is shared between fish and mammals and establish an experimental platform for studying how evolutionarily distilled regulatory information commonly controls IEC development and physiology. PMID:28850571
PlantTFDB: a comprehensive plant transcription factor database
Guo, An-Yuan; Chen, Xin; Gao, Ge; Zhang, He; Zhu, Qi-Hui; Liu, Xiao-Chuan; Zhong, Ying-Fu; Gu, Xiaocheng; He, Kun; Luo, Jingchu
2008-01-01
Transcription factors (TFs) play key roles in controlling gene expression. Systematic identification and annotation of TFs, followed by construction of TF databases may serve as useful resources for studying the function and evolution of transcription factors. We developed a comprehensive plant transcription factor database PlantTFDB (http://planttfdb.cbi.pku.edu.cn), which contains 26 402 TFs predicted from 22 species, including five model organisms with available whole genome sequence and 17 plants with available EST sequences. To provide comprehensive information for those putative TFs, we made extensive annotation at both family and gene levels. A brief introduction and key references were presented for each family. Functional domain information and cross-references to various well-known public databases were available for each identified TF. In addition, we predicted putative orthologs of those TFs among the 22 species. PlantTFDB has a simple interface to allow users to search the database by IDs or free texts, to make sequence similarity search against TFs of all or individual species, and to download TF sequences for local analysis. PMID:17933783
Liu, Dan; Wang, Qianqian; Ruan, Zengliang; He, Qian; Zhang, Liming
2015-01-01
Background Jellyfish contain diverse toxins and other bioactive components. However, large-scale identification of novel toxins and bioactive components from jellyfish has been hampered by the low efficiency of traditional isolation and purification methods. Results We performed de novo transcriptome sequencing of the tentacle tissue of the jellyfish Cyanea capillata. A total of 51,304,108 reads were obtained and assembled into 50,536 unigenes. Of these, 21,357 unigenes had homologues in public databases, but the remaining unigenes had no significant matches due to the limited sequence information available and species-specific novel sequences. Functional annotation of the unigenes also revealed general gene expression profile characteristics in the tentacle of C. capillata. A primary goal of this study was to identify putative toxin transcripts. As expected, we screened many transcripts encoding proteins similar to several well-known toxin families including phospholipases, metalloproteases, serine proteases and serine protease inhibitors. In addition, some transcripts also resembled molecules with potential toxic activities, including cnidarian CfTX-like toxins with hemolytic activity, plancitoxin-1, venom toxin-like peptide-6, histamine-releasing factor, neprilysin, dipeptidyl peptidase 4, vascular endothelial growth factor A, angiotensin-converting enzyme-like and endothelin-converting enzyme 1-like proteins. Most of these molecules have not been previously reported in jellyfish. Interestingly, we also characterized a number of transcripts with similarities to proteins relevant to several degenerative diseases, including Huntington’s, Alzheimer’s and Parkinson’s diseases. This is the first description of degenerative disease-associated genes in jellyfish. Conclusion We obtained a well-categorized and annotated transcriptome of C. capillata tentacle that will be an important and valuable resource for further understanding of jellyfish at the molecular level and information on the underlying molecular mechanisms of jellyfish stinging. The findings of this study may also be used in comparative studies of gene expression profiling among different jellyfish species. PMID:26551022
Liu, Guoyan; Zhou, Yonghong; Liu, Dan; Wang, Qianqian; Ruan, Zengliang; He, Qian; Zhang, Liming
2015-01-01
Jellyfish contain diverse toxins and other bioactive components. However, large-scale identification of novel toxins and bioactive components from jellyfish has been hampered by the low efficiency of traditional isolation and purification methods. We performed de novo transcriptome sequencing of the tentacle tissue of the jellyfish Cyanea capillata. A total of 51,304,108 reads were obtained and assembled into 50,536 unigenes. Of these, 21,357 unigenes had homologues in public databases, but the remaining unigenes had no significant matches due to the limited sequence information available and species-specific novel sequences. Functional annotation of the unigenes also revealed general gene expression profile characteristics in the tentacle of C. capillata. A primary goal of this study was to identify putative toxin transcripts. As expected, we screened many transcripts encoding proteins similar to several well-known toxin families including phospholipases, metalloproteases, serine proteases and serine protease inhibitors. In addition, some transcripts also resembled molecules with potential toxic activities, including cnidarian CfTX-like toxins with hemolytic activity, plancitoxin-1, venom toxin-like peptide-6, histamine-releasing factor, neprilysin, dipeptidyl peptidase 4, vascular endothelial growth factor A, angiotensin-converting enzyme-like and endothelin-converting enzyme 1-like proteins. Most of these molecules have not been previously reported in jellyfish. Interestingly, we also characterized a number of transcripts with similarities to proteins relevant to several degenerative diseases, including Huntington's, Alzheimer's and Parkinson's diseases. This is the first description of degenerative disease-associated genes in jellyfish. We obtained a well-categorized and annotated transcriptome of C. capillata tentacle that will be an important and valuable resource for further understanding of jellyfish at the molecular level and information on the underlying molecular mechanisms of jellyfish stinging. The findings of this study may also be used in comparative studies of gene expression profiling among different jellyfish species.
Saldarriaga, Omar A.; Travi, Bruno L.; Choudhury, Goutam Ghosh; Melby, Peter C.
2012-01-01
IFN-γ/LPS-activated hamster (Mesocricetus auratus) macrophages express significantly less iNOS (NOS2) than activated mouse macrophages, which contributes to the hamster's susceptibility to intracellular pathogens. We determined a mechanism responsible for differences in iNOS promoter activity in hamsters and mice. The HtPP (1.2 kb) showed low basal and inducible promoter activity when compared with the mouse, and sequences within a 100-bp region (−233 to −133) of the mouse and hamster promoters influenced this activity. Moreover, within this 100 bp, we identified a smaller region (44 bp) in the mouse promoter, which recovered basal promoter activity when swapped into the hamster promoter. The mouse homolog (100-bp region) contained a cis-element for NF-IL-6 (−153/−142), which was absent in the hamster counterpart. EMSA and supershift assays revealed that the hamster sequence did not support the binding of NF-IL-6. Introduction of a functional NF-IL-6 binding sequence into the hamster promoter or its alteration in the mouse promoter revealed the critical importance of this transcription factor for full iNOS promoter activity. Furthermore, the binding of NF-IL-6 to the iNOS promoter (−153/−142) in vivo was increased in mouse cells but was reduced in hamster cells after IFN-γ/LPS stimulation. Differences in the activity of the iNOS promoters were evident in mouse and hamster cells, so they were not merely a result of species-specific differences in transcription factors. Thus, we have identified unique DNA sequences and a critical transcription factor, NF-IL-6, which contribute to the overall basal and inducible expression of hamster iNOS. PMID:22517919
Kast, Brigitte; Schori, Christian; Grimm, Christian
2016-05-01
Hypoxic preconditioning protects photoreceptors against light-induced degeneration preserving retinal morphology and function. Although hypoxia inducible transcription factors 1 and 2 (HIF1, HIF2) are the main regulators of the hypoxic response, photoreceptor protection does not depend on HIF1 in rods. Here we used rod-specific Hif2a single and Hif1a;Hif2a double knockout mice to investigate the potential involvement of HIF2 in rods for protection after hypoxic preconditioning. To identify potential HIF2 target genes in rods we determined the retinal transcriptome of hypoxic control and rod-specific Hif2a knockouts by RNA sequencing. We show that rods do not need HIF2 for hypoxia-induced increased survival after light exposure. The transcriptomic analysis revealed a number of genes that are potentially regulated by HIF2 in rods; among those were Htra1, Timp3 and Hmox1, candidates that are interesting due to their connection to human degenerative diseases of the retina. We conclude that neither HIF1 nor HIF2 are required in photoreceptors for protection by hypoxic preconditioning. We hypothesize that HIF transcription factors may be needed in other cells to produce protective factors acting in a paracrine fashion on photoreceptor cells. Alternatively, hypoxic preconditioning induces a rod-intrinsic response that is independent of HIF transcription factors. Copyright © 2015 Elsevier Ltd. All rights reserved.
Hégarat, Nadia; Novopashina, Darya; Fokina, Alesya A; Boutorine, Alexandre S; Venyaminova, Alya G; Praseuth, Danièle; François, Jean-Christophe
2014-03-01
Inhibition of insulin-like growth factor I (IGF-I) signaling is a promising antitumor strategy and nucleic acid-based approaches have been investigated to target genes in the pathway. Here, we sought to modulate IGF-I transcriptional activity using triple helix formation. The IGF-I P1 promoter contains a purine/pyrimidine (R/Y) sequence that is pivotal for transcription as determined by deletion analysis and can be targeted with a triplex-forming oligonucleotide (TFO). We designed modified purine- and pyrimidine-rich TFOs to bind to the R/Y sequence. To monitor TFO binding, we developed a fluorescence-based gel-retardation assay that allowed independent detection of each strand in three-stranded complexes using end-labeling with Alexa 488, cyanine (Cy)3 and Cy5 fluorochromes. We characterized TFOs for their ability to inhibit restriction enzyme activity, compete with DNA-binding proteins and inhibit IGF-I transcription in reporter assays. TFOs containing modified nucleobases, 5-methyl-2'-deoxycytidine and 5-propynyl-2'-deoxyuridine, specifically inhibited restriction enzyme cleavage and formed triplexes on the P1 promoter fragment. In cells, deletion of the R/Y-rich sequence led to 48% transcriptional inhibition of a reporter gene. Transfection with TFOs inhibited reporter gene activity to a similar extent, whereas transcription from a mutant construct with an interrupted R/Y region was unaffected, strongly suggesting the involvement of triplex formation in the inhibitory mechanisms. Our results indicate that nuclease-resistant TFOs will likely inhibit endogenous IGF-I gene function in cells. © 2014 FEBS.
Transcriptome analysis by strand-specific sequencing of complementary DNA
Parkhomchuk, Dmitri; Borodina, Tatiana; Amstislavskiy, Vyacheslav; Banaru, Maria; Hallen, Linda; Krobitsch, Sylvia; Lehrach, Hans; Soldatov, Alexey
2009-01-01
High-throughput complementary DNA sequencing (RNA-Seq) is a powerful tool for whole-transcriptome analysis, supplying information about a transcript's expression level and structure. However, it is difficult to determine the polarity of transcripts, and therefore identify which strand is transcribed. Here, we present a simple cDNA sequencing protocol that preserves information about a transcript's direction. Using Saccharomyces cerevisiae and mouse brain transcriptomes as models, we demonstrate that knowing the transcript's orientation allows more accurate determination of the structure and expression of genes. It also helps to identify new genes and enables studying promoter-associated and antisense transcription. The transcriptional landscapes we obtained are available online. PMID:19620212
Transcriptome analysis by strand-specific sequencing of complementary DNA.
Parkhomchuk, Dmitri; Borodina, Tatiana; Amstislavskiy, Vyacheslav; Banaru, Maria; Hallen, Linda; Krobitsch, Sylvia; Lehrach, Hans; Soldatov, Alexey
2009-10-01
High-throughput complementary DNA sequencing (RNA-Seq) is a powerful tool for whole-transcriptome analysis, supplying information about a transcript's expression level and structure. However, it is difficult to determine the polarity of transcripts, and therefore identify which strand is transcribed. Here, we present a simple cDNA sequencing protocol that preserves information about a transcript's direction. Using Saccharomyces cerevisiae and mouse brain transcriptomes as models, we demonstrate that knowing the transcript's orientation allows more accurate determination of the structure and expression of genes. It also helps to identify new genes and enables studying promoter-associated and antisense transcription. The transcriptional landscapes we obtained are available online.
Matsuo, Noritaka; Yu-Hua, Wang; Sumiyoshi, Hideaki; Sakata-Takatani, Keiko; Nagato, Hitoshi; Sakai, Kumiko; Sakurai, Mami; Yoshioka, Hidekatsu
2003-08-29
We have characterized the proximal promoter region of the human COL11A1 gene. Transient transfection assays indicate that the segment from -199 to +1 is necessary for the activation of basal transcription. Electrophoretic mobility shift assays (EMSAs) demonstrated that the ATTGG sequence, within the -147 to -121 fragment, is critical to bind nuclear proteins in the proximal COL11A1 promoter. We demonstrated that the CCAAT binding factor (CBF/NF-Y) bound to this region using an interference assay with consensus oligonucleotides and a supershift assay with specific antibodies in an EMSA. In a chromatin immunoprecipitation assay and EMSA using DNA-affinity-purified proteins, CBF/NF-Y proteins directly bound this region in vitro and in vivo. We also showed that four tandem copies of the CBF/NF-Y-binding fragment produced higher transcriptional activity than one or two copies, whereas the absence of a CBF/NF-Y-binding fragment suppressed the COL11A1 promoter activity. Furthermore, overexpression of a dominant-negative CBF-B/NF-YA subunit significantly inhibited promoter activity in both transient and stable cells. These results indicate that the CBF/NF-Y proteins regulate the transcription of COL11A1 by directly binding to the ATTGG sequence in the proximal promoter region.
Dimeric PROP1 binding to diverse palindromic TAAT sequences promotes its transcriptional activity.
Nakayama, Michie; Kato, Takako; Susa, Takao; Sano, Akiko; Kitahara, Kousuke; Kato, Yukio
2009-08-13
Mutations in the Prop1 gene are responsible for murine Ames dwarfism and human combined pituitary hormone deficiency with hypogonadism. Recently, we reported that PROP1 is a possible transcription factor for gonadotropin subunit genes through plural cis-acting sites composed of AT-rich sequences containing a TAAT motif which differs from its consensus binding sequence known as PRDQ9 (TAATTGAATTA). This study aimed to verify the binding specificity and sequence of PROP1 by applying the method of SELEX (Systematic Evolution of Ligands by EXponential enrichment), EMSA (electrophoretic mobility shift assay) and transient transfection assay. SELEX, after 5, 7 and 9 generations of selection using a random sequence library, showed that nucleotides containing one or two TAAT motifs were accumulated and accounted for 98.5% at the 9th generation. Aligned sequences and EMSA demonstrated that PROP1 binds preferentially to 11 nucleotides composed of an inverted TAAT motif separated by 3 nucleotides with variation in the half site of palindromic TAAT motifs and with preferential requirement of T at the nucleotide number 5 immediately 3' to a TAAT motif. Transient transfection assay demonstrated first that dimeric binding of PROP1 to an inverted TAAT motif and its cognates resulted in transcriptional activation, whereas monomeric binding of PROP1 to a single TAAT motif and an inverted ATTA motif did not mediate activation. Thus, this study demonstrated that dimeric binding of PROP1 is able to recognize diverse palindromic TAAT sequences separated by 3 nucleotides and to exhibit its transcriptional activity.
Yokozaki, H; Tahara, H; Oue, N; Tahara, E
2000-01-01
A new transcription variant of hepatocyte growth factor/scatter factor (HGF/SF) was cloned from human gastric cancer cell line HSC-39. Northern blot analysis of eight human gastric cancer cell lines (TMK-1, MKN-1, MKN-7, MKN-28, MKN-45, MKN-74, KATO-III and HSC-39) demonstrated that HSC-39 cells expressed a 1.3 kb abnormal HGF/SF transcript. Screening of 1 x 10(6) colonies of cDNA library from HSC-39 constructed in pAP3neo mammalian expression vector selected four positive clones containing HGF/SF transcript. Among them, two contained a 1.3 kbp insert detecting the identical transcript to that obtained with HGF/SF probe by Northern blotting. Deoxynucleotide sequencing of the 1.3 kbp insert revealed that it was composed of a part of HGF/SF cDNA from exon 14 to exon 18, corresponding to the whole sequence of HGF/SF light chain, with 5' 75 nucleotides unrelated to any sequence involved in HGF/SF.
Direct non transcriptional role of NF-Y in DNA replication.
Benatti, Paolo; Belluti, Silvia; Miotto, Benoit; Neusiedler, Julia; Dolfini, Diletta; Drac, Marjorie; Basile, Valentina; Schwob, Etienne; Mantovani, Roberto; Blow, J Julian; Imbriano, Carol
2016-04-01
NF-Y is a heterotrimeric transcription factor, which plays a pioneer role in the transcriptional control of promoters containing the CCAAT-box, among which genes involved in cell cycle regulation, apoptosis and DNA damage response. The knock-down of the sequence-specific subunit NF-YA triggers defects in S-phase progression, which lead to apoptotic cell death. Here, we report that NF-Y has a critical function in DNA replication progression, independent from its transcriptional activity. NF-YA colocalizes with early DNA replication factories, its depletion affects the loading of replisome proteins to DNA, among which Cdc45, and delays the passage from early to middle-late S phase. Molecular combing experiments are consistent with a role for NF-Y in the control of fork progression. Finally, we unambiguously demonstrate a direct non-transcriptional role of NF-Y in the overall efficiency of DNA replication, specifically in the DNA elongation process, using a Xenopus cell-free system. Our findings broaden the activity of NF-Y on a DNA metabolism other than transcription, supporting the existence of specific TFs required for proper and efficient DNA replication. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
Electrophoretic mobility shift scanning using an automated infrared DNA sequencer.
Sano, M; Ohyama, A; Takase, K; Yamamoto, M; Machida, M
2001-11-01
Electrophoretic mobility shift assay (EMSA) is widely used in the study of sequence-specific DNA-binding proteins, including transcription factors and mismatch binding proteins. We have established a non-radioisotope-based protocol for EMSA that features an automated DNA sequencer with an infrared fluorescent dye (IRDye) detection unit. Our modification of the elec- trophoresis unit, which includes cooling the gel plates with a reduced well-to-read length, has made it possible to detect shifted bands within 1 h. Further, we have developed a rapid ligation-based method for generating IRDye-labeled probes with an approximately 60% cost reduction. This method has the advantages of real-time scanning, stability of labeled probes, and better safety associated with nonradioactive methods of detection. Analysis of a promoter from an industrially important filamentous fungus, Aspergillus oryzae, in a prototype experiment revealed that the method we describe has potential for use in systematic scanning and identification of the functionally important elements to which cellular factors bind in a sequence-specific manner.
Activation of vitellogenin II gene expression by steroid hormones in the old Japanese quail.
Gupta, S; Upadhyay, R; Kanungo, M S
1998-11-01
Alterations in the basal transcription rates of eukaryotic genes are believed to involve the binding of trans-acting factor(s) with specific DNA sequences in the promoter. We show here two interrelated events for the VTGII gene of the old, non-egg laying Japanese quail: alterations in the structure of the chromatin encompassing the gene, and binding of trans-acting factors to the promoter of the gene. Estradiol/progesterone alone or together cause alterations in the conformation of the chromatin of the promoter region of the gene. This may allow free access of nuclear protein(s) to the cis-acting elements, ERE, PRE and NF1, in the promoter of the gene and cause activation of transcription.
de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas
2014-01-01
The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. PMID:24792163
Ectoderm gene activation in sea urchin embryos mediated by the CCAAT-binding factor.
Li, Xiaotao; Bhattacharya, Chitralekha; Dayal, Sandeep; Maity, Sankar; Klein, William H
2002-05-01
Transcriptional enhancers are short stretches of DNA that function to achieve highly specific patterns of gene expression. To identify the mechanisms by which enhancers achieve their specificity, we made use of an enhancer from the aboral ectoderm-specific spec2a gene of the sea urchin Strongylocentrotus purpuratus. The spec2a enhancer contains five cis-regulatory elements within 78 base pairs that interact with five distinct DNA-binding proteins to confer aboral ectoderm expression. Here, we present an analysis of the sea urchin CCAAT binding factor (CBF), which binds to a CCAAT motif within the spec2a enhancer. S. purpuratus CBF and SpOtx, a ubiquitously expressed factor, act together at closely placed cis-regulatory elements to mediate spec2a transcription in the ectoderm. SpCBF was the sole factor that bound to the spec2a CCAAT element, and two of the three subunits that make up the CBF holoprotein were cloned and shown to have high sequence conservation with their vertebrate orthologs. Based on its involvement in the regulation of several other sea urchin genes, SpCBF appears to be a major transcription factor in the sea urchin embryo for positive regulation of ectoderm gene expression. In addition to its role in vertebrate cell growth and proliferation, our results indicate that CBF also functions at the early stages of germ layer formation, namely ectoderm differentiation.
DOE Office of Scientific and Technical Information (OSTI.GOV)
O'Brien, D.A.
1992-11-01
A putative transcription factor in Rhodobactor capsulatus which binds upstream of the crt and bch pigment biosynthesis operons and appears to play a role in the adaptation of the organism from the aerobic to the anaerobic-photosynthetic growth mode was characterized. Chapter 2 describes the identification of this factor through an in vitro mobility shift assay, as well as the determination of its binding properties and sequence specificity. Chapter 3 focuses on the isolation of this factor. Biochemistry of later carotenoid biosynthesis enzymes derived from the non-photosynthetic bacterium, Erwinia herbicola. Chapter 4 describes the separate overexpression and in vitro analysis ofmore » two enzymes involved in the main sequence of the carotenoid biosynthesis pathway, lycopene cyclase and 5-carotene hydroxylase. Chapter 5 examines the overexpression and enzymology of functionally active zeaxanthin glucosyltransferase, an enzyme which carries out a more unusual transformation, converting a carotenoid into its more hydrophilic mono- and diglucoside derivatives. In addition, amino acid homology with other glucosyltransferases suggests a putative binding site for the UDP-activated glucose substrate.« less
Integrating and mining the chromatin landscape of cell-type specificity using self-organizing maps.
Mortazavi, Ali; Pepke, Shirley; Jansen, Camden; Marinov, Georgi K; Ernst, Jason; Kellis, Manolis; Hardison, Ross C; Myers, Richard M; Wold, Barbara J
2013-12-01
We tested whether self-organizing maps (SOMs) could be used to effectively integrate, visualize, and mine diverse genomics data types, including complex chromatin signatures. A fine-grained SOM was trained on 72 ChIP-seq histone modifications and DNase-seq data sets from six biologically diverse cell lines studied by The ENCODE Project Consortium. We mined the resulting SOM to identify chromatin signatures related to sequence-specific transcription factor occupancy, sequence motif enrichment, and biological functions. To highlight clusters enriched for specific functions such as transcriptional promoters or enhancers, we overlaid onto the map additional data sets not used during training, such as ChIP-seq, RNA-seq, CAGE, and information on cis-acting regulatory modules from the literature. We used the SOM to parse known transcriptional enhancers according to the cell-type-specific chromatin signature, and we further corroborated this pattern on the map by EP300 (also known as p300) occupancy. New candidate cell-type-specific enhancers were identified for multiple ENCODE cell types in this way, along with new candidates for ubiquitous enhancer activity. An interactive web interface was developed to allow users to visualize and custom-mine the ENCODE SOM. We conclude that large SOMs trained on chromatin data from multiple cell types provide a powerful way to identify complex relationships in genomic data at user-selected levels of granularity.
Integrating and mining the chromatin landscape of cell-type specificity using self-organizing maps
Mortazavi, Ali; Pepke, Shirley; Jansen, Camden; Marinov, Georgi K.; Ernst, Jason; Kellis, Manolis; Hardison, Ross C.; Myers, Richard M.; Wold, Barbara J.
2013-01-01
We tested whether self-organizing maps (SOMs) could be used to effectively integrate, visualize, and mine diverse genomics data types, including complex chromatin signatures. A fine-grained SOM was trained on 72 ChIP-seq histone modifications and DNase-seq data sets from six biologically diverse cell lines studied by The ENCODE Project Consortium. We mined the resulting SOM to identify chromatin signatures related to sequence-specific transcription factor occupancy, sequence motif enrichment, and biological functions. To highlight clusters enriched for specific functions such as transcriptional promoters or enhancers, we overlaid onto the map additional data sets not used during training, such as ChIP-seq, RNA-seq, CAGE, and information on cis-acting regulatory modules from the literature. We used the SOM to parse known transcriptional enhancers according to the cell-type-specific chromatin signature, and we further corroborated this pattern on the map by EP300 (also known as p300) occupancy. New candidate cell-type-specific enhancers were identified for multiple ENCODE cell types in this way, along with new candidates for ubiquitous enhancer activity. An interactive web interface was developed to allow users to visualize and custom-mine the ENCODE SOM. We conclude that large SOMs trained on chromatin data from multiple cell types provide a powerful way to identify complex relationships in genomic data at user-selected levels of granularity. PMID:24170599
Wang, Lu; Mariño-Ramírez, Leonardo
2017-01-01
Abstract Transposable element (TE) derived sequences are known to contribute to the regulation of the human genome. The majority of known TE-derived regulatory sequences correspond to relatively ancient insertions, which are fixed across human populations. The extent to which human genetic variation caused by recent TE activity leads to regulatory polymorphisms among populations has yet to be thoroughly explored. In this study, we searched for associations between polymorphic TE (polyTE) loci and human gene expression levels using an expression quantitative trait loci (eQTL) approach. We compared locus-specific polyTE insertion genotypes to B cell gene expression levels among 445 individuals from 5 human populations. Numerous human polyTE loci correspond to both cis and trans eQTL, and their regulatory effects are directly related to cell type-specific function in the immune system. PolyTE loci are associated with differences in expression between European and African population groups, and a single polyTE loci is indirectly associated with the expression of numerous genes via the regulation of the B cell-specific transcription factor PAX5. The polyTE-gene expression associations we found indicate that human TE genetic variation can have important phenotypic consequences. Our results reveal that TE-eQTL are involved in population-specific gene regulation as well as transcriptional network modification. PMID:27998931
RNA binding specificity of Ebola virus transcription factor VP30.
Schlereth, Julia; Grünweller, Arnold; Biedenkopf, Nadine; Becker, Stephan; Hartmann, Roland K
2016-09-01
The transcription factor VP30 of the non-segmented RNA negative strand Ebola virus balances viral transcription and replication. Here, we comprehensively studied RNA binding by VP30. Using a novel VP30:RNA electrophoretic mobility shift assay, we tested truncated variants of 2 potential natural RNA substrates of VP30 - the genomic Ebola viral 3'-leader region and its complementary antigenomic counterpart (each ∼155 nt in length) - and a series of other non-viral RNAs. Based on oligonucleotide interference, the major VP30 binding region on the genomic 3'-leader substrate was assigned to the internal expanded single-stranded region (∼ nt 125-80). Best binding to VP30 was obtained with ssRNAs of optimally ∼ 40 nt and mixed base composition; underrepresentation of purines or pyrimidines was tolerated, but homopolymeric sequences impaired binding. A stem-loop structure, particularly at the 3'-end or positioned internally, supports stable binding to VP30. In contrast, dsRNA or RNAs exposing large internal loops flanked by entirely helical arms on both sides are not bound. Introduction of a 5´-Cap(0) structure impaired VP30 binding. Also, ssDNAs bind substantially weaker than isosequential ssRNAs and heparin competes with RNA for binding to VP30, indicating that ribose 2'-hydroxyls and electrostatic contacts of the phosphate groups contribute to the formation of VP30:RNA complexes. Our results indicate a rather relaxed RNA binding specificity of filoviral VP30, which largely differs from that of the functionally related transcription factor of the Paramyxoviridae which binds to ssRNAs as short as 13 nt with a preference for oligo(A) sequences.
Antisense transcription is pervasive but rarely conserved in enteric bacteria.
Raghavan, Rahul; Sloan, Daniel B; Ochman, Howard
2012-01-01
Noncoding RNAs, including antisense RNAs (asRNAs) that originate from the complementary strand of protein-coding genes, are involved in the regulation of gene expression in all domains of life. Recent application of deep-sequencing technologies has revealed that the transcription of asRNAs occurs genome-wide in bacteria. Although the role of the vast majority of asRNAs remains unknown, it is often assumed that their presence implies important regulatory functions, similar to those of other noncoding RNAs. Alternatively, many antisense transcripts may be produced by chance transcription events from promoter-like sequences that result from the degenerate nature of bacterial transcription factor binding sites. To investigate the biological relevance of antisense transcripts, we compared genome-wide patterns of asRNA expression in closely related enteric bacteria, Escherichia coli and Salmonella enterica serovar Typhimurium, by performing strand-specific transcriptome sequencing. Although antisense transcripts are abundant in both species, less than 3% of asRNAs are expressed at high levels in both species, and only about 14% appear to be conserved among species. And unlike the promoters of protein-coding genes, asRNA promoters show no evidence of sequence conservation between, or even within, species. Our findings suggest that many or even most bacterial asRNAs are nonadaptive by-products of the cell's transcription machinery. IMPORTANCE Application of high-throughput methods has revealed the expression throughout bacterial genomes of transcripts encoded on the strand complementary to protein-coding genes. Because transcription is costly, it is usually assumed that these transcripts, termed antisense RNAs (asRNAs), serve some function; however, the role of most asRNAs is unclear, raising questions about their relevance in cellular processes. Because natural selection conserves functional elements, comparisons between related species provide a method for assessing functionality genome-wide. Applying such an approach, we assayed all transcripts in two closely related bacteria, Escherichia coli and Salmonella enterica serovar Typhimurium, and demonstrate that, although the levels of genome-wide antisense transcription are similarly high in both bacteria, only a small fraction of asRNAs are shared across species. Moreover, the promoters associated with asRNAs show no evidence of sequence conservation between, or even within, species. These findings indicate that despite the genome-wide transcription of asRNAs, many of these transcripts are likely nonfunctional.
Kemme, Catherine A.; Marquez, Rolando; Luu, Ross H.
2017-01-01
Abstract Eukaryotic genomes contain numerous non-functional high-affinity sequences for transcription factors. These sequences potentially serve as natural decoys that sequester transcription factors. We have previously shown that the presence of sequences similar to the target sequence could substantially impede association of the transcription factor Egr-1 with its targets. In this study, using a stopped-flow fluorescence method, we examined the kinetic impact of DNA methylation of decoys on the search process of the Egr-1 zinc-finger protein. We analyzed its association with an unmethylated target site on fluorescence-labeled DNA in the presence of competitor DNA duplexes, including Egr-1 decoys. DNA methylation of decoys alone did not affect target search kinetics. In the presence of the MeCP2 methyl-CpG-binding domain (MBD), however, DNA methylation of decoys substantially (∼10-30-fold) accelerated the target search process of the Egr-1 zinc-finger protein. This acceleration did not occur when the target was also methylated. These results suggest that when decoys are methylated, MBD proteins can block them and thereby allow Egr-1 to avoid sequestration in non-functional locations. This effect may occur in vivo for DNA methylation outside CpG islands (CGIs) and could facilitate localization of some transcription factors within regulatory CGIs, where DNA methylation is rare. PMID:28486614
Bashir, Ali; Bansal, Vikas; Bafna, Vineet
2010-06-18
Massively parallel DNA sequencing technologies have enabled the sequencing of several individual human genomes. These technologies are also being used in novel ways for mRNA expression profiling, genome-wide discovery of transcription-factor binding sites, small RNA discovery, etc. The multitude of sequencing platforms, each with their unique characteristics, pose a number of design challenges, regarding the technology to be used and the depth of sequencing required for a particular sequencing application. Here we describe a number of analytical and empirical results to address design questions for two applications: detection of structural variations from paired-end sequencing and estimating mRNA transcript abundance. For structural variation, our results provide explicit trade-offs between the detection and resolution of rearrangement breakpoints, and the optimal mix of paired-read insert lengths. Specifically, we prove that optimal detection and resolution of breakpoints is achieved using a mix of exactly two insert library lengths. Furthermore, we derive explicit formulae to determine these insert length combinations, enabling a 15% improvement in breakpoint detection at the same experimental cost. On empirical short read data, these predictions show good concordance with Illumina 200 bp and 2 Kbp insert length libraries. For transcriptome sequencing, we determine the sequencing depth needed to detect rare transcripts from a small pilot study. With only 1 Million reads, we derive corrections that enable almost perfect prediction of the underlying expression probability distribution, and use this to predict the sequencing depth required to detect low expressed genes with greater than 95% probability. Together, our results form a generic framework for many design considerations related to high-throughput sequencing. We provide software tools http://bix.ucsd.edu/projects/NGS-DesignTools to derive platform independent guidelines for designing sequencing experiments (amount of sequencing, choice of insert length, mix of libraries) for novel applications of next generation sequencing.
Position-specific binding of FUS to nascent RNA regulates mRNA length
Masuda, Akio; Takeda, Jun-ichi; Okuno, Tatsuya; Okamoto, Takaaki; Ohkawara, Bisei; Ito, Mikako; Ishigaki, Shinsuke; Sobue, Gen
2015-01-01
More than half of all human genes produce prematurely terminated polyadenylated short mRNAs. However, the underlying mechanisms remain largely elusive. CLIP-seq (cross-linking immunoprecipitation [CLIP] combined with deep sequencing) of FUS (fused in sarcoma) in neuronal cells showed that FUS is frequently clustered around an alternative polyadenylation (APA) site of nascent RNA. ChIP-seq (chromatin immunoprecipitation [ChIP] combined with deep sequencing) of RNA polymerase II (RNAP II) demonstrated that FUS stalls RNAP II and prematurely terminates transcription. When an APA site is located upstream of an FUS cluster, FUS enhances polyadenylation by recruiting CPSF160 and up-regulates the alternative short transcript. In contrast, when an APA site is located downstream from an FUS cluster, polyadenylation is not activated, and the RNAP II-suppressing effect of FUS leads to down-regulation of the alternative short transcript. CAGE-seq (cap analysis of gene expression [CAGE] combined with deep sequencing) and PolyA-seq (a strand-specific and quantitative method for high-throughput sequencing of 3' ends of polyadenylated transcripts) revealed that position-specific regulation of mRNA lengths by FUS is operational in two-thirds of transcripts in neuronal cells, with enrichment in genes involved in synaptic activities. PMID:25995189
The transcription factor Nerfin-1 prevents reversion of neurons into neural stem cells.
Froldi, Francesca; Szuperak, Milan; Weng, Chen-Fang; Shi, Wei; Papenfuss, Anthony T; Cheng, Louise Y
2015-01-15
Cellular dedifferentiation is the regression of a cell from a specialized state to a more multipotent state and is implicated in cancer. However, the transcriptional network that prevents differentiated cells from reacquiring stem cell fate is so far unclear. Neuroblasts (NBs), the Drosophila neural stem cells, are a model for the regulation of stem cell self-renewal and differentiation. Here we show that the Drosophila zinc finger transcription factor Nervous fingers 1 (Nerfin-1) locks neurons into differentiation, preventing their reversion into NBs. Following Prospero-dependent neuronal specification in the ganglion mother cell (GMC), a Nerfin-1-specific transcriptional program maintains differentiation in the post-mitotic neurons. The loss of Nerfin-1 causes reversion to multipotency and results in tumors in several neural lineages. Both the onset and rate of neuronal dedifferentiation in nerfin-1 mutant lineages are dependent on Myc- and target of rapamycin (Tor)-mediated cellular growth. In addition, Nerfin-1 is required for NB differentiation at the end of neurogenesis. RNA sequencing (RNA-seq) and chromatin immunoprecipitation (ChIP) analysis show that Nerfin-1 administers its function by repression of self-renewing-specific and activation of differentiation-specific genes. Our findings support the model of bidirectional interconvertibility between neural stem cells and their post-mitotic progeny and highlight the importance of the Nerfin-1-regulated transcriptional program in neuronal maintenance. © 2015 Froldi et al.; Published by Cold Spring Harbor Laboratory Press.
Fisher, R P; Topper, J N; Clayton, D A
1987-07-17
Selective transcription of human mitochondrial DNA requires a transcription factor (mtTF) in addition to an essentially nonselective RNA polymerase. Partially purified mtTF is able to sequester promoter-containing DNA in preinitiation complexes in the absence of mitochondrial RNA polymerase, suggesting a DNA-binding mechanism for factor activity. Functional domains, required for positive transcriptional regulation by mtTF, are identified within both major promoters of human mtDNA through transcription of mutant promoter templates in a reconstituted in vitro system. These domains are essentially coextensive with DNA sequences protected from nuclease digestion by mtTF-binding. Comparison of the sequences of the two mtTF-responsive elements reveals significant homology only when one sequence is inverted; the binding sites are in opposite orientations with respect to the predominant direction of transcription. Thus mtTF may function bidirectionally, requiring additional protein-DNA interactions to dictate transcriptional polarity. The mtTF-responsive elements are arrayed as direct repeats, separated by approximately 80 bp within the displacement-loop region of human mitochondrial DNA; this arrangement may reflect duplication of an ancestral bidirectional promoter, giving rise to separate, unidirectional promoters for each strand.
Identification and embryonic expression of a new AP-2 transcription factor, AP-2 epsilon.
Wang, Hao-Ven; Vaupel, Kristina; Buettner, Reinhard; Bosserhoff, Anja-Katrin; Moser, Markus
2004-09-01
AP-2 proteins comprise a family of highly related transcription factors, which are expressed during mouse embryogenesis in a variety of ectodermal, neuroectodermal, and mesenchymal tissues. AP-2 transcription factors were shown to be involved in morphogenesis of craniofacial, urogenital, neural crest-derived, and placental tissues. By means of a partial cDNA fragment identified during an expressed sequence tag search for AP-2 genes, we identified a fifth, previously unknown AP-2-related gene, AP-2 epsilon. AP-2 epsilon encodes an open reading frame of 434 amino acids, which reveals the typical modular structure of AP-2 transcription factors with highly conserved C-terminal DNA binding and dimerization domains. Although the N-terminally localized activation domain is less homologous, position and identity of amino acids essential for transcriptional transactivation are conserved. Reverse transcriptase-polymerase chain reaction analyses of murine embryos revealed AP-2 epsilon expression from gestational stage embryonic day 7.5 throughout all later embryonic stages until birth. Whole-mount in situ hybridization using a specific AP-2 epsilon cDNA fragment demonstrated that during embryogenesis, expression of AP-2 epsilon is mainly restricted to neural tissue, especially the midbrain, hindbrain, and olfactory bulb. This expression pattern was confirmed by immunohistochemistry with an AP-2 epsilon-specific antiserum. By using this antiserum, we could further localize AP-2 epsilon expression in a hypothalamic nucleus and the neuroepithelium of the vomeronasal organ, suggesting an important function of AP-2 epsilon for the development of the olfactory system.
Sex-specific differences in transcriptome profiles of brain and muscle tissue of the tropical gar.
Cribbin, Kayla M; Quackenbush, Corey R; Taylor, Kyle; Arias-Rodriguez, Lenin; Kelley, Joanna L
2017-04-07
The tropical gar (Atractosteus tropicus) is the southernmost species of the seven extant species of gar fishes in the world. In Mexico and Central America, the species is an important food source due to its nutritional quality and low price. Despite its regional importance and increasing concerns about overexploitation and habitat degradation, basic genetic information on the tropical gar is lacking. Determining genetic information on the tropical gar is important for the sustainable management of wild populations, implementation of best practices in aquaculture settings, evolutionary studies of ancient lineages, and an understanding of sex-specific gene expression. In this study, the transcriptome of the tropical gar was sequenced and assembled de novo using tissues from three males and three females using Illumina sequencing technology. Sex-specific and highly differentially expressed transcripts in brain and muscle tissues between adult males and females were subsequently identified. The transcriptome was assembled de novo resulting in 80,611 transcripts with a contig N50 of 3,355 base pairs and over 168 kilobases in total length. Male muscle, brain, and gonad as well as female muscle and brain were included in the assembly. The assembled transcriptome was annotated to identify the putative function of expressed transcripts using Trinotate and SwissProt, a database of well-annotated proteins. The brain and muscle datasets were then aligned to the assembled transcriptome to identify transcripts that were differentially expressed between males and females. The contrast between male and female brain identified 109 transcripts from 106 genes that were significantly differentially expressed. In the muscle comparison, 82 transcripts from 80 genes were identified with evidence for significant differential expression. Almost all genes identified as differentially expressed were sex-specific. The differentially expressed transcripts were enriched for genes involved in cellular functioning, signaling, immune response, and tissue-specific functions. This study identified differentially expressed transcripts between male and female gar in muscle and brain tissue. The majority of differentially expressed transcripts had sex-specific expression. Expanding on these findings to other developmental stages, populations, and species may lead to the identification of genetic factors contributing to the skewed sex ratio seen in the tropical gar and of sex-specific differences in expression in other species. Finally, the transcriptome assembly will open future research avenues on tropical gar development, cell function, environmental resistance, and evolution in the context of other early vertebrates.
Lewis type 1 antigen synthase (beta3Gal-T5) is transcriptionally regulated by homeoproteins.
Isshiki, Soichiro; Kudo, Takashi; Nishihara, Shoko; Ikehara, Yuzuru; Togayachi, Akira; Furuya, Akiko; Shitara, Kenya; Kubota, Tetsuro; Watanabe, Masahiko; Kitajima, Masaki; Narimatsu, Hisashi
2003-09-19
The type 1 carbohydrate chain, Galbeta1-3GlcNAc, is synthesized by UDP-galactose:beta-N-acetylglucosamine beta1,3-galactosyltransferase (beta3Gal-T). Among six beta3Gal-Ts cloned to date, beta3Gal-T5 is an essential enzyme for the synthesis of type 1 chain in epithelium of digestive tracts or pancreatic tissue. It forms the type 1 structure on glycoproteins produced from such tissues. In the present study, we found that the transcriptional regulation of the beta3Gal-T5 gene is controlled by homeoproteins, i.e. members of caudal-related homeobox protein (Cdx) and hepatocyte nuclear factor (HNF) families. We found an important region (-151 to -121 from the transcription initiation site), named the beta3Gal-T5 control element (GCE), for the promoter activity. GCE contained the consensus sequences for members of the Cdx and HNF families. Mutations introduced into this sequence abolished the transcriptional activity. Four factors, Cdx1, Cdx2, HNF1alpha, and HNF1beta, could bind to GCE and transcriptionally activate the beta3Gal-T5 gene. Transcriptional regulation of the beta3Gal-T5 gene was consistent with that of members of the Cdx and HNF1 families in two in vivo systems. 1) During in vitro differentiation of Caco-2 cells, transcriptional up-regulation of beta3Gal-T5 was observed in correlation with the increase in transcripts for Cdx2 and HNF1alpha. 2) Both transcript and protein levels of beta3Gal-T5 were determined to be significantly reduced in colon cancer. This down-regulation was correlated with the decrease of Cdx1 and HNF1beta expression in cancer tissue. This is the first finding that a glycosyltransferase gene is transcriptionally regulated under the control of homeoproteins in a tissue-specific manner. beta3Gal-T5, controlled by the intestinal homeoproteins, may play an important role in the specific function of intestinal cells by modifying the carbohydrate structure of glycoproteins.
Structural Basis for Sequence-specific DNA Recognition by an Arabidopsis WRKY Transcription Factor*
Yamasaki, Kazuhiko; Kigawa, Takanori; Watanabe, Satoru; Inoue, Makoto; Yamasaki, Tomoko; Seki, Motoaki; Shinozaki, Kazuo; Yokoyama, Shigeyuki
2012-01-01
The WRKY family transcription factors regulate plant-specific reactions that are mostly related to biotic and abiotic stresses. They share the WRKY domain, which recognizes a DNA element (TTGAC(C/T)) termed the W-box, in target genes. Here, we determined the solution structure of the C-terminal WRKY domain of Arabidopsis WRKY4 in complex with the W-box DNA by NMR. A four-stranded β-sheet enters the major groove of DNA in an atypical mode termed the β-wedge, where the sheet is nearly perpendicular to the DNA helical axis. Residues in the conserved WRKYGQK motif contact DNA bases mainly through extensive apolar contacts with thymine methyl groups. The importance of these contacts was verified by substituting the relevant T bases with U and by surface plasmon resonance analyses of DNA binding. PMID:22219184
CRX ChIP-seq reveals the cis-regulatory architecture of mouse photoreceptors
Corbo, Joseph C.; Lawrence, Karen A.; Karlstetter, Marcus; Myers, Connie A.; Abdelaziz, Musa; Dirkes, William; Weigelt, Karin; Seifert, Martin; Benes, Vladimir; Fritsche, Lars G.; Weber, Bernhard H.F.; Langmann, Thomas
2010-01-01
Approximately 98% of mammalian DNA is noncoding, yet we understand relatively little about the function of this enigmatic portion of the genome. The cis-regulatory elements that control gene expression reside in noncoding regions and can be identified by mapping the binding sites of tissue-specific transcription factors. Cone-rod homeobox (CRX) is a key transcription factor in photoreceptor differentiation and survival, but its in vivo targets are largely unknown. Here, we used chromatin immunoprecipitation with massively parallel sequencing (ChIP-seq) on CRX to identify thousands of cis-regulatory regions around photoreceptor genes in adult mouse retina. CRX directly regulates downstream photoreceptor transcription factors and their target genes via a network of spatially distributed regulatory elements around each locus. CRX-bound regions act in a synergistic fashion to activate transcription and contain multiple CRX binding sites which interact in a spacing- and orientation-dependent manner to fine-tune transcript levels. CRX ChIP-seq was also performed on Nrl−/− retinas, which represent an enriched source of cone photoreceptors. Comparison with the wild-type ChIP-seq data set identified numerous rod- and cone-specific CRX-bound regions as well as many shared elements. Thus, CRX combinatorially orchestrates the transcriptional networks of both rods and cones by coordinating the expression of photoreceptor genes including most retinal disease genes. In addition, this study pinpoints thousands of noncoding regions of relevance to both Mendelian and complex retinal disease. PMID:20693478
Cis-regulatory landscapes of four cell types of the retina.
Hartl, Dominik; Krebs, Arnaud R; Jüttner, Josephine; Roska, Botond; Schübeler, Dirk
2017-11-16
The retina is composed of ∼50 cell-types with specific functions for the process of vision. Identification of the cis-regulatory elements active in retinal cell-types is key to elucidate the networks controlling this diversity. Here, we combined transcriptome and epigenome profiling to map the regulatory landscape of four cell-types isolated from mouse retinas including rod and cone photoreceptors as well as rare inter-neuron populations such as horizontal and starburst amacrine cells. Integration of this information reveals sequence determinants and candidate transcription factors for controlling cellular specialization. Additionally, we refined parallel reporter assays to enable studying the transcriptional activity of large collection of sequences in individual cell-types isolated from a tissue. We provide proof of concept for this approach and its scalability by characterizing the transcriptional capacity of several hundred putative regulatory sequences within individual retinal cell-types. This generates a catalogue of cis-regulatory regions active in retinal cell types and we further demonstrate their utility as potential resource for cellular tagging and manipulation. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
NASA Technical Reports Server (NTRS)
Marsh, T. L.; Reich, C. I.; Whitelock, R. B.; Olsen, G. J.; Woese, C. R. (Principal Investigator)
1994-01-01
The first step in transcription initiation in eukaryotes is mediated by the TATA-binding protein, a subunit of the transcription factor IID complex. We have cloned and sequenced the gene for a presumptive homolog of this eukaryotic protein from Thermococcus celer, a member of the Archaea (formerly archaebacteria). The protein encoded by the archaeal gene is a tandem repeat of a conserved domain, corresponding to the repeated domain in its eukaryotic counterparts. Molecular phylogenetic analyses of the two halves of the repeat are consistent with the duplication occurring before the divergence of the archael and eukaryotic domains. In conjunction with previous observations of similarity in RNA polymerase subunit composition and sequences and the finding of a transcription factor IIB-like sequence in Pyrococcus woesei (a relative of T. celer) it appears that major features of the eukaryotic transcription apparatus were well-established before the origin of eukaryotic cellular organization. The divergence between the two halves of the archael protein is less than that between the halves of the individual eukaryotic sequences, indicating that the average rate of sequence change in the archael protein has been less than in its eukaryotic counterparts. To the extent that this lower rate applies to the genome as a whole, a clearer picture of the early genes (and gene families) that gave rise to present-day genomes is more apt to emerge from the study of sequences from the Archaea than from the corresponding sequences from eukaryotes.
Lee, Sooncheol; Nguyen, Huong Minh; Kang, Changwon
2010-10-01
No biological function has been identified for tiny RNA transcripts that are abortively and repetitiously released from initiation complexes of RNA polymerase in vitro and in vivo to date. In this study, we show that abortive initiation affects termination in transcription of bacteriophage T7 gene 10. Specifically, abortive transcripts produced from promoter phi 10 exert trans-acting antitermination activity on terminator T phi both in vitro and in vivo. Following abortive initiation cycling of T7 RNA polymerase at phi 10, short G-rich and oligo(G) RNAs were produced and both specifically sequestered 5- and 6-nt C + U stretch sequences, consequently interfering with terminator hairpin formation. This antitermination activity depended on sequence-specific hybridization of abortive transcripts with the 5' but not 3' half of T phi RNA. Antitermination was abolished when T phi was mutated to lack a C + U stretch, but restored when abortive transcript sequence was additionally modified to complement the mutation in T phi, both in vitro and in vivo. Antitermination was enhanced in vivo when the abortive transcript concentration was increased via overproduction of RNA polymerase or ribonuclease deficiency. Accordingly, antitermination activity exerted on T phi by abortive transcripts should facilitate expression of T phi-downstream promoter-less genes 11 and 12 in T7 infection of Escherichia coli.
Chen, Chih-Yu; Shi, Wenqiang; Balaton, Bradley P; Matthews, Allison M; Li, Yifeng; Arenillas, David J; Mathelier, Anthony; Itoh, Masayoshi; Kawaji, Hideya; Lassmann, Timo; Hayashizaki, Yoshihide; Carninci, Piero; Forrest, Alistair R R; Brown, Carolyn J; Wasserman, Wyeth W
2016-11-18
Sex differences in susceptibility and progression have been reported in numerous diseases. Female cells have two copies of the X chromosome with X-chromosome inactivation imparting mono-allelic gene silencing for dosage compensation. However, a subset of genes, named escapees, escape silencing and are transcribed bi-allelically resulting in sexual dimorphism. Here we conducted in silico analyses of the sexes using human datasets to gain perspectives into such regulation. We identified transcription start sites of escapees (escTSSs) based on higher transcription levels in female cells using FANTOM5 CAGE data. Significant over-representations of YY1 transcription factor binding motif and ChIP-seq peaks around escTSSs highlighted its positive association with escapees. Furthermore, YY1 occupancy is significantly biased towards the inactive X (Xi) at long non-coding RNA loci that are frequent contacts of Xi-specific superloops. Our study suggests a role for YY1 in transcriptional activity on Xi in general through sequence-specific binding, and its involvement at superloop anchors.
Chen, Chih-yu; Shi, Wenqiang; Balaton, Bradley P.; Matthews, Allison M.; Li, Yifeng; Arenillas, David J.; Mathelier, Anthony; Itoh, Masayoshi; Kawaji, Hideya; Lassmann, Timo; Hayashizaki, Yoshihide; Carninci, Piero; Forrest, Alistair R. R.; Brown, Carolyn J.; Wasserman, Wyeth W.
2016-01-01
Sex differences in susceptibility and progression have been reported in numerous diseases. Female cells have two copies of the X chromosome with X-chromosome inactivation imparting mono-allelic gene silencing for dosage compensation. However, a subset of genes, named escapees, escape silencing and are transcribed bi-allelically resulting in sexual dimorphism. Here we conducted in silico analyses of the sexes using human datasets to gain perspectives into such regulation. We identified transcription start sites of escapees (escTSSs) based on higher transcription levels in female cells using FANTOM5 CAGE data. Significant over-representations of YY1 transcription factor binding motif and ChIP-seq peaks around escTSSs highlighted its positive association with escapees. Furthermore, YY1 occupancy is significantly biased towards the inactive X (Xi) at long non-coding RNA loci that are frequent contacts of Xi-specific superloops. Our study suggests a role for YY1 in transcriptional activity on Xi in general through sequence-specific binding, and its involvement at superloop anchors. PMID:27857184
Expression regulation by a methyl-CpG binding domain in an E. coli based, cell-free TX-TL system
NASA Astrophysics Data System (ADS)
Schenkelberger, M.; Shanak, S.; Finkler, M.; Worst, E. G.; Noireaux, V.; Helms, V.; Ott, A.
2017-04-01
Cytosine methylation plays an important role in the epigenetic regulation of eukaryotic gene expression. The methyl-CpG binding domain (MBD) is common to a family of eukaryotic transcriptional regulators. How MBD, a stretch of about 80 amino acids, recognizes CpGs in a methylation dependent manner, and as a function of sequence, is only partly understood. Here we show, using an Escherichia coli cell-free expression system, that MBD from the human transcriptional regulator MeCP2 performs as a specific, methylation-dependent repressor in conjunction with the BDNF (brain-derived neurotrophic factor) promoter sequence. Mutation of either base flanking the central CpG pair changes the expression level of the target gene. However, the relative degree of repression as a function of MBD concentration remains unaltered. Molecular dynamics simulations that address the DNA B fiber ratio and the handedness reveal cooperative transitions in the promoter DNA upon MBD binding that correlate well with our experimental observations. We suggest that not only steric hindrance, but also conformational changes of the BDNF promoter as a result of MBD binding are required for MBD to act as a specific inhibitory element. Our work demonstrates that the prokaryotic transcription machinery can reproduce features of epigenetic mammalian transcriptional regulatory elements.
Purification and characterization of human mitochondrial transcription factor 1.
Fisher, R P; Clayton, D A
1988-01-01
We purified to near homogeneity a transcription factor from human KB cell mitochondria. This factor, designated mitochondrial transcription factor 1 (mtTF1), is required for the in vitro recognition of both major promoters of human mitochondrial DNA by the homologous mitochondrial RNA polymerase. Furthermore, it has been shown to bind upstream regulatory elements of the two major promoters. After separation from RNA polymerase by phosphocellulose chromatography, mtTF1 was chromatographed on a MonoQ anion-exchange fast-performance liquid chromatography column. Analysis of mtTF1-containing fractions by sodium dodecyl sulfate-polyacrylamide gel electrophoresis revealed a single major polypeptide with an Mr of approximately 25,000. Centrifugation in analytical glycerol gradients indicated a sedimentation coefficient of approximately 2.5 S, consistent with a monomeric 25-kilodalton protein. Finally, when the 25-kilodalton polypeptide was excised from a stained sodium dodecyl sulfate-polyacrylamide gel and allowed to renature, it regained DNA-binding and transcriptional stimulatory activities at both promoters. Although mtTF1 is the only mitochondrial DNA-binding transcription factor to be purified and characterized, its properties, such as a high affinity for random DNA and a weak specificity for one of its target sequences, may typify this class of regulatory proteins. Images PMID:3211148
A long natural-antisense RNA is accumulated in the conidia of Aspergillus oryzae.
Tsujii, Masaru; Okuda, Satoshi; Ishi, Kazutomo; Madokoro, Kana; Takeuchi, Michio; Yamagata, Youhei
2016-01-01
Analysis of expressed sequence tag libraries from various culture conditions revealed the existence of conidia-specific transcripts assembled to putative conidiation-specific reductase gene (csrA) in Aspergillus oryzae. However, the all transcripts were transcribed with opposite direction to the gene csrA. The sequence analysis of the transcript revealed that the RNA overlapped mRNA of csrA with 3'-end, and did not code protein longer than 60 amino acid residues. We designated the transcript Conidia Specific Long Natural-antisense RNA (CSLNR). The real-time PCR analysis demonstrated that the CSLNR is conidia-specific transcript, which cannot be transcribed in the absence of brlA, and the amount of CSLNR was much more than that of the transcript from csrA in conidia. Furthermore, the csrA deletion, also lacking coding region of CSLNR in A. oryzae reduced the number of conidia. Overexpression of CsrA demonstrated the inhibition of growth and conidiation, while CSLNR did not affect conidiation.
Sela, Dotan; Chen, Lu; Martin-Brown, Skylar; Washburn, Michael P; Florens, Laurence; Conaway, Joan Weliky; Conaway, Ronald C
2012-06-29
The basic leucine zipper transcription factor ATF6α functions as a master regulator of endoplasmic reticulum (ER) stress response genes. Previous studies have established that, in response to ER stress, ATF6α translocates to the nucleus and activates transcription of ER stress response genes upon binding sequence specifically to ER stress response enhancer elements in their promoters. In this study, we investigate the biochemical mechanism by which ATF6α activates transcription. By exploiting a combination of biochemical and multidimensional protein identification technology-based mass spectrometry approaches, we have obtained evidence that ATF6α functions at least in part by recruiting to the ER stress response enhancer elements of ER stress response genes a collection of RNA polymerase II coregulatory complexes, including the Mediator and multiple histone acetyltransferase complexes, among which are the Spt-Ada-Gcn5 acetyltransferase (SAGA) and Ada-Two-A-containing (ATAC) complexes. Our findings shed new light on the mechanism of action of ATF6α, and they outline a straightforward strategy for applying multidimensional protein identification technology mass spectrometry to determine which RNA polymerase II transcription factors and coregulators are recruited to promoters and other regulatory elements to control transcription.
Shkolnik, Doron; Bar-Zvi, Dudy
2008-05-01
The manipulation of transacting factors is commonly used to achieve a wide change in the expression of a large number of genes in transgenic plants as a result of a change in the expression of a single gene product. This is mostly achieved by the overexpression of transactivator or repressor proteins. In this study, it is demonstrated that the overexpression of an exogenous DNA-binding protein can be used to compete with the expression of an endogenous transcription factor sharing the same DNA-binding sequence. Arabidopsis was transformed with cDNA encoding tomato abscisic acid stress ripening 1 (ASR1), a sequence-specific DNA protein that has no orthologues in the Arabidopsis genome. ASR1-overexpressing (ASR1-OE) plants display an abscisic acid-insensitive 4 (abi4) phenotype: seed germination is not sensitive to inhibition by abscisic acid (ABA), glucose, NaCl and paclobutrazol. ASR1 binds coupling element 1 (CE1), a cis-acting element bound by the ABI4 transcription factor, located in the ABI4-regulated promoters, including that of the ABI4 gene. Chromatin immunoprecipitation demonstrates that ASR1 is bound in vivo to the promoter of the ABI4 gene in ASR1-OE plants, but not to promoters of genes known to be regulated by the transcription factors ABI3 or ABI5. Real-time polymerase chain reaction (PCR) analysis confirmed that the expression of ABI4 and ABI4-regulated genes is markedly reduced in ASR1-OE plants. Therefore, it is concluded that the abi4 phenotype of ASR1-OE plants is the result of competition between the foreign ASR1 and the endogenous ABI4 on specific promoter DNA sequences. The biotechnological advantage of using this approach in crop plants from the Brassicaceae family to reduce the transactivation activity of ABI4 is discussed.
Wood, David L. A.; Nones, Katia; Steptoe, Anita; Christ, Angelika; Harliwong, Ivon; Newell, Felicity; Bruxner, Timothy J. C.; Miller, David; Cloonan, Nicole; Grimmond, Sean M.
2015-01-01
Genetic variation modulates gene expression transcriptionally or post-transcriptionally, and can profoundly alter an individual’s phenotype. Measuring allelic differential expression at heterozygous loci within an individual, a phenomenon called allele-specific expression (ASE), can assist in identifying such factors. Massively parallel DNA and RNA sequencing and advances in bioinformatic methodologies provide an outstanding opportunity to measure ASE genome-wide. In this study, matched DNA and RNA sequencing, genotyping arrays and computationally phased haplotypes were integrated to comprehensively and conservatively quantify ASE in a single human brain and liver tissue sample. We describe a methodological evaluation and assessment of common bioinformatic steps for ASE quantification, and recommend a robust approach to accurately measure SNP, gene and isoform ASE through the use of personalized haplotype genome alignment, strict alignment quality control and intragenic SNP aggregation. Our results indicate that accurate ASE quantification requires careful bioinformatic analyses and is adversely affected by sample specific alignment confounders and random sampling even at moderate sequence depths. We identified multiple known and several novel ASE genes in liver, including WDR72, DSP and UBD, as well as genes that contained ASE SNPs with imbalance direction discordant with haplotype phase, explainable by annotated transcript structure, suggesting isoform derived ASE. The methods evaluated in this study will be of use to researchers performing highly conservative quantification of ASE, and the genes and isoforms identified as ASE of interest to researchers studying those loci. PMID:25965996
Two short segments of Smad3 are important for specific interaction of Smad3 with c-Ski and SnoN.
Mizuide, Masafumi; Hara, Takane; Furuya, Toshio; Takeda, Masafumi; Kusanagi, Kiyoshi; Inada, Yuri; Mori, Masatomo; Imamura, Takeshi; Miyazawa, Keiji; Miyazono, Kohei
2003-01-03
c-Ski and SnoN are transcriptional co-repressors that inhibit transforming growth factor-beta signaling through interaction with Smad proteins. Among receptor-regulated Smads, c-Ski and SnoN bind more strongly to Smad2 and Smad3 than to Smad1. Here, we show that c-Ski and SnoN bind to the "SE" sequence in the C-terminal MH2 domain of Smad3, which is exposed on the N-terminal upper side of the toroidal structure of the MH2 oligomer. The "QPSMT" sequence, located in the vicinity of SE, supports the interaction with c-Ski and SnoN. Sequences similar to SE and QPSMT are found in Smad2, but not in Smad1. The N-terminal MH1 domain and linker region of Smad3 protrude from the N-terminal upper side of the MH2 oligomer toroid. Smurf2 induces ubiquitin-dependent degradation of SnoN, since it appears to be located close to SnoN through binding to the linker region of Smad2. In contrast, transcription factors Mixer and FoxH3 (FAST1) bind to the bottom side of the Smad3 MH2 toroid; therefore, c-Ski does not affect the interaction of Smads with these transcription factors. Our findings thus demonstrate the stoichiometry of how multiple molecules can associate with the Smad oligomers and how the Smad-interacting proteins functionally interact with each other.
The sheep growth hormone gene polymorphism and its effects on milk traits.
Dettori, Maria Luisa; Pazzola, Michele; Pira, Emanuela; Paschino, Pietro; Vacca, Giuseppe Massimo
2015-05-01
Growth hormone (GH) is encoded by the GH gene, which may be single copy or duplicate in sheep. The two copies of the sheep GH gene (GH1/GH2-N and GH2-Z) were entirely sequenced in one 106 ewes of Sarda breed, in order to highlight sequence polymorphisms and investigate possible association between genetic variants and milk traits. Milk traits included milk yield, fat, protein, casein and lactose percentage. We evidenced 75 nucleotide changes. Transcription factor binding site prediction revealed two sequences potentially recognised by the pituitary-specific transcription factor POU1FI at the GH1/GH2-N gene, which were lost at the promoter of GH2-Z, which might explain the different tissues of expression of GH1/GH2-N (pituitary) and GH2-Z (placenta). Significant differences in milk traits were observed among genotypes at polymorphic loci only for the GH2-Z gene. Sheep with homozygote genotype ss748770547 CC had higher fat percentage (P < 0.01) than TT. SNP ss748770547 was part of a potential transcription factor binding site for C/EBP alpha (CCAAT/Enhancer Binding Protein), which is involved in the regulation of adipogenesis and adipoblast differentiation. SNP ss748770547, located in the GH2-Z gene 5' flanking region, may be a causal mutation affecting milk fat content. These findings might contribute to the knowledge of the sheep GH locus and might be useful in selection processes in sheep.
Large-scale collection of full-length cDNA and transcriptome analysis in Hevea brasiliensis.
Makita, Yuko; Ng, Kiaw Kiaw; Veera Singham, G; Kawashima, Mika; Hirakawa, Hideki; Sato, Shusei; Othman, Ahmad Sofiman; Matsui, Minami
2017-04-01
Natural rubber has unique physical properties that cannot be replaced by products from other latex-producing plants or petrochemically produced synthetic rubbers. Rubber from Hevea brasiliensis is the main commercial source for this natural rubber that has a cis-polyisoprene configuration. For sustainable production of enough rubber to meet demand elucidation of the molecular mechanisms involved in the production of latex is vital. To this end, we firstly constructed rubber full-length cDNA libraries of RRIM 600 cultivar and sequenced around 20,000 clones by the Sanger method and over 15,000 contigs by Illumina sequencer. With these data, we updated around 5,500 gene structures and newly annotated around 9,500 transcription start sites. Second, to elucidate the rubber biosynthetic pathways and their transcriptional regulation, we carried out tissue- and cultivar-specific RNA-Seq analysis. By using our recently published genome sequence, we confirmed the expression patterns of the rubber biosynthetic genes. Our data suggest that the cytoplasmic mevalonate (MVA) pathway is the main route for isoprenoid biosynthesis in latex production. In addition to the well-studied polymerization factors, we suggest that rubber elongation factor 8 (REF8) is a candidate factor in cis-polyisoprene biosynthesis. We have also identified 39 transcription factors that may be key regulators in latex production. Expression profile analysis using two additional cultivars, RRIM 901 and PB 350, via an RNA-Seq approach revealed possible expression differences between a high latex-yielding cultivar and a disease-resistant cultivar. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Transcription Factor Map Alignment of Promoter Regions
Blanco, Enrique; Messeguer, Xavier; Smith, Temple F; Guigó, Roderic
2006-01-01
We address the problem of comparing and characterizing the promoter regions of genes with similar expression patterns. This remains a challenging problem in sequence analysis, because often the promoter regions of co-expressed genes do not show discernible sequence conservation. In our approach, thus, we have not directly compared the nucleotide sequence of promoters. Instead, we have obtained predictions of transcription factor binding sites, annotated the predicted sites with the labels of the corresponding binding factors, and aligned the resulting sequences of labels—to which we refer here as transcription factor maps (TF-maps). To obtain the global pairwise alignment of two TF-maps, we have adapted an algorithm initially developed to align restriction enzyme maps. We have optimized the parameters of the algorithm in a small, but well-curated, collection of human–mouse orthologous gene pairs. Results in this dataset, as well as in an independent much larger dataset from the CISRED database, indicate that TF-map alignments are able to uncover conserved regulatory elements, which cannot be detected by the typical sequence alignments. PMID:16733547
RNAi triggered by symmetrically transcribed transgenes in Drosophila melanogaster.
Giordano, Ennio; Rendina, Rosaria; Peluso, Ivana; Furia, Maria
2002-01-01
Specific silencing of target genes can be induced in a variety of organisms by providing homologous double-stranded RNA molecules. In vivo, these molecules can be generated either by transcription of sequences having an inverted-repeat (IR) configuration or by simultaneous transcription of sense-antisense strands. Since IR constructs are difficult to prepare and can stimulate genomic rearrangements, we investigated the silencing potential of symmetrically transcribed sequences. We report that Drosophila transgenes whose sense-antisense transcription was driven by two convergent arrays of Gal4-dependent UAS sequences can induce specific, dominant, and heritable repression of target genes. This effect is not dependent on a mechanism based on homology-dependent DNA/DNA interactions, but is directly triggered by transcriptional activation and is accompanied by specific depletion of the endogenous target RNA. Tissue-specific induction of these transgenes restricts the target gene silencing to selected body domains, and spreading phenomena described in other cases of post-transcriptional gene silencing (PTGS) were not observed. In addition to providing an additional tool useful for Drosophila functional genomic analysis, these results add further strength to the view that events of sense-antisense transcription may readily account for some, if not all, PTGS-cosuppression phenomena and can potentially play a relevant role in gene regulation. PMID:11861567
Link, Gerhard
1984-01-01
A nuclease-treated plastid extract from mustard (Sinapis alba L.) allows efficient transcription of cloned plastid DNA templates. In this in vitro system, the major runoff transcript of the truncated gene for the 32 000 mol. wt. photosystem II protein was accurately initiated from a site close to or identical with the in vivo start site. By using plasmids with deletions in the 5'-flanking region of this gene as templates, a DNA region required for efficient and selective initiation was detected ˜28-35 nucleotides upstream of the transcription start site. This region contains the sequence element TTGACA, which matches the consensus sequence for prokaryotic `−35' promoter elements. In the absence of this region, a region ˜13-27 nucleotides upstream of the start site still enables a basic level of specific transcription. This second region contains the sequence element TATATAA, which matches the consensus sequence for the `TATA' box of genes transcribed by RNA polymerase II (or B). The region between the `TATA'-like element and the transcription start site is not sufficient but may be required for specific transcription of the plastid gene. This latter region contains the sequence element TATACT, which resembles the prokaryotic `−10' (Pribnow) box. Based on the structural and transcriptional features of the 5' upstream region, a `promoter switch' mechanism is proposed, which may account for the developmentally regulated expression of this plastid gene. ImagesFig. 1.Fig. 2.Fig. 3.Fig. 4.Figure 5. PMID:16453540
Warfield, Linda; Tuttle, Lisa M; Pacheco, Derek; Klevit, Rachel E; Hahn, Steven
2014-08-26
Although many transcription activators contact the same set of coactivator complexes, the mechanism and specificity of these interactions have been unclear. For example, do intrinsically disordered transcription activation domains (ADs) use sequence-specific motifs, or do ADs of seemingly different sequence have common properties that encode activation function? We find that the central activation domain (cAD) of the yeast activator Gcn4 functions through a short, conserved sequence-specific motif. Optimizing the residues surrounding this short motif by inserting additional hydrophobic residues creates very powerful ADs that bind the Mediator subunit Gal11/Med15 with high affinity via a "fuzzy" protein interface. In contrast to Gcn4, the activity of these synthetic ADs is not strongly dependent on any one residue of the AD, and this redundancy is similar to that of some natural ADs in which few if any sequence-specific residues have been identified. The additional hydrophobic residues in the synthetic ADs likely allow multiple faces of the AD helix to interact with the Gal11 activator-binding domain, effectively forming a fuzzier interface than that of the wild-type cAD.
The evolution of WRKY transcription factors.
Rinerson, Charles I; Rabara, Roel C; Tripathi, Prateek; Shen, Qingxi J; Rushton, Paul J
2015-02-27
The availability of increasing numbers of sequenced genomes has necessitated a re-evaluation of the evolution of the WRKY transcription factor family. Modern day plants descended from a charophyte green alga that colonized the land between 430 and 470 million years ago. The first charophyte genome sequence from Klebsormidium flaccidum filled a gap in the available genome sequences in the plant kingdom between unicellular green algae that typically have 1-3 WRKY genes and mosses that contain 30-40. WRKY genes have been previously found in non-plant species but their occurrence has been difficult to explain. Only two WRKY genes are present in the Klebsormidium flaccidum genome and the presence of a Group IIb gene was unexpected because it had previously been thought that Group IIb WRKY genes first appeared in mosses. We found WRKY transcription factor genes outside of the plant lineage in some diplomonads, social amoebae, fungi incertae sedis, and amoebozoa. This patchy distribution suggests that lateral gene transfer is responsible. These lateral gene transfer events appear to pre-date the formation of the WRKY groups in flowering plants. Flowering plants contain proteins with domains typical for both resistance (R) proteins and WRKY transcription factors. R protein-WRKY genes have evolved numerous times in flowering plants, each type being restricted to specific flowering plant lineages. These chimeric proteins contain not only novel combinations of protein domains but also novel combinations and numbers of WRKY domains. Once formed, R protein WRKY genes may combine different components of signalling pathways that may either create new diversity in signalling or accelerate signalling by short circuiting signalling pathways. We propose that the evolution of WRKY transcription factors includes early lateral gene transfers to non-plant organisms and the occurrence of algal WRKY genes that have no counterparts in flowering plants. We propose two alternative hypotheses of WRKY gene evolution: The "Group I Hypothesis" sees all WRKY genes evolving from Group I C-terminal WRKY domains. The alternative "IIa + b Separate Hypothesis" sees Groups IIa and IIb evolving directly from a single domain algal gene separate from the Group I-derived lineage.
Kandpal, Raj P; Rajasimha, Harsha K; Brooks, Matthew J; Nellissery, Jacob; Wan, Jun; Qian, Jiang; Kern, Timothy S; Swaroop, Anand
2012-01-01
To define gene expression changes associated with diabetic retinopathy in a mouse model using next generation sequencing, and to utilize transcriptome signatures to assess molecular pathways by which pharmacological agents inhibit diabetic retinopathy. We applied a high throughput RNA sequencing (RNA-seq) strategy using Illumina GAIIx to characterize the entire retinal transcriptome from nondiabetic and from streptozotocin-treated mice 32 weeks after induction of diabetes. Some of the diabetic mice were treated with inhibitors of receptor for advanced glycation endproducts (RAGE) and p38 mitogen activated protein (MAP) kinase, which have previously been shown to inhibit diabetic retinopathy in rodent models. The transcripts and alternatively spliced variants were determined in all experimental groups. Next generation sequencing-based RNA-seq profiles provided comprehensive signatures of transcripts that are altered in early stages of diabetic retinopathy. These transcripts encoded proteins involved in distinct yet physiologically relevant disease-associated pathways such as inflammation, microvasculature formation, apoptosis, glucose metabolism, Wnt signaling, xenobiotic metabolism, and photoreceptor biology. Significant upregulation of crystallin transcripts was observed in diabetic animals, and the diabetes-induced upregulation of these transcripts was inhibited in diabetic animals treated with inhibitors of either RAGE or p38 MAP kinase. These two therapies also showed dissimilar regulation of some subsets of transcripts that included alternatively spliced versions of arrestin, neutral sphingomyelinase activation associated factor (Nsmaf), SH3-domain GRB2-like interacting protein 1 (Sgip1), and axin. Diabetes alters many transcripts in the retina, and two therapies that inhibit the vascular pathology similarly inhibit a portion of these changes, pointing to possible molecular mechanisms for their beneficial effects. These therapies also changed the abundance of various alternatively spliced versions of signaling transcripts, suggesting a possible role of alternative splicing in disease etiology. Our studies clearly demonstrate RNA-seq as a comprehensive strategy for identifying disease-specific transcripts, and for determining comparative profiles of molecular changes mediated by candidate drugs.
Large Scale Comparative Visualisation of Regulatory Networks with TRNDiff
Chua, Xin-Yi; Buckingham, Lawrence; Hogan, James M.; ...
2015-06-01
The advent of Next Generation Sequencing (NGS) technologies has seen explosive growth in genomic datasets, and dense coverage of related organisms, supporting study of subtle, strain-specific variations as a determinant of function. Such data collections present fresh and complex challenges for bioinformatics, those of comparing models of complex relationships across hundreds and even thousands of sequences. Transcriptional Regulatory Network (TRN) structures document the influence of regulatory proteins called Transcription Factors (TFs) on associated Target Genes (TGs). TRNs are routinely inferred from model systems or iterative search, and analysis at these scales requires simultaneous displays of multiple networks well beyond thosemore » of existing network visualisation tools [1]. In this paper we describe TRNDiff, an open source system supporting the comparative analysis and visualization of TRNs (and similarly structured data) from many genomes, allowing rapid identification of functional variations within species. The approach is demonstrated through a small scale multiple TRN analysis of the Fur iron-uptake system of Yersinia, suggesting a number of candidate virulence factors; and through a larger study exploiting integration with the RegPrecise database (http://regprecise.lbl.gov; [2]) - a collection of hundreds of manually curated and predicted transcription factor regulons drawn from across the entire spectrum of prokaryotic organisms.« less
Panagopoulos, Ioannis; Gorunova, Ludmila; Bjerkehagen, Bodil; Heim, Sverre
2014-01-01
Whole transcriptome sequencing was used to study a small round cell tumor in which a t(4;19)(q35;q13) was part of the complex karyotype but where the initial reverse transcriptase PCR (RT-PCR) examination did not detect a CIC-DUX4 fusion transcript previously described as the crucial gene-level outcome of this specific translocation. The RNA sequencing data were analysed using the FusionMap, FusionFinder, and ChimeraScan programs which are specifically designed to identify fusion genes. FusionMap, FusionFinder, and ChimeraScan identified 1017, 102, and 101 fusion transcripts, respectively, but CIC-DUX4 was not among them. Since the RNA sequencing data are in the fastq text-based format, we searched the files using the "grep" command-line utility. The "grep" command searches the text for specific expressions and displays, by default, the lines where matches occur. The "specific expression" was a sequence of 20 nucleotides from the coding part of the last exon 20 of CIC (Reference Sequence: NM_015125.3) chosen since all the so far reported CIC breakpoints have occurred here. Fifteen chimeric CIC-DUX4 cDNA sequences were captured and the fusion between the CIC and DUX4 genes was mapped precisely. New primer combinations were constructed based on these findings and were used together with a polymerase suitable for amplification of GC-rich DNA templates to amplify CIC-DUX4 cDNA fragments which had the same fusion point found with "grep". In conclusion, FusionMap, FusionFinder, and ChimeraScan generated a plethora of fusion transcripts but did not detect the biologically important CIC-DUX4 chimeric transcript; they are generally useful but evidently suffer from imperfect both sensitivity and specificity. The "grep" command is an excellent tool to capture chimeric transcripts from RNA sequencing data when the pathological and/or cytogenetic information strongly indicates the presence of a specific fusion gene.
Engineering Synthetic Gene Circuits in Living Cells with CRISPR Technology.
Jusiak, Barbara; Cleto, Sara; Perez-Piñera, Pablo; Lu, Timothy K
2016-07-01
One of the goals of synthetic biology is to build regulatory circuits that control cell behavior, for both basic research purposes and biomedical applications. The ability to build transcriptional regulatory devices depends on the availability of programmable, sequence-specific, and effective synthetic transcription factors (TFs). The prokaryotic clustered regularly interspaced short palindromic repeat (CRISPR) system, recently harnessed for transcriptional regulation in various heterologous host cells, offers unprecedented ease in designing synthetic TFs. We review how CRISPR can be used to build synthetic gene circuits and discuss recent advances in CRISPR-mediated gene regulation that offer the potential to build increasingly complex, programmable, and efficient gene circuits in the future. Copyright © 2016. Published by Elsevier Ltd.
Genome-wide uniformity of human ‘open’ pre-initiation complexes
Lai, William K.M.; Pugh, B. Franklin
2017-01-01
Transcription of protein-coding and noncoding DNA occurs pervasively throughout the mammalian genome. Their sites of initiation are generally inferred from transcript 5′ ends and are thought to be either locally dispersed or focused. How these two modes of initiation relate is unclear. Here, we apply permanganate treatment and chromatin immunoprecipitation (PIP-seq) of initiation factors to identify the precise location of melted DNA separately associated with the preinitiation complex (PIC) and the adjacent paused complex (PC). This approach revealed the two known modes of transcription initiation. However, in contrast to prevailing views, they co-occurred within the same promoter region: initiation originating from a focused PIC, and broad nucleosome-linked initiation. PIP-seq allowed transcriptional orientation of Pol II to be determined, which may be useful near promoters where sufficient sense/anti-sense transcript mapping information is lacking. PIP-seq detected divergently oriented Pol II at both coding and noncoding promoters, as well as at enhancers. Their occupancy levels were not necessarily coupled in the two orientations. DNA sequence and shape analysis of initiation complex sites suggest that both sequence and shape contribute to specificity, but in a context-restricted manner. That is, initiation sites have the locally “best” initiator (INR) sequence and/or shape. These findings reveal a common core to pervasive Pol II initiation throughout the human genome. PMID:27927716
Yamazaki, Yuji; Kubota, Hiroshi; Nozaki, Masami; Nagata, Kazuhiro
2003-08-15
The chaperonin-containing t-complex polypeptide 1 (CCT) is a molecular chaperone that facilitates protein folding in eukaryotic cytosol, and the expression of CCT is highly dependent on cell growth. We show here that transcription of the gene encoding the theta subunit of mouse CCT, Cctq, is regulated by the ternary complex factors (TCFs), Elk-1, Sap-1a, and Net (Sap-2). Reporter gene assay using HeLa cells indicated that the Cctq gene promoter contains a cis-acting element of the CCGGAAGT sequence (CQE1) at -36 bp. The major CQE1-binding proteins in HeLa cell nuclear extract was recognized by anti-Elk-1 or anti-Sap-1a antibodies in electrophoretic mobility shift assay, and recombinant Elk-1, Sap-1a, or Net specifically recognized CQE1. The CQE1-dependent transcriptional activity in HeLa cells was virtually abolished by overexpression of the DNA binding domains of TCFs. Overexpression of full-length TCFs with Ras indicated that exogenous TCFs can regulate the CQE1-dependent transcription in a Ras-dependent manner. PD98059, an inhibitor of MAPK, significantly repressed the CQE1-dependent transcription. However, no serum response factor was detected by electrophoretic mobility shift assay using the CQE1 element. These results indicate that transcription of the Cctq gene is regulated by TCFs under the control of the Ras/MAPK pathway, probably independently of serum response factor.
Proteopedia: 3D Visualization and Annotation of Transcription Factor-DNA Readout Modes
ERIC Educational Resources Information Center
Dantas Machado, Ana Carolina; Saleebyan, Skyler B.; Holmes, Bailey T.; Karelina, Maria; Tam, Julia; Kim, Sharon Y.; Kim, Keziah H.; Dror, Iris; Hodis, Eran; Martz, Eric; Compeau, Patricia A.; Rohs, Remo
2012-01-01
3D visualization assists in identifying diverse mechanisms of protein-DNA recognition that can be observed for transcription factors and other DNA binding proteins. We used Proteopedia to illustrate transcription factor-DNA readout modes with a focus on DNA shape, which can be a function of either nucleotide sequence (Hox proteins) or base pairing…
Qian, Jiang; Esumi, Noriko; Chen, Yangjian; Wang, Qingliang; Chowers, Itay; Zack, Donald J.
2005-01-01
Identification of tissue-specific gene regulatory networks can yield insights into the molecular basis of a tissue's development, function and pathology. Here, we present a computational approach designed to identify potential regulatory target genes of photoreceptor cell-specific transcription factors (TFs). The approach is based on the hypothesis that genes related to the retina in terms of expression, disease and/or function are more likely to be the targets of retina-specific TFs than other genes. A list of genes that are preferentially expressed in retina was obtained by integrating expressed sequence tag, SAGE and microarray datasets. The regulatory targets of retina-specific TFs are enriched in this set of retina-related genes. A Bayesian approach was employed to integrate information about binding site location relative to a gene's transcription start site. Our method was applied to three retina-specific TFs, CRX, NRL and NR2E3, and a number of potential targets were predicted. To experimentally assess the validity of the bioinformatic predictions, mobility shift, transient transfection and chromatin immunoprecipitation assays were performed with five predicted CRX targets, and the results were suggestive of CRX regulation in 5/5, 3/5 and 4/5 cases, respectively. Together, these experiments strongly suggest that RP1, GUCY2D, ABCA4 are novel targets of CRX. PMID:15967807
Putative Monofunctional Type I Polyketide Synthase Units: A Dinoflagellate-Specific Feature?
Eichholz, Karsten; Beszteri, Bánk; John, Uwe
2012-01-01
Marine dinoflagellates (alveolata) are microalgae of which some cause harmful algal blooms and produce a broad variety of most likely polyketide synthesis derived phycotoxins. Recently, novel polyketide synthesase (PKS) transcripts have been described from the Florida red tide dinoflagellate Karenia brevis (gymnodiniales) which are evolutionarily related to Type I PKS but were apparently expressed as monofunctional proteins, a feature typical of Type II PKS. Here, we investigated expression units of PKS I-like sequences in Alexandrium ostenfeldii (gonyaulacales) and Heterocapsa triquetra (peridiniales) at the transcript and protein level. The five full length transcripts we obtained were all characterized by polyadenylation, a 3′ UTR and the dinoflagellate specific spliced leader sequence at the 5′end. Each of the five transcripts encoded a single ketoacylsynthase (KS) domain showing high similarity to K. brevis KS sequences. The monofunctional structure was also confirmed using dinoflagellate specific KS antibodies in Western Blots. In a maximum likelihood phylogenetic analysis of KS domains from diverse PKSs, dinoflagellate KSs formed a clade placed well within the protist Type I PKS clade between apicomplexa, haptophytes and chlorophytes. These findings indicate that the atypical PKS I structure, i.e., expression as putative monofunctional units, might be a dinoflagellate specific feature. In addition, the sequenced transcripts harbored a previously unknown, apparently dinoflagellate specific conserved N-terminal domain. We discuss the implications of this novel region with regard to the putative monofunctional organization of Type I PKS in dinoflagellates. PMID:23139807
Hendrickson, Peter G; Doráis, Jessie A; Grow, Edward J; Whiddon, Jennifer L; Lim, Jong-Won; Wike, Candice L; Weaver, Bradley D; Pflueger, Christian; Emery, Benjamin R; Wilcox, Aaron L; Nix, David A; Peterson, C Matthew; Tapscott, Stephen J; Carrell, Douglas T; Cairns, Bradley R
2017-06-01
To better understand transcriptional regulation during human oogenesis and preimplantation development, we defined stage-specific transcription, which highlighted the cleavage stage as being highly distinctive. Here, we present multiple lines of evidence that a eutherian-specific multicopy retrogene, DUX4, encodes a transcription factor that activates hundreds of endogenous genes (for example, ZSCAN4, KDM4E and PRAMEF-family genes) and retroviral elements (MERVL/HERVL family) that define the cleavage-specific transcriptional programs in humans and mice. Remarkably, mouse Dux expression is both necessary and sufficient to convert mouse embryonic stem cells (mESCs) into 2-cell-embryo-like ('2C-like') cells, measured here by the reactivation of '2C' genes and repeat elements, the loss of POU5F1 (also known as OCT4) protein and chromocenters, and the conversion of the chromatin landscape (as assessed by transposase-accessible chromatin using sequencing (ATAC-seq)) to a state strongly resembling that of mouse 2C embryos. Thus, we propose mouse DUX and human DUX4 as major drivers of the cleavage or 2C state.
Damania, Blossom; Mital, Renu; Alwine, James C.
1998-01-01
The TATA-binding protein (TBP) is common to the basal transcription factors of all three RNA polymerases, being associated with polymerase-specific TBP-associated factors (TAFs). Simian virus 40 large T antigen has previously been shown to interact with the TBP-TAFII complexes, TFIID (B. Damania and J. C. Alwine, Genes Dev. 10:1369–1381, 1996), and the TBP-TAFI complex, SL1 (W. Zhai, J. Tuan, and L. Comai, Genes Dev. 11:1605–1617, 1997), and in both cases these interactions are critical for transcriptional activation. We show a similar mechanism for activation of the class 3 polymerase III (pol III) promoter for the U6 RNA gene. Large T antigen can activate this promoter, which contains a TATA box and an upstream proximal sequence element but cannot activate the TATA-less, intragenic VAI promoter (a class 2, pol III promoter). Mutants of large T antigen that cannot activate pol II promoters also fail to activate the U6 promoter. We provide evidence that large T antigen can interact with the TBP-containing pol III transcription factor human TFIIB-related factor (hBRF), as well as with at least two of the three TAFs in the pol III-specific small nuclear RNA-activating protein complex (SNAPc). In addition, we demonstrate that large T antigen can cofractionate and coimmunoprecipitate with the hBRF-containing complex TFIIIB derived from HeLa cells infected with a recombinant adenovirus which expresses large T antigen. Hence, similar to its function with pol I and pol II promoters, large T antigen interacts with TBP-containing, basal pol III transcription factors and appears to perform a TAF-like function. PMID:9488448
Kemme, Catherine A; Marquez, Rolando; Luu, Ross H; Iwahara, Junji
2017-07-27
Eukaryotic genomes contain numerous non-functional high-affinity sequences for transcription factors. These sequences potentially serve as natural decoys that sequester transcription factors. We have previously shown that the presence of sequences similar to the target sequence could substantially impede association of the transcription factor Egr-1 with its targets. In this study, using a stopped-flow fluorescence method, we examined the kinetic impact of DNA methylation of decoys on the search process of the Egr-1 zinc-finger protein. We analyzed its association with an unmethylated target site on fluorescence-labeled DNA in the presence of competitor DNA duplexes, including Egr-1 decoys. DNA methylation of decoys alone did not affect target search kinetics. In the presence of the MeCP2 methyl-CpG-binding domain (MBD), however, DNA methylation of decoys substantially (∼10-30-fold) accelerated the target search process of the Egr-1 zinc-finger protein. This acceleration did not occur when the target was also methylated. These results suggest that when decoys are methylated, MBD proteins can block them and thereby allow Egr-1 to avoid sequestration in non-functional locations. This effect may occur in vivo for DNA methylation outside CpG islands (CGIs) and could facilitate localization of some transcription factors within regulatory CGIs, where DNA methylation is rare. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Neymotin, Benjamin; Ettorre, Victoria; Gresham, David
2016-01-01
Degradation of mRNA contributes to variation in transcript abundance. Studies of individual mRNAs have shown that both cis and trans factors affect mRNA degradation rates. However, the factors underlying transcriptome-wide variation in mRNA degradation rates are poorly understood. We investigated the contribution of different transcript properties to transcriptome-wide degradation rate variation in the budding yeast, Saccharomyces cerevisiae, using multiple regression analysis. We find that multiple transcript properties are significantly associated with variation in mRNA degradation rates, and that a model incorporating these properties explains ∼50% of the genome-wide variance. Predictors of mRNA degradation rates include transcript length, ribosome density, biased codon usage, and GC content of the third position in codons. To experimentally validate these factors, we studied individual transcripts expressed from identical promoters. We find that decreasing ribosome density by mutating the first translational start site of a transcript increases its degradation rate. Using coding sequence variants of green fluorescent protein (GFP) that differ only at synonymous sites, we show that increased GC content of the third position of codons results in decreased rates of mRNA degradation. Thus, in steady-state conditions, a large fraction of genome-wide variation in mRNA degradation rates is determined by inherent properties of transcripts, many of which are related to translation, rather than specific regulatory mechanisms. PMID:27633789
Cyclin A and the retinoblastoma gene product complex with a common transcription factor.
Bandara, L R; Adamczewski, J P; Hunt, T; La Thangue, N B
1991-07-18
The retinoblastoma gene (Rb) product is a negative regulator of cellular proliferation, an effect that could be mediated in part at the transcriptional level through its ability to complex with the sequence-specific transcription factor DRTF1. This interaction is modulated by adenovirus E1a, which sequesters the Rb protein and several other cellular proteins, including cyclin A, a molecule that undergoes cyclical accumulation and destruction during each cell cycle and which is required for cell cycle progression. Cyclin A, which also complexes with DRTF1, facilitates the efficient assembly of the Rb protein into the complex. This suggests a role for cyclin A in regulating transcription and defines a transcription factor through which molecules that regulate the cell cycle in a negative fashion, such as Rb, and in a positive fashion, such as cyclin A, interact. Mutant loss-of-function Rb alleles, which occur in a variety of tumour cells, also fail to complex with E1a and large T antigen. Here we report on a naturally occurring loss-of-function Rb allele encoding a protein that fails to complex with DRTF1. This might explain how mutation in the Rb gene prevents negative growth control.
Schulz, Sebastian; Eckweiler, Denitsa; Bielecka, Agata; Nicolai, Tanja; Franke, Raimo; Dötsch, Andreas; Hornischer, Klaus; Bruchmann, Sebastian; Düvel, Juliane; Häussler, Susanne
2015-01-01
Sigma factors are essential global regulators of transcription initiation in bacteria which confer promoter recognition specificity to the RNA polymerase core enzyme. They provide effective mechanisms for simultaneously regulating expression of large numbers of genes in response to challenging conditions, and their presence has been linked to bacterial virulence and pathogenicity. In this study, we constructed nine his-tagged sigma factor expressing and/or deletion mutant strains in the opportunistic pathogen Pseudomonas aeruginosa. To uncover the direct and indirect sigma factor regulons, we performed mRNA profiling, as well as chromatin immunoprecipitation coupled to high-throughput sequencing. We furthermore elucidated the de novo binding motif of each sigma factor, and validated the RNA- and ChIP-seq results by global motif searches in the proximity of transcriptional start sites (TSS). Our integrated approach revealed a highly modular network architecture which is composed of insulated functional sigma factor modules. Analysis of the interconnectivity of the various sigma factor networks uncovered a limited, but highly function-specific, crosstalk which orchestrates complex cellular processes. Our data indicate that the modular structure of sigma factor networks enables P. aeruginosa to function adequately in its environment and at the same time is exploited to build up higher-level functions by specific interconnections that are dominated by a participation of RpoN. PMID:25780925
Chow, C W; Clark, M P; Rinaldo, J E; Chalkley, R
1996-03-01
In the present study, we have explored an unexpected observation in transcription initiation that is mediated by single-stranded oligonucleotides. Initially, our goal was to understand the function of different upstream regulatory elements/initiation sites in the rat xanthine dehydrogenase/oxidase (XDH/XO) promoter. We performed in vitro transcription with HeLa nuclear extracts in the presence of different double-stranded oligonucleotides against upstream elements as competitors. A new and unusual transcription initiation site was detected by primer extension. This new initiation site maps to the downstream region of the corresponding competitor. Subsequent analyses have indicated that the induction of a new transcription initiation site is anomalous which is due to the presence of a small amount of single-stranded oligonucleotide in the competitor. We found that this anomalous initiation site is insensitive to the orientation of the promoter and requires only a small amount of single-stranded oligonucleotide (< 2-fold molar excess relative to template). We surmise that a complementary interaction between the single-stranded oligonucleotide and transiently denatured promoter template may be responsible for this sequence-specific transcription initiation artifact. To study the regulation of transcription initiation by in vitro transcription approaches, we propose that one should probe the effect of removing transacting factors by adding an excess of a cognate oligonucleotide which does not bear exact sequence identity to the template.
Clare, Alison J.; Wicky, Hollie E.; Empson, Ruth M.; Hughes, Stephanie M.
2017-01-01
Forebrain embryonic zinc finger (Fezf2) encodes a transcription factor essential for the specification of layer 5 projection neurons (PNs) in the developing cerebral cortex. As with many developmental transcription factors, Fezf2 continues to be expressed into adulthood, suggesting it remains crucial to the maintenance of neuronal phenotypes. Despite the continued expression, a function has yet to be explored for Fezf2 in the PNs of the developed cortex. Here, we investigated the role of Fezf2 in mature neurons, using lentiviral-mediated delivery of a shRNA to conditionally knockdown the expression of Fezf2 in the mouse primary motor cortex (M1). RNA-sequencing analysis of Fezf2-reduced M1 revealed significant changes to the transcriptome, identifying a regulatory role for Fezf2 in the mature M1. Kyoto Encyclopedia Genes and Genomes (KEGG) pathway analyses of Fezf2-regulated genes indicated a role in neuronal signaling and plasticity, with significant enrichment of neuroactive ligand-receptor interaction, cell adhesion molecules and calcium signaling pathways. Gene Ontology analysis supported a functional role for Fezf2-regulated genes in neuronal transmission and additionally indicated an importance in the regulation of behavior. Using the mammalian phenotype ontology database, we identified a significant overrepresentation of Fezf2-regulated genes associated with specific behavior phenotypes, including associative learning, social interaction, locomotor activation and hyperactivity. These roles were distinct from that of Fezf2-regulated genes identified in development, indicating a dynamic transition in Fezf2 function. Together our findings demonstrate a regulatory role for Fezf2 in the mature brain, with Fezf2-regulated genes having functional roles in sustaining normal neuronal and behavioral phenotypes. These results support the hypothesis that developmental transcription factors are important for maintaining neuron transcriptomes and that disruption of their expression could contribute to the progression of disease phenotypes. PMID:28936162
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kwon, Deug-Nam; Park, Mi-Ryung; Park, Jong-Yi
Highlights: {yields} The sequences of -604 to -84 bp of the pUPII promoter contained the region of a putative negative cis-regulatory element. {yields} The core promoter was located in the 5F-1. {yields} Transcription factor HNF4 can directly bind in the pUPII core promoter region, which plays a critical role in controlling promoter activity. {yields} These features of the pUPII promoter are fundamental to development of a target-specific vector. -- Abstract: Uroplakin II (UPII) is a one of the integral membrane proteins synthesized as a major differentiation product of mammalian urothelium. UPII gene expression is bladder specific and differentiation dependent, butmore » little is known about its transcription response elements and molecular mechanism. To identify the cis-regulatory elements in the pig UPII (pUPII) gene promoter region, we constructed pUPII 5' upstream region deletion mutants and demonstrated that each of the deletion mutants participates in controlling the expression of the pUPII gene in human bladder carcinoma RT4 cells. We also identified a new core promoter region and putative negative cis-regulatory element within a minimal promoter region. In addition, we showed that hepatocyte nuclear factor 4 (HNF4) can directly bind in the pUPII core promoter (5F-1) region, which plays a critical role in controlling promoter activity. Transient cotransfection experiments showed that HNF4 positively regulates pUPII gene promoter activity. Thus, the binding element and its binding protein, HNF4 transcription factor, may be involved in the mechanism that specifically regulates pUPII gene transcription.« less
ERIC Educational Resources Information Center
Kugel, Jennifer F.
2008-01-01
An undergraduate biochemistry laboratory experiment that will teach the technique of fluorescence resonance energy transfer (FRET) while analyzing protein-induced DNA bending is described. The experiment uses the protein TATA binding protein (TBP), which is a general transcription factor that recognizes and binds specific DNA sequences known as…
Pérez-Quintero, Alvaro L.; Rodriguez-R, Luis M.; Dereeper, Alexis; López, Camilo; Koebnik, Ralf; Szurek, Boris; Cunnac, Sebastien
2013-01-01
Transcription Activators-Like Effectors (TALEs) belong to a family of virulence proteins from the Xanthomonas genus of bacterial plant pathogens that are translocated into the plant cell. In the nucleus, TALEs act as transcription factors inducing the expression of susceptibility genes. A code for TALE-DNA binding specificity and high-resolution three-dimensional structures of TALE-DNA complexes were recently reported. Accurate prediction of TAL Effector Binding Elements (EBEs) is essential to elucidate the biological functions of the many sequenced TALEs as well as for robust design of artificial TALE DNA-binding domains in biotechnological applications. In this work a program with improved EBE prediction performances was developed using an updated specificity matrix and a position weight correction function to account for the matching pattern observed in a validation set of TALE-DNA interactions. To gain a systems perspective on the large TALE repertoires from X. oryzae strains, this program was used to predict rice gene targets for 99 sequenced family members. Integrating predictions and available expression data in a TALE-gene network revealed multiple candidate transcriptional targets for many TALEs as well as several possible instances of functional convergence among TALEs. PMID:23869221
Wang, Lan; Ren, Shifang; Zhu, Haiyan; Zhang, Dongmei; Hao, Yuqing; Ruan, Yuanyuan; Zhou, Lei; Lee, Chiayu; Qiu, Lin; Yun, Xiaojing; Xie, Jianhui
2012-08-01
CLEC-2 was first identified by sequence similarity to C-type lectin-like molecules with immune functions and has been reported as a receptor for the platelet-aggregating snake venom toxin rhodocytin and the endogenous sialoglycoprotein podoplanin. Recent researches indicate that CLEC-2-deficient mice were lethal at the embryonic stage associated with disorganized and blood-filled lymphatic vessels and severe edema. In view of a necessary role of CLEC-2 in the individual development, it is of interest to investigate its phylogenetic homology and highly conserved functional regions. In this work, we reported that CLEC-2 from different species holds with an extraordinary conservation by sequence alignment and phylogenetic tree analysis. The functional structures including N-linked oligosaccharide sites and ligand-binding domain implement a structural and functional conservation in a variety of species. The glycosylation sites (N120 and N134) are necessary for the surface expression CLEC-2. CLEC-2 from different species possesses the binding activity of mouse podoplanin. Nevertheless, the expression of CLEC-2 is regulated with a species-specific manner. The alternative splicing of pre-mRNA, a regulatory mechanism of gene expression, and the binding sites on promoter for several key transcription factors vary between different species. Therefore, CLEC-2 shares high sequence homology and functional identity. However the transcript expression might be tightly regulated by different mechanisms in evolution.
Modrzynska, Katarzyna; Pfander, Claudia; Chappell, Lia; Yu, Lu; Suarez, Catherine; Dundas, Kirsten; Gomes, Ana Rita; Goulding, David; Rayner, Julian C; Choudhary, Jyoti; Billker, Oliver
2017-01-11
A family of apicomplexa-specific proteins containing AP2 DNA-binding domains (ApiAP2s) was identified in malaria parasites. This family includes sequence-specific transcription factors that are key regulators of development. However, functions for the majority of ApiAP2 genes remain unknown. Here, a systematic knockout screen in Plasmodium berghei identified ten ApiAP2 genes that were essential for mosquito transmission: four were critical for the formation of infectious ookinetes, and three were required for sporogony. We describe non-essential functions for AP2-O and AP2-SP proteins in blood stages, and identify AP2-G2 as a repressor active in both asexual and sexual stages. Comparative transcriptomics across mutants and developmental stages revealed clusters of co-regulated genes with shared cis promoter elements, whose expression can be controlled positively or negatively by different ApiAP2 factors. We propose that stage-specific interactions between ApiAP2 proteins on partly overlapping sets of target genes generate the complex transcriptional network that controls the Plasmodium life cycle. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
Bai, Yu; Iwasaki, Yuki; Kanaya, Shigehiko; Zhao, Yue; Ikemura, Toshimichi
2014-01-01
With remarkable increase of genomic sequence data of a wide range of species, novel tools are needed for comprehensive analyses of the big sequence data. Self-Organizing Map (SOM) is an effective tool for clustering and visualizing high-dimensional data such as oligonucleotide composition on one map. By modifying the conventional SOM, we have previously developed Batch-Learning SOM (BLSOM), which allows classification of sequence fragments according to species, solely depending on the oligonucleotide composition. In the present study, we introduce the oligonucleotide BLSOM used for characterization of vertebrate genome sequences. We first analyzed pentanucleotide compositions in 100 kb sequences derived from a wide range of vertebrate genomes and then the compositions in the human and mouse genomes in order to investigate an efficient method for detecting differences between the closely related genomes. BLSOM can recognize the species-specific key combination of oligonucleotide frequencies in each genome, which is called a "genome signature," and the specific regions specifically enriched in transcription-factor-binding sequences. Because the classification and visualization power is very high, BLSOM is an efficient powerful tool for extracting a wide range of information from massive amounts of genomic sequences (i.e., big sequence data).
Analysis of the regulation of viral transcription.
Gloss, Bernd; Kalantari, Mina; Bernard, Hans-Ulrich
2005-01-01
Despite the small genomes and number of genes of papillomaviruses, regulation of their transcription is very complex and governed by numerous transcription factors, cis-responsive elements, and epigenetic phenomena. This chapter describes the strategies of how one can approach a systematic analysis of these factors, elements, and mechanisms. From the numerous different techniques useful for studying transcription, we describe in detail three selected protocols of approaches that have been relevant in shaping our knowledge of human papillomavirus transcription. These are DNAse I protection ("footprinting") for location of transcription-factor binding sites, electrophoretic mobility shifts ("gelshifts") for analysis of bound transcription factors, and bisulfite sequencing for analysis of DNA methylation as a prerequisite for epigenetic transcriptional regulation.
Binder, Stefan; Stoll, Katrin; Stoll, Birgit
2013-01-01
It is well recognized that flowering plants maintain a particularly broad spectrum of factors to support gene expression in mitochondria. Many of these factors are pentatricopeptide repeat (PPR) proteins that participate in virtually all processes dealing with RNA. One of these processes is the post-transcriptional generation of mature 5′ termini of RNA. Several PPR proteins are required for efficient 5′ maturation of mitochondrial mRNA and rRNA. These so-called RNA PROCESSING FACTORs (RPF) exclusively represent P-class PPR proteins, mainly composed of canonical PPR motifs without any extra domains. Applying the recent PPR-nucleotide recognition code, binding sites of RPF are predicted on the 5′ leader sequences. The sequence-specific interaction of an RPF with one or a few RNA substrates probably directly or indirectly recruits an as-yet-unidentified endonuclease to the processing site(s). The identification and characterization of RPF is a major step toward the understanding of the role of 5′ end maturation in flowering plant mitochondria. PMID:24184847
Li, Richard Y.; Di Felice, Rosa; Rohs, Remo; Lidar, Daniel A.
2018-01-01
Transcription factors regulate gene expression, but how these proteins recognize and specifically bind to their DNA targets is still debated. Machine learning models are effective means to reveal interaction mechanisms. Here we studied the ability of a quantum machine learning approach to predict binding specificity. Using simplified datasets of a small number of DNA sequences derived from actual binding affinity experiments, we trained a commercially available quantum annealer to classify and rank transcription factor binding. The results were compared to state-of-the-art classical approaches for the same simplified datasets, including simulated annealing, simulated quantum annealing, multiple linear regression, LASSO, and extreme gradient boosting. Despite technological limitations, we find a slight advantage in classification performance and nearly equal ranking performance using the quantum annealer for these fairly small training data sets. Thus, we propose that quantum annealing might be an effective method to implement machine learning for certain computational biology problems. PMID:29652405
Chen, M; Hieng, S; Qian, X; Costa, R; Ou, J H
1994-11-15
Hepatitis B virus (HBV) ENI enhancer can activate the expression of HBV and non-HBV genes in a liver-specific manner. By performing the electrophoretic mobility-shift assays, we demonstrated that the three related, liver-enriched, transcription factors, HNF3 alpha, HNF3 beta, and HNF3 gamma could all bind to the 2c site of HBV ENI enhancer. Mutations introduced in the 2c site to abolish the binding by HNF3 reduced the enhancer activity approximately 15-fold. Moreover, expression of HNF3 antisense sequences to suppress the expression of HNF3 in Huh-7 hepatoma cells led to reduction of the ENI enhancer activity. These results indicate that HNF3 positively regulates the ENI enhancer activity and this regulation is most likely mediated through the 2c site. The requirement of HNF3 for the ENI enhancer activity could explain the liver specificity of this enhancer element.
Dumay-Odelot, Hélène; Durrieu-Gaillard, Stéphanie; El Ayoubi, Leyla; Parrot, Camila; Teichmann, Martin
2014-01-01
Human RNA polymerase III transcribes small untranslated RNAs that contribute to the regulation of essential cellular processes, including transcription, RNA processing and translation. Analysis of this transcription system by in vitro transcription techniques has largely contributed to the discovery of its transcription factors and to the understanding of the regulation of human RNA polymerase III transcription. Here we review some of the key steps that led to the identification of transcription factors and to the definition of minimal promoter sequences for human RNA polymerase III transcription. PMID:25764111
Fine-tuning the onset of myogenesis by homeobox proteins that interact with the Myf5 limb enhancer
Daubas, Philippe; Duval, Nathalie; Bajard, Lola; Langa Vives, Francina; Robert, Benoît; Mankoo, Baljinder S.; Buckingham, Margaret
2015-01-01
ABSTRACT Skeletal myogenesis in vertebrates is initiated at different sites of skeletal muscle formation during development, by activation of specific control elements of the myogenic regulatory genes. In the mouse embryo, Myf5 is the first myogenic determination gene to be expressed and its spatiotemporal regulation requires multiple enhancer sequences, extending over 120 kb upstream of the Mrf4-Myf5 locus. An enhancer, located at −57/−58 kb from Myf5, is responsible for its activation in myogenic cells derived from the hypaxial domain of the somite, that will form limb muscles. Pax3 and Six1/4 transcription factors are essential activators of this enhancer, acting on a 145-bp core element. Myogenic progenitor cells that will form the future muscle masses of the limbs express the factors necessary for Myf5 activation when they delaminate from the hypaxial dermomyotome and migrate into the forelimb bud, however they do not activate Myf5 and the myogenic programme until they have populated the prospective muscle masses. We show that Msx1 and Meox2 homeodomain-containing transcription factors bind in vitro and in vivo to specific sites in the 145-bp element, and are implicated in fine-tuning activation of Myf5 in the forelimb. Msx1, when bound between Pax and Six sites, prevents the binding of these key activators, thus inhibiting transcription of Myf5 and consequent premature myogenic differentiation. Meox2 is required for Myf5 activation at the onset of myogenesis via direct binding to other homeodomain sites in this sequence. Thus, these homeodomain factors, acting in addition to Pax3 and Six1/4, fine-tune the entry of progenitor cells into myogenesis at early stages of forelimb development. PMID:26538636
Algama, Manjula; Tasker, Edward; Williams, Caitlin; Parslow, Adam C; Bryson-Richardson, Robert J; Keith, Jonathan M
2017-03-27
Computational identification of non-coding RNAs (ncRNAs) is a challenging problem. We describe a genome-wide analysis using Bayesian segmentation to identify intronic elements highly conserved between three evolutionarily distant vertebrate species: human, mouse and zebrafish. We investigate the extent to which these elements include ncRNAs (or conserved domains of ncRNAs) and regulatory sequences. We identified 655 deeply conserved intronic sequences in a genome-wide analysis. We also performed a pathway-focussed analysis on genes involved in muscle development, detecting 27 intronic elements, of which 22 were not detected in the genome-wide analysis. At least 87% of the genome-wide and 70% of the pathway-focussed elements have existing annotations indicative of conserved RNA secondary structure. The expression of 26 of the pathway-focused elements was examined using RT-PCR, providing confirmation that they include expressed ncRNAs. Consistent with previous studies, these elements are significantly over-represented in the introns of transcription factors. This study demonstrates a novel, highly effective, Bayesian approach to identifying conserved non-coding sequences. Our results complement previous findings that these sequences are enriched in transcription factors. However, in contrast to previous studies which suggest the majority of conserved sequences are regulatory factor binding sites, the majority of conserved sequences identified using our approach contain evidence of conserved RNA secondary structures, and our laboratory results suggest most are expressed. Functional roles at DNA and RNA levels are not mutually exclusive, and many of our elements possess evidence of both. Moreover, ncRNAs play roles in transcriptional and post-transcriptional regulation, and this may contribute to the over-representation of these elements in introns of transcription factors. We attribute the higher sensitivity of the pathway-focussed analysis compared to the genome-wide analysis to improved alignment quality, suggesting that enhanced genomic alignments may reveal many more conserved intronic sequences.
Alu elements shape the primate transcriptome by cis-regulation of RNA editing
2014-01-01
Background RNA editing by adenosine to inosine deamination is a widespread phenomenon, particularly frequent in the human transcriptome, largely due to the presence of inverted Alu repeats and their ability to form double-stranded structures – a requisite for ADAR editing. While several hundred thousand editing sites have been identified within these primate-specific repeats, the function of Alu-editing has yet to be elucidated. Results We show that inverted Alu repeats, expressed in the primate brain, can induce site-selective editing in cis on sites located several hundred nucleotides from the Alu elements. Furthermore, a computational analysis, based on available RNA-seq data, finds that site-selective editing occurs significantly closer to edited Alu elements than expected. These targets are poorly edited upon deletion of the editing inducers, as well as in homologous transcripts from organisms lacking Alus. Sequences surrounding sites near edited Alus in UTRs, have been subjected to a lesser extent of evolutionary selection than those far from edited Alus, indicating that their editing generally depends on cis-acting Alus. Interestingly, we find an enrichment of primate-specific editing within encoded sequence or the UTRs of zinc finger-containing transcription factors. Conclusions We propose a model whereby primate-specific editing is induced by adjacent Alu elements that function as recruitment elements for the ADAR editing enzymes. The enrichment of site-selective editing with potentially functional consequences on the expression of transcription factors indicates that editing contributes more profoundly to the transcriptomic regulation and repertoire in primates than previously thought. PMID:24485196
Matrix Metalloproteinase (MMP) 9 Transcription in Mouse Brain Induced by Fear Learning*
Ganguly, Krishnendu; Rejmak, Emilia; Mikosz, Marta; Nikolaev, Evgeni; Knapska, Ewelina; Kaczmarek, Leszek
2013-01-01
Memory formation requires learning-based molecular and structural changes in neurons, whereas matrix metalloproteinase (MMP) 9 is involved in the synaptic plasticity by cleaving extracellular matrix proteins and, thus, is associated with learning processes in the mammalian brain. Because the mechanisms of MMP-9 transcription in the brain are poorly understood, this study aimed to elucidate regulation of MMP-9 gene expression in the mouse brain after fear learning. We show here that contextual fear conditioning markedly increases MMP-9 transcription, followed by enhanced enzymatic levels in the three major brain structures implicated in fear learning, i.e. the amygdala, hippocampus, and prefrontal cortex. To reveal the role of AP-1 transcription factor in MMP-9 gene expression, we have used reporter gene constructs with specifically mutated AP-1 gene promoter sites. The constructs were introduced into the medial prefrontal cortex of neonatal mouse pups by electroporation, and the regulation of MMP-9 transcription was studied after contextual fear conditioning in the adult animals. Specifically, −42/-50- and −478/-486-bp AP-1 binding motifs of the mouse MMP-9 promoter sequence have been found to play a major role in MMP-9 gene activation. Furthermore, increases in MMP-9 gene promoter binding by the AP-1 transcription factor proteins c-Fos and c-Jun have been demonstrated in all three brain structures under investigation. Hence, our results suggest that AP-1 acts as a positive regulator of MMP-9 transcription in the brain following fear learning. PMID:23720741
Matrix metalloproteinase (MMP) 9 transcription in mouse brain induced by fear learning.
Ganguly, Krishnendu; Rejmak, Emilia; Mikosz, Marta; Nikolaev, Evgeni; Knapska, Ewelina; Kaczmarek, Leszek
2013-07-19
Memory formation requires learning-based molecular and structural changes in neurons, whereas matrix metalloproteinase (MMP) 9 is involved in the synaptic plasticity by cleaving extracellular matrix proteins and, thus, is associated with learning processes in the mammalian brain. Because the mechanisms of MMP-9 transcription in the brain are poorly understood, this study aimed to elucidate regulation of MMP-9 gene expression in the mouse brain after fear learning. We show here that contextual fear conditioning markedly increases MMP-9 transcription, followed by enhanced enzymatic levels in the three major brain structures implicated in fear learning, i.e. the amygdala, hippocampus, and prefrontal cortex. To reveal the role of AP-1 transcription factor in MMP-9 gene expression, we have used reporter gene constructs with specifically mutated AP-1 gene promoter sites. The constructs were introduced into the medial prefrontal cortex of neonatal mouse pups by electroporation, and the regulation of MMP-9 transcription was studied after contextual fear conditioning in the adult animals. Specifically, -42/-50- and -478/-486-bp AP-1 binding motifs of the mouse MMP-9 promoter sequence have been found to play a major role in MMP-9 gene activation. Furthermore, increases in MMP-9 gene promoter binding by the AP-1 transcription factor proteins c-Fos and c-Jun have been demonstrated in all three brain structures under investigation. Hence, our results suggest that AP-1 acts as a positive regulator of MMP-9 transcription in the brain following fear learning.
Bedon, Frank; Grima-Pettenati, Jacqueline; Mackay, John
2007-01-01
Background Several members of the R2R3-MYB family of transcription factors act as regulators of lignin and phenylpropanoid metabolism during wood formation in angiosperm and gymnosperm plants. The angiosperm Arabidopsis has over one hundred R2R3-MYBs genes; however, only a few members of this family have been discovered in gymnosperms. Results We isolated and characterised full-length cDNAs encoding R2R3-MYB genes from the gymnosperms white spruce, Picea glauca (13 sequences), and loblolly pine, Pinus taeda L. (five sequences). Sequence similarities and phylogenetic analyses placed the spruce and pine sequences in diverse subgroups of the large R2R3-MYB family, although several of the sequences clustered closely together. We searched the highly variable C-terminal region of diverse plant MYBs for conserved amino acid sequences and identified 20 motifs in the spruce MYBs, nine of which have not previously been reported and three of which are specific to conifers. The number and length of the introns in spruce MYB genes varied significantly, but their positions were well conserved relative to angiosperm MYB genes. Quantitative RTPCR of MYB genes transcript abundance in root and stem tissues revealed diverse expression patterns; three MYB genes were preferentially expressed in secondary xylem, whereas others were preferentially expressed in phloem or were ubiquitous. The MYB genes expressed in xylem, and three others, were up-regulated in the compression wood of leaning trees within 76 hours of induction. Conclusion Our survey of 18 conifer R2R3-MYB genes clearly showed a gene family structure similar to that of Arabidopsis. Three of the sequences are likely to play a role in lignin metabolism and/or wood formation in gymnosperm trees, including a close homolog of the loblolly pine PtMYB4, shown to regulate lignin biosynthesis in transgenic tobacco. PMID:17397551
Cleavage of rRNA ensures translational cessation in sperm at fertilization
Johnson, G.D.; Sendler, E.; Lalancette, C.; Hauser, R.; Diamond, M.P.; Krawetz, S.A.
2011-01-01
Intact ribosomal RNAs (rRNAs) comprise the majority of somatic transcripts, yet appear conspicuously absent in spermatozoa, perhaps reflecting cytoplasmic expulsion during spermatogenesis. To discern their fate, total RNA retained in mature spermatozoa from three fertile donors was characterized by Next Generation Sequencing. In all samples, >75% of total sequence reads aligned to rRNAs. The distribution of reads along the length of these transcripts exhibited a high degree of non-uniformity that was reiterated between donors. The coverage of sequencing reads was inversely correlated with guanine-cytosine (GC)-richness such that sequences greater than ∼70% GC were virtually absent in all sperm RNA samples. To confirm the loss of sequence, the relative abundance of specific regions of the 28S transcripts in sperm was established by 7-Deaza-2′-deoxy-guanosine-5′-triphosphate RT–PCR. The inability to amplify specific regions of the 28S sequence from sperm despite the abundant representation of this transcript in the sequencing libraries demonstrates that approximately three-quarters of RNA retained in the mature male gamete are products of rRNA fragmentation. Hence, cleavage (not expulsion of the RNA component of the translational machinery) is responsible for preventing spurious translation following spermiogenesis. These results highlight the potential importance of those transcripts, including many mRNAs, which evade fragmentation and remain intact when sperm are delivered at fertilization. Sequencing data are deposited in GEO as: GSE29160. PMID:21831882
Jaber, Tareq; Henderson, Gail; Li, Sumin; Perng, Guey-Chuen; Carpenter, Dale; Wechsler, Steven L; Jones, Clinton
2009-10-01
The herpes simplex virus type 1 (HSV-1) latency-associated transcript (LAT) is abundantly expressed in latently infected sensory neurons. In small animal models of infection, expression of the first 1.5 kb of LAT coding sequences is necessary and sufficient for wild-type reactivation from latency. The ability of LAT to inhibit apoptosis is important for reactivation from latency. Within the first 1.5 kb of LAT coding sequences and LAT promoter sequences, additional transcripts have been identified. For example, the anti-sense to LAT transcript (AL) is expressed in the opposite direction to LAT from the 5' end of LAT and LAT promoter sequences. In addition, the upstream of LAT (UOL) transcript is expressed in the LAT direction from sequences in the LAT promoter. Further examination of the first 1.5 kb of LAT coding sequences revealed two small ORFs that are anti-sense with respect to LAT (AL2 and AL3). A transcript spanning AL3 was detected in productively infected cells, mouse neuroblastoma cells stably expressing LAT and trigeminal ganglia (TG) of latently infected mice. Peptide-specific IgG directed against AL3 specifically recognized a protein migrating near 15 kDa in cells stably transfected with LAT, mouse neuroblastoma cells transfected with a plasmid containing the AL3 ORF and TG of latently infected mice. The inability to detect the AL3 protein during productive infection may have been because the 5' terminus of the AL3 transcript was downstream of the first in-frame methionine of the AL3 ORF during productive infection.
Qin, Jin-Hong; Zhang, Qing; Zhang, Zhi-Ming; Zhong, Yi; Yang, Yang; Hu, Bao-Yu; Zhao, Guo-Ping; Guo, Xiao-Kui
2008-06-01
DNA microarray analysis was used to compare the differential gene expression profiles between Leptospira interrogans serovar Lai type strain 56601 and its corresponding attenuated strain IPAV. A 22-kb genomic island covering a cluster of 34 genes (i.e., genes LA0186 to LA0219) was actively expressed in both strains but concomitantly upregulated in strain 56601 in contrast to that of IPAV. Reverse transcription-PCR assays proved that the gene cluster comprised five transcripts. Gene annotation of this cluster revealed characteristics of a putative prophage-like remnant with at least 8 of 34 sequences encoding prophage-like proteins, of which the LA0195 protein is probably a putative prophage CI-like regulator. The transcription initiation activities of putative promoter-regulatory sequences of transcripts I, II, and III, all proximal to the LA0195 gene, were further analyzed in the Escherichia coli promoter probe vector pKK232-8 by assaying the reporter chloramphenicol acetyltransferase (CAT) activities. The strong promoter activities of both transcripts I and II indicated by the E. coli CAT assay were well correlated with the in vitro sequence-specific binding of the recombinant LA0195 protein to the corresponding promoter probes detected by the electrophoresis mobility shift assay. On the other hand, the promoter activity of transcript III was very low in E. coli and failed to show active binding to the LA0195 protein in vitro. These results suggested that the LA0195 protein is likely involved in the transcription of transcripts I and II. However, the identical complete DNA sequences of this prophage remnant from these two strains strongly suggests that possible regulatory factors or signal transduction systems residing outside of this region within the genome may be responsible for the differential expression profiling in these two strains.
A transcription factor collective defines the HSN serotonergic neuron regulatory landscape
Artacho, Alejandro; Jimeno-Martín, Ángela; Chirivella, Laura; Weinberg, Peter
2018-01-01
Cell differentiation is controlled by individual transcription factors (TFs) that together activate a selection of enhancers in specific cell types. How these combinations of TFs identify and activate their target sequences remains poorly understood. Here, we identify the cis-regulatory transcriptional code that controls the differentiation of serotonergic HSN neurons in Caenorhabditis elegans. Activation of the HSN transcriptome is directly orchestrated by a collective of six TFs. Binding site clusters for this TF collective form a regulatory signature that is sufficient for de novo identification of HSN neuron functional enhancers. Among C. elegans neurons, the HSN transcriptome most closely resembles that of mouse serotonergic neurons. Mouse orthologs of the HSN TF collective also regulate serotonergic differentiation and can functionally substitute for their worm counterparts which suggests deep homology. Our results identify rules governing the regulatory landscape of a critically important neuronal type in two species separated by over 700 million years. PMID:29553368
Uckun, Fatih M.; Ma, Hong; Zhang, Jian; Ozer, Zahide; Dovat, Sinisa; Mao, Cheney; Ishkhanian, Rita; Goodman, Patricia; Qazi, Sanjive
2012-01-01
Ikaros is a zinc finger-containing DNA-binding protein that plays a pivotal role in immune homeostasis through transcriptional regulation of the earliest stages of lymphocyte ontogeny and differentiation. Functional deficiency of Ikaros has been implicated in the pathogenesis of acute lymphoblastic leukemia, the most common form of childhood cancer. Therefore, a stringent regulation of Ikaros activity is considered of paramount importance, but the operative molecular mechanisms responsible for its regulation remain largely unknown. Here we provide multifaceted genetic and biochemical evidence for a previously unknown function of spleen tyrosine kinase (SYK) as a partner and posttranslational regulator of Ikaros. We demonstrate that SYK phoshorylates Ikaros at unique C-terminal serine phosphorylation sites S358 and S361, thereby augmenting its nuclear localization and sequence-specific DNA binding activity. Mechanistically, we establish that SYK-induced Ikaros activation is essential for its nuclear localization and optimal transcription factor function. PMID:23071339
Essential and Unexpected Role of YY1 to Promote Mesodermal Cardiac Differentiation
Gregoire, Serge; Karra, Ravi; Passer, Derek; Deutsch, Marcus-Andre; Krane, Markus; Feistritzer, Rebecca; Sturzu, Anthony; Domian, Ibrahim; Saga, Yumiko; Wu, Sean M.
2013-01-01
Rational Cardiogenesis is regulated by a complex interplay between transcription factors. However, little is known about how these interactions regulate the transition from mesodermal precursors to cardiac progenitor cells (CPCs). Objective To identify novel regulators of mesodermal cardiac lineage commitment. Methods and Results We performed a bioinformatic-based transcription factor binding site analysis on upstream promoter regions of genes that are enriched in embryonic stem cell (ESC)-derived CPCs. From 32 candidate transcription factors screened, we found that YY1, a repressor of sarcomeric gene expression, is present in CPCs in vivo. Interestingly, we uncovered the ability of YY1 to transcriptionally activate Nkx2.5, a key marker of early cardiogenic commitment. YY1 regulates Nkx2.5 expression via a 2.1 kb cardiac-specific enhancer as demonstrated by in vitro luciferase-based assays and in vivo chromatin immunoprecipitation (ChIP) and genome-wide sequencing analysis. Furthermore, the ability of YY1 to activate Nkx2.5 expression depends on its cooperative interaction with Gata4 at a nearby chromatin. Cardiac mesoderm-specific loss-of-function of YY1 resulted in early embryonic lethality. This was corroborated in vitro by ESC-based assays where we show that the overexpression of YY1 enhanced the cardiogenic differentiation of ESCs into CPCs. Conclusion These results demonstrate an essential and unexpected role for YY1 to promote cardiogenesis as a transcriptional activator of Nkx2.5 and other CPC-enriched genes. PMID:23307821
Novel splice mutation in microthalmia-associated transcription factor in Waardenburg Syndrome.
Brenner, Laura; Burke, Kelly; Leduc, Charles A; Guha, Saurav; Guo, Jiancheng; Chung, Wendy K
2011-01-01
Waardenburg Syndrome (WS) is a syndromic form of hearing loss associated with mutations in six different genes. We identified a large family with WS that had previously undergone clinical testing, with no reported pathogenic mutation. Using linkage analysis, a region on 3p14.1 with an LOD score of 6.6 was identified. Microthalmia-Associated Transcription Factor, a gene known to cause WS, is located within this region of linkage. Sequencing of Microthalmia-Associated Transcription Factor demonstrated a c.1212 G>A synonymous variant that segregated with the WS in the family and was predicted to cause a novel splicing site that was confirmed with expression analysis of the mRNA. This case illustrates the need to computationally analyze novel synonymous sequence variants for possible effects on splicing to maximize the clinical sensitivity of sequence-based genetic testing.
Fauzi, Hamid; Agyeman, Akwasi; Hines, Jennifer V.
2008-01-01
Many bacteria utilize riboswitch transcription regulation to monitor and appropriately respond to cellular levels of important metabolites or effector molecules. The T box transcription antitermination riboswitch responds to cognate uncharged tRNA by specifically stabilizing an antiterminator element in the 5′-untranslated mRNA leader region and precluding formation of a thermodynamically more stable terminator element. Stabilization occurs when the tRNA acceptor end base pairs with the first four nucleotides in the seven nucleotide bulge of the highly conserved antiterminator element. The significance of the conservation of the antiterminator bulge nucleotides that do not base pair with the tRNA is unknown, but they are required for optimal function. In vitro selection was used to determine if the isolated antiterminator bulge context alone dictates the mode in which the tRNA acceptor end binds the bulge nucleotides. No sequence conservation beyond complementarity was observed and the location was not constrained to the first four bases of the bulge. The results indicate that formation of a structure that recognizes the tRNA acceptor end in isolation is not the determinant driving force for the high phylogenetic sequence conservation observed within the antiterminator bulge. Additional factors or T box leader features more likely influenced the phylogenetic sequence conservation. PMID:19152843
DOE Office of Scientific and Technical Information (OSTI.GOV)
Buchman, A.R.; Kimmerly, W.J.; Rine, J.
1988-01-01
Two DNA-binding factors from Saccharomyces cerevisiae have been characterized, GRFI (general regulatory factor I) and ABFI (ARS-binding factor I), that recognize specific sequences within diverse genetic elements. GRFI bound to sequences at the negative regulatory elements (silencers) of the silent mating type loci HML E and HMR E and to the upstream activating sequence (UAS) required for transcription of the MAT ..cap alpha.. genes. A putative conserved UAS located at genes involved in translation (RPG box) was also recognized by GRFI. In addition, GRFI bound with high affinity to sequences within the (C/sub 1-3/A)-repeat region at yeast telomeres. Binding sitesmore » for GRFI with the highest affinity appeared to be of the form 5'-(A/G)(A/C)ACCCAN NCA(T/C)(T/C)-3', where N is any nucleotide. ABFI-binding sites were located next to autonomously replicating sequences (ARSs) at controlling elements of the silent mating type loci HMR E, HMR I, and HML I and were associated with ARS1, ARS2, and the 2..mu..m plasmid ARS. Two tandem ABFI binding sites were found between the HIS3 and DED1 genes, several kilobase pairs from any ARS, indicating that ABFI-binding sites are not restricted to ARSs. The sequences recognized by AFBI showed partial dyad-symmetry and appeared to be variations of the consensus 5'-TATCATTNNNNACGA-3'. GRFI and ABFI were both abundant DNA-binding factors and did not appear to be encoded by the SIR genes, whose product are required for repression of the silent mating type loci. Together, these results indicate that both GRFI and ABFI play multiple roles within the cell.« less
de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas
2014-06-01
The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Hong, Yoonki; Kim, Woo Jin; Bang, Chi Young; Lee, Jae Cheol; Oh, Yeon-Mok
2016-04-01
Lung cancer is the most common cause of cancer related death. Alterations in gene sequence, structure, and expression have an important role in the pathogenesis of lung cancer. Fusion genes and alternative splicing of cancer-related genes have the potential to be oncogenic. In the current study, we performed RNA-sequencing (RNA-seq) to investigate potential fusion genes and alternative splicing in non-small cell lung cancer. RNA was isolated from lung tissues obtained from 86 subjects with lung cancer. The RNA samples from lung cancer and normal tissues were processed with RNA-seq using the HiSeq 2000 system. Fusion genes were evaluated using Defuse and ChimeraScan. Candidate fusion transcripts were validated by Sanger sequencing. Alternative splicing was analyzed using multivariate analysis of transcript sequencing and validated using quantitative real time polymerase chain reaction. RNA-seq data identified oncogenic fusion genes EML4-ALK and SLC34A2-ROS1 in three of 86 normal-cancer paired samples. Nine distinct fusion transcripts were selected using DeFuse and ChimeraScan; of which, four fusion transcripts were validated by Sanger sequencing. In 33 squamous cell carcinoma, 29 tumor specific skipped exon events and six mutually exclusive exon events were identified. ITGB4 and PYCR1 were top genes that showed significant tumor specific splice variants. In conclusion, RNA-seq data identified novel potential fusion transcripts and splice variants. Further evaluation of their functional significance in the pathogenesis of lung cancer is required.
Pu, Jiarui; Mei, Hong; Zhao, Jun; Huang, Kai; Zeng, Fuqing; Tong, Qiangsong
2012-01-01
Heparanase (HPA), an endo-h-D-glucuronidase that cleaves the heparan sulfate chain of heparan sulfate proteoglycans, is overexpressed in majority of human cancers. Recent evidence suggests that small interfering RNA (siRNA) induces transcriptional gene silencing (TGS) in human cells. In this study, transfection of siRNA against −9/+10 bp (siH3), but not −174/−155 bp (siH1) or −134/−115 bp (siH2) region relative to transcription start site (TSS) locating at 101 bp upstream of the translation start site, resulted in TGS of heparanase in human prostate cancer, bladder cancer, and gastric cancer cells in a sequence-specific manner. Methylation-specific PCR and bisulfite sequencing revealed no DNA methylation of CpG islands within heparanase promoter in siH3-transfected cells. The TGS of heparanase did not involve changes of epigenetic markers histone H3 lysine 9 dimethylation (H3K9me2), histone H3 lysine 27 trimethylation (H3K27me3) or active chromatin marker acetylated histone H3 (AcH3). The regulation of alternative splicing was not involved in siH3-mediated TGS. Instead, siH3 interfered with transcription initiation via decreasing the binding of both RNA polymerase II and transcription factor II B (TFIIB), but not the binding of transcription factors Sp1 or early growth response 1, on the heparanase promoter. Moreover, Argonaute 1 and Argonaute 2 facilitated the decreased binding of RNA polymerase II and TFIIB on heparanase promoter, and were necessary in siH3-induced TGS of heparanase. Stable transfection of the short hairpin RNA construct targeting heparanase TSS (−9/+10 bp) into cancer cells, resulted in decreased proliferation, invasion, metastasis and angiogenesis of cancer cells in vitro and in athymic mice models. These results suggest that small RNAs targeting TSS can induce TGS of heparanase via interference with transcription initiation, and significantly suppress the tumor growth, invasion, metastasis and angiogenesis of cancer cells. PMID:22363633
Peters, Linda M.; Belyantseva, Inna A.; Lagziel, Ayala; Battey, James F.; Friedman, Thomas B.; Morell, Robert J.
2007-01-01
Specialization in cell function and morphology is influenced by the differential expression of mRNAs, many of which are expressed at low abundance and restricted to certain cell types. Detecting such transcripts in cDNA libraries may require sequencing millions of clones. Massively parallel signature sequencing (MPSS) is well-suited for identifying transcripts that are expressed in discrete cell types and in low abundance. We have made MPSS libraries from microdissections of three inner ear tissues. By comparing these MPSS libraries to those of 87 other tissues included in the Mouse Reference Transcriptome (MRT) online resource, we have identified genes that are highly enriched in, or specific to, the inner ear. We show by RT-PCR and in situ hybridization that signatures unique to the inner ear libraries identify transcripts with highly specific cell-type localizations. These transcripts serve to illustrate the utility of a resource that is available to the research community. Utilization of these resources will increase the number of known transcription units and expand our knowledge of the tissue-specific regulation of the transcriptome. PMID:17049805
Raghu, G; Tevosian, S; Anant, S; Subramanian, K N; George, D L; Mirkin, S M
1994-01-01
The mouse c-Ki-ras protooncogene promoter contains an unusual DNA element consisting of a 27 bp-long homopurine-homopyrimidine mirror repeat (H-motif) adjacent to a d(C-G)5 repeat. We have previously shown that in vitro these repeats may adopt H and Z conformations, respectively, causing nuclease and chemical hypersensitivity. Here we have studied the functional role of these DNA stretches using fine deletion analysis of the promoter and a transient transcription assay in vivo. We found that while the H-motif is responsible for approximately half of the promoter activity in both mouse and human cell lines, the Z-forming sequence exhibits little, if any, such activity. Mutational changes introduced within the homopurine-homopyrimidine stretch showed that its sequence integrity, rather than its H-forming potential, is responsible for its effect on transcription. Electrophoretic mobility shift assays revealed that the putative H-motif tightly binds several nuclear proteins, one of which is likely to be transcription factor Sp1, as determined by competition experiments. Southwestern hybridization studies detected two major proteins specifically binding to the H-motif: a 97 kD protein which presumably corresponds to Sp1 and another protein of 60 kD in human and 64 kD in mouse cells. We conclude that the homopurine-homopyrimidine stretch is required for full transcriptional activity of the c-Ki-ras promoter and at least two distinct factors, Sp1 and an unidentified protein, potentially contribute to the positive effect on transcription. Images PMID:8078760
Beauchef, Gallic; Bigot, Nicolas; Kypriotou, Magdalini; Renard, Emmanuelle; Porée, Benoît; Widom, Russell; Dompmartin-Blanchere, Anne; Oddos, Thierry; Maquart, François-Xavier; Demoor, Magali; Boumediene, Karim; Galera, Philippe
2012-01-01
Transcriptional mechanisms regulating type I collagen genes expression in physiopathological situations are not completely known. In this study, we have investigated the role of nuclear factor-κB (NF-κB) transcription factor on type I collagen expression in adult normal human (ANF) and scleroderma (SF) fibroblasts. We demonstrated that NF-κB, a master transcription factor playing a major role in immune response/apoptosis, down-regulates COL1A1 expression by a transcriptional control involving the −112/−61 bp sequence. This 51-bp region mediates the action of two zinc fingers, Sp1 (specific protein-1) and Sp3, acting as trans-activators of type I collagen expression in ANF and SF. Knockdown of each one of these trans factors by siRNA confirmed the trans-activating effect of Sp1/Sp3 and the p65 subunit of NF-κB trans-inhibiting effect on COL1A1 expression. Despite no existing κB consensus sequence in the COL1A1 promoter, we found that Sp1/Sp3/c-Krox and NF-κB bind and/or are recruited on the proximal promoter in chromatin immunoprecipitation (ChIP) assays. Attempts to elucidate whether interactions between Sp1/Sp3/c-Krox and p65 are necessary to mediate the NF-κB inhibitory effect on COL1A1 in ANF and SF were carried out; in this regard, immunoprecipitation assays revealed that they interact, and this was validated by re-ChIP. Finally, the knockdown of Sp1/Sp3/c-Krox prevents the p65 inhibitory effect on COL1A1 transcription in ANF, whereas only the siRNAs targeting Sp3 and c-Krox provoked the same effect in SF, suggesting that particular interactions are characteristic of the scleroderma phenotype. In conclusion, our findings highlight a new mechanism for COL1A1 transcriptional regulation by NF-κB, and these data could allow the development of new antifibrotic strategies. PMID:22139845
Hahn, Steven; Young, Elton T.
2011-01-01
Here we review recent advances in understanding the regulation of mRNA synthesis in Saccharomyces cerevisiae. Many fundamental gene regulatory mechanisms have been conserved in all eukaryotes, and budding yeast has been at the forefront in the discovery and dissection of these conserved mechanisms. Topics covered include upstream activation sequence and promoter structure, transcription factor classification, and examples of regulated transcription factor activity. We also examine advances in understanding the RNA polymerase II transcription machinery, conserved coactivator complexes, transcription activation domains, and the cooperation of these factors in gene regulatory mechanisms. PMID:22084422
Analysis of functional redundancies within the Arabidopsis TCP transcription factor family.
Danisman, Selahattin; van Dijk, Aalt D J; Bimbo, Andrea; van der Wal, Froukje; Hennig, Lars; de Folter, Stefan; Angenent, Gerco C; Immink, Richard G H
2013-12-01
Analyses of the functions of TEOSINTE-LIKE1, CYCLOIDEA, and PROLIFERATING CELL FACTOR1 (TCP) transcription factors have been hampered by functional redundancy between its individual members. In general, putative functionally redundant genes are predicted based on sequence similarity and confirmed by genetic analysis. In the TCP family, however, identification is impeded by relatively low overall sequence similarity. In a search for functionally redundant TCP pairs that control Arabidopsis leaf development, this work performed an integrative bioinformatics analysis, combining protein sequence similarities, gene expression data, and results of pair-wise protein-protein interaction studies for the 24 members of the Arabidopsis TCP transcription factor family. For this, the work completed any lacking gene expression and protein-protein interaction data experimentally and then performed a comprehensive prediction of potential functional redundant TCP pairs. Subsequently, redundant functions could be confirmed for selected predicted TCP pairs by genetic and molecular analyses. It is demonstrated that the previously uncharacterized class I TCP19 gene plays a role in the control of leaf senescence in a redundant fashion with TCP20. Altogether, this work shows the power of combining classical genetic and molecular approaches with bioinformatics predictions to unravel functional redundancies in the TCP transcription factor family.
Analysis of functional redundancies within the Arabidopsis TCP transcription factor family
Danisman, Selahattin; de Folter, Stefan; Immink, Richard G. H.
2013-01-01
Analyses of the functions of TEOSINTE-LIKE1, CYCLOIDEA, and PROLIFERATING CELL FACTOR1 (TCP) transcription factors have been hampered by functional redundancy between its individual members. In general, putative functionally redundant genes are predicted based on sequence similarity and confirmed by genetic analysis. In the TCP family, however, identification is impeded by relatively low overall sequence similarity. In a search for functionally redundant TCP pairs that control Arabidopsis leaf development, this work performed an integrative bioinformatics analysis, combining protein sequence similarities, gene expression data, and results of pair-wise protein–protein interaction studies for the 24 members of the Arabidopsis TCP transcription factor family. For this, the work completed any lacking gene expression and protein–protein interaction data experimentally and then performed a comprehensive prediction of potential functional redundant TCP pairs. Subsequently, redundant functions could be confirmed for selected predicted TCP pairs by genetic and molecular analyses. It is demonstrated that the previously uncharacterized class I TCP19 gene plays a role in the control of leaf senescence in a redundant fashion with TCP20. Altogether, this work shows the power of combining classical genetic and molecular approaches with bioinformatics predictions to unravel functional redundancies in the TCP transcription factor family. PMID:24129704
Fedrigo, Olivier; Babbitt, Courtney C.; Wortham, Matthew; Tewari, Alok K.; London, Darin; Song, Lingyun; Lee, Bum-Kyu; Iyer, Vishwanath R.; Parker, Stephen C. J.; Margulies, Elliott H.; Wray, Gregory A.; Furey, Terrence S.; Crawford, Gregory E.
2012-01-01
Understanding the molecular basis for phenotypic differences between humans and other primates remains an outstanding challenge. Mutations in non-coding regulatory DNA that alter gene expression have been hypothesized as a key driver of these phenotypic differences. This has been supported by differential gene expression analyses in general, but not by the identification of specific regulatory elements responsible for changes in transcription and phenotype. To identify the genetic source of regulatory differences, we mapped DNaseI hypersensitive (DHS) sites, which mark all types of active gene regulatory elements, genome-wide in the same cell type isolated from human, chimpanzee, and macaque. Most DHS sites were conserved among all three species, as expected based on their central role in regulating transcription. However, we found evidence that several hundred DHS sites were gained or lost on the lineages leading to modern human and chimpanzee. Species-specific DHS site gains are enriched near differentially expressed genes, are positively correlated with increased transcription, show evidence of branch-specific positive selection, and overlap with active chromatin marks. Species-specific sequence differences in transcription factor motifs found within these DHS sites are linked with species-specific changes in chromatin accessibility. Together, these indicate that the regulatory elements identified here are genetic contributors to transcriptional and phenotypic differences among primate species. PMID:22761590
2017-01-01
Recent advances in next-generation sequencing approaches have revolutionized our understanding of transcriptional expression in diverse systems. However, measurements of transcription do not necessarily reflect gene translation, the process of ultimate importance in understanding cellular function. To circumvent this limitation, biochemical tagging of ribosome subunits to isolate ribosome-associated mRNA has been developed. However, this approach, called TRAP, lacks quantitative resolution compared to a superior technology, ribosome profiling. Here, we report the development of an optimized ribosome profiling approach in Drosophila. We first demonstrate successful ribosome profiling from a specific tissue, larval muscle, with enhanced resolution compared to conventional TRAP approaches. We next validate the ability of this technology to define genome-wide translational regulation. This technology is leveraged to test the relative contributions of transcriptional and translational mechanisms in the postsynaptic muscle that orchestrate the retrograde control of presynaptic function at the neuromuscular junction. Surprisingly, we find no evidence that significant changes in the transcription or translation of specific genes are necessary to enable retrograde homeostatic signaling, implying that post-translational mechanisms ultimately gate instructive retrograde communication. Finally, we show that a global increase in translation induces adaptive responses in both transcription and translation of protein chaperones and degradation factors to promote cellular proteostasis. Together, this development and validation of tissue-specific ribosome profiling enables sensitive and specific analysis of translation in Drosophila. PMID:29194454
NASA Astrophysics Data System (ADS)
Li, Shengjie; Bai, Junjie; Wang, Lin
2008-08-01
Myostatin or GDF-8, a member of the transforming growth factor-β (TGF-β) superfamily, has been demonstrated to be a negative regulator of skeletal muscle mass in mammals. In the present study, we obtained a 5.64 kb sequence of myostatin encoding gene and its promoter from largemouth bass ( Micropterus salmoides). The myostatin encoding gene consisted of three exons (488 bp, 371 bp and 1779 bp, respectively) and two introns (390 bp and 855 bp, respectively). The intron-exon boundaries were conservative in comparison with those of mammalian myostatin encoding genes, whereas the size of introns was smaller than that of mammals. Sequence analysis of 1.569 kb of the largemouth bass myostatin gene promoter region revealed that it contained two TATA boxes, one CAAT box and nine putative E-boxes. Putative muscle growth response elements for myocyte enhancer factor 2 (MEF2), serum response factor (SRF), activator protein 1 (AP1), etc., and muscle-specific Mt binding site (MTBF) were also detected. Some of the transcription factor binding sites were conserved among five teleost species. This information will be useful for studying the transcriptional regulation of myostatin in fish.
2011-01-01
Background Transcription factors (TFs) play a central role in regulating gene expression by interacting with cis-regulatory DNA elements associated with their target genes. Recent surveys have examined the DNA binding specificities of most Saccharomyces cerevisiae TFs, but a comprehensive evaluation of their data has been lacking. Results We analyzed in vitro and in vivo TF-DNA binding data reported in previous large-scale studies to generate a comprehensive, curated resource of DNA binding specificity data for all characterized S. cerevisiae TFs. Our collection comprises DNA binding site motifs and comprehensive in vitro DNA binding specificity data for all possible 8-bp sequences. Investigation of the DNA binding specificities within the basic leucine zipper (bZIP) and VHT1 regulator (VHR) TF families revealed unexpected plasticity in TF-DNA recognition: intriguingly, the VHR TFs, newly characterized by protein binding microarrays in this study, recognize bZIP-like DNA motifs, while the bZIP TF Hac1 recognizes a motif highly similar to the canonical E-box motif of basic helix-loop-helix (bHLH) TFs. We identified several TFs with distinct primary and secondary motifs, which might be associated with different regulatory functions. Finally, integrated analysis of in vivo TF binding data with protein binding microarray data lends further support for indirect DNA binding in vivo by sequence-specific TFs. Conclusions The comprehensive data in this curated collection allow for more accurate analyses of regulatory TF-DNA interactions, in-depth structural studies of TF-DNA specificity determinants, and future experimental investigations of the TFs' predicted target genes and regulatory roles. PMID:22189060
Chen, Huan; Je, Jihyun; Song, Chieun; Hwang, Jung Eun; Lim, Chae Oh
2012-09-01
The dehydration-responsive element-binding factor 2C (DREB2C) is a member of the CBF/DREB subfamily of proteins, which contains a single APETALA2/Ethylene responsive element-binding factor (AP2/ERF) domain. To identify the expression pattern of the DREB2C gene, which contains multiple transcription cis-regulatory elements in its promoter, an approximately 1.4 kb upstream DREB2C sequence was fused to the β-glucuronidase reporter gene (GUS) and the recombinant p1244 construct was transformed into Arabidopsis thaliana (L.) Heynh. The promoter of the gene directed prominent GUS activity in the vasculature in diverse young dividing tissues. Upon applying heat stress (HS), GUS staining was also enhanced in the vasculature of the growing tissues. Analysis of a series of 5'-deletions of the DREB2C promoter revealed that a proximal upstream sequence sufficient for the tissue-specific spatial and temporal induction of GUS expression by HS is localized in the promoter region between -204 and -34 bps relative to the transcriptional start site. Furthermore, electrophoretic mobility shift assay (EMSA) demonstrated that nuclear protein binding activities specific to a -120 to -32 bp promoter fragment increased after HS. These results indicate that the TATA-proximal region and some latent trans-acting factors may cooperate in HS-induced activation of the Arabidopsis DREB2C promoter. © 2012 Institute of Botany, Chinese Academy of Sciences.
James, Timothy Y.; Srivilai, Prayook; Kües, Ursula; Vilgalys, Rytas
2006-01-01
Mating incompatibility in mushroom fungi is controlled by the mating-type loci. In tetrapolar species, two unlinked mating-type loci exist (A and B), whereas in bipolar species there is only one locus. The A and B mating-type loci encode homeodomain transcription factors and pheromones and pheromone receptors, respectively. Most mushroom species have a tetrapolar mating system, but numerous transitions to bipolar mating systems have occurred. Here we determined the genes controlling mating type in the bipolar mushroom Coprinellus disseminatus. Through positional cloning and degenerate PCR, we sequenced both the transcription factor and pheromone receptor mating-type gene homologs from C. disseminatus. Only the transcription factor genes segregate with mating type, discounting the hypothesis of genetic linkage between the A and B mating-type loci as the causal origin of bipolar mating behavior. The mating-type locus of C. disseminatus is similar to the A mating-type locus of the model species Coprinopsis cinerea and encodes two tightly linked pairs of homeodomain transcription factor genes. When transformed into C. cinerea, the C. disseminatus A and B homologs elicited sexual reactions like native mating-type genes. Although mating type in C. disseminatus is controlled by only the transcription factor genes, cellular functions appear to be conserved for both groups of genes. PMID:16461425
Bent, Zachary W.; Poorey, Kunal; LaBauve, Annette E.; ...
2016-12-21
When analyzing pathogen transcriptomes during the infection of host cells, the signal-to-background (pathogen-to-host) ratio of nucleic acids (NA) in infected samples is very small. Despite the advancements in next-generation sequencing, the minute amount of pathogen NA makes standard RNA-seq library preps inadequate for effective gene-level analysis of the pathogen in cases with low bacterial loads. In order to provide a more complete picture of the pathogen transcriptome during an infection, we developed a novel pathogen enrichment technique, which can enrich for transcripts from any cultivable bacteria or virus, using common, readily available laboratory equipment and reagents. To evenly enrich formore » pathogen transcripts, we generate biotinylated pathogen-targeted capture probes in an enzymatic process using the entire genome of the pathogen as a template. The capture probes are hybridized to a strand-specific cDNA library generated from an RNA sample. The biotinylated probes are captured on a monomeric avidin resin in a miniature spin column, and enriched pathogen-specific cDNA is eluted following a series of washes. To test this method, we performed an in vitro time-course infection using Klebsiella pneumoniae to infect murine macrophage cells. K. pneumoniae transcript enrichment efficiency was evaluated using RNA-seq. Bacterial transcripts were enriched up to ~400-fold, and allowed the recovery of transcripts from ~2000–3600 genes not observed in untreated control samples. These additional transcripts revealed interesting aspects of K. pneumoniae biology including the expression of putative virulence factors and the expression of several genes responsible for antibiotic resistance even in the absence of drugs.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bent, Zachary W.; Poorey, Kunal; LaBauve, Annette E.
When analyzing pathogen transcriptomes during the infection of host cells, the signal-to-background (pathogen-to-host) ratio of nucleic acids (NA) in infected samples is very small. Despite the advancements in next-generation sequencing, the minute amount of pathogen NA makes standard RNA-seq library preps inadequate for effective gene-level analysis of the pathogen in cases with low bacterial loads. In order to provide a more complete picture of the pathogen transcriptome during an infection, we developed a novel pathogen enrichment technique, which can enrich for transcripts from any cultivable bacteria or virus, using common, readily available laboratory equipment and reagents. To evenly enrich formore » pathogen transcripts, we generate biotinylated pathogen-targeted capture probes in an enzymatic process using the entire genome of the pathogen as a template. The capture probes are hybridized to a strand-specific cDNA library generated from an RNA sample. The biotinylated probes are captured on a monomeric avidin resin in a miniature spin column, and enriched pathogen-specific cDNA is eluted following a series of washes. To test this method, we performed an in vitro time-course infection using Klebsiella pneumoniae to infect murine macrophage cells. K. pneumoniae transcript enrichment efficiency was evaluated using RNA-seq. Bacterial transcripts were enriched up to ~400-fold, and allowed the recovery of transcripts from ~2000–3600 genes not observed in untreated control samples. These additional transcripts revealed interesting aspects of K. pneumoniae biology including the expression of putative virulence factors and the expression of several genes responsible for antibiotic resistance even in the absence of drugs.« less
Kujur, Alice; Bajaj, Deepak; Saxena, Maneesha S.; Tripathi, Shailesh; Upadhyaya, Hari D.; Gowda, C.L.L.; Singh, Sube; Jain, Mukesh; Tyagi, Akhilesh K.; Parida, Swarup K.
2013-01-01
We developed 1108 transcription factor gene-derived microsatellite (TFGMS) and 161 transcription factor functional domain-associated microsatellite (TFFDMS) markers from 707 TFs of chickpea. The robust amplification efficiency (96.5%) and high intra-specific polymorphic potential (34%) detected by markers suggest their immense utilities in efficient large-scale genotyping applications, including construction of both physical and functional transcript maps and understanding population structure. Candidate gene-based association analysis revealed strong genetic association of TFFDMS markers with three major seed and pod traits. Further, TFGMS markers in the 5′ untranslated regions of TF genes showing differential expression during seed development had higher trait association potential. The significance of TFFDMS markers was demonstrated by correlating their allelic variation with amino acid sequence expansion/contraction in the functional domain and alteration of secondary protein structure encoded by genes. The seed weight-associated markers were validated through traditional bi-parental genetic mapping. The determination of gene-specific linkage disequilibrium (LD) patterns in desi and kabuli based on single nucleotide polymorphism-microsatellite marker haplotypes revealed extended LD decay, enhanced LD resolution and trait association potential of genes. The evolutionary history of a strong seed-size/weight-associated TF based on natural variation and haplotype sharing among desi, kabuli and wild unravelled useful information having implication for seed-size trait evolution during chickpea domestication. PMID:23633531
Huntley, Stuart; Baggott, Daniel M.; Hamilton, Aaron T.; Tran-Gyamfi, Mary; Yang, Shan; Kim, Joomyeong; Gordon, Laurie; Branscomb, Elbert; Stubbs, Lisa
2006-01-01
Krüppel-type zinc finger (ZNF) motifs are prevalent components of transcription factor proteins in all eukaryotes. KRAB-ZNF proteins, in which a potent repressor domain is attached to a tandem array of DNA-binding zinc-finger motifs, are specific to tetrapod vertebrates and represent the largest class of ZNF proteins in mammals. To define the full repertoire of human KRAB-ZNF proteins, we searched the genome sequence for key motifs and then constructed and manually curated gene models incorporating those sequences. The resulting gene catalog contains 423 KRAB-ZNF protein-coding loci, yielding alternative transcripts that altogether predict at least 742 structurally distinct proteins. Active rounds of segmental duplication, involving single genes or larger regions and including both tandem and distributed duplication events, have driven the expansion of this mammalian gene family. Comparisons between the human genes and ZNF loci mined from the draft mouse, dog, and chimpanzee genomes not only identified 103 KRAB-ZNF genes that are conserved in mammals but also highlighted a substantial level of lineage-specific change; at least 136 KRAB-ZNF coding genes are primate specific, including many recent duplicates. KRAB-ZNF genes are widely expressed and clustered genes are typically not coregulated, indicating that paralogs have evolved to fill roles in many different biological processes. To facilitate further study, we have developed a Web-based public resource with access to gene models, sequences, and other data, including visualization tools to provide genomic context and interaction with other public data sets. PMID:16606702
Protein Interaction Profile Sequencing (PIP-seq).
Foley, Shawn W; Gregory, Brian D
2016-10-10
Every eukaryotic RNA transcript undergoes extensive post-transcriptional processing from the moment of transcription up through degradation. This regulation is performed by a distinct cohort of RNA-binding proteins which recognize their target transcript by both its primary sequence and secondary structure. Here, we describe protein interaction profile sequencing (PIP-seq), a technique that uses ribonuclease-based footprinting followed by high-throughput sequencing to globally assess both protein-bound RNA sequences and RNA secondary structure. PIP-seq utilizes single- and double-stranded RNA-specific nucleases in the absence of proteins to infer RNA secondary structure. These libraries are also compared to samples that undergo nuclease digestion in the presence of proteins in order to find enriched protein-bound sequences. Combined, these four libraries provide a comprehensive, transcriptome-wide view of RNA secondary structure and RNA protein interaction sites from a single experimental technique. © 2016 by John Wiley & Sons, Inc. Copyright © 2016 John Wiley & Sons, Inc.
Puranik, Swati; Kumar, Karunesh; Srivastava, Prem S; Prasad, Manoj
2011-10-01
The NAC (NAM/ATAF1,2/CUC2) proteins are among the largest family of plant transcription factors. Its members have been associated with diverse plant processes and intricately regulate the expression of several genes. Inspite of this immense progress, knowledge of their DNA-binding properties are still limited. In our recent publication,1 we reported isolation of a membrane-associated NAC domain protein from Setaria italica (SiNAC). Transactivation analysis revealed that it was a functionally active transcription factor as it could stimulate expression of reporter genes in vivo. Truncations of the transmembrane region of the protein lead to its nuclear localization. Here we describe expression and purification of SiNAC DNA-binding domain. We further report identification of a novel DNA-binding site, [C/G][A/T][T/A][G/C]TC[C/G][A/T][C/G][G/C] for SiNAC by electrophoretic mobility shift assay. The SiNAC-GST protein could bind to the NAC recognition sequence in vitro as well as to sequences where some bases had been reshuffled. The results presented here contribute to our understanding of the DNA-binding specificity of SiNAC protein.
Puranik, Swati; Kumar, Karunesh; Srivastava, Prem S
2011-01-01
The NAC (NAM/ATAF1,2/CUC2) proteins are among the largest family of plant transcription factors. Its members have been associated with diverse plant processes and intricately regulate the expression of several genes. Inspite of this immense progress, knowledge of their DNA-binding properties are still limited. In our recent publication,1 we reported isolation of a membrane-associated NAC domain protein from Setaria italica (SiNAC). Transactivation analysis revealed that it was a functionally active transcription factor as it could stimulate expression of reporter genes in vivo. Truncation of the transmembrane region of the protein lead to its nuclear localization. Here we describe expression and purification of SiNAC DNA-binding domain. We further report identification of a novel DNA-binding site, [C/G][A/T] [T/A][G/C]TC[C/G][A/T][C/G][G/C] for SiNAC by electrophoretic mobility shift assay. The SiNAC-GST protein could bind to the NAC recognition sequence in vitro as well as to sequences where some bases had been reshuffled. The results presented here contribute to our understanding of the DNA-binding specificity of SiNAC protein. PMID:21918373
Development of MTL-CEBPA: Small Activating RNA Drug for Hepatocellular Carcinoma.
Setten, Ryan L; Lightfoot, Helen L; Habib, Nagy A; Rossi, John J
2018-06-10
Oligonucleotide drug development has revolutionised the drug discovery field allowing the notoriously "undruggable" genome to potentially become "druggable". Within this field, 'small' or 'short' activating RNAs (saRNA) are a more recently discovered category of short double stranded RNA with clinical potential. SaRNAs promote endogenous transcription from target loci, a phenomenon widely observed in mammals known as RNA activation (RNAa). The ability to target a particular gene is dependent on the sequence of the saRNA. Hence, the potential clinical application of saRNA is to increase target gene expression in a sequence specific manner. SaRNA based oligonucleotide therapeutics present great promise in expanding the "druggable" genome with particular areas of interest including transcription factor activation and haploinsufficency. Review and Conclusion: In this mini-review, we describe the pre-clinical development of the first saRNA drug to enter the clinic. This saRNA, referred to as MTL-CEBPA, induces transcription of the transcription factor CCAAT/enhancer-binding protein alpha (CEBPα), a tumour suppressor and critical regulator of hepatocyte function. MTL-CEBPA is presently in Phase I clinical trials for hepatocellular carcinoma (HCC). The clinical development of MTL-CEBPA will demonstrate "proof of concept", showing that saRNAs can provide the basis for drugs which enhance targeted gene expression and consequently improve disease outcome in patients. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
A Comprehensive Analysis of Transcript-Supported De Novo Genes in Saccharomyces sensu stricto Yeasts
Lu, Tzu-Chiao; Leu, Jun-Yi; Lin, Wen-Chang
2017-01-01
Abstract Novel genes arising from random DNA sequences (de novo genes) have been suggested to be widespread in the genomes of different organisms. However, our knowledge about the origin and evolution of de novo genes is still limited. To systematically understand the general features of de novo genes, we established a robust pipeline to analyze >20,000 transcript-supported coding sequences (CDSs) from the budding yeast Saccharomyces cerevisiae. Our analysis pipeline combined phylogeny, synteny, and sequence alignment information to identify possible orthologs across 20 Saccharomycetaceae yeasts and discovered 4,340 S. cerevisiae-specific de novo genes and 8,871 S. sensu stricto-specific de novo genes. We further combine information on CDS positions and transcript structures to show that >65% of de novo genes arose from transcript isoforms of ancient genes, especially in the upstream and internal regions of ancient genes. Fourteen identified de novo genes with high transcript levels were chosen to verify their protein expressions. Ten of them, including eight transcript isoform-associated CDSs, showed translation signals and five proteins exhibited specific cytosolic localizations. Our results suggest that de novo genes frequently arise in the S. sensu stricto complex and have the potential to be quickly integrated into ancient cellular network. PMID:28981695
Papantonis, Argyris; Sourmeli, Sissy; Lecanidou, Rena
2008-05-09
From the different cis-elements clustered on silkmoth chorion gene promoters, C/EBP binding sites predominate. Their sequence composition and dispersal vary amongst promoters of diverse developmental specificity. Occupancy of these sites by BmC/EBP was examined through Southwestern and ChIP assays modified to suit ovarian follicular cells. For the genes studied, binding of BmC/EBP coincided with the respective stages of transcriptional activation. However, the factor was reloaded on promoter sequences long after individual gene repression. Furthermore, suppression of BmC/EBP transcription in developing follicles resulted in de-regulation of chorion gene expression. A biphasic function of BmC/EBP, according to which it may act as both an activator and a repressor during silkmoth choriogenesis, is considered under the light of the presented data.
Munde, Manoj; Poon, Gregory M. K.; Wilson, W. David
2013-01-01
Members of the ETS family of transcription factors regulate a functionally diverse array of genes. All ETS proteins share a structurally-conserved but sequence-divergent DNA-binding domain, known as the ETS domain. Although the structure and thermodynamics of the ETS-DNA complexes are well known, little is known about the kinetics of sequence recognition, a facet that offers potential insight into its molecular mechanism. We have characterized DNA binding by the ETS domain of PU.1 by biosensor-surface plasmon resonance (SPR). SPR analysis revealed a striking kinetic profile for DNA binding by the PU.1 ETS domain. At low salt concentrations, it binds high-affinity cognate DNA with a very slow association rate constant (≤105 M−1 s−1), compensated by a correspondingly small dissociation rate constant. The kinetics are strongly salt-dependent but mutually balance to produce a relatively weak dependence in the equilibrium constant. This profile contrasts sharply with reported data for other ETS domains (e.g., Ets-1, TEL) for which high-affinity binding is driven by rapid association (>107 M−1 s−1). We interpret this difference in terms of the hydration properties of ETS-DNA binding and propose that at least two mechanisms of sequence recognition are employed by this family of DNA-binding domain. Additionally, we use SPR to demonstrate the potential for pharmacological inhibition of sequence-specific ETS-DNA binding, using the minor groove-binding distamycin as a model compound. Our work establishes SPR as a valuable technique for extending our understanding of the molecular mechanisms of ETS-DNA interactions as well as developing potential small-molecule agents for biotechnological and therapeutic purposes. PMID:23416556
Idrovo Espín, Fabio Marcelo; Peraza-Echeverria, Santy; Fuentes, Gabriela; Santamaría, Jorge M
2012-05-01
The TGA transcription factors belong to the subfamily of bZIP group D that play a major role in disease resistance and development. Most of the TGA identified in Arabidopsis interact with the master regulator of SAR, NPR1 that controls the expression of PR genes. As a first approach to determine the possible involvement of these transcription factors in papaya defense, we characterized Arabidopsis TGA orthologs from the genome of Carica papaya cv. SunUp. Six orthologs CpTGA1 to CpTGA6, were identified. The predicted CpTGA proteins were highly similar to AtTGA sequences and probably share the same DNA binding properties and transcriptional regulation features. The protein sequences alignment evidenced the presence of conserved domains, characteristic of this group of transcription factors. The phylogeny showed that CpTGA evolved into three different subclades associated with defense and floral development. This is the first report of basal expression patterns assessed by RT-PCR, from the whole subfamily of CpTGA members in different tissues from papaya cv. Maradol mature plants. Overall, CpTGA1, CpTGA3 CpTGA6 and CpTGA4 showed a basal expression in all tissues tested; CpTGA2 expressed strongly in all tissues except in petioles while CpTGA5 expressed only in petals and to a lower extent in petioles. Although more detailed studies in anthers and other floral structures are required, we suggest that CpTGA5 might be tissue-specific, and it might be involved in papaya floral development. On the other hand, we report here for the first time, the expression of the whole family of CpTGA in response to salicylic acid (SA). The expression of CpTGA3, CpTGA4 and CpTGA6 increased in response to SA, what would suggest its involvement in the SAR response in papaya. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
Soler, Marçal; Camargo, Eduardo Leal Oliveira; Carocha, Victor; Cassan-Wang, Hua; San Clemente, Hélène; Savelli, Bruno; Hefer, Charles A; Paiva, Jorge A Pinto; Myburg, Alexander A; Grima-Pettenati, Jacqueline
2015-06-01
The R2R3-MYB family, one of the largest transcription factor families in higher plants, controls a wide variety of plant-specific processes including, notably, phenylpropanoid metabolism and secondary cell wall formation. We performed a genome-wide analysis of this superfamily in Eucalyptus, one of the most planted hardwood trees world-wide. A total of 141 predicted R2R3-MYB sequences identified in the Eucalyptus grandis genome sequence were subjected to comparative phylogenetic analyses with Arabidopsis thaliana, Oryza sativa, Populus trichocarpa and Vitis vinifera. We analysed features such as gene structure, conserved motifs and genome location. Transcript abundance patterns were assessed by RNAseq and validated by high-throughput quantitative PCR. We found some R2R3-MYB subgroups with expanded membership in E. grandis, V. vinifera and P. trichocarpa, and others preferentially found in woody species, suggesting diversification of specific functions in woody plants. By contrast, subgroups containing key genes regulating lignin biosynthesis and secondary cell wall formation are more conserved across all of the species analysed. In Eucalyptus, R2R3-MYB tandem gene duplications seem to disproportionately affect woody-preferential and woody-expanded subgroups. Interestingly, some of the genes belonging to woody-preferential subgroups show higher expression in the cambial region, suggesting a putative role in the regulation of secondary growth. © 2014 The Authors New Phytologist © 2014 New Phytologist Trust.
Kelemen, Zsolt; Sebastian, Alvaro; Xu, Wenjia; Grain, Damaris; Salsac, Fabien; Avon, Alexandra; Berger, Nathalie; Tran, Joseph; Dubreucq, Bertrand; Lurin, Claire; Lepiniec, Loïc; Contreras-Moreira, Bruno; Dubos, Christian
2015-01-01
The control of growth and development of all living organisms is a complex and dynamic process that requires the harmonious expression of numerous genes. Gene expression is mainly controlled by the activity of sequence-specific DNA binding proteins called transcription factors (TFs). Amongst the various classes of eukaryotic TFs, the MYB superfamily is one of the largest and most diverse, and it has considerably expanded in the plant kingdom. R2R3-MYBs have been extensively studied over the last 15 years. However, DNA-binding specificity has been characterized for only a small subset of these proteins. Therefore, one of the remaining challenges is the exhaustive characterization of the DNA-binding specificity of all R2R3-MYB proteins. In this study, we have developed a library of Arabidopsis thaliana R2R3-MYB open reading frames, whose DNA-binding activities were assayed in vivo (yeast one-hybrid experiments) with a pool of selected cis-regulatory elements. Altogether 1904 interactions were assayed leading to the discovery of specific patterns of interactions between the various R2R3-MYB subgroups and their DNA target sequences and to the identification of key features that govern these interactions. The present work provides a comprehensive in vivo analysis of R2R3-MYB binding activities that should help in predicting new DNA motifs and identifying new putative target genes for each member of this very large family of TFs. In a broader perspective, the generated data will help to better understand how TF interact with their target DNA sequences. PMID:26484765
Chao, Tianle; Wang, Guizhi; Wang, Jianmin; Liu, Zhaohua; Ji, Zhibin; Hou, Lei; Zhang, Chunlan
2016-01-01
High-throughput mRNA sequencing enables the discovery of new transcripts and additional parts of incompletely annotated transcripts. Compared with the human and cow genomes, the reference annotation level of the sheep genome is still low. An investigation of new transcripts in sheep skeletal muscle will improve our understanding of muscle development. Therefore, applying high-throughput sequencing, two cDNA libraries from the biceps brachii of small-tailed Han sheep and Dorper sheep were constructed, and whole-transcriptome analysis was performed to determine the unknown transcript catalogue of this tissue. In this study, 40,129 transcripts were finally mapped to the sheep genome. Among them, 3,467 transcripts were determined to be unannotated in the current reference sheep genome and were defined as new transcripts. Based on protein-coding capacity prediction and comparative analysis of sequence similarity, 246 transcripts were classified as portions of unannotated genes or incompletely annotated genes. Another 1,520 transcripts were predicted with high confidence to be long non-coding RNAs. Our analysis also revealed 334 new transcripts that displayed specific expression in ruminants and uncovered a number of new transcripts without intergenus homology but with specific expression in sheep skeletal muscle. The results confirmed a complex transcript pattern of coding and non-coding RNA in sheep skeletal muscle. This study provided important information concerning the sheep genome and transcriptome annotation, which could provide a basis for further study.
2012-01-01
Background The potential contribution of upstream sequence variation to the unique features of orthologous genes is just beginning to be unraveled. A core subset of stress-associated bZIP transcription factors from rice (Oryza sativa) formed ten clusters of orthologous groups (COG) with genes from the monocot sorghum (Sorghum bicolor) and dicot Arabidopsis (Arabidopsis thaliana). The total cis-regulatory information content of each stress-associated COG was examined by phylogenetic footprinting to reveal ortholog-specific, lineage-specific and species-specific conservation patterns. Results The most apparent pattern observed was the occurrence of spatially conserved ‘core modules’ among the COGs but not among paralogs. These core modules are comprised of various combinations of two to four putative transcription factor binding site (TFBS) classes associated with either developmental or stress-related functions. Outside the core modules are specific stress (ABA, oxidative, abiotic, biotic) or organ-associated signals, which may be functioning as ‘regulatory fine-tuners’ and further define lineage-specific and species-specific cis-regulatory signatures. Orthologous monocot and dicot promoters have distinct TFBS classes involved in disease and oxidative-regulated expression, while the orthologous rice and sorghum promoters have distinct combinations of root-specific signals, a pattern that is not particularly conserved in Arabidopsis. Conclusions Patterns of cis-regulatory conservation imply that each ortholog has distinct signatures, further suggesting that they are potentially unique in a regulatory context despite the presumed conservation of broad biological function during speciation. Based on the observed patterns of conservation, we postulate that core modules are likely primary determinants of basal developmental programming, which may be integrated with and further elaborated by additional intrinsic or extrinsic signals in conjunction with lineage-specific or species-specific regulatory fine-tuners. This synergy may be critical for finer-scale spatio-temporal regulation, hence unique expression profiles of homologous transcription factors from different species with distinct zones of ecological adaptation such as rice, sorghum and Arabidopsis. The patterns revealed from these comparisons set the stage for further empirical validation by functional genomics. PMID:22992304
Functional and mechanistic diversity of distal transcription enhancers
Bulger, Michael; Groudine, Mark
2013-01-01
Biological differences among metazoans, and between cell types in a given organism, arise in large part due to differences in gene expression patterns. The sequencing of multiple metazoan genomes, coupled with recent advances in genome-wide analysis of histone modifications and transcription factor binding, has revealed that among regulatory DNA sequences, gene-distal enhancers appear to exhibit the greatest diversity and cell-type specificity. Moreover, such elements are emerging as important targets for mutations that can give rise to disease and to genetic variability that underlies evolutionary change. Studies of long-range interactions between distal genomic sequences in the nucleus indicate that enhancers are often important determinants of nuclear organization, contributing to a general model for enhancer function that involves direct enhancer-promoter contact. In a number of systems, however, mechanisms for enhancer function are emerging that do not fit solely within such a model, suggesting that enhancers as a class of DNA regulatory element may be functionally and mechanistically diverse. PMID:21295696
Natural Antisense Transcripts: Molecular Mechanisms and Implications in Breast Cancers
Latgé, Guillaume; Poulet, Christophe; Bours, Vincent; Jerusalem, Guy
2018-01-01
Natural antisense transcripts are RNA sequences that can be transcribed from both DNA strands at the same locus but in the opposite direction from the gene transcript. Because strand-specific high-throughput sequencing of the antisense transcriptome has only been available for less than a decade, many natural antisense transcripts were first described as long non-coding RNAs. Although the precise biological roles of natural antisense transcripts are not known yet, an increasing number of studies report their implication in gene expression regulation. Their expression levels are altered in many physiological and pathological conditions, including breast cancers. Among the potential clinical utilities of the natural antisense transcripts, the non-coding|coding transcript pairs are of high interest for treatment. Indeed, these pairs can be targeted by antisense oligonucleotides to specifically tune the expression of the coding-gene. Here, we describe the current knowledge about natural antisense transcripts, their varying molecular mechanisms as gene expression regulators, and their potential as prognostic or predictive biomarkers in breast cancers. PMID:29301303
Natural Antisense Transcripts: Molecular Mechanisms and Implications in Breast Cancers.
Latgé, Guillaume; Poulet, Christophe; Bours, Vincent; Josse, Claire; Jerusalem, Guy
2018-01-02
Natural antisense transcripts are RNA sequences that can be transcribed from both DNA strands at the same locus but in the opposite direction from the gene transcript. Because strand-specific high-throughput sequencing of the antisense transcriptome has only been available for less than a decade, many natural antisense transcripts were first described as long non-coding RNAs. Although the precise biological roles of natural antisense transcripts are not known yet, an increasing number of studies report their implication in gene expression regulation. Their expression levels are altered in many physiological and pathological conditions, including breast cancers. Among the potential clinical utilities of the natural antisense transcripts, the non-coding|coding transcript pairs are of high interest for treatment. Indeed, these pairs can be targeted by antisense oligonucleotides to specifically tune the expression of the coding-gene. Here, we describe the current knowledge about natural antisense transcripts, their varying molecular mechanisms as gene expression regulators, and their potential as prognostic or predictive biomarkers in breast cancers.
Bonham, Andrew J.; Wenta, Nikola; Osslund, Leah M.; Prussin, Aaron J.; Vinkemeier, Uwe; Reich, Norbert O.
2013-01-01
The DNA-binding specificity and affinity of the dimeric human transcription factor (TF) STAT1, were assessed by total internal reflectance fluorescence protein-binding microarrays (TIRF-PBM) to evaluate the effects of protein phosphorylation, higher-order polymerization and small-molecule inhibition. Active, phosphorylated STAT1 showed binding preferences consistent with prior characterization, whereas unphosphorylated STAT1 showed a weak-binding preference for one-half of the GAS consensus site, consistent with recent models of STAT1 structure and function in response to phosphorylation. This altered-binding preference was further tested by use of the inhibitor LLL3, which we show to disrupt STAT1 binding in a sequence-dependent fashion. To determine if this sequence-dependence is specific to STAT1 and not a general feature of human TF biology, the TF Myc/Max was analysed and tested with the inhibitor Mycro3. Myc/Max inhibition by Mycro3 is sequence independent, suggesting that the sequence-dependent inhibition of STAT1 may be specific to this system and a useful target for future inhibitor design. PMID:23180800
Kehrmann, Jan; Tatura, Roman; Zeschnigk, Michael; Probst-Kepper, Michael; Geffers, Robert; Steinmann, Joerg; Buer, Jan
2014-07-01
The epigenetic regulation of transcription factor genes is critical for T-cell lineage specification. A specific methylation pattern within a conserved region of the lineage specifying transcription factor gene FOXP3, the Treg-specific demethylated region (TSDR), is restricted to regulatory T (Treg) cells and is required for stable expression of FOXP3 and suppressive function. We analysed the impact of hypomethylating agents 5-aza-2'-deoxycytidine and epigallocatechin-3-gallate on human CD4(+) CD25(-) T cells for generating demethylation within FOXP3-TSDR and inducing functional Treg cells. Gene expression, including lineage-specifying transcription factors of the major T-cell lineages and their leading cytokines, functional properties and global transcriptome changes were analysed. The FOXP3-TSDR methylation pattern was determined by using deep amplicon bisulphite sequencing. 5-aza-2'-deoxycytidine induced FOXP3-TSDR hypomethylation and expression of the Treg-cell-specific genes FOXP3 and LRRC32. Proliferation of 5-aza-2'-deoxycytidine-treated cells was reduced, but the cells did not show suppressive function. Hypomethylation was not restricted to FOXP3-TSDR and expression of master transcription factors and leading cytokines of T helper type 1 and type 17 cells were induced. Epigallocatechin-3-gallate induced global DNA hypomethylation to a lesser extent than 5-aza-2'-deoxycitidine, but no relevant hypomethylation within FOXP3-TSDR or expression of Treg-cell-specific genes. Neither of the DNA methyltransferase inhibitors induced fully functional human Treg cells. 5-aza-2'-deoxycitidine-treated cells resembled Treg cells, but they did not suppress proliferation of responder cells, which is an essential capability to be used for Treg cell transfer therapy. Using a recently developed targeted demethylation technology might be a more promising approach for the generation of functional Treg cells. © 2014 John Wiley & Sons Ltd.
Finding functional features in Saccharomyces genomes by phylogenetic footprinting.
Cliften, Paul; Sudarsanam, Priya; Desikan, Ashwin; Fulton, Lucinda; Fulton, Bob; Majors, John; Waterston, Robert; Cohen, Barak A; Johnston, Mark
2003-07-04
The sifting and winnowing of DNA sequence that occur during evolution cause nonfunctional sequences to diverge, leaving phylogenetic footprints of functional sequence elements in comparisons of genome sequences. We searched for such footprints among the genome sequences of six Saccharomyces species and identified potentially functional sequences. Comparison of these sequences allowed us to revise the catalog of yeast genes and identify sequence motifs that may be targets of transcriptional regulatory proteins. Some of these conserved sequence motifs reside upstream of genes with similar functional annotations or similar expression patterns or those bound by the same transcription factor and are thus good candidates for functional regulatory sequences.
de-Carvalho, Jorge; Rodrigues, Rogério M M; Tomé, Brigitte; Henriques, Sílvia F; Mira, Nuno P; Sá-Correia, Isabel; Ferreira, Guilherme N M
2014-04-21
A novel quartz crystal microbalance (QCM) analytical method is developed based on the transmission line model (TLM) algorithm to analyze the binding of transcription factors (TFs) to immobilized DNA oligoduplexes. The method is used to characterize the mechanical properties of biological films through the estimation of the film dynamic shear moduli, G and G, and the film thickness. Using the Saccharomyces cerevisiae transcription factor Haa1 (Haa1DBD) as a biological model two sensors were prepared by immobilizing DNA oligoduplexes, one containing the Haa1 recognition element (HRE(wt)) and another with a random sequence (HRE(neg)) used as a negative control. The immobilization of DNA oligoduplexes was followed in real time and we show that DNA strands initially adsorb with low or non-tilting, laying flat close to the surface, which then lift-off the surface leading to final film tilting angles of 62.9° and 46.7° for HRE(wt) and HRE(neg), respectively. Furthermore we show that the binding of Haa1DBD to HRE(wt) leads to a more ordered and compact film, and forces a 31.7° bending of the immobilized HRE(wt) oligoduplex. This work demonstrates the suitability of the QCM to monitor the specific binding of TFs to immobilized DNA sequences and provides an analytical methodology to study protein-DNA biophysics and kinetics.
Maier, Holly; Ostraat, Rachel; Parenti, Sarah; Fitzsimmons, Daniel; Abraham, Lawrence J.; Garvie, Colin W.; Hagman, James
2003-01-01
Pax-5, a member of the paired domain family of transcription factors, is a key regulator of B lymphocyte-specific transcription and differentiation. A major target of Pax-5-mediated activation is the mb-1 gene, which encodes the essential transmembrane signaling protein Ig-α. Pax-5 recruits three members of the Ets family of transcription factors: Ets-1, Fli-1 and GABPα (with GABPβ1), to assemble ternary complexes on the mb-1 promoter in vitro. Using the Pax-5:Ets-1:DNA crystal structure as a guide, we defined amino acid requirements for transcriptional activation of endogenous mb-1 genes using a novel cell-based assay. Mutations in the β-hairpin/β-turn of the DNA-binding domain of Pax-5 demonstrated its importance for DNA sequence recognition and activation of mb-1 transcription. Mutations of amino acids contacting Ets-1 in the crystal structure reduced or blocked mb-1 promoter activation. One of these mutations, Q22A, resulted in greatly reduced mb-1 gene transcript levels, concurrent with the loss of its ability to recruit Fli-1 to bind the promoter in vitro. In contrast, the mutation had no effect on recruitment of the related Ets protein GABPα (with GABPβ1). These data further define requirements for Pax-5 function in vivo and reveal the complexity of interactions required for cooperative partnerships between transcription factors. PMID:14500810
Wu, Jia Qian; Du, Jiang; Rozowsky, Joel; Zhang, Zhengdong; Urban, Alexander E; Euskirchen, Ghia; Weissman, Sherman; Gerstein, Mark; Snyder, Michael
2008-01-03
Recent studies of the mammalian transcriptome have revealed a large number of additional transcribed regions and extraordinary complexity in transcript diversity. However, there is still much uncertainty regarding precisely what portion of the genome is transcribed, the exact structures of these novel transcripts, and the levels of the transcripts produced. We have interrogated the transcribed loci in 420 selected ENCyclopedia Of DNA Elements (ENCODE) regions using rapid amplification of cDNA ends (RACE) sequencing. We analyzed annotated known gene regions, but primarily we focused on novel transcriptionally active regions (TARs), which were previously identified by high-density oligonucleotide tiling arrays and on random regions that were not believed to be transcribed. We found RACE sequencing to be very sensitive and were able to detect low levels of transcripts in specific cell types that were not detectable by microarrays. We also observed many instances of sense-antisense transcripts; further analysis suggests that many of the antisense transcripts (but not all) may be artifacts generated from the reverse transcription reaction. Our results show that the majority of the novel TARs analyzed (60%) are connected to other novel TARs or known exons. Of previously unannotated random regions, 17% were shown to produce overlapping transcripts. Furthermore, it is estimated that 9% of the novel transcripts encode proteins. We conclude that RACE sequencing is an efficient, sensitive, and highly accurate method for characterization of the transcriptome of specific cell/tissue types. Using this method, it appears that much of the genome is represented in polyA+ RNA. Moreover, a fraction of the novel RNAs can encode protein and are likely to be functional.
Zhao, Yunjun; Sun, Jiayan; Xu, Peng; Zhang, Rui; Li, Laigeng
2014-02-01
Alternative splicing is an important mechanism involved in regulating the development of multicellular organisms. Although many genes in plants undergo alternative splicing, little is understood of its significance in regulating plant growth and development. In this study, alternative splicing of black cottonwood (Populus trichocarpa) wood-associated NAC domain transcription factor (PtrWNDs), PtrWND1B, is shown to occur exclusively in secondary xylem fiber cells. PtrWND1B is expressed with a normal short-transcript PtrWND1B-s as well as its alternative long-transcript PtrWND1B-l. The intron 2 structure of the PtrWND1B gene was identified as a critical sequence that causes PtrWND1B alternative splicing. Suppression of PtrWND1B expression specifically inhibited fiber cell wall thickening. The two PtrWND1B isoforms play antagonistic roles in regulating cell wall thickening during fiber cell differentiation in Populus spp. PtrWND1B-s overexpression enhanced fiber cell wall thickening, while overexpression of PtrWND1B-l repressed fiber cell wall thickening. Alternative splicing may enable more specific regulation of processes such as fiber cell wall thickening during wood formation.
Zhao, Yunjun; Sun, Jiayan; Xu, Peng; Zhang, Rui; Li, Laigeng
2014-01-01
Alternative splicing is an important mechanism involved in regulating the development of multicellular organisms. Although many genes in plants undergo alternative splicing, little is understood of its significance in regulating plant growth and development. In this study, alternative splicing of black cottonwood (Populus trichocarpa) wood-associated NAC domain transcription factor (PtrWNDs), PtrWND1B, is shown to occur exclusively in secondary xylem fiber cells. PtrWND1B is expressed with a normal short-transcript PtrWND1B-s as well as its alternative long-transcript PtrWND1B-l. The intron 2 structure of the PtrWND1B gene was identified as a critical sequence that causes PtrWND1B alternative splicing. Suppression of PtrWND1B expression specifically inhibited fiber cell wall thickening. The two PtrWND1B isoforms play antagonistic roles in regulating cell wall thickening during fiber cell differentiation in Populus spp. PtrWND1B-s overexpression enhanced fiber cell wall thickening, while overexpression of PtrWND1B-l repressed fiber cell wall thickening. Alternative splicing may enable more specific regulation of processes such as fiber cell wall thickening during wood formation. PMID:24394777
Sequence-specific DNA binding by MYC/MAX to low-affinity non-E-box motifs.
Allevato, Michael; Bolotin, Eugene; Grossman, Mark; Mane-Padros, Daniel; Sladek, Frances M; Martinez, Ernest
2017-01-01
The MYC oncoprotein regulates transcription of a large fraction of the genome as an obligatory heterodimer with the transcription factor MAX. The MYC:MAX heterodimer and MAX:MAX homodimer (hereafter MYC/MAX) bind Enhancer box (E-box) DNA elements (CANNTG) and have the greatest affinity for the canonical MYC E-box (CME) CACGTG. However, MYC:MAX also recognizes E-box variants and was reported to bind DNA in a "non-specific" fashion in vitro and in vivo. Here, in order to identify potential additional non-canonical binding sites for MYC/MAX, we employed high throughput in vitro protein-binding microarrays, along with electrophoretic mobility-shift assays and bioinformatic analyses of MYC-bound genomic loci in vivo. We identified all hexameric motifs preferentially bound by MYC/MAX in vitro, which include the low-affinity non-E-box sequence AACGTT, and found that the vast majority (87%) of MYC-bound genomic sites in a human B cell line contain at least one of the top 21 motifs bound by MYC:MAX in vitro. We further show that high MYC/MAX concentrations are needed for specific binding to the low-affinity sequence AACGTT in vitro and that elevated MYC levels in vivo more markedly increase the occupancy of AACGTT sites relative to CME sites, especially at distal intergenic and intragenic loci. Hence, MYC binds diverse DNA motifs with a broad range of affinities in a sequence-specific and dose-dependent manner, suggesting that MYC overexpression has more selective effects on the tumor transcriptome than previously thought.
Archaeal RNA polymerase arrests transcription at DNA lesions.
Gehring, Alexandra M; Santangelo, Thomas J
2017-01-01
Transcription elongation is not uniform and transcription is often hindered by protein-bound factors or DNA lesions that limit translocation and impair catalysis. Despite the high degree of sequence and structural homology of the multi-subunit RNA polymerases (RNAP), substantial differences in response to DNA lesions have been reported. Archaea encode only a single RNAP with striking structural conservation with eukaryotic RNAP II (Pol II). Here, we demonstrate that the archaeal RNAP from Thermococcus kodakarensis is sensitive to a variety of DNA lesions that pause and arrest RNAP at or adjacent to the site of DNA damage. DNA damage only halts elongation when present in the template strand, and the damage often results in RNAP arresting such that the lesion would be encapsulated with the transcription elongation complex. The strand-specific halt to archaeal transcription elongation on modified templates is supportive of RNAP recognizing DNA damage and potentially initiating DNA repair through a process akin to the well-described transcription-coupled DNA repair (TCR) pathways in Bacteria and Eukarya.
Inferring genome-wide interplay landscape between DNA methylation and transcriptional regulation.
Tang, Binhua; Wang, Xin
2015-01-01
DNA methylation and transcriptional regulation play important roles in cancer cell development and differentiation processes. Based on the currently available cell line profiling information from the ENCODE Consortium, we propose a Bayesian inference model to infer and construct genome-wide interaction landscape between DNA methylation and transcriptional regulation, which sheds light on the underlying complex functional mechanisms important within the human cancer and disease context. For the first time, we select all the currently available cell lines (>=20) and transcription factors (>=80) profiling information from the ENCODE Consortium portal. Through the integration of those genome-wide profiling sources, our genome-wide analysis detects multiple functional loci of interest, and indicates that DNA methylation is cell- and region-specific, due to the interplay mechanisms with transcription regulatory activities. We validate our analysis results with the corresponding RNA-sequencing technique for those detected genomic loci. Our results provide novel and meaningful insights for the interplay mechanisms of transcriptional regulation and gene expression for the human cancer and disease studies.
Cis-acting elements in the promoter region of the human aldolase C gene.
Buono, P; de Conciliis, L; Olivetta, E; Izzo, P; Salvatore, F
1993-08-16
We investigated the cis-acting sequences involved in the expression of the human aldolase C gene by transient transfections into human neuroblastoma cells (SKNBE). We demonstrate that 420 bp of the 5'-flanking DNA direct at high efficiency the transcription of the CAT reporter gene. A deletion between -420 bp and -164 bp causes a 60% decrease of CAT activity. Gel shift and DNase I footprinting analyses revealed four protected elements: A, B, C and D. Competition analyses indicate that Sp1 or factors sharing a similar sequence specificity bind to elements A and B, but not to elements C and D. Sequence analysis shows a half palindromic ERE motif (GGTCA), in elements B and D. Region D binds a transactivating factor which appears also essential to stabilize the initiation complex.
Gilad, Yoav; Pritchard, Jonathan K.; Stephens, Matthew
2015-01-01
Understanding global gene regulation depends critically on accurate annotation of regulatory elements that are functional in a given cell type. CENTIPEDE, a powerful, probabilistic framework for identifying transcription factor binding sites from tissue-specific DNase I cleavage patterns and genomic sequence content, leverages the hypersensitivity of factor-bound chromatin and the information in the DNase I spatial cleavage profile characteristic of each DNA binding protein to accurately infer functional factor binding sites. However, the model for the spatial profile in this framework fails to account for the substantial variation in the DNase I cleavage profiles across different binding sites. Neither does it account for variation in the profiles at the same binding site across multiple replicate DNase I experiments, which are increasingly available. In this work, we introduce new methods, based on multi-scale models for inhomogeneous Poisson processes, to account for such variation in DNase I cleavage patterns both within and across binding sites. These models account for the spatial structure in the heterogeneity in DNase I cleavage patterns for each factor. Using DNase-seq measurements assayed in a lymphoblastoid cell line, we demonstrate the improved performance of this model for several transcription factors by comparing against the Chip-seq peaks for those factors. Finally, we explore the effects of DNase I sequence bias on inference of factor binding using a simple extension to our framework that allows for a more flexible background model. The proposed model can also be easily applied to paired-end ATAC-seq and DNase-seq data. msCentipede, a Python implementation of our algorithm, is available at http://rajanil.github.io/msCentipede. PMID:26406244
Raj, Anil; Shim, Heejung; Gilad, Yoav; Pritchard, Jonathan K; Stephens, Matthew
2015-01-01
Understanding global gene regulation depends critically on accurate annotation of regulatory elements that are functional in a given cell type. CENTIPEDE, a powerful, probabilistic framework for identifying transcription factor binding sites from tissue-specific DNase I cleavage patterns and genomic sequence content, leverages the hypersensitivity of factor-bound chromatin and the information in the DNase I spatial cleavage profile characteristic of each DNA binding protein to accurately infer functional factor binding sites. However, the model for the spatial profile in this framework fails to account for the substantial variation in the DNase I cleavage profiles across different binding sites. Neither does it account for variation in the profiles at the same binding site across multiple replicate DNase I experiments, which are increasingly available. In this work, we introduce new methods, based on multi-scale models for inhomogeneous Poisson processes, to account for such variation in DNase I cleavage patterns both within and across binding sites. These models account for the spatial structure in the heterogeneity in DNase I cleavage patterns for each factor. Using DNase-seq measurements assayed in a lymphoblastoid cell line, we demonstrate the improved performance of this model for several transcription factors by comparing against the Chip-seq peaks for those factors. Finally, we explore the effects of DNase I sequence bias on inference of factor binding using a simple extension to our framework that allows for a more flexible background model. The proposed model can also be easily applied to paired-end ATAC-seq and DNase-seq data. msCentipede, a Python implementation of our algorithm, is available at http://rajanil.github.io/msCentipede.
De novo Assembly of Leaf Transcriptome in the Medicinal Plant Andrographis paniculata
Cherukupalli, Neeraja; Divate, Mayur; Mittapelli, Suresh R.; Khareedu, Venkateswara R.; Vudem, Dashavantha R.
2016-01-01
Andrographis paniculata is an important medicinal plant containing various bioactive terpenoids and flavonoids. Despite its importance in herbal medicine, no ready-to-use transcript sequence information of this plant is made available in the public data base, this study mainly deals with the sequencing of RNA from A. paniculata leaf using Illumina HiSeq™ 2000 platform followed by the de novo transcriptome assembly. A total of 189.22 million high quality paired reads were generated and 1,70,724 transcripts were predicted in the primary assembly. Secondary assembly generated a transcriptome size of ~88 Mb with 83,800 clustered transcripts. Based on the similarity searches against plant non-redundant protein database, gene ontology, and eukaryotic orthologous groups, 49,363 transcripts were annotated constituting upto 58.91% of the identified unigenes. Annotation of transcripts—using kyoto encyclopedia of genes and genomes database—revealed 5606 transcripts plausibly involved in 140 pathways including biosynthesis of terpenoids and other secondary metabolites. Transcription factor analysis showed 6767 unique transcripts belonging to 97 different transcription factor families. A total number of 124 CYP450 transcripts belonging to seven divergent clans have been identified. Transcriptome revealed 146 different transcripts coding for enzymes involved in the biosynthesis of terpenoids of which 35 contained terpene synthase motifs. This study also revealed 32,341 simple sequence repeats (SSRs) in 23,168 transcripts. Assembled sequences of transcriptome of A. paniculata generated in this study are made available, for the first time, in the TSA database, which provides useful information for functional and comparative genomic analysis besides identification of key enzymes involved in the various pathways of secondary metabolism. PMID:27582746
Townley, Ian K.; Karchner, Sibel I.; Skripnikova, Elena; Wiese, Thomas E.; Hahn, Mark E.
2017-01-01
The hypoxia-inducible factor (HIF) family of transcription factors plays central roles in the development, physiology, pathology, and environmental adaptation of animals. Because many aquatic habitats are characterized by episodes of low dissolved oxygen, fish represent ideal models to study the roles of HIF in the response to aquatic hypoxia. The estuarine fish Fundulus heteroclitus is found in habitats prone to hypoxia. It responds to low oxygen via behavioral, physiological, and molecular changes, and one member of the HIF family, HIF2α, has been previously described. Herein, cDNA sequencing, phylogenetic analyses, and genomic approaches were used to determine other members of the HIFα family from F. heteroclitus and their relationships to HIFα subunits from other vertebrates. In vitro and cellular approaches demonstrated that full-length forms of HIF1α, HIF2α, and HIF3α independently formed complexes with the β-subunit, aryl hydrocarbon receptor nuclear translocator, to bind to hypoxia response elements and activate reporter gene expression. Quantitative PCR showed that HIFα mRNA abundance varied among organs of normoxic fish in an isoform-specific fashion. Analysis of the F. heteroclitus genome revealed a locus encoding a second HIF2α—HIF2αb—a predicted protein lacking oxygen sensing and transactivation domains. Finally, sequence analyses demonstrated polymorphism in the coding sequence of each F. heteroclitus HIFα subunit, suggesting that genetic variation in these transcription factors may play a role in the variation in hypoxia responses among individuals or populations. PMID:28039194
Townley, Ian K; Karchner, Sibel I; Skripnikova, Elena; Wiese, Thomas E; Hahn, Mark E; Rees, Bernard B
2017-03-01
The hypoxia-inducible factor (HIF) family of transcription factors plays central roles in the development, physiology, pathology, and environmental adaptation of animals. Because many aquatic habitats are characterized by episodes of low dissolved oxygen, fish represent ideal models to study the roles of HIF in the response to aquatic hypoxia. The estuarine fish Fundulus heteroclitus is found in habitats prone to hypoxia. It responds to low oxygen via behavioral, physiological, and molecular changes, and one member of the HIF family, HIF2α, has been previously described. Herein, cDNA sequencing, phylogenetic analyses, and genomic approaches were used to determine other members of the HIFα family from F. heteroclitus and their relationships to HIFα subunits from other vertebrates. In vitro and cellular approaches demonstrated that full-length forms of HIF1α, HIF2α, and HIF3α independently formed complexes with the β-subunit, aryl hydrocarbon receptor nuclear translocator, to bind to hypoxia response elements and activate reporter gene expression. Quantitative PCR showed that HIFα mRNA abundance varied among organs of normoxic fish in an isoform-specific fashion. Analysis of the F. heteroclitus genome revealed a locus encoding a second HIF2α-HIF2αb-a predicted protein lacking oxygen sensing and transactivation domains. Finally, sequence analyses demonstrated polymorphism in the coding sequence of each F. heteroclitus HIFα subunit, suggesting that genetic variation in these transcription factors may play a role in the variation in hypoxia responses among individuals or populations. Copyright © 2017 the American Physiological Society.
Precision control of recombinant gene transcription for CHO cell synthetic biology.
Brown, Adam J; James, David C
2016-01-01
The next generation of mammalian cell factories for biopharmaceutical production will be genetically engineered to possess both generic and product-specific manufacturing capabilities that may not exist naturally. Introduction of entirely new combinations of synthetic functions (e.g. novel metabolic or stress-response pathways), and retro-engineering of existing functional cell modules will drive disruptive change in cellular manufacturing performance. However, before we can apply the core concepts underpinning synthetic biology (design, build, test) to CHO cell engineering we must first develop practical and robust enabling technologies. Fundamentally, we will require the ability to precisely control the relative stoichiometry of numerous functional components we simultaneously introduce into the host cell factory. In this review we discuss how this can be achieved by design of engineered promoters that enable concerted control of recombinant gene transcription. We describe the specific mechanisms of transcriptional regulation that affect promoter function during bioproduction processes, and detail the highly-specific promoter design criteria that are required in the context of CHO cell engineering. The relative applicability of diverse promoter development strategies are discussed, including re-engineering of natural sequences, design of synthetic transcription factor-based systems, and construction of synthetic promoters. This review highlights the potential of promoter engineering to achieve precision transcriptional control for CHO cell synthetic biology. Copyright © 2015. Published by Elsevier Inc.
Macaca specific exon creation event generates a novel ZKSCAN5 transcript.
Kim, Young-Hyun; Choe, Se-Hee; Song, Bong-Seok; Park, Sang-Je; Kim, Myung-Jin; Park, Young-Ho; Yoon, Seung-Bin; Lee, Youngjeon; Jin, Yeung Bae; Sim, Bo-Woong; Kim, Ji-Su; Jeong, Kang-Jin; Kim, Sun-Uk; Lee, Sang-Rae; Park, Young-Il; Huh, Jae-Won; Chang, Kyu-Tae
2016-02-15
ZKSCAN5 (also known as ZFP95) is a zinc-finger protein belonging to the Krűppel family. ZKSCAN5 contains a SCAN box and a KRAB A domain and is proposed to play a distinct role during spermatogenesis. In humans, alternatively spliced ZKSCAN5 transcripts with different 5'-untranslated regions (UTRs) have been identified. However, investigation of our Macaca UniGene Database revealed novel alternative ZKSCAN5 transcripts that arose due to an exon creation event. Therefore, in this study, we identified the full-length sequences of ZKSCAN5 and its alternative transcripts in Macaca spp. Additionally, we investigated different nonhuman primate sequences to elucidate the molecular mechanism underlying the exon creation event. We analyzed the evolutionary features of the ZKSCAN5 transcripts by reverse transcription polymerase chain reaction (RT-PCR) and genomic PCR, and by sequencing various nonhuman primate DNA and RNA samples. The exon-created transcript was only detected in the Macaca lineage (crab-eating monkey and rhesus monkey). Full-length sequence analysis by rapid amplification of cDNA ends (RACE) identified ten full-length transcripts and four functional isoforms of ZKSCAN5. Protein sequence analyses revealed the presence of two groups of isoforms that arose because of differences in start-codon usage. Together, our results demonstrate that there has been specific selection for a discrete set of ZKSCAN5 variants in the Macaca lineage. Furthermore, study of this locus (and perhaps others) in Macaca spp. might facilitate our understanding of the evolutionary pressures that have shaped the mechanism of exon creation in primates. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
Deep sampling of the Palomero maize transcriptome by a high throughput strategy of pyrosequencing.
Vega-Arreguín, Julio C; Ibarra-Laclette, Enrique; Jiménez-Moraila, Beatriz; Martínez, Octavio; Vielle-Calzada, Jean Philippe; Herrera-Estrella, Luis; Herrera-Estrella, Alfredo
2009-07-06
In-depth sequencing analysis has not been able to determine the overall complexity of transcriptional activity of a plant organ or tissue sample. In some cases, deep parallel sequencing of Expressed Sequence Tags (ESTs), although not yet optimized for the sequencing of cDNAs, has represented an efficient procedure for validating gene prediction and estimating overall gene coverage. This approach could be very valuable for complex plant genomes. In addition, little emphasis has been given to efforts aiming at an estimation of the overall transcriptional universe found in a multicellular organism at a specific developmental stage. To explore, in depth, the transcriptional diversity in an ancient maize landrace, we developed a protocol to optimize the sequencing of cDNAs and performed 4 consecutive GS20-454 pyrosequencing runs of a cDNA library obtained from 2 week-old Palomero Toluqueño maize plants. The protocol reported here allowed obtaining over 90% of informative sequences. These GS20-454 runs generated over 1.5 Million reads, representing the largest amount of sequences reported from a single plant cDNA library. A collection of 367,391 quality-filtered reads (30.09 Mb) from a single run was sufficient to identify transcripts corresponding to 34% of public maize ESTs databases; total sequences generated after 4 filtered runs increased this coverage to 50%. Comparisons of all 1.5 Million reads to the Maize Assembled Genomic Islands (MAGIs) provided evidence for the transcriptional activity of 11% of MAGIs. We estimate that 5.67% (86,069 sequences) do not align with public ESTs or annotated genes, potentially representing new maize transcripts. Following the assembly of 74.4% of the reads in 65,493 contigs, real-time PCR of selected genes confirmed a predicted correlation between the abundance of GS20-454 sequences and corresponding levels of gene expression. A protocol was developed that significantly increases the number, length and quality of cDNA reads using massive 454 parallel sequencing. We show that recurrent 454 pyrosequencing of a single cDNA sample is necessary to attain a thorough representation of the transcriptional universe present in maize, that can also be used to estimate transcript abundance of specific genes. This data suggests that the molecular and functional diversity contained in the vast native landraces remains to be explored, and that large-scale transcriptional sequencing of a presumed ancestor of the modern maize varieties represents a valuable approach to characterize the functional diversity of maize for future agricultural and evolutionary studies.
Tadeu, Ana Mafalda Baptista; Lin, Samantha; Hou, Lin; Chung, Lisa; Zhong, Mei; Zhao, Hongyu; Horsley, Valerie
2015-01-01
In recent years, several studies have shed light into the processes that regulate epidermal specification and homeostasis. We previously showed that a broad-spectrum γ–secretase inhibitor DAPT promoted early keratinocyte specification in human embryonic stem cells triggered to undergo ectoderm specification. Here, we show that DAPT accelerates human embryonic stem cell differentiation and induces expression of the ectoderm protein AP2. Furthermore, we utilize RNA sequencing to identify several candidate regulators of ectoderm specification including those involved in epithelial and epidermal development in human embryonic stem cells. Genes associated with transcriptional regulation and growth factor activity are significantly enriched upon DAPT treatment during specification of human embryonic stem cells to the ectoderm lineage. The human ectoderm cell signature identified in this study contains several genes expressed in ectodermal and epithelial tissues. Importantly, these genes are also associated with skin disorders and ectodermal defects, providing a platform for understanding the biology of human epidermal keratinocyte development under diseased and homeostatic conditions. PMID:25849374
2011-01-01
Background The endometrium is a dynamic tissue whose changes are driven by the ovarian steroidal hormones. Its main function is to provide an adequate substrate for embryo implantation. Using microarray technology, several reports have provided the gene expression patterns of human endometrial tissue during the window of implantation. However it is required that biological connections be made across these genomic datasets to take full advantage of them. The objective of this work was to perform a research synthesis of available gene expression profiles related to acquisition of endometrial receptivity for embryo implantation, in order to gain insights into its molecular basis and regulation. Methods Gene expression datasets were intersected to determine a consensus endometrial receptivity transcript list (CERTL). For this cluster of genes we determined their functional annotations using available web-based databases. In addition, promoter sequences were analyzed to identify putative transcription factor binding sites using bioinformatics tools and determined over-represented features. Results We found 40 up- and 21 down-regulated transcripts in the CERTL. Those more consistently increased were C4BPA, SPP1, APOD, CD55, CFD, CLDN4, DKK1, ID4, IL15 and MAP3K5 whereas the more consistently decreased were OLFM1, CCNB1, CRABP2, EDN3, FGFR1, MSX1 and MSX2. Functional annotation of CERTL showed it was enriched with transcripts related to the immune response, complement activation and cell cycle regulation. Promoter sequence analysis of genes revealed that DNA binding sites for E47, E2F1 and SREBP1 transcription factors were the most consistently over-represented and in both up- and down-regulated genes during the window of implantation. Conclusions Our research synthesis allowed organizing and mining high throughput data to explore endometrial receptivity and focus future research efforts on specific genes and pathways. The discovery of possible new transcription factors orchestrating the CERTL opens new alternatives for understanding gene expression regulation in uterine function. PMID:21272326
Bhardwaj, Ankur R; Joshi, Gopal; Kukreja, Bharti; Malik, Vidhi; Arora, Priyanka; Pandey, Ritu; Shukla, Rohit N; Bankar, Kiran G; Katiyar-Agarwal, Surekha; Goel, Shailendra; Jagannath, Arun; Kumar, Amar; Agarwal, Manu
2015-01-21
Brassica juncea var. Varuna is an economically important oilseed crop of family Brassicaceae which is vulnerable to abiotic stresses at specific stages in its life cycle. Till date no attempts have been made to elucidate genome-wide changes in its transcriptome against high temperature or drought stress. To gain global insights into genes, transcription factors and kinases regulated by these stresses and to explore information on coding transcripts that are associated with traits of agronomic importance, we utilized a combinatorial approach of next generation sequencing and de-novo assembly to discover B. juncea transcriptome associated with high temperature and drought stresses. We constructed and sequenced three transcriptome libraries namely Brassica control (BC), Brassica high temperature stress (BHS) and Brassica drought stress (BDS). More than 180 million purity filtered reads were generated which were processed through quality parameters and high quality reads were assembled de-novo using SOAPdenovo assembler. A total of 77750 unique transcripts were identified out of which 69,245 (89%) were annotated with high confidence. We established a subset of 19110 transcripts, which were differentially regulated by either high temperature and/or drought stress. Furthermore, 886 and 2834 transcripts that code for transcription factors and kinases, respectively, were also identified. Many of these were responsive to high temperature, drought or both stresses. Maximum number of up-regulated transcription factors in high temperature and drought stress belonged to heat shock factors (HSFs) and dehydration responsive element-binding (DREB) families, respectively. We also identified 239 metabolic pathways, which were perturbed during high temperature and drought treatments. Analysis of gene ontologies associated with differentially regulated genes forecasted their involvement in diverse biological processes. Our study provides first comprehensive discovery of B. juncea transcriptome under high temperature and drought stress conditions. Transcriptome resource generated in this study will enhance our understanding on the molecular mechanisms involved in defining the response of B. juncea against two important abiotic stresses. Furthermore this information would benefit designing of efficient crop improvement strategies for tolerance against conditions of high temperature regimes and water scarcity.
Thermodynamics-based models of transcriptional regulation with gene sequence.
Wang, Shuqiang; Shen, Yanyan; Hu, Jinxing
2015-12-01
Quantitative models of gene regulatory activity have the potential to improve our mechanistic understanding of transcriptional regulation. However, the few models available today have been based on simplistic assumptions about the sequences being modeled or heuristic approximations of the underlying regulatory mechanisms. In this work, we have developed a thermodynamics-based model to predict gene expression driven by any DNA sequence. The proposed model relies on a continuous time, differential equation description of transcriptional dynamics. The sequence features of the promoter are exploited to derive the binding affinity which is derived based on statistical molecular thermodynamics. Experimental results show that the proposed model can effectively identify the activity levels of transcription factors and the regulatory parameters. Comparing with the previous models, the proposed model can reveal more biological sense.
Itzhaki, H; Maxson, J M; Woodson, W R
1994-09-13
The increased production of ethylene during carnation petal senescence regulates the transcription of the GST1 gene encoding a subunit of glutathione-S-transferase. We have investigated the molecular basis for this ethylene-responsive transcription by examining the cis elements and trans-acting factors involved in the expression of the GST1 gene. Transient expression assays following delivery of GST1 5' flanking DNA fused to a beta-glucuronidase receptor gene were used to functionally define sequences responsible for ethylene-responsive expression. Deletion analysis of the 5' flanking sequences of GST1 identified a single positive regulatory element of 197 bp between -667 and -470 necessary for ethylene-responsive expression. The sequences within this ethylene-responsive region were further localized to 126 bp between -596 and -470. The ethylene-responsive element (ERE) within this region conferred ethylene-regulated expression upon a minimal cauliflower mosaic virus-35S TATA-box promoter in an orientation-independent manner. Gel electrophoresis mobility-shift assays and DNase I footprinting were used to identify proteins that bind to sequences within the ERE. Nuclear proteins from carnation petals were shown to specifically interact with the 126-bp ERE and the presence and binding of these proteins were independent of ethylene or petal senescence. DNase I footprinting defined DNA sequences between -510 and -488 within the ERE specifically protected by bound protein. An 8-bp sequence (ATTTCAAA) within the protected region shares significant homology with promoter sequences required for ethylene responsiveness from the tomato fruit-ripening E4 gene.
Kwon, Andrew T.; Arenillas, David J.; Hunt, Rebecca Worsley; Wasserman, Wyeth W.
2012-01-01
oPOSSUM-3 is a web-accessible software system for identification of over-represented transcription factor binding sites (TFBS) and TFBS families in either DNA sequences of co-expressed genes or sequences generated from high-throughput methods, such as ChIP-Seq. Validation of the system with known sets of co-regulated genes and published ChIP-Seq data demonstrates the capacity for oPOSSUM-3 to identify mediating transcription factors (TF) for co-regulated genes or co-recovered sequences. oPOSSUM-3 is available at http://opossum.cisreg.ca. PMID:22973536
Kwon, Andrew T; Arenillas, David J; Worsley Hunt, Rebecca; Wasserman, Wyeth W
2012-09-01
oPOSSUM-3 is a web-accessible software system for identification of over-represented transcription factor binding sites (TFBS) and TFBS families in either DNA sequences of co-expressed genes or sequences generated from high-throughput methods, such as ChIP-Seq. Validation of the system with known sets of co-regulated genes and published ChIP-Seq data demonstrates the capacity for oPOSSUM-3 to identify mediating transcription factors (TF) for co-regulated genes or co-recovered sequences. oPOSSUM-3 is available at http://opossum.cisreg.ca.
Goudot, Christel; Etchebest, Catherine
2011-01-01
AP-1 proteins are transcription factors (TFs) that belong to the basic leucine zipper family, one of the largest families of TFs in eukaryotic cells. Despite high homology between their DNA binding domains, these proteins are able to recognize diverse DNA motifs. In yeasts, these motifs are referred as YRE (Yap Response Element) and are either seven (YRE-Overlap) or eight (YRE-Adjacent) base pair long. It has been proposed that the AP-1 DNA binding motif preference relies on a single change in the amino acid sequence of the yeast AP-1 TFs (an arginine in the YRE-O binding factors being replaced by a lysine in the YRE-A binding Yaps). We developed a computational approach to infer condition-specific transcriptional modules associated to the orthologous AP-1 protein Yap1p, Cgap1p and Cap1p, in three yeast species: the model yeast Saccharomyces cerevisiae and two pathogenic species Candida glabrata and Candida albicans. Exploitation of these modules in terms of predictions of the protein/DNA regulatory interactions changed our vision of AP-1 protein evolution. Cis-regulatory motif analyses revealed the presence of a conserved adenine in 5′ position of the canonical YRE sites. While Yap1p, Cgap1p and Cap1p shared a remarkably low number of target genes, an impressive conservation was observed in the YRE sequences identified by Yap1p and Cap1p. In Candida glabrata, we found that Cgap1p, unlike Yap1p and Cap1p, recognizes YRE-O and YRE-A motifs. These findings were supported by structural data available for the transcription factor Pap1p (Schizosaccharomyces pombe). Thus, whereas arginine and lysine substitutions in Cgap1p and Yap1p proteins were reported as responsible for a specific YRE-O or YRE-A preference, our analyses rather suggest that the ancestral yeast AP-1 protein could recognize both YRE-O and YRE-A motifs and that the arginine/lysine exchange is not the only determinant of the specialization of modern Yaps for one motif or another. PMID:21695268
Evolution of transcriptional enhancers and animal diversity
Rubinstein, Marcelo; de Souza, Flávio S. J.
2013-01-01
Deciphering the genetic bases that drive animal diversity is one of the major challenges of modern biology. Although four decades ago it was proposed that animal evolution was mainly driven by changes in cis-regulatory DNA elements controlling gene expression rather than in protein-coding sequences, only now are powerful bioinformatics and experimental approaches available to accelerate studies into how the evolution of transcriptional enhancers contributes to novel forms and functions. In the introduction to this Theme Issue, we start by defining the general properties of transcriptional enhancers, such as modularity and the coexistence of tight sequence conservation with transcription factor-binding site shuffling as different mechanisms that maintain the enhancer grammar over evolutionary time. We discuss past and current methods used to identify cell-type-specific enhancers and provide examples of how enhancers originate de novo, change and are lost in particular lineages. We then focus in the central part of this Theme Issue on analysing examples of how the molecular evolution of enhancers may change form and function. Throughout this introduction, we present the main findings of the articles, reviews and perspectives contributed to this Theme Issue that together illustrate some of the great advances and current frontiers in the field. PMID:24218630
Nadjar-Boger, Elisabeth; Maccatrozzo, Lisa; Radaelli, Giuseppe; Funkenstein, Bruria
2013-02-01
Myostatin (MSTN) is a member of the transforming growth factor-ß superfamily, known as a negative regulator of skeletal muscle development and growth in mammals. In contrast to mammals, fish possess at least two paralogs of MSTN: MSTN-1 and MSTN-2. Here we describe the cloning and sequence analysis of spliced and precursor (unspliced) transcripts as well as the 5' flanking region of MSTN-2 from the marine fish Umbrina cirrosa (ucMSTN-2). In silico analysis revealed numerous putative cis regulatory elements including several E-boxes known as binding sites to myogenic transcription factors. Transient transfection experiments using non-muscle and muscle cell lines showed high transcriptional activity in muscle cells and in differentiated neural cells, in accordance with our previous findings in MSTN-2 promoter from Sparus aurata. Comparative informatics analysis of MSTN-2 from several fish species revealed high conservation of the predicted amino acid sequence as well as the gene structure (exon length) although intron length varied between species. The proximal promoter of MSTN-2 gene was found to be conserved among Perciforms. In conclusion, this study reinforces our conclusion that MSTN-2 promoter is a very strong promoter, especially in muscle cells. In addition, we show that the MSTN-2 gene structure is highly conserved among fishes as is the predicted amino acid sequence of the peptide. Copyright © 2012 Elsevier Inc. All rights reserved.
Mitsuda, Nobutaka; Hisabori, Toru; Takeyasu, Kunio; Sato, Masa H
2004-07-01
A 38-bp pollen-specific cis-acting region of the AVP1 gene is involved in the expression of the Arabidopsis thaliana V-PPase during pollen development. Here, we report the isolation and structural characterization of AtVOZ1 and AtVOZ2, novel transcription factors that bind to the 38-bp cis-acting region of A. thaliana V-PPase gene, AVP1. AtVOZ1 and AtVOZ2 show 53% amino acid sequence similarity. Homologs of AtVOZ1 and AtVOZ2 are found in various vascular plants as well as a moss, Physcomitrella patens. Promoter-beta-glucuronidase reporter analysis shows that AtVOZ1 is specifically expressed in the phloem tissue and AtVOZ2 is strongly expressed in the root. In vivo transient effector-reporter analysis in A. thaliana suspension-cultured cells demonstrates that AtVOZ1 and AtVOZ2 function as transcriptional activators in the Arabidopsis cell. Two conserved regions termed Domain-A and Domain-B were identified from an alignment of AtVOZ proteins and their homologs of O. sativa and P. patens. AtVOZ2 binds as a dimer to the specific palindromic sequence, GCGTNx7ACGC, with Domain-B, which is comprised of a functional novel zinc coordinating motif and a conserved basic region. Domain-B is shown to function as both the DNA-binding and the dimerization domains of AtVOZ2. From highly the conservative nature among all identified VOZ proteins, we conclude that Domain-B is responsible for the DNA binding and dimerization of all VOZ-family proteins and designate it as the VOZ-domain.
Modes of Interaction of KMT2 Histone H3 Lysine 4 Methyltransferase/COMPASS Complexes with Chromatin
Bochyńska, Agnieszka; Lüscher-Firzlaff, Juliane
2018-01-01
Regulation of gene expression is achieved by sequence-specific transcriptional regulators, which convey the information that is contained in the sequence of DNA into RNA polymerase activity. This is achieved by the recruitment of transcriptional co-factors. One of the consequences of co-factor recruitment is the control of specific properties of nucleosomes, the basic units of chromatin, and their protein components, the core histones. The main principles are to regulate the position and the characteristics of nucleosomes. The latter includes modulating the composition of core histones and their variants that are integrated into nucleosomes, and the post-translational modification of these histones referred to as histone marks. One of these marks is the methylation of lysine 4 of the core histone H3 (H3K4). While mono-methylation of H3K4 (H3K4me1) is located preferentially at active enhancers, tri-methylation (H3K4me3) is a mark found at open and potentially active promoters. Thus, H3K4 methylation is typically associated with gene transcription. The class 2 lysine methyltransferases (KMTs) are the main enzymes that methylate H3K4. KMT2 enzymes function in complexes that contain a necessary core complex composed of WDR5, RBBP5, ASH2L, and DPY30, the so-called WRAD complex. Here we discuss recent findings that try to elucidate the important question of how KMT2 complexes are recruited to specific sites on chromatin. This is embedded into short overviews of the biological functions of KMT2 complexes and the consequences of H3K4 methylation. PMID:29498679
Boj, Sylvia F.; Servitja, Joan Marc; Martin, David; Rios, Martin; Talianidis, Iannis; Guigo, Roderic; Ferrer, Jorge
2009-01-01
OBJECTIVE The evolutionary conservation of transcriptional mechanisms has been widely exploited to understand human biology and disease. Recent findings, however, unexpectedly showed that the transcriptional regulators hepatocyte nuclear factor (HNF)-1α and -4α rarely bind to the same genes in mice and humans, leading to the proposal that tissue-specific transcriptional regulation has undergone extensive divergence in the two species. Such observations have major implications for the use of mouse models to understand HNF-1α– and HNF-4α–deficient diabetes. However, the significance of studies that assess binding without considering regulatory function is poorly understood. RESEARCH DESIGN AND METHODS We compared previously reported mouse and human HNF-1α and HNF-4α binding studies with independent binding experiments. We also integrated binding studies with mouse and human loss-of-function gene expression datasets. RESULTS First, we confirmed the existence of species-specific HNF-1α and -4α binding, yet observed incomplete detection of binding in the different datasets, causing an underestimation of binding conservation. Second, only a minor fraction of HNF-1α– and HNF-4α–bound genes were downregulated in the absence of these regulators. This subset of functional targets did not show evidence for evolutionary divergence of binding or binding sequence motifs. Finally, we observed differences between conserved and species-specific binding properties. For example, conserved binding was more frequently located near transcriptional start sites and was more likely to involve multiple binding events in the same gene. CONCLUSIONS Despite evolutionary changes in binding, essential direct transcriptional functions of HNF-1α and -4α are largely conserved between mice and humans. PMID:19188435
Guo, Yong; Qiu, Li-Juan
2013-01-01
The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max). In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs) were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.
Structure-Function Relationships in Human Testis-determining Factor SRY
Racca, Joseph D.; Chen, Yen-Shan; Maloy, James D.; Wickramasinghe, Nalinda; Phillips, Nelson B.; Weiss, Michael A.
2014-01-01
Human testis determination is initiated by SRY, a Y-encoded architectural transcription factor. Mutations in SRY cause 46 XY gonadal dysgenesis with female somatic phenotype (Swyer syndrome) and confer a high risk of malignancy (gonadoblastoma). Such mutations cluster in the SRY high mobility group (HMG) box, a conserved motif of specific DNA binding and bending. To explore structure-function relationships, we constructed all possible substitutions at a site of clinical mutation (W70L). Our studies thus focused on a core aromatic residue (position 15 of the consensus HMG box) that is invariant among SRY-related HMG box transcription factors (the SOX family) and conserved as aromatic (Phe or Tyr) among other sequence-specific boxes. In a yeast one-hybrid system sensitive to specific SRY-DNA binding, the variant domains exhibited reduced (Phe and Tyr) or absent activity (the remaining 17 substitutions). Representative nonpolar variants with partial or absent activity (Tyr, Phe, Leu, and Ala in order of decreasing side-chain volume) were chosen for study in vitro and in mammalian cell culture. The clinical mutation (Leu) was found to markedly impair multiple biochemical and cellular activities as respectively probed through the following: (i) in vitro assays of specific DNA binding and protein stability, and (ii) cell culture-based assays of proteosomal degradation, nuclear import, enhancer DNA occupancy, and SRY-dependent transcriptional activation. Surprisingly, however, DNA bending is robust to this or the related Ala substitution that profoundly impairs box stability. Together, our findings demonstrate that the folding, trafficking, and gene-regulatory function of SRY requires an invariant aromatic “buttress” beneath its specific DNA-bending surface. PMID:25258310
DOE Office of Scientific and Technical Information (OSTI.GOV)
Berman, Benjamin P.; Pfeiffer, Barret D.; Laverty, Todd R.
2004-08-06
The identification of sequences that control transcription in metazoans is a major goal of genome analysis. In a previous study, we demonstrated that searching for clusters of predicted transcription factor binding sites could discover active regulatory sequences, and identified 37 regions of the Drosophila melanogaster genome with high densities of predicted binding sites for five transcription factors involved in anterior-posterior embryonic patterning. Nine of these clusters overlapped known enhancers. Here, we report the results of in vivo functional analysis of 27 remaining clusters. We generated transgenic flies carrying each cluster attached to a basal promoter and reporter gene, and assayedmore » embryos for reporter gene expression. Six clusters are enhancers of adjacent genes: giant, fushi tarazu, odd-skipped, nubbin, squeeze and pdm2; three drive expression in patterns unrelated to those of neighboring genes; the remaining 18 do not appear to have enhancer activity. We used the Drosophila pseudoobscura genome to compare patterns of evolution in and around the 15 positive and 18 false-positive predictions. Although conservation of primary sequence cannot distinguish true from false positives, conservation of binding-site clustering accurately discriminates functional binding-site clusters from those with no function. We incorporated conservation of binding-site clustering into a new genome-wide enhancer screen, and predict several hundred new regulatory sequences, including 85 adjacent to genes with embryonic patterns. Measuring conservation of sequence features closely linked to function--such as binding-site clustering--makes better use of comparative sequence data than commonly used methods that examine only sequence identity.« less
Watanabe, Keijirou; Hida, Mariko; Sasaki, Takako; Yano, Hiroyuki; Kawano, Kenji; Yoshioka, Hidekatsu; Matsuo, Noritaka
2016-02-01
Type XI collagen is a cartilage-specific extracellular matrix, and is important for collagen fibril formation and skeletal morphogenesis. We have previously reported that NF-Y regulated the proximal promoter activity of the mouse collagen α1(XI) gene (Col11a1) in chondrocytes (Hida et. al. In Vitro Cell. Dev. Biol. Anim. 2014). However, the mechanism of the Col11a1 gene regulation in chondrocytes has not been fully elucidated. In this study, we further characterized the proximal promoter activity of the mouse Col11a1 gene in chondrocytes. Cell transfection experiments with deletion and mutation constructs indicated that the downstream region of the NF-Y binding site (-116 to +1) is also necessary to regulate the proximal promoter activity of the mouse Col11a1 gene. This minimal promoter region has no TATA box and GC-rich sequence; we therefore examined whether the GC-rich sequence (-96 to -67) is necessary for the transcription regulation of the Col11a1 gene. Luciferase assays using a series of mutation constructs exhibited that the GC-rich sequence is a critical element of Col11a1 promoter activity in chondrocytes. Moreover, in silico analysis of this region suggested that one of the most effective candidates was transcription factor Sp1. Consistent with the prediction, overexpression of Sp1 significantly increased the promoter activity. Furthermore, knockdown of Sp1 expression by siRNA transfection suppressed the proximal promoter activity and the expression of endogenous transcript of the mouse Col11a1 gene. Taken together, these results indicate that the transcription factor Sp1 upregulates the proximal promoter activity of the mouse Col11a1 gene in chondrocytes.
Splicing factor SFRS1 recognizes a functionally diverse landscape of RNA transcripts.
Sanford, Jeremy R; Wang, Xin; Mort, Matthew; Vanduyn, Natalia; Cooper, David N; Mooney, Sean D; Edenberg, Howard J; Liu, Yunlong
2009-03-01
Metazoan genes are encrypted with at least two superimposed codes: the genetic code to specify the primary structure of proteins and the splicing code to expand their proteomic output via alternative splicing. Here, we define the specificity of a central regulator of pre-mRNA splicing, the conserved, essential splicing factor SFRS1. Cross-linking immunoprecipitation and high-throughput sequencing (CLIP-seq) identified 23,632 binding sites for SFRS1 in the transcriptome of cultured human embryonic kidney cells. SFRS1 was found to engage many different classes of functionally distinct transcripts including mRNA, miRNA, snoRNAs, ncRNAs, and conserved intergenic transcripts of unknown function. The majority of these diverse transcripts share a purine-rich consensus motif corresponding to the canonical SFRS1 binding site. The consensus site was not only enriched in exons cross-linked to SFRS1 in vivo, but was also enriched in close proximity to splice sites. mRNAs encoding RNA processing factors were significantly overrepresented, suggesting that SFRS1 may broadly influence the post-transcriptional control of gene expression in vivo. Finally, a search for the SFRS1 consensus motif within the Human Gene Mutation Database identified 181 mutations in 82 different genes that disrupt predicted SFRS1 binding sites. This comprehensive analysis substantially expands the known roles of human SR proteins in the regulation of a diverse array of RNA transcripts.
Core Promoter Functions in the Regulation of Gene Expression of Drosophila Dorsal Target Genes*
Zehavi, Yonathan; Kuznetsov, Olga; Ovadia-Shochat, Avital; Juven-Gershon, Tamar
2014-01-01
Developmental processes are highly dependent on transcriptional regulation by RNA polymerase II. The RNA polymerase II core promoter is the ultimate target of a multitude of transcription factors that control transcription initiation. Core promoters consist of core promoter motifs, e.g. the initiator, TATA box, and the downstream core promoter element (DPE), which confer specific properties to the core promoter. Here, we explored the importance of core promoter functions in the dorsal-ventral developmental gene regulatory network. This network includes multiple genes that are activated by different nuclear concentrations of Dorsal, an NFκB homolog transcription factor, along the dorsal-ventral axis. We show that over two-thirds of Dorsal target genes contain DPE sequence motifs, which is significantly higher than the proportion of DPE-containing promoters in Drosophila genes. We demonstrate that multiple Dorsal target genes are evolutionarily conserved and functionally dependent on the DPE. Furthermore, we have analyzed the activation of key Dorsal target genes by Dorsal, as well as by another Rel family transcription factor, Relish, and the dependence of their activation on the DPE motif. Using hybrid enhancer-promoter constructs in Drosophila cells and embryo extracts, we have demonstrated that the core promoter composition is an important determinant of transcriptional activity of Dorsal target genes. Taken together, our results provide evidence for the importance of core promoter composition in the regulation of Dorsal target genes. PMID:24634215
Song, Wei; Guo, Jun-Tao
2015-01-01
Transcription factors regulate gene expression through binding to specific DNA sequences. How transcription factors achieve high binding specificity is still not well understood. In this paper, we investigated the role of protein flexibility in protein-DNA-binding specificity by comparative molecular dynamics (MD) simulations. Protein flexibility has been considered as a key factor in molecular recognition, which is intrinsically a dynamic process involving fine structural fitting between binding components. In this study, we performed comparative MD simulations on wild-type and F10V mutant P22 Arc repressor in both free and complex conformations. The F10V mutant has lower DNA-binding specificity though both the bound and unbound main-chain structures between the wild-type and F10V mutant Arc are highly similar. We found that the DNA-binding motif of wild-type Arc is structurally more flexible than the F10V mutant in the unbound state, especially for the six DNA base-contacting residues in each dimer. We demonstrated that the flexible side chains of wild-type Arc lead to a higher DNA-binding specificity through forming more hydrogen bonds with DNA bases upon binding. Our simulations also showed a possible conformational selection mechanism for Arc-DNA binding. These results indicate the important roles of protein flexibility and dynamic properties in protein-DNA-binding specificity.
Aggarwal, Pooja; Das Gupta, Mainak; Joseph, Agnel Praveen; Chatterjee, Nirmalya; Srinivasan, N.; Nath, Utpal
2010-01-01
The TCP transcription factors control multiple developmental traits in diverse plant species. Members of this family share an ∼60-residue-long TCP domain that binds to DNA. The TCP domain is predicted to form a basic helix-loop-helix (bHLH) structure but shares little sequence similarity with canonical bHLH domain. This classifies the TCP domain as a novel class of DNA binding domain specific to the plant kingdom. Little is known about how the TCP domain interacts with its target DNA. We report biochemical characterization and DNA binding properties of a TCP member in Arabidopsis thaliana, TCP4. We have shown that the 58-residue domain of TCP4 is essential and sufficient for binding to DNA and possesses DNA binding parameters comparable to canonical bHLH proteins. Using a yeast-based random mutagenesis screen and site-directed mutants, we identified the residues important for DNA binding and dimer formation. Mutants defective in binding and dimerization failed to rescue the phenotype of an Arabidopsis line lacking the endogenous TCP4 activity. By combining structure prediction, functional characterization of the mutants, and molecular modeling, we suggest a possible DNA binding mechanism for this class of transcription factors. PMID:20363772
The nuclear matrix protein NMP-1 is the transcription factor YY1.
Guo, B; Odgren, P R; van Wijnen, A J; Last, T J; Nickerson, J; Penman, S; Lian, J B; Stein, J L; Stein, G S
1995-01-01
NMP-1 was initially identified as a nuclear matrix-associated DNA-binding factor that exhibits sequence-specific recognition for the site IV regulatory element of a histone H4 gene. This distal promoter domain is a nuclear matrix interaction site. In the present study, we show that NMP-1 is the multifunctional transcription factor YY1. Gel-shift and Western blot analyses demonstrate that NMP-1 is immunoreactive with YY1 antibody. Furthermore, purified YY1 protein specifically recognizes site IV and reconstitutes the NMP-1 complex. Western blot and gel-shift analyses indicate that YY1 is present within the nuclear matrix. In situ immunofluorescence studies show that a significant fraction of YY1 is localized in the nuclear matrix, principally but not exclusively associated with residual nucleoli. Our results confirm that NMP-1/YY1 is a ubiquitous protein that is present in both human cells and in rat osteosarcoma ROS 17/2.8 cells. The finding that NMP-1 is identical to YY1 suggests that this transcriptional regulator may mediate gene-matrix interactions. Our results are consistent with the concept that the nuclear matrix may functionally compartmentalize the eukaryotic nucleus to support regulation of gene expression. Images Fig. 2 Fig. 3 Fig. 4 Fig. 5 Fig. 6 PMID:7479833
Chae, Minho; Danko, Charles G; Kraus, W Lee
2015-07-16
Global run-on coupled with deep sequencing (GRO-seq) provides extensive information on the location and function of coding and non-coding transcripts, including primary microRNAs (miRNAs), long non-coding RNAs (lncRNAs), and enhancer RNAs (eRNAs), as well as yet undiscovered classes of transcripts. However, few computational tools tailored toward this new type of sequencing data are available, limiting the applicability of GRO-seq data for identifying novel transcription units. Here, we present groHMM, a computational tool in R, which defines the boundaries of transcription units de novo using a two state hidden-Markov model (HMM). A systematic comparison of the performance between groHMM and two existing peak-calling methods tuned to identify broad regions (SICER and HOMER) favorably supports our approach on existing GRO-seq data from MCF-7 breast cancer cells. To demonstrate the broader utility of our approach, we have used groHMM to annotate a diverse array of transcription units (i.e., primary transcripts) from four GRO-seq data sets derived from cells representing a variety of different human tissue types, including non-transformed cells (cardiomyocytes and lung fibroblasts) and transformed cells (LNCaP and MCF-7 cancer cells), as well as non-mammalian cells (from flies and worms). As an example of the utility of groHMM and its application to questions about the transcriptome, we show how groHMM can be used to analyze cell type-specific enhancers as defined by newly annotated enhancer transcripts. Our results show that groHMM can reveal new insights into cell type-specific transcription by identifying novel transcription units, and serve as a complete and useful tool for evaluating functional genomic elements in cells.
Preissl, Sebastian; Fang, Rongxin; Huang, Hui; Zhao, Yuan; Raviram, Ramya; Gorkin, David U; Zhang, Yanxiao; Sos, Brandon C; Afzal, Veena; Dickel, Diane E; Kuan, Samantha; Visel, Axel; Pennacchio, Len A; Zhang, Kun; Ren, Bing
2018-03-01
Analysis of chromatin accessibility can reveal transcriptional regulatory sequences, but heterogeneity of primary tissues poses a significant challenge in mapping the precise chromatin landscape in specific cell types. Here we report single-nucleus ATAC-seq, a combinatorial barcoding-assisted single-cell assay for transposase-accessible chromatin that is optimized for use on flash-frozen primary tissue samples. We apply this technique to the mouse forebrain through eight developmental stages. Through analysis of more than 15,000 nuclei, we identify 20 distinct cell populations corresponding to major neuronal and non-neuronal cell types. We further define cell-type-specific transcriptional regulatory sequences, infer potential master transcriptional regulators and delineate developmental changes in forebrain cellular composition. Our results provide insight into the molecular and cellular dynamics that underlie forebrain development in the mouse and establish technical and analytical frameworks that are broadly applicable to other heterogeneous tissues.
Ishii, N; Yamamoto, M; Lahm, H W; Iizumi, S; Yoshihara, F; Nakayama, H; Arisawa, M; Aoki, Y
1997-02-01
Electromobility shift assays with a DNA probe containing the Saccharomyces cerevisiae ENO1 RPG box identified a specific DNA-binding protein in total protein extracts of Candida albicans. The protein, named Rbf1p (RPG-box-binding protein 1), bound to other S. cerevisiae RPG boxes, although the nucleotide recognition profile was not completely the same as that of S. cerevisiae Rap 1p (repressor-activator protein 1), an RPG-box-binding protein. The repetitive sequence of the C. albicans chromosomal telomere also competed with RPG-box binding to Rbf1p. For further analysis, we purified Rbf1p 57,600-fold from C. albicans total protein extracts, raised mAbs against the purified protein and immunologically cloned the gene, whose ORF specified a protein of 527 aa. The bacterially expressed protein showed RPG-box-binding activity with the same profile as that of the purified one. The Rbf1p, containing two glutamine-rich regions that are found in many transcription factors, showed transcriptional activation capability in S. cerevisiae and was predominantly observed in nuclei. These results suggest that Rbf1p is a transcription factor with telomere-binding activity in C. albicans.
Shah, Syed Tariq; Pang, Chaoyou; Fan, Shuli; Song, Meizhen; Arain, Saima; Yu, Shuxun
2013-12-01
NAC (NAM, ATAF, and CUC) is a plant-specific transcription factor family with diverse roles in plant development and stress regulation. In this report, stress-responsive NAC genes (GhNAC8-GhNAC17) isolated from cotton (Gossypium hirsutum L.) were characterised in the context of leaf senescence and stress tolerance. The characterisation of NAC genes during leaf senescence has not yet been reported for cotton. Based on the sequence characterisation, these GhNACs could be classified into three groups belonging to three known NAC sub-families. Their predicted amino acid sequences exhibited similarities to NAC genes from other plant species. Senescent leaves were the sites of maximum expression for all GhNAC genes except GhNAC10 and GhNAC13, which showed maximum expression in fibres, collected from 25 days post anthesis (DPA) plants. The ten GhNAC genes displayed differential expression patterns and levels during natural and induced leaf senescence. Quantitative RT-PCR and promoter analyses suggest that these genes are induced by ABA, ethylene, drought, salinity, cold, heat, and other hormonal treatments. These results support a role for cotton GhNAC genes in transcriptional regulation of leaf senescence, stress tolerance and other developmental stages of cotton. © 2013.
Diversity, expansion, and evolutionary novelty of plant DNA-binding transcription factor families.
Lehti-Shiu, Melissa D; Panchy, Nicholas; Wang, Peipei; Uygun, Sahra; Shiu, Shin-Han
2017-01-01
Plant transcription factors (TFs) that interact with specific sequences via DNA-binding domains are crucial for regulating transcriptional initiation and are fundamental to plant development and environmental response. In addition, expansion of TF families has allowed functional divergence of duplicate copies, which has contributed to novel, and in some cases adaptive, traits in plants. Thus, TFs are central to the generation of the diverse plant species that we see today. Major plant agronomic traits, including those relevant to domestication, have also frequently arisen through changes in TF coding sequence or expression patterns. Here our goal is to provide an overview of plant TF evolution by first comparing the diversity of DNA-binding domains and the sizes of these domain families in plants and other eukaryotes. Because TFs are among the most highly expanded gene families in plants, the birth and death process of TFs as well as the mechanisms contributing to their retention are discussed. We also provide recent examples of how TFs have contributed to novel traits that are important in plant evolution and in agriculture.This article is part of a Special Issue entitled: Plant Gene Regulatory Mechanisms and Networks, edited by Dr. Erich Grotewold and Dr. Nathan Springer. Copyright © 2016 Elsevier B.V. All rights reserved.
Soo Shin, Jane Hae
2017-01-01
Abstract Guanine-rich (G-rich) homopurine–homopyrimidine nucleotide sequences can block transcription with an efficiency that depends upon their orientation, composition and length, as well as the presence of negative supercoiling or breaks in the non-template DNA strand. We report that a G-rich sequence in the non-template strand reduces the yield of T7 RNA polymerase transcription by more than an order of magnitude when positioned close (9 bp) to the promoter, in comparison to that for a distal (∼250 bp) location of the same sequence. This transcription blockage is much less pronounced for a C-rich sequence, and is not significant for an A-rich sequence. Remarkably, the blockage is not pronounced if transcription is performed in the presence of RNase H, which specifically digests the RNA strands within RNA–DNA hybrids. The blockage also becomes less pronounced upon reduced RNA polymerase concentration. Based upon these observations and those from control experiments, we conclude that the blockage is primarily due to the formation of stable RNA–DNA hybrids (R-loops), which inhibit successive rounds of transcription. Our results could be relevant to transcription dynamics in vivo (e.g. transcription ‘bursting’) and may also have practical implications for the design of expression vectors. PMID:28498974
Belotserkovskii, Boris P; Soo Shin, Jane Hae; Hanawalt, Philip C
2017-06-20
Guanine-rich (G-rich) homopurine-homopyrimidine nucleotide sequences can block transcription with an efficiency that depends upon their orientation, composition and length, as well as the presence of negative supercoiling or breaks in the non-template DNA strand. We report that a G-rich sequence in the non-template strand reduces the yield of T7 RNA polymerase transcription by more than an order of magnitude when positioned close (9 bp) to the promoter, in comparison to that for a distal (∼250 bp) location of the same sequence. This transcription blockage is much less pronounced for a C-rich sequence, and is not significant for an A-rich sequence. Remarkably, the blockage is not pronounced if transcription is performed in the presence of RNase H, which specifically digests the RNA strands within RNA-DNA hybrids. The blockage also becomes less pronounced upon reduced RNA polymerase concentration. Based upon these observations and those from control experiments, we conclude that the blockage is primarily due to the formation of stable RNA-DNA hybrids (R-loops), which inhibit successive rounds of transcription. Our results could be relevant to transcription dynamics in vivo (e.g. transcription 'bursting') and may also have practical implications for the design of expression vectors. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Agarkar, Vinod B.; Babayeva, Nigar D.; Rizzino, Angie
2010-10-08
Ets proteins are transcription factors that activate or repress the expression of genes that are involved in various biological processes, including cellular proliferation, differentiation, development, transformation and apoptosis. Like other Ets-family members, Elf3 functions as a sequence-specific DNA-binding transcriptional factor. A mouse Elf3 C-terminal fragment (amino-acid residues 269-371) containing the DNA-binding domain has been crystallized in complex with mouse type II TGF-{beta} receptor promoter (TR-II) DNA. The crystals belonged to space group P2{sub 1}2{sub 1}2{sub 1}, with unit-cell parameters a = 42.66, b = 52, c = 99.78 {angstrom}, and diffracted to a resolution of 2.2 {angstrom}.
Defining Differential Genetic Signatures in CXCR4- and the CCR5-Utilizing HIV-1 Co-Linear Sequences
Aiamkitsumrit, Benjamas; Dampier, Will; Martin-Garcia, Julio; Nonnemacher, Michael R.; Pirrone, Vanessa; Ivanova, Tatyana; Zhong, Wen; Kilareski, Evelyn; Aldigun, Hazeez; Frantz, Brian; Rimbey, Matthew; Wojno, Adam; Passic, Shendra; Williams, Jean W.; Shah, Sonia; Blakey, Brandon; Parikh, Nirzari; Jacobson, Jeffrey M.; Moldover, Brian; Wigdahl, Brian
2014-01-01
The adaptation of human immunodeficiency virus type-1 (HIV-1) to an array of physiologic niches is advantaged by the plasticity of the viral genome, encoded proteins, and promoter. CXCR4-utilizing (X4) viruses preferentially, but not universally, infect CD4+ T cells, generating high levels of virus within activated HIV-1-infected T cells that can be detected in regional lymph nodes and peripheral blood. By comparison, the CCR5-utilizing (R5) viruses have a greater preference for cells of the monocyte-macrophage lineage; however, while R5 viruses also display a propensity to enter and replicate in T cells, they infect a smaller percentage of CD4+ T cells in comparison to X4 viruses. Additionally, R5 viruses have been associated with viral transmission and CNS disease and are also more prevalent during HIV-1 disease. Specific adaptive changes associated with X4 and R5 viruses were identified in co-linear viral sequences beyond the Env-V3. The in silico position-specific scoring matrix (PSSM) algorithm was used to define distinct groups of X4 and R5 sequences based solely on sequences in Env-V3. Bioinformatic tools were used to identify genetic signatures involving specific protein domains or long terminal repeat (LTR) transcription factor sites within co-linear viral protein R (Vpr), trans-activator of transcription (Tat), or LTR sequences that were preferentially associated with X4 or R5 Env-V3 sequences. A number of differential amino acid and nucleotide changes were identified across the co-linear Vpr, Tat, and LTR sequences, suggesting the presence of specific genetic signatures that preferentially associate with X4 or R5 viruses. Investigation of the genetic relatedness between X4 and R5 viruses utilizing phylogenetic analyses of complete sequences could not be used to definitively and uniquely identify groups of R5 or X4 sequences; in contrast, differences in the genetic diversities between X4 and R5 were readily identified within these co-linear sequences in HIV-1-infected patients. PMID:25265194
A single alteration 20 nt 5′ to an editing target inhibits chloroplast RNA editing in vivo
Reed, Martha L.; Peeters, Nemo M.; Hanson, Maureen R.
2001-01-01
Transcripts of typical dicot plant plastid genes undergo C→U RNA editing at approximately 30 locations, but there is no consensus sequence surrounding the C targets of editing. The cis-acting elements required for editing of the C located at tobacco rpoB editing site II were investigated by introducing translatable chimeric minigenes containing sequence –20 to +6 surrounding the C target of editing. When the –20 to +6 sequence specified by the homologous region present in the black pine chloroplast genome was incorporated, virtually no editing of the transcripts occurred in transgenic tobacco plastids. Nucleotides that differ between the black pine and tobacco sequence were tested for their role in C→U editing by designing chimeric genes containing one or more of these divergent nucleotides. Surprisingly, the divergent nucleotide that had the strongest negative effect on editing of the minigene transcript was located –20 nt 5′ to the C target of editing. Expression of transgene transcripts carrying the 27 nt sequence did not affect the editing extent of the endogenous rpoB transcripts, even though the chimeric transcripts were much more abundant than those of the endogenous gene. In plants carrying a 93 nt rpoB editing site sequence, transgene transcripts accumulated to a level three times greater than transgene transcripts in the plants carrying the 27 nt rpoB editing sites and resulted in editing of the endogenous transcripts from 100 to 50%. Both a lower affinity of the 27 nt site for a trans-acting factor and lower abundance of the transcript could explain why expression of minigene transcripts containing the 27 nt sequence did not affect endogenous editing. PMID:11266552
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hudson, William H.; Pickard, Mark R.; de Vera, Ian Mitchelle S.
2014-12-23
The majority of the eukaryotic genome is transcribed, generating a significant number of long intergenic noncoding RNAs (lincRNAs). Although lincRNAs represent the most poorly understood product of transcription, recent work has shown lincRNAs fulfill important cellular functions. In addition to low sequence conservation, poor understanding of structural mechanisms driving lincRNA biology hinders systematic prediction of their function. Here we report the molecular requirements for the recognition of steroid receptors (SRs) by the lincRNA growth arrest-specific 5 (Gas5), which regulates steroid-mediated transcriptional regulation, growth arrest and apoptosis. We identify the functional Gas5-SR interface and generate point mutations that ablate the SR-Gas5more » lincRNA interaction, altering Gas5-driven apoptosis in cancer cell lines. Further, we find that the Gas5 SR-recognition sequence is conserved among haplorhines, with its evolutionary origin as a splice acceptor site. This study demonstrates that lincRNAs can recognize protein targets in a conserved, sequence-specific manner in order to affect critical cell functions.« less
Epigenetic regulatory mechanisms in vertebrate eye development and disease
Cvekl, A; Mitton, KP
2014-01-01
Eukaryotic DNA is organized as a nucleoprotein polymer termed chromatin with nucleosomes serving as its repetitive architectural units. Cellular differentiation is a dynamic process driven by activation and repression of specific sets of genes, partitioning the genome into transcriptionally active and inactive chromatin domains. Chromatin architecture at individual genes/loci may remain stable through cell divisions, from a single mother cell to its progeny during mitosis, and represents an example of epigenetic phenomena. Epigenetics refers to heritable changes caused by mechanisms distinct from the primary DNA sequence. Recent studies have shown a number of links between chromatin structure, gene expression, extracellular signaling, and cellular differentiation during eye development. This review summarizes recent advances in this field, and the relationship between sequence-specific DNA-binding transcription factors and their roles in recruitment of chromatin remodeling enzymes. In addition, lens and retinal differentiation is accompanied by specific changes in the nucleolar organization, expression of non-coding RNAs, and DNA methylation. Epigenetic regulatory mechanisms in ocular tissues represent exciting areas of research that have opened new avenues for understanding normal eye development, inherited eye diseases and eye diseases related to aging and the environment. PMID:20179734
Archaeal TFEα/β is a hybrid of TFIIE and the RNA polymerase III subcomplex hRPC62/39
Blombach, Fabian; Salvadori, Enrico; Fouqueau, Thomas; Yan, Jun; Reimann, Julia; Sheppard, Carol; Smollett, Katherine L; Albers, Sonja V; Kay, Christopher WM; Thalassinos, Konstantinos; Werner, Finn
2015-01-01
Transcription initiation of archaeal RNA polymerase (RNAP) and eukaryotic RNAPII is assisted by conserved basal transcription factors. The eukaryotic transcription factor TFIIE consists of α and β subunits. Here we have identified and characterised the function of the TFIIEβ homologue in archaea that on the primary sequence level is related to the RNAPIII subunit hRPC39. Both archaeal TFEβ and hRPC39 harbour a cubane 4Fe-4S cluster, which is crucial for heterodimerization of TFEα/β and its engagement with the RNAP clamp. TFEα/β stabilises the preinitiation complex, enhances DNA melting, and stimulates abortive and productive transcription. These activities are strictly dependent on the β subunit and the promoter sequence. Our results suggest that archaeal TFEα/β is likely to represent the evolutionary ancestor of TFIIE-like factors in extant eukaryotes. DOI: http://dx.doi.org/10.7554/eLife.08378.001 PMID:26067235
Wang, Zhixiong; Cheng, Yulan; Abraham, John M; Yan, Rong; Liu, Xi; Chen, Wei; Ibrahim, Sariat; Schroth, Gary P; Ke, Xiquan; He, Yulong; Meltzer, Stephen J
2017-10-15
Studies of chromosomal rearrangements and fusion transcripts have elucidated mechanisms of tumorigenesis and led to targeted cancer therapies. This study was aimed at identifying novel fusion transcripts in esophageal adenocarcinoma (EAC). To identify new fusion transcripts associated with EAC, targeted RNA sequencing and polymerase chain reaction (PCR) verification were performed in 40 EACs and matched nonmalignant specimens from the same patients. Genomic PCR and Sanger sequencing were performed to find the breakpoint of fusion genes. Five novel in-frame fusion transcripts were identified and verified in 40 EACs and in a validation cohort of 15 additional EACs (55 patients in all): fibroblast growth factor receptor 2 (FGFR2)-GRB2-associated binding protein 2 (GAB2) in 2 of 55 or 3.6%, Niemann-Pick C1 (NPC1)-maternal embryonic leucine zipper kinase (MELK) in 2 of 55 or 3.6%, ubiquitin-specific peptidase 54 (USP54)-calcium/calmodulin dependent protein kinase II γ (CAMK2G) in 2 of 55 or 3.6%, megakaryoblastic leukemia (translocation) 1 (MKL1)-fibulin 1 (FBLN1) in 1 of 55 or 1.8%, and CCR4-NOT transcription complex subunit 2 (CNOT2)-chromosome 12 open reading frame 49 (C12orf49) in 1 of 55 or 1.8%. A genomic analysis indicated that NPC1-MELK arose from a complex interchromosomal translocation event involving chromosomes 18, 3, and 9 with 3 rearrangement points, and this was consistent with chromoplexy. These data indicate that fusion transcripts occur at a stable frequency in EAC. Furthermore, our results indicate that chromoplexy is an underlying mechanism that generates fusion transcripts in EAC. These and other fusion transcripts merit further study as diagnostic markers and potential therapeutic targets in EAC. Cancer 2017;123:3916-24. © 2017 American Cancer Society. © 2017 American Cancer Society.
Strand-Specific RNA-Seq Analyses of Fruiting Body Development in Coprinopsis cinerea
Muraguchi, Hajime; Umezawa, Kiwamu; Niikura, Mai; ...
2015-10-28
We report that the basidiomycete fungus Coprinopsis cinerea is an important model system for multicellular development. Fruiting bodies of C. cinerea are typical mushrooms, which can be produced synchronously on defined media in the laboratory. To investigate the transcriptome in detail during fruiting body development, high-throughput sequencing (RNA-seq) was performed using cDNA libraries strand-specifically constructed from 13 points (stages/tissues) with two biological replicates. The reads were aligned to 14,245 predicted transcripts, and counted for forward and reverse transcripts. Differentially expressed genes (DEGs) between two adjacent points and between vegetative mycelium and each point were detected by Tag Count Comparison (TCC).more » To validate RNA-seq data, expression levels of selected genes were compared using RPKM values in RNA-seq data and qRT-PCR data, and DEGs detected in microarray data were examined in MA plots of RNA-seq data by TCC. We discuss events deduced from GO analysis of DEGs. In addition, we uncovered both transcription factor candidates and antisense transcripts that are likely to be involved in developmental regulation for fruiting.« less
Strand-Specific RNA-Seq Analyses of Fruiting Body Development in Coprinopsis cinerea
DOE Office of Scientific and Technical Information (OSTI.GOV)
Muraguchi, Hajime; Umezawa, Kiwamu; Niikura, Mai
We report that the basidiomycete fungus Coprinopsis cinerea is an important model system for multicellular development. Fruiting bodies of C. cinerea are typical mushrooms, which can be produced synchronously on defined media in the laboratory. To investigate the transcriptome in detail during fruiting body development, high-throughput sequencing (RNA-seq) was performed using cDNA libraries strand-specifically constructed from 13 points (stages/tissues) with two biological replicates. The reads were aligned to 14,245 predicted transcripts, and counted for forward and reverse transcripts. Differentially expressed genes (DEGs) between two adjacent points and between vegetative mycelium and each point were detected by Tag Count Comparison (TCC).more » To validate RNA-seq data, expression levels of selected genes were compared using RPKM values in RNA-seq data and qRT-PCR data, and DEGs detected in microarray data were examined in MA plots of RNA-seq data by TCC. We discuss events deduced from GO analysis of DEGs. In addition, we uncovered both transcription factor candidates and antisense transcripts that are likely to be involved in developmental regulation for fruiting.« less
Zhang, Ruowen; Wu, Jiahui; Ferrandon, Sylvain; Glowacki, Katie J; Houghton, Janet A
2016-12-06
The GLI genes are transcription factors and in cancers are oncogenes, aberrantly and constitutively activated. GANT61, a specific GLI inhibitor, has induced extensive cytotoxicity in human models of colon cancer. The FOXM1 promoter was determined to be a transcriptional target of GLI1. In HT29 cells, inhibition of GLI1 binding at the GLI consensus sequence by GANT61 led to inhibited binding of Pol II, the pause-release factors DSIF, NELF and p-TEFb. The formation of R-loops (RNA:DNA hybrids, ssDNA), were reduced by GANT61 at the FOXM1 promoter. Pretreatment of HT29 cells with α-amanitin reduced GANT61-induced γH2AX foci. Co-localization of GLI1 and BrdU foci, inhibited by GANT61, indicated GLI1 and DNA replication to be linked. By co-immunoprecipitation and confocal microscopy, GLI1 co-localized with the DNA licensing factors ORC4, CDT1, and MCM2. Significant co-localization of GLI1 and ORC4 was inhibited by GANT61, and enrichment of ORC4 occurred at the GLI binding site in the FOXM1 promoter. CDT1 was found to be a transcription target of GLI1. Overexpression of CDT1 in HT29 and SW480 cells reduced GANT61-induced cell death, gH2AX foci, and cleavage of caspase-3. Data demonstrate involvement of transcription and of DNA replication licensing factors by non-transcriptional and transcriptional mechanisms in the GLI-dependent mechanism of action of GANT61.
Kodadek, T
1995-05-01
Transcription factors generally have only modest specificity for their target sites, yet must find them in a sea of non-specific DNA. Some transcription factors are expressed at very high levels, to ensure that, despite losses to non-specific binding, the promoter is still occupied (the carpet-bombing strategy). Others increase their binding specificity by collaborating with other factors in a variety of ways.
A genomewide survey of basic helix–loop–helix factors in Drosophila
Moore, Adrian W.; Barbel, Sandra; Jan, Lily Yeh; Jan, Yuh Nung
2000-01-01
The basic helix–loop–helix (bHLH) transcription factors play important roles in the specification of tissue type during the development of animals. We have used the information contained in the recently published genomic sequence of Drosophila melanogaster to identify 12 additional bHLH proteins. By sequence analysis we have assigned these proteins to families defined by Atonal, Hairy-Enhancer of Split, Hand, p48, Mesp, MYC/USF, and the bHLH-Per, Arnt, Sim (PAS) domain. In addition, one single protein represents a unique family of bHLH proteins. mRNA in situ analysis demonstrates that the genes encoding these proteins are expressed in several tissue types but are particularly concentrated in the developing nervous system and mesoderm. PMID:10973473
Bochkareva, Aleksandra; Zenkin, Nikolay
2013-01-01
The mechanisms of abortive synthesis and promoter escape during initiation of transcription are poorly understood. Here, we show that, after initiation of RNA synthesis, non-specific interaction of σ70 region 1.2, present in all σ70 family factors, with the non-template strand around position −4 relative to the transcription start site facilitates unwinding of the DNA duplex downstream of the transcription start site. This leads to stabilization of short RNA products and allows their extension, i.e. promoter escape. We show that this activity of σ70 region 1.2 is assisted by the β-lobe domain, but does not involve the β′-rudder or the β′-switch-2, earlier proposed to participate in promoter escape. DNA sequence independence of this function of σ70 region 1.2 suggests that it may be conserved in all σ70 family factors. Our results indicate that the abortive nature of initial synthesis is caused, at least in part, by failure to open the downstream DNA by the β-lobe and σ region 1.2. PMID:23430153
Yuan, Meng; Ke, Yinggen; Huang, Renyan; Ma, Ling; Yang, Zeyu; Chu, Zhaohui; Xiao, Jinghua; Li, Xianghua; Wang, Shiping
2016-07-29
Transcription activator-like effectors (TALEs) are sequence-specific DNA binding proteins found in a range of plant pathogenic bacteria, where they play important roles in host-pathogen interactions. However, it has been unclear how TALEs, after they have been injected into the host cells, activate transcription of host genes required for infection success. Here, we show that the basal transcription factor IIA gamma subunit TFIIAγ5 from rice is a key component for infection by the TALE-carrying bacterium Xanthomonas oryzae pv. oryzae, the causal agent for bacterial blight. Direct interaction of several TALEs with TFIIAγ5 is required for activation of disease susceptibility genes. Conversely, reduced expression of the TFIIAγ5 host gene limits the induction of susceptibility genes and thus decreases bacterial blight symptoms. Suppression or mutation of TFIIAγ5 can also reduce bacterial streak, another devastating disease of rice caused by TALE-carrying X. oryzae pv. oryzicola. These results have important implications for formulating a widely applicable strategy with which to improve resistance of plants to TALE-carrying pathogens.
Dash, P K; Tian, L M; Moore, A N
1998-07-07
Axonal injury increases intracellular Ca2+ and cAMP and has been shown to induce gene expression, which is thought to be a key event for regeneration. Increases in intracellular Ca2+ and/or cAMP can alter gene expression via activation of a family of transcription factors that bind to and modulate the expression of CRE (Ca2+/cAMP response element) sequence-containing genes. We have used Aplysia motor neurons to examine the role of CRE-binding proteins in axonal regeneration after injury. We report that axonal injury increases the binding of proteins to a CRE sequence-containing probe. In addition, Western blot analysis revealed that the level of ApCREB2, a CRE sequence-binding repressor, was enhanced as a result of axonal injury. The sequestration of CRE-binding proteins by microinjection of CRE sequence-containing plasmids enhanced axon collateral formation (both number and length) as compared with control plasmid injections. These findings show that Ca2+/cAMP-mediated gene expression via CRE-binding transcription factors participates in the regeneration of motor neuron axons.
A novel host factor for integration of mycobacteriophage L5
Pedulla, Marisa L.; Lee, Mong Hong; Lever, Dawn C.; Hatfull, Graham F.
1996-01-01
Bacterial integration host factors (IHFs) play central roles in the cellular processes of recombination, DNA replication, transcription, and bacterial pathogenesis. We describe here a novel mycobacterial IHF (mIHF) of Mycobacterium smegmatis and Mycobacterium tuberculosis that stimulates integration of mycobacteriophage L5. mIHF is the product of a single gene and is unrelated at the sequence level to other integration host factors. By itself, mIHF does not bind preferentially to attP DNA, although it significantly alters the pattern of integrase (Int) binding, promoting the formation of specific integrase–mIHF–attP intasome complexes. PMID:8986825
Yadav, Kamlesh Kumar; Rajasekharan, Ram
2016-11-01
PHM8 is a very important enzyme in nonpolar lipid metabolism because of its role in triacylglycerol (TAG) biosynthesis under phosphate stress conditions. It is positively regulated by the PHO4 transcription factor under low phosphate conditions; however, its regulation has not been explored under normal physiological conditions. General control nonderepressible (GCN4), a basic leucine-zipper transcription factor activates the transcription of amino acids, purine biosynthesis genes and many stress response genes under various stress conditions. In this study, we demonstrate that the level of TAG is regulated by the transcription factor GCN4. GCN4 directly binds to its consensus recognition sequence (TGACTC) in the PHM8 promoter and controls its expression. The analysis of cells expressing the P PHM8 -lacZ reporter gene showed that mutations (TGACTC-GGGCCC) in the GCN4-binding sequence caused a significant increase in β-galactosidase activity. Mutation in the GCN4 binding sequence causes an increase in PHM8 expression, lysophosphatidic acid phosphatase activity and TAG level. PHM8, in conjunction with DGA1, a mono- and diacylglycerol transferase, controls the level of TAG. These results revealed that GCN4 negatively regulates PHM8 and that deletion of GCN4 causes de-repression of PHM8, which is responsible for the increased TAG content in gcn4∆ cells.
Atypical archaeal tRNA pyrrolysine transcript behaves towards EF-Tu as a typical elongator tRNA
Théobald-Dietrich, Anne; Frugier, Magali; Giegé, Richard; Rudinger-Thirion, Joëlle
2004-01-01
The newly discovered tRNAPyl is involved in specific incorporation of pyrrolysine in the active site of methylamine methyltransferases in the archaeon Methanosarcina barkeri. In solution probing experiments, a transcript derived from tRNAPyl displays a secondary fold slightly different from the canonical cloverleaf and interestingly similar to that of bovine mitochondrial tRNASer(uga). Aminoacylation of tRNAPyl transcript by a typical class II synthetase, LysRS from yeast, was possible when its amber anticodon CUA was mutated into a lysine UUU anticodon. Hydrolysis protection assays show that lysylated tRNAPyl can be recognized by bacterial elongation factor. This indicates that no antideterminant sequence is present in the body of the tRNAPyl transcript to prevent it from interacting with EF-Tu, in contrast with the otherwise functionally similar tRNASec that mediates selenocysteine incorporation. PMID:14872064
Syndromes associated with Homo sapiens pol II regulatory genes.
Bina, M; Demmon, S; Pares-Matos, E I
2000-01-01
The molecular basis of human characteristics is an intriguing but an unresolved problem. Human characteristics cover a broad spectrum, from the obvious to the abstract. Obvious characteristics may include morphological features such as height, shape, and facial form. Abstract characteristics may be hidden in processes that are controlled by hormones and the human brain. In this review we examine exaggerated characteristics presented as syndromes. Specifically, we focus on human genes that encode transcription factors to examine morphological, immunological, and hormonal anomalies that result from deletion, insertion, or mutation of genes that regulate transcription by RNA polymerase II (the Pol II genes). A close analysis of abnormal phenotypes can give clues into how sequence variations in regulatory genes and changes in transcriptional control may give rise to characteristics defined as complex traits.
Proliferating cell nuclear antigen (Pcna) as a direct downstream target gene of Hoxc8
DOE Office of Scientific and Technical Information (OSTI.GOV)
Min, Hyehyun; Lee, Ji-Yeon; Bok, Jinwoong
2010-02-19
Hoxc8 is a member of Hox family transcription factors that play crucial roles in spatiotemporal body patterning during embryogenesis. Hox proteins contain a conserved 61 amino acid homeodomain, which is responsible for recognition and binding of the proteins onto Hox-specific DNA binding motifs and regulates expression of their target genes. Previously, using proteome analysis, we identified Proliferating cell nuclear antigen (Pcna) as one of the putative target genes of Hoxc8. Here, we asked whether Hoxc8 regulates Pcna expression by directly binding to the regulatory sequence of Pcna. In mouse embryos at embryonic day 11.5, the expression pattern of Pcna wasmore » similar to that of Hoxc8 along the anteroposterior body axis. Moreover, Pcna transcript levels as well as cell proliferation rate were increased by overexpression of Hoxc8 in C3H10T1/2 mouse embryonic fibroblast cells. Characterization of 2.3 kb genomic sequence upstream of Pcna coding region revealed that the upstream sequence contains several Hox core binding sequences and one Hox-Pbx binding sequence. Direct binding of Hoxc8 proteins to the Pcna regulatory sequence was verified by chromatin immunoprecipitation assay. Taken together, our data suggest that Pcna is a direct downstream target of Hoxc8.« less
Suzuki, Harukazu; Forrest, Alistair R R; van Nimwegen, Erik; Daub, Carsten O; Balwierz, Piotr J; Irvine, Katharine M; Lassmann, Timo; Ravasi, Timothy; Hasegawa, Yuki; de Hoon, Michiel J L; Katayama, Shintaro; Schroder, Kate; Carninci, Piero; Tomaru, Yasuhiro; Kanamori-Katayama, Mutsumi; Kubosaki, Atsutaka; Akalin, Altuna; Ando, Yoshinari; Arner, Erik; Asada, Maki; Asahara, Hiroshi; Bailey, Timothy; Bajic, Vladimir B; Bauer, Denis; Beckhouse, Anthony G; Bertin, Nicolas; Björkegren, Johan; Brombacher, Frank; Bulger, Erika; Chalk, Alistair M; Chiba, Joe; Cloonan, Nicole; Dawe, Adam; Dostie, Josee; Engström, Pär G; Essack, Magbubah; Faulkner, Geoffrey J; Fink, J Lynn; Fredman, David; Fujimori, Ko; Furuno, Masaaki; Gojobori, Takashi; Gough, Julian; Grimmond, Sean M; Gustafsson, Mika; Hashimoto, Megumi; Hashimoto, Takehiro; Hatakeyama, Mariko; Heinzel, Susanne; Hide, Winston; Hofmann, Oliver; Hörnquist, Michael; Huminiecki, Lukasz; Ikeo, Kazuho; Imamoto, Naoko; Inoue, Satoshi; Inoue, Yusuke; Ishihara, Ryoko; Iwayanagi, Takao; Jacobsen, Anders; Kaur, Mandeep; Kawaji, Hideya; Kerr, Markus C; Kimura, Ryuichiro; Kimura, Syuhei; Kimura, Yasumasa; Kitano, Hiroaki; Koga, Hisashi; Kojima, Toshio; Kondo, Shinji; Konno, Takeshi; Krogh, Anders; Kruger, Adele; Kumar, Ajit; Lenhard, Boris; Lennartsson, Andreas; Lindow, Morten; Lizio, Marina; Macpherson, Cameron; Maeda, Norihiro; Maher, Christopher A; Maqungo, Monique; Mar, Jessica; Matigian, Nicholas A; Matsuda, Hideo; Mattick, John S; Meier, Stuart; Miyamoto, Sei; Miyamoto-Sato, Etsuko; Nakabayashi, Kazuhiko; Nakachi, Yutaka; Nakano, Mika; Nygaard, Sanne; Okayama, Toshitsugu; Okazaki, Yasushi; Okuda-Yabukami, Haruka; Orlando, Valerio; Otomo, Jun; Pachkov, Mikhail; Petrovsky, Nikolai; Plessy, Charles; Quackenbush, John; Radovanovic, Aleksandar; Rehli, Michael; Saito, Rintaro; Sandelin, Albin; Schmeier, Sebastian; Schönbach, Christian; Schwartz, Ariel S; Semple, Colin A; Sera, Miho; Severin, Jessica; Shirahige, Katsuhiko; Simons, Cas; St Laurent, George; Suzuki, Masanori; Suzuki, Takahiro; Sweet, Matthew J; Taft, Ryan J; Takeda, Shizu; Takenaka, Yoichi; Tan, Kai; Taylor, Martin S; Teasdale, Rohan D; Tegnér, Jesper; Teichmann, Sarah; Valen, Eivind; Wahlestedt, Claes; Waki, Kazunori; Waterhouse, Andrew; Wells, Christine A; Winther, Ole; Wu, Linda; Yamaguchi, Kazumi; Yanagawa, Hiroshi; Yasuda, Jun; Zavolan, Mihaela; Hume, David A; Arakawa, Takahiro; Fukuda, Shiro; Imamura, Kengo; Kai, Chikatoshi; Kaiho, Ai; Kawashima, Tsugumi; Kawazu, Chika; Kitazume, Yayoi; Kojima, Miki; Miura, Hisashi; Murakami, Kayoko; Murata, Mitsuyoshi; Ninomiya, Noriko; Nishiyori, Hiromi; Noma, Shohei; Ogawa, Chihiro; Sano, Takuma; Simon, Christophe; Tagami, Michihira; Takahashi, Yukari; Kawai, Jun; Hayashizaki, Yoshihide
2009-05-01
Using deep sequencing (deepCAGE), the FANTOM4 study measured the genome-wide dynamics of transcription-start-site usage in the human monocytic cell line THP-1 throughout a time course of growth arrest and differentiation. Modeling the expression dynamics in terms of predicted cis-regulatory sites, we identified the key transcription regulators, their time-dependent activities and target genes. Systematic siRNA knockdown of 52 transcription factors confirmed the roles of individual factors in the regulatory network. Our results indicate that cellular states are constrained by complex networks involving both positive and negative regulatory interactions among substantial numbers of transcription factors and that no single transcription factor is both necessary and sufficient to drive the differentiation process.
MorTAL Kombat: the story of defense against TAL effectors through loss-of-susceptibility
Hutin, Mathilde; Pérez-Quintero, Alvaro L.; Lopez, Camilo; Szurek, Boris
2015-01-01
Many plant-pathogenic xanthomonads rely on Transcription Activator-Like (TAL) effectors to colonize their host. This particular family of type III effectors functions as specific plant transcription factors via a programmable DNA-binding domain. Upon binding to the promoters of plant disease susceptibility genes in a sequence-specific manner, the expression of these host genes is induced. However, plants have evolved specific strategies to counter the action of TAL effectors and confer resistance. One mechanism is to avoid the binding of TAL effectors by mutations of their DNA binding sites, resulting in resistance by loss-of-susceptibility. This article reviews our current knowledge of the susceptibility hubs targeted by Xanthomonas TAL effectors, possible evolutionary scenarios for plants to combat the pathogen with loss-of-function alleles, and how this knowledge can be used overall to develop new pathogen-informed breeding strategies and improve crop resistance. PMID:26236326
Donald, R G; Schindler, U; Batschauer, A; Cashmore, A R
1990-01-01
G box and I box sequences of the Arabidopsis thaliana ribulose-bisphosphate-1,5-carboxylase small subunit (RBCS) promoter are required for expression mediated by the Arabidopsis rbcS-1A promoter in transgenic tobacco plants and are bound in vitro by factors from plant nuclear extracts termed GBF and GA-1, respectively. We show here that a -390 to -60 rbcS-1A promoter fragment containing the G box and two I boxes activates transcription from a truncated iso-1-cytochrome c (CYC1) gene promoter in Saccharomyces cerevisiae. Mutagenesis of either the rbcS-1A G box or both I box sequences eliminated the expression mediated by this fragment. When polymerized, I box oligonucleotides were also capable of enhancing expression from the truncated CYC1 promoter. Single-copy G box sequences from the Arabidopsis rbcS-1A, Arabidopsis Adh and tomato rbcS-3A promoters were more potent activators and were used in mobility shift assays to identify a DNA binding activity in yeast functionally similar to GBF. In methylation interference experiments, the binding specificity of the yeast protein was indistinguishable from that obtained with plant nuclear extracts. Images Fig. 3. Fig. 4. Fig. 5. Fig. 6. PMID:2161333
footprintDB: a database of transcription factors with annotated cis elements and binding interfaces.
Sebastian, Alvaro; Contreras-Moreira, Bruno
2014-01-15
Traditional and high-throughput techniques for determining transcription factor (TF) binding specificities are generating large volumes of data of uneven quality, which are scattered across individual databases. FootprintDB integrates some of the most comprehensive freely available libraries of curated DNA binding sites and systematically annotates the binding interfaces of the corresponding TFs. The first release contains 2422 unique TF sequences, 10 112 DNA binding sites and 3662 DNA motifs. A survey of the included data sources, organisms and TF families was performed together with proprietary database TRANSFAC, finding that footprintDB has a similar coverage of multicellular organisms, while also containing bacterial regulatory data. A search engine has been designed that drives the prediction of DNA motifs for input TFs, or conversely of TF sequences that might recognize input regulatory sequences, by comparison with database entries. Such predictions can also be extended to a single proteome chosen by the user, and results are ranked in terms of interface similarity. Benchmark experiments with bacterial, plant and human data were performed to measure the predictive power of footprintDB searches, which were able to correctly recover 10, 55 and 90% of the tested sequences, respectively. Correctly predicted TFs had a higher interface similarity than the average, confirming its diagnostic value. Web site implemented in PHP,Perl, MySQL and Apache. Freely available from http://floresta.eead.csic.es/footprintdb.
Upregulation of Endogenous HMOX1 Expression by a Computer-Designed Artificial Transcription Factor
Guo, Hongfeng; Tian, Yi; Lu, Hai; Wei, Yong; Ying, Dajun
2010-01-01
Heme oxygenase-1 (HO-1) is well known as a cytoprotective factor. Research has revealed that it is a promising therapeutic target for cardiovascular diseases. In the current study, an HMOX1 (HO-1 gene) enhancer-specific artificial zinc-finger protein (AZP) was designed using bioinformatical methods. Then, an artificial transcription factor (ATF) was constructed based on the AZP. In the ATF, the p65 functional domain was used as the effector domain (ED), and a nuclear localization sequence (NLS) was also included. We next analyzed the affinity of the ATF to the HMOX1 enhancer and the effect of the ATF on endogenous HMOX1 expression. The results suggest that the ATF could effectively upregulate endogenous HMOX1 expression in ECV304 cells. With further research, the ATF could be developed as a potential drug for cardiovascular diseases. PMID:20706680
Tissue-selective restriction of RNA editing of CaV1.3 by splicing factor SRSF9.
Huang, Hua; Kapeli, Katannya; Jin, Wenhao; Wong, Yuk Peng; Arumugam, Thiruma Valavan; Koh, Joanne Huifen; Srimasorn, Sumitra; Mallilankaraman, Karthik; Chua, John Jia En; Yeo, Gene W; Soong, Tuck Wah
2018-05-04
Adenosine DeAminases acting on RNA (ADAR) catalyzes adenosine-to-inosine (A-to-I) conversion within RNA duplex structures. While A-to-I editing is often dynamically regulated in a spatial-temporal manner, the mechanisms underlying its tissue-selective restriction remain elusive. We have previously reported that transcripts of voltage-gated calcium channel CaV1.3 are subject to brain-selective A-to-I RNA editing by ADAR2. Here, we show that editing of CaV1.3 mRNA is dependent on a 40 bp RNA duplex formed between exon 41 and an evolutionarily conserved editing site complementary sequence (ECS) located within the preceding intron. Heterologous expression of a mouse minigene that contained the ECS, intermediate intronic sequence and exon 41 with ADAR2 yielded robust editing. Interestingly, editing of CaV1.3 was potently inhibited by serine/arginine-rich splicing factor 9 (SRSF9). Mechanistically, the inhibitory effect of SRSF9 required direct RNA interaction. Selective down-regulation of SRSF9 in neurons provides a basis for the neuron-specific editing of CaV1.3 transcripts.
Falcone, Emmanuela; Grandoni, Luca; Garibaldi, Francesca; Manni, Isabella; Filligoi, Giancarlo; Piaggio, Giulia; Gurtner, Aymone
2016-01-01
miRNAs are potent regulators of gene expression and modulate multiple cellular processes in physiology and pathology. Deregulation of miRNAs expression has been found in various cancer types, thus, miRNAs may be potential targets for cancer therapy. However, the mechanisms through which miRNAs are regulated in cancer remain unclear. Therefore, the identification of transcriptional factor-miRNA crosstalk is one of the most update aspects of the study of miRNAs regulation. In the present study we describe the development of a fast and user-friendly software, named infinity, able to find the presence of DNA matrices, such as binding sequences for transcriptional factors, on ~65kb (kilobase) of 939 human miRNA genomic sequences, simultaneously. Of note, the power of this software has been validated in vivo by performing chromatin immunoprecipitation assays on a subset of new in silico identified target sequences (CCAAT) for the transcription factor NF-Y on colon cancer deregulated miRNA loci. Moreover, for the first time, we have demonstrated that NF-Y, through its CCAAT binding activity, regulates the expression of miRNA-181a, -181b, -21, -17, -130b, -301b in colon cancer cells. The infinity software that we have developed is a powerful tool to underscore new TF/miRNA regulatory networks. Infinity was implemented in pure Java using Eclipse framework, and runs on Linux and MS Windows machine, with MySQL database. The software is freely available on the web at https://github.com/bio-devel/infinity. The website is implemented in JavaScript, PHP and HTML with all major browsers supported.
Cloning and functional analysis of the promoter region of the human Disc large gene.
Cavatorta, Ana Laura; Giri, Adriana A; Banks, Lawrence; Gardiol, Daniela
2008-11-15
A number of studies have demonstrated the involvement of human Disc large (DLG1) in the control of both cell polarity and maintenance of tissue architecture. However, the mechanisms controlling DLG1 transcription are not fully understood. This is relevant since DLG1 is lost in many tumours during the later stages of malignant progression. Therefore, we performed the cloning and functional analysis of a genomic 5' flanking region of the DLG1 open reading frame with promoter activity. We analyzed the activity of a series of 5' deletion constructs of the DLG1 promoter and determined the minimal essential sequences that are required for promoter activity as well as cis-elements that regulate transcription. We found, within the DLG1 promoter sequences, consensus-binding sites for the Snail family of transcription factors that repress the expression of epithelial markers and are up-regulated in a variety of tumours. Snail transcription factors repress the transcriptional activity of the DLG1 promoter and, ectopically expressed Snail proteins bind to the native DLG1 promoter. These data suggest a role for Snail transcription factors in the control of DLG1 expression and provide a basis for understanding the transcriptional regulation of DLG1.
Nucleotide Interdependency in Transcription Factor Binding Sites in the Drosophila Genome.
Dresch, Jacqueline M; Zellers, Rowan G; Bork, Daniel K; Drewell, Robert A
2016-01-01
A long-standing objective in modern biology is to characterize the molecular components that drive the development of an organism. At the heart of eukaryotic development lies gene regulation. On the molecular level, much of the research in this field has focused on the binding of transcription factors (TFs) to regulatory regions in the genome known as cis-regulatory modules (CRMs). However, relatively little is known about the sequence-specific binding preferences of many TFs, especially with respect to the possible interdependencies between the nucleotides that make up binding sites. A particular limitation of many existing algorithms that aim to predict binding site sequences is that they do not allow for dependencies between nonadjacent nucleotides. In this study, we use a recently developed computational algorithm, MARZ, to compare binding site sequences using 32 distinct models in a systematic and unbiased approach to explore nucleotide dependencies within binding sites for 15 distinct TFs known to be critical to Drosophila development. Our results indicate that many of these proteins have varying levels of nucleotide interdependencies within their DNA recognition sequences, and that, in some cases, models that account for these dependencies greatly outperform traditional models that are used to predict binding sites. We also directly compare the ability of different models to identify the known KRUPPEL TF binding sites in CRMs and demonstrate that a more complex model that accounts for nucleotide interdependencies performs better when compared with simple models. This ability to identify TFs with critical nucleotide interdependencies in their binding sites will lead to a deeper understanding of how these molecular characteristics contribute to the architecture of CRMs and the precise regulation of transcription during organismal development.
Nucleotide Interdependency in Transcription Factor Binding Sites in the Drosophila Genome
Dresch, Jacqueline M.; Zellers, Rowan G.; Bork, Daniel K.; Drewell, Robert A.
2016-01-01
A long-standing objective in modern biology is to characterize the molecular components that drive the development of an organism. At the heart of eukaryotic development lies gene regulation. On the molecular level, much of the research in this field has focused on the binding of transcription factors (TFs) to regulatory regions in the genome known as cis-regulatory modules (CRMs). However, relatively little is known about the sequence-specific binding preferences of many TFs, especially with respect to the possible interdependencies between the nucleotides that make up binding sites. A particular limitation of many existing algorithms that aim to predict binding site sequences is that they do not allow for dependencies between nonadjacent nucleotides. In this study, we use a recently developed computational algorithm, MARZ, to compare binding site sequences using 32 distinct models in a systematic and unbiased approach to explore nucleotide dependencies within binding sites for 15 distinct TFs known to be critical to Drosophila development. Our results indicate that many of these proteins have varying levels of nucleotide interdependencies within their DNA recognition sequences, and that, in some cases, models that account for these dependencies greatly outperform traditional models that are used to predict binding sites. We also directly compare the ability of different models to identify the known KRUPPEL TF binding sites in CRMs and demonstrate that a more complex model that accounts for nucleotide interdependencies performs better when compared with simple models. This ability to identify TFs with critical nucleotide interdependencies in their binding sites will lead to a deeper understanding of how these molecular characteristics contribute to the architecture of CRMs and the precise regulation of transcription during organismal development. PMID:27330274
Identification of Regulatory Elements That Control PPARγ Expression in Adipocyte Progenitors
Chou, Wen-Ling; Galmozzi, Andrea; Partida, David; Kwan, Kevin; Yeung, Hui; Su, Andrew I.; Saez, Enrique
2013-01-01
Adipose tissue renewal and obesity-driven expansion of fat cell number are dependent on proliferation and differentiation of adipose progenitors that reside in the vasculature that develops in coordination with adipose depots. The transcriptional events that regulate commitment of progenitors to the adipose lineage are poorly understood. Because expression of the nuclear receptor PPARγ defines the adipose lineage, isolation of elements that control PPARγ expression in adipose precursors may lead to discovery of transcriptional regulators of early adipocyte determination. Here, we describe the identification and validation in transgenic mice of 5 highly conserved non-coding sequences from the PPARγ locus that can drive expression of a reporter gene in a manner that recapitulates the tissue-specific pattern of PPARγ expression. Surprisingly, these 5 elements appear to control PPARγ expression in adipocyte precursors that are associated with the vasculature of adipose depots, but not in mature adipocytes. Characterization of these five PPARγ regulatory sequences may enable isolation of the transcription factors that bind these cis elements and provide insight into the molecular regulation of adipose tissue expansion in normal and pathological states. PMID:24009687
CTCF-Mediated Human 3D Genome Architecture Reveals Chromatin Topology for Transcription.
Tang, Zhonghui; Luo, Oscar Junhong; Li, Xingwang; Zheng, Meizhen; Zhu, Jacqueline Jufen; Szalaj, Przemyslaw; Trzaskoma, Pawel; Magalska, Adriana; Wlodarczyk, Jakub; Ruszczycki, Blazej; Michalski, Paul; Piecuch, Emaly; Wang, Ping; Wang, Danjuan; Tian, Simon Zhongyuan; Penrad-Mobayed, May; Sachs, Laurent M; Ruan, Xiaoan; Wei, Chia-Lin; Liu, Edison T; Wilczynski, Grzegorz M; Plewczynski, Dariusz; Li, Guoliang; Ruan, Yijun
2015-12-17
Spatial genome organization and its effect on transcription remains a fundamental question. We applied an advanced chromatin interaction analysis by paired-end tag sequencing (ChIA-PET) strategy to comprehensively map higher-order chromosome folding and specific chromatin interactions mediated by CCCTC-binding factor (CTCF) and RNA polymerase II (RNAPII) with haplotype specificity and nucleotide resolution in different human cell lineages. We find that CTCF/cohesin-mediated interaction anchors serve as structural foci for spatial organization of constitutive genes concordant with CTCF-motif orientation, whereas RNAPII interacts within these structures by selectively drawing cell-type-specific genes toward CTCF foci for coordinated transcription. Furthermore, we show that haplotype variants and allelic interactions have differential effects on chromosome configuration, influencing gene expression, and may provide mechanistic insights into functions associated with disease susceptibility. 3D genome simulation suggests a model of chromatin folding around chromosomal axes, where CTCF is involved in defining the interface between condensed and open compartments for structural regulation. Our 3D genome strategy thus provides unique insights in the topological mechanism of human variations and diseases. Copyright © 2015 Elsevier Inc. All rights reserved.
oPOSSUM: integrated tools for analysis of regulatory motif over-representation
Ho Sui, Shannan J.; Fulton, Debra L.; Arenillas, David J.; Kwon, Andrew T.; Wasserman, Wyeth W.
2007-01-01
The identification of over-represented transcription factor binding sites from sets of co-expressed genes provides insights into the mechanisms of regulation for diverse biological contexts. oPOSSUM, an internet-based system for such studies of regulation, has been improved and expanded in this new release. New features include a worm-specific version for investigating binding sites conserved between Caenorhabditis elegans and C. briggsae, as well as a yeast-specific version for the analysis of co-expressed sets of Saccharomyces cerevisiae genes. The human and mouse applications feature improvements in ortholog mapping, sequence alignments and the delineation of multiple alternative promoters. oPOSSUM2, introduced for the analysis of over-represented combinations of motifs in human and mouse genes, has been integrated with the original oPOSSUM system. Analysis using user-defined background gene sets is now supported. The transcription factor binding site models have been updated to include new profiles from the JASPAR database. oPOSSUM is available at http://www.cisreg.ca/oPOSSUM/ PMID:17576675
Kim, Kyuhyung; Kim, Rinho; Sengupta, Piali
2010-01-01
The differentiated features of postmitotic neurons are dictated by the expression of specific transcription factors. The mechanisms by which the precise spatiotemporal expression patterns of these factors are regulated are poorly understood. In C. elegans, the ceh-36 Otx homeobox gene is expressed in the AWC sensory neurons throughout postembryonic development, and regulates terminal differentiation of this neuronal subtype. Here, we show that the HMX/NKX homeodomain protein MLS-2 regulates ceh-36 expression specifically in the AWC neurons. Consequently, the AWC neurons fail to express neuron type-specific characteristics in mls-2 mutants. mls-2 is expressed transiently in postmitotic AWC neurons, and directly initiates ceh-36 expression. CEH-36 subsequently interacts with a distinct site in its cis-regulatory sequences to maintain its own expression, and also directly regulates the expression of AWC-specific terminal differentiation genes. We also show that MLS-2 acts in additional neuron types to regulate their development and differentiation. Our analysis describes a transcription factor cascade that defines the unique postmitotic characteristics of a sensory neuron subtype, and provides insights into the spatiotemporal regulatory mechanisms that generate functional diversity in the sensory nervous system. PMID:20150279
Verzi, Michael P.; Shin, Hyunjin; San Roman, Adrianna K.
2013-01-01
Tissue-specific gene expression requires modulation of nucleosomes, allowing transcription factors to occupy cis elements that are accessible only in selected tissues. Master transcription factors control cell-specific genes and define cellular identities, but it is unclear if they possess special abilities to regulate cell-specific chromatin and if such abilities might underlie lineage determination and maintenance. One prevailing view is that several transcription factors enable chromatin access in combination. The homeodomain protein CDX2 specifies the embryonic intestinal epithelium, through unknown mechanisms, and partners with transcription factors such as HNF4A in the adult intestine. We examined enhancer chromatin and gene expression following Cdx2 or Hnf4a excision in mouse intestines. HNF4A loss did not affect CDX2 binding or chromatin, whereas CDX2 depletion modified chromatin significantly at CDX2-bound enhancers, disrupted HNF4A occupancy, and abrogated expression of neighboring genes. Thus, CDX2 maintains transcription-permissive chromatin, illustrating a powerful and dominant effect on enhancer configuration in an adult tissue. Similar, hierarchical control of cell-specific chromatin states is probably a general property of master transcription factors. PMID:23129810
Therapeutic potential of Mediator complex subunits in metabolic diseases.
Ranjan, Amol; Ansari, Suraiya A
2018-01-01
The multisubunit Mediator is an evolutionary conserved transcriptional coregulatory complex in eukaryotes. It is needed for the transcriptional regulation of gene expression in general as well as in a gene specific manner. Mediator complex subunits interact with different transcription factors as well as components of RNA Pol II transcription initiation complex and in doing so act as a bridge between gene specific transcription factors and general Pol II transcription machinery. Specific interaction of various Mediator subunits with nuclear receptors (NRs) and other transcription factors involved in metabolism has been reported in different studies. Evidences indicate that ligand-activated NRs recruit Mediator complex for RNA Pol II-dependent gene transcription. These NRs have been explored as therapeutic targets in different metabolic diseases; however, they show side-effects as targets due to their overlapping involvement in different signaling pathways. Here we discuss the interaction of various Mediator subunits with transcription factors involved in metabolism and whether specific interaction of these transcription factors with Mediator subunits could be potentially utilized as therapeutic strategy in a variety of metabolic diseases. Copyright © 2017 Elsevier B.V. and Société Française de Biochimie et Biologie Moléculaire (SFBBM). All rights reserved.
SOX2 as a New Regulator of HPV16 Transcription.
Martínez-Ramírez, Imelda; Del-Castillo-Falconi, Víctor; Mitre-Aguilar, Irma B; Amador-Molina, Alfredo; Carrillo-García, Adela; Langley, Elizabeth; Zentella-Dehesa, Alejandro; Soto-Reyes, Ernesto; García-Carrancá, Alejandro; Herrera, Luis A; Lizano, Marcela
2017-07-05
Persistent infections with high-risk human papillomavirus (HPV) constitute the main risk factor for cervical cancer development. HPV16 is the most frequent type associated to squamous cell carcinomas (SCC), followed by HPV18. The long control region (LCR) in the HPV genome contains the replication origin and sequences recognized by cellular transcription factors (TFs) controlling viral transcription. Altered expression of E6 and E7 viral oncogenes, modulated by the LCR, causes modifications in cellular pathways such as proliferation, leading to malignant transformation. The aim of this study was to identify specific TFs that could contribute to the modulation of high-risk HPV transcriptional activity, related to the cellular histological origin. We identified sex determining region Y (SRY)-box 2 (SOX2) response elements present in HPV16-LCR. SOX2 binding to the LCR was demonstrated by in vivo and in vitro assays. The overexpression of this TF repressed HPV16-LCR transcriptional activity, as shown through reporter plasmid assays and by the down-regulation of endogenous HPV oncogenes. Site-directed mutagenesis revealed that three putative SOX2 binding sites are involved in the repression of the LCR activity. We propose that SOX2 acts as a transcriptional repressor of HPV16-LCR, decreasing the expression of E6 and E7 oncogenes in a SCC context.
TEFM is a potent stimulator of mitochondrial transcription elongation in vitro
Posse, Viktor; Shahzad, Saba; Falkenberg, Maria; Hällberg, B. Martin; Gustafsson, Claes M.
2015-01-01
A single-subunit RNA polymerase, POLRMT, transcribes the mitochondrial genome in human cells. Recently, a factor termed as the mitochondrial transcription elongation factor, TEFM, was shown to stimulate transcription elongation in vivo, but its effect in vitro was relatively modest. In the current work, we have isolated active TEFM in recombinant form and used a reconstituted in vitro transcription system to characterize its activities. We show that TEFM strongly promotes POLRMT processivity as it dramatically stimulates the formation of longer transcripts. TEFM also abolishes premature transcription termination at conserved sequence block II, an event that has been linked to primer formation during initiation of mtDNA synthesis. We show that POLRMT pauses at a wide range of sites in a given DNA sequence. In the absence of TEFM, this leads to termination; however, the presence of TEFM abolishes this effect and aids POLRMT in continuation of transcription. Further, we show that TEFM substantially increases the POLRMT affinity to an elongation-like DNA:RNA template. In combination with previously published in vivo observations, our data establish TEFM as an essential component of the mitochondrial transcription machinery. PMID:25690892
A transcription factor collective defines the HSN serotonergic neuron regulatory landscape.
Lloret-Fernández, Carla; Maicas, Miren; Mora-Martínez, Carlos; Artacho, Alejandro; Jimeno-Martín, Ángela; Chirivella, Laura; Weinberg, Peter; Flames, Nuria
2018-03-22
Cell differentiation is controlled by individual transcription factors (TFs) that together activate a selection of enhancers in specific cell types. How these combinations of TFs identify and activate their target sequences remains poorly understood. Here, we identify the cis -regulatory transcriptional code that controls the differentiation of serotonergic HSN neurons in Caenorhabditis elegans . Activation of the HSN transcriptome is directly orchestrated by a collective of six TFs. Binding site clusters for this TF collective form a regulatory signature that is sufficient for de novo identification of HSN neuron functional enhancers. Among C. elegans neurons, the HSN transcriptome most closely resembles that of mouse serotonergic neurons. Mouse orthologs of the HSN TF collective also regulate serotonergic differentiation and can functionally substitute for their worm counterparts which suggests deep homology. Our results identify rules governing the regulatory landscape of a critically important neuronal type in two species separated by over 700 million years. © 2018, Lloret-Fernández et al.
A Recommendation for Naming Transcription Factor Proteins in the Grasses
USDA-ARS?s Scientific Manuscript database
Transcription factors are central for the exquisite temporal and spatial expression patterns of many genes. These proteins are characterized by their ability to be tethered to particular regulatory sequences in the genes that they control. While many other proteins participate in the regulation of g...
Grau, Jan; Reschke, Maik; Erkes, Annett; Streubel, Jana; Morgan, Richard D.; Wilson, Geoffrey G.; Koebnik, Ralf; Boch, Jens
2016-01-01
Transcription activator-like effectors (TALEs) are virulence factors, produced by the bacterial plant-pathogen Xanthomonas, that function as gene activators inside plant cells. Although the contribution of individual TALEs to infectivity has been shown, the specific roles of most TALEs, and the overall TALE diversity in Xanthomonas spp. is not known. TALEs possess a highly repetitive DNA-binding domain, which is notoriously difficult to sequence. Here, we describe an improved method for characterizing TALE genes by the use of PacBio sequencing. We present ‘AnnoTALE’, a suite of applications for the analysis and annotation of TALE genes from Xanthomonas genomes, and for grouping similar TALEs into classes. Based on these classes, we propose a unified nomenclature for Xanthomonas TALEs that reveals similarities pointing to related functionalities. This new classification enables us to compare related TALEs and to identify base substitutions responsible for the evolution of TALE specificities. PMID:26876161
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kvon, Evgeny Z.; Kamneva, Olga K.; Melo, Uirá S.
The evolution of body shape is thought to be tightly coupled to changes in regulatory sequences, but specific molecular events associated with major morphological transitions in vertebrates have remained elusive. In this paper, we identified snake-specific sequence changes within an otherwise highly conserved long-range limb enhancer of Sonic hedgehog (Shh). Transgenic mouse reporter assays revealed that the in vivo activity pattern of the enhancer is conserved across a wide range of vertebrates, including fish, but not in snakes. Genomic substitution of the mouse enhancer with its human or fish ortholog results in normal limb development. In contrast, replacement with snake orthologsmore » caused severe limb reduction. Synthetic restoration of a single transcription factor binding site lost in the snake lineage reinstated full in vivo function to the snake enhancer. Our results demonstrate changes in a regulatory sequence associated with a major body plan transition and highlight the role of enhancers in morphological evolution.« less
DNA sequence+shape kernel enables alignment-free modeling of transcription factor binding.
Ma, Wenxiu; Yang, Lin; Rohs, Remo; Noble, William Stafford
2017-10-01
Transcription factors (TFs) bind to specific DNA sequence motifs. Several lines of evidence suggest that TF-DNA binding is mediated in part by properties of the local DNA shape: the width of the minor groove, the relative orientations of adjacent base pairs, etc. Several methods have been developed to jointly account for DNA sequence and shape properties in predicting TF binding affinity. However, a limitation of these methods is that they typically require a training set of aligned TF binding sites. We describe a sequence + shape kernel that leverages DNA sequence and shape information to better understand protein-DNA binding preference and affinity. This kernel extends an existing class of k-mer based sequence kernels, based on the recently described di-mismatch kernel. Using three in vitro benchmark datasets, derived from universal protein binding microarrays (uPBMs), genomic context PBMs (gcPBMs) and SELEX-seq data, we demonstrate that incorporating DNA shape information improves our ability to predict protein-DNA binding affinity. In particular, we observe that (i) the k-spectrum + shape model performs better than the classical k-spectrum kernel, particularly for small k values; (ii) the di-mismatch kernel performs better than the k-mer kernel, for larger k; and (iii) the di-mismatch + shape kernel performs better than the di-mismatch kernel for intermediate k values. The software is available at https://bitbucket.org/wenxiu/sequence-shape.git. rohs@usc.edu or william-noble@uw.edu. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.
Widespread evidence of cooperative DNA binding by transcription factors in Drosophila development
Kazemian, Majid; Pham, Hannah; Wolfe, Scot A.; Brodsky, Michael H.; Sinha, Saurabh
2013-01-01
Regulation of eukaryotic gene transcription is often combinatorial in nature, with multiple transcription factors (TFs) regulating common target genes, often through direct or indirect mutual interactions. Many individual examples of cooperative binding by directly interacting TFs have been identified, but it remains unclear how pervasive this mechanism is during animal development. Cooperative TF binding should be manifest in genomic sequences as biased arrangements of TF-binding sites. Here, we explore the extent and diversity of such arrangements related to gene regulation during Drosophila embryogenesis. We used the DNA-binding specificities of 322 TFs along with chromatin accessibility information to identify enriched spacing and orientation patterns of TF-binding site pairs. We developed a new statistical approach for this task, specifically designed to accurately assess inter-site spacing biases while accounting for the phenomenon of homotypic site clustering commonly observed in developmental regulatory regions. We observed a large number of short-range distance preferences between TF-binding site pairs, including examples where the preference depends on the relative orientation of the binding sites. To test whether these binding site patterns reflect physical interactions between the corresponding TFs, we analyzed 27 TF pairs whose binding sites exhibited short distance preferences. In vitro protein–protein binding experiments revealed that >65% of these TF pairs can directly interact with each other. For five pairs, we further demonstrate that they bind cooperatively to DNA if both sites are present with the preferred spacing. This study demonstrates how DNA-binding motifs can be used to produce a comprehensive map of sequence signatures for different mechanisms of combinatorial TF action. PMID:23847101
Widespread evidence of cooperative DNA binding by transcription factors in Drosophila development.
Kazemian, Majid; Pham, Hannah; Wolfe, Scot A; Brodsky, Michael H; Sinha, Saurabh
2013-09-01
Regulation of eukaryotic gene transcription is often combinatorial in nature, with multiple transcription factors (TFs) regulating common target genes, often through direct or indirect mutual interactions. Many individual examples of cooperative binding by directly interacting TFs have been identified, but it remains unclear how pervasive this mechanism is during animal development. Cooperative TF binding should be manifest in genomic sequences as biased arrangements of TF-binding sites. Here, we explore the extent and diversity of such arrangements related to gene regulation during Drosophila embryogenesis. We used the DNA-binding specificities of 322 TFs along with chromatin accessibility information to identify enriched spacing and orientation patterns of TF-binding site pairs. We developed a new statistical approach for this task, specifically designed to accurately assess inter-site spacing biases while accounting for the phenomenon of homotypic site clustering commonly observed in developmental regulatory regions. We observed a large number of short-range distance preferences between TF-binding site pairs, including examples where the preference depends on the relative orientation of the binding sites. To test whether these binding site patterns reflect physical interactions between the corresponding TFs, we analyzed 27 TF pairs whose binding sites exhibited short distance preferences. In vitro protein-protein binding experiments revealed that >65% of these TF pairs can directly interact with each other. For five pairs, we further demonstrate that they bind cooperatively to DNA if both sites are present with the preferred spacing. This study demonstrates how DNA-binding motifs can be used to produce a comprehensive map of sequence signatures for different mechanisms of combinatorial TF action.
Eren, M; Painter, C A; Gleaves, L A; Schoenhard, J A; Atkinson, J B; Brown, N J; Vaughan, D E
2003-11-01
Numerous studies have described regulatory factors and sequences that control transcriptional responses in vitro. However, there is a paucity of information on the qualitative and quantitative regulation of heterologous promoters using transgenic strategies. In order to investigate the physiological regulation of human plasminogen activator inhibitor type-1 (hPAI-1) expression in vivo compared to murine PAI-1 (mPAI-1) and to test the physiological relevance of regulatory mechanisms described in vitro, we generated transgenic mice expressing enhanced green fluorescent protein (EGFP) driven by the proximal -2.9 kb of the hPAI-1 promoter. Transgenic animals were treated with Ang II, TGF-beta1 and lipopolysaccharide (LPS) to compare the relative activation of the human and murine PAI-1 promoters. Ang II increased EGFP expression most effectively in brain, kidney and spleen, while mPAI-1 expression was quantitatively enhanced most prominently in heart and spleen. TGF-beta1 failed to induce activation of the hPAI-1 promoter but potently stimulated mPAI-1 in kidney and spleen. LPS administration triggered robust expression of mPAI-1 in liver, kidney, pancreas, spleen and lung, while EGFP was induced only modestly in heart and kidney. These results indicate that the transcriptional response of the endogenous mPAI-1 promoter varies widely in terms of location and magnitude of response to specific stimuli. Moreover, the physiological regulation of PAI-1 expression likely involves a complex interaction of transcription factors and DNA sequences that are not adequately replicated by in vitro functional studies focused on the proximal -2.9 kb promoter.
Beltran, A S; Graves, L M; Blancafort, P
2014-01-01
Basal-like breast tumors are aggressive cancers associated with high proliferation and metastasis. Chemotherapy is currently the only treatment option; however, resistance often occurs resulting in recurrence and patient death. Some extremely aggressive cancers are also associated with hypoxia, inflammation and high leukocyte infiltration. Herein, we discovered that the neural-specific transcription factor, Engrailed 1 (EN1), is exclusively overexpressed in these tumors. Short hairpin RNA (shRNA)-mediated knockdown of EN1 triggered potent and selective cell death. In contrast, ectopic overexpression of EN1 in normal cells activated survival pathways and conferred resistance to chemotherapeutic agents. Exogenous expression of EN1 cDNA reprogrammed the breast epithelial cells toward a long-lived, neural-like phenotype displaying dopaminergic markers. Gene expression microarrays demonstrated that the EN1 cDNA altered transcription of a high number of inflammatory molecules, notably chemokines and chemokine receptors, which could mediate prosurvival pathways. To block EN1 function, we engineered synthetic interference peptides (iPeps) comprising the EN1-specific sequences that mediate essential protein-protein interactions necessary for EN1 function and an N-terminal cell-penetrating peptide/nuclear localization sequence. These EN1-iPeps rapidly mediated a strong apoptotic response in tumor cells overexpressing EN1, with no toxicity to normal or non EN1-expressing cells. Delivery of EN1-iPeps into basal-like cancer cells significantly decreased the fifty percent inhibitory concentrations (IC50) of chemotherapeutic drugs routinely used to treat breast cancer. Lastly, matrix-assisted laser desorption/ionization-time of flight mass spectrometry and immunoprecipitation assays demonstrated that EN1-iPeps captured targets involved in transcriptional and post-transcriptional regulation. Importantly, the EN1-iPeps bound the glutamyl-prolyl tRNA synthetase (EPRS) target, which has been associated with the transcript-specific translational control of inflammatory proteins and activation of amino-acid stress pathways. This work unveils EN1 as an activator of intrinsic inflammatory pathways associated with prosurvival in basal-like breast cancer. We further build upon these results and describe the engineering of iPeps targeting EN1 (EN1-iPeps) as a novel and selective therapeutic strategy to combat these lethal forms of breast cancer. PMID:24141779
Chirgadze, Y N; Boshkova, E A; Polozov, R V; Sivozhelezov, V S; Dzyabchenko, A V; Kuzminsky, M B; Stepanenko, V A; Ivanov, V V
2018-01-07
The mouse factor Zif268, known also as early growth response protein EGR-1, is a classical representative for the Cys2His2 transcription factor family. It is required for binding the RNA polymerase with operator dsDNA to initialize the transcription process. We have shown that only in this family of total six Zn-finger protein families the Zn complex plays a significant role in the protein-DNA binding. Electrostatic feature of this complex in the binding of factor Zif268 from Mus musculus with operator DNA has been considered. The factor consists of three similar Zn-finger units which bind with triplets of coding DNA. Essential contacts of the factor with the DNA phosphates are formed by three conservative His residues, one in each finger. We describe here the results of calculations of the electrostatic potentials for the Zn-Cys2His2 complex, Zn-finger unit 1, and the whole transcription factor. The potential of Zif268 has a positive area on the factor surface, and it corresponds exactly to the binding sites of each of Zn-finger units. The main part of these areas is determined by conservative His residues, which form contacts with the DNA phosphate groups. Our result shows that the electrostatic positive potential of this histidine residue is enhanced due to the Zn complex. The other contacts of the Zn-finger with DNA are related to nucleotide bases, and they are responsible for the sequence-specific binding with DNA. This result may be extended to all other members of the Cys2His2 transcription factor family.
Buitrago-Flórez, Francisco Javier; Restrepo, Silvia; Riaño-Pachón, Diego Mauricio
2014-01-01
The biological diversity among Stramenopiles is striking; they range from large multicellular seaweeds to tiny unicellular species, they embrace many ecologically important autothrophic (e.g., diatoms, brown algae), and heterotrophic (e.g., oomycetes) groups. Transcription factors (TFs) and other transcription regulators (TRs) regulate spatial and temporal gene expression. A plethora of transcriptional regulatory proteins have been identified and classified into families on the basis of sequence similarity. The purpose of this work is to identify the TF and TR complement in diverse species belonging to Stramenopiles in order to understand how these regulators may contribute to their observed diversity. We identified and classified 63 TF and TR families in 11 species of Stramenopiles. In some species we found gene families with high relative importance. Taking into account the 63 TF and TR families identified, 28 TF and TR families were established to be positively correlated with specific traits like number of predicted proteins, number of flagella and number of cell types during the life cycle. Additionally, we found gains and losses in TF and TR families specific to some species and clades, as well as, two families with high abundance specific to the autotrophic species and three families with high abundance specific to the heterotropic species. For the first time, there is a systematic search of TF and TR families in Stramenopiles. The attempts to uncover relationships between these families and the complexity of this group may be of great impact, considering that there are several important pathogens of plants and animals, as well as, important species involved in carbon cycling. Specific TF and TR families identified in this work appear to be correlated with particular traits in the Stramenopiles group and may be correlated with the high complexity and diversity in Stramenopiles. PMID:25375671
Buitrago-Flórez, Francisco Javier; Restrepo, Silvia; Riaño-Pachón, Diego Mauricio
2014-01-01
The biological diversity among Stramenopiles is striking; they range from large multicellular seaweeds to tiny unicellular species, they embrace many ecologically important autothrophic (e.g., diatoms, brown algae), and heterotrophic (e.g., oomycetes) groups. Transcription factors (TFs) and other transcription regulators (TRs) regulate spatial and temporal gene expression. A plethora of transcriptional regulatory proteins have been identified and classified into families on the basis of sequence similarity. The purpose of this work is to identify the TF and TR complement in diverse species belonging to Stramenopiles in order to understand how these regulators may contribute to their observed diversity. We identified and classified 63 TF and TR families in 11 species of Stramenopiles. In some species we found gene families with high relative importance. Taking into account the 63 TF and TR families identified, 28 TF and TR families were established to be positively correlated with specific traits like number of predicted proteins, number of flagella and number of cell types during the life cycle. Additionally, we found gains and losses in TF and TR families specific to some species and clades, as well as, two families with high abundance specific to the autotrophic species and three families with high abundance specific to the heterotropic species. For the first time, there is a systematic search of TF and TR families in Stramenopiles. The attempts to uncover relationships between these families and the complexity of this group may be of great impact, considering that there are several important pathogens of plants and animals, as well as, important species involved in carbon cycling. Specific TF and TR families identified in this work appear to be correlated with particular traits in the Stramenopiles group and may be correlated with the high complexity and diversity in Stramenopiles.
Microbial metatranscriptomics in a permanent marine oxygen minimum zone.
Stewart, Frank J; Ulloa, Osvaldo; DeLong, Edward F
2012-01-01
Simultaneous characterization of taxonomic composition, metabolic gene content and gene expression in marine oxygen minimum zones (OMZs) has potential to broaden perspectives on the microbial and biogeochemical dynamics in these environments. Here, we present a metatranscriptomic survey of microbial community metabolism in the Eastern Tropical South Pacific OMZ off northern Chile. Community RNA was sampled in late austral autumn from four depths (50, 85, 110, 200 m) extending across the oxycline and into the upper OMZ. Shotgun pyrosequencing of cDNA yielded 180,000 to 550,000 transcript sequences per depth. Based on functional gene representation, transcriptome samples clustered apart from corresponding metagenome samples from the same depth, highlighting the discrepancies between metabolic potential and actual transcription. BLAST-based characterizations of non-ribosomal RNA sequences revealed a dominance of genes involved with both oxidative (nitrification) and reductive (anammox, denitrification) components of the marine nitrogen cycle. Using annotations of protein-coding genes as proxies for taxonomic affiliation, we observed depth-specific changes in gene expression by key functional taxonomic groups. Notably, transcripts most closely matching the genome of the ammonia-oxidizing archaeon Nitrosopumilus maritimus dominated the transcriptome in the upper three depths, representing one in five protein-coding transcripts at 85 m. In contrast, transcripts matching the anammox bacterium Kuenenia stuttgartiensis dominated at the core of the OMZ (200 m; 1 in 12 protein-coding transcripts). The distribution of N. maritimus-like transcripts paralleled that of transcripts matching ammonia monooxygenase genes, which, despite being represented by both bacterial and archaeal sequences in the community DNA, were dominated (> 99%) by archaeal sequences in the RNA, suggesting a substantial role for archaeal nitrification in the upper OMZ. These data, as well as those describing other key OMZ metabolic processes (e.g. sulfur oxidation), highlight gene-specific expression patterns in the context of the entire community transcriptome, as well as identify key functional groups for taxon-specific genomic profiling. © 2011 Society for Applied Microbiology and Blackwell Publishing Ltd.
Kinetic Competition between Elongation Rate and Binding of NELF Controls Promoter Proximal Pausing
Li, Jian; Liu, Yingyun; Rhee, Ho Sung; Ghosh, Saikat Kumar B.; Bai, Lu; Pugh, B. Franklin; Gilmour, David S.
2013-01-01
Summary Pausing of RNA polymerase II (Pol II) 20-60 bp downstream of transcription start sites is a major checkpoint during transcription in animal cells. Mechanisms that control pausing are largely unknown. We developed permanganate-ChIP-seq to evaluate the state of Pol II at promoters throughout the Drosophila genome, and a biochemical system that reconstitutes promoter-proximal pausing to define pausing mechanisms. Stable open complexes of Pol II are largely absent from the transcription start sites of most mRNA genes, but are present at snRNA genes and the highly transcribed heat shock genes following their induction. The location of the pause is influenced by the timing between when NELF loads onto Pol II and how fast Pol II escapes the promoter region. Our biochemical analysis reveals that the sequence-specific transcription factor, GAF, orchestrates efficient pausing by recruiting NELF to promoters before transcription initiation and by assisting in loading NELF onto Pol II after initiation. PMID:23746353
Cortijo, Sandra; Charoensawan, Varodom; Roudier, François; Wigge, Philip A
2018-01-01
Chromatin immunoprecipitation combined with next-generation sequencing (ChIP-seq) is a powerful technique to investigate in vivo transcription factor (TF) binding to DNA, as well as chromatin marks. Here we provide a detailed protocol for all the key steps to perform ChIP-seq in Arabidopsis thaliana roots, also working on other A. thaliana tissues and in most non-ligneous plants. We detail all steps from material collection, fixation, chromatin preparation, immunoprecipitation, library preparation, and finally computational analysis based on a combination of publicly available tools.
Wang, Boya; Guo, Xiaohua; Wang, Chen; Ma, Jieyu; Niu, Fangfang; Zhang, Hanfeng; Yang, Bo; Liang, Wanwan; Han, Feng; Jiang, Yuan-Qing
2015-03-01
NAC transcription factors are plant-specific and play important roles in plant development processes, response to biotic and abiotic cues and hormone signaling. However, to date, little is known about the NAC genes in canola (or oilseed rape, Brassica napus L.). In this study, a total of 60 NAC genes were identified from canola through a systematical analysis and mining of expressed sequence tags. Among these, the cDNA sequences of 41 NAC genes were successfully cloned. The translated protein sequences of canola NAC genes with the NAC genes from representative species were phylogenetically clustered into three major groups and multiple subgroups. The transcriptional activities of these BnaNAC proteins were assayed in yeast. In addition, by quantitative real-time RT-PCR, we further observed that some of these BnaNACs were regulated by different hormone stimuli or abiotic stresses. Interestingly, we successfully identified two novel BnaNACs, BnaNAC19 and BnaNAC82, which could elicit hypersensitive response-like cell death when expressed in Nicotiana benthamiana leaves, which was mediated by accumulation of reactive oxygen species. Overall, our work has laid a solid foundation for further characterization of this important NAC gene family in canola.
The Clostridium sporulation programs: diversity and preservation of endospore differentiation.
Al-Hinai, Mohab A; Jones, Shawn W; Papoutsakis, Eleftherios T
2015-03-01
Bacillus and Clostridium organisms initiate the sporulation process when unfavorable conditions are detected. The sporulation process is a carefully orchestrated cascade of events at both the transcriptional and posttranslational levels involving a multitude of sigma factors, transcription factors, proteases, and phosphatases. Like Bacillus genomes, sequenced Clostridium genomes contain genes for all major sporulation-specific transcription and sigma factors (spo0A, sigH, sigF, sigE, sigG, and sigK) that orchestrate the sporulation program. However, recent studies have shown that there are substantial differences in the sporulation programs between the two genera as well as among different Clostridium species. First, in the absence of a Bacillus-like phosphorelay system, activation of Spo0A in Clostridium organisms is carried out by a number of orphan histidine kinases. Second, downstream of Spo0A, the transcriptional and posttranslational regulation of the canonical set of four sporulation-specific sigma factors (σ(F), σ(E), σ(G), and σ(K)) display different patterns, not only compared to Bacillus but also among Clostridium organisms. Finally, recent studies demonstrated that σ(K), the last sigma factor to be activated according to the Bacillus subtilis model, is involved in the very early stages of sporulation in Clostridium acetobutylicum, C. perfringens, and C. botulinum as well as in the very late stages of spore maturation in C. acetobutylicum. Despite profound differences in initiation, propagation, and orchestration of expression of spore morphogenetic components, these findings demonstrate not only the robustness of the endospore sporulation program but also the plasticity of the program to generate different complex phenotypes, some apparently regulated at the epigenetic level. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
The Clostridium Sporulation Programs: Diversity and Preservation of Endospore Differentiation
Al-Hinai, Mohab A.; Jones, Shawn W.
2015-01-01
SUMMARY Bacillus and Clostridium organisms initiate the sporulation process when unfavorable conditions are detected. The sporulation process is a carefully orchestrated cascade of events at both the transcriptional and posttranslational levels involving a multitude of sigma factors, transcription factors, proteases, and phosphatases. Like Bacillus genomes, sequenced Clostridium genomes contain genes for all major sporulation-specific transcription and sigma factors (spo0A, sigH, sigF, sigE, sigG, and sigK) that orchestrate the sporulation program. However, recent studies have shown that there are substantial differences in the sporulation programs between the two genera as well as among different Clostridium species. First, in the absence of a Bacillus-like phosphorelay system, activation of Spo0A in Clostridium organisms is carried out by a number of orphan histidine kinases. Second, downstream of Spo0A, the transcriptional and posttranslational regulation of the canonical set of four sporulation-specific sigma factors (σF, σE, σG, and σK) display different patterns, not only compared to Bacillus but also among Clostridium organisms. Finally, recent studies demonstrated that σK, the last sigma factor to be activated according to the Bacillus subtilis model, is involved in the very early stages of sporulation in Clostridium acetobutylicum, C. perfringens, and C. botulinum as well as in the very late stages of spore maturation in C. acetobutylicum. Despite profound differences in initiation, propagation, and orchestration of expression of spore morphogenetic components, these findings demonstrate not only the robustness of the endospore sporulation program but also the plasticity of the program to generate different complex phenotypes, some apparently regulated at the epigenetic level. PMID:25631287
The Clostridium Sporulation Programs: Diversity and Preservation of Endospore Differentiation
Al-Hinai, Mohab A.; Jones, Shawn W.; Papoutsakis, Eleftherios T.
2015-01-28
Bacillus and Clostridium organisms initiate the sporulation process when unfavorable conditions are detected. The sporulation process is a carefully orchestrated cascade of events at both the transcriptional and posttranslational levels involving a multitude of sigma factors, transcription factors, proteases, and phosphatases. Like Bacillus genomes, sequenced Clostridium genomes contain genes for all major sporulation-specific transcription and sigma factors (spo0A, sigH, sigF, sigE, sigG, and sigK) that orchestrate the sporulation program. However, recent studies have shown that there are substantial differences in the sporulation programs between the two genera as well as among different Clostridium species. First, in the absence of amore » Bacillus-like phosphorelay system, activation of Spo0A in Clostridium organisms is carried out by a number of orphan histidine kinases. Second, downstream of Spo0A, the transcriptional and posttranslational regulation of the canonical set of four sporulation-specific sigma factors (σF, σE, σG, and σK) display different patterns, not only compared to Bacillus but also among Clostridium organisms. Finally, recent studies demonstrated that σK, the last sigma factor to be activated according to the Bacillus subtilis model, is involved in the very early stages of sporulation in Clostridium acetobutylicum, C. perfringens, and C. botulinum as well as in the very late stages of spore maturation in C. acetobutylicum. Despite profound differences in initiation, propagation, and orchestration of expression of spore morphogenetic components, these findings demonstrate not only the robustness of the endospore sporulation program but also the plasticity of the program to generate different complex phenotypes, some apparently regulated at the epigenetic level.« less