Inferring genome-wide interplay landscape between DNA methylation and transcriptional regulation.
Tang, Binhua; Wang, Xin
2015-01-01
DNA methylation and transcriptional regulation play important roles in cancer cell development and differentiation processes. Based on the currently available cell line profiling information from the ENCODE Consortium, we propose a Bayesian inference model to infer and construct genome-wide interaction landscape between DNA methylation and transcriptional regulation, which sheds light on the underlying complex functional mechanisms important within the human cancer and disease context. For the first time, we select all the currently available cell lines (>=20) and transcription factors (>=80) profiling information from the ENCODE Consortium portal. Through the integration of those genome-wide profiling sources, our genome-wide analysis detects multiple functional loci of interest, and indicates that DNA methylation is cell- and region-specific, due to the interplay mechanisms with transcription regulatory activities. We validate our analysis results with the corresponding RNA-sequencing technique for those detected genomic loci. Our results provide novel and meaningful insights for the interplay mechanisms of transcriptional regulation and gene expression for the human cancer and disease studies.
Huang, Lulin; Cheng, Tingcai; Xu, Pingzhen; Fang, Ting; Xia, Qingyou
2012-01-01
Transcription factors are present in all living organisms, and play vital roles in a wide range of biological processes. Studies of transcription factors will help reveal the complex regulation mechanism of organisms. So far, hundreds of domains have been identified that show transcription factor activity. Here, 281 reported transcription factor domains were used as seeds to search the transcription factors in genomes of Bombyx mori L. (Lepidoptera: Bombycidae) and four other model insects. Overall, 666 transcription factors including 36 basal factors and 630 other factors were identified in B. mori genome, which accounted for 4.56% of its genome. The silkworm transcription factors' expression profiles were investigated in relation to multiple tissues, developmental stages, sexual dimorphism, and responses to oral infection by pathogens and direct bacterial injection. These all provided rich clues for revealing the transcriptional regulation mechanism of silkworm organ differentiation, growth and development, sexual dimorphism, and response to pathogen infection. PMID:22943524
Transcription facilitated genome-wide recruitment of topoisomerase I and DNA gyrase.
Ahmed, Wareed; Sala, Claudia; Hegde, Shubhada R; Jha, Rajiv Kumar; Cole, Stewart T; Nagaraja, Valakunja
2017-05-01
Movement of the transcription machinery along a template alters DNA topology resulting in the accumulation of supercoils in DNA. The positive supercoils generated ahead of transcribing RNA polymerase (RNAP) and the negative supercoils accumulating behind impose severe topological constraints impeding transcription process. Previous studies have implied the role of topoisomerases in the removal of torsional stress and the maintenance of template topology but the in vivo interaction of functionally distinct topoisomerases with heterogeneous chromosomal territories is not deciphered. Moreover, how the transcription-induced supercoils influence the genome-wide recruitment of DNA topoisomerases remains to be explored in bacteria. Using ChIP-Seq, we show the genome-wide occupancy profile of both topoisomerase I and DNA gyrase in conjunction with RNAP in Mycobacterium tuberculosis taking advantage of minimal topoisomerase representation in the organism. The study unveils the first in vivo genome-wide interaction of both the topoisomerases with the genomic regions and establishes that transcription-induced supercoils govern their recruitment at genomic sites. Distribution profiles revealed co-localization of RNAP and the two topoisomerases on the active transcriptional units (TUs). At a given locus, topoisomerase I and DNA gyrase were localized behind and ahead of RNAP, respectively, correlating with the twin-supercoiled domains generated. The recruitment of topoisomerases was higher at the genomic loci with higher transcriptional activity and/or at regions under high torsional stress compared to silent genomic loci. Importantly, the occupancy of DNA gyrase, sole type II topoisomerase in Mtb, near the Ter domain of the Mtb chromosome validates its function as a decatenase.
Transcription facilitated genome-wide recruitment of topoisomerase I and DNA gyrase
Ahmed, Wareed; Sala, Claudia; Hegde, Shubhada R.; Jha, Rajiv Kumar
2017-01-01
Movement of the transcription machinery along a template alters DNA topology resulting in the accumulation of supercoils in DNA. The positive supercoils generated ahead of transcribing RNA polymerase (RNAP) and the negative supercoils accumulating behind impose severe topological constraints impeding transcription process. Previous studies have implied the role of topoisomerases in the removal of torsional stress and the maintenance of template topology but the in vivo interaction of functionally distinct topoisomerases with heterogeneous chromosomal territories is not deciphered. Moreover, how the transcription-induced supercoils influence the genome-wide recruitment of DNA topoisomerases remains to be explored in bacteria. Using ChIP-Seq, we show the genome-wide occupancy profile of both topoisomerase I and DNA gyrase in conjunction with RNAP in Mycobacterium tuberculosis taking advantage of minimal topoisomerase representation in the organism. The study unveils the first in vivo genome-wide interaction of both the topoisomerases with the genomic regions and establishes that transcription-induced supercoils govern their recruitment at genomic sites. Distribution profiles revealed co-localization of RNAP and the two topoisomerases on the active transcriptional units (TUs). At a given locus, topoisomerase I and DNA gyrase were localized behind and ahead of RNAP, respectively, correlating with the twin-supercoiled domains generated. The recruitment of topoisomerases was higher at the genomic loci with higher transcriptional activity and/or at regions under high torsional stress compared to silent genomic loci. Importantly, the occupancy of DNA gyrase, sole type II topoisomerase in Mtb, near the Ter domain of the Mtb chromosome validates its function as a decatenase. PMID:28463980
Transcription regulation by distal enhancers
Stadhouders, Ralph; van den Heuvel, Anita; Kolovos, Petros; Jorna, Ruud; Leslie, Kris; Grosveld, Frank; Soler, Eric
2012-01-01
Genome-wide chromatin profiling efforts have shown that enhancers are often located at large distances from gene promoters within the noncoding genome. Whereas enhancers can stimulate transcription initiation by communicating with promoters via chromatin looping mechanisms, we propose that enhancers may also stimulate transcription elongation by physical interactions with intronic elements. We review here recent findings derived from the study of the hematopoietic system. PMID:22771987
2014-01-01
Background Cis-regulatory modules (CRMs), or the DNA sequences required for regulating gene expression, play the central role in biological researches on transcriptional regulation in metazoan species. Nowadays, the systematic understanding of CRMs still mainly resorts to computational methods due to the time-consuming and small-scale nature of experimental methods. But the accuracy and reliability of different CRM prediction tools are still unclear. Without comparative cross-analysis of the results and combinatorial consideration with extra experimental information, there is no easy way to assess the confidence of the predicted CRMs. This limits the genome-wide understanding of CRMs. Description It is known that transcription factor binding and epigenetic profiles tend to determine functions of CRMs in gene transcriptional regulation. Thus integration of the genome-wide epigenetic profiles with systematically predicted CRMs can greatly help researchers evaluate and decipher the prediction confidence and possible transcriptional regulatory functions of these potential CRMs. However, these data are still fragmentary in the literatures. Here we performed the computational genome-wide screening for potential CRMs using different prediction tools and constructed the pioneer database, cisMEP (cis-regulatory module epigenetic profile database), to integrate these computationally identified CRMs with genomic epigenetic profile data. cisMEP collects the literature-curated TFBS location data and nine genres of epigenetic data for assessing the confidence of these potential CRMs and deciphering the possible CRM functionality. Conclusions cisMEP aims to provide a user-friendly interface for researchers to assess the confidence of different potential CRMs and to understand the functions of CRMs through experimentally-identified epigenetic profiles. The deposited potential CRMs and experimental epigenetic profiles for confidence assessment provide experimentally testable hypotheses for the molecular mechanisms of metazoan gene regulation. We believe that the information deposited in cisMEP will greatly facilitate the comparative usage of different CRM prediction tools and will help biologists to study the modular regulatory mechanisms between different TFs and their target genes. PMID:25521507
Transcription regulation by distal enhancers: who's in the loop?
Stadhouders, Ralph; van den Heuvel, Anita; Kolovos, Petros; Jorna, Ruud; Leslie, Kris; Grosveld, Frank; Soler, Eric
2012-01-01
Genome-wide chromatin profiling efforts have shown that enhancers are often located at large distances from gene promoters within the noncoding genome. Whereas enhancers can stimulate transcription initiation by communicating with promoters via chromatin looping mechanisms, we propose that enhancers may also stimulate transcription elongation by physical interactions with intronic elements. We review here recent findings derived from the study of the hematopoietic system.
NASA Technical Reports Server (NTRS)
Stolc, Viktor; Samanta, Manoj Pratim; Tongprasit, Waraporn; Marshall, Wallace F.
2005-01-01
The important role that cilia and flagella play in human disease creates an urgent need to identify genes involved in ciliary assembly and function. The strong and specific induction of flagellar-coding genes during flagellar regeneration in Chlamydomonas reinhardtii suggests that transcriptional profiling of such cells would reveal new flagella-related genes. We have conducted a genome-wide analysis of RNA transcript levels during flagellar regeneration in Chlamydomonas by using maskless photolithography method-produced DNA oligonucleotide microarrays with unique probe sequences for all exons of the 19,803 predicted genes. This analysis represents previously uncharacterized whole-genome transcriptional activity profiling study in this important model organism. Analysis of strongly induced genes reveals a large set of known flagellar components and also identifies a number of important disease-related proteins as being involved with cilia and flagella, including the zebrafish polycystic kidney genes Qilin, Reptin, and Pontin, as well as the testis-expressed tubby-like protein TULP2.
Decoherence in yeast cell populations and its implications for genome-wide expression noise.
Briones, M R S; Bosco, F
2009-01-20
Gene expression "noise" is commonly defined as the stochastic variation of gene expression levels in different cells of the same population under identical growth conditions. Here, we tested whether this "noise" is amplified with time, as a consequence of decoherence in global gene expression profiles (genome-wide microarrays) of synchronized cells. The stochastic component of transcription causes fluctuations that tend to be amplified as time progresses, leading to a decay of correlations of expression profiles, in perfect analogy with elementary relaxation processes. Measuring decoherence, defined here as a decay in the auto-correlation function of yeast genome-wide expression profiles, we found a slowdown in the decay of correlations, opposite to what would be expected if, as in mixing systems, correlations decay exponentially as the equilibrium state is reached. Our results indicate that the populational variation in gene expression (noise) is a consequence of temporal decoherence, in which the slow decay of correlations is a signature of strong interdependence of the transcription dynamics of different genes.
A transcriptional profile of the decidua in preeclampsia
LØSET, Mari; MUNDAL, Siv B.; JOHNSON, Matthew P.; FENSTAD, Mona H.; FREED, Katherine A.; LIAN, Ingrid A.; EIDE, Irina P.; BJØRGE, Line; BLANGERO, John; MOSES, Eric K.; AUSTGULEN, Rigmor
2010-01-01
OBJECTIVE To obtain insight into possible mechanisms underlying preeclampsia using genome-wide transcriptional profiling in decidua basalis. STUDY DESIGN Genome-wide transcriptional profiling was performed on decidua basalis tissue from preeclamptic (n = 37) and normal pregnancies (n = 58). Differentially expressed genes were identified and merged into canonical pathways and networks. RESULTS Of the 26,504 expressed transcripts detected, 455 were differentially expressed (P <0.05, FDR P <0.1). Both novel (ARL5B, SLITRK4) and previously reported preeclampsia-associated genes (PLA2G7, HMOX1) were identified. Pathway analysis revealed that ‘tryptophan metabolism’, ‘endoplasmic reticulum stress’, ‘linoleic acid metabolism’, ‘notch signaling’, ‘fatty acid metabolism’, ‘arachidonic acid metabolism’ and ‘NRF2-mediated oxidative stress response’ were overrepresented canonical pathways. CONCLUSION In the present study single genes, canonical pathways and gene-gene networks that are likely to play an important role in the pathogenesis of preeclampsia, have been identified. Future functional studies are needed to accomplish a greater understanding of the mechanisms involved. PMID:20934677
Lohmann, Ingrid
2012-01-01
In multi-cellular organisms, spatiotemporal activity of cis-regulatory DNA elements depends on their occupancy by different transcription factors (TFs). In recent years, genome-wide ChIP-on-Chip, ChIP-Seq and DamID assays have been extensively used to unravel the combinatorial interaction of TFs with cis-regulatory modules (CRMs) in the genome. Even though genome-wide binding profiles are increasingly becoming available for different TFs, single TF binding profiles are in most cases not sufficient for dissecting complex regulatory networks. Thus, potent computational tools detecting statistically significant and biologically relevant TF-motif co-occurrences in genome-wide datasets are essential for analyzing context-dependent transcriptional regulation. We have developed COPS (Co-Occurrence Pattern Search), a new bioinformatics tool based on a combination of association rules and Markov chain models, which detects co-occurring TF binding sites (BSs) on genomic regions of interest. COPS scans DNA sequences for frequent motif patterns using a Frequent-Pattern tree based data mining approach, which allows efficient performance of the software with respect to both data structure and implementation speed, in particular when mining large datasets. Since transcriptional gene regulation very often relies on the formation of regulatory protein complexes mediated by closely adjoining TF binding sites on CRMs, COPS additionally detects preferred short distance between co-occurring TF motifs. The performance of our software with respect to biological significance was evaluated using three published datasets containing genomic regions that are independently bound by several TFs involved in a defined biological process. In sum, COPS is a fast, efficient and user-friendly tool mining statistically and biologically significant TFBS co-occurrences and therefore allows the identification of TFs that combinatorially regulate gene expression. PMID:23272209
Chacon, Diego; Beck, Dominik; Perera, Dilmi; Wong, Jason W H; Pimanda, John E
2014-01-01
The BloodChIP database (http://www.med.unsw.edu.au/CRCWeb.nsf/page/BloodChIP) supports exploration and visualization of combinatorial transcription factor (TF) binding at a particular locus in human CD34-positive and other normal and leukaemic cells or retrieval of target gene sets for user-defined combinations of TFs across one or more cell types. Increasing numbers of genome-wide TF binding profiles are being added to public repositories, and this trend is likely to continue. For the power of these data sets to be fully harnessed by experimental scientists, there is a need for these data to be placed in context and easily accessible for downstream applications. To this end, we have built a user-friendly database that has at its core the genome-wide binding profiles of seven key haematopoietic TFs in human stem/progenitor cells. These binding profiles are compared with binding profiles in normal differentiated and leukaemic cells. We have integrated these TF binding profiles with chromatin marks and expression data in normal and leukaemic cell fractions. All queries can be exported into external sites to construct TF-gene and protein-protein networks and to evaluate the association of genes with cellular processes and tissue expression.
Integrated analysis of chromosome copy number variation and gene expression in cervical carcinoma
Yan, Deng; Yi, Song; Chiu, Wang Chi; Qin, Liu Gui; Kin, Wong Hoi; Kwok Hung, Chung Tony; Linxiao, Han; Wai, Choy Kwong; Yi, Sui; Tao, Yang; Tao, Tang
2017-01-01
Objective This study was conducted to explore chromosomal copy number variations (CNV) and transcript expression and to examine pathways in cervical pathogenesis using genome-wide high resolution microarrays. Methods Genome-wide chromosomal CNVs were investigated in 6 cervical cancer cell lines by Human Genome CGH Microarray Kit (4x44K). Gene expression profiles in cervical cancer cell lines, primary cervical carcinoma and normal cervical epithelium tissues were also studied using the Whole Human Genome Microarray Kit (4x44K). Results Fifty common chromosomal CNVs were identified in the cervical cancer cell lines. Correlation analysis revealed that gene up-regulation or down-regulation is significantly correlated with genomic amplification (P=0.009) or deletion (P=0.006) events. Expression profiles were identified through cluster analysis. Gene annotation analysis pinpointed cell cycle pathways was significantly (P=1.15E-08) affected in cervical cancer. Common CNVs were associated with cervical cancer. Conclusion Chromosomal CNVs may contribute to their transcript expression in cervical cancer. PMID:29312578
Integrated analysis of chromosome copy number variation and gene expression in cervical carcinoma.
Yan, Deng; Yi, Song; Chiu, Wang Chi; Qin, Liu Gui; Kin, Wong Hoi; Kwok Hung, Chung Tony; Linxiao, Han; Wai, Choy Kwong; Yi, Sui; Tao, Yang; Tao, Tang
2017-12-12
This study was conducted to explore chromosomal copy number variations (CNV) and transcript expression and to examine pathways in cervical pathogenesis using genome-wide high resolution microarrays. Genome-wide chromosomal CNVs were investigated in 6 cervical cancer cell lines by Human Genome CGH Microarray Kit (4x44K). Gene expression profiles in cervical cancer cell lines, primary cervical carcinoma and normal cervical epithelium tissues were also studied using the Whole Human Genome Microarray Kit (4x44K). Fifty common chromosomal CNVs were identified in the cervical cancer cell lines. Correlation analysis revealed that gene up-regulation or down-regulation is significantly correlated with genomic amplification ( P =0.009) or deletion ( P =0.006) events. Expression profiles were identified through cluster analysis. Gene annotation analysis pinpointed cell cycle pathways was significantly ( P =1.15E-08) affected in cervical cancer. Common CNVs were associated with cervical cancer. Chromosomal CNVs may contribute to their transcript expression in cervical cancer.
Regulatory variation: an emerging vantage point for cancer biology.
Li, Luolan; Lorzadeh, Alireza; Hirst, Martin
2014-01-01
Transcriptional regulation involves complex and interdependent interactions of noncoding and coding regions of the genome with proteins that interact and modify them. Genetic variation/mutation in coding and noncoding regions of the genome can drive aberrant transcription and disease. In spite of accounting for nearly 98% of the genome comparatively little is known about the contribution of noncoding DNA elements to disease. Genome-wide association studies of complex human diseases including cancer have revealed enrichment for variants in the noncoding genome. A striking finding of recent cancer genome re-sequencing efforts has been the previously underappreciated frequency of mutations in epigenetic modifiers across a wide range of cancer types. Taken together these results point to the importance of dysregulation in transcriptional regulatory control in genesis of cancer. Powered by recent technological advancements in functional genomic profiling, exploration of normal and transformed regulatory networks will provide novel insight into the initiation and progression of cancer and open new windows to future prognostic and diagnostic tools. © 2013 Wiley Periodicals, Inc.
Genome wide interactions of wild-type and activator bypass forms of σ54.
Schaefer, Jorrit; Engl, Christoph; Zhang, Nan; Lawton, Edward; Buck, Martin
2015-09-03
Enhancer-dependent transcription involving the promoter specificity factor σ(54) is widely distributed amongst bacteria and commonly associated with cell envelope function. For transcription initiation, σ(54)-RNA polymerase yields open promoter complexes through its remodelling by cognate AAA+ ATPase activators. Since activators can be bypassed in vitro, bypass transcription in vivo could be a source of emergent gene expression along evolutionary pathways yielding new control networks and transcription patterns. At a single test promoter in vivo bypass transcription was not observed. We now use genome-wide transcription profiling, genome-wide mutagenesis and gene over-expression strategies in Escherichia coli, to (i) scope the range of bypass transcription in vivo and (ii) identify genes which might alter bypass transcription in vivo. We find little evidence for pervasive bypass transcription in vivo with only a small subset of σ(54) promoters functioning without activators. Results also suggest no one gene limits bypass transcription in vivo, arguing bypass transcription is strongly kept in check. Promoter sequences subject to repression by σ(54) were evident, indicating loss of rpoN (encoding σ(54)) rather than creating rpoN bypass alleles would be one evolutionary route for new gene expression patterns. Finally, cold-shock promoters showed unusual σ(54)-dependence in vivo not readily correlated with conventional σ(54) binding-sites. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Genome wide interactions of wild-type and activator bypass forms of σ54
Schaefer, Jorrit; Engl, Christoph; Zhang, Nan; Lawton, Edward; Buck, Martin
2015-01-01
Enhancer-dependent transcription involving the promoter specificity factor σ54 is widely distributed amongst bacteria and commonly associated with cell envelope function. For transcription initiation, σ54-RNA polymerase yields open promoter complexes through its remodelling by cognate AAA+ ATPase activators. Since activators can be bypassed in vitro, bypass transcription in vivo could be a source of emergent gene expression along evolutionary pathways yielding new control networks and transcription patterns. At a single test promoter in vivo bypass transcription was not observed. We now use genome-wide transcription profiling, genome-wide mutagenesis and gene over-expression strategies in Escherichia coli, to (i) scope the range of bypass transcription in vivo and (ii) identify genes which might alter bypass transcription in vivo. We find little evidence for pervasive bypass transcription in vivo with only a small subset of σ54 promoters functioning without activators. Results also suggest no one gene limits bypass transcription in vivo, arguing bypass transcription is strongly kept in check. Promoter sequences subject to repression by σ54 were evident, indicating loss of rpoN (encoding σ54) rather than creating rpoN bypass alleles would be one evolutionary route for new gene expression patterns. Finally, cold-shock promoters showed unusual σ54-dependence in vivo not readily correlated with conventional σ54 binding-sites. PMID:26082500
Guo, D; Li, H L; Tang, X; Peng, S Q
2014-12-18
In plants, homeodomain proteins play a critical role in regulating various aspects of plant growth and development. KNOX proteins are members of the homeodomain protein family. The KNOX transcription factors have been reported from Arabidopsis, rice, and other higher plants. The recent publication of the draft genome sequence of cassava (Manihot esculenta Krantz) has allowed a genome-wide search for M. esculenta KNOX (MeKNOX) transcription factors and the comparison of these positively identified proteins with their homologs in model plants. In the present study, we identified 12 MeKNOX genes in the cassava genome and grouped them into two distinct subfamilies based on their domain composition and phylogenetic analysis. Furthermore, semi-quantitative reverse transcription polymerase chain reaction analysis was performed to elucidate the expression profiles of these genes in different tissues and during various stages of root development. The analysis of MeKNOX expression profiles of indicated that 12 MeKNOX genes display differential expressions either in their transcript abundance or expression patterns.
Reverse Engineering of Genome-wide Gene Regulatory Networks from Gene Expression Data
Liu, Zhi-Ping
2015-01-01
Transcriptional regulation plays vital roles in many fundamental biological processes. Reverse engineering of genome-wide regulatory networks from high-throughput transcriptomic data provides a promising way to characterize the global scenario of regulatory relationships between regulators and their targets. In this review, we summarize and categorize the main frameworks and methods currently available for inferring transcriptional regulatory networks from microarray gene expression profiling data. We overview each of strategies and introduce representative methods respectively. Their assumptions, advantages, shortcomings, and possible improvements and extensions are also clarified and commented. PMID:25937810
2017-01-01
Recent advances in next-generation sequencing approaches have revolutionized our understanding of transcriptional expression in diverse systems. However, measurements of transcription do not necessarily reflect gene translation, the process of ultimate importance in understanding cellular function. To circumvent this limitation, biochemical tagging of ribosome subunits to isolate ribosome-associated mRNA has been developed. However, this approach, called TRAP, lacks quantitative resolution compared to a superior technology, ribosome profiling. Here, we report the development of an optimized ribosome profiling approach in Drosophila. We first demonstrate successful ribosome profiling from a specific tissue, larval muscle, with enhanced resolution compared to conventional TRAP approaches. We next validate the ability of this technology to define genome-wide translational regulation. This technology is leveraged to test the relative contributions of transcriptional and translational mechanisms in the postsynaptic muscle that orchestrate the retrograde control of presynaptic function at the neuromuscular junction. Surprisingly, we find no evidence that significant changes in the transcription or translation of specific genes are necessary to enable retrograde homeostatic signaling, implying that post-translational mechanisms ultimately gate instructive retrograde communication. Finally, we show that a global increase in translation induces adaptive responses in both transcription and translation of protein chaperones and degradation factors to promote cellular proteostasis. Together, this development and validation of tissue-specific ribosome profiling enables sensitive and specific analysis of translation in Drosophila. PMID:29194454
High-resolution mapping of transcription factor binding sites on native chromatin
Kasinathan, Sivakanthan; Orsi, Guillermo A.; Zentner, Gabriel E.; Ahmad, Kami; Henikoff, Steven
2014-01-01
Sequence-specific DNA-binding proteins including transcription factors (TFs) are key determinants of gene regulation and chromatin architecture. Formaldehyde cross-linking and sonication followed by Chromatin ImmunoPrecipitation (X-ChIP) is widely used for profiling of TF binding, but is limited by low resolution and poor specificity and sensitivity. We present a simple protocol that starts with micrococcal nuclease-digested uncross-linked chromatin and is followed by affinity purification of TFs and paired-end sequencing. The resulting ORGANIC (Occupied Regions of Genomes from Affinity-purified Naturally Isolated Chromatin) profiles of Saccharomyces cerevisiae Abf1 and Reb1 provide highly accurate base-pair resolution maps that are not biased toward accessible chromatin, and do not require input normalization. We also demonstrate the high specificity of our method when applied to larger genomes by profiling Drosophila melanogaster GAGA Factor and Pipsqueak. Our results suggest that ORGANIC profiling is a widely applicable high-resolution method for sensitive and specific profiling of direct protein-DNA interactions. PMID:24336359
Genome-wide transcriptional profiling of human glioblastoma cells in response to ITE treatment.
Kang, Bo; Zhou, Yanwen; Zheng, Min; Wang, Ying-Jie
2015-09-01
A ligand-activated transcription factor aryl hydrocarbon receptor (AhR) is recently revealed to play a key role in embryogenesis and tumorigenesis (Feng et al. [1], Safe et al. [2]) and 2-(1'H-indole-3'-carbonyl)-thiazole-4-carboxylic acid methyl ester (ITE) (Song et al. [3]) is an endogenous AhR ligand that possesses anti-tumor activity. In order to gain insights into how ITE acts via the AhR in embryogenesis and tumorigenesis, we analyzed the genome-wide transcriptional profiles of the following three groups of cells: the human glioblastoma U87 parental cells, U87 tumor sphere cells treated with vehicle (DMSO) and U87 tumor sphere cells treated with ITE. Here, we provide the details of the sample gathering strategy and show the quality controls and the analyses associated with our gene array data deposited into the Gene Expression Omnibus (GEO) under the accession code of GSE67986.
Chatterjee, Sumantra; Sivakamasundari, V; Yap, Sook Peng; Kraus, Petra; Kumar, Vibhor; Xing, Xing; Lim, Siew Lan; Sng, Joel; Prabhakar, Shyam; Lufkin, Thomas
2014-12-05
Vertebrate organogenesis is a highly complex process involving sequential cascades of transcription factor activation or repression. Interestingly a single developmental control gene can occasionally be essential for the morphogenesis and differentiation of tissues and organs arising from vastly disparate embryological lineages. Here we elucidated the role of the mammalian homeobox gene Bapx1 during the embryogenesis of five distinct organs at E12.5 - vertebral column, spleen, gut, forelimb and hindlimb - using expression profiling of sorted wildtype and mutant cells combined with genome wide binding site analysis. Furthermore we analyzed the development of the vertebral column at the molecular level by combining transcriptional profiling and genome wide binding data for Bapx1 with similarly generated data sets for Sox9 to assemble a detailed gene regulatory network revealing genes previously not reported to be controlled by either of these two transcription factors. The gene regulatory network appears to control cell fate decisions and morphogenesis in the vertebral column along with the prevention of premature chondrocyte differentiation thus providing a detailed molecular view of vertebral column development.
Mikhaylova, Lyudmila; Zhang, Yiming; Kobzik, Lester; Fedulov, Alexey V
2013-01-01
We investigated the link between epigenome-wide methylation aberrations at birth and genomic transcriptional changes upon allergen sensitization that occur in the neonatal dendritic cells (DC) due to maternal asthma. We previously demonstrated that neonates of asthmatic mothers are born with a functional skew in splenic DCs that can be seen even in allergen-naïve pups and can convey allergy responses to normal recipients. However, minimal-to-no transcriptional or phenotypic changes were found to explain this alteration. Here we provide in-depth analysis of genome-wide DNA methylation profiles and RNA transcriptional (microarray) profiles before and after allergen sensitization. We identified differentially methylated and differentially expressed loci and performed manually-curated matching of methylation status of the key regulatory sequences (promoters and CpG islands) to expression of their respective transcripts before and after sensitization. We found that while allergen-naive DCs from asthma-at-risk neonates have minimal transcriptional change compared to controls, the methylation changes are extensive. The substantial transcriptional change only becomes evident upon allergen sensitization, when it occurs in multiple genes with the pre-existing epigenetic alterations. We demonstrate that maternal asthma leads to both hyper- and hypomethylation in neonatal DCs, and that both types of events at various loci significantly overlap with transcriptional responses to allergen. Pathway analysis indicates that approximately 1/2 of differentially expressed and differentially methylated genes directly interact in known networks involved in allergy and asthma processes. We conclude that congenital epigenetic changes in DCs are strongly linked to altered transcriptional responses to allergen and to early-life asthma origin. The findings are consistent with the emerging paradigm that asthma is a disease with underlying epigenetic changes.
Genome-wide transcriptional profiling by microarrays provides a powerful platform for gene expression-based biomarker discovery. After their wide acceptance in human disease diagnosis, prognosis, and drug discovery, these gene signatures are increasingly being adopted for environ...
Genome-wide transcriptional profiling by microarrays provides a powerful platform for gene expression-based biomarker discovery. After their wide acceptance in human disease diagnosis, prognosis, and drug discovery, these gene signatures are increasingly being adopted for environ...
Gilardi, Federica; Liechti, Robin; Martin, Olivier; Harshman, Keith; Delorenzi, Mauro; Desvergne, Béatrice; Herr, Winship; Deplancke, Bart; Schibler, Ueli; Rougemont, Jacques; Guex, Nicolas; Hernandez, Nouria; Naef, Felix
2012-01-01
Interactions of cell-autonomous circadian oscillators with diurnal cycles govern the temporal compartmentalization of cell physiology in mammals. To understand the transcriptional and epigenetic basis of diurnal rhythms in mouse liver genome-wide, we generated temporal DNA occupancy profiles by RNA polymerase II (Pol II) as well as profiles of the histone modifications H3K4me3 and H3K36me3. We used these data to quantify the relationships of phases and amplitudes between different marks. We found that rhythmic Pol II recruitment at promoters rather than rhythmic transition from paused to productive elongation underlies diurnal gene transcription, a conclusion further supported by modeling. Moreover, Pol II occupancy preceded mRNA accumulation by 3 hours, consistent with mRNA half-lives. Both methylation marks showed that the epigenetic landscape is highly dynamic and globally remodeled during the 24-hour cycle. While promoters of transcribed genes had tri-methylated H3K4 even at their trough activity times, tri-methylation levels reached their peak, on average, 1 hour after Pol II. Meanwhile, rhythms in tri-methylation of H3K36 lagged transcription by 3 hours. Finally, modeling profiles of Pol II occupancy and mRNA accumulation identified three classes of genes: one showing rhythmicity both in transcriptional and mRNA accumulation, a second class with rhythmic transcription but flat mRNA levels, and a third with constant transcription but rhythmic mRNAs. The latter class emphasizes widespread temporally gated posttranscriptional regulation in the mouse liver. PMID:23209382
2012-01-01
Background The biphasic life cycle with pelagic larva and benthic adult stages is widely observed in the animal kingdom, including the Porifera (sponges), which are the earliest branching metazoans. The demosponge, Amphimedon queenslandica, undergoes metamorphosis from a free-swimming larva into a sessile adult that bears no morphological resemblance to other animals. While the genome of A. queenslandica contains an extensive repertoire of genes very similar to that of complex bilaterians, it is as yet unclear how this is drawn upon to coordinate changing morphological features and ecological demands throughout the sponge life cycle. Results To identify genome-wide events that accompany the pelagobenthic transition in A. queenslandica, we compared global gene expression profiles at four key developmental stages by sequencing the poly(A) transcriptome using SOLiD technology. Large-scale changes in transcription were observed as sponge larvae settled on the benthos and began metamorphosis. Although previous systematics suggest that the only clear homology between Porifera and other animals is in the embryonic and larval stages, we observed extensive use of genes involved in metazoan-associated cellular processes throughout the sponge life cycle. Sponge-specific transcripts are not over-represented in the morphologically distinct adult; rather, many genes that encode typical metazoan features, such as cell adhesion and immunity, are upregulated. Our analysis further revealed gene families with candidate roles in competence, settlement, and metamorphosis in the sponge, including transcription factors, G-protein coupled receptors and other signaling molecules. Conclusions This first genome-wide study of the developmental transcriptome in an early branching metazoan highlights major transcriptional events that accompany the pelagobenthic transition and point to a network of regulatory mechanisms that coordinate changes in morphology with shifting environmental demands. Metazoan developmental and structural gene orthologs are well-integrated into the expression profiles at every stage of sponge development, including the adult. The utilization of genes involved in metazoan-associated processes throughout sponge development emphasizes the potential of the genome of the last common ancestor of animals to generate phenotypic complexity. PMID:22646746
Integrative analysis of the Caenorhabditis elegans genome by the modENCODE project.
Gerstein, Mark B; Lu, Zhi John; Van Nostrand, Eric L; Cheng, Chao; Arshinoff, Bradley I; Liu, Tao; Yip, Kevin Y; Robilotto, Rebecca; Rechtsteiner, Andreas; Ikegami, Kohta; Alves, Pedro; Chateigner, Aurelien; Perry, Marc; Morris, Mitzi; Auerbach, Raymond K; Feng, Xin; Leng, Jing; Vielle, Anne; Niu, Wei; Rhrissorrakrai, Kahn; Agarwal, Ashish; Alexander, Roger P; Barber, Galt; Brdlik, Cathleen M; Brennan, Jennifer; Brouillet, Jeremy Jean; Carr, Adrian; Cheung, Ming-Sin; Clawson, Hiram; Contrino, Sergio; Dannenberg, Luke O; Dernburg, Abby F; Desai, Arshad; Dick, Lindsay; Dosé, Andréa C; Du, Jiang; Egelhofer, Thea; Ercan, Sevinc; Euskirchen, Ghia; Ewing, Brent; Feingold, Elise A; Gassmann, Reto; Good, Peter J; Green, Phil; Gullier, Francois; Gutwein, Michelle; Guyer, Mark S; Habegger, Lukas; Han, Ting; Henikoff, Jorja G; Henz, Stefan R; Hinrichs, Angie; Holster, Heather; Hyman, Tony; Iniguez, A Leo; Janette, Judith; Jensen, Morten; Kato, Masaomi; Kent, W James; Kephart, Ellen; Khivansara, Vishal; Khurana, Ekta; Kim, John K; Kolasinska-Zwierz, Paulina; Lai, Eric C; Latorre, Isabel; Leahey, Amber; Lewis, Suzanna; Lloyd, Paul; Lochovsky, Lucas; Lowdon, Rebecca F; Lubling, Yaniv; Lyne, Rachel; MacCoss, Michael; Mackowiak, Sebastian D; Mangone, Marco; McKay, Sheldon; Mecenas, Desirea; Merrihew, Gennifer; Miller, David M; Muroyama, Andrew; Murray, John I; Ooi, Siew-Loon; Pham, Hoang; Phippen, Taryn; Preston, Elicia A; Rajewsky, Nikolaus; Rätsch, Gunnar; Rosenbaum, Heidi; Rozowsky, Joel; Rutherford, Kim; Ruzanov, Peter; Sarov, Mihail; Sasidharan, Rajkumar; Sboner, Andrea; Scheid, Paul; Segal, Eran; Shin, Hyunjin; Shou, Chong; Slack, Frank J; Slightam, Cindie; Smith, Richard; Spencer, William C; Stinson, E O; Taing, Scott; Takasaki, Teruaki; Vafeados, Dionne; Voronina, Ksenia; Wang, Guilin; Washington, Nicole L; Whittle, Christina M; Wu, Beijing; Yan, Koon-Kiu; Zeller, Georg; Zha, Zheng; Zhong, Mei; Zhou, Xingliang; Ahringer, Julie; Strome, Susan; Gunsalus, Kristin C; Micklem, Gos; Liu, X Shirley; Reinke, Valerie; Kim, Stuart K; Hillier, LaDeana W; Henikoff, Steven; Piano, Fabio; Snyder, Michael; Stein, Lincoln; Lieb, Jason D; Waterston, Robert H
2010-12-24
We systematically generated large-scale data sets to improve genome annotation for the nematode Caenorhabditis elegans, a key model organism. These data sets include transcriptome profiling across a developmental time course, genome-wide identification of transcription factor-binding sites, and maps of chromatin organization. From this, we created more complete and accurate gene models, including alternative splice forms and candidate noncoding RNAs. We constructed hierarchical networks of transcription factor-binding and microRNA interactions and discovered chromosomal locations bound by an unusually large number of transcription factors. Different patterns of chromatin composition and histone modification were revealed between chromosome arms and centers, with similarly prominent differences between autosomes and the X chromosome. Integrating data types, we built statistical models relating chromatin, transcription factor binding, and gene expression. Overall, our analyses ascribed putative functions to most of the conserved genome.
Identification and expression profiles of the WRKY transcription factor family in Ricinus communis.
Li, Hui-Liang; Zhang, Liang-Bo; Guo, Dong; Li, Chang-Zhu; Peng, Shi-Qing
2012-07-25
In plants, WRKY proteins constitute a large family of transcription factors. They are involved in many biological processes, such as plant development, metabolism, and responses to biotic and abiotic stresses. A large number of WRKY transcription factors have been reported from Arabidopsis, rice, and other higher plants. The recent publication of the draft genome sequence of castor bean (Ricinus communis) has allowed a genome-wide search for R. communis WRKY (RcWRKY) transcription factors and the comparison of these positively identified proteins with their homologs in model plants. A total of 47 WRKY genes were identified in the castor bean genome. According to the structural features of the WRKY domain, the RcWRKY are classified into seven main phylogenetic groups. Furthermore, putative orthologs of RcWRKY proteins in Arabidopsis and rice could now be assigned. An analysis of expression profiles of RcWRKY genes indicates that 47 WRKY genes display differential expressions either in their transcript abundance or expression patterns under normal growth conditions. Copyright © 2012 Elsevier B.V. All rights reserved.
Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets
Macosko, Evan Z.; Basu, Anindita; Satija, Rahul; Nemesh, James; Shekhar, Karthik; Goldman, Melissa; Tirosh, Itay; Bialas, Allison R.; Kamitaki, Nolan; Martersteck, Emily M.; Trombetta, John J.; Weitz, David A.; Sanes, Joshua R.; Shalek, Alex K.; Regev, Aviv; McCarroll, Steven A.
2015-01-01
Summary Cells, the basic units of biological structure and function, vary broadly in type and state. Single-cell genomics can characterize cell identity and function, but limitations of ease and scale have prevented its broad application. Here we describe Drop-Seq, a strategy for quickly profiling thousands of individual cells by separating them into nanoliter-sized aqueous droplets, associating a different barcode with each cell’s RNAs, and sequencing them all together. Drop-Seq analyzes mRNA transcripts from thousands of individual cells simultaneously while remembering transcripts’ cell of origin. We analyzed transcriptomes from 44,808 mouse retinal cells and identified 39 transcriptionally distinct cell populations, creating a molecular atlas of gene expression for known retinal cell classes and novel candidate cell subtypes. Drop-Seq will accelerate biological discovery by enabling routine transcriptional profiling at single-cell resolution. PMID:26000488
Gao, Qingqing; Xia, Le; Liu, Juanhua; Wang, Xiaobo; Gao, Song; Liu, Xiufan
2016-11-01
Avian pathogenic Escherichia coli (APEC) cause typical extraintestinal infections in poultry, including acute fatal septicemia, subacute pericarditis, and airsacculitis. These bacteria most often infect chickens, turkeys, ducks, and other avian species, and therefore pose a significant economic burden on the poultry industry worldwide. Few studies have analyzed the genome-wide transcriptional profile of APEC during infection in vivo. In this study, we examined the genome-wide transcriptional response of APEC O2 strain E058 in an in vivo chicken infection model to better understand the factors necessary for APEC colonization, growth, and survival in vivo. An Affymetrix multigenome DNA microarray, which contains most of the genomic open reading frames of E. coli K-12 strain MG1655, uropathogenic E. coli strain CFT073, and E. coli O157:H7 strain EDL 933, was used to profile the gene expression in APEC E058. We identified the in vivo transcriptional response of APEC E058 bacteria collected directly from the blood of infected chickens. Significant differences in expression levels were detected between the in vivo expression profile and the in vitro expression profile in LB medium. The genes highly expressed during infection were involved in metabolism, iron acquisition or transport, virulence, response to stress, and biological regulation. The reliability of the microarray data was confirmed by performing quantitative real-time PCR on 12 representative genes. Moreover, several significantly upregulated genes, including yjiY, sodA, phoB and spy, were selected to study their role in APEC pathogenesis. The data will help to better understand the mechanisms of APEC pathogenesis. Copyright © 2016 Elsevier Ltd. All rights reserved.
Genome-wide transcriptional profiling of human glioblastoma cells in response to ITE treatment
Kang, Bo; Zhou, Yanwen; Zheng, Min; Wang, Ying-Jie
2015-01-01
A ligand-activated transcription factor aryl hydrocarbon receptor (AhR) is recently revealed to play a key role in embryogenesis and tumorigenesis (Feng et al. [1], Safe et al. [2]) and 2-(1′H-indole-3′-carbonyl)-thiazole-4-carboxylic acid methyl ester (ITE) (Song et al. [3]) is an endogenous AhR ligand that possesses anti-tumor activity. In order to gain insights into how ITE acts via the AhR in embryogenesis and tumorigenesis, we analyzed the genome-wide transcriptional profiles of the following three groups of cells: the human glioblastoma U87 parental cells, U87 tumor sphere cells treated with vehicle (DMSO) and U87 tumor sphere cells treated with ITE. Here, we provide the details of the sample gathering strategy and show the quality controls and the analyses associated with our gene array data deposited into the Gene Expression Omnibus (GEO) under the accession code of GSE67986. PMID:26484269
Saponaro, Marco; Kantidakis, Theodoros; Mitter, Richard; Kelly, Gavin P.; Heron, Mark; Williams, Hannah; Söding, Johannes; Stewart, Aengus; Svejstrup, Jesper Q.
2014-01-01
Summary RECQL5 is the sole member of the RECQ family of helicases associated with RNA polymerase II (RNAPII). We now show that RECQL5 is a general elongation factor that is important for preserving genome stability during transcription. Depletion or overexpression of RECQL5 results in corresponding shifts in the genome-wide RNAPII density profile. Elongation is particularly affected, with RECQL5 depletion causing a striking increase in the average rate, concurrent with increased stalling, pausing, arrest, and/or backtracking (transcription stress). RECQL5 therefore controls the movement of RNAPII across genes. Loss of RECQL5 also results in the loss or gain of genomic regions, with the breakpoints of lost regions located in genes and common fragile sites. The chromosomal breakpoints overlap with areas of elevated transcription stress, suggesting that RECQL5 suppresses such stress and its detrimental effects, and thereby prevents genome instability in the transcribed region of genes. PMID:24836610
Rodrigues, Raquel; Grosso, Ana Rita; Moita, Luís
2013-01-01
The immune system relies on the plasticity of its components to produce appropriate responses to frequent environmental challenges. Dendritic cells (DCs) are critical initiators of innate immunity and orchestrate the later and more specific adaptive immunity. The generation of diversity in transcriptional programs is central for effective immune responses. Alternative splicing is widely considered a key generator of transcriptional and proteomic complexity, but its role has been rarely addressed systematically in immune cells. Here we used splicing-sensitive arrays to assess genome-wide gene- and exon-level expression profiles in human DCs in response to a bacterial challenge. We find widespread alternative splicing events and splicing factor transcriptional signatures induced by an E. coli challenge to human DCs. Alternative splicing acts in concert with transcriptional modulation, but these two mechanisms of gene regulation affect primarily distinct functional gene groups. Alternative splicing is likely to have an important role in DC immunobiology because it affects genes known to be involved in DC development, endocytosis, antigen presentation and cell cycle arrest.
BLISS is a versatile and quantitative method for genome-wide profiling of DNA double-strand breaks.
Yan, Winston X; Mirzazadeh, Reza; Garnerone, Silvano; Scott, David; Schneider, Martin W; Kallas, Tomasz; Custodio, Joaquin; Wernersson, Erik; Li, Yinqing; Gao, Linyi; Federova, Yana; Zetsche, Bernd; Zhang, Feng; Bienko, Magda; Crosetto, Nicola
2017-05-12
Precisely measuring the location and frequency of DNA double-strand breaks (DSBs) along the genome is instrumental to understanding genomic fragility, but current methods are limited in versatility, sensitivity or practicality. Here we present Breaks Labeling In Situ and Sequencing (BLISS), featuring the following: (1) direct labelling of DSBs in fixed cells or tissue sections on a solid surface; (2) low-input requirement by linear amplification of tagged DSBs by in vitro transcription; (3) quantification of DSBs through unique molecular identifiers; and (4) easy scalability and multiplexing. We apply BLISS to profile endogenous and exogenous DSBs in low-input samples of cancer cells, embryonic stem cells and liver tissue. We demonstrate the sensitivity of BLISS by assessing the genome-wide off-target activity of two CRISPR-associated RNA-guided endonucleases, Cas9 and Cpf1, observing that Cpf1 has higher specificity than Cas9. Our results establish BLISS as a versatile, sensitive and efficient method for genome-wide DSB mapping in many applications.
Dong, Chen; Hu, Huigang; Xie, Jianghui
2016-12-01
DNA-binding with one finger (Dof) domain proteins are a multigene family of plant-specific transcription factors involved in numerous aspects of plant growth and development. In this study, we report a genome-wide search for Musa acuminata Dof (MaDof) genes and their expression profiles at different developmental stages and in response to various abiotic stresses. In addition, a complete overview of the Dof gene family in bananas is presented, including the gene structures, chromosomal locations, cis-regulatory elements, conserved protein domains, and phylogenetic inferences. Based on the genome-wide analysis, we identified 74 full-length protein-coding MaDof genes unevenly distributed on 11 chromosomes. Phylogenetic analysis with Dof members from diverse plant species showed that MaDof genes can be classified into four subgroups (StDof I, II, III, and IV). The detailed genomic information of the MaDof gene homologs in the present study provides opportunities for functional analyses to unravel the exact role of the genes in plant growth and development.
Model-based redesign of global transcription regulation
Carrera, Javier; Rodrigo, Guillermo; Jaramillo, Alfonso
2009-01-01
Synthetic biology aims to the design or redesign of biological systems. In particular, one possible goal could be the rewiring of the transcription regulation network by exchanging the endogenous promoters. To achieve this objective, we have adapted current methods to the inference of a model based on ordinary differential equations that is able to predict the network response after a major change in its topology. Our procedure utilizes microarray data for training. We have experimentally validated our inferred global regulatory model in Escherichia coli by predicting transcriptomic profiles under new perturbations. We have also tested our methodology in silico by providing accurate predictions of the underlying networks from expression data generated with artificial genomes. In addition, we have shown the predictive power of our methodology by obtaining the gene profile in experimental redesigns of the E. coli genome, where rewiring the transcriptional network by means of knockouts of master regulators or by upregulating transcription factors controlled by different promoters. Our approach is compatible with most network inference methods, allowing to explore computationally future genome-wide redesign experiments in synthetic biology. PMID:19188257
GST-PRIME: an algorithm for genome-wide primer design.
Leister, Dario; Varotto, Claudio
2007-01-01
The profiling of mRNA expression based on DNA arrays has become a powerful tool to study genome-wide transcription of genes in a number of organisms. GST-PRIME is a software package created to facilitate large-scale primer design for the amplification of probes to be immobilized on arrays for transcriptome analyses, even though it can be also applied in low-throughput approaches. GST-PRIME allows highly efficient, direct amplification of gene-sequence tags (GSTs) from genomic DNA (gDNA), starting from annotated genome or transcript sequences. GST-PRIME provides a customer-friendly platform for automatic primer design, and despite the relative simplicity of the algorithm, experimental tests in the model plant species Arabidopsis thaliana confirmed the reliability of the software. This chapter describes the algorithm used for primer design, its input and output files, and the installation of the standalone package and its use.
Global effects of the CSR-1 RNA interference pathway on the transcriptional landscape.
Cecere, Germano; Hoersch, Sebastian; O'Keeffe, Sean; Sachidanandam, Ravi; Grishok, Alla
2014-04-01
Argonaute proteins and their small RNA cofactors short interfering RNAs are known to inhibit gene expression at the transcriptional and post-transcriptional levels. In Caenorhabditis elegans, the Argonaute CSR-1 binds thousands of endogenous siRNAs (endo-siRNAs) that are antisense to germline transcripts. However, its role in gene expression regulation remains controversial. Here we used genome-wide profiling of nascent RNA transcripts and found that the CSR-1 RNA interference pathway promoted sense-oriented RNA polymerase II transcription. Moreover, a loss of CSR-1 function resulted in global increase in antisense transcription and ectopic transcription of silent chromatin domains, which led to reduced chromatin incorporation of centromere-specific histone H3. On the basis of these findings, we propose that the CSR-1 pathway helps maintain the directionality of active transcription, thereby propagating the distinction between transcriptionally active and silent genomic regions.
An integrated workflow for analysis of ChIP-chip data.
Weigelt, Karin; Moehle, Christoph; Stempfl, Thomas; Weber, Bernhard; Langmann, Thomas
2008-08-01
Although ChIP-chip is a powerful tool for genome-wide discovery of transcription factor target genes, the steps involving raw data analysis, identification of promoters, and correlation with binding sites are still laborious processes. Therefore, we report an integrated workflow for the analysis of promoter tiling arrays with the Genomatix ChipInspector system. We compare this tool with open-source software packages to identify PU.1 regulated genes in mouse macrophages. Our results suggest that ChipInspector data analysis, comparative genomics for binding site prediction, and pathway/network modeling significantly facilitate and enhance whole-genome promoter profiling to reveal in vivo sites of transcription factor-DNA interactions.
Genome-wide profiling of PRC1 and PRC2 Polycomb chromatin binding in Drosophila melanogaster.
Tolhuis, Bas; de Wit, Elzo; Muijrers, Inhua; Teunissen, Hans; Talhout, Wendy; van Steensel, Bas; van Lohuizen, Maarten
2006-06-01
Polycomb group (PcG) proteins maintain transcriptional repression of developmentally important genes and have been implicated in cell proliferation and stem cell self-renewal. We used a genome-wide approach to map binding patterns of PcG proteins (Pc, esc and Sce) in Drosophila melanogaster Kc cells. We found that Pc associates with large genomic regions of up to approximately 150 kb in size, hereafter referred to as 'Pc domains'. Sce and esc accompany Pc in most of these domains. PcG-bound chromatin is trimethylated at histone H3 Lys27 and is generally transcriptionally silent. Furthermore, PcG proteins preferentially bind to developmental genes. Many of these encode transcriptional regulators and key components of signal transduction pathways, including Wingless, Hedgehog, Notch and Delta. We also identify several new putative functions of PcG proteins, such as in steroid hormone biosynthesis. These results highlight the extensive involvement of PcG proteins in the coordination of development through the formation of large repressive chromatin domains.
2014-01-01
Background Genome-wide microarrays have been useful for predicting chemical-genetic interactions at the gene level. However, interpreting genome-wide microarray results can be overwhelming due to the vast output of gene expression data combined with off-target transcriptional responses many times induced by a drug treatment. This study demonstrates how experimental and computational methods can interact with each other, to arrive at more accurate predictions of drug-induced perturbations. We present a two-stage strategy that links microarray experimental testing and network training conditions to predict gene perturbations for a drug with a known mechanism of action in a well-studied organism. Results S. cerevisiae cells were treated with the antifungal, fluconazole, and expression profiling was conducted under different biological conditions using Affymetrix genome-wide microarrays. Transcripts were filtered with a formal network-based method, sparse simultaneous equation models and Lasso regression (SSEM-Lasso), under different network training conditions. Gene expression results were evaluated using both gene set and single gene target analyses, and the drug’s transcriptional effects were narrowed first by pathway and then by individual genes. Variables included: (i) Testing conditions – exposure time and concentration and (ii) Network training conditions – training compendium modifications. Two analyses of SSEM-Lasso output – gene set and single gene – were conducted to gain a better understanding of how SSEM-Lasso predicts perturbation targets. Conclusions This study demonstrates that genome-wide microarrays can be optimized using a two-stage strategy for a more in-depth understanding of how a cell manifests biological reactions to a drug treatment at the transcription level. Additionally, a more detailed understanding of how the statistical model, SSEM-Lasso, propagates perturbations through a network of gene regulatory interactions is achieved. PMID:24444313
Yu, Hong; Soler, Marçal; Mila, Isabelle; San Clemente, Hélène; Savelli, Bruno; Dunand, Christophe; Paiva, Jorge A. P.; Myburg, Alexander A.; Bouzayen, Mondher; Grima-Pettenati, Jacqueline; Cassan-Wang, Hua
2014-01-01
Auxin is a central hormone involved in a wide range of developmental processes including the specification of vascular stem cells. Auxin Response Factors (ARF) are important actors of the auxin signalling pathway, regulating the transcription of auxin-responsive genes through direct binding to their promoters. The recent availability of the Eucalyptus grandis genome sequence allowed us to examine the characteristics and evolutionary history of this gene family in a woody plant of high economic importance. With 17 members, the E. grandis ARF gene family is slightly contracted, as compared to those of most angiosperms studied hitherto, lacking traces of duplication events. In silico analysis of alternative transcripts and gene truncation suggested that these two mechanisms were preeminent in shaping the functional diversity of the ARF family in Eucalyptus. Comparative phylogenetic analyses with genomes of other taxonomic lineages revealed the presence of a new ARF clade found preferentially in woody and/or perennial plants. High-throughput expression profiling among different organs and tissues and in response to environmental cues highlighted genes expressed in vascular cambium and/or developing xylem, responding dynamically to various environmental stimuli. Finally, this study allowed identification of three ARF candidates potentially involved in the auxin-regulated transcriptional program underlying wood formation. PMID:25269088
Jing, Zhaobin; Liu, Zhande
2018-04-01
As one of the largest transcriptional factor families in plants, WRKY transcription factors play important roles in various biotic and abiotic stress responses. To date, WRKY genes in kiwifruit (Actinidia spp.) remain poorly understood. In our study, o total of 97 AcWRKY genes have been identified in the kiwifruit genome. An overview of these AcWRKY genes is analyzed, including the phylogenetic relationships, exon-intron structures, synteny and expression profiles. The 97 AcWRKY genes were divided into three groups based on the conserved WRKY domain. Synteny analysis indicated that segmental duplication events contributed to the expansion of the kiwifruit AcWRKY family. In addition, the synteny analysis between kiwifruit and Arabidopsis suggested that some of the AcWRKY genes were derived from common ancestors before the divergence of these two species. Conserved motifs outside the AcWRKY domain may reflect their functional conservation. Genome-wide segmental and tandem duplication were found, which may contribute to the expansion of AcWRKY genes. Furthermore, the analysis of selected AcWRKY genes showed a variety of expression patterns in five different organs as well as during biotic and abiotic stresses. The genome-wide identification and characterization of kiwifruit WRKY transcription factors provides insight into the evolutionary history and is a useful resource for further functional analyses of kiwifruit.
Principles of regulatory information conservation between mouse and human.
Cheng, Yong; Ma, Zhihai; Kim, Bong-Hyun; Wu, Weisheng; Cayting, Philip; Boyle, Alan P; Sundaram, Vasavi; Xing, Xiaoyun; Dogan, Nergiz; Li, Jingjing; Euskirchen, Ghia; Lin, Shin; Lin, Yiing; Visel, Axel; Kawli, Trupti; Yang, Xinqiong; Patacsil, Dorrelyn; Keller, Cheryl A; Giardine, Belinda; Kundaje, Anshul; Wang, Ting; Pennacchio, Len A; Weng, Zhiping; Hardison, Ross C; Snyder, Michael P
2014-11-20
To broaden our understanding of the evolution of gene regulation mechanisms, we generated occupancy profiles for 34 orthologous transcription factors (TFs) in human-mouse erythroid progenitor, lymphoblast and embryonic stem-cell lines. By combining the genome-wide transcription factor occupancy repertoires, associated epigenetic signals, and co-association patterns, here we deduce several evolutionary principles of gene regulatory features operating since the mouse and human lineages diverged. The genomic distribution profiles, primary binding motifs, chromatin states, and DNA methylation preferences are well conserved for TF-occupied sequences. However, the extent to which orthologous DNA segments are bound by orthologous TFs varies both among TFs and with genomic location: binding at promoters is more highly conserved than binding at distal elements. Notably, occupancy-conserved TF-occupied sequences tend to be pleiotropic; they function in several tissues and also co-associate with many TFs. Single nucleotide variants at sites with potential regulatory functions are enriched in occupancy-conserved TF-occupied sequences.
Yao, Bing; Lin, Li; Street, R Craig; Zalewski, Zachary A; Galloway, Jocelyn N; Wu, Hao; Nelson, David L; Jin, Peng
2014-02-15
Fragile X-associated tremor/ataxia syndrome (FXTAS) is a late-onset neurodegenerative disorder in which patients carry premutation alleles of 55-200 CGG repeats in the FMR1 gene. To date, whether alterations in epigenetic regulation modulate FXTAS has gone unexplored. 5-Hydroxymethylcytosine (5hmC) converted from 5-methylcytosine (5mC) by the ten-eleven translocation (TET) family of proteins has been found recently to play key roles in neuronal functions. Here, we undertook genome-wide profiling of cerebellar 5hmC in a FXTAS mouse model (rCGG mice) and found that rCGG mice at 16 weeks showed overall reduced 5hmC levels genome-wide compared with age-matched wild-type littermates. However, we also observed gain-of-5hmC regions in repetitive elements, as well as in cerebellum-specific enhancers, but not in general enhancers. Genomic annotation and motif prediction of wild-type- and rCGG-specific differential 5-hydroxymethylated regions (DhMRs) revealed their high correlation with genes and transcription factors that are important in neuronal developmental and functional pathways. DhMR-associated genes partially overlapped with genes that were differentially associated with ribosomes in CGG mice identified by bacTRAP ribosomal profiling. Taken together, our data strongly indicate a functional role for 5hmC-mediated epigenetic modulation in the etiology of FXTAS, possibly through the regulation of transcription.
Genome-wide bisulfite sensitivity profiling of yeast suggests bisulfite inhibits transcription.
Segovia, Romulo; Mathew, Veena; Tam, Annie S; Stirling, Peter C
2017-09-01
Bisulfite, in the form of sodium bisulfite or metabisulfite, is used commercially as a food preservative. Bisulfite is used in the laboratory as a single-stranded DNA mutagen in epigenomic analyses of DNA methylation. Recently it has also been used on whole yeast cells to induce mutations in exposed single-stranded regions in vivo. To understand the effects of bisulfite on live cells we conducted a genome-wide screen for bisulfite sensitive mutants in yeast. Screening the deletion mutant array, and collections of essential gene mutants we define a genetic network of bisulfite sensitive mutants. Validation of screen hits revealed hyper-sensitivity of transcription and RNA processing mutants, rather than DNA repair pathways and follow-up analyses support a role in perturbation of RNA transactions. We propose a model in which bisulfite-modified nucleotides may interfere with transcription or RNA metabolism when used in vivo. Copyright © 2017 Elsevier B.V. All rights reserved.
Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C; Zhang, Baohong
2014-10-16
Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence.
Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C.; Zhang, Baohong
2014-01-01
Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence. PMID:25322260
Global effects of the CSR-1 RNA interference pathway on transcriptional landscape
Cecere, Germano; Hoersch, Sebastian; O’Keeffe, Sean; Sachidanandam, Ravi; Grishok, Alla
2014-01-01
Argonaute proteins and their small RNA co-factors short interfering RNAs (siRNAs) are known to inhibit gene expression at the transcriptional and post-transcriptional levels. In Caenorhabditis elegans, the Argonaute CSR-1 binds thousands of endogenous siRNAs (endo-siRNAs) antisense to germline transcripts and associates with chromatin in a siRNA-dependent manner. However, its role in gene expression regulation remains controversial. Here, we used a genome-wide profiling of nascent RNA transcripts to demonstrate that the CSR-1 RNAi pathway promotes sense-oriented Pol II transcription. Moreover, a loss of CSR-1 function resulted in global increase in antisense transcription and ectopic transcription of silent chromatin domains, which led to reduced chromatin incorporation of centromere-specific histone H3. Based on these findings, we propose that the CSR-1 pathway has a role in maintaining the directionality of active transcription thereby propagating the distinction between transcriptionally active and silent genomic regions. PMID:24681887
Alexandrov, Boian S; Fukuyo, Yayoi; Lange, Martin; Horikoshi, Nobuo; Gelev, Vladimir; Rasmussen, Kim Ø; Bishop, Alan R; Usheva, Anny
2012-11-01
The genome-wide mapping of the major gene expression regulators, the transcription factors (TFs) and their DNA binding sites, is of great importance for describing cellular behavior and phenotypic diversity. Presently, the methods for prediction of genomic TF binding produce a large number of false positives, most likely due to insufficient description of the physiochemical mechanisms of protein-DNA binding. Growing evidence suggests that, in the cell, the double-stranded DNA (dsDNA) is subject to local transient strands separations (breathing) that contribute to genomic functions. By using site-specific chromatin immunopecipitations, gel shifts, BIOBASE data, and our model that accurately describes the melting behavior and breathing dynamics of dsDNA we report a specific DNA breathing profile found at YY1 binding sites in cells. We find that the genomic flanking sequence variations and SNPs, may exert long-range effects on DNA dynamics and predetermine YY1 binding. The ubiquitous TF YY1 has a fundamental role in essential biological processes by activating, initiating or repressing transcription depending upon the sequence context it binds. We anticipate that consensus binding sequences together with the related DNA dynamics profile may significantly improve the accuracy of genomic TF binding sites and TF binding-related functional SNPs.
Miller, Gregory E; Chen, Edith; Shalowitz, Madeleine U; Story, Rachel E; Leigh, Adam K K; Ham, Paula; Arevalo, Jesusa M G; Cole, Steve W
2018-06-01
There are marked socioeconomic disparities in pediatric asthma control, but the molecular origins of these disparities are not well understood. To fill this gap, we performed genome-wide expression profiling of monocytes and T-helper cells from pediatric asthma patients of lower and higher socioeconomic status (SES). Ninety-nine children with asthma participated in a cross-sectional assessment. Out of which 87% were atopic, and most had disease of mild (54%) or moderate (29%) severity. Children were from lower-SES (n = 49; household income <$50 000) or higher-SES (n = 50; household income >$140 000) families. Peripheral blood monocytes and T-helper cells were isolated for genome-wide expression profiling of mRNA. Lower-SES children had worse asthma quality of life relative to higher-SES children, by both their own and their parents' reports. Although the groups had similar disease severity and potential confounds were controlled, their transcriptional profiles differed notably. The monocytes of lower-SES children showed transcriptional indications of up-regulated anti-microbial and pro-inflammatory activity. The T-helper cells of lower-SES children also had comparatively reduced expression of genes encoding γ-interferon and tumor necrosis factor-α, cytokines that orchestrate Type 1 responses. They also showed up-regulated activity of transcription factors that polarize cells towards Type 2 responses and promote Th17 cell maturation. Collectively, these patterns implicate pro-inflammatory monocytes and Type 2 cytokine activity as mechanisms contributing to worse asthma control among lower-SES children. © 2018 Wiley Periodicals, Inc.
Sajuthi, Satria P; Sharma, Neeraj K; Comeau, Mary E; Chou, Jeff W; Bowden, Donald W; Freedman, Barry I; Langefeld, Carl D; Parks, John S; Das, Swapan K
2017-10-20
Dyslipidemia is a major contributor to the increased cardiovascular disease and mortality associated with obesity and type 2 diabetes. We hypothesized that variation in expression of adipose tissue transcripts is associated with serum lipid concentrations in African Americans (AAs), and common genetic variants regulate expression levels of these transcripts. Fasting serum lipid levels, genome-wide transcript expression profiles of subcutaneous adipose tissue, and genome-wide SNP genotypes were analyzed in a cohort of non-diabetic AAs (N=250). Serum triglyceride (TRIG) and high density lipoprotein-cholesterol (HDL-C) levels were associated (FDR<0.01) with expression level of 1021 and 1875 adipose tissue transcripts, respectively, but none associated with total cholesterol or LDL-C levels. Serum HDL-C-associated transcripts were enriched for salient biological pathways, including branched-chain amino acid degradation, and oxidative phosphorylation. Genes in immuno-inflammatory pathways were activated among individuals with higher serum TRIG levels. We identified significant cis-regulatory SNPs (cis-eSNPs) for 449 serum lipid-associated transcripts in adipose tissue. The cis-eSNPs of 12 genes were nominally associated (p<0.001) with serum lipid level in genome wide association studies in Global Lipids Genetics Consortium (GLGC) cohorts. Allelic effect direction of cis-eSNPs on expression of MARCH2, BEST1 and TMEM258 matched with effect direction of these SNP alleles on serum TRIG or HDL-C levels in GLGC cohorts. These data suggest that expressions of serum lipid-associated transcripts in adipose tissue are dependent on common cis-eSNPs in African Americans. Thus, genetically-mediated transcriptional regulation in adipose tissue may play a role in reducing HDL-C and increasing TRIG in serum. Copyright © 2017 Elsevier B.V. All rights reserved.
Baev, Vesselin; Milev, Ivan; Naydenov, Mladen; Vachev, Tihomir; Apostolova, Elena; Mehterov, Nikolay; Gozmanva, Mariyana; Minkov, Georgi; Sablok, Gaurav; Yahubyan, Galina
2014-11-01
Small RNA profiling and assessing its dependence on changing environmental factors have expanded our understanding of the transcriptional and post-transcriptional regulation of plant stress responses. Insufficient data have been documented earlier to depict the profiling of small RNA classes in temperature-associated stress which has a wide implication for climate change biology. In the present study, we report a comparative assessment of the genome-wide profiling of small RNAs in Arabidopsis thaliana using two conditional responses, induced by high- and low-temperature. Genome-wide profiling of small RNAs revealed an abundance of 21 nt small RNAs at low temperature, while high temperature showed an abundance of 21 nt and 24 nt small RNAs. The two temperature treatments altered the expression of a specific subset of mature miRNAs and displayed differential expression of a number of miRNA isoforms (isomiRs). Comparative analysis demonstrated that a large number of protein-coding genes can give rise to differentially expressed small RNAs following temperature shifts. Low temperature caused accumulation of small RNAs, corresponding to the sense strand of a number of cold-responsive genes. In contrast, high temperature stimulated the production of small RNAs of both polarities from genes encoding functionally diverse proteins. Copyright © 2014 Elsevier Masson SAS. All rights reserved.
Vivek-Ananth, R P; Samal, Areejit
2016-09-01
A major goal of systems biology is to build predictive computational models of cellular metabolism. Availability of complete genome sequences and wealth of legacy biochemical information has led to the reconstruction of genome-scale metabolic networks in the last 15 years for several organisms across the three domains of life. Due to paucity of information on kinetic parameters associated with metabolic reactions, the constraint-based modelling approach, flux balance analysis (FBA), has proved to be a vital alternative to investigate the capabilities of reconstructed metabolic networks. In parallel, advent of high-throughput technologies has led to the generation of massive amounts of omics data on transcriptional regulation comprising mRNA transcript levels and genome-wide binding profile of transcriptional regulators. A frontier area in metabolic systems biology has been the development of methods to integrate the available transcriptional regulatory information into constraint-based models of reconstructed metabolic networks in order to increase the predictive capabilities of computational models and understand the regulation of cellular metabolism. Here, we review the existing methods to integrate transcriptional regulatory information into constraint-based models of metabolic networks. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
A genome-wide SNP scan accelerates trait-regulatory genomic loci identification in chickpea
Kujur, Alice; Bajaj, Deepak; Upadhyaya, Hari D.; Das, Shouvik; Ranjan, Rajeev; Shree, Tanima; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C.L.L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
We identified 44844 high-quality SNPs by sequencing 92 diverse chickpea accessions belonging to a seed and pod trait-specific association panel using reference genome- and de novo-based GBS (genotyping-by-sequencing) assays. A GWAS (genome-wide association study) in an association panel of 211, including the 92 sequenced accessions, identified 22 major genomic loci showing significant association (explaining 23–47% phenotypic variation) with pod and seed number/plant and 100-seed weight. Eighteen trait-regulatory major genomic loci underlying 13 robust QTLs were validated and mapped on an intra-specific genetic linkage map by QTL mapping. A combinatorial approach of GWAS, QTL mapping and gene haplotype-specific LD mapping and transcript profiling uncovered one superior haplotype and favourable natural allelic variants in the upstream regulatory region of a CesA-type cellulose synthase (Ca_Kabuli_CesA3) gene regulating high pod and seed number/plant (explaining 47% phenotypic variation) in chickpea. The up-regulation of this superior gene haplotype correlated with increased transcript expression of Ca_Kabuli_CesA3 gene in the pollen and pod of high pod/seed number accession, resulting in higher cellulose accumulation for normal pollen and pollen tube growth. A rapid combinatorial genome-wide SNP genotyping-based approach has potential to dissect complex quantitative agronomic traits and delineate trait-regulatory genomic loci (candidate genes) for genetic enhancement in crop plants, including chickpea. PMID:26058368
Yuan, Xiao-Long; Gao, Ning; Xing, Yan; Zhang, Hai-Bin; Zhang, Ai-Ling; Liu, Jing; He, Jin-Long; Xu, Yuan; Lin, Wen-Mian; Chen, Zan-Mou; Zhang, Hao; Zhang, Zhe; Li, Jia-Qi
2016-02-25
Substantial evidence has shown that DNA methylation regulates the initiation of ovarian and sexual maturation. Here, we investigated the genome-wide profile of DNA methylation in porcine ovaries at single-base resolution using reduced representation bisulfite sequencing. The biological variation was minimal among the three ovarian replicates. We found hypermethylation frequently occurred in regions with low gene abundance, while hypomethylation in regions with high gene abundance. The DNA methylation around transcriptional start sites was negatively correlated with their own CpG content. Additionally, the methylation level in the bodies of genes was higher than that in their 5' and 3' flanking regions. The DNA methylation pattern of the low CpG content promoter genes differed obviously from that of the high CpG content promoter genes. The DNA methylation level of the porcine ovary was higher than that of the porcine intestine. Analyses of the genome-wide DNA methylation in porcine ovaries would advance the knowledge and understanding of the porcine ovarian methylome.
Genome wide transcriptional profile analysis of Vitis amurensis in response to cold stress
USDA-ARS?s Scientific Manuscript database
Grape is one of the most important fruit crops worldwide and is cultivated on all of the continents except Antarctica. However, low temperatures can limit the geographical locations and productivity of grapes. Vitis amurensis is a wild grapevine species with remarkable cold-tolerance, exceeding th...
Herbicides are structurally diverse chemicals that inhibit plant-specific targets, however their off-target and potentially differentiating side-effects are less well defined. In this study, genome-wide expression profiling based on Affymetrix AtH1 arrays was used to identify dis...
In this study, genome-wide expression profiling based on Affymetrix ATH1 arrays was used to identify discriminating responses of Arabidopsis thaliana to five herbicides, which contain active ingredients targeting two different branches of amino acid biosynthesis. One herbicide co...
Pellacani, Davide; Bilenky, Misha; Kannan, Nagarajan; Heravi-Moussavi, Alireza; Knapp, David J H F; Gakkhar, Sitanshu; Moksa, Michelle; Carles, Annaick; Moore, Richard; Mungall, Andrew J; Marra, Marco A; Jones, Steven J M; Aparicio, Samuel; Hirst, Martin; Eaves, Connie J
2016-11-15
The normal adult human mammary gland is a continuous bilayered epithelial system. Bipotent and myoepithelial progenitors are prominent and unique components of the outer (basal) layer. The inner (luminal) layer includes both luminal-restricted progenitors and a phenotypically separable fraction that lacks progenitor activity. We now report an epigenomic comparison of these three subsets with one another, with their associated stromal cells, and with three immortalized, non-tumorigenic human mammary cell lines. Each genome-wide analysis contains profiles for six histone marks, methylated DNA, and RNA transcripts. Analysis of these datasets shows that each cell type has unique features, primarily within genomic regulatory regions, and that the cell lines group together. Analyses of the promoter and enhancer profiles place the luminal progenitors in between the basal cells and the non-progenitor luminal subset. Integrative analysis reveals networks of subset-specific transcription factors. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.
Pothi, Radhika; Hesketh, Andrew; Möller-Levet, Carla; Hodgson, David A; Laing, Emma E; Stewart, Graham R; Smith, Colin P
2018-01-01
Abstract Stress-induced adaptations require multiple levels of regulation in all organisms to repair cellular damage. In the present study we evaluated the genome-wide transcriptional and translational changes following heat stress exposure in the soil-dwelling model actinomycete bacterium, Streptomyces coelicolor. The combined analysis revealed an unprecedented level of translational control of gene expression, deduced through polysome profiling, in addition to transcriptional changes. Our data show little correlation between the transcriptome and ‘translatome’; while an obvious downward trend in genome wide transcription was observed, polysome associated transcripts following heat-shock showed an opposite upward trend. A handful of key protein players, including the major molecular chaperones and proteases were highly induced at both the transcriptional and translational level following heat-shock, a phenomenon known as ‘potentiation’. Many other transcripts encoding cold-shock proteins, ABC-transporter systems, multiple transcription factors were more highly polysome-associated following heat stress; interestingly, these protein families were not induced at the transcriptional level and therefore were not previously identified as part of the stress response. Thus, stress coping mechanisms at the level of gene expression in this bacterium go well beyond the induction of a relatively small number of molecular chaperones and proteases in order to ensure cellular survival at non-physiological temperatures. PMID:29746664
Bucca, Giselda; Pothi, Radhika; Hesketh, Andrew; Möller-Levet, Carla; Hodgson, David A; Laing, Emma E; Stewart, Graham R; Smith, Colin P
2018-05-09
Stress-induced adaptations require multiple levels of regulation in all organisms to repair cellular damage. In the present study we evaluated the genome-wide transcriptional and translational changes following heat stress exposure in the soil-dwelling model actinomycete bacterium, Streptomyces coelicolor. The combined analysis revealed an unprecedented level of translational control of gene expression, deduced through polysome profiling, in addition to transcriptional changes. Our data show little correlation between the transcriptome and 'translatome'; while an obvious downward trend in genome wide transcription was observed, polysome associated transcripts following heat-shock showed an opposite upward trend. A handful of key protein players, including the major molecular chaperones and proteases were highly induced at both the transcriptional and translational level following heat-shock, a phenomenon known as 'potentiation'. Many other transcripts encoding cold-shock proteins, ABC-transporter systems, multiple transcription factors were more highly polysome-associated following heat stress; interestingly, these protein families were not induced at the transcriptional level and therefore were not previously identified as part of the stress response. Thus, stress coping mechanisms at the level of gene expression in this bacterium go well beyond the induction of a relatively small number of molecular chaperones and proteases in order to ensure cellular survival at non-physiological temperatures.
Principles of regulatory information conservation between mouse and human
Cheng, Yong; Ma, Zhihai; Kim, Bong-Hyun; ...
2014-11-19
To broaden our understanding of the evolution of gene regulation mechanisms, we generated occupancy profiles for 34 orthologous transcription factors (TFs) in human–mouse erythroid progenitor, lymphoblast and embryonic stem-cell lines. By combining the genome-wide transcription factor occupancy repertoires, associated epigenetic signals, and co-association patterns, here we deduce several evolutionary principles of gene regulatory features operating since the mouse and human lineages diverged. The genomic distribution profiles, primary binding motifs, chromatin states, and DNA methylation preferences are well conserved for TF-occupied sequences. However, the extent to which orthologous DNA segments are bound by orthologous TFs varies both among TFs and withmore » genomic location: binding at promoters is more highly conserved than binding at distal elements. Notably, occupancy-conserved TF-occupied sequences tend to be pleiotropic; they function in several tissues and also co-associate with many TFs. Lastly, single nucleotide variants at sites with potential regulatory functions are enriched in occupancy-conserved TF-occupied sequences.« less
Genome-Wide Expression Profiling of Complex Regional Pain Syndrome
Jin, Eun-Heui; Zhang, Enji; Ko, Youngkwon; Sim, Woo Seog; Moon, Dong Eon; Yoon, Keon Jung; Hong, Jang Hee; Lee, Won Hyung
2013-01-01
Complex regional pain syndrome (CRPS) is a chronic, progressive, and devastating pain syndrome characterized by spontaneous pain, hyperalgesia, allodynia, altered skin temperature, and motor dysfunction. Although previous gene expression profiling studies have been conducted in animal pain models, there genome-wide expression profiling in the whole blood of CRPS patients has not been reported yet. Here, we successfully identified certain pain-related genes through genome-wide expression profiling in the blood from CRPS patients. We found that 80 genes were differentially expressed between 4 CRPS patients (2 CRPS I and 2 CRPS II) and 5 controls (cut-off value: 1.5-fold change and p<0.05). Most of those genes were associated with signal transduction, developmental processes, cell structure and motility, and immunity and defense. The expression levels of major histocompatibility complex class I A subtype (HLA-A29.1), matrix metalloproteinase 9 (MMP9), alanine aminopeptidase N (ANPEP), l-histidine decarboxylase (HDC), granulocyte colony-stimulating factor 3 receptor (G-CSF3R), and signal transducer and activator of transcription 3 (STAT3) genes selected from the microarray were confirmed in 24 CRPS patients and 18 controls by quantitative reverse transcription-polymerase chain reaction (qRT-PCR). We focused on the MMP9 gene that, by qRT-PCR, showed a statistically significant difference in expression in CRPS patients compared to controls with the highest relative fold change (4.0±1.23 times and p = 1.4×10−4). The up-regulation of MMP9 gene in the blood may be related to the pain progression in CRPS patients. Our findings, which offer a valuable contribution to the understanding of the differential gene expression in CRPS may help in the understanding of the pathophysiology of CRPS pain progression. PMID:24244504
Brasa, Sarah; Teo, Soon-Siong; Roloff, Tim-Christoph; Morawiec, Laurent; Zamurovic, Natasa; Vicart, Axel; Funhoff, Enrico; Couttet, Philippe; Schübeler, Dirk; Grenet, Olivier; Marlowe, Jennifer; Moggs, Jonathan; Terranova, Rémi
2011-01-01
Evidence suggests that epigenetic perturbations are involved in the adverse effects associated with some drugs and toxicants, including certain classes of non-genotoxic carcinogens. Such epigenetic changes (altered DNA methylation and covalent histone modifications) may take place at the earliest stages of carcinogenesis and their identification holds great promise for biomedical research. Here, we evaluate the sensitivity and specificity of genome-wide epigenomic and transcriptomic profiling in phenobarbital (PB)-treated B6C3F1 mice, a well-characterized rodent model of non-genotoxic liver carcinogenesis. Methylated DNA Immunoprecipitation (MeDIP)-coupled microarray profiling of 17,967 promoter regions and 4,566 intergenic CpG islands was combined with genome-wide mRNA expression profiling to identify liver tissue-specific PB-mediated DNA methylation and transcriptional alterations. Only a limited number of significant anti-correlations were observed between PB-induced transcriptional and promoter-based DNA methylation perturbations. However, the constitutive androstane receptor (CAR) target gene Cyp2b10 was found to be concomitantly hypomethylated and transcriptionally activated in a liver tissue-specific manner following PB treatment. Furthermore, analysis of active and repressive histone modifications using chromatin immunoprecipitation revealed a strong PB-mediated epigenetic switch at the Cyp2b10 promoter. Our data reveal that PB-induced transcriptional perturbations are not generally associated with broad changes in the DNA methylation status at proximal promoters and suggest that the drug-inducible CAR pathway regulates an epigenetic switch from repressive to active chromatin at the target gene Cyp2b10. This study demonstrates the utility of integrated epigenomic and transcriptomic profiling for elucidating early mechanisms and biomarkers of non-genotoxic carcinogenesis. PMID:21455306
Lempiäinen, Harri; Müller, Arne; Brasa, Sarah; Teo, Soon-Siong; Roloff, Tim-Christoph; Morawiec, Laurent; Zamurovic, Natasa; Vicart, Axel; Funhoff, Enrico; Couttet, Philippe; Schübeler, Dirk; Grenet, Olivier; Marlowe, Jennifer; Moggs, Jonathan; Terranova, Rémi
2011-03-24
Evidence suggests that epigenetic perturbations are involved in the adverse effects associated with some drugs and toxicants, including certain classes of non-genotoxic carcinogens. Such epigenetic changes (altered DNA methylation and covalent histone modifications) may take place at the earliest stages of carcinogenesis and their identification holds great promise for biomedical research. Here, we evaluate the sensitivity and specificity of genome-wide epigenomic and transcriptomic profiling in phenobarbital (PB)-treated B6C3F1 mice, a well-characterized rodent model of non-genotoxic liver carcinogenesis. Methylated DNA Immunoprecipitation (MeDIP)-coupled microarray profiling of 17,967 promoter regions and 4,566 intergenic CpG islands was combined with genome-wide mRNA expression profiling to identify liver tissue-specific PB-mediated DNA methylation and transcriptional alterations. Only a limited number of significant anti-correlations were observed between PB-induced transcriptional and promoter-based DNA methylation perturbations. However, the constitutive androstane receptor (CAR) target gene Cyp2b10 was found to be concomitantly hypomethylated and transcriptionally activated in a liver tissue-specific manner following PB treatment. Furthermore, analysis of active and repressive histone modifications using chromatin immunoprecipitation revealed a strong PB-mediated epigenetic switch at the Cyp2b10 promoter. Our data reveal that PB-induced transcriptional perturbations are not generally associated with broad changes in the DNA methylation status at proximal promoters and suggest that the drug-inducible CAR pathway regulates an epigenetic switch from repressive to active chromatin at the target gene Cyp2b10. This study demonstrates the utility of integrated epigenomic and transcriptomic profiling for elucidating early mechanisms and biomarkers of non-genotoxic carcinogenesis.
Genome-wide transcription start site profiling in biofilm-grown Burkholderia cenocepacia J2315.
Sass, Andrea M; Van Acker, Heleen; Förstner, Konrad U; Van Nieuwerburgh, Filip; Deforce, Dieter; Vogel, Jörg; Coenye, Tom
2015-10-13
Burkholderia cenocepacia is a soil-dwelling Gram-negative Betaproteobacterium with an important role as opportunistic pathogen in humans. Infections with B. cenocepacia are very difficult to treat due to their high intrinsic resistance to most antibiotics. Biofilm formation further adds to their antibiotic resistance. B. cenocepacia harbours a large, multi-replicon genome with a high GC-content, the reference genome of strain J2315 includes 7374 annotated genes. This study aims to annotate transcription start sites and identify novel transcripts on a whole genome scale. RNA extracted from B. cenocepacia J2315 biofilms was analysed by differential RNA-sequencing and the resulting dataset compared to data derived from conventional, global RNA-sequencing. Transcription start sites were annotated and further analysed according to their position relative to annotated genes. Four thousand ten transcription start sites were mapped over the whole B. cenocepacia genome and the primary transcription start site of 2089 genes expressed in B. cenocepacia biofilms were defined. For 64 genes a start codon alternative to the annotated one was proposed. Substantial antisense transcription for 105 genes and two novel protein coding sequences were identified. The distribution of internal transcription start sites can be used to identify genomic islands in B. cenocepacia. A potassium pump strongly induced only under biofilm conditions was found and 15 non-coding small RNAs highly expressed in biofilms were discovered. Mapping transcription start sites across the B. cenocepacia genome added relevant information to the J2315 annotation. Genes and novel regulatory RNAs putatively involved in B. cenocepacia biofilm formation were identified. These findings will help in understanding regulation of B. cenocepacia biofilm formation.
Decomposing genomic variance using information from GWA, GWE and eQTL analysis.
Ehsani, A; Janss, L; Pomp, D; Sørensen, P
2016-04-01
A commonly used procedure in genome-wide association (GWA), genome-wide expression (GWE) and expression quantitative trait locus (eQTL) analyses is based on a bottom-up experimental approach that attempts to individually associate molecular variants with complex traits. Top-down modeling of the entire set of genomic data and partitioning of the overall variance into subcomponents may provide further insight into the genetic basis of complex traits. To test this approach, we performed a whole-genome variance components analysis and partitioned the genomic variance using information from GWA, GWE and eQTL analyses of growth-related traits in a mouse F2 population. We characterized the mouse trait genetic architecture by ordering single nucleotide polymorphisms (SNPs) based on their P-values and studying the areas under the curve (AUCs). The observed traits were found to have a genomic variance profile that differed significantly from that expected of a trait under an infinitesimal model. This situation was particularly true for both body weight and body fat, for which the AUCs were much higher compared with that of glucose. In addition, SNPs with a high degree of trait-specific regulatory potential (SNPs associated with subset of transcripts that significantly associated with a specific trait) explained a larger proportion of the genomic variance than did SNPs with high overall regulatory potential (SNPs associated with transcripts using traditional eQTL analysis). We introduced AUC measures of genomic variance profiles that can be used to quantify relative importance of SNPs as well as degree of deviation of a trait's inheritance from an infinitesimal model. The shape of the curve aids global understanding of traits: The steeper the left-hand side of the curve, the fewer the number of SNPs controlling most of the phenotypic variance. © 2015 Stichting International Foundation for Animal Genetics.
Single-cell transcriptional analysis of taste sensory neuron pair in Caenorhabditis elegans.
Takayama, Jun; Faumont, Serge; Kunitomo, Hirofumi; Lockery, Shawn R; Iino, Yuichi
2010-01-01
The nervous system is composed of a wide variety of neurons. A description of the transcriptional profiles of each neuron would yield enormous information about the molecular mechanisms that define morphological or functional characteristics. Here we show that RNA isolation from single neurons is feasible by using an optimized mRNA tagging method. This method extracts transcripts in the target cells by co-immunoprecipitation of the complexes of RNA and epitope-tagged poly(A) binding protein expressed specifically in the cells. With this method and genome-wide microarray, we compared the transcriptional profiles of two functionally different neurons in the main C. elegans gustatory neuron class ASE. Eight of the 13 known subtype-specific genes were successfully detected. Additionally, we identified nine novel genes including a receptor guanylyl cyclase, secreted proteins, a TRPC channel and uncharacterized genes conserved among nematodes, suggesting the two neurons are substantially different than previously thought. The expression of these novel genes was controlled by the previously known regulatory network for subtype differentiation. We also describe unique motif organization within individual gene groups classified by the expression patterns in ASE. Our study paves the way to the complete catalog of the expression profiles of individual C. elegans neurons.
Puranik, Swati; Sahu, Pranav Pankaj; Mandal, Sambhu Nath; B., Venkata Suresh; Parida, Swarup Kumar; Prasad, Manoj
2013-01-01
The NAC proteins represent a major plant-specific transcription factor family that has established enormously diverse roles in various plant processes. Aided by the availability of complete genomes, several members of this family have been identified in Arabidopsis, rice, soybean and poplar. However, no comprehensive investigation has been presented for the recently sequenced, naturally stress tolerant crop, Setaria italica (foxtail millet) that is famed as a model crop for bioenergy research. In this study, we identified 147 putative NAC domain-encoding genes from foxtail millet by systematic sequence analysis and physically mapped them onto nine chromosomes. Genomic organization suggested that inter-chromosomal duplications may have been responsible for expansion of this gene family in foxtail millet. Phylogenetically, they were arranged into 11 distinct sub-families (I-XI), with duplicated genes fitting into one cluster and possessing conserved motif compositions. Comparative mapping with other grass species revealed some orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of genes. The evolutionary significance as duplication and divergence of NAC genes based on their amino acid substitution rates was understood. Expression profiling against various stresses and phytohormones provides novel insights into specific and/or overlapping expression patterns of SiNAC genes, which may be responsible for functional divergence among individual members in this crop. Further, we performed structure modeling and molecular simulation of a stress-responsive protein, SiNAC128, proffering an initial framework for understanding its molecular function. Taken together, this genome-wide identification and expression profiling unlocks new avenues for systematic functional analysis of novel NAC gene family candidates which may be applied for improvising stress adaption in plants. PMID:23691254
Puranik, Swati; Sahu, Pranav Pankaj; Mandal, Sambhu Nath; B, Venkata Suresh; Parida, Swarup Kumar; Prasad, Manoj
2013-01-01
The NAC proteins represent a major plant-specific transcription factor family that has established enormously diverse roles in various plant processes. Aided by the availability of complete genomes, several members of this family have been identified in Arabidopsis, rice, soybean and poplar. However, no comprehensive investigation has been presented for the recently sequenced, naturally stress tolerant crop, Setaria italica (foxtail millet) that is famed as a model crop for bioenergy research. In this study, we identified 147 putative NAC domain-encoding genes from foxtail millet by systematic sequence analysis and physically mapped them onto nine chromosomes. Genomic organization suggested that inter-chromosomal duplications may have been responsible for expansion of this gene family in foxtail millet. Phylogenetically, they were arranged into 11 distinct sub-families (I-XI), with duplicated genes fitting into one cluster and possessing conserved motif compositions. Comparative mapping with other grass species revealed some orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of genes. The evolutionary significance as duplication and divergence of NAC genes based on their amino acid substitution rates was understood. Expression profiling against various stresses and phytohormones provides novel insights into specific and/or overlapping expression patterns of SiNAC genes, which may be responsible for functional divergence among individual members in this crop. Further, we performed structure modeling and molecular simulation of a stress-responsive protein, SiNAC128, proffering an initial framework for understanding its molecular function. Taken together, this genome-wide identification and expression profiling unlocks new avenues for systematic functional analysis of novel NAC gene family candidates which may be applied for improvising stress adaption in plants.
An anatomically comprehensive atlas of the adult human brain transcriptome
Guillozet-Bongaarts, Angela L.; Shen, Elaine H.; Ng, Lydia; Miller, Jeremy A.; van de Lagemaat, Louie N.; Smith, Kimberly A.; Ebbert, Amanda; Riley, Zackery L.; Abajian, Chris; Beckmann, Christian F.; Bernard, Amy; Bertagnolli, Darren; Boe, Andrew F.; Cartagena, Preston M.; Chakravarty, M. Mallar; Chapin, Mike; Chong, Jimmy; Dalley, Rachel A.; David Daly, Barry; Dang, Chinh; Datta, Suvro; Dee, Nick; Dolbeare, Tim A.; Faber, Vance; Feng, David; Fowler, David R.; Goldy, Jeff; Gregor, Benjamin W.; Haradon, Zeb; Haynor, David R.; Hohmann, John G.; Horvath, Steve; Howard, Robert E.; Jeromin, Andreas; Jochim, Jayson M.; Kinnunen, Marty; Lau, Christopher; Lazarz, Evan T.; Lee, Changkyu; Lemon, Tracy A.; Li, Ling; Li, Yang; Morris, John A.; Overly, Caroline C.; Parker, Patrick D.; Parry, Sheana E.; Reding, Melissa; Royall, Joshua J.; Schulkin, Jay; Sequeira, Pedro Adolfo; Slaughterbeck, Clifford R.; Smith, Simon C.; Sodt, Andy J.; Sunkin, Susan M.; Swanson, Beryl E.; Vawter, Marquis P.; Williams, Derric; Wohnoutka, Paul; Zielke, H. Ronald; Geschwind, Daniel H.; Hof, Patrick R.; Smith, Stephen M.; Koch, Christof; Grant, Seth G. N.; Jones, Allan R.
2014-01-01
Neuroanatomically precise, genome-wide maps of transcript distributions are critical resources to complement genomic sequence data and to correlate functional and genetic brain architecture. Here we describe the generation and analysis of a transcriptional atlas of the adult human brain, comprising extensive histological analysis and comprehensive microarray profiling of ~900 neuroanatomically precise subdivisions in two individuals. Transcriptional regulation varies enormously by anatomical location, with different regions and their constituent cell types displaying robust molecular signatures that are highly conserved between individuals. Analysis of differential gene expression and gene co-expression relationships demonstrates that brain-wide variation strongly reflects the distributions of major cell classes such as neurons, oligodendrocytes, astrocytes and microglia. Local neighbourhood relationships between fine anatomical subdivisions are associated with discrete neuronal subtypes and genes involved with synaptic transmission. The neocortex displays a relatively homogeneous transcriptional pattern, but with distinct features associated selectively with primary sensorimotor cortices and with enriched frontal lobe expression. Notably, the spatial topography of the neocortex is strongly reflected in its molecular topography— the closer two cortical regions, the more similar their transcriptomes. This freely accessible online data resource forms a high-resolution transcriptional baseline for neurogenetic studies of normal and abnormal human brain function. PMID:22996553
Moqtaderi, Zarmik; Wang, Jie; Raha, Debasish; White, Robert J; Snyder, Michael; Weng, Zhiping; Struhl, Kevin
2010-05-01
Genome-wide occupancy profiles of five components of the RNA polymerase III (Pol III) machinery in human cells identified the expected tRNA and noncoding RNA targets and revealed many additional Pol III-associated loci, mostly near short interspersed elements (SINEs). Several genes are targets of an alternative transcription factor IIIB (TFIIIB) containing Brf2 instead of Brf1 and have extremely low levels of TFIIIC. Strikingly, expressed Pol III genes, unlike nonexpressed Pol III genes, are situated in regions with a pattern of histone modifications associated with functional Pol II promoters. TFIIIC alone associates with numerous ETC loci, via the B box or a novel motif. ETCs are often near CTCF binding sites, suggesting a potential role in chromosome organization. Our results suggest that human Pol III complexes associate preferentially with regions near functional Pol II promoters and that TFIIIC-mediated recruitment of TFIIIB is regulated in a locus-specific manner.
Sunkel, Benjamin; Wu, Dayong; Chen, Zhong; Wang, Chiou-Miin; Liu, Xiangtao; Ye, Zhenqing; Horning, Aaron M; Liu, Joseph; Mahalingam, Devalingam; Lopez-Nicora, Horacio; Lin, Chun-Lin; Goodfellow, Paul J; Clinton, Steven K; Jin, Victor X; Chen, Chun-Liang; Huang, Tim H-M; Wang, Qianben
2016-05-19
Identifying prostate cancer-driving transcription factors (TFs) in addition to the androgen receptor promises to improve our ability to effectively diagnose and treat this disease. We employed an integrative genomics analysis of master TFs CREB1 and FoxA1 in androgen-dependent prostate cancer (ADPC) and castration-resistant prostate cancer (CRPC) cell lines, primary prostate cancer tissues and circulating tumor cells (CTCs) to investigate their role in defining prostate cancer gene expression profiles. Combining genome-wide binding site and gene expression profiles we define CREB1 as a critical driver of pro-survival, cell cycle and metabolic transcription programs. We show that CREB1 and FoxA1 co-localize and mutually influence each other's binding to define disease-driving transcription profiles associated with advanced prostate cancer. Gene expression analysis in human prostate cancer samples found that CREB1/FoxA1 target gene panels predict prostate cancer recurrence. Finally, we showed that this signaling pathway is sensitive to compounds that inhibit the transcription co-regulatory factor MED1. These findings not only reveal a novel, global transcriptional co-regulatory function of CREB1 and FoxA1, but also suggest CREB1/FoxA1 signaling is a targetable driver of prostate cancer progression and serves as a biomarker of poor clinical outcomes. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
The rubber tree genome shows expansion of gene family associated with rubber biosynthesis.
Lau, Nyok-Sean; Makita, Yuko; Kawashima, Mika; Taylor, Todd D; Kondo, Shinji; Othman, Ahmad Sofiman; Shu-Chien, Alexander Chong; Matsui, Minami
2016-06-24
Hevea brasiliensis Muell. Arg, a member of the family Euphorbiaceae, is the sole natural resource exploited for commercial production of high-quality natural rubber. The properties of natural rubber latex are almost irreplaceable by synthetic counterparts for many industrial applications. A paucity of knowledge on the molecular mechanisms of rubber biosynthesis in high yield traits still persists. Here we report the comprehensive genome-wide analysis of the widely planted H. brasiliensis clone, RRIM 600. The genome was assembled based on ~155-fold combined coverage with Illumina and PacBio sequence data and has a total length of 1.55 Gb with 72.5% comprising repetitive DNA sequences. A total of 84,440 high-confidence protein-coding genes were predicted. Comparative genomic analysis revealed strong synteny between H. brasiliensis and other Euphorbiaceae genomes. Our data suggest that H. brasiliensis's capacity to produce high levels of latex can be attributed to the expansion of rubber biosynthesis-related genes in its genome and the high expression of these genes in latex. Using cap analysis gene expression data, we illustrate the tissue-specific transcription profiles of rubber biosynthesis-related genes, revealing alternative means of transcriptional regulation. Our study adds to the understanding of H. brasiliensis biology and provides valuable genomic resources for future agronomic-related improvement of the rubber tree.
Pandey, Ashutosh; Alok, Anshu; Lakhwani, Deepika; Singh, Jagdeep; Asif, Mehar H.; Trivedi, Prabodh K.
2016-01-01
Flavonoid biosynthesis is largely regulated at the transcriptional level due to the modulated expression of genes related to the phenylpropanoid pathway in plants. Although accumulation of different flavonoids has been reported in banana, a staple fruit crop, no detailed information is available on regulation of the biosynthesis in this important plant. We carried out genome-wide analysis of banana (Musa acuminata, AAA genome) and identified 28 genes belonging to 9 gene families associated with flavonoid biosynthesis. Expression analysis suggested spatial and temporal regulation of the identified genes in different tissues of banana. Analysis revealed enhanced expression of genes related to flavonol and proanthocyanidin (PA) biosynthesis in peel and pulp at the early developmental stages of fruit. Genes involved in anthocyanin biosynthesis were highly expressed during banana fruit ripening. In general, higher accumulation of metabolites was observed in the peel as compared to pulp tissue. A correlation between expression of genes and metabolite content was observed at the early stage of fruit development. Furthermore, this study also suggests regulation of flavonoid biosynthesis, at transcriptional level, under light and dark exposures as well as methyl jasmonate (MJ) treatment in banana. PMID:27539368
Pandey, Ashutosh; Alok, Anshu; Lakhwani, Deepika; Singh, Jagdeep; Asif, Mehar H; Trivedi, Prabodh K
2016-08-19
Flavonoid biosynthesis is largely regulated at the transcriptional level due to the modulated expression of genes related to the phenylpropanoid pathway in plants. Although accumulation of different flavonoids has been reported in banana, a staple fruit crop, no detailed information is available on regulation of the biosynthesis in this important plant. We carried out genome-wide analysis of banana (Musa acuminata, AAA genome) and identified 28 genes belonging to 9 gene families associated with flavonoid biosynthesis. Expression analysis suggested spatial and temporal regulation of the identified genes in different tissues of banana. Analysis revealed enhanced expression of genes related to flavonol and proanthocyanidin (PA) biosynthesis in peel and pulp at the early developmental stages of fruit. Genes involved in anthocyanin biosynthesis were highly expressed during banana fruit ripening. In general, higher accumulation of metabolites was observed in the peel as compared to pulp tissue. A correlation between expression of genes and metabolite content was observed at the early stage of fruit development. Furthermore, this study also suggests regulation of flavonoid biosynthesis, at transcriptional level, under light and dark exposures as well as methyl jasmonate (MJ) treatment in banana.
Resolving Heart Regeneration by Replacement Histone Profiling.
Goldman, Joseph Aaron; Kuzu, Guray; Lee, Nutishia; Karasik, Jaclyn; Gemberling, Matthew; Foglia, Matthew J; Karra, Ravi; Dickson, Amy L; Sun, Fei; Tolstorukov, Michael Y; Poss, Kenneth D
2017-02-27
Chromatin regulation is a principal mechanism governing animal development, yet it is unclear to what extent structural changes in chromatin underlie tissue regeneration. Non-mammalian vertebrates such as zebrafish activate cardiomyocyte (CM) division after tissue damage to regenerate lost heart muscle. Here, we generated transgenic zebrafish expressing a biotinylatable H3.3 histone variant in CMs and derived cell-type-specific profiles of histone replacement. We identified an emerging program of putative enhancers that revise H3.3 occupancy during regeneration, overlaid upon a genome-wide reduction of H3.3 from promoters. In transgenic reporter lines, H3.3-enriched elements directed gene expression in subpopulations of CMs. Other elements increased H3.3 enrichment and displayed enhancer activity in settings of injury- and/or Neuregulin1-elicited CM proliferation. Dozens of consensus sequence motifs containing predicted transcription factor binding sites were enriched in genomic regions with regeneration-responsive H3.3 occupancy. Thus, cell-type-specific regulatory programs of tissue regeneration can be revealed by genome-wide H3.3 profiling. Copyright © 2017 Elsevier Inc. All rights reserved.
Bajaj, Deepak; Das, Shouvik; Upadhyaya, Hari D.; Ranjan, Rajeev; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. Laxmipathi; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
The study identified 9045 high-quality SNPs employing both genome-wide GBS- and candidate gene-based SNP genotyping assays in 172, including 93 cultivated (desi and kabuli) and 79 wild chickpea accessions. The GWAS in a structured population of 93 sequenced accessions detected 15 major genomic loci exhibiting significant association with seed coat color. Five seed color-associated major genomic loci underlying robust QTLs mapped on a high-density intra-specific genetic linkage map were validated by QTL mapping. The integration of association and QTL mapping with gene haplotype-specific LD mapping and transcript profiling identified novel allelic variants (non-synonymous SNPs) and haplotypes in a MATE secondary transporter gene regulating light/yellow brown and beige seed coat color differentiation in chickpea. The down-regulation and decreased transcript expression of beige seed coat color-associated MATE gene haplotype was correlated with reduced proanthocyanidins accumulation in the mature seed coats of beige than light/yellow brown seed colored desi and kabuli accessions for their coloration/pigmentation. This seed color-regulating MATE gene revealed strong purifying selection pressure primarily in LB/YB seed colored desi and wild Cicer reticulatum accessions compared with the BE seed colored kabuli accessions. The functionally relevant molecular tags identified have potential to decipher the complex transcriptional regulatory gene function of seed coat coloration and for understanding the selective sweep-based seed color trait evolutionary pattern in cultivated and wild accessions during chickpea domestication. The genome-wide integrated approach employed will expedite marker-assisted genetic enhancement for developing cultivars with desirable seed coat color types in chickpea. PMID:26635822
DNA methylation profiles of donor nuclei cells and tissues of cloned bovine fetuses.
Kremenskoy, Maksym; Kremenska, Yuliya; Suzuki, Masako; Imai, Kei; Takahashi, Seiya; Hashizume, Kazuyoshi; Yagi, Shintaro; Shiota, Kunio
2006-04-01
Methylation of DNA in CpG islands plays an important role during fetal development and differentiation because CpG islands are preferentially located in upstream regions of mammalian genomic DNA, including the transcription start site of housekeeping genes and are also associated with tissue-specific genes. Somatic nuclear transfer (NT) technology has been used to generate live clones in numerous mammalian species, but only a low percentage of nuclear transferred animals develop to term. Abnormal epigenetic changes in the CpG islands of donor nuclei after nuclear transfer could contribute to a high rate of abortion during early gestation and increase perinatal death. These changes have yet to be explored. Thus, we investigated the genome-wide DNA methylation profiles of CpG islands in nuclei donor cells and NT animals. Using Restriction Landmark Genomic Scanning (RLGS), we showed, for the first time, the epigenetic profile formation of tissues from NT bovine fetuses produced from cumulus cells. From approximately 2600 unmethylated NotI sites visualized on the RLGS profile, at least 35 NotI sites showed different methylation statuses. Moreover, we proved that fetal and placental tissues from artificially inseminated and cloned cattle have tissue-specific differences in the genome-wide methylation profiles of the CpG islands. We also found that possible abnormalities occurred in the fetal brain and placental tissues of cloned animals.
Tiengwe, Calvin; Marcello, Lucio; Farr, Helen; Dickens, Nicholas; Kelly, Steven; Swiderski, Michal; Vaughan, Diane; Gull, Keith; Barry, J. David; Bell, Stephen D.; McCulloch, Richard
2012-01-01
Summary Identification of replication initiation sites, termed origins, is a crucial step in understanding genome transmission in any organism. Transcription of the Trypanosoma brucei genome is highly unusual, with each chromosome comprising a few discrete transcription units. To understand how DNA replication occurs in the context of such organization, we have performed genome-wide mapping of the binding sites of the replication initiator ORC1/CDC6 and have identified replication origins, revealing that both localize to the boundaries of the transcription units. A remarkably small number of active origins is seen, whose spacing is greater than in any other eukaryote. We show that replication and transcription in T. brucei have a profound functional overlap, as reducing ORC1/CDC6 levels leads to genome-wide increases in mRNA levels arising from the boundaries of the transcription units. In addition, ORC1/CDC6 loss causes derepression of silent Variant Surface Glycoprotein genes, which are critical for host immune evasion. PMID:22840408
Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes.
Riechmann, J L; Heard, J; Martin, G; Reuber, L; Jiang, C; Keddie, J; Adam, L; Pineda, O; Ratcliffe, O J; Samaha, R R; Creelman, R; Pilgrim, M; Broun, P; Zhang, J Z; Ghandehari, D; Sherman, B K; Yu, G
2000-12-15
The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over 5% of its genome to code for more than 1500 transcription factors, about 45% of which are from families specific to plants. Arabidopsis transcription factors that belong to families common to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the conserved DNA binding domains, many of which have been arranged in combinations specific to each lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the regulation of transcription.
GWIPS-viz: development of a ribo-seq genome browser
Michel, Audrey M.; Fox, Gearoid; M. Kiran, Anmol; De Bo, Christof; O’Connor, Patrick B. F.; Heaphy, Stephen M.; Mullan, James P. A.; Donohue, Claire A.; Higgins, Desmond G.; Baranov, Pavel V.
2014-01-01
We describe the development of GWIPS-viz (http://gwips.ucc.ie), an online genome browser for viewing ribosome profiling data. Ribosome profiling (ribo-seq) is a recently developed technique that provides genome-wide information on protein synthesis (GWIPS) in vivo. It is based on the deep sequencing of ribosome-protected messenger RNA (mRNA) fragments, which allows the ribosome density along all mRNA transcripts present in the cell to be quantified. Since its inception, ribo-seq has been carried out in a number of eukaryotic and prokaryotic organisms. Owing to the increasing interest in ribo-seq, there is a pertinent demand for a dedicated ribo-seq genome browser. GWIPS-viz is based on The University of California Santa Cruz (UCSC) Genome Browser. Ribo-seq tracks, coupled with mRNA-seq tracks, are currently available for several genomes: human, mouse, zebrafish, nematode, yeast, bacteria (Escherichia coli K12, Bacillus subtilis), human cytomegalovirus and bacteriophage lambda. Our objective is to continue incorporating published ribo-seq data sets so that the wider community can readily view ribosome profiling information from multiple studies without the need to carry out computational processing. PMID:24185699
Bloom, Chloe I; Graham, Christine M; Berry, Matthew P R; Rozakeas, Fotini; Redford, Paul S; Wang, Yuanyuan; Xu, Zhaohui; Wilkinson, Katalin A; Wilkinson, Robert J; Kendrick, Yvonne; Devouassoux, Gilles; Ferry, Tristan; Miyara, Makoto; Bouvry, Diane; Valeyre, Dominique; Dominique, Valeyre; Gorochov, Guy; Blankenship, Derek; Saadatian, Mitra; Vanhems, Phillip; Beynon, Huw; Vancheeswaran, Rama; Wickremasinghe, Melissa; Chaussabel, Damien; Banchereau, Jacques; Pascual, Virginia; Ho, Ling-Pei; Lipman, Marc; O'Garra, Anne
2013-01-01
New approaches to define factors underlying the immunopathogenesis of pulmonary diseases including sarcoidosis and tuberculosis are needed to develop new treatments and biomarkers. Comparing the blood transcriptional response of tuberculosis to other similar pulmonary diseases will advance knowledge of disease pathways and help distinguish diseases with similar clinical presentations. To determine the factors underlying the immunopathogenesis of the granulomatous diseases, sarcoidosis and tuberculosis, by comparing the blood transcriptional responses in these and other pulmonary diseases. We compared whole blood genome-wide transcriptional profiles in pulmonary sarcoidosis, pulmonary tuberculosis, to community acquired pneumonia and primary lung cancer and healthy controls, before and after treatment, and in purified leucocyte populations. An Interferon-inducible neutrophil-driven blood transcriptional signature was present in both sarcoidosis and tuberculosis, with a higher abundance and expression in tuberculosis. Heterogeneity of the sarcoidosis signature correlated significantly with disease activity. Transcriptional profiles in pneumonia and lung cancer revealed an over-abundance of inflammatory transcripts. After successful treatment the transcriptional activity in tuberculosis and pneumonia patients was significantly reduced. However the glucocorticoid-responsive sarcoidosis patients showed a significant increase in transcriptional activity. 144-blood transcripts were able to distinguish tuberculosis from other lung diseases and controls. Tuberculosis and sarcoidosis revealed similar blood transcriptional profiles, dominated by interferon-inducible transcripts, while pneumonia and lung cancer showed distinct signatures, dominated by inflammatory genes. There were also significant differences between tuberculosis and sarcoidosis in the degree of their transcriptional activity, the heterogeneity of their profiles and their transcriptional response to treatment.
Almstrup, Kristian; Hoei-Hansen, Christina E; Wirkner, Ute; Blake, Jonathon; Schwager, Christian; Ansorge, Wilhelm; Nielsen, John E; Skakkebaek, Niels E; Rajpert-De Meyts, Ewa; Leffers, Henrik
2004-07-15
Carcinoma in situ (CIS) is the common precursor of histologically heterogeneous testicular germ cell tumors (TGCTs), which in recent decades have markedly increased and now are the most common malignancy of young men. Using genome-wide gene expression profiling, we identified >200 genes highly expressed in testicular CIS, including many never reported in testicular neoplasms. Expression was further verified by semiquantitative reverse transcription-PCR and in situ hybridization. Among the highest expressed genes were NANOG and POU5F1, and reverse transcription-PCR revealed possible changes in their stoichiometry on progression into embryonic carcinoma. We compared the CIS expression profile with patterns reported in embryonic stem cells (ESCs), which revealed a substantial overlap that may be as high as 50%. We also demonstrated an over-representation of expressed genes in regions of 17q and 12, reported as unstable in cultured ESCs. The close similarity between CIS and ESCs explains the pluripotency of CIS. Moreover, the findings are consistent with an early prenatal origin of TGCTs and thus suggest that etiologic factors operating in utero are of primary importance for the incidence trends of TGCTs. Finally, some of the highly expressed genes identified in this study are promising candidates for new diagnostic markers for CIS and/or TGCTs.
Kim, Ji Eun; Lee, Min Hee; Cho, Eun Ju; Kim, Ji Hong; Chung, Byung Yeoup; Kim, Jin-Hong
2013-12-01
Ionizing radiation causes various epigenetic changes, as well as a variety of DNA lesions such as strand breaks, cross-links, oxidative damages, etc., in genomes. However, radiation-induced epigenetic changes have rarely been substantiated in plant genomes. The current study investigates whether DNA methylation of Arabidopsis thaliana genome is altered by gamma rays. We found that genomic DNA methylation decreased in wild-type plants with increasing doses of gamma rays (5, 50 and 200 Gy). Irradiation with 200 Gy significantly increased the expression of transcriptionally inactive centromeric 180-bp (CEN) and transcriptionally silent information (TSI) repeats. This increase suggested that there was a substantial release of transcriptional gene silencing by gamma rays, probably by induction of DNA hypomethylation. High expression of the DNA demethylase ROS1 and low expression of the DNA methyltransferase CMT3 supported this hypothesis. Moreover, Southern blot analysis following digestion of genomic DNA with methylation-sensitive enzymes revealed that the DNA hypomethylation occured preferentially at CHG or CHH sites rather than CG sites, depending on the radiation dose. Unlike CEN and TSI repeats, the number of Ta3, AtSN1 and FWA repeats decreased in transcription but increased in non-CG methylation. In addition, the cmt3-11 mutant showed neither DNA hypomethylation nor transcriptional activation of silenced repeats upon gamma irradiation. Furthermore, profiles of genome-wide transcriptomes in response to gamma rays differed between the wild-type and cmt3-11 mutant. These results suggest that gamma irradiation induced DNA hypomethylation preferentially at non-CG sites of transcriptionally inactive repeats in a locus-specific manner, which depends on CMT3 activity.
Pervasive, Genome-Wide Transcription in the Organelle Genomes of Diverse Plastid-Bearing Protists.
Sanitá Lima, Matheus; Smith, David Roy
2017-11-06
Organelle genomes are among the most sequenced kinds of chromosome. This is largely because they are small and widely used in molecular studies, but also because next-generation sequencing technologies made sequencing easier, faster, and cheaper. However, studies of organelle RNA have not kept pace with those of DNA, despite huge amounts of freely available eukaryotic RNA-sequencing (RNA-seq) data. Little is known about organelle transcription in nonmodel species, and most of the available eukaryotic RNA-seq data have not been mined for organelle transcripts. Here, we use publicly available RNA-seq experiments to investigate organelle transcription in 30 diverse plastid-bearing protists with varying organelle genomic architectures. Mapping RNA-seq data to organelle genomes revealed pervasive, genome-wide transcription, regardless of the taxonomic grouping, gene organization, or noncoding content. For every species analyzed, transcripts covered ≥85% of the mitochondrial and/or plastid genomes (all of which were ≤105 kb), indicating that most of the organelle DNA-coding and noncoding-is transcriptionally active. These results follow earlier studies of model species showing that organellar transcription is coupled and ubiquitous across the genome, requiring significant downstream processing of polycistronic transcripts. Our findings suggest that noncoding organelle DNA can be transcriptionally active, raising questions about the underlying function of these transcripts and underscoring the utility of publicly available RNA-seq data for recovering complete genome sequences. If pervasive transcription is also found in bigger organelle genomes (>105 kb) and across a broader range of eukaryotes, this could indicate that noncoding organelle RNAs are regulating fundamental processes within eukaryotic cells. Copyright © 2017 Sanitá Lima and Smith.
Parallel factor ChIP provides essential internal control for quantitative differential ChIP-seq.
Guertin, Michael J; Cullen, Amy E; Markowetz, Florian; Holding, Andrew N
2018-04-17
A key challenge in quantitative ChIP combined with high-throughput sequencing (ChIP-seq) is the normalization of data in the presence of genome-wide changes in occupancy. Analysis-based normalization methods were developed for transcriptomic data and these are dependent on the underlying assumption that total transcription does not change between conditions. For genome-wide changes in transcription factor (TF) binding, these assumptions do not hold true. The challenges in normalization are confounded by experimental variability during sample preparation, processing and recovery. We present a novel normalization strategy utilizing an internal standard of unchanged peaks for reference. Our method can be readily applied to monitor genome-wide changes by ChIP-seq that are otherwise lost or misrepresented through analytical normalization. We compare our approach to normalization by total read depth and two alternative methods that utilize external experimental controls to study TF binding. We successfully resolve the key challenges in quantitative ChIP-seq analysis and demonstrate its application by monitoring the loss of Estrogen Receptor-alpha (ER) binding upon fulvestrant treatment, ER binding in response to estrodiol, ER mediated change in H4K12 acetylation and profiling ER binding in patient-derived xenographs. This is supported by an adaptable pipeline to normalize and quantify differential TF binding genome-wide and generate metrics for differential binding at individual sites.
Establishment and functions of DNA methylation in the germline
Stewart, Kathleen R; Veselovska, Lenka; Kelsey, Gavin
2016-01-01
Epigenetic modifications established during gametogenesis regulate transcription and other nuclear processes in gametes, but also have influences in the zygote, embryo and postnatal life. This is best understood for DNA methylation which, established at discrete regions of the oocyte and sperm genomes, governs genomic imprinting. In this review, we describe how imprinting has informed our understanding of de novo DNA methylation mechanisms, highlight how recent genome-wide profiling studies have provided unprecedented insights into establishment of the sperm and oocyte methylomes and consider the fate and function of gametic methylation and other epigenetic modifications after fertilization. PMID:27659720
Jin, Hyun Mi; Jeong, Hye Im; Kim, Kyung Hyun; Hahn, Yoonsoo; Madsen, Eugene L; Jeon, Che Ok
2016-02-18
A genome-wide transcriptional analysis of Alteromonas naphthalenivorans SN2 was performed to investigate its ecophysiological behavior in contaminated tidal flats and seawater. The experimental design mimicked these habitats that either added naphthalene or pyruvate; tidal flat-naphthalene (TF-N), tidal flat-pyruvate (TF-P), seawater-naphthalene (SW-N), and seawater-pyruvate (SW-P). The transcriptional profiles clustered by habitat (TF-N/TF-P and SW-N/SW-P), rather than carbon source, suggesting that the former may exert a greater influence on genome-wide expression in strain SN2 than the latter. Metabolic mapping of cDNA reads from strain SN2 based on KEGG pathway showed that metabolic and regulatory genes associated with energy metabolism, translation, and cell motility were highly expressed in all four test conditions, probably highlighting the copiotrophic properties of strain SN2 as an opportunistic marine r-strategist. Differential gene expression analysis revealed that strain SN2 displayed specific cellular responses to environmental variables (tidal flat, seawater, naphthalene, and pyruvate) and exhibited certain ecological fitness traits -- its notable PAH degradation capability in seasonally cold tidal flat might be reflected in elevated expression of stress response and chaperone proteins, while fast growth in nitrogen-deficient and aerobic seawater probably correlated with high expression of glutamine synthetase, enzymes utilizing nitrite/nitrate, and those involved in the removal of reactive oxygen species.
Jin, Hyun Mi; Jeong, Hye Im; Kim, Kyung Hyun; Hahn, Yoonsoo; Madsen, Eugene L.; Jeon, Che Ok
2016-01-01
A genome-wide transcriptional analysis of Alteromonas naphthalenivorans SN2 was performed to investigate its ecophysiological behavior in contaminated tidal flats and seawater. The experimental design mimicked these habitats that either added naphthalene or pyruvate; tidal flat-naphthalene (TF-N), tidal flat-pyruvate (TF-P), seawater-naphthalene (SW-N), and seawater-pyruvate (SW-P). The transcriptional profiles clustered by habitat (TF-N/TF-P and SW-N/SW-P), rather than carbon source, suggesting that the former may exert a greater influence on genome-wide expression in strain SN2 than the latter. Metabolic mapping of cDNA reads from strain SN2 based on KEGG pathway showed that metabolic and regulatory genes associated with energy metabolism, translation, and cell motility were highly expressed in all four test conditions, probably highlighting the copiotrophic properties of strain SN2 as an opportunistic marine r-strategist. Differential gene expression analysis revealed that strain SN2 displayed specific cellular responses to environmental variables (tidal flat, seawater, naphthalene, and pyruvate) and exhibited certain ecological fitness traits –– its notable PAH degradation capability in seasonally cold tidal flat might be reflected in elevated expression of stress response and chaperone proteins, while fast growth in nitrogen-deficient and aerobic seawater probably correlated with high expression of glutamine synthetase, enzymes utilizing nitrite/nitrate, and those involved in the removal of reactive oxygen species. PMID:26887987
Whole-genome expression analysis of mammalian-wide interspersed repeat elements in human cell lines.
Carnevali, Davide; Conti, Anastasia; Pellegrini, Matteo; Dieci, Giorgio
2017-02-01
With more than 500,000 copies, mammalian-wide interspersed repeats (MIRs), a sub-group of SINEs, represent ∼2.5% of the human genome and one of the most numerous family of potential targets for the RNA polymerase (Pol) III transcription machinery. Since MIR elements ceased to amplify ∼130 myr ago, previous studies primarily focused on their genomic impact, while the issue of their expression has not been extensively addressed. We applied a dedicated bioinformatic pipeline to ENCODE RNA-Seq datasets of seven human cell lines and, for the first time, we were able to define the Pol III-driven MIR transcriptome at single-locus resolution. While the majority of Pol III-transcribed MIR elements are cell-specific, we discovered a small set of ubiquitously transcribed MIRs mapping within Pol II-transcribed genes in antisense orientation that could influence the expression of the overlapping gene. We also identified novel Pol III-transcribed ncRNAs, deriving from transcription of annotated MIR fragments flanked by unique MIR-unrelated sequences, and confirmed the role of Pol III-specific internal promoter elements in MIR transcription. Besides demonstrating widespread transcription at these retrotranspositionally inactive elements in human cells, the ability to profile MIR expression at single-locus resolution will facilitate their study in different cell types and states including pathological alterations. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Strategies for the acquisition of transcriptional and epigenetic information in single cells.
Li, Guang; Dzilic, Elda; Flores, Nick; Shieh, Alice; Wu, Sean M
2017-03-01
As the basic unit of living organisms, each single cell has unique molecular signatures and functions. Our ability to uncover the transcriptional and epigenetic signature of single cells has been hampered by the lack of tools to explore this area of research. The advent of microfluidic single cell technology along with single cell genome-wide DNA amplification methods had greatly improved our understanding of the expression variation in single cells. Transcriptional expression profile by multiplex qPCR or genome-wide RNA sequencing has enabled us to examine genes expression in single cells in different tissues. With the new tools, the identification of new cellular heterogeneity, novel marker genes, unique subpopulations, and spatial locations of each single cell can be acquired successfully. Epigenetic modifications for each single cell can also be obtained via similar methods. Based on single cell genome sequencing, single cell epigenetic information including histone modifications, DNA methylation, and chromatin accessibility have been explored and provided valuable insights regarding gene regulation and disease prognosis. In this article, we review the development of strategies to obtain single cell transcriptional and epigenetic data. Furthermore, we discuss ways in which single cell studies may help to provide greater understanding of the mechanisms of basic cardiovascular biology that will eventually lead to improvement in our ability to diagnose disease and develop new therapies.
Pócsi, István; Miskei, Márton; Karányi, Zsolt; Emri, Tamás; Ayoubi, Patricia; Pusztahelyi, Tünde; Balla, György; Prade, Rolf A
2005-01-01
Background In addition to their cytotoxic nature, reactive oxygen species (ROS) are also signal molecules in diverse cellular processes in eukaryotic organisms. Linking genome-wide transcriptional changes to cellular physiology in oxidative stress-exposed Aspergillus nidulans cultures provides the opportunity to estimate the sizes of peroxide (O22-), superoxide (O2•-) and glutathione/glutathione disulphide (GSH/GSSG) redox imbalance responses. Results Genome-wide transcriptional changes triggered by diamide, H2O2 and menadione in A. nidulans vegetative tissues were recorded using DNA microarrays containing 3533 unique PCR-amplified probes. Evaluation of LOESS-normalized data indicated that 2499 gene probes were affected by at least one stress-inducing agent. The stress induced by diamide and H2O2 were pulse-like, with recovery after 1 h exposure time while no recovery was observed with menadione. The distribution of stress-responsive gene probes among major physiological functional categories was approximately the same for each agent. The gene group sizes solely responsive to changes in intracellular O22-, O2•- concentrations or to GSH/GSSG redox imbalance were estimated at 7.7, 32.6 and 13.0 %, respectively. Gene groups responsive to diamide, H2O2 and menadione treatments and gene groups influenced by GSH/GSSG, O22- and O2•- were only partly overlapping with distinct enrichment profiles within functional categories. Changes in the GSH/GSSG redox state influenced expression of genes coding for PBS2 like MAPK kinase homologue, PSK2 kinase homologue, AtfA transcription factor, and many elements of ubiquitin tagging, cell division cycle regulators, translation machinery proteins, defense and stress proteins, transport proteins as well as many enzymes of the primary and secondary metabolisms. Meanwhile, a separate set of genes encoding transport proteins, CpcA and JlbA amino acid starvation-responsive transcription factors, and some elements of sexual development and sporulation was ROS responsive. Conclusion The existence of separate O22-, O2•- and GSH/GSSG responsive gene groups in a eukaryotic genome has been demonstrated. Oxidant-triggered, genome-wide transcriptional changes should be analyzed considering changes in oxidative stress-responsive physiological conditions and not correlating them directly to the chemistry and concentrations of the oxidative stress-inducing agent. PMID:16368011
Genome-wide transcriptome and expression profile analysis of Phalaenopsis during explant browning.
Xu, Chuanjun; Zeng, Biyu; Huang, Junmei; Huang, Wen; Liu, Yumei
2015-01-01
Explant browning presents a major problem for in vitro culture, and can lead to the death of the explant and failure of regeneration. Considerable work has examined the physiological mechanisms underlying Phalaenopsis leaf explant browning, but the molecular mechanisms of browning remain elusive. In this study, we used whole genome RNA sequencing to examine Phalaenopsis leaf explant browning at genome-wide level. We first used Illumina high-throughput technology to sequence the transcriptome of Phalaenopsis and then performed de novo transcriptome assembly. We assembled 79,434,350 clean reads into 31,708 isogenes and generated 26,565 annotated unigenes. We assigned Gene Ontology (GO) terms, Kyoto Encyclopedia of Genes and Genomes (KEGG) annotations, and potential Pfam domains to each transcript. Using the transcriptome data as a reference, we next analyzed the differential gene expression of explants cultured for 0, 3, and 6 d, respectively. We then identified differentially expressed genes (DEGs) before and after Phalaenopsis explant browning. We also performed GO, KEGG functional enrichment and Pfam analysis of all DEGs. Finally, we selected 11 genes for quantitative real-time PCR (qPCR) analysis to confirm the expression profile analysis. Here, we report the first comprehensive analysis of transcriptome and expression profiles during Phalaenopsis explant browning. Our results suggest that Phalaenopsis explant browning may be due in part to gene expression changes that affect the secondary metabolism, such as: phenylpropanoid pathway and flavonoid biosynthesis. Genes involved in photosynthesis and ATPase activity have been found to be changed at transcription level; these changes may perturb energy metabolism and thus lead to the decay of plant cells and tissues. This study provides comprehensive gene expression data for Phalaenopsis browning. Our data constitute an important resource for further functional studies to prevent explant browning.
Multi-targeted priming for genome-wide gene expression assays.
Adomas, Aleksandra B; Lopez-Giraldez, Francesc; Clark, Travis A; Wang, Zheng; Townsend, Jeffrey P
2010-08-17
Complementary approaches to assaying global gene expression are needed to assess gene expression in regions that are poorly assayed by current methodologies. A key component of nearly all gene expression assays is the reverse transcription of transcribed sequences that has traditionally been performed by priming the poly-A tails on many of the transcribed genes in eukaryotes with oligo-dT, or by priming RNA indiscriminately with random hexamers. We designed an algorithm to find common sequence motifs that were present within most protein-coding genes of Saccharomyces cerevisiae and of Neurospora crassa, but that were not present within their ribosomal RNA or transfer RNA genes. We then experimentally tested whether degenerately priming these motifs with multi-targeted primers improved the accuracy and completeness of transcriptomic assays. We discovered two multi-targeted primers that would prime a preponderance of genes in the genomes of Saccharomyces cerevisiae and Neurospora crassa while avoiding priming ribosomal RNA or transfer RNA. Examining the response of Saccharomyces cerevisiae to nitrogen deficiency and profiling Neurospora crassa early sexual development, we demonstrated that using multi-targeted primers in reverse transcription led to superior performance of microarray profiling and next-generation RNA tag sequencing. Priming with multi-targeted primers in addition to oligo-dT resulted in higher sensitivity, a larger number of well-measured genes and greater power to detect differences in gene expression. Our results provide the most complete and detailed expression profiles of the yeast nitrogen starvation response and N. crassa early sexual development to date. Furthermore, our multi-targeting priming methodology for genome-wide gene expression assays provides selective targeting of multiple sequences and counter-selection against undesirable sequences, facilitating a more complete and precise assay of the transcribed sequences within the genome.
Genome-Wide Transcriptome and Expression Profile Analysis of Phalaenopsis during Explant Browning
Xu, Chuanjun; Zeng, Biyu; Huang, Junmei; Huang, Wen; Liu, Yumei
2015-01-01
Background Explant browning presents a major problem for in vitro culture, and can lead to the death of the explant and failure of regeneration. Considerable work has examined the physiological mechanisms underlying Phalaenopsis leaf explant browning, but the molecular mechanisms of browning remain elusive. In this study, we used whole genome RNA sequencing to examine Phalaenopsis leaf explant browning at genome-wide level. Methodology/Principal Findings We first used Illumina high-throughput technology to sequence the transcriptome of Phalaenopsis and then performed de novo transcriptome assembly. We assembled 79,434,350 clean reads into 31,708 isogenes and generated 26,565 annotated unigenes. We assigned Gene Ontology (GO) terms, Kyoto Encyclopedia of Genes and Genomes (KEGG) annotations, and potential Pfam domains to each transcript. Using the transcriptome data as a reference, we next analyzed the differential gene expression of explants cultured for 0, 3, and 6 d, respectively. We then identified differentially expressed genes (DEGs) before and after Phalaenopsis explant browning. We also performed GO, KEGG functional enrichment and Pfam analysis of all DEGs. Finally, we selected 11 genes for quantitative real-time PCR (qPCR) analysis to confirm the expression profile analysis. Conclusions/Significance Here, we report the first comprehensive analysis of transcriptome and expression profiles during Phalaenopsis explant browning. Our results suggest that Phalaenopsis explant browning may be due in part to gene expression changes that affect the secondary metabolism, such as: phenylpropanoid pathway and flavonoid biosynthesis. Genes involved in photosynthesis and ATPase activity have been found to be changed at transcription level; these changes may perturb energy metabolism and thus lead to the decay of plant cells and tissues. This study provides comprehensive gene expression data for Phalaenopsis browning. Our data constitute an important resource for further functional studies to prevent explant browning. PMID:25874455
Lijun Liu; Trevor Ramsay; Matthew S. Zinkgraf; David Sundell; Nathaniel Robert Street; Vladimir Filkov; Andrew Groover
2015-01-01
Identifying transcription factor target genes is essential for modeling the transcriptional networks underlying developmental processes. Here we report a chromatin immunoprecipitation sequencing (ChIP-seq) resource consisting of genome-wide binding regions and associated putative target genes for four Populus homeodomain transcription factors...
Kuuluvainen, Emilia; Hakala, Heini; Havula, Essi; Sahal Estimé, Michelle; Rämet, Mika; Hietakangas, Ville; Mäkelä, Tomi P
2014-06-06
The Cdk8 (cyclin-dependent kinase 8) module of Mediator integrates regulatory cues from transcription factors to RNA polymerase II. It consists of four subunits where Med12 and Med13 link Cdk8 and cyclin C (CycC) to core Mediator. Here we have investigated the contributions of the Cdk8 module subunits to transcriptional regulation using RNA interference in Drosophila cells. Genome-wide expression profiling demonstrated separation of Cdk8-CycC and Med12-Med13 profiles. However, transcriptional regulation by Cdk8-CycC was dependent on Med12-Med13. This observation also revealed that Cdk8-CycC and Med12-Med13 often have opposite transcriptional effects. Interestingly, Med12 and Med13 profiles overlapped significantly with that of the GATA factor Serpent. Accordingly, mutational analyses indicated that GATA sites are required for Med12-Med13 regulation of Serpent-dependent genes. Med12 and Med13 were also found to be required for Serpent-activated innate immunity genes in defense to bacterial infection. The results reveal a novel role for the Cdk8 module in Serpent-dependent transcription and innate immunity. © 2014 by The American Society for Biochemistry and Molecular Biology, Inc.
Genome-wide expression analyses of the stationary phase model of ageing in yeast.
Wanichthanarak, Kwanjeera; Wongtosrad, Nutvadee; Petranovic, Dina
2015-07-01
Ageing processes involved in replicative lifespan (RLS) and chronological lifespan (CLS) have been found to be conserved among many organisms, including in unicellular Eukarya such as yeast Saccharomyces cerevisiae. Here we performed an integrated approach of genome wide expression profiles of yeast at different time points, during growth and starvation. The aim of the study was to identify transcriptional changes in those conditions by using several different computational analyses in order to propose transcription factors, biological networks and metabolic pathways that seem to be relevant during the process of chronological ageing in yeast. Specifically, we performed differential gene expression analysis, gene-set enrichment analysis and network-based analysis, and we identified pathways affected in the stationary phase and specific transcription factors driving transcriptional adaptations. The results indicate signal propagation from G protein-coupled receptors through signaling pathway components and other stress and nutrient-induced transcription factors resulting in adaptation of yeast cells to the lack of nutrients by activating metabolism associated with aerobic metabolism of carbon sources such as ethanol, glycerol and fatty acids. In addition, we found STE12, XBP1 and TOS8 as highly connected nodes in the subnetworks of ageing yeast. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
2010-01-01
Background The identification of non-coding transcripts in human, mouse, and Escherichia coli has revealed their widespread occurrence and functional importance in both eukaryotic and prokaryotic life. In prokaryotes, studies have shown that non-coding transcripts participate in a broad range of cellular functions like gene regulation, stress and virulence. However, very little is known about non-coding transcripts in Streptococcus pneumoniae (pneumococcus), an obligate human respiratory pathogen responsible for significant worldwide morbidity and mortality. Tiling microarrays enable genome wide mRNA profiling as well as identification of novel transcripts at a high-resolution. Results Here, we describe a high-resolution transcription map of the S. pneumoniae clinical isolate TIGR4 using genomic tiling arrays. Our results indicate that approximately 66% of the genome is expressed under our experimental conditions. We identified a total of 50 non-coding small RNAs (sRNAs) from the intergenic regions, of which 36 had no predicted function. Half of the identified sRNA sequences were found to be unique to S. pneumoniae genome. We identified eight overrepresented sequence motifs among sRNA sequences that correspond to sRNAs in different functional categories. Tiling arrays also identified approximately 202 operon structures in the genome. Conclusions In summary, the pneumococcal operon structures and novel sRNAs identified in this study enhance our understanding of the complexity and extent of the pneumococcal 'expressed' genome. Furthermore, the results of this study open up new avenues of research for understanding the complex RNA regulatory network governing S. pneumoniae physiology and virulence. PMID:20525227
USDA-ARS?s Scientific Manuscript database
Background: The objective of this study was to acquire a broader, more comprehensive picture of the transcriptional changes in the L. Thoracis muscle (LT) and subcutaneous fat (SF) of lambs supplemented with vitamin E. Furthermore, we aimed to identify novel genes involved in the metabolism of vitam...
The rubber tree genome shows expansion of gene family associated with rubber biosynthesis
Lau, Nyok-Sean; Makita, Yuko; Kawashima, Mika; Taylor, Todd D.; Kondo, Shinji; Othman, Ahmad Sofiman; Shu-Chien, Alexander Chong; Matsui, Minami
2016-01-01
Hevea brasiliensis Muell. Arg, a member of the family Euphorbiaceae, is the sole natural resource exploited for commercial production of high-quality natural rubber. The properties of natural rubber latex are almost irreplaceable by synthetic counterparts for many industrial applications. A paucity of knowledge on the molecular mechanisms of rubber biosynthesis in high yield traits still persists. Here we report the comprehensive genome-wide analysis of the widely planted H. brasiliensis clone, RRIM 600. The genome was assembled based on ~155-fold combined coverage with Illumina and PacBio sequence data and has a total length of 1.55 Gb with 72.5% comprising repetitive DNA sequences. A total of 84,440 high-confidence protein-coding genes were predicted. Comparative genomic analysis revealed strong synteny between H. brasiliensis and other Euphorbiaceae genomes. Our data suggest that H. brasiliensis’s capacity to produce high levels of latex can be attributed to the expansion of rubber biosynthesis-related genes in its genome and the high expression of these genes in latex. Using cap analysis gene expression data, we illustrate the tissue-specific transcription profiles of rubber biosynthesis-related genes, revealing alternative means of transcriptional regulation. Our study adds to the understanding of H. brasiliensis biology and provides valuable genomic resources for future agronomic-related improvement of the rubber tree. PMID:27339202
Li, Guoqing; Shao, Jinhui; Liu, Cong; Lu, Jun; Zhao, Xiaodong
2017-01-01
Alternative polyadenylation (APA) plays an important role in regulation of genes expression and is involved in many biological processes. As eukaryotic cells receive a variety of external signals, genes produce diverse transcriptional isoforms and exhibit different translation efficiency. The traditional Chinese medicine (TCM) Jinfukang (JFK) has been effectively used for lung cancer treatment. In this study, we investigated whether JFK exerts its antitumor effect by modulating APA patterns in lung cancer cells. We performed a genome-wide APA site profiling analysis in JFK treated lung cancer cells A549 with 3T-seq approach that we reported previously. Comparing with those in untreated A549, in JFK treated A549 we observed APA-mediated 3′ UTRs alterations in 310 genes including 77 genes with shortened 3′ UTRs. In particular, we identified TMEM123, a gene involved in oncotic cell death, which produced transcripts with shortened 3′ UTR and thus was upregulated upon JFK treatment. Taken together, our studies suggest that APA might be one of the antitumor mechanisms of JFK and provide a new insight for the understanding of TCM against cancer. PMID:29234412
Kou, Yao; Li, Guoqing; Shao, Jinhui; Liu, Cong; Wu, Jun; Lu, Jun; Zhao, Xiaodong; Tian, Jing
2017-01-01
Alternative polyadenylation (APA) plays an important role in regulation of genes expression and is involved in many biological processes. As eukaryotic cells receive a variety of external signals, genes produce diverse transcriptional isoforms and exhibit different translation efficiency. The traditional Chinese medicine (TCM) Jinfukang (JFK) has been effectively used for lung cancer treatment. In this study, we investigated whether JFK exerts its antitumor effect by modulating APA patterns in lung cancer cells. We performed a genome-wide APA site profiling analysis in JFK treated lung cancer cells A549 with 3T-seq approach that we reported previously. Comparing with those in untreated A549, in JFK treated A549 we observed APA-mediated 3' UTRs alterations in 310 genes including 77 genes with shortened 3' UTRs. In particular, we identified TMEM123 , a gene involved in oncotic cell death, which produced transcripts with shortened 3' UTR and thus was upregulated upon JFK treatment. Taken together, our studies suggest that APA might be one of the antitumor mechanisms of JFK and provide a new insight for the understanding of TCM against cancer.
Vitamin D receptor signaling and its therapeutic implications: Genome-wide and structural view.
Carlberg, Carsten; Molnár, Ferdinand
2015-05-01
Vitamin D3 is one of the few natural compounds that has, via its metabolite 1α,25-dihydroxyvitamin D3 (1,25(OH)2D3) and the transcription factor vitamin D receptor (VDR), a direct effect on gene regulation. For efficiently applying the therapeutic and disease-preventing potential of 1,25(OH)2D3 and its synthetic analogs, the key steps in vitamin D signaling need to be understood. These are the different types of molecular interactions with the VDR, such as (i) the complex formation of VDR with genomic DNA, (ii) the interaction of VDR with its partner transcription factors, (iii) the binding of 1,25(OH)2D3 or its synthetic analogs within the ligand-binding pocket of the VDR, and (iv) the resulting conformational change on the surface of the VDR leading to a change of the protein-protein interaction profile of the receptor with other proteins. This review will present the latest genome-wide insight into vitamin D signaling, and will discuss its therapeutic implications.
Genome-Wide Profiling of DNA Double-Strand Breaks by the BLESS and BLISS Methods.
Mirzazadeh, Reza; Kallas, Tomasz; Bienko, Magda; Crosetto, Nicola
2018-01-01
DNA double-strand breaks (DSBs) are major DNA lesions that are constantly formed during physiological processes such as DNA replication, transcription, and recombination, or as a result of exogenous agents such as ionizing radiation, radiomimetic drugs, and genome editing nucleases. Unrepaired DSBs threaten genomic stability by leading to the formation of potentially oncogenic rearrangements such as translocations. In past few years, several methods based on next-generation sequencing (NGS) have been developed to study the genome-wide distribution of DSBs or their conversion to translocation events. We developed Breaks Labeling, Enrichment on Streptavidin, and Sequencing (BLESS), which was the first method for direct labeling of DSBs in situ followed by their genome-wide mapping at nucleotide resolution (Crosetto et al., Nat Methods 10:361-365, 2013). Recently, we have further expanded the quantitative nature, applicability, and scalability of BLESS by developing Breaks Labeling In Situ and Sequencing (BLISS) (Yan et al., Nat Commun 8:15058, 2017). Here, we first present an overview of existing methods for genome-wide localization of DSBs, and then focus on the BLESS and BLISS methods, discussing different assay design options depending on the sample type and application.
USDA-ARS?s Scientific Manuscript database
Transcription initiation, essential to gene expression regulation, involves recruitment of basal transcription factors to the core promoter elements (CPEs). The distribution of currently known CPEs across plant genomes is largely unknown. This is the first large scale genome-wide report on the compu...
Tomlinson, Gillian S.; Booth, Helen; Petit, Sarah J.; Potton, Elspeth; Towers, Greg J.; Miller, Robert F.; Chain, Benjamin M.; Noursadeghi, Mahdad
2012-01-01
Alveolar macrophages (AM) are thought to have a key role in the immunopathogenesis of respiratory diseases. We sought to test the hypothesis that human AM exhibit an anti-inflammatory bias by making genome-wide comparisons with monocyte derived macrophages (MDM). Adherent AM obtained by bronchoalveolar lavage of patients under investigation for haemoptysis, but found to have no respiratory pathology, were compared to MDM from healthy volunteers by whole genome transcriptional profiling before and after innate immune stimulation. We found that freshly isolated AM exhibited a marked pro-inflammatory transcriptional signature. High levels of basal pro-inflammatory gene expression gave the impression of attenuated responses to lipopolysaccharide (LPS) and the RNA analogue, poly IC, but in rested cells pro-inflammatory gene expression declined and transcriptional responsiveness to these stimuli was restored. In comparison to MDM, both freshly isolated and rested AM showed upregulation of MHC class II molecules. In most experimental paradigms ex vivo adherent AM are used immediately after isolation. Therefore, the confounding effects of their pro-inflammatory profile at baseline need careful consideration. Moreover, despite the prevailing view that AM have an anti-inflammatory bias, our data clearly show that they can adopt a striking pro-inflammatory phenotype, and may have greater capacity for presentation of exogenous antigens than MDM. PMID:22768282
Bloom, Chloe I.; Graham, Christine M.; Berry, Matthew P. R.; Rozakeas, Fotini; Redford, Paul S.; Wang, Yuanyuan; Xu, Zhaohui; Wilkinson, Katalin A.; Wilkinson, Robert J.; Kendrick, Yvonne; Devouassoux, Gilles; Ferry, Tristan; Miyara, Makoto; Bouvry, Diane; Dominique, Valeyre; Gorochov, Guy; Blankenship, Derek; Saadatian, Mitra; Vanhems, Phillip; Beynon, Huw; Vancheeswaran, Rama; Wickremasinghe, Melissa; Chaussabel, Damien; Banchereau, Jacques; Pascual, Virginia; Ho, Ling-pei; Lipman, Marc; O’Garra, Anne
2013-01-01
Rationale New approaches to define factors underlying the immunopathogenesis of pulmonary diseases including sarcoidosis and tuberculosis are needed to develop new treatments and biomarkers. Comparing the blood transcriptional response of tuberculosis to other similar pulmonary diseases will advance knowledge of disease pathways and help distinguish diseases with similar clinical presentations. Objectives To determine the factors underlying the immunopathogenesis of the granulomatous diseases, sarcoidosis and tuberculosis, by comparing the blood transcriptional responses in these and other pulmonary diseases. Methods We compared whole blood genome-wide transcriptional profiles in pulmonary sarcoidosis, pulmonary tuberculosis, to community acquired pneumonia and primary lung cancer and healthy controls, before and after treatment, and in purified leucocyte populations. Measurements and Main Results An Interferon-inducible neutrophil-driven blood transcriptional signature was present in both sarcoidosis and tuberculosis, with a higher abundance and expression in tuberculosis. Heterogeneity of the sarcoidosis signature correlated significantly with disease activity. Transcriptional profiles in pneumonia and lung cancer revealed an over-abundance of inflammatory transcripts. After successful treatment the transcriptional activity in tuberculosis and pneumonia patients was significantly reduced. However the glucocorticoid-responsive sarcoidosis patients showed a significant increase in transcriptional activity. 144-blood transcripts were able to distinguish tuberculosis from other lung diseases and controls. Conclusions Tuberculosis and sarcoidosis revealed similar blood transcriptional profiles, dominated by interferon-inducible transcripts, while pneumonia and lung cancer showed distinct signatures, dominated by inflammatory genes. There were also significant differences between tuberculosis and sarcoidosis in the degree of their transcriptional activity, the heterogeneity of their profiles and their transcriptional response to treatment. PMID:23940611
Singh, Anil Kumar; Sharma, Vishal; Pal, Awadhesh Kumar; Acharya, Vishal; Ahuja, Paramvir Singh
2013-08-01
NAC [no apical meristem (NAM), Arabidopsis thaliana transcription activation factor [ATAF1/2] and cup-shaped cotyledon (CUC2)] proteins belong to one of the largest plant-specific transcription factor (TF) families and play important roles in plant development processes, response to biotic and abiotic cues and hormone signalling. Our genome-wide analysis identified 110 StNAC genes in potato encoding for 136 proteins, including 14 membrane-bound TFs. The physical map positions of StNAC genes on 12 potato chromosomes were non-random, and 40 genes were found to be distributed in 16 clusters. The StNAC proteins were phylogenetically clustered into 12 subgroups. Phylogenetic analysis of StNACs along with their Arabidopsis and rice counterparts divided these proteins into 18 subgroups. Our comparative analysis has also identified 36 putative TNAC proteins, which appear to be restricted to Solanaceae family. In silico expression analysis, using Illumina RNA-seq transcriptome data, revealed tissue-specific, biotic, abiotic stress and hormone-responsive expression profile of StNAC genes. Several StNAC genes, including StNAC072 and StNAC101that are orthologs of known stress-responsive Arabidopsis RESPONSIVE TO DEHYDRATION 26 (RD26) were identified as highly abiotic stress responsive. Quantitative real-time polymerase chain reaction analysis largely corroborated the expression profile of StNAC genes as revealed by the RNA-seq data. Taken together, this analysis indicates towards putative functions of several StNAC TFs, which will provide blue-print for their functional characterization and utilization in potato improvement.
2013-01-01
Background The brown planthopper (Nilaparvata lugens) is one of the most serious rice plant pests in Asia. N. lugens causes extensive rice damage by sucking rice phloem sap, which results in stunted plant growth and the transmission of plant viruses. Despite the importance of this insect pest, little is known about the immunological mechanisms occurring in this hemimetabolous insect species. Results In this study, we performed a genome- and transcriptome-wide analysis aiming at the immune-related genes. The transcriptome datasets include the N. lugens intestine, the developmental stage, wing formation, and sex-specific expression information that provided useful gene expression sequence data for the genome-wide analysis. As a result, we identified a large number of genes encoding N. lugens pattern recognition proteins, modulation proteins in the prophenoloxidase (proPO) activating cascade, immune effectors, and the signal transduction molecules involved in the immune pathways, including the Toll, Immune deficiency (Imd) and Janus kinase signal transducers and activators of transcription (JAK-STAT) pathways. The genome scale analysis revealed detailed information of the gene structure, distribution and transcription orientations in scaffolds. A comparison of the genome-available hemimetabolous and metabolous insect species indicate the differences in the immune-related gene constitution. We investigated the gene expression profiles with regards to how they responded to bacterial infections and tissue, as well as development and sex expression specificity. Conclusions The genome- and transcriptome-wide analysis of immune-related genes including pattern recognition and modulation molecules, immune effectors, and the signal transduction molecules involved in the immune pathways is an important step in determining the overall architecture and functional network of the immune components in N. lugens. Our findings provide the comprehensive gene sequence resource and expression profiles of the immune-related genes of N. lugens, which could facilitate the understanding of the innate immune mechanisms in the hemimetabolous insect species. These data give insight into clarifying the potential functional roles of the immune-related genes involved in the biological processes of development, reproduction, and virus transmission in N. lugens. PMID:23497397
Chai, Wenbo; Si, Weina; Ji, Wei; Qin, Qianqian; Zhao, Manli; Jiang, Haiyang
2018-01-01
HD-Zip proteins represent the major transcription factors in higher plants, playing essential roles in plant development and stress responses. Foxtail millet is a crop to investigate the systems biology of millet and biofuel grasses and the HD-Zip gene family has not been studied in foxtail millet. For further investigation of the expression profile of the HD-Zip gene family in foxtail millet, a comprehensive genome-wide expression analysis was conducted in this study. We found 47 protein-encoding genes in foxtail millet using BLAST search tools; the putative proteins were classified into four subfamilies, namely, subfamilies I, II, III, and IV. Gene structure and motif analysis indicate that the genes in one subfamily were conserved. Promotor analysis showed that HD-Zip gene was involved in abiotic stress. Duplication analysis revealed that 8 (~17%) hdz genes were tandemly duplicated and 28 (58%) were segmentally duplicated; purifying duplication plays important roles in gene expansion. Microsynteny analysis revealed the maximum relationship in foxtail millet-sorghum and foxtail millet-rice. Expression profiling upon the abiotic stresses of drought and high salinity and the biotic stress of ABA revealed that some genes regulated responses to drought and salinity stresses via an ABA-dependent process, especially sihdz29 and sihdz45. Our study provides new insight into evolutionary and functional analyses of HD-Zip genes involved in environmental stress responses in foxtail millet.
Haralambieva, Iana H.; Oberg, Ann L.; Ovsyannikova, Inna G.; Kennedy, Richard B.; Grill, Diane E.; Middha, Sumit; Bot, Brian M.; Wang, Vivian W.; Smith, David I.; Jacobson, Robert M.; Poland, Gregory A.
2013-01-01
Immune responses to current rubella vaccines demonstrate significant inter-individual variability. We performed mRNA-Seq profiling on PBMCs from high and low antibody responders to rubella vaccination to delineate transcriptional differences upon viral stimulation. Generalized linear models were used to assess the per gene fold change (FC) for stimulated versus unstimulated samples or the interaction between outcome and stimulation. Model results were evaluated by both FC and p-value. Pathway analysis and self-contained gene set tests were performed for assessment of gene group effects. Of 17,566 detected genes, we identified 1,080 highly significant differentially expressed genes upon viral stimulation (p<1.00E−15, FDR<1.00E−14), including various immune function and inflammation-related genes, genes involved in cell signaling, cell regulation and transcription, and genes with unknown function. Analysis by immune outcome and stimulation status identified 27 genes (p≤0.0006 and FDR≤0.30) that responded differently to viral stimulation in high vs. low antibody responders, including major histocompatibility complex (MHC) class I genes (HLA-A, HLA-B and B2M with p = 0.0001, p = 0.0005 and p = 0.0002, respectively), and two genes related to innate immunity and inflammation (EMR3 and MEFV with p = 1.46E−08 and p = 0.0004, respectively). Pathway and gene set analysis also revealed transcriptional differences in antigen presentation and innate/inflammatory gene sets and pathways between high and low responders. Using mRNA-Seq genome-wide transcriptional profiling, we identified antigen presentation and innate/inflammatory genes that may assist in explaining rubella vaccine-induced immune response variations. Such information may provide new scientific insights into vaccine-induced immunity useful in rational vaccine development and immune response monitoring. PMID:23658707
Khan, Meraj A; Sengupta, Jayasree; Mittal, Suneeta; Ghosh, Debabrata
2012-09-24
In order to obtain a lead of the pathophysiology of endometriosis, genome-wide expressional analyses of eutopic and ectopic endometrium have earlier been reported, however, the effects of stages of severity and phases of menstrual cycle on expressional profiles have not been examined. The effect of genetic heterogeneity and fertility history on transcriptional activity was also not considered. In the present study, a genome-wide expression analysis of autologous, paired eutopic and ectopic endometrial samples obtained from fertile women (n=18) suffering from moderate (stage 3; n=8) or severe (stage 4; n=10) ovarian endometriosis during proliferative (n=13) and secretory (n=5) phases of menstrual cycle was performed. Individual pure RNA samples were subjected to Agilent's Whole Human Genome 44K microarray experiments. Microarray data were validated (P<0.01) by estimating transcript copy numbers by performing real time RT-PCR of seven (7) arbitrarily selected genes in all samples. The data obtained were subjected to differential expression (DE) and differential co-expression (DC) analyses followed by networks and enrichment analysis, and gene set enrichment analysis (GSEA). The reproducibility of prediction based on GSEA implementation of DC results was assessed by examining the relative expressions of twenty eight (28) selected genes in RNA samples obtained from fresh pool of eutopic and ectopic samples from confirmed ovarian endometriosis patients with stages 3 and 4 (n=4/each) during proliferative and secretory (n=4/each) phases. Higher clustering effect of pairing (cluster distance, cd=0.1) in samples from same individuals on expressional arrays among eutopic and ectopic samples was observed as compared to that of clinical stages of severity (cd=0.5) and phases of menstrual cycle (cd=0.6). Post hoc analysis revealed anomaly in the expressional profiles of several genes associated with immunological, neuracrine and endocrine functions and gynecological cancers however with no overt oncogenic potential in endometriotic tissue. Dys-regulation of three (CLOCK, ESR1, and MYC) major transcription factors appeared to be significant causative factors in the pathogenesis of ovarian endometriosis. A novel cohort of twenty-eight (28) genes representing potential marker for ovarian endometriosis in fertile women was discovered. Dysfunctional expression of immuno-neuro-endocrine behaviour in endometrium appeared critical to endometriosis. Although no overt oncogenic potential was evident, several genes associated with gynecological cancers were observed to be high in the expressional profiles in endometriotic tissue.
Sex-specific hippocampal 5-hydroxymethylcytosine is disrupted in response to acute stress
Papale, Ligia A.; Li, Sisi; Madrid, Andy; Zhang, Qi; Chen, Li; Chopra, Pankaj; Jin, Peng; Keleş, Sündüz; Alisch, Reid S.
2016-01-01
Environmental stress is among the most important contributors to increased susceptibility to develop psychiatric disorders. While it is well known that acute environmental stress alters gene expression, the molecular mechanisms underlying these changes remain largely unknown. 5-hydroxymethylcytosine (5hmC) is a novel environmentally sensitive epigenetic modification that is highly enriched in neurons and is associated with active neuronal transcription. Recently, we reported a genome-wide disruption of hippocampal 5hmC in male mice following acute stress that was correlated to altered transcript levels of genes in known stress related pathways. Since sex-specific endocrine mechanisms respond to environmental stimulus by altering the neuronal epigenome, we examined the genome-wide profile of hippocampal 5hmC in female mice following exposure to acute stress and identified 363 differentially hydroxymethylated regions (DhMRs) linked to known (e.g., Nr3c1 and Ntrk2) and potentially novel genes associated with stress response and psychiatric disorders. Integration of hippocampal expression data from the same female mice found stress-related hydroxymethylation correlated to altered transcript levels. Finally, characterization of stress-induced sex-specific 5hmC profiles in the hippocampus revealed 778 sex-specific acute stress-induced DhMRs some of which were correlated to altered transcript levels that produce sex-specific isoforms in response to stress. Together, the alterations in 5hmC presented here provide a possible molecular mechanism for the adaptive sex-specific response to stress that may augment the design of novel therapeutic agents that will have optimal effectiveness in each sex. PMID:27576189
A transcriptome atlas of rabbit revealed by PacBio single-molecule long-read sequencing.
Chen, Shi-Yi; Deng, Feilong; Jia, Xianbo; Li, Cao; Lai, Song-Jia
2017-08-09
It is widely acknowledged that transcriptional diversity largely contributes to biological regulation in eukaryotes. Since the advent of second-generation sequencing technologies, a large number of RNA sequencing studies have considerably improved our understanding of transcriptome complexity. However, it still remains a huge challenge for obtaining full-length transcripts because of difficulties in the short read-based assembly. In the present study we employ PacBio single-molecule long-read sequencing technology for whole-transcriptome profiling in rabbit (Oryctolagus cuniculus). We totally obtain 36,186 high-confidence transcripts from 14,474 genic loci, among which more than 23% of genic loci and 66% of isoforms have not been annotated yet within the current reference genome. Furthermore, about 17% of transcripts are computationally revealed to be non-coding RNAs. Up to 24,797 alternative splicing (AS) and 11,184 alternative polyadenylation (APA) events are detected within this de novo constructed transcriptome, respectively. The results provide a comprehensive set of reference transcripts and hence contribute to the improved annotation of rabbit genome.
Zhou, Yan; Xu, Daixiang; Jia, Ledong; Huang, Xiaohu; Ma, Guoqiang; Wang, Shuxian; Zhu, Meichen; Zhang, Aoxiang; Guan, Mingwei; Lu, Kun; Xu, Xinfu; Wang, Rui; Li, Jiana; Qu, Cunmin
2017-10-24
The basic region/leucine zipper motif (bZIP) transcription factor family is one of the largest families of transcriptional regulators in plants. bZIP genes have been systematically characterized in some plants, but not in rapeseed ( Brassica napus ). In this study, we identified 247 BnbZIP genes in the rapeseed genome, which we classified into 10 subfamilies based on phylogenetic analysis of their deduced protein sequences. The BnbZIP genes were grouped into functional clades with Arabidopsis genes with similar putative functions, indicating functional conservation. Genome mapping analysis revealed that the BnbZIPs are distributed unevenly across all 19 chromosomes, and that some of these genes arose through whole-genome duplication and dispersed duplication events. All expression profiles of 247 bZIP genes were extracted from RNA-sequencing data obtained from 17 different B . napus ZS11 tissues with 42 various developmental stages. These genes exhibited different expression patterns in various tissues, revealing that these genes are differentially regulated. Our results provide a valuable foundation for functional dissection of the different BnbZIP homologs in B . napus and its parental lines and for molecular breeding studies of bZIP genes in B . napus .
Zhou, Yan; Xu, Daixiang; Jia, Ledong; Huang, Xiaohu; Ma, Guoqiang; Wang, Shuxian; Zhu, Meichen; Zhang, Aoxiang; Guan, Mingwei; Xu, Xinfu; Wang, Rui; Li, Jiana
2017-01-01
The basic region/leucine zipper motif (bZIP) transcription factor family is one of the largest families of transcriptional regulators in plants. bZIP genes have been systematically characterized in some plants, but not in rapeseed (Brassica napus). In this study, we identified 247 BnbZIP genes in the rapeseed genome, which we classified into 10 subfamilies based on phylogenetic analysis of their deduced protein sequences. The BnbZIP genes were grouped into functional clades with Arabidopsis genes with similar putative functions, indicating functional conservation. Genome mapping analysis revealed that the BnbZIPs are distributed unevenly across all 19 chromosomes, and that some of these genes arose through whole-genome duplication and dispersed duplication events. All expression profiles of 247 bZIP genes were extracted from RNA-sequencing data obtained from 17 different B. napus ZS11 tissues with 42 various developmental stages. These genes exhibited different expression patterns in various tissues, revealing that these genes are differentially regulated. Our results provide a valuable foundation for functional dissection of the different BnbZIP homologs in B. napus and its parental lines and for molecular breeding studies of bZIP genes in B. napus. PMID:29064393
High-Throughput Sequencing Reveals Principles of Adeno-Associated Virus Serotype 2 Integration
Janovitz, Tyler; Klein, Isaac A.; Oliveira, Thiago; Mukherjee, Piali; Nussenzweig, Michel C.; Sadelain, Michel
2013-01-01
Viral integrations are important in human biology, yet genome-wide integration profiles have not been determined for many viruses. Adeno-associated virus (AAV) infects most of the human population and is a prevalent gene therapy vector. AAV integrates into the human genome with preference for a single locus, termed AAVS1. However, the genome-wide integration of AAV has not been defined, and the principles underlying this recombination remain unclear. Using a novel high-throughput approach, integrant capture sequencing, nearly 12 million AAV junctions were recovered from a human cell line, providing five orders of magnitude more data than were previously available. Forty-five percent of integrations occurred near AAVS1, and several thousand novel integration hotspots were identified computationally. Most of these occurred in genes, with dozens of hotspots targeting known oncogenes. Viral replication protein binding sites (RBS) and transcriptional activity were major factors favoring integration. In a first for eukaryotic viruses, the data reveal a unique asymmetric integration profile with distinctive directional orientation of viral genomes. These studies provide a new understanding of AAV integration biology through the use of unbiased high-throughput data acquisition and bioinformatics. PMID:23720718
Yan, Bin; Yang, Xinping; Lee, Tin-Lap; Friedman, Jay; Tang, Jun; Van Waes, Carter; Chen, Zhong
2007-01-01
Background Differentially expressed gene profiles have previously been observed among pathologically defined cancers by microarray technologies, including head and neck squamous cell carcinomas (HNSCCs). However, the molecular expression signatures and transcriptional regulatory controls that underlie the heterogeneity in HNSCCs are not well defined. Results Genome-wide cDNA microarray profiling of ten HNSCC cell lines revealed novel gene expression signatures that distinguished cancer cell subsets associated with p53 status. Three major clusters of over-expressed genes (A to C) were defined through hierarchical clustering, Gene Ontology, and statistical modeling. The promoters of genes in these clusters exhibited different patterns and prevalence of transcription factor binding sites for p53, nuclear factor-κB (NF-κB), activator protein (AP)-1, signal transducer and activator of transcription (STAT)3 and early growth response (EGR)1, as compared with the frequency in vertebrate promoters. Cluster A genes involved in chromatin structure and function exhibited enrichment for p53 and decreased AP-1 binding sites, whereas clusters B and C, containing cytokine and antiapoptotic genes, exhibited a significant increase in prevalence of NF-κB binding sites. An increase in STAT3 and EGR1 binding sites was distributed among the over-expressed clusters. Novel regulatory modules containing p53 or NF-κB concomitant with other transcription factor binding motifs were identified, and experimental data supported the predicted transcriptional regulation and binding activity. Conclusion The transcription factors p53, NF-κB, and AP-1 may be important determinants of the heterogeneous pattern of gene expression, whereas STAT3 and EGR1 may broadly enhance gene expression in HNSCCs. Defining these novel gene signatures and regulatory mechanisms will be important for establishing new molecular classifications and subtyping, which in turn will promote development of targeted therapeutics for HNSCC. PMID:17498291
Genome-wide profiling of DNA-binding proteins using barcode-based multiplex Solexa sequencing.
Raghav, Sunil Kumar; Deplancke, Bart
2012-01-01
Chromatin immunoprecipitation (ChIP) is a commonly used technique to detect the in vivo binding of proteins to DNA. ChIP is now routinely paired to microarray analysis (ChIP-chip) or next-generation sequencing (ChIP-Seq) to profile the DNA occupancy of proteins of interest on a genome-wide level. Because ChIP-chip introduces several biases, most notably due to the use of a fixed number of probes, ChIP-Seq has quickly become the method of choice as, depending on the sequencing depth, it is more sensitive, quantitative, and provides a greater binding site location resolution. With the ever increasing number of reads that can be generated per sequencing run, it has now become possible to analyze several samples simultaneously while maintaining sufficient sequence coverage, thus significantly reducing the cost per ChIP-Seq experiment. In this chapter, we provide a step-by-step guide on how to perform multiplexed ChIP-Seq analyses. As a proof-of-concept, we focus on the genome-wide profiling of RNA Polymerase II as measuring its DNA occupancy at different stages of any biological process can provide insights into the gene regulatory mechanisms involved. However, the protocol can also be used to perform multiplexed ChIP-Seq analyses of other DNA-binding proteins such as chromatin modifiers and transcription factors.
Li, Caiqin; Wang, Yan; Ying, Peiyuan; Ma, Wuqiang; Li, Jianguo
2015-01-01
The high level of physiological fruitlet abscission in litchi (Litchi chinensis Sonn.) causes severe yield loss. Cell separation occurs at the fruit abscission zone (FAZ) and can be triggered by ethylene. However, a deep knowledge of the molecular events occurring in the FAZ is still unknown. Here, genome-wide digital transcript abundance (DTA) analysis of putative fruit abscission related genes regulated by ethephon in litchi were studied. More than 81 million high quality reads from seven ethephon treated and untreated control libraries were obtained by high-throughput sequencing. Through DTA profile analysis in combination with Gene Ontology and KEGG pathway enrichment analyses, a total of 2730 statistically significant candidate genes were involved in the ethephon-promoted litchi fruitlet abscission. Of these, there were 1867 early-responsive genes whose expressions were up- or down-regulated from 0 to 1 d after treatment. The most affected genes included those related to ethylene biosynthesis and signaling, auxin transport and signaling, transcription factors (TFs), protein ubiquitination, ROS response, calcium signal transduction, and cell wall modification. These genes could be clustered into four groups and 13 subgroups according to their similar expression patterns. qRT-PCR displayed the expression pattern of 41 selected candidate genes, which proved the accuracy of our DTA data. Ethephon treatment significantly increased fruit abscission and ethylene production of fruitlet. The possible molecular events to control the ethephon-promoted litchi fruitlet abscission were prompted out. The increased ethylene evolution in fruitlet would suppress the synthesis and polar transport of auxin and trigger abscission signaling. To the best of our knowledge, it is the first time to monitor the gene expression profile occurring in the FAZ-enriched pedicel during litchi fruit abscission induced by ethephon on the genome-wide level. This study will contribute to a better understanding for the molecular regulatory mechanism of fruit abscission in litchi. PMID:26217356
Xin, Haiping; Zhu, Wei; Wang, Lina; Xiang, Yue; Fang, Linchuan; Li, Jitao; Sun, Xiaoming; Wang, Nian; Londo, Jason P.; Li, Shaohua
2013-01-01
Grape is one of the most important fruit crops worldwide. The suitable geographical locations and productivity of grapes are largely limited by temperature. Vitis amurensis is a wild grapevine species with remarkable cold-tolerance, exceeding that of Vitis vinifera, the dominant cultivated species of grapevine. However, the molecular mechanisms that contribute to the enhanced freezing tolerance of V. amurensis remain unknown. Here we used deep sequencing data from restriction endonuclease-generated cDNA fragments to evaluate the whole genome wide modification of transcriptome of V. amurensis under cold treatment. Vitis vinifera cv. Muscat of Hamburg was used as control to help investigate the distinctive features of V. amruensis in responding to cold stress. Approximately 9 million tags were sequenced from non-cold treatment (NCT) and cold treatment (CT) cDNA libraries in each species of grapevine sampled from shoot apices. Alignment of tags into V. vinifera cv. Pinot noir (PN40024) annotated genome identified over 15,000 transcripts in each library in V. amruensis and more than 16,000 in Muscat of Hamburg. Comparative analysis between NCT and CT libraries indicate that V. amurensis has fewer differential expressed genes (DEGs, 1314 transcripts) than Muscat of Hamburg (2307 transcripts) when exposed to cold stress. Common DEGs (408 transcripts) suggest that some genes provide fundamental roles during cold stress in grapes. The most robust DEGs (more than 20-fold change) also demonstrated significant differences between two kinds of grapevine, indicating that cold stress may trigger species specific pathways in V. amurensis. Functional categories of DEGs indicated that the proportion of up-regulated transcripts related to metabolism, transport, signal transduction and transcription were more abundant in V. amurensis. Several highly expressed transcripts that were found uniquely accumulated in V. amurensis are discussed in detail. This subset of unique candidate transcripts may contribute to the excellent cold-hardiness of V. amurensis. PMID:23516547
Global Identification and Characterization of Transcriptionally Active Regions in the Rice Genome
Stolc, Viktor; Deng, Wei; He, Hang; Korbel, Jan; Chen, Xuewei; Tongprasit, Waraporn; Ronald, Pamela; Chen, Runsheng; Gerstein, Mark; Wang Deng, Xing
2007-01-01
Genome tiling microarray studies have consistently documented rich transcriptional activity beyond the annotated genes. However, systematic characterization and transcriptional profiling of the putative novel transcripts on the genome scale are still lacking. We report here the identification of 25,352 and 27,744 transcriptionally active regions (TARs) not encoded by annotated exons in the rice (Oryza. sativa) subspecies japonica and indica, respectively. The non-exonic TARs account for approximately two thirds of the total TARs detected by tiling arrays and represent transcripts likely conserved between japonica and indica. Transcription of 21,018 (83%) japonica non-exonic TARs was verified through expression profiling in 10 tissue types using a re-array in which annotated genes and TARs were each represented by five independent probes. Subsequent analyses indicate that about 80% of the japonica TARs that were not assigned to annotated exons can be assigned to various putatively functional or structural elements of the rice genome, including splice variants, uncharacterized portions of incompletely annotated genes, antisense transcripts, duplicated gene fragments, and potential non-coding RNAs. These results provide a systematic characterization of non-exonic transcripts in rice and thus expand the current view of the complexity and dynamics of the rice transcriptome. PMID:17372628
Best practices for mapping replication origins in eukaryotic chromosomes.
Besnard, Emilie; Desprat, Romain; Ryan, Michael; Kahli, Malik; Aladjem, Mirit I; Lemaitre, Jean-Marc
2014-09-02
Understanding the regulatory principles ensuring complete DNA replication in each cell division is critical for deciphering the mechanisms that maintain genomic stability. Recent advances in genome sequencing technology facilitated complete mapping of DNA replication sites and helped move the field from observing replication patterns at a handful of single loci to analyzing replication patterns genome-wide. These advances address issues, such as the relationship between replication initiation events, transcription, and chromatin modifications, and identify potential replication origin consensus sequences. This unit summarizes the technological and fundamental aspects of replication profiling and briefly discusses novel insights emerging from mining large datasets, published in the last 3 years, and also describes DNA replication dynamics on a whole-genome scale. Copyright © 2014 John Wiley & Sons, Inc.
Pindyurin, Alexey V
2017-01-01
A thorough study of the genome-wide binding patterns of chromatin proteins is essential for understanding the regulatory mechanisms of genomic processes in eukaryotic nuclei, including DNA replication, transcription, and repair. The DNA adenine methyltransferase identification (DamID) method is a powerful tool to identify genomic binding sites of chromatin proteins. This method does not require fixation of cells and the use of specific antibodies, and has been used to generate genome-wide binding maps of more than a hundred different proteins in Drosophila tissue culture cells. Recent versions of inducible DamID allow performing cell type-specific profiling of chromatin proteins even in small samples of Drosophila tissues that contain heterogeneous cell types. Importantly, with these methods sorting of cells of interest or their nuclei is not necessary as genomic DNA isolated from the whole tissue can be used as an input. Here, I describe in detail an FLP-inducible DamID method, namely generation of suitable transgenic flies, activation of the Dam transgenes by the FLP recombinase, isolation of DNA from small amounts of dissected tissues, and subsequent identification of the DNA binding sites of the chromatin proteins.
Lee, Soohyun; Seo, Chae Hwa; Alver, Burak Han; Lee, Sanghyuk; Park, Peter J
2015-09-03
RNA-seq has been widely used for genome-wide expression profiling. RNA-seq data typically consists of tens of millions of short sequenced reads from different transcripts. However, due to sequence similarity among genes and among isoforms, the source of a given read is often ambiguous. Existing approaches for estimating expression levels from RNA-seq reads tend to compromise between accuracy and computational cost. We introduce a new approach for quantifying transcript abundance from RNA-seq data. EMSAR (Estimation by Mappability-based Segmentation And Reclustering) groups reads according to the set of transcripts to which they are mapped and finds maximum likelihood estimates using a joint Poisson model for each optimal set of segments of transcripts. The method uses nearly all mapped reads, including those mapped to multiple genes. With an efficient transcriptome indexing based on modified suffix arrays, EMSAR minimizes the use of CPU time and memory while achieving accuracy comparable to the best existing methods. EMSAR is a method for quantifying transcripts from RNA-seq data with high accuracy and low computational cost. EMSAR is available at https://github.com/parklab/emsar.
Care, Matthew A.; Cocco, Mario; Laye, Jon P.; Barnes, Nicholas; Huang, Yuanxue; Wang, Ming; Barrans, Sharon; Du, Ming; Jack, Andrew; Westhead, David R.; Doody, Gina M.; Tooze, Reuben M.
2014-01-01
Interferon regulatory factor 4 (IRF4) is central to the transcriptional network of activated B-cell-like diffuse large B-cell lymphoma (ABC-DLBCL), an aggressive lymphoma subgroup defined by gene expression profiling. Since cofactor association modifies transcriptional regulatory input by IRF4, we assessed genome occupancy by IRF4 and endogenous cofactors in ABC-DLBCL cell lines. IRF4 partners with SPIB, PU.1 and BATF genome-wide, but SPIB provides the dominant IRF4 partner in this context. Upon SPIB knockdown IRF4 occupancy is depleted and neither PU.1 nor BATF acutely compensates. Integration with ENCODE data from lymphoblastoid cell line GM12878, demonstrates that IRF4 adopts either SPIB- or BATF-centric genome-wide distributions in related states of post-germinal centre B-cell transformation. In primary DLBCL high-SPIB and low-BATF or the reciprocal low-SPIB and high-BATF mRNA expression links to differential gene expression profiles across nine data sets, identifying distinct associations with SPIB occupancy, signatures of B-cell differentiation stage and potential pathogenetic mechanisms. In a population-based patient cohort, SPIBhigh/BATFlow-ABC-DLBCL is enriched for mutation of MYD88, and SPIBhigh/BATFlow-ABC-DLBCL with MYD88-L265P mutation identifies a small subgroup of patients among this otherwise aggressive disease subgroup with distinct favourable outcome. We conclude that differential expression of IRF4 cofactors SPIB and BATF identifies biologically and clinically significant heterogeneity among ABC-DLBCL. PMID:24875472
Weissgerber, Thomas; Dobler, Nadine; Polen, Tino; Latus, Jeanette; Stockdreher, Yvonne
2013-01-01
The purple sulfur bacterium Allochromatium vinosum DSM 180T is one of the best-studied sulfur-oxidizing anoxygenic phototrophic bacteria, and it has been developed into a model organism for laboratory-based studies of oxidative sulfur metabolism. Here, we took advantage of the organism's high metabolic versatility and performed whole-genome transcriptional profiling to investigate the response of A. vinosum cells upon exposure to sulfide, thiosulfate, elemental sulfur, or sulfite compared to photoorganoheterotrophic growth on malate. Differential expression of 1,178 genes was observed, corresponding to 30% of the A. vinosum genome. Relative transcription of 551 genes increased significantly during growth on one of the different sulfur sources, while the relative transcript abundance of 627 genes decreased. A significant number of genes that revealed strongly enhanced relative transcription levels have documented sulfur metabolism-related functions. Among these are the dsr genes, including dsrAB for dissimilatory sulfite reductase, and the sgp genes for the proteins of the sulfur globule envelope, thus confirming former results. In addition, we identified new genes encoding proteins with appropriate subcellular localization and properties to participate in oxidative dissimilatory sulfur metabolism. Those four genes for hypothetical proteins that exhibited the strongest increases of mRNA levels on sulfide and elemental sulfur, respectively, were chosen for inactivation and phenotypic analyses of the respective mutant strains. This approach verified the importance of the encoded proteins for sulfur globule formation during the oxidation of sulfide and thiosulfate and thereby also documented the suitability of comparative transcriptomics for the identification of new sulfur-related genes in anoxygenic phototrophic sulfur bacteria. PMID:23873913
Dey-Rao, R; Sinha, A A
2015-03-01
The disease Lupus erythematosus (LE), exhibits a variety of clinical manifestations with potentially wide-ranging multi-organ damage to joints, tendons, kidney, lung, heart, blood vessels, central nervous system and skin [1,2] Systemic changes are likely to trigger organ specific manifestation of the disease. Here, we provide the data examined to address the gap in knowledge regarding causes and mechanisms contributing to the autoimmune attack on skin in chronic cutaneous lupus erythematosus (CCLE). The raw gene expression data files (CEL files) are provided with this article [3].
Kuttippurathu, Lakshmi; Patra, Biswanath; Cook, Daniel; Hoek, Jan B.
2017-01-01
Chronic ethanol intake impairs liver regeneration through a system-wide alteration in the regulatory networks driving the response to injury. Our study focused on the initial phase of response to 2/3rd partial hepatectomy (PHx) to investigate how adaptation to chronic ethanol intake affects the genome-wide binding profiles of the transcription factors C/EBP-β and C/EBP-α. These factors participate in complementary and often opposing functions for maintaining cellular differentiation, regulating metabolism, and governing cell growth during liver regeneration. We analyzed ChIP-seq data with a comparative pattern count (COMPACT) analysis, which exhaustively enumerates temporal patterns of discretized binding profiles to identify dominant as well as subtle patterns that may not be apparent from conventional clustering analyses. We found that adaptation to chronic ethanol intake significantly alters the genome-wide binding profile of C/EBP-β and C/EBP-α before and following PHx. A subset of these ethanol-induced changes include C/EBP-β binding to promoters of genes involved in the profibrogenic transforming growth factor-β pathway, and both C/EBP-β and C/EBP-α binding to promoters of genes involved in the cell cycle, apoptosis, homeostasis, and metabolic processes. The shift in C/EBP binding loci, coupled with an ethanol-induced increase in C/EBP-β binding at 6 h post-resection, indicates that ethanol adaptation may change both the amount and nature of C/EBP binding postresection. Taken together, our results suggest that chronic ethanol consumption leads to a spatially and temporally reorganized activity at many genomic loci, resulting in a shift in the dynamic balance and coordination of cellular processes underlying regenerative response. PMID:27815535
2013-01-01
Background MADS-domain transcription factors play important roles during plant development. The Arabidopsis MADS-box gene SHORT VEGETATIVE PHASE (SVP) is a key regulator of two developmental phases. It functions as a repressor of the floral transition during the vegetative phase and later it contributes to the specification of floral meristems. How these distinct activities are conferred by a single transcription factor is unclear, but interactions with other MADS domain proteins which specify binding to different genomic regions is likely one mechanism. Results To compare the genome-wide DNA binding profile of SVP during vegetative and reproductive development we performed ChIP-seq analyses. These ChIP-seq data were combined with tiling array expression analysis, induction experiments and qRT-PCR to identify biologically relevant binding sites. In addition, we compared genome-wide target genes of SVP with those published for the MADS domain transcription factors FLC and AP1, which interact with SVP during the vegetative and reproductive phases, respectively. Conclusions Our analyses resulted in the identification of pathways that are regulated by SVP including those controlling meristem development during vegetative growth and flower development whereas floral transition pathways and hormonal signaling were regulated predominantly during the vegetative phase. Thus, SVP regulates many developmental pathways, some of which are common to both of its developmental roles whereas others are specific to only one of them. PMID:23759218
Antisense transcription is pervasive but rarely conserved in enteric bacteria.
Raghavan, Rahul; Sloan, Daniel B; Ochman, Howard
2012-01-01
Noncoding RNAs, including antisense RNAs (asRNAs) that originate from the complementary strand of protein-coding genes, are involved in the regulation of gene expression in all domains of life. Recent application of deep-sequencing technologies has revealed that the transcription of asRNAs occurs genome-wide in bacteria. Although the role of the vast majority of asRNAs remains unknown, it is often assumed that their presence implies important regulatory functions, similar to those of other noncoding RNAs. Alternatively, many antisense transcripts may be produced by chance transcription events from promoter-like sequences that result from the degenerate nature of bacterial transcription factor binding sites. To investigate the biological relevance of antisense transcripts, we compared genome-wide patterns of asRNA expression in closely related enteric bacteria, Escherichia coli and Salmonella enterica serovar Typhimurium, by performing strand-specific transcriptome sequencing. Although antisense transcripts are abundant in both species, less than 3% of asRNAs are expressed at high levels in both species, and only about 14% appear to be conserved among species. And unlike the promoters of protein-coding genes, asRNA promoters show no evidence of sequence conservation between, or even within, species. Our findings suggest that many or even most bacterial asRNAs are nonadaptive by-products of the cell's transcription machinery. IMPORTANCE Application of high-throughput methods has revealed the expression throughout bacterial genomes of transcripts encoded on the strand complementary to protein-coding genes. Because transcription is costly, it is usually assumed that these transcripts, termed antisense RNAs (asRNAs), serve some function; however, the role of most asRNAs is unclear, raising questions about their relevance in cellular processes. Because natural selection conserves functional elements, comparisons between related species provide a method for assessing functionality genome-wide. Applying such an approach, we assayed all transcripts in two closely related bacteria, Escherichia coli and Salmonella enterica serovar Typhimurium, and demonstrate that, although the levels of genome-wide antisense transcription are similarly high in both bacteria, only a small fraction of asRNAs are shared across species. Moreover, the promoters associated with asRNAs show no evidence of sequence conservation between, or even within, species. These findings indicate that despite the genome-wide transcription of asRNAs, many of these transcripts are likely nonfunctional.
Dynamic regulation of VEGF-inducible genes by an ERK/ERG/p300 transcriptional network.
Fish, Jason E; Cantu Gutierrez, Manuel; Dang, Lan T; Khyzha, Nadiya; Chen, Zhiqi; Veitch, Shawn; Cheng, Henry S; Khor, Melvin; Antounians, Lina; Njock, Makon-Sébastien; Boudreau, Emilie; Herman, Alexander M; Rhyner, Alexander M; Ruiz, Oscar E; Eisenhoffer, George T; Medina-Rivera, Alejandra; Wilson, Michael D; Wythe, Joshua D
2017-07-01
The transcriptional pathways activated downstream of vascular endothelial growth factor (VEGF) signaling during angiogenesis remain incompletely characterized. By assessing the signals responsible for induction of the Notch ligand delta-like 4 (DLL4) in endothelial cells, we find that activation of the MAPK/ERK pathway mirrors the rapid and dynamic induction of DLL4 transcription and that this pathway is required for DLL4 expression. Furthermore, VEGF/ERK signaling induces phosphorylation and activation of the ETS transcription factor ERG, a prerequisite for DLL4 induction. Transcription of DLL4 coincides with dynamic ERG-dependent recruitment of the transcriptional co-activator p300. Genome-wide gene expression profiling identified a network of VEGF-responsive and ERG-dependent genes, and ERG chromatin immunoprecipitation (ChIP)-seq revealed the presence of conserved ERG-bound putative enhancer elements near these target genes. Functional experiments performed in vitro and in vivo confirm that this network of genes requires ERK, ERG and p300 activity. Finally, genome-editing and transgenic approaches demonstrate that a highly conserved ERG-bound enhancer located upstream of HLX (which encodes a transcription factor implicated in sprouting angiogenesis) is required for its VEGF-mediated induction. Collectively, these findings elucidate a novel transcriptional pathway contributing to VEGF-dependent angiogenesis. © 2017. Published by The Company of Biologists Ltd.
Kravatsky, Yuri V; Chechetkin, Vladimir R; Tchurikov, Nikolai A; Kravatskaya, Galina I
2015-02-01
The broad class of tasks in genetics and epigenetics can be reduced to the study of various features that are distributed over the genome (genome tracks). The rapid and efficient processing of the huge amount of data stored in the genome-scale databases cannot be achieved without the software packages based on the analytical criteria. However, strong inhomogeneity of genome tracks hampers the development of relevant statistics. We developed the criteria for the assessment of genome track inhomogeneity and correlations between two genome tracks. We also developed a software package, Genome Track Analyzer, based on this theory. The theory and software were tested on simulated data and were applied to the study of correlations between CpG islands and transcription start sites in the Homo sapiens genome, between profiles of protein-binding sites in chromosomes of Drosophila melanogaster, and between DNA double-strand breaks and histone marks in the H. sapiens genome. Significant correlations between transcription start sites on the forward and the reverse strands were observed in genomes of D. melanogaster, Caenorhabditis elegans, Mus musculus, H. sapiens, and Danio rerio. The observed correlations may be related to the regulation of gene expression in eukaryotes. Genome Track Analyzer is freely available at http://ancorr.eimb.ru/. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Hematopoietic transcriptional mechanisms: from locus-specific to genome-wide vantage points.
DeVilbiss, Andrew W; Sanalkumar, Rajendran; Johnson, Kirby D; Keles, Sunduz; Bresnick, Emery H
2014-08-01
Hematopoiesis is an exquisitely regulated process in which stem cells in the developing embryo and the adult generate progenitor cells that give rise to all blood lineages. Master regulatory transcription factors control hematopoiesis by integrating signals from the microenvironment and dynamically establishing and maintaining genetic networks. One of the most rudimentary aspects of cell type-specific transcription factor function, how they occupy a highly restricted cohort of cis-elements in chromatin, remains poorly understood. Transformative technologic advances involving the coupling of next-generation DNA sequencing technology with the chromatin immunoprecipitation assay (ChIP-seq) have enabled genome-wide mapping of factor occupancy patterns. However, formidable problems remain; notably, ChIP-seq analysis yields hundreds to thousands of chromatin sites occupied by a given transcription factor, and only a fraction of the sites appear to be endowed with critical, non-redundant function. It has become en vogue to map transcription factor occupancy patterns genome-wide, while using powerful statistical tools to establish correlations to inform biology and mechanisms. With the advent of revolutionary genome editing technologies, one can now reach beyond correlations to conduct definitive hypothesis testing. This review focuses on key discoveries that have emerged during the path from single loci to genome-wide analyses, specifically in the context of hematopoietic transcriptional mechanisms. Copyright © 2014 ISEH - International Society for Experimental Hematology. Published by Elsevier Inc. All rights reserved.
Nascent-Seq reveals novel features of mouse circadian transcriptional regulation
Menet, Jerome S; Rodriguez, Joseph; Abruzzi, Katharine C; Rosbash, Michael
2012-01-01
A substantial fraction of the metazoan transcriptome undergoes circadian oscillations in many cells and tissues. Based on the transcription feedback loops important for circadian timekeeping, it is commonly assumed that this mRNA cycling reflects widespread transcriptional regulation. To address this issue, we directly measured the circadian dynamics of mouse liver transcription using Nascent-Seq (genome-wide sequencing of nascent RNA). Although many genes are rhythmically transcribed, many rhythmic mRNAs manifest poor transcriptional rhythms, indicating a prominent contribution of post-transcriptional regulation to circadian mRNA expression. This analysis of rhythmic transcription also showed that the rhythmic DNA binding profile of the transcription factors CLOCK and BMAL1 does not determine the transcriptional phase of most target genes. This likely reflects gene-specific collaborations of CLK:BMAL1 with other transcription factors. These insights from Nascent-Seq indicate that it should have broad applicability to many other gene expression regulatory issues. DOI: http://dx.doi.org/10.7554/eLife.00011.001 PMID:23150795
2014-01-01
Background Imprinted genes have been extensively documented in eutherian mammals and found to exhibit significant interspecific variation in the suites of genes that are imprinted and in their regulation between tissues and developmental stages. Much less is known about imprinted loci in metatherian (marsupial) mammals, wherein studies have been limited to a small number of genes previously known to be imprinted in eutherians. We describe the first ab initio search for imprinted marsupial genes, in fibroblasts from the opossum, Monodelphis domestica, based on a genome-wide ChIP-seq strategy to identify promoters that are simultaneously marked by mutually exclusive, transcriptionally opposing histone modifications. Results We identified a novel imprinted gene (Meis1) and two additional monoallelically expressed genes, one of which (Cstb) showed allele-specific, but non-imprinted expression. Imprinted vs. allele-specific expression could not be resolved for the third monoallelically expressed gene (Rpl17). Transcriptionally opposing histone modifications H3K4me3, H3K9Ac, and H3K9me3 were found at the promoters of all three genes, but differential DNA methylation was not detected at CpG islands at any of these promoters. Conclusions In generating the first genome-wide histone modification profiles for a marsupial, we identified the first gene that is imprinted in a marsupial but not in eutherian mammals. This outcome demonstrates the practicality of an ab initio discovery strategy and implicates histone modification, but not differential DNA methylation, as a conserved mechanism for marking imprinted genes in all therian mammals. Our findings suggest that marsupials use multiple epigenetic mechanisms for imprinting and support the concept that lineage-specific selective forces can produce sets of imprinted genes that differ between metatherian and eutherian lines. PMID:24484454
Sex-specific hippocampal 5-hydroxymethylcytosine is disrupted in response to acute stress.
Papale, Ligia A; Li, Sisi; Madrid, Andy; Zhang, Qi; Chen, Li; Chopra, Pankaj; Jin, Peng; Keleş, Sündüz; Alisch, Reid S
2016-12-01
Environmental stress is among the most important contributors to increased susceptibility to develop psychiatric disorders. While it is well known that acute environmental stress alters gene expression, the molecular mechanisms underlying these changes remain largely unknown. 5-hydroxymethylcytosine (5hmC) is a novel environmentally sensitive epigenetic modification that is highly enriched in neurons and is associated with active neuronal transcription. Recently, we reported a genome-wide disruption of hippocampal 5hmC in male mice following acute stress that was correlated to altered transcript levels of genes in known stress related pathways. Since sex-specific endocrine mechanisms respond to environmental stimulus by altering the neuronal epigenome, we examined the genome-wide profile of hippocampal 5hmC in female mice following exposure to acute stress and identified 363 differentially hydroxymethylated regions (DhMRs) linked to known (e.g., Nr3c1 and Ntrk2) and potentially novel genes associated with stress response and psychiatric disorders. Integration of hippocampal expression data from the same female mice found stress-related hydroxymethylation correlated to altered transcript levels. Finally, characterization of stress-induced sex-specific 5hmC profiles in the hippocampus revealed 778 sex-specific acute stress-induced DhMRs some of which were correlated to altered transcript levels that produce sex-specific isoforms in response to stress. Together, the alterations in 5hmC presented here provide a possible molecular mechanism for the adaptive sex-specific response to stress that may augment the design of novel therapeutic agents that will have optimal effectiveness in each sex. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
New Era of Studying RNA Secondary Structure and Its Influence on Gene Regulation in Plants.
Yang, Xiaofei; Yang, Minglei; Deng, Hongjing; Ding, Yiliang
2018-01-01
The dynamic structure of RNA plays a central role in post-transcriptional regulation of gene expression such as RNA maturation, degradation, and translation. With the rise of next-generation sequencing, the study of RNA structure has been transformed from in vitro low-throughput RNA structure probing methods to in vivo high-throughput RNA structure profiling. The development of these methods enables incremental studies on the function of RNA structure to be performed, revealing new insights of novel regulatory mechanisms of RNA structure in plants. Genome-wide scale RNA structure profiling allows us to investigate general RNA structural features over 10s of 1000s of mRNAs and to compare RNA structuromes between plant species. Here, we provide a comprehensive and up-to-date overview of: (i) RNA structure probing methods; (ii) the biological functions of RNA structure; (iii) genome-wide RNA structural features corresponding to their regulatory mechanisms; and (iv) RNA structurome evolution in plants.
Charfeddine, Mariam; Saïdi, Mohamed Najib; Charfeddine, Safa; Hammami, Asma; Gargouri Bouzid, Radhia
2015-04-01
The ERF transcription factors belong to the AP2/ERF superfamily, one of the largest transcription factor families in plants. They play important roles in plant development processes, as well as in the response to biotic, abiotic, and hormone signaling. In the present study, 155 putative ERF transcription factor genes were identified from the potato (Solanum tuberosum) genome database, and compared with those from Arabidopsis thaliana. The StERF proteins are divided into ten phylogenetic groups. Expression analyses of five StERFs were carried out by semi-quantitative RT-PCR and compared with published RNA-seq data. These latter analyses were used to distinguish tissue-specific, biotic, and abiotic stress genes as well as hormone-responsive StERF genes. The results are of interest to better understand the role of the AP2/ERF genes in response to diverse types of stress in potatoes. A comprehensive analysis of the physiological functions and biological roles of the ERF family genes in S. tuberosum is required to understand crop stress tolerance mechanisms.
Wei, Yingying; Wu, George; Ji, Hongkai
2013-05-01
Mapping genome-wide binding sites of all transcription factors (TFs) in all biological contexts is a critical step toward understanding gene regulation. The state-of-the-art technologies for mapping transcription factor binding sites (TFBSs) couple chromatin immunoprecipitation (ChIP) with high-throughput sequencing (ChIP-seq) or tiling array hybridization (ChIP-chip). These technologies have limitations: they are low-throughput with respect to surveying many TFs. Recent advances in genome-wide chromatin profiling, including development of technologies such as DNase-seq, FAIRE-seq and ChIP-seq for histone modifications, make it possible to predict in vivo TFBSs by analyzing chromatin features at computationally determined DNA motif sites. This promising new approach may allow researchers to monitor the genome-wide binding sites of many TFs simultaneously. In this article, we discuss various experimental design and data analysis issues that arise when applying this approach. Through a systematic analysis of the data from the Encyclopedia Of DNA Elements (ENCODE) project, we compare the predictive power of individual and combinations of chromatin marks using supervised and unsupervised learning methods, and evaluate the value of integrating information from public ChIP and gene expression data. We also highlight the challenges and opportunities for developing novel analytical methods, such as resolving the one-motif-multiple-TF ambiguity and distinguishing functional and non-functional TF binding targets from the predicted binding sites. The online version of this article (doi:10.1007/s12561-012-9066-5) contains supplementary material, which is available to authorized users.
Transcriptional profiling of Medicago truncatula meristematic root cells
Holmes, Peta; Goffard, Nicolas; Weiller, Georg F; Rolfe, Barry G; Imin, Nijat
2008-01-01
Background The root apical meristem of crop and model legume Medicago truncatula is a significantly different stem cell system to that of the widely studied model plant species Arabidopsis thaliana. In this study we used the Affymetrix Medicago GeneChip® to compare the transcriptomes of meristem and non-meristematic root to identify root meristem specific candidate genes. Results Using mRNA from root meristem and non-meristem we were able to identify 324 and 363 transcripts differentially expressed from the two regions. With bioinformatics tools developed to functionally annotate the Medicago genome array we could identify significant changes in metabolism, signalling and the differentially expression of 55 transcription factors in meristematic and non-meristematic roots. Conclusion This is the first comprehensive analysis of M. truncatula root meristem cells using this genome array. This data will facilitate the mapping of regulatory and metabolic networks involved in the open root meristem of M. truncatula and provides candidates for functional analysis. PMID:18302802
Paul, Sujay
2017-06-01
MicroRNAs (miRNAs) are endogenous, short (~21-nucleotide), non-coding RNA molecules that play pivotal roles in plant growth, development, and stress response signaling. In this study using recently published draft genome sequence of a high-altitude plant maca (Lepidium meyenii Walp) and applying genome-wide computational-based approaches, a total of 62 potentially conserved miRNAs belonging to 28 families were identified and four (lme-miR160a, lme-miR164c, lme-miR 166a, and lme-miR 319a) of them further validated by RT-PCR. Deploying psRNATarget tool a total of 99 potential miRNA target transcripts were also identified in maca. Targets include a number of transcription factors like Squamosa promoter-binding, NAC, MYB, auxin response factor, APETALA, WRKY, and F-box protein. To the best of my knowledge, this is the first genome-based miRNA profiling of a high-altitude plant.
Ulianov, Sergey V; Galitsyna, Aleksandra A; Flyamer, Ilya M; Golov, Arkadiy K; Khrameeva, Ekaterina E; Imakaev, Maxim V; Abdennur, Nezar A; Gelfand, Mikhail S; Gavrilov, Alexey A; Razin, Sergey V
2017-07-11
In homeotherms, the alpha-globin gene clusters are located within permanently open genome regions enriched in housekeeping genes. Terminal erythroid differentiation results in dramatic upregulation of alpha-globin genes making their expression comparable to the rRNA transcriptional output. Little is known about the influence of the erythroid-specific alpha-globin gene transcription outburst on adjacent, widely expressed genes and large-scale chromatin organization. Here, we have analyzed the total transcription output, the overall chromatin contact profile, and CTCF binding within the 2.7 Mb segment of chicken chromosome 14 harboring the alpha-globin gene cluster in cultured lymphoid cells and cultured erythroid cells before and after induction of terminal erythroid differentiation. We found that, similarly to mammalian genome, the chicken genomes is organized in TADs and compartments. Full activation of the alpha-globin gene transcription in differentiated erythroid cells is correlated with upregulation of several adjacent housekeeping genes and the emergence of abundant intergenic transcription. An extended chromosome region encompassing the alpha-globin cluster becomes significantly decompacted in differentiated erythroid cells, and depleted in CTCF binding and CTCF-anchored chromatin loops, while the sub-TAD harboring alpha-globin gene cluster and the upstream major regulatory element (MRE) becomes highly enriched with chromatin interactions as compared to lymphoid and proliferating erythroid cells. The alpha-globin gene domain and the neighboring loci reside within the A-like chromatin compartment in both lymphoid and erythroid cells and become further segregated from the upstream gene desert upon terminal erythroid differentiation. Our findings demonstrate that the effects of tissue-specific transcription activation are not restricted to the host genomic locus but affect the overall chromatin structure and transcriptional output of the encompassing topologically associating domain.
Genome-wide analysis of WRKY gene family in Cucumis sativus
2011-01-01
Background WRKY proteins are a large family of transcriptional regulators in higher plant. They are involved in many biological processes, such as plant development, metabolism, and responses to biotic and abiotic stresses. Prior to the present study, only one full-length cucumber WRKY protein had been reported. The recent publication of the draft genome sequence of cucumber allowed us to conduct a genome-wide search for cucumber WRKY proteins, and to compare these positively identified proteins with their homologs in model plants, such as Arabidopsis. Results We identified a total of 55 WRKY genes in the cucumber genome. According to structural features of their encoded proteins, the cucumber WRKY (CsWRKY) genes were classified into three groups (group 1-3). Analysis of expression profiles of CsWRKY genes indicated that 48 WRKY genes display differential expression either in their transcript abundance or in their expression patterns under normal growth conditions, and 23 WRKY genes were differentially expressed in response to at least one abiotic stresses (cold, drought or salinity). The expression profile of stress-inducible CsWRKY genes were correlated with those of their putative Arabidopsis WRKY (AtWRKY) orthologs, except for the group 3 WRKY genes. Interestingly, duplicated group 3 AtWRKY genes appear to have been under positive selection pressure during evolution. In contrast, there was no evidence of recent gene duplication or positive selection pressure among CsWRKY group 3 genes, which may have led to the expressional divergence of group 3 orthologs. Conclusions Fifty-five WRKY genes were identified in cucumber and the structure of their encoded proteins, their expression, and their evolution were examined. Considering that there has been extensive expansion of group 3 WRKY genes in angiosperms, the occurrence of different evolutionary events could explain the functional divergence of these genes. PMID:21955985
Genome-wide analysis of WRKY gene family in Cucumis sativus.
Ling, Jian; Jiang, Weijie; Zhang, Ying; Yu, Hongjun; Mao, Zhenchuan; Gu, Xingfang; Huang, Sanwen; Xie, Bingyan
2011-09-28
WRKY proteins are a large family of transcriptional regulators in higher plant. They are involved in many biological processes, such as plant development, metabolism, and responses to biotic and abiotic stresses. Prior to the present study, only one full-length cucumber WRKY protein had been reported. The recent publication of the draft genome sequence of cucumber allowed us to conduct a genome-wide search for cucumber WRKY proteins, and to compare these positively identified proteins with their homologs in model plants, such as Arabidopsis. We identified a total of 55 WRKY genes in the cucumber genome. According to structural features of their encoded proteins, the cucumber WRKY (CsWRKY) genes were classified into three groups (group 1-3). Analysis of expression profiles of CsWRKY genes indicated that 48 WRKY genes display differential expression either in their transcript abundance or in their expression patterns under normal growth conditions, and 23 WRKY genes were differentially expressed in response to at least one abiotic stresses (cold, drought or salinity). The expression profile of stress-inducible CsWRKY genes were correlated with those of their putative Arabidopsis WRKY (AtWRKY) orthologs, except for the group 3 WRKY genes. Interestingly, duplicated group 3 AtWRKY genes appear to have been under positive selection pressure during evolution. In contrast, there was no evidence of recent gene duplication or positive selection pressure among CsWRKY group 3 genes, which may have led to the expressional divergence of group 3 orthologs. Fifty-five WRKY genes were identified in cucumber and the structure of their encoded proteins, their expression, and their evolution were examined. Considering that there has been extensive expansion of group 3 WRKY genes in angiosperms, the occurrence of different evolutionary events could explain the functional divergence of these genes.
Liu, Chaoyang; Xie, Tao; Chen, Chenjie; Luan, Aiping; Long, Jianmei; Li, Chuhao; Ding, Yaqi; He, Yehua
2017-07-01
The MYB proteins comprise one of the largest families of plant transcription factors, which are involved in various plant physiological and biochemical processes. Pineapple (Ananas comosus) is one of three most important tropical fruits worldwide. The completion of pineapple genome sequencing provides a great opportunity to investigate the organization and evolutionary traits of pineapple MYB genes at the genome-wide level. In the present study, a total of 94 pineapple R2R3-MYB genes were identified and further phylogenetically classified into 26 subfamilies, as supported by the conserved gene structures and motif composition. Collinearity analysis indicated that the segmental duplication events played a crucial role in the expansion of pineapple MYB gene family. Further comparative phylogenetic analysis suggested that there have been functional divergences of MYB gene family during plant evolution. RNA-seq data from different tissues and developmental stages revealed distinct temporal and spatial expression profiles of the AcMYB genes. Further quantitative expression analysis showed the specific expression patterns of the selected putative stress-related AcMYB genes in response to distinct abiotic stress and hormonal treatments. The comprehensive expression analysis of the pineapple MYB genes, especially the tissue-preferential and stress-responsive genes, could provide valuable clues for further function characterization. In this work, we systematically identified AcMYB genes by analyzing the pineapple genome sequence using a set of bioinformatics approaches. Our findings provide a global insight into the organization, phylogeny and expression patterns of the pineapple R2R3-MYB genes, and hence contribute to the greater understanding of their biological roles in pineapple.
Russell, Scott D; Gou, Xiaoping; Wong, Chui E; Wang, Xinkun; Yuan, Tong; Wei, Xiaoping; Bhalla, Prem L; Singh, Mohan B
2012-08-01
Genomic assay of sperm cell RNA provides insight into functional control, modes of regulation, and contributions of male gametes to double fertilization. Sperm cells of rice (Oryza sativa) were isolated from field-grown, disease-free plants and RNA was processed for use with the full-genome Affymetrix microarray. Comparison with Gene Expression Omnibus (GEO) reference arrays confirmed expressionally distinct gene profiles. A total of 10,732 distinct gene sequences were detected in sperm cells, of which 1668 were not expressed in pollen or seedlings. Pathways enriched in male germ cells included ubiquitin-mediated pathways, pathways involved in chromatin modeling including histones, histone modification and nonhistone epigenetic modification, and pathways related to RNAi and gene silencing. Genome-wide expression patterns in angiosperm sperm cells indicate common and divergent themes in the male germline that appear to be largely self-regulating through highly up-regulated chromatin modification pathways. A core of highly conserved genes appear common to all sperm cells, but evidence is still emerging that another class of genes have diverged in expression between monocots and dicots since their divergence. Sperm cell transcripts present at fusion may be transmitted through plasmogamy during double fertilization to effect immediate post-fertilization expression of early embryo and (or) endosperm development. © 2012 The Authors. New Phytologist © 2012 New Phytologist Trust.
The histone H3 variant H3.3 regulates gene body DNA methylation in Arabidopsis thaliana.
Wollmann, Heike; Stroud, Hume; Yelagandula, Ramesh; Tarutani, Yoshiaki; Jiang, Danhua; Jing, Li; Jamge, Bhagyshree; Takeuchi, Hidenori; Holec, Sarah; Nie, Xin; Kakutani, Tetsuji; Jacobsen, Steven E; Berger, Frédéric
2017-05-18
Gene bodies of vertebrates and flowering plants are occupied by the histone variant H3.3 and DNA methylation. The origin and significance of these profiles remain largely unknown. DNA methylation and H3.3 enrichment profiles over gene bodies are correlated and both have a similar dependence on gene transcription levels. This suggests a mechanistic link between H3.3 and gene body methylation. We engineered an H3.3 knockdown in Arabidopsis thaliana and observed transcription reduction that predominantly affects genes responsive to environmental cues. When H3.3 levels are reduced, gene bodies show a loss of DNA methylation correlated with transcription levels. To study the origin of changes in DNA methylation profiles when H3.3 levels are reduced, we examined genome-wide distributions of several histone H3 marks, H2A.Z, and linker histone H1. We report that in the absence of H3.3, H1 distribution increases in gene bodies in a transcription-dependent manner. We propose that H3.3 prevents recruitment of H1, inhibiting H1's promotion of chromatin folding that restricts access to DNA methyltransferases responsible for gene body methylation. Thus, gene body methylation is likely shaped by H3.3 dynamics in conjunction with transcriptional activity.
Nash, Claire; Boufaied, Nadia; Mills, Ian G; Franco, Omar E; Hayward, Simon W; Thomson, Axel A
2017-05-05
The androgen receptor (AR) is a transcription factor, and key regulator of prostate development and cancer, which has discrete functions in stromal versus epithelial cells. AR expressed in mesenchyme is necessary and sufficient for prostate development while loss of stromal AR is predictive of prostate cancer progression. Many studies have characterized genome-wide binding of AR in prostate tumour cells but none have used primary mesenchyme or stroma. We applied ChIPseq to identify genomic AR binding sites in primary human fetal prostate fibroblasts and patient derived cancer associated fibroblasts, as well as the WPMY1 cell line overexpressing AR. We identified AR binding sites that were specific to fetal prostate fibroblasts (7534), cancer fibroblasts (629), WPMY1-AR (2561) as well as those common among all (783). Primary fibroblasts had a distinct AR binding profile versus prostate cancer cell lines and tissue, and showed a localisation to gene promoter binding sites 1 kb upstream of the transcriptional start site, as well as non-classical AR binding sequence motifs. We used RNAseq to define transcribed genes associated with AR binding sites and derived cistromes for embryonic and cancer fibroblasts as well as a cistrome common to both. These were compared to several in vivo ChIPseq and transcript expression datasets; which identified subsets of AR targets that were expressed in vivo and regulated by androgens. This analysis enabled us to deconvolute stromal AR targets active in stroma within tumour samples. Taken together, our data suggest that the AR shows significantly different genomic binding site locations in primary prostate fibroblasts compared to that observed in tumour cells. Validation of our AR binding site data with transcript expression in vitro and in vivo suggests that the AR target genes we have identified in primary fibroblasts may contribute to clinically significant and biologically important AR-regulated changes in prostate tissue. Copyright © 2017. Published by Elsevier B.V.
Angelastro, James M.; Klimaschewski, Lars; Tang, Song; Vitolo, Ottavio V.; Weissman, Tamily A.; Donlin, Laura T.; Shelanski, Michael L.; Greene, Lloyd A.
2000-01-01
Neurotrophic factors such as nerve growth factor (NGF) promote a wide variety of responses in neurons, including differentiation, survival, plasticity, and repair. Such actions often require changes in gene expression. To identify the regulated genes and thereby to more fully understand the NGF mechanism, we carried out serial analysis of gene expression (SAGE) profiling of transcripts derived from rat PC12 cells before and after NGF-promoted neuronal differentiation. Multiple criteria supported the reliability of the profile. Approximately 157,000 SAGE tags were analyzed, representing at least 21,000 unique transcripts. Of these, nearly 800 were regulated by 6-fold or more in response to NGF. Approximately 150 of the regulated transcripts have been matched to named genes, the majority of which were not previously known to be NGF-responsive. Functional categorization of the regulated genes provides insight into the complex, integrated mechanism by which NGF promotes its multiple actions. It is anticipated that as genomic sequence information accrues the data derived here will continue to provide information about neurotrophic factor mechanisms. PMID:10984536
Nayduch, Dana; Lee, Matthew B; Saski, Christopher A
2014-01-01
Unlike other important vectors such as mosquitoes and sandflies, genetic and genomic tools for Culicoides biting midges are lacking, despite the fact that they vector a large number of arboviruses and other pathogens impacting humans and domestic animals world-wide. In North America, female Culicoides sonorensis midges are important vectors of bluetongue virus (BTV) and epizootic hemorrhagic disease virus (EHDV), orbiviruses that cause significant disease in livestock and wildlife. Libraries of tissue-specific transcripts expressed in response to feeding and oral orbivirus challenge in C. sonorensis have previously been reported, but extensive genome-wide expression profiling in the midge has not. Here, we successfully used deep sequencing technologies to construct the first adult female C. sonorensis reference transcriptome, and utilized genome-wide expression profiling to elucidate the genetic response to blood and sucrose feeding over time. The adult female midge unigene consists of 19,041 genes, of which less than 7% are differentially expressed during the course of a sucrose meal, while up to 52% of the genes respond significantly in blood-fed midges, indicating hematophagy induces complex physiological processes. Many genes that were differentially expressed during blood feeding were associated with digestion (e.g. proteases, lipases), hematophagy (e.g., salivary proteins), and vitellogenesis, revealing many major metabolic and biological factors underlying these critical processes. Additionally, key genes in the vitellogenesis pathway were identified, which provides the first glimpse into the molecular basis of anautogeny for C. sonorensis. This is the first extensive transcriptome for this genus, which will serve as a framework for future expression studies, RNAi, and provide a rich dataset contributing to the ultimate goal of informing a reference genome assembly and annotation. Moreover, this study will serve as a foundation for subsequent studies of genome-wide expression analyses during early orbivirus infection and dissecting the molecular mechanisms behind vector competence in midges.
Accurate Prediction of Inducible Transcription Factor Binding Intensities In Vivo
Siepel, Adam; Lis, John T.
2012-01-01
DNA sequence and local chromatin landscape act jointly to determine transcription factor (TF) binding intensity profiles. To disentangle these influences, we developed an experimental approach, called protein/DNA binding followed by high-throughput sequencing (PB–seq), that allows the binding energy landscape to be characterized genome-wide in the absence of chromatin. We applied our methods to the Drosophila Heat Shock Factor (HSF), which inducibly binds a target DNA sequence element (HSE) following heat shock stress. PB–seq involves incubating sheared naked genomic DNA with recombinant HSF, partitioning the HSF–bound and HSF–free DNA, and then detecting HSF–bound DNA by high-throughput sequencing. We compared PB–seq binding profiles with ones observed in vivo by ChIP–seq and developed statistical models to predict the observed departures from idealized binding patterns based on covariates describing the local chromatin environment. We found that DNase I hypersensitivity and tetra-acetylation of H4 were the most influential covariates in predicting changes in HSF binding affinity. We also investigated the extent to which DNA accessibility, as measured by digital DNase I footprinting data, could be predicted from MNase–seq data and the ChIP–chip profiles for many histone modifications and TFs, and found GAGA element associated factor (GAF), tetra-acetylation of H4, and H4K16 acetylation to be the most predictive covariates. Lastly, we generated an unbiased model of HSF binding sequences, which revealed distinct biophysical properties of the HSF/HSE interaction and a previously unrecognized substructure within the HSE. These findings provide new insights into the interplay between the genomic sequence and the chromatin landscape in determining transcription factor binding intensity. PMID:22479205
Lu, Zefu; Yu, Hong; Xiong, Guosheng; Wang, Jing; Jiao, Yongqing; Liu, Guifu; Jing, Yanhui; Meng, Xiangbing; Hu, Xingming; Qian, Qian; Fu, Xiangdong; Wang, Yonghong; Li, Jiayang
2013-01-01
IDEAL PLANT ARCHITECTURE1 (IPA1) is critical in regulating rice (Oryza sativa) plant architecture and substantially enhances grain yield. To elucidate its molecular basis, we first confirmed IPA1 as a functional transcription activator and then identified 1067 and 2185 genes associated with IPA1 binding sites in shoot apices and young panicles, respectively, through chromatin immunoprecipitation sequencing assays. The SQUAMOSA PROMOTER BINDING PROTEIN-box direct binding core motif GTAC was highly enriched in IPA1 binding peaks; interestingly, a previously uncharacterized indirect binding motif TGGGCC/T was found to be significantly enriched through the interaction of IPA1 with proliferating cell nuclear antigen PROMOTER BINDING FACTOR1 or PROMOTER BINDING FACTOR2. Genome-wide expression profiling by RNA sequencing revealed IPA1 roles in diverse pathways. Moreover, our results demonstrated that IPA1 could directly bind to the promoter of rice TEOSINTE BRANCHED1, a negative regulator of tiller bud outgrowth, to suppress rice tillering, and directly and positively regulate DENSE AND ERECT PANICLE1, an important gene regulating panicle architecture, to influence plant height and panicle length. The elucidation of target genes of IPA1 genome-wide will contribute to understanding the molecular mechanisms underlying plant architecture and to facilitating the breeding of elite varieties with ideal plant architecture. PMID:24170127
Measuring and Reducing Off-Target Activities of Programmable Nucleases Including CRISPR-Cas9
Koo, Taeyoung; Lee, Jungjoon; Kim, Jin-Soo
2015-01-01
Programmable nucleases, which include zinc-finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), and RNA-guided engineered nucleases (RGENs) repurposed from the type II clustered, regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated protein 9 (Cas9) system are now widely used for genome editing in higher eukaryotic cells and whole organisms, revolutionising almost every discipline in biological research, medicine, and biotechnology. All of these nucleases, however, induce off-target mutations at sites homologous in sequence with on-target sites, limiting their utility in many applications including gene or cell therapy. In this review, we compare methods for detecting nuclease off-target mutations. We also review methods for profiling genome-wide off-target effects and discuss how to reduce or avoid off-target mutations. PMID:25985872
Regulating RNA polymerase pausing and transcription elongation in embryonic stem cells
Min, Irene M.; Waterfall, Joshua J.; Core, Leighton J.; Munroe, Robert J.; Schimenti, John; Lis, John T.
2011-01-01
Transitions between pluripotent stem cells and differentiated cells are executed by key transcription regulators. Comparative measurements of RNA polymerase distribution over the genome's primary transcription units in different cell states can identify the genes and steps in the transcription cycle that are regulated during such transitions. To identify the complete transcriptional profiles of RNA polymerases with high sensitivity and resolution, as well as the critical regulated steps upon which regulatory factors act, we used genome-wide nuclear run-on (GRO-seq) to map the density and orientation of transcriptionally engaged RNA polymerases in mouse embryonic stem cells (ESCs) and mouse embryonic fibroblasts (MEFs). In both cell types, progression of a promoter-proximal, paused RNA polymerase II (Pol II) into productive elongation is a rate-limiting step in transcription of ∼40% of mRNA-encoding genes. Importantly, quantitative comparisons between cell types reveal that transcription is controlled frequently at paused Pol II's entry into elongation. Furthermore, “bivalent” ESC genes (exhibiting both active and repressive histone modifications) bound by Polycomb group complexes PRC1 (Polycomb-repressive complex 1) and PRC2 show dramatically reduced levels of paused Pol II at promoters relative to an average gene. In contrast, bivalent promoters bound by only PRC2 allow Pol II pausing, but it is confined to extremely 5′ proximal regions. Altogether, these findings identify rate-limiting targets for transcription regulation during cell differentiation. PMID:21460038
The root transcriptome for North American ginseng assembled and profiled across seasonal development
2013-01-01
Background Ginseng including North American ginseng (Panax quinquefolius L.) is one of the most widely used medicinal plants. Its success is thought to be due to a diverse collection of ginsenosides that serve as its major bioactive compounds. However, few genomic resources exist and the details concerning its various biosynthetic pathways remain poorly understood. As the root is the primary tissue harvested commercially for ginsenosides, next generation sequencing was applied to the characterization and assembly of the root transcriptome throughout seasonal development. Transcripts showing homology to ginsenoside biosynthesis enzymes were profiled in greater detail. Results RNA extracts from root samples from seven development stages of North American ginseng were subjected to 454 sequencing, filtered for quality and used in the de novo assembly of a collective root reference transcriptome consisting of 41,623 transcripts. Annotation efforts using a number of public databases resulted in detailed annotation information for 34,801 (84%) transcripts. In addition, 3,955 genes were assigned to metabolic pathways using the Kyoto Encyclopedia of Genes and Genomes. Among our results, we found all of the known enzymes involved in the ginsenoside backbone biosynthesis and used co-expression analysis to identify a number of candidate sequences involved in the latter stages ginsenoside biosynthesis pathway. Transcript profiles suggest ginsenoside biosynthesis occurs at distinct stages of development. Conclusions The assembly generated provides a comprehensive annotated reference for future transcriptomic study of North American ginseng. A collection of putative ginsenoside biosynthesis genes were identified and candidate genes predicted from the lesser understood downstream stages of biosynthesis. Transcript expression profiles across seasonal development suggest a primary dammarane-type ginsenoside biosynthesis occurs just prior to plant senescence, with secondary ginsenoside production occurring throughout development. Data from the study provide a valuable resource for conducting future ginsenoside biosynthesis research in this important medicinal plant. PMID:23957709
Bashiri, Asher; Heo, Hye J.; Ben-Avraham, Danny; Mazor, Moshe; Budagov, Temuri; Einstein, Francine H.; Atzmon, Gil
2014-01-01
Maternal obesity is a significant risk factor for development of both maternal and fetal metabolic complications. Increase in visceral fat and insulin resistance is a metabolic hallmark of pregnancy, yet little is known how obesity alters adipose cellular function and how this may contribute to pregnancy morbidities. We sought to identify alterations in genome-wide transcription expression in both visceral (omental) and abdominal subcutaneous fat deposits in pregnancy complicated by obesity. Visceral and abdominal subcutaneous fat deposits were collected from normal weight and obese pregnant women (n=4/group) at time of scheduled uncomplicated cesarean section. A genome-wide expression array (Affymetrix Human Exon 1.0 st platform), validated by quantitative real-time PCR, was utilized to establish the gene transcript expression profile in both visceral and abdominal subcutaneous fat in normal weight and obese pregnant women. Global alteration in gene expression was identified in pregnancy complicated by obesity. These regions of variations lead to identification of indolethylamine N-methyltransferase (INMT), tissue factor pathway inhibitor-2 (TFPI-2), and ephrin type-B receptor 6 (EPHB6), not previously associated with fat metabolism during pregnancy. In addition, subcutaneous fat of obese pregnant women demonstrated increased coding protein transcripts associated with apoptosis compared to lean counterparts. Global alteration of gene expression in adipose tissue may contribute to adverse pregnancy outcomes associated with obesity. PMID:24696292
Pujolar, J M; Milan, M; Marino, I A M; Capoccioni, F; Ciccotti, E; Belpaire, C; Covaci, A; Malarvannan, G; Patarnello, T; Bargelloni, L; Zane, L; Maes, G E
2013-05-15
The European eel illustrates an example of a critically endangered fish species strongly affected by human stressors throughout its life cycle, in which pollution is considered to be one of the factors responsible for the decline of the stock. The objective of our study was to better understand the transcriptional response of European eels chronically exposed to pollutants in their natural environment. A total of 42 pre-migrating (silver) female eels from lowly, highly and extremely polluted environments in Belgium and, for comparative purposes, a lowly polluted habitat in Italy were measured for polychlorinated biphenyls (PCBs), organochlorine pesticides (OCPs) and brominated flame retardants (BFRs). Multipollutant level of bioaccumulation was linked to their genome-wide gene transcription using an eel-specific array of 14,913 annotated cDNAs. Shared responses to pollutant exposure were observed when comparing the highly polluted site in Belgium with the relatively clean sites in Belgium and Italy. First, an altered pattern of transcription of genes was associated with detoxification, with a novel European eel CYP3A gene and gluthatione S-transferase transcriptionally up-regulated. Second, an altered pattern of transcription of genes associated with the oxidative phosphorylation pathway, with the following genes involved in the generation of ATP being transcriptionally down-regulated in individuals from the highly polluted site: NADH dehydrogenase, succinate dehydrogenase, ubiquinol-cytochrome c reductase, cytochrome c oxidase and ATP synthase. Although we did not measure metabolism directly, seeing that the transcription level of many genes encoding enzymes involved in the mitochondrial respiratory chain and oxidative phosphorylation were down-regulated in the highly polluted site suggests that pollutants may have a significant effect on energy metabolism in these fish. Copyright © 2013 Elsevier B.V. All rights reserved.
Hao, Ming; Li, Aili; Shi, Tongwei; Luo, Jiangtao; Zhang, Lianquan; Zhang, Xuechuan; Ning, Shunzong; Yuan, Zhongwei; Zeng, Deying; Kong, Xingchen; Li, Xiaolong; Zheng, Hongkun; Lan, Xiujin; Zhang, Huaigang; Zheng, Youliang; Mao, Long; Liu, Dengcai
2017-02-10
The formation of an allopolyploid is a two step process, comprising an initial wide hybridization event, which is later followed by a whole genome doubling. Both processes can affect the transcription of homoeologues. Here, RNA-Seq was used to obtain the genome-wide leaf transcriptome of two independent Triticum turgidum × Aegilops tauschii allotriploids (F1), along with their spontaneous allohexaploids (S1) and their parental lines. The resulting sequence data were then used to characterize variation in homoeologue transcript abundance. The hybridization event strongly down-regulated D-subgenome homoeologues, but this effect was in many cases reversed by whole genome doubling. The suppression of D-subgenome homoeologue transcription resulted in a marked frequency of parental transcription level dominance, especially with respect to genes encoding proteins involved in photosynthesis. Singletons (genes where no homoeologues were present) were frequently transcribed at both the allotriploid and allohexaploid plants. The implication is that whole genome doubling helps to overcome the phenotypic weakness of the allotriploid, restoring a more favourable gene dosage in genes experiencing transcription level dominance in hexaploid wheat.
Li, LiQi; Jothi, Raja; Cui, Kairong; Lee, Jan Y; Cohen, Tsadok; Gorivodsky, Marat; Tzchori, Itai; Zhao, Yangu; Hayes, Sandra M; Bresnick, Emery H; Zhao, Keji; Westphal, Heiner; Love, Paul E
2013-01-01
The nuclear adaptor Ldb1 functions as a core component of multiprotein transcription complexes that regulate differentiation in diverse cell types. In the hematopoietic lineage, Ldb1 forms a complex with the non–DNA-binding adaptor Lmo2 and the transcription factors E2A, Scl and GATA-1 (or GATA-2). Here we demonstrate a critical and continuous requirement for Ldb1 in the maintenance of both fetal and adult mouse hematopoietic stem cells (HSCs). Deletion of Ldb1 in hematopoietic progenitors resulted in the downregulation of many transcripts required for HSC maintenance. Genome-wide profiling by chromatin immunoprecipitation followed by sequencing (ChIP-Seq) identified Ldb1 complex–binding sites at highly conserved regions in the promoters of genes involved in HSC maintenance. Our results identify a central role for Ldb1 in regulating the transcriptional program responsible for the maintenance of HSCs. PMID:21186366
Li, LiQi; Jothi, Raja; Cui, Kairong; Lee, Jan Y; Cohen, Tsadok; Gorivodsky, Marat; Tzchori, Itai; Zhao, Yangu; Hayes, Sandra M; Bresnick, Emery H; Zhao, Keji; Westphal, Heiner; Love, Paul E
2011-02-01
The nuclear adaptor Ldb1 functions as a core component of multiprotein transcription complexes that regulate differentiation in diverse cell types. In the hematopoietic lineage, Ldb1 forms a complex with the non-DNA-binding adaptor Lmo2 and the transcription factors E2A, Scl and GATA-1 (or GATA-2). Here we demonstrate a critical and continuous requirement for Ldb1 in the maintenance of both fetal and adult mouse hematopoietic stem cells (HSCs). Deletion of Ldb1 in hematopoietic progenitors resulted in the downregulation of many transcripts required for HSC maintenance. Genome-wide profiling by chromatin immunoprecipitation followed by sequencing (ChIP-Seq) identified Ldb1 complex-binding sites at highly conserved regions in the promoters of genes involved in HSC maintenance. Our results identify a central role for Ldb1 in regulating the transcriptional program responsible for the maintenance of HSCs.
Ultraconserved regions encoding ncRNAs are altered in human leukemias and carcinomas.
Calin, George A; Liu, Chang-gong; Ferracin, Manuela; Hyslop, Terry; Spizzo, Riccardo; Sevignani, Cinzia; Fabbri, Muller; Cimmino, Amelia; Lee, Eun Joo; Wojcik, Sylwia E; Shimizu, Masayoshi; Tili, Esmerina; Rossi, Simona; Taccioli, Cristian; Pichiorri, Flavia; Liu, Xiuping; Zupo, Simona; Herlea, Vlad; Gramantieri, Laura; Lanza, Giovanni; Alder, Hansjuerg; Rassenti, Laura; Volinia, Stefano; Schmittgen, Thomas D; Kipps, Thomas J; Negrini, Massimo; Croce, Carlo M
2007-09-01
Noncoding RNA (ncRNA) transcripts are thought to be involved in human tumorigenesis. We report that a large fraction of genomic ultraconserved regions (UCRs) encode a particular set of ncRNAs whose expression is altered in human cancers. Genome-wide profiling revealed that UCRs have distinct signatures in human leukemias and carcinomas. UCRs are frequently located at fragile sites and genomic regions involved in cancers. We identified certain UCRs whose expression may be regulated by microRNAs abnormally expressed in human chronic lymphocytic leukemia, and we proved that the inhibition of an overexpressed UCR induces apoptosis in colon cancer cells. Our findings argue that ncRNAs and interaction between noncoding genes are involved in tumorigenesis to a greater extent than previously thought.
Hassler, Melanie R; Pulverer, Walter; Lakshminarasimhan, Ranjani; Redl, Elisa; Hacker, Julia; Garland, Gavin D; Merkel, Olaf; Schiefer, Ana-Iris; Simonitsch-Klupp, Ingrid; Kenner, Lukas; Weisenberger, Daniel J; Weinhaeusel, Andreas; Turner, Suzanne D; Egger, Gerda
2016-10-04
Aberrant DNA methylation patterns in malignant cells allow insight into tumor evolution and development and can be used for disease classification. Here, we describe the genome-wide DNA methylation signatures of NPM-ALK-positive (ALK+) and NPM-ALK-negative (ALK-) anaplastic large-cell lymphoma (ALCL). We find that ALK+ and ALK- ALCL share common DNA methylation changes for genes involved in T cell differentiation and immune response, including TCR and CTLA-4, without an ALK-specific impact on tumor DNA methylation in gene promoters. Furthermore, we uncover a close relationship between global ALCL DNA methylation patterns and those in distinct thymic developmental stages and observe tumor-specific DNA hypomethylation in regulatory regions that are enriched for conserved transcription factor binding motifs such as AP1. Our results indicate similarity between ALCL tumor cells and thymic T cell subsets and a direct relationship between ALCL oncogenic signaling and DNA methylation through transcription factor induction and occupancy. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.
Lopez-Bigas, Nuria; Kisiel, Tomasz A.; DeWaal, Dannielle C.; Holmes, Katie B.; Volkert, Tom L.; Gupta, Sumeet; Love, Jennifer; Murray, Heather L.; Young, Richard A.; Benevolenskaya, Elizaveta V.
2010-01-01
SUMMARY Retinoblastoma protein (pRB) mediates cell-cycle withdrawal and differentiation by interacting with a variety of proteins. RB-Binding Protein 2 (RBP2) has been shown to be a key effector. We sought to determine transcriptional regulation by RBP2 genome-wide by using location analysis and gene expression profiling experiments. We describe that RBP2 shows high correlation with the presence of H3K4me3 and its target genes are separated into two functionally distinct classes: differentiation-independent and differentiation-dependent genes. The former class is enriched by genes that encode mitochondrial proteins, while the latter is represented by cell-cycle genes. We demonstrate the role of RBP2 in mitochondrial biogenesis, which involves regulation of H3K4me3-modified nucleosomes. Analysis of expression changes upon RBP2 depletion depicted genes with a signature of differentiation control, analogous to the changes seen upon reintroduction of pRB. We conclude that, during differentiation, RBP2 exerts inhibitory effects on multiple genes through direct interaction with their promoters. PMID:18722178
2012-01-01
Background Filamentous fungi are confronted with changes and limitations of their carbon source during growth in their natural habitats and during industrial applications. To survive life-threatening starvation conditions, carbon from endogenous resources becomes mobilized to fuel maintenance and self-propagation. Key to understand the underlying cellular processes is the system-wide analysis of fungal starvation responses in a temporal and spatial resolution. The knowledge deduced is important for the development of optimized industrial production processes. Results This study describes the physiological, morphological and genome-wide transcriptional changes caused by prolonged carbon starvation during submerged batch cultivation of the filamentous fungus Aspergillus niger. Bioreactor cultivation supported highly reproducible growth conditions and monitoring of physiological parameters. Changes in hyphal growth and morphology were analyzed at distinct cultivation phases using automated image analysis. The Affymetrix GeneChip platform was used to establish genome-wide transcriptional profiles for three selected time points during prolonged carbon starvation. Compared to the exponential growth transcriptome, about 50% (7,292) of all genes displayed differential gene expression during at least one of the starvation time points. Enrichment analysis of Gene Ontology, Pfam domain and KEGG pathway annotations uncovered autophagy and asexual reproduction as major global transcriptional trends. Induced transcription of genes encoding hydrolytic enzymes was accompanied by increased secretion of hydrolases including chitinases, glucanases, proteases and phospholipases as identified by mass spectrometry. Conclusions This study is the first system-wide analysis of the carbon starvation response in a filamentous fungus. Morphological, transcriptomic and secretomic analyses identified key events important for fungal survival and their chronology. The dataset obtained forms a comprehensive framework for further elucidation of the interrelation and interplay of the individual cellular events involved. PMID:22873931
Targeting Transcriptional Regulators of CD8+ T Cell Dysfunction to Boost Anti-Tumor Immunity
Waugh, Katherine A.; Leach, Sonia M.; Slansky, Jill E.
2015-01-01
Transcription is a dynamic process influenced by the cellular environment: healthy, transformed, and otherwise. Genome-wide mRNA expression profiles reflect the collective impact of pathways modulating cell function under different conditions. In this review we focus on the transcriptional pathways that control tumor infiltrating CD8+ T cell (TIL) function. Simultaneous restraint of overlapping inhibitory pathways may confer TIL resistance to multiple mechanisms of suppression traditionally referred to as exhaustion, tolerance, or anergy. Although decades of work have laid a solid foundation of altered transcriptional networks underlying various subsets of hypofunctional or “dysfunctional” CD8+ T cells, an understanding of the relevance in TIL has just begun. With recent technological advances, it is now feasible to further elucidate and utilize these pathways in immunotherapy platforms that seek to increase TIL function. PMID:26393659
Marshall, Owen J; Southall, Tony D; Cheetham, Seth W; Brand, Andrea H
2016-09-01
This protocol is an extension to: Nat. Protoc. 2, 1467-1478 (2007); doi:10.1038/nprot.2007.148; published online 7 June 2007The ability to profile transcription and chromatin binding in a cell-type-specific manner is a powerful aid to understanding cell-fate specification and cellular function in multicellular organisms. We recently developed targeted DamID (TaDa) to enable genome-wide, cell-type-specific profiling of DNA- and chromatin-binding proteins in vivo without cell isolation. As a protocol extension, this article describes substantial modifications to an existing protocol, and it offers additional applications. TaDa builds upon DamID, a technique for detecting genome-wide DNA-binding profiles of proteins, by coupling it with the GAL4 system in Drosophila to enable both temporal and spatial resolution. TaDa ensures that Dam-fusion proteins are expressed at very low levels, thus avoiding toxicity and potential artifacts from overexpression. The modifications to the core DamID technique presented here also increase the speed of sample processing and throughput, and adapt the method to next-generation sequencing technology. TaDa is robust, reproducible and highly sensitive. Compared with other methods for cell-type-specific profiling, the technique requires no cell-sorting, cross-linking or antisera, and binding profiles can be generated from as few as 10,000 total induced cells. By profiling the genome-wide binding of RNA polymerase II (Pol II), TaDa can also identify transcribed genes in a cell-type-specific manner. Here we describe a detailed protocol for carrying out TaDa experiments and preparing the material for next-generation sequencing. Although we developed TaDa in Drosophila, it should be easily adapted to other organisms with an inducible expression system. Once transgenic animals are obtained, the entire experimental procedure-from collecting tissue samples to generating sequencing libraries-can be accomplished within 5 d.
Transcriptional architecture of the primate neocortex.
Bernard, Amy; Lubbers, Laura S; Tanis, Keith Q; Luo, Rui; Podtelezhnikov, Alexei A; Finney, Eva M; McWhorter, Mollie M E; Serikawa, Kyle; Lemon, Tracy; Morgan, Rebecca; Copeland, Catherine; Smith, Kimberly; Cullen, Vivian; Davis-Turak, Jeremy; Lee, Chang-Kyu; Sunkin, Susan M; Loboda, Andrey P; Levine, David M; Stone, David J; Hawrylycz, Michael J; Roberts, Christopher J; Jones, Allan R; Geschwind, Daniel H; Lein, Ed S
2012-03-22
Genome-wide transcriptional profiling was used to characterize the molecular underpinnings of neocortical organization in rhesus macaque, including cortical areal specialization and laminar cell-type diversity. Microarray analysis of individual cortical layers across sensorimotor and association cortices identified robust and specific molecular signatures for individual cortical layers and areas, prominently involving genes associated with specialized neuronal function. Overall, transcriptome-based relationships were related to spatial proximity, being strongest between neighboring cortical areas and between proximal layers. Primary visual cortex (V1) displayed the most distinctive gene expression compared to other cortical regions in rhesus and human, both in the specialized layer 4 as well as other layers. Laminar patterns were more similar between macaque and human compared to mouse, as was the unique V1 profile that was not observed in mouse. These data provide a unique resource detailing neocortical transcription patterns in a nonhuman primate with great similarity in gene expression to human. Copyright © 2012 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Adamopoulos, Panagiotis G.; Kontos, Christos K.; Scorilas, Andreas
Tissue kallikrein and kallikrein-related peptidases (KLKs) form the largest group of serine proteases in the human genome, sharing many structural and functional characteristics. Multiple alternative transcripts have been reported for the most human KLK genes, while many of them are aberrantly expressed in various malignancies, thus possessing significant prognostic and/or diagnostic value. Alternative splicing of cancer-related genes is a common cellular mechanism accounting for cancer cell transcriptome complexity, as it affects cell cycle control, proliferation, apoptosis, invasion, and metastasis. In this study, we describe the identification and molecular cloning of eight novel transcripts of the human KLK10 gene using 3′more » rapid amplification of cDNA ends (3′ RACE) and next-generation sequencing (NGS), as well as their expression analysis in a wide panel of cell lines, originating from several distinct cancerous and normal tissues. Bioinformatic analysis revealed that the novel KLK10 transcripts contain new alternative splicing events between already annotated exons as well as novel exons. In addition, investigation of their expression profile in a wide panel of cell lines was performed with nested RT-PCR using variant-specific pairs of primers. Since many KLK mRNA transcripts possess clinical value, these newly discovered alternatively spliced KLK10 transcripts appear as new potential biomarkers for diagnostic and/or prognostic purposes or as targets for therapeutic strategies. - Highlights: • NGS was used to identify novel transcripts of the human KLK10 gene. • 8 novel KLK10 transcripts were identified. • A novel 3′UTR was detected and characterized. • The expression profiles of all 8 novel KLK10 transcripts were identified.« less
Algama, Manjula; Tasker, Edward; Williams, Caitlin; Parslow, Adam C; Bryson-Richardson, Robert J; Keith, Jonathan M
2017-03-27
Computational identification of non-coding RNAs (ncRNAs) is a challenging problem. We describe a genome-wide analysis using Bayesian segmentation to identify intronic elements highly conserved between three evolutionarily distant vertebrate species: human, mouse and zebrafish. We investigate the extent to which these elements include ncRNAs (or conserved domains of ncRNAs) and regulatory sequences. We identified 655 deeply conserved intronic sequences in a genome-wide analysis. We also performed a pathway-focussed analysis on genes involved in muscle development, detecting 27 intronic elements, of which 22 were not detected in the genome-wide analysis. At least 87% of the genome-wide and 70% of the pathway-focussed elements have existing annotations indicative of conserved RNA secondary structure. The expression of 26 of the pathway-focused elements was examined using RT-PCR, providing confirmation that they include expressed ncRNAs. Consistent with previous studies, these elements are significantly over-represented in the introns of transcription factors. This study demonstrates a novel, highly effective, Bayesian approach to identifying conserved non-coding sequences. Our results complement previous findings that these sequences are enriched in transcription factors. However, in contrast to previous studies which suggest the majority of conserved sequences are regulatory factor binding sites, the majority of conserved sequences identified using our approach contain evidence of conserved RNA secondary structures, and our laboratory results suggest most are expressed. Functional roles at DNA and RNA levels are not mutually exclusive, and many of our elements possess evidence of both. Moreover, ncRNAs play roles in transcriptional and post-transcriptional regulation, and this may contribute to the over-representation of these elements in introns of transcription factors. We attribute the higher sensitivity of the pathway-focussed analysis compared to the genome-wide analysis to improved alignment quality, suggesting that enhanced genomic alignments may reveal many more conserved intronic sequences.
Anderson, Letícia; Gomes, Monete Rajão; daSilva, Lucas Ferreira; Pereira, Adriana da Silva Andrade; Mourão, Marina M.; Romier, Christophe; Pierce, Raymond
2017-01-01
Background Schistosomiasis is a parasitic disease infecting hundreds of millions of people worldwide. Treatment depends on a single drug, praziquantel, which kills the Schistosoma spp. parasite only at the adult stage. HDAC inhibitors (HDACi) such as Trichostatin A (TSA) induce parasite mortality in vitro (schistosomula and adult worms), however the downstream effects of histone hyperacetylation on the parasite are not known. Methodology/Principal findings TSA treatment of adult worms in vitro increased histone acetylation at H3K9ac and H3K14ac, which are transcription activation marks, not affecting the unrelated transcription repression mark H3K27me3. We investigated the effect of TSA HDACi on schistosomula gene expression at three different time points, finding a marked genome-wide change in the transcriptome profile. Gene transcription activity was correlated with changes on the chromatin acetylation mark at gene promoter regions. Moreover, combining expression data with ChIP-Seq public data for schistosomula, we found that differentially expressed genes having the H3K4me3 mark at their promoter region in general showed transcription activation upon HDACi treatment, compared with those without the mark, which showed transcription down-regulation. Affected genes are enriched for DNA replication processes, most of them being up-regulated. Twenty out of 22 genes encoding proteins involved in reducing reactive oxygen species accumulation were down-regulated. Dozens of genes encoding proteins with histone reader motifs were changed, including SmEED from the PRC2 complex. We targeted SmEZH2 methyltransferase PRC2 component with a new EZH2 inhibitor (GSK343) and showed a synergistic effect with TSA, significantly increasing schistosomula mortality. Conclusions/Significance Genome-wide gene expression analyses have identified important pathways and cellular functions that were affected and may explain the schistosomicidal effect of TSA HDACi. The change in expression of dozens of histone reader genes involved in regulation of the epigenetic program in S. mansoni can be used as a starting point to look for possible novel schistosomicidal targets. PMID:28406899
Dalla Rosa, Ilaria; Zhang, Hongliang; Khiati, Salim; Wu, Xiaolin; Pommier, Yves
2017-12-08
Mitochondrial DNA (mtDNA) is essential for cell viability because it encodes subunits of the respiratory chain complexes. Mitochondrial topoisomerase IB (TOP1MT) facilitates mtDNA replication by removing DNA topological tensions produced during mtDNA transcription, but it appears to be dispensable. To test whether cells lacking TOP1MT have aberrant mtDNA transcription, we performed mitochondrial transcriptome profiling. To that end, we designed and implemented a customized tiling array, which enabled genome-wide, strand-specific, and simultaneous detection of all mitochondrial transcripts. Our technique revealed that Top1mt KO mouse cells process the mitochondrial transcripts normally but that protein-coding mitochondrial transcripts are elevated. Moreover, we found discrete long noncoding RNAs produced by H-strand transcription and encompassing the noncoding regulatory region of mtDNA in human and murine cells and tissues. Of note, these noncoding RNAs were strongly up-regulated in the absence of TOP1MT. In contrast, 7S DNA, produced by mtDNA replication, was reduced in the Top1mt KO cells. We propose that the long noncoding RNA species in the D-loop region are generated by the extension of H-strand transcripts beyond their canonical stop site and that TOP1MT acts as a topological barrier and regulator for mtDNA transcription and D-loop formation.
Salt-Responsive Transcriptome Profiling of Suaeda glauca via RNA Sequencing
Jin, Hangxia; Dong, Dekun; Yang, Qinghua; Zhu, Danhua
2016-01-01
Background Suaeda glauca, a succulent halophyte of the Chenopodiaceae family, is widely distributed in coastal areas of China. Suaeda glauca is highly resistant to salt and alkali stresses. In the present study, the salt-responsive transcriptome of Suaeda glauca was analyzed to identify genes involved in salt tolerance and study halophilic mechanisms in this halophyte. Results Illumina HiSeq 2500 was used to sequence cDNA libraries from salt-treated and control samples with three replicates each treatment. De novo assembly of the six transcriptomes identified 75,445 unigenes. A total of 23,901 (31.68%) unigenes were annotated. Compared with transcriptomes from the three salt-treated and three salt-free samples, 231 differentially expressed genes (DEGs) were detected (including 130 up-regulated genes and 101 down-regulated genes), and 195 unigenes were functionally annotated. Based on the Gene Ontology (GO), Clusters of Orthologous Groups (COG) and Kyoto Encyclopedia of Genes and Genomes (KEGG) classifications of the DEGs, more attention should be paid to transcripts associated with signal transduction, transporters, the cell wall and growth, defense metabolism and transcription factors involved in salt tolerance. Conclusions This report provides a genome-wide transcriptional analysis of a halophyte, Suaeda glauca, under salt stress. Further studies of the genetic basis of salt tolerance in halophytes are warranted. PMID:26930632
A genome-wide longitudinal transcriptome analysis of the aging model Podospora anserina.
Philipp, Oliver; Hamann, Andrea; Servos, Jörg; Werner, Alexandra; Koch, Ina; Osiewacz, Heinz D
2013-01-01
Aging of biological systems is controlled by various processes which have a potential impact on gene expression. Here we report a genome-wide transcriptome analysis of the fungal aging model Podospora anserina. Total RNA of three individuals of defined age were pooled and analyzed by SuperSAGE (serial analysis of gene expression). A bioinformatics analysis identified different molecular pathways to be affected during aging. While the abundance of transcripts linked to ribosomes and to the proteasome quality control system were found to decrease during aging, those associated with autophagy increase, suggesting that autophagy may act as a compensatory quality control pathway. Transcript profiles associated with the energy metabolism including mitochondrial functions were identified to fluctuate during aging. Comparison of wild-type transcripts, which are continuously down-regulated during aging, with those down-regulated in the long-lived, copper-uptake mutant grisea, validated the relevance of age-related changes in cellular copper metabolism. Overall, we (i) present a unique age-related data set of a longitudinal study of the experimental aging model P. anserina which represents a reference resource for future investigations in a variety of organisms, (ii) suggest autophagy to be a key quality control pathway that becomes active once other pathways fail, and (iii) present testable predictions for subsequent experimental investigations.
Lacruz, Rodrigo S; Smith, Charles E; Bringas, Pablo; Chen, Yi-Bu; Smith, Susan M; Snead, Malcolm L; Kurtz, Ira; Hacia, Joseph G; Hubbard, Michael J; Paine, Michael L
2012-05-01
The gene repertoire regulating vertebrate biomineralization is poorly understood. Dental enamel, the most highly mineralized tissue in mammals, differs from other calcifying systems in that the formative cells (ameloblasts) lack remodeling activity and largely degrade and resorb the initial extracellular matrix. Enamel mineralization requires that ameloblasts undergo a profound functional switch from matrix-secreting to maturational (calcium transport, protein resorption) roles as mineralization progresses. During the maturation stage, extracellular pH decreases markedly, placing high demands on ameloblasts to regulate acidic environments present around the growing hydroxyapatite crystals. To identify the genetic events driving enamel mineralization, we conducted genome-wide transcript profiling of the developing enamel organ from rat incisors and highlight over 300 genes differentially expressed during maturation. Using multiple bioinformatics analyses, we identified groups of maturation-associated genes whose functions are linked to key mineralization processes including pH regulation, calcium handling, and matrix turnover. Subsequent qPCR and Western blot analyses revealed that a number of solute carrier (SLC) gene family members were up-regulated during maturation, including the novel protein Slc24a4 involved in calcium handling as well as other proteins of similar function (Stim1). By providing the first global overview of the cellular machinery required for enamel maturation, this study provide a strong foundation for improving basic understanding of biomineralization and its practical applications in healthcare. Copyright © 2011 Wiley Periodicals, Inc.
Liu, Shi-Huo; Li, Hong-Fei; Yang, Yang; Yang, Rui-Lin; Yang, Wen-Jia; Jiang, Hong-Bo; Dou, Wei; Smagghe, Guy; Wang, Jin-Jun
2018-05-01
Chitinases (Chts) and chitin deacetylases (CDAs) are important enzymes required for chitin metabolism in insects. In this study, 12 Cht-related genes (including seven Cht genes and five imaginal disc growth factor genes) and 6 CDA genes (encoding seven proteins) were identified in Bactrocera dorsalis using genome-wide searching and transcript profiling. Based on the conserved sequences and phylogenetic relationships, 12 Cht-related proteins were clustered into eight groups (group I-V and VII-IX). Further domain architecture analysis showed that all contained at least one chitinase catalytic domain, however, only four (BdCht5, BdCht7, BdCht8 and BdCht10) possessed chitin-binding domains. The subsequent phylogenetic analysis revealed that seven CDAs were clustered into five groups (group I-V), and all had one chitin deacetylase catalytic domain. However, only six exhibited chitin-binding domains. Finally, the development- and tissue-specific expression profiling showed that transcript levels of the 12 Cht-related genes and 6 CDA genes varied considerably among eggs, larvae, pupae and adults, as well as among different tissues of larvae and adults. Our findings illustrate the structural differences and expression patterns of Cht and CDA genes in B. dorsalis, and provide important information for the development of new pest control strategies based on these vital enzymes. Copyright © 2018. Published by Elsevier Inc.
Traverse, Charles C.
2017-01-01
ABSTRACT Advances in sequencing technologies have enabled direct quantification of genome-wide errors that occur during RNA transcription. These errors occur at rates that are orders of magnitude higher than rates during DNA replication, but due to technical difficulties such measurements have been limited to single-base substitutions and have not yet quantified the scope of transcription insertions and deletions. Previous reporter gene assay findings suggested that transcription indels are produced exclusively by elongation complex slippage at homopolymeric runs, so we enumerated indels across the protein-coding transcriptomes of Escherichia coli and Buchnera aphidicola, which differ widely in their genomic base compositions and incidence of repeat regions. As anticipated from prior assays, transcription insertions prevailed in homopolymeric runs of A and T; however, transcription deletions arose in much more complex sequences and were rarely associated with homopolymeric runs. By reconstructing the relocated positions of the elongation complex as inferred from the sequences inserted or deleted during transcription, we show that continuation of transcription after slippage hinges on the degree of nucleotide complementarity within the RNA:DNA hybrid at the new DNA template location. PMID:28851848
Shamimuzzaman, Md.
2018-01-01
To understand translational capacity on a genome-wide scale across three developmental stages of immature soybean seed cotyledons, ribosome profiling was performed in combination with RNA sequencing and cluster analysis. Transcripts representing 216 unique genes demonstrated a higher level of translational activity in at least one stage by exhibiting higher translational efficiencies (TEs) in which there were relatively more ribosome footprint sequence reads mapping to the transcript than were present in the control total RNA sample. The majority of these transcripts were more translationally active at the early stage of seed development and included 12 unique serine or cysteine proteases and 16 2S albumin and low molecular weight cysteine-rich proteins that may serve as substrates for turnover and mobilization early in seed development. It would appear that the serine proteases and 2S albumins play a vital role in the early stages. In contrast, our investigation of profiles of 19 genes encoding high abundance seed storage proteins, such as glycinins, beta-conglycinins, lectin, and Kunitz trypsin inhibitors, showed that they all had similar patterns in which the TE values started at low levels and increased approximately 2 to 6-fold during development. The highest levels of these seed protein transcripts were found at the mid-developmental stage, whereas the highest ribosome footprint levels of only up to 1.6 TE were found at the late developmental stage. These experimental findings suggest that the major seed storage protein coding genes are primarily regulated at the transcriptional level during normal soybean cotyledon development. Finally, our analyses also identified a total of 370 unique gene models that showed very low TE values including over 48 genes encoding ribosomal family proteins and 95 gene models that are related to energy and photosynthetic functions, many of which have homology to the chloroplast genome. Additionally, we showed that genes of the chloroplast were relatively translationally inactive during seed development. PMID:29570733
O'Brien, M.A.; Costin, B.N.; Miles, M.F.
2014-01-01
Postgenomic studies of the function of genes and their role in disease have now become an area of intense study since efforts to define the raw sequence material of the genome have largely been completed. The use of whole-genome approaches such as microarray expression profiling and, more recently, RNA-sequence analysis of transcript abundance has allowed an unprecedented look at the workings of the genome. However, the accurate derivation of such high-throughput data and their analysis in terms of biological function has been critical to truly leveraging the postgenomic revolution. This chapter will describe an approach that focuses on the use of gene networks to both organize and interpret genomic expression data. Such networks, derived from statistical analysis of large genomic datasets and the application of multiple bioinformatics data resources, poten-tially allow the identification of key control elements for networks associated with human disease, and thus may lead to derivation of novel therapeutic approaches. However, as discussed in this chapter, the leveraging of such networks cannot occur without a thorough understanding of the technical and statistical factors influencing the derivation of genomic expression data. Thus, while the catch phrase may be “it's the network … stupid,” the understanding of factors extending from RNA isolation to genomic profiling technique, multivariate statistics, and bioinformatics are all critical to defining fully useful gene networks for study of complex biology. PMID:23195313
Di, Li-Jun; Byun, Jung S; Wong, Madeline M; Wakano, Clay; Taylor, Tara; Bilke, Sven; Baek, Songjoon; Hunter, Kent; Yang, Howard; Lee, Maxwell; Zvosec, Cecilia; Khramtsova, Galina; Cheng, Fan; Perou, Charles M; Miller, C Ryan; Raab, Rachel; Olopade, Olufunmilayo I; Gardner, Kevin
2013-01-01
The C-terminal binding protein (CtBP) is a NADH-dependent transcriptional repressor that links carbohydrate metabolism to epigenetic regulation by recruiting diverse histone-modifying complexes to chromatin. Here global profiling of CtBP in breast cancer cells reveals that it drives epithelial-to-mesenchymal transition, stem cell pathways and genome instability. CtBP expression induces mesenchymal and stem cell-like features, whereas CtBP depletion or caloric restriction reverses gene repression and increases DNA repair. Multiple members of the CtBP-targeted gene network are selectively downregulated in aggressive breast cancer subtypes. Differential expression of CtBP-targeted genes predicts poor clinical outcome in breast cancer patients, and elevated levels of CtBP in patient tumours predict shorter median survival. Finally, both CtBP promoter targeting and gene repression can be reversed by small molecule inhibition. These findings define broad roles for CtBP in breast cancer biology and suggest novel chromatin-based strategies for pharmacologic and metabolic intervention in cancer.
Polstein, Lauren R.; Perez-Pinera, Pablo; Kocak, D. Dewran; Vockley, Christopher M.; Bledsoe, Peggy; Song, Lingyun; Safi, Alexias; Crawford, Gregory E.; Reddy, Timothy E.; Gersbach, Charles A.
2015-01-01
Genome engineering technologies based on the CRISPR/Cas9 and TALE systems are enabling new approaches in science and biotechnology. However, the specificity of these tools in complex genomes and the role of chromatin structure in determining DNA binding are not well understood. We analyzed the genome-wide effects of TALE- and CRISPR-based transcriptional activators in human cells using ChIP-seq to assess DNA-binding specificity and RNA-seq to measure the specificity of perturbing the transcriptome. Additionally, DNase-seq was used to assess genome-wide chromatin remodeling that occurs as a result of their action. Our results show that these transcription factors are highly specific in both DNA binding and gene regulation and are able to open targeted regions of closed chromatin independent of gene activation. Collectively, these results underscore the potential for these technologies to make precise changes to gene expression for gene and cell therapies or fundamental studies of gene function. PMID:26025803
Chromatin-associated RNA sequencing (ChAR-seq) maps genome-wide RNA-to-DNA contacts
Jukam, David; Teran, Nicole A; Risca, Viviana I; Smith, Owen K; Johnson, Whitney L; Skotheim, Jan M; Greenleaf, William James
2018-01-01
RNA is a critical component of chromatin in eukaryotes, both as a product of transcription, and as an essential constituent of ribonucleoprotein complexes that regulate both local and global chromatin states. Here, we present a proximity ligation and sequencing method called Chromatin-Associated RNA sequencing (ChAR-seq) that maps all RNA-to-DNA contacts across the genome. Using Drosophila cells, we show that ChAR-seq provides unbiased, de novo identification of targets of chromatin-bound RNAs including nascent transcripts, chromosome-specific dosage compensation ncRNAs, and genome-wide trans-associated RNAs involved in co-transcriptional RNA processing. PMID:29648534
Eldem, Vahap; Çelikkol Akçay, Ufuk; Ozhuner, Esma; Bakır, Yakup; Uranbey, Serkan; Unver, Turgay
2012-01-01
Peach (Prunus persica L.) is one of the most important worldwide fresh fruits. Since fruit growth largely depends on adequate water supply, drought stress is considered as the most important abiotic stress limiting fleshy fruit production and quality in peach. Plant responses to drought stress are regulated both at transcriptional and post-transcriptional level. As post-transcriptional gene regulators, miRNAs (miRNAs) are small (19–25 nucleotides in length), endogenous, non-coding RNAs. Recent studies indicate that miRNAs are involved in plant responses to drought. Therefore, Illumina deep sequencing technology was used for genome-wide identification of miRNAs and their expression profile in response to drought in peach. In this study, four sRNA libraries were constructed from leaf control (LC), leaf stress (LS), root control (RC) and root stress (RS) samples. We identified a total of 531, 471, 535 and 487 known mature miRNAs in LC, LS, RC and RS libraries, respectively. The expression level of 262 (104 up-regulated, 158 down-regulated) of the 453 miRNAs changed significantly in leaf tissue, whereas 368 (221 up-regulated, 147 down-regulated) of the 465 miRNAs had expression levels that changed significantly in root tissue upon drought stress. Additionally, a total of 197, 221, 238 and 265 novel miRNA precursor candidates were identified from LC, LS, RC and RS libraries, respectively. Target transcripts (137 for LC, 133 for LS, 148 for RC and 153 for RS) generated significant Gene Ontology (GO) terms related to DNA binding and catalytic activites. Genome-wide miRNA expression analysis of peach by deep sequencing approach helped to expand our understanding of miRNA function in response to drought stress in peach and Rosaceae. A set of differentially expressed miRNAs could pave the way for developing new strategies to alleviate the adverse effects of drought stress on plant growth and development. PMID:23227166
Rodríguez, Alejandra; Gonzalez, Luis; Ko, Arthur; Alvarez, Marcus; Miao, Zong; Bhagat, Yash; Nikkola, Elina; Cruz-Bautista, Ivette; Arellano-Campos, Olimpia; Muñoz-Hernández, Linda L; Ordóñez-Sánchez, Maria-Luisa; Rodriguez-Guillen, Rosario; Mohlke, Karen L; Laakso, Markku; Tusie-Luna, Teresa; Aguilar-Salinas, Carlos A; Pajukanta, Päivi
2016-07-01
We recently identified a locus on chromosome 18q11.2 for high serum triglycerides in Mexicans. We hypothesize that the lead genome-wide association study single-nucleotide polymorphism rs9949617, or its linkage disequilibrium proxies, regulates 1 of the 5 genes in the triglyceride-associated region. We performed a linkage disequilibrium analysis and found 9 additional variants in linkage disequilibrium (r(2)>0.7) with the lead single-nucleotide polymorphism. To select the variants for functional analyses, we annotated the 10 variants using DNase I hypersensitive sites, transcription factor and chromatin states and identified rs17259126 as the lead candidate variant for functional in vitro validation. Using luciferase transcriptional reporter assay in liver HepG2 cells, we found that the G allele exhibits a significantly lower effect on transcription (P<0.05). The electrophoretic mobility shift and ChIPqPCR (chromatin immunoprecipitation coupled with quantitative polymerase chain reaction) assays confirmed that the minor G allele of rs17259126 disrupts an hepatocyte nuclear factor 4 α-binding site. To find the regional candidate gene, we performed a local expression quantitative trait locus analysis and found that rs17259126 and its linkage disequilibrium proxies alter expression of the regional transmembrane protein 241 (TMEM241) gene in 795 adipose RNAs from the Metabolic Syndrome In Men (METSIM) cohort (P=6.11×10(-07)-5.80×10(-04)). These results were replicated in expression profiles of TMEM241 from the Multiple Tissue Human Expression Resource (MuTHER; n=856). The Mexican genome-wide association study signal for high serum triglycerides on chromosome 18q11.2 harbors a regulatory single-nucleotide polymorphism, rs17259126, which disrupts normal hepatocyte nuclear factor 4 α binding and decreases the expression of the regional TMEM241 gene. Our data suggest that decreased transcript levels of TMEM241 contribute to increased triglyceride levels in Mexicans. © 2016 American Heart Association, Inc.
Integrated data analysis for genome-wide research.
Steinfath, Matthias; Repsilber, Dirk; Scholz, Matthias; Walther, Dirk; Selbig, Joachim
2007-01-01
Integrated data analysis is introduced as the intermediate level of a systems biology approach to analyse different 'omics' datasets, i.e., genome-wide measurements of transcripts, protein levels or protein-protein interactions, and metabolite levels aiming at generating a coherent understanding of biological function. In this chapter we focus on different methods of correlation analyses ranging from simple pairwise correlation to kernel canonical correlation which were recently applied in molecular biology. Several examples are presented to illustrate their application. The input data for this analysis frequently originate from different experimental platforms. Therefore, preprocessing steps such as data normalisation and missing value estimation are inherent to this approach. The corresponding procedures, potential pitfalls and biases, and available software solutions are reviewed. The multiplicity of observations obtained in omics-profiling experiments necessitates the application of multiple testing correction techniques.
Wijchers, Patrick J; Yandim, Cihangir; Panousopoulou, Eleni; Ahmad, Mushfika; Harker, Nicky; Saveliev, Alexander; Burgoyne, Paul S; Festenstein, Richard
2010-09-14
Differences between males and females are normally attributed to developmental and hormonal differences between the sexes. Here, we demonstrate differences between males and females in gene silencing using a heterochromatin-sensitive reporter gene. Using "sex-reversal" mouse models with varying sex chromosome complements, we found that this differential gene silencing was determined by X chromosome complement, rather than sex. Genome-wide transcription profiling showed that the expression of hundreds of autosomal genes was also sensitive to sex chromosome complement. These genome-wide analyses also uncovered a role for Sry in modulating autosomal gene expression in a sex chromosome complement-specific manner. The identification of this additional layer in the establishment of sexual dimorphisms has implications for understanding sexual dimorphisms in physiology and disease. Copyright © 2010 Elsevier Inc. All rights reserved.
TEGS-CN: A Statistical Method for Pathway Analysis of Genome-wide Copy Number Profile.
Huang, Yen-Tsung; Hsu, Thomas; Christiani, David C
2014-01-01
The effects of copy number alterations make up a significant part of the tumor genome profile, but pathway analyses of these alterations are still not well established. We proposed a novel method to analyze multiple copy numbers of genes within a pathway, termed Test for the Effect of a Gene Set with Copy Number data (TEGS-CN). TEGS-CN was adapted from TEGS, a method that we previously developed for gene expression data using a variance component score test. With additional development, we extend the method to analyze DNA copy number data, accounting for different sizes and thus various numbers of copy number probes in genes. The test statistic follows a mixture of X (2) distributions that can be obtained using permutation with scaled X (2) approximation. We conducted simulation studies to evaluate the size and the power of TEGS-CN and to compare its performance with TEGS. We analyzed a genome-wide copy number data from 264 patients of non-small-cell lung cancer. With the Molecular Signatures Database (MSigDB) pathway database, the genome-wide copy number data can be classified into 1814 biological pathways or gene sets. We investigated associations of the copy number profile of the 1814 gene sets with pack-years of cigarette smoking. Our analysis revealed five pathways with significant P values after Bonferroni adjustment (<2.8 × 10(-5)), including the PTEN pathway (7.8 × 10(-7)), the gene set up-regulated under heat shock (3.6 × 10(-6)), the gene sets involved in the immune profile for rejection of kidney transplantation (9.2 × 10(-6)) and for transcriptional control of leukocytes (2.2 × 10(-5)), and the ganglioside biosynthesis pathway (2.7 × 10(-5)). In conclusion, we present a new method for pathway analyses of copy number data, and causal mechanisms of the five pathways require further study.
A HaemAtlas: characterizing gene expression in differentiated human blood cells.
Watkins, Nicholas A; Gusnanto, Arief; de Bono, Bernard; De, Subhajyoti; Miranda-Saavedra, Diego; Hardie, Debbie L; Angenent, Will G J; Attwood, Antony P; Ellis, Peter D; Erber, Wendy; Foad, Nicola S; Garner, Stephen F; Isacke, Clare M; Jolley, Jennifer; Koch, Kerstin; Macaulay, Iain C; Morley, Sarah L; Rendon, Augusto; Rice, Kate M; Taylor, Niall; Thijssen-Timmer, Daphne C; Tijssen, Marloes R; van der Schoot, C Ellen; Wernisch, Lorenz; Winzer, Thilo; Dudbridge, Frank; Buckley, Christopher D; Langford, Cordelia F; Teichmann, Sarah; Göttgens, Berthold; Ouwehand, Willem H
2009-05-07
Hematopoiesis is a carefully controlled process that is regulated by complex networks of transcription factors that are, in part, controlled by signals resulting from ligand binding to cell-surface receptors. To further understand hematopoiesis, we have compared gene expression profiles of human erythroblasts, megakaryocytes, B cells, cytotoxic and helper T cells, natural killer cells, granulocytes, and monocytes using whole genome microarrays. A bioinformatics analysis of these data was performed focusing on transcription factors, immunoglobulin superfamily members, and lineage-specific transcripts. We observed that the numbers of lineage-specific genes varies by 2 orders of magnitude, ranging from 5 for cytotoxic T cells to 878 for granulocytes. In addition, we have identified novel coexpression patterns for key transcription factors involved in hematopoiesis (eg, GATA3-GFI1 and GATA2-KLF1). This study represents the most comprehensive analysis of gene expression in hematopoietic cells to date and has identified genes that play key roles in lineage commitment and cell function. The data, which are freely accessible, will be invaluable for future studies on hematopoiesis and the role of specific genes and will also aid the understanding of the recent genome-wide association studies.
A HaemAtlas: characterizing gene expression in differentiated human blood cells
Gusnanto, Arief; de Bono, Bernard; De, Subhajyoti; Miranda-Saavedra, Diego; Hardie, Debbie L.; Angenent, Will G. J.; Attwood, Antony P.; Ellis, Peter D.; Erber, Wendy; Foad, Nicola S.; Garner, Stephen F.; Isacke, Clare M.; Jolley, Jennifer; Koch, Kerstin; Macaulay, Iain C.; Morley, Sarah L.; Rendon, Augusto; Rice, Kate M.; Taylor, Niall; Thijssen-Timmer, Daphne C.; Tijssen, Marloes R.; van der Schoot, C. Ellen; Wernisch, Lorenz; Winzer, Thilo; Dudbridge, Frank; Buckley, Christopher D.; Langford, Cordelia F.; Teichmann, Sarah; Göttgens, Berthold; Ouwehand, Willem H.
2009-01-01
Hematopoiesis is a carefully controlled process that is regulated by complex networks of transcription factors that are, in part, controlled by signals resulting from ligand binding to cell-surface receptors. To further understand hematopoiesis, we have compared gene expression profiles of human erythroblasts, megakaryocytes, B cells, cytotoxic and helper T cells, natural killer cells, granulocytes, and monocytes using whole genome microarrays. A bioinformatics analysis of these data was performed focusing on transcription factors, immunoglobulin superfamily members, and lineage-specific transcripts. We observed that the numbers of lineage-specific genes varies by 2 orders of magnitude, ranging from 5 for cytotoxic T cells to 878 for granulocytes. In addition, we have identified novel coexpression patterns for key transcription factors involved in hematopoiesis (eg, GATA3-GFI1 and GATA2-KLF1). This study represents the most comprehensive analysis of gene expression in hematopoietic cells to date and has identified genes that play key roles in lineage commitment and cell function. The data, which are freely accessible, will be invaluable for future studies on hematopoiesis and the role of specific genes and will also aid the understanding of the recent genome-wide association studies. PMID:19228925
Tadra-Sfeir, Michelle Z; Faoro, Helisson; Camilios-Neto, Doumit; Brusamarello-Santos, Liziane; Balsanelli, Eduardo; Weiss, Vinicius; Baura, Valter A; Wassem, Roseli; Cruz, Leonardo M; De Oliveira Pedrosa, Fábio; Souza, Emanuel M; Monteiro, Rose A
2015-01-01
Herbaspirillum seropedicae is a diazotrophic bacterium which associates endophytically with economically important gramineae. Flavonoids such as naringenin have been shown to have an effect on the interaction between H. seropedicae and its host plants. We used a high-throughput sequencing based method (RNA-Seq) to access the influence of naringenin on the whole transcriptome profile of H. seropedicae. Three hundred and four genes were downregulated and seventy seven were upregulated by naringenin. Data analysis revealed that genes related to bacterial flagella biosynthesis, chemotaxis and biosynthesis of peptidoglycan were repressed by naringenin. Moreover, genes involved in aromatic metabolism and multidrug transport efllux were actived.
Hou, Xiao-Jin; Li, Si-Bei; Liu, Sheng-Rui; Hu, Chun-Gen; Zhang, Jin-Zhi
2014-01-01
MYB family genes are widely distributed in plants and comprise one of the largest transcription factors involved in various developmental processes and defense responses of plants. To date, few MYB genes and little expression profiling have been reported for citrus. Here, we describe and classify 177 members of the sweet orange MYB gene (CsMYB) family in terms of their genomic gene structures and similarity to their putative Arabidopsis orthologs. According to these analyses, these CsMYBs were categorized into four groups (4R-MYB, 3R-MYB, 2R-MYB and 1R-MYB). Gene structure analysis revealed that 1R-MYB genes possess relatively more introns as compared with 2R-MYB genes. Investigation of their chromosomal localizations revealed that these CsMYBs are distributed across nine chromosomes. Sweet orange includes a relatively small number of MYB genes compared with the 198 members in Arabidopsis, presumably due to a paralog reduction related to repetitive sequence insertion into promoter and non-coding transcribed region of the genes. Comparative studies of CsMYBs and Arabidopsis showed that CsMYBs had fewer gene duplication events. Expression analysis revealed that the MYB gene family has a wide expression profile in sweet orange development and plays important roles in development and stress responses. In addition, 337 new putative microsatellites with flanking sequences sufficient for primer design were also identified from the 177 CsMYBs. These results provide a useful reference for the selection of candidate MYB genes for cloning and further functional analysis forcitrus. PMID:25375352
Marcon, Helena Sanches; Domingues, Douglas Silva; Silva, Juliana Costa; Borges, Rafael Junqueira; Matioli, Fábio Filippi; Fontes, Marcos Roberto de Mattos; Marino, Celso Luis
2015-08-14
In Eucalyptus genus, studies on genome composition and transposable elements (TEs) are particularly scarce. Nearly half of the recently released Eucalyptus grandis genome is composed by retrotransposons and this data provides an important opportunity to understand TE dynamics in Eucalyptus genome and transcriptome. We characterized nine families of transcriptionally active LTR retrotransposons from Copia and Gypsy superfamilies in Eucalyptus grandis genome and we depicted genomic distribution and copy number in two Eucalyptus species. We also evaluated genomic polymorphism and transcriptional profile in three organs of five Eucalyptus species. We observed contrasting genomic and transcriptional behavior in the same family among different species. RLC_egMax_1 was the most prevalent family and RLC_egAngela_1 was the family with the lowest copy number. Most families of both superfamilies have their insertions occurring <3 million years, except one Copia family, RLC_egBianca_1. Protein theoretical models suggest different properties between Copia and Gypsy domains. IRAP and REMAP markers suggested genomic polymorphisms among Eucalyptus species. Using EST analysis and qRT-PCRs, we observed transcriptional activity in several tissues and in all evaluated species. In some families, osmotic stress increases transcript values. Our strategy was successful in isolating transcriptionally active retrotransposons in Eucalyptus, and each family has a particular genomic and transcriptional pattern. Overall, our results show that retrotransposon activity have differentially affected genome and transcriptome among Eucalyptus species.
Divergent transcription is associated with promoters of transcriptional regulators
2013-01-01
Background Divergent transcription is a wide-spread phenomenon in mammals. For instance, short bidirectional transcripts are a hallmark of active promoters, while longer transcripts can be detected antisense from active genes in conditions where the RNA degradation machinery is inhibited. Moreover, many described long non-coding RNAs (lncRNAs) are transcribed antisense from coding gene promoters. However, the general significance of divergent lncRNA/mRNA gene pair transcription is still poorly understood. Here, we used strand-specific RNA-seq with high sequencing depth to thoroughly identify antisense transcripts from coding gene promoters in primary mouse tissues. Results We found that a substantial fraction of coding-gene promoters sustain divergent transcription of long non-coding RNA (lncRNA)/mRNA gene pairs. Strikingly, upstream antisense transcription is significantly associated with genes related to transcriptional regulation and development. Their promoters share several characteristics with those of transcriptional developmental genes, including very large CpG islands, high degree of conservation and epigenetic regulation in ES cells. In-depth analysis revealed a unique GC skew profile at these promoter regions, while the associated coding genes were found to have large first exons, two genomic features that might enforce bidirectional transcription. Finally, genes associated with antisense transcription harbor specific H3K79me2 epigenetic marking and RNA polymerase II enrichment profiles linked to an intensified rate of early transcriptional elongation. Conclusions We concluded that promoters of a class of transcription regulators are characterized by a specialized transcriptional control mechanism, which is directly coupled to relaxed bidirectional transcription. PMID:24365181
Jung, SeungWoo; Bohan, Amy
2018-02-01
OBJECTIVE To characterize expression profiles of circulating microRNAs via genome-wide sequencing for dogs with congestive heart failure (CHF) secondary to myxomatous mitral valve degeneration (MMVD). ANIMALS 9 healthy client-owned dogs and 8 age-matched client-owned dogs with CHF secondary to MMVD. PROCEDURES Blood samples were collected before administering cardiac medications for the management of CHF. Isolated microRNAs from plasma were classified into microRNA libraries and subjected to next-generation sequencing (NGS) for genome-wide sequencing analysis and quantification of circulating microRNAs. Quantitative reverse transcription PCR (qRT-PCR) assays were used to validate expression profiles of differentially expressed circulating microRNAs identified from NGS analysis of dogs with CHF. RESULTS 326 microRNAs were identified with NGS analysis. Hierarchical analysis revealed distinct expression patterns of circulating microRNAs between healthy dogs and dogs with CHF. Results of qRT-PCR assays confirmed upregulation of 4 microRNAs (miR-133, miR-1, miR-let-7e, and miR-125) and downregulation of 4 selected microRNAs (miR-30c, miR-128, miR-142, and miR-423). Results of qRT-PCR assays were highly correlated with NGS data and supported the specificity of circulating microRNA expression profiles in dogs with CHF secondary to MMVD. CONCLUSIONS AND CLINICAL RELEVANCE These results suggested that circulating microRNA expression patterns were unique and could serve as molecular biomarkers of CHF in dogs with MMVD.
Guo, Yong; Qiu, Li-Juan
2013-01-01
The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max). In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs) were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.
Schadt, Eric E; Edwards, Stephen W; GuhaThakurta, Debraj; Holder, Dan; Ying, Lisa; Svetnik, Vladimir; Leonardson, Amy; Hart, Kyle W; Russell, Archie; Li, Guoya; Cavet, Guy; Castle, John; McDonagh, Paul; Kan, Zhengyan; Chen, Ronghua; Kasarskis, Andrew; Margarint, Mihai; Caceres, Ramon M; Johnson, Jason M; Armour, Christopher D; Garrett-Engele, Philip W; Tsinoremas, Nicholas F; Shoemaker, Daniel D
2004-01-01
Background Computational and microarray-based experimental approaches were used to generate a comprehensive transcript index for the human genome. Oligonucleotide probes designed from approximately 50,000 known and predicted transcript sequences from the human genome were used to survey transcription from a diverse set of 60 tissues and cell lines using ink-jet microarrays. Further, expression activity over at least six conditions was more generally assessed using genomic tiling arrays consisting of probes tiled through a repeat-masked version of the genomic sequence making up chromosomes 20 and 22. Results The combination of microarray data with extensive genome annotations resulted in a set of 28,456 experimentally supported transcripts. This set of high-confidence transcripts represents the first experimentally driven annotation of the human genome. In addition, the results from genomic tiling suggest that a large amount of transcription exists outside of annotated regions of the genome and serves as an example of how this activity could be measured on a genome-wide scale. Conclusions These data represent one of the most comprehensive assessments of transcriptional activity in the human genome and provide an atlas of human gene expression over a unique set of gene predictions. Before the annotation of the human genome is considered complete, however, the previously unannotated transcriptional activity throughout the genome must be fully characterized. PMID:15461792
Discovering Hematopoietic Mechanisms Through Genome-Wide Analysis of GATA Factor Chromatin Occupancy
Fujiwara, Tohru; O'Geen, Henriette; Keles, Sunduz; Blahnik, Kimberly; Linnemann, Amelia K.; Kang, Yoon-A; Choi, Kyunghee; Farnham, Peggy J.; Bresnick, Emery H.
2009-01-01
SUMMARY GATA factors interact with simple DNA motifs (WGATAR) to regulate critical processes, including hematopoiesis, but very few WGATAR motifs are occupied in genomes. Given the rudimentary knowledge of mechanisms underlying this restriction, and how GATA factors establish genetic networks, we used ChIP-seq to define GATA-1 and GATA-2 occupancy genome-wide in erythroid cells. Coupled with genetic complementation analysis and transcriptional profiling, these studies revealed a rich collection of targets containing a characteristic binding motif of greater complexity than WGATAR. GATA factors occupied loci encoding multiple components of the Scl/TAL1 complex, a master regulator of hematopoiesis and leukemogenic target. Mechanistic analyses provided evidence for cross-regulatory and autoregulatory interactions among components of this complex, including GATA-2 induction of the hematopoietic corepressor ETO-2 and an ETO-2 negative autoregulatory loop. These results establish fundamental principles underlying GATA factor mechanisms in chromatin and illustrate a complex network of considerable importance for the control of hematopoiesis. PMID:19941826
2014-01-01
Background Basic leucine zipper (bZIP) transcription factor gene family is one of the largest and most diverse families in plants. Current studies have shown that the bZIP proteins regulate numerous growth and developmental processes and biotic and abiotic stress responses. Nonetheless, knowledge concerning the specific expression patterns and evolutionary history of plant bZIP family members remains very limited. Results We identified 55 bZIP transcription factor-encoding genes in the grapevine (Vitis vinifera) genome, and divided them into 10 groups according to the phylogenetic relationship with those in Arabidopsis. The chromosome distribution and the collinearity analyses suggest that expansion of the grapevine bZIP (VvbZIP) transcription factor family was greatly contributed by the segment/chromosomal duplications, which may be associated with the grapevine genome fusion events. Nine intron/exon structural patterns within the bZIP domain and the additional conserved motifs were identified among all VvbZIP proteins, and showed a high group-specificity. The predicted specificities on DNA-binding domains indicated that some highly conserved amino acid residues exist across each major group in the tree of land plant life. The expression patterns of VvbZIP genes across the grapevine gene expression atlas, based on microarray technology, suggest that VvbZIP genes are involved in grapevine organ development, especially seed development. Expression analysis based on qRT-PCR indicated that VvbZIP genes are extensively involved in drought- and heat-responses, with possibly different mechanisms. Conclusions The genome-wide identification, chromosome organization, gene structures, evolutionary and expression analyses of grapevine bZIP genes provide an overall insight of this gene family and their potential involvement in growth, development and stress responses. This will facilitate further research on the bZIP gene family regarding their evolutionary history and biological functions. PMID:24725365
Lin, Hailan; Xia, Xiaofeng; Yu, Liying; Vasseur, Liette; Gurr, Geoff M; Yao, Fengluan; Yang, Guang; You, Minsheng
2015-12-10
Serine proteases (SPs) are crucial proteolytic enzymes responsible for digestion and other processes including signal transduction and immune responses in insects. Serine protease homologs (SPHs) lack catalytic activity but are involved in innate immunity. This study presents a genome-wide investigation of SPs and SPHs in the diamondback moth, Plutella xylostella (L.), a globally-distributed destructive pest of cruciferous crops. A total of 120 putative SPs and 101 putative SPHs were identified in the P. xylostella genome by bioinformatics analysis. Based on the features of trypsin, 38 SPs were putatively designated as trypsin genes. The distribution, transcription orientation, exon-intron structure and sequence alignments suggested that the majority of trypsin genes evolved from tandem duplications. Among the 221 SP/SPH genes, ten SP and three SPH genes with one or more clip domains were predicted and designated as PxCLIPs. Phylogenetic analysis of CLIPs in P. xylostella, two other Lepidoptera species (Bombyx mori and Manduca sexta), and two more distantly related insects (Drosophila melanogaster and Apis mellifera) showed that seven of the 13 PxCLIPs were clustered with homologs of the Lepidoptera rather than other species. Expression profiling of the P. xylostella SP and SPH genes in different developmental stages and tissues showed diverse expression patterns, suggesting high functional diversity with roles in digestion and development. This is the first genome-wide investigation on the SP and SPH genes in P. xylostella. The characterized features and profiled expression patterns of the P. xylostella SPs and SPHs suggest their involvement in digestion, development and immunity of this species. Our findings provide a foundation for further research on the functions of this gene family in P. xylostella, and a better understanding of its capacity to rapidly adapt to a wide range of environmental variables including host plants and insecticides.
Transposon identification using profile HMMs
2010-01-01
Background Transposons are "jumping genes" that account for large quantities of repetitive content in genomes. They are known to affect transcriptional regulation in several different ways, and are implicated in many human diseases. Transposons are related to microRNAs and viruses, and many genes, pseudogenes, and gene promoters are derived from transposons or have origins in transposon-induced duplication. Modeling transposon-derived genomic content is difficult because they are poorly conserved. Profile hidden Markov models (profile HMMs), widely used for protein sequence family modeling, are rarely used for modeling DNA sequence families. The algorithm commonly used to estimate the parameters of profile HMMs, Baum-Welch, is prone to prematurely converge to local optima. The DNA domain is especially problematic for the Baum-Welch algorithm, since it has only four letters as opposed to the twenty residues of the amino acid alphabet. Results We demonstrate with a simulation study and with an application to modeling the MIR family of transposons that two recently introduced methods, Conditional Baum-Welch and Dynamic Model Surgery, achieve better estimates of the parameters of profile HMMs across a range of conditions. Conclusions We argue that these new algorithms expand the range of potential applications of profile HMMs to many important DNA sequence family modeling problems, including that of searching for and modeling the virus-like transposons that are found in all known genomes. PMID:20158867
Yan, Qian; Liu, Hou-Sheng; Yao, Dan; Li, Xin; Chen, Han; Dou, Yang; Wang, Yi; Pei, Yan; Xiao, Yue-Hua
2015-01-01
Basic/helix-loop-helix (bHLH) proteins comprise one of the largest transcription factor families and play important roles in diverse cellular and molecular processes. Comprehensive analyses of the composition and evolution of the bHLH family in cotton are essential to elucidate their functions and the molecular basis of cotton development. By searching bHLH homologous genes in sequenced diploid cotton genomes (Gossypium raimondii and G. arboreum), a set of cotton bHLH reference genes containing 289 paralogs were identified and named as GobHLH001-289. Based on their phylogenetic relationships, these cotton bHLH proteins were clustered into 27 subfamilies. Compared to those in Arabidopsis and cacao, cotton bHLH proteins generally increased in number, but unevenly in different subfamilies. To further uncover evolutionary changes of bHLH genes during tetraploidization of cotton, all genes of S5a and S5b subfamilies in upland cotton and its diploid progenitors were cloned and compared, and their transcript profiles were determined in upland cotton. A total of 10 genes of S5a and S5b subfamilies (doubled from A- and D-genome progenitors) maintained in tetraploid cottons. The major sequence changes in upland cotton included a 15-bp in-frame deletion in GhbHLH130D and a long terminal repeat retrotransposon inserted in GhbHLH062A, which eliminated GhbHLH062A expression in various tissues. The S5a and S5b bHLH genes of A and D genomes (except GobHLH062) showed similar transcription patterns in various tissues including roots, stems, leaves, petals, ovules, and fibers, while the A- and D-genome genes of GobHLH110 and GobHLH130 displayed clearly different transcript profiles during fiber development. In total, this study represented a genome-wide analysis of cotton bHLH family, and revealed significant changes in sequence and expression of these genes in tetraploid cottons, which paved the way for further functional analyses of bHLH genes in the cotton genus. PMID:25992947
Yan, Qian; Liu, Hou-Sheng; Yao, Dan; Li, Xin; Chen, Han; Dou, Yang; Wang, Yi; Pei, Yan; Xiao, Yue-Hua
2015-01-01
Basic/helix-loop-helix (bHLH) proteins comprise one of the largest transcription factor families and play important roles in diverse cellular and molecular processes. Comprehensive analyses of the composition and evolution of the bHLH family in cotton are essential to elucidate their functions and the molecular basis of cotton development. By searching bHLH homologous genes in sequenced diploid cotton genomes (Gossypium raimondii and G. arboreum), a set of cotton bHLH reference genes containing 289 paralogs were identified and named as GobHLH001-289. Based on their phylogenetic relationships, these cotton bHLH proteins were clustered into 27 subfamilies. Compared to those in Arabidopsis and cacao, cotton bHLH proteins generally increased in number, but unevenly in different subfamilies. To further uncover evolutionary changes of bHLH genes during tetraploidization of cotton, all genes of S5a and S5b subfamilies in upland cotton and its diploid progenitors were cloned and compared, and their transcript profiles were determined in upland cotton. A total of 10 genes of S5a and S5b subfamilies (doubled from A- and D-genome progenitors) maintained in tetraploid cottons. The major sequence changes in upland cotton included a 15-bp in-frame deletion in GhbHLH130D and a long terminal repeat retrotransposon inserted in GhbHLH062A, which eliminated GhbHLH062A expression in various tissues. The S5a and S5b bHLH genes of A and D genomes (except GobHLH062) showed similar transcription patterns in various tissues including roots, stems, leaves, petals, ovules, and fibers, while the A- and D-genome genes of GobHLH110 and GobHLH130 displayed clearly different transcript profiles during fiber development. In total, this study represented a genome-wide analysis of cotton bHLH family, and revealed significant changes in sequence and expression of these genes in tetraploid cottons, which paved the way for further functional analyses of bHLH genes in the cotton genus.
Temporal Expression Profiling Identifies Pathways Mediating Effect of Causal Variant on Phenotype
Gupta, Saumya; Radhakrishnan, Aparna; Raharja-Liu, Pandu; Lin, Gen; Steinmetz, Lars M.; Gagneur, Julien; Sinha, Himanshu
2015-01-01
Even with identification of multiple causal genetic variants for common human diseases, understanding the molecular processes mediating the causal variants’ effect on the disease remains a challenge. This understanding is crucial for the development of therapeutic strategies to prevent and treat disease. While static profiling of gene expression is primarily used to get insights into the biological bases of diseases, it makes differentiating the causative from the correlative effects difficult, as the dynamics of the underlying biological processes are not monitored. Using yeast as a model, we studied genome-wide gene expression dynamics in the presence of a causal variant as the sole genetic determinant, and performed allele-specific functional validation to delineate the causal effects of the genetic variant on the phenotype. Here, we characterized the precise genetic effects of a functional MKT1 allelic variant in sporulation efficiency variation. A mathematical model describing meiotic landmark events and conditional activation of MKT1 expression during sporulation specified an early meiotic role of this variant. By analyzing the early meiotic genome-wide transcriptional response, we demonstrate an MKT1-dependent role of novel modulators, namely, RTG1/3, regulators of mitochondrial retrograde signaling, and DAL82, regulator of nitrogen starvation, in additively effecting sporulation efficiency. In the presence of functional MKT1 allele, better respiration during early sporulation was observed, which was dependent on the mitochondrial retrograde regulator, RTG3. Furthermore, our approach showed that MKT1 contributes to sporulation independent of Puf3, an RNA-binding protein that steady-state transcription profiling studies have suggested to mediate MKT1-pleiotropic effects during mitotic growth. These results uncover interesting regulatory links between meiosis and mitochondrial retrograde signaling. In this study, we highlight the advantage of analyzing allele-specific transcriptional dynamics of mediating genes. Applications in higher eukaryotes can be valuable for inferring causal molecular pathways underlying complex dynamic processes, such as development, physiology and disease progression. PMID:26039065
Moqtaderi, Zarmik; Wang, Jie; Raha, Debasish; White, Robert J.; Snyder, Michael; Weng, Zhiping; Struhl, Kevin
2012-01-01
Genome-wide occupancy profiles of five components of the RNA Polymerase III (Pol III) machinery in human cells identified the expected tRNA and non-coding RNA targets and revealed many additional Pol III-associated loci, mostly near SINEs. Several genes are targets of an alternative TFIIIB containing Brf2 instead of Brf1 and have extremely low levels of TFIIIC. Strikingly, expressed Pol III genes, unlike non-expressed Pol III genes, are situated in regions with a pattern of histone modifications associated with functional Pol II promoters. TFIIIC alone associates with numerous ETC loci, via the B box or a novel motif. ETCs are often near CTCF binding sites, suggesting a potential role in chromosome organization. Our results suggest that human Pol III complexes associate preferentially with regions near functional Pol II promoters and that TFIIIC-mediated recruitment of TFIIIB is regulated in a locus-specific manner. PMID:20418883
Stojnic, Robert; Fu, Audrey Qiuyan; Adryan, Boris
2012-01-01
Inferring the combinatorial regulatory code of transcription factors (TFs) from genome-wide TF binding profiles is challenging. A major reason is that TF binding profiles significantly overlap and are therefore highly correlated. Clustered occurrence of multiple TFs at genomic sites may arise from chromatin accessibility and local cooperation between TFs, or binding sites may simply appear clustered if the profiles are generated from diverse cell populations. Overlaps in TF binding profiles may also result from measurements taken at closely related time intervals. It is thus of great interest to distinguish TFs that directly regulate gene expression from those that are indirectly associated with gene expression. Graphical models, in particular Bayesian networks, provide a powerful mathematical framework to infer different types of dependencies. However, existing methods do not perform well when the features (here: TF binding profiles) are highly correlated, when their association with the biological outcome is weak, and when the sample size is small. Here, we develop a novel computational method, the Neighbourhood Consistent PC (NCPC) algorithms, which deal with these scenarios much more effectively than existing methods do. We further present a novel graphical representation, the Direct Dependence Graph (DDGraph), to better display the complex interactions among variables. NCPC and DDGraph can also be applied to other problems involving highly correlated biological features. Both methods are implemented in the R package ddgraph, available as part of Bioconductor (http://bioconductor.org/packages/2.11/bioc/html/ddgraph.html). Applied to real data, our method identified TFs that specify different classes of cis-regulatory modules (CRMs) in Drosophila mesoderm differentiation. Our analysis also found depletion of the early transcription factor Twist binding at the CRMs regulating expression in visceral and somatic muscle cells at later stages, which suggests a CRM-specific repression mechanism that so far has not been characterised for this class of mesodermal CRMs. PMID:23144600
Esquerré, Thomas; Bouvier, Marie; Turlan, Catherine; Carpousis, Agamemnon J; Girbal, Laurence; Cocaign-Bousquet, Muriel
2016-04-26
Bacterial adaptation requires large-scale regulation of gene expression. We have performed a genome-wide analysis of the Csr system, which regulates many important cellular functions. The Csr system is involved in post-transcriptional regulation, but a role in transcriptional regulation has also been suggested. Two proteins, an RNA-binding protein CsrA and an atypical signaling protein CsrD, participate in the Csr system. Genome-wide transcript stabilities and levels were compared in wildtype E. coli (MG1655) and isogenic mutant strains deficient in CsrA or CsrD activity demonstrating for the first time that CsrA and CsrD are global negative and positive regulators of transcription, respectively. The role of CsrA in transcription regulation may be indirect due to the 4.6-fold increase in csrD mRNA concentration in the CsrA deficient strain. Transcriptional action of CsrA and CsrD on a few genes was validated by transcriptional fusions. In addition to an effect on transcription, CsrA stabilizes thousands of mRNAs. This is the first demonstration that CsrA is a global positive regulator of mRNA stability. For one hundred genes, we predict that direct control of mRNA stability by CsrA might contribute to metabolic adaptation by regulating expression of genes involved in carbon metabolism and transport independently of transcriptional regulation.
Transcriptome profile analysis of floral sex determination in cucumber.
Wu, Tao; Qin, Zhiwei; Zhou, Xiuyan; Feng, Zhuo; Du, Yalin
2010-07-15
Cucumber has been widely studied as a model for floral sex determination. In this investigation, we performed genome-wide transcriptional profiling of apical tissue of a gynoecious mutant (Csg-G) and the monoecious wild-type (Csg-M) of cucumber in an attempt to isolate genes involved in sex determination, using the Solexa technology. The profiling analysis revealed numerous changes in gene expression attributable to the mutation, which resulted in the down-regulation of 600 genes and the up-regulation of 143 genes. The Solexa data were confirmed by reverse transcription polymerase chain reaction (RT-PCR) and real-time quantitative RT-PCR (qRT-PCR). Gene ontology (GO) analysis revealed that the differentially expressed genes were mainly involved in biogenesis, transport and organization of cellular component, macromolecular and cellular biosynthesis, localization, establishment of localization, translation and other processes. Furthermore, the expression of some of these genes depended upon the tissue and the developmental stage of the flowers of gynoecious mutant. The results of this study suggest two important concepts, which govern sex determination in cucumber. First, the differential expression of genes involved in plant hormone signaling pathways, such as ACS, Asr1, CsIAA2, CS-AUX1 and TLP, indicate that phytohormones and their crosstalk might play a critical role in the sex determination. Second, the regulation of some transcription factors, including EREBP-9, may also be involved in this developmental process. Copyright (c) 2010 Elsevier GmbH. All rights reserved.
Howell, Kate Joanne; Kraiczy, Judith; Nayak, Komal M; Gasparetto, Marco; Ross, Alexander; Lee, Claire; Mak, Tim N; Koo, Bon-Kyoung; Kumar, Nitin; Lawley, Trevor; Sinha, Anupam; Rosenstiel, Philip; Heuschkel, Robert; Stegle, Oliver; Zilbauer, Matthias
2018-02-01
We analyzed DNA methylation patterns and transcriptomes of primary intestinal epithelial cells (IEC) of children newly diagnosed with inflammatory bowel diseases (IBD) to learn more about pathogenesis. We obtained mucosal biopsies (N = 236) collected from terminal ileum and ascending and sigmoid colons of children (median age 13 years) newly diagnosed with IBD (43 with Crohn's disease [CD], 23 with ulcerative colitis [UC]), and 30 children without IBD (controls). Patients were recruited and managed at a hospital in the United Kingdom from 2013 through 2016. We also obtained biopsies collected at later stages from a subset of patients. IECs were purified and analyzed for genome-wide DNA methylation patterns and gene expression profiles. Adjacent microbiota were isolated from biopsies and analyzed by 16S gene sequencing. We generated intestinal organoid cultures from a subset of samples and genome-wide DNA methylation analysis was performed. We found gut segment-specific differences in DNA methylation and transcription profiles of IECs from children with IBD vs controls; some were independent of mucosal inflammation. Changes in gut microbiota between IBD and control groups were not as large and were difficult to assess because of large amounts of intra-individual variation. Only IECs from patients with CD had changes in DNA methylation and transcription patterns in terminal ileum epithelium, compared with controls. Colon epithelium from patients with CD and from patients with ulcerative colitis had distinct changes in DNA methylation and transcription patterns, compared with controls. In IECs from patients with IBD, changes in DNA methylation, compared with controls, were stable over time and were partially retained in ex-vivo organoid cultures. Statistical analyses of epithelial cell profiles allowed us to distinguish children with CD or UC from controls; profiles correlated with disease outcome parameters, such as the requirement for treatment with biologic agents. We identified specific changes in DNA methylation and transcriptome patterns in IECs from pediatric patients with IBD compared with controls. These data indicate that IECs undergo changes during IBD development and could be involved in pathogenesis. Further analyses of primary IECs from patients with IBD could improve our understanding of the large variations in disease progression and outcomes. Copyright © 2018 AGA Institute. Published by Elsevier Inc. All rights reserved.
Basnet, Ram Kumar; Moreno-Pachon, Natalia; Lin, Ke; Bucher, Johan; Visser, Richard G F; Maliepaard, Chris; Bonnema, Guusje
2013-12-01
Brassica seeds are important as basic units of plant growth and sources of vegetable oil. Seed development is regulated by many dynamic metabolic processes controlled by complex networks of spatially and temporally expressed genes. We conducted a global microarray gene co-expression analysis by measuring transcript abundance of developing seeds from two diverse B. rapa morphotypes: a pak choi (leafy-type) and a yellow sarson (oil-type), and two of their doubled haploid (DH) progenies, (1) to study the timing of metabolic processes in developing seeds, (2) to explore the major transcriptional differences in developing seeds of the two morphotypes, and (3) to identify the optimum stage for a genetical genomics study in B. rapa seed. Seed developmental stages were similar in developing seeds of pak choi and yellow sarson of B. rapa; however, the colour of embryo and seed coat differed among these two morphotypes. In this study, most transcriptional changes occurred between 25 and 35 DAP, which shows that the timing of seed developmental processes in B. rapa is at later developmental stages than in the related species B. napus. Using a Weighted Gene Co-expression Network Analysis (WGCNA), we identified 47 "gene modules", of which 27 showed a significant association with temporal and/or genotypic variation. An additional hierarchical cluster analysis identified broad spectra of gene expression patterns during seed development. The predominant variation in gene expression was according to developmental stages rather than morphotype differences. Since lipids are the major storage compounds of Brassica seeds, we investigated in more detail the regulation of lipid metabolism. Four co-regulated gene clusters were identified with 17 putative cis-regulatory elements predicted in their 1000 bp upstream region, either specific or common to different lipid metabolic pathways. This is the first study of genome-wide profiling of transcript abundance during seed development in B. rapa. The identification of key physiological events, major expression patterns, and putative cis-regulatory elements provides useful information to construct gene regulatory networks in B. rapa developing seeds and provides a starting point for a genetical genomics study of seed quality traits.
Polstein, Lauren R; Perez-Pinera, Pablo; Kocak, D Dewran; Vockley, Christopher M; Bledsoe, Peggy; Song, Lingyun; Safi, Alexias; Crawford, Gregory E; Reddy, Timothy E; Gersbach, Charles A
2015-08-01
Genome engineering technologies based on the CRISPR/Cas9 and TALE systems are enabling new approaches in science and biotechnology. However, the specificity of these tools in complex genomes and the role of chromatin structure in determining DNA binding are not well understood. We analyzed the genome-wide effects of TALE- and CRISPR-based transcriptional activators in human cells using ChIP-seq to assess DNA-binding specificity and RNA-seq to measure the specificity of perturbing the transcriptome. Additionally, DNase-seq was used to assess genome-wide chromatin remodeling that occurs as a result of their action. Our results show that these transcription factors are highly specific in both DNA binding and gene regulation and are able to open targeted regions of closed chromatin independent of gene activation. Collectively, these results underscore the potential for these technologies to make precise changes to gene expression for gene and cell therapies or fundamental studies of gene function. © 2015 Polstein et al.; Published by Cold Spring Harbor Laboratory Press.
Hilson, Pierre; Allemeersch, Joke; Altmann, Thomas; Aubourg, Sébastien; Avon, Alexandra; Beynon, Jim; Bhalerao, Rishikesh P.; Bitton, Frédérique; Caboche, Michel; Cannoot, Bernard; Chardakov, Vasil; Cognet-Holliger, Cécile; Colot, Vincent; Crowe, Mark; Darimont, Caroline; Durinck, Steffen; Eickhoff, Holger; de Longevialle, Andéol Falcon; Farmer, Edward E.; Grant, Murray; Kuiper, Martin T.R.; Lehrach, Hans; Léon, Céline; Leyva, Antonio; Lundeberg, Joakim; Lurin, Claire; Moreau, Yves; Nietfeld, Wilfried; Paz-Ares, Javier; Reymond, Philippe; Rouzé, Pierre; Sandberg, Goran; Segura, Maria Dolores; Serizet, Carine; Tabrett, Alexandra; Taconnat, Ludivine; Thareau, Vincent; Van Hummelen, Paul; Vercruysse, Steven; Vuylsteke, Marnik; Weingartner, Magdalena; Weisbeek, Peter J.; Wirta, Valtteri; Wittink, Floyd R.A.; Zabeau, Marc; Small, Ian
2004-01-01
Microarray transcript profiling and RNA interference are two new technologies crucial for large-scale gene function studies in multicellular eukaryotes. Both rely on sequence-specific hybridization between complementary nucleic acid strands, inciting us to create a collection of gene-specific sequence tags (GSTs) representing at least 21,500 Arabidopsis genes and which are compatible with both approaches. The GSTs were carefully selected to ensure that each of them shared no significant similarity with any other region in the Arabidopsis genome. They were synthesized by PCR amplification from genomic DNA. Spotted microarrays fabricated from the GSTs show good dynamic range, specificity, and sensitivity in transcript profiling experiments. The GSTs have also been transferred to bacterial plasmid vectors via recombinational cloning protocols. These cloned GSTs constitute the ideal starting point for a variety of functional approaches, including reverse genetics. We have subcloned GSTs on a large scale into vectors designed for gene silencing in plant cells. We show that in planta expression of GST hairpin RNA results in the expected phenotypes in silenced Arabidopsis lines. These versatile GST resources provide novel and powerful tools for functional genomics. PMID:15489341
Trichostatin A effects on gene expression in the protozoan parasite Entamoeba histolytica
Ehrenkaufer, Gretchen M; Eichinger, Daniel J; Singh, Upinder
2007-01-01
Background Histone modification regulates chromatin structure and influences gene expression associated with diverse biological functions including cellular differentiation, cancer, maintenance of genome architecture, and pathogen virulence. In Entamoeba, a deep-branching eukaryote, short chain fatty acids (SCFA) affect histone acetylation and parasite development. Additionally, a number of active histone modifying enzymes have been identified in the parasite genome. However, the overall extent of gene regulation tied to histone acetylation is not known. Results In order to identify the genome-wide effects of histone acetylation in regulating E. histolytica gene expression, we used whole-genome expression profiling of parasites treated with SCFA and Trichostatin A (TSA). Despite significant changes in histone acetylation patterns, exposure of parasites to SCFA resulted in minimal transcriptional changes (11 out of 9,435 genes transcriptionally regulated). In contrast, exposure to TSA, a more specific inhibitor of histone deacetylases, significantly affected transcription of 163 genes (122 genes upregulated and 41 genes downregulated). Genes modulated by TSA were not regulated by treatment with 5-Azacytidine, an inhibitor of DNA-methyltransferase, indicating that in E. histolytica the crosstalk between DNA methylation and histone modification is not substantial. However, the set of genes regulated by TSA overlapped substantially with genes regulated during parasite development: 73/122 genes upregulated by TSA exposure were upregulated in E. histolytica cysts (p-value = 6 × 10-53) and 15/41 genes downregulated by TSA exposure were downregulated in E. histolytica cysts (p-value = 3 × 10-7). Conclusion This work represents the first genome-wide analysis of histone acetylation and its effects on gene expression in E. histolytica. The data indicate that SCFAs, despite their ability to influence histone acetylation, have minimal effects on gene transcription in cultured parasites. In contrast, the effect of TSA on E. histolytica gene expression is more substantial and includes genes involved in the encystation pathway. These observations will allow further dissection of the effects of histone acetylation and the genetic pathways regulating stage conversion in this pathogenic parasite. PMID:17612405
Hsu, Yi-Hsiang; Zillikens, M Carola; Wilson, Scott G; Farber, Charles R; Demissie, Serkalem; Soranzo, Nicole; Bianchi, Estelle N; Grundberg, Elin; Liang, Liming; Richards, J Brent; Estrada, Karol; Zhou, Yanhua; van Nas, Atila; Moffatt, Miriam F; Zhai, Guangju; Hofman, Albert; van Meurs, Joyce B; Pols, Huibert A P; Price, Roger I; Nilsson, Olle; Pastinen, Tomi; Cupples, L Adrienne; Lusis, Aldons J; Schadt, Eric E; Ferrari, Serge; Uitterlinden, André G; Rivadeneira, Fernando; Spector, Timothy D; Karasik, David; Kiel, Douglas P
2010-06-10
Osteoporosis is a complex disorder and commonly leads to fractures in elderly persons. Genome-wide association studies (GWAS) have become an unbiased approach to identify variations in the genome that potentially affect health. However, the genetic variants identified so far only explain a small proportion of the heritability for complex traits. Due to the modest genetic effect size and inadequate power, true association signals may not be revealed based on a stringent genome-wide significance threshold. Here, we take advantage of SNP and transcript arrays and integrate GWAS and expression signature profiling relevant to the skeletal system in cellular and animal models to prioritize the discovery of novel candidate genes for osteoporosis-related traits, including bone mineral density (BMD) at the lumbar spine (LS) and femoral neck (FN), as well as geometric indices of the hip (femoral neck-shaft angle, NSA; femoral neck length, NL; and narrow-neck width, NW). A two-stage meta-analysis of GWAS from 7,633 Caucasian women and 3,657 men, revealed three novel loci associated with osteoporosis-related traits, including chromosome 1p13.2 (RAP1A, p = 3.6x10(-8)), 2q11.2 (TBC1D8), and 18q11.2 (OSBPL1A), and confirmed a previously reported region near TNFRSF11B/OPG gene. We also prioritized 16 suggestive genome-wide significant candidate genes based on their potential involvement in skeletal metabolism. Among them, 3 candidate genes were associated with BMD in women. Notably, 2 out of these 3 genes (GPR177, p = 2.6x10(-13); SOX6, p = 6.4x10(-10)) associated with BMD in women have been successfully replicated in a large-scale meta-analysis of BMD, but none of the non-prioritized candidates (associated with BMD) did. Our results support the concept of our prioritization strategy. In the absence of direct biological support for identified genes, we highlighted the efficiency of subsequent functional characterization using publicly available expression profiling relevant to the skeletal system in cellular or whole animal models to prioritize candidate genes for further functional validation.
Sass, Andrea; Kiekens, Sanne; Coenye, Tom
2017-11-15
Small RNAs play a regulatory role in many central metabolic processes of bacteria, as well as in developmental processes such as biofilm formation. Small RNAs of Burkholderia cenocepacia, an opportunistic pathogenic beta-proteobacterium, are to date not well characterised. To address that, we performed genome-wide transcriptome structure analysis of biofilm grown B. cenocepacia J2315. 41 unannotated short transcripts were identified in intergenic regions of the B. cenocepacia genome. 15 of these short transcripts, highly abundant in biofilms, widely conserved in Burkholderia sp. and without known function, were selected for in-depth analysis. Expression profiling showed that most of these sRNAs are more abundant in biofilms than in planktonic cultures. Many are also highly abundant in cells grown in minimal media, suggesting they are involved in adaptation to nutrient limitation and growth arrest. Their computationally predicted targets include a high proportion of genes involved in carbon metabolism. Expression and target genes of one sRNA suggest a potential role in regulating iron homoeostasis. The strategy used for this study to detect sRNAs expressed in B. cenocepacia biofilms has successfully identified sRNAs with a regulatory function.
Schmidt, Martin; Van Bel, Michiel; Woloszynska, Magdalena; Slabbinck, Bram; Martens, Cindy; De Block, Marc; Coppens, Frederik; Van Lijsebettens, Mieke
2017-07-06
Cytosine methylation in plant genomes is important for the regulation of gene transcription and transposon activity. Genome-wide methylomes are studied upon mutation of the DNA methyltransferases, adaptation to environmental stresses or during development. However, from basic biology to breeding programs, there is a need to monitor multiple samples to determine transgenerational methylation inheritance or differential cytosine methylation. Methylome data obtained by sodium hydrogen sulfite (bisulfite)-conversion and next-generation sequencing (NGS) provide genome-wide information on cytosine methylation. However, a profiling method that detects cytosine methylation state dispersed over the genome would allow high-throughput analysis of multiple plant samples with distinct epigenetic signatures. We use specific restriction endonucleases to enrich for cytosine coverage in a bisulfite and NGS-based profiling method, which was compared to whole-genome bisulfite sequencing of the same plant material. We established an effective methylome profiling method in plants, termed plant-reduced representation bisulfite sequencing (plant-RRBS), using optimized double restriction endonuclease digestion, fragment end repair, adapter ligation, followed by bisulfite conversion, PCR amplification and NGS. We report a performant laboratory protocol and a straightforward bioinformatics data analysis pipeline for plant-RRBS, applicable for any reference-sequenced plant species. As a proof of concept, methylome profiling was performed using an Oryza sativa ssp. indica pure breeding line and a derived epigenetically altered line (epiline). Plant-RRBS detects methylation levels at tens of millions of cytosine positions deduced from bisulfite conversion in multiple samples. To evaluate the method, the coverage of cytosine positions, the intra-line similarity and the differential cytosine methylation levels between the pure breeding line and the epiline were determined. Plant-RRBS reproducibly covers commonly up to one fourth of the cytosine positions in the rice genome when using MspI-DpnII within a group of five biological replicates of a line. The method predominantly detects cytosine methylation in putative promoter regions and not-annotated regions in rice. Plant-RRBS offers high-throughput and broad, genome-dispersed methylation detection by effective read number generation obtained from reproducibly covered genome fractions using optimized endonuclease combinations, facilitating comparative analyses of multi-sample studies for cytosine methylation and transgenerational stability in experimental material and plant breeding populations.
Transcriptome profiling of Zymomonas mobilis under furfural stress.
He, Ming-xiong; Wu, Bo; Shui, Zong-xia; Hu, Qi-chun; Wang, Wen-guo; Tan, Fu-rong; Tang, Xiao-yu; Zhu, Qi-li; Pan, Ke; Li, Qing; Su, Xiao-hong
2012-07-01
Furfural from lignocellulosic hydrolysates is the prevalent inhibitor to microorganisms during cellulosic ethanol production, but the molecular mechanisms of tolerance to this inhibitor in Zymomonas mobilis are still unclear. In this study, genome-wide transcriptional responses to furfural were investigated in Z. mobilis using microarray analysis. We found that 433 genes were differentially expressed in response to furfural. Furfural up- or down-regulated genes related to cell wall/membrane biogenesis, metabolism, and transcription. However, furfural has a subtle negative effect on Entner-Doudoroff pathway mRNAs. Our results revealed that furfural had effects on multiple aspects of cellular metabolism at the transcriptional level and that membrane might play important roles in response to furfural. This research has provided insights into the molecular response to furfural in Z. mobilis, and it will be helpful to construct more furfural-resistant strains for cellulosic ethanol production.
Kuang, Zheng; Ji, Zhicheng
2018-01-01
Abstract Biological processes are usually associated with genome-wide remodeling of transcription driven by transcription factors (TFs). Identifying key TFs and their spatiotemporal binding patterns are indispensable to understanding how dynamic processes are programmed. However, most methods are designed to predict TF binding sites only. We present a computational method, dynamic motif occupancy analysis (DynaMO), to infer important TFs and their spatiotemporal binding activities in dynamic biological processes using chromatin profiling data from multiple biological conditions such as time-course histone modification ChIP-seq data. In the first step, DynaMO predicts TF binding sites with a random forests approach. Next and uniquely, DynaMO infers dynamic TF binding activities at predicted binding sites using their local chromatin profiles from multiple biological conditions. Another landmark of DynaMO is to identify key TFs in a dynamic process using a clustering and enrichment analysis of dynamic TF binding patterns. Application of DynaMO to the yeast ultradian cycle, mouse circadian clock and human neural differentiation exhibits its accuracy and versatility. We anticipate DynaMO will be generally useful for elucidating transcriptional programs in dynamic processes. PMID:29325176
Microplate-based platform for combined chromatin and DNA methylation immunoprecipitation assays
2011-01-01
Background The processes that compose expression of a given gene are far more complex than previously thought presenting unprecedented conceptual and mechanistic challenges that require development of new tools. Chromatin structure, which is regulated by DNA methylation and histone modification, is at the center of gene regulation. Immunoprecipitations of chromatin (ChIP) and methylated DNA (MeDIP) represent a major achievement in this area that allow researchers to probe chromatin modifications as well as specific protein-DNA interactions in vivo and to estimate the density of proteins at specific sites genome-wide. Although a critical component of chromatin structure, DNA methylation has often been studied independently of other chromatin events and transcription. Results To allow simultaneous measurements of DNA methylation with other genomic processes, we developed and validated a simple and easy-to-use high throughput microplate-based platform for analysis of DNA methylation. Compared to the traditional beads-based MeDIP the microplate MeDIP was more sensitive and had lower non-specific binding. We integrated the MeDIP method with a microplate ChIP assay which allows measurements of both DNA methylation and histone marks at the same time, Matrix ChIP-MeDIP platform. We illustrated several applications of this platform to relate DNA methylation, with chromatin and transcription events at selected genes in cultured cells, human cancer and in a model of diabetic kidney disease. Conclusion The high throughput capacity of Matrix ChIP-MeDIP to profile tens and potentially hundreds of different genomic events at the same time as DNA methylation represents a powerful platform to explore complex genomic mechanism at selected genes in cultured cells and in whole tissues. In this regard, Matrix ChIP-MeDIP should be useful to complement genome-wide studies where the rich chromatin and transcription database resources provide fruitful foundation to pursue mechanistic, functional and diagnostic information at genes of interest in health and disease. PMID:22098709
[Transcription activator-like effectors(TALEs)based genome engineering].
Zhao, Mei-Wei; Duan, Cheng-Li; Liu, Jiang
2013-10-01
Systematic reverse-engineering of functional genome architecture requires precise modifications of gene sequences and transcription levels. The development and application of transcription activator-like effectors(TALEs) has created a wealth of genome engineering possibilities. TALEs are a class of naturally occurring DNA-binding proteins found in the plant pathogen Xanthomonas species. The DNA-binding domain of each TALE typically consists of tandem 34-amino acid repeat modules rearranged according to a simple cipher to target new DNA sequences. Customized TALEs can be used for a wide variety of genome engineering applications, including transcriptional modulation and genome editing. Such "genome engineering" has now been established in human cells and a number of model organisms, thus opening the door to better understanding gene function in model organisms, improving traits in crop plants and treating human genetic disorders.
Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui; ...
2014-10-02
Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptionalmore » regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui
Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptionalmore » regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.« less
Baumbach, Jan; Brinkrolf, Karina; Czaja, Lisa F; Rahmann, Sven; Tauch, Andreas
2006-02-14
The application of DNA microarray technology in post-genomic analysis of bacterial genome sequences has allowed the generation of huge amounts of data related to regulatory networks. This data along with literature-derived knowledge on regulation of gene expression has opened the way for genome-wide reconstruction of transcriptional regulatory networks. These large-scale reconstructions can be converted into in silico models of bacterial cells that allow a systematic analysis of network behavior in response to changing environmental conditions. CoryneRegNet was designed to facilitate the genome-wide reconstruction of transcriptional regulatory networks of corynebacteria relevant in biotechnology and human medicine. During the import and integration process of data derived from experimental studies or literature knowledge CoryneRegNet generates links to genome annotations, to identified transcription factors and to the corresponding cis-regulatory elements. CoryneRegNet is based on a multi-layered, hierarchical and modular concept of transcriptional regulation and was implemented by using the relational database management system MySQL and an ontology-based data structure. Reconstructed regulatory networks can be visualized by using the yFiles JAVA graph library. As an application example of CoryneRegNet, we have reconstructed the global transcriptional regulation of a cellular module involved in SOS and stress response of corynebacteria. CoryneRegNet is an ontology-based data warehouse that allows a pertinent data management of regulatory interactions along with the genome-scale reconstruction of transcriptional regulatory networks. These models can further be combined with metabolic networks to build integrated models of cellular function including both metabolism and its transcriptional regulation.
Wilhelmsson, Per K I; Mühlich, Cornelia; Ullrich, Kristian K
2017-01-01
Abstract Plant genomes encode many lineage-specific, unique transcription factors. Expansion of such gene families has been previously found to coincide with the evolution of morphological complexity, although comparative analyses have been hampered by severe sampling bias. Here, we make use of the recently increased availability of plant genomes. We have updated and expanded previous rule sets for domain-based classification of transcription associated proteins (TAPs), comprising transcription factors and transcriptional regulators. The genome-wide annotation of these protein families has been analyzed and made available via the novel TAPscan web interface. We find that many TAP families previously thought to be specific for land plants actually evolved in streptophyte (charophyte) algae; 26 out of 36 TAP family gains are inferred to have occurred in the common ancestor of the Streptophyta (uniting the land plants—Embryophyta—with their closest algal relatives). In contrast, expansions of TAP families were found to occur throughout streptophyte evolution. 17 out of 76 expansion events were found to be common to all land plants and thus probably evolved concomitant with the water-to-land-transition. PMID:29216360
Terrados, Gloria; Finkernagel, Florian; Stielow, Bastian; Sadic, Dennis; Neubert, Juliane; Herdt, Olga; Krause, Michael; Scharfe, Maren; Jarek, Michael; Suske, Guntram
2012-01-01
The transcription factor Sp2 is essential for early mouse development and for proliferation of mouse embryonic fibroblasts in culture. Yet its mechanisms of action and its target genes are largely unknown. In this study, we have combined RNA interference, in vitro DNA binding, chromatin immunoprecipitation sequencing and global gene-expression profiling to investigate the role of Sp2 for cellular functions, to define target sites and to identify genes regulated by Sp2. We show that Sp2 is important for cellular proliferation that it binds to GC-boxes and occupies proximal promoters of genes essential for vital cellular processes including gene expression, replication, metabolism and signalling. Moreover, we identified important key target genes and cellular pathways that are directly regulated by Sp2. Most significantly, Sp2 binds and activates numerous sequence-specific transcription factor and co-activator genes, and represses the whole battery of cholesterol synthesis genes. Our results establish Sp2 as a sequence-specific regulator of vitally important genes. PMID:22684502
Zhang, Yu; Zhu, Chenyang; Sun, Bangyao; Lv, Jiawei; Liu, Zhonghua; Liu, Shengwang; Li, Hai
2017-01-01
p53 dysfunction is frequently observed in lung cancer. Although restoring the tumour suppressor function of p53 is recently approved as a putative strategy for combating cancers, the lack of understanding of the molecular mechanism underlying p53-mediated lung cancer suppression has limited the application of p53-based therapies in lung cancer. Using RNA sequencing, we determined the transcriptional profile of human non-small cell lung carcinoma A549 cells after treatment with two p53-activating chemical compounds, nutlin and RITA, which could induce A549 cell cycle arrest and apoptosis, respectively. Bioinformatics analysis of genome-wide gene expression data showed that distinct transcription profiles were induced by nutlin and RITA and 66 pathways were differentially regulated by these two compounds. However, only two of these pathways, 'Adherens junction' and 'Axon guidance', were found to be synthetic lethal with p53 re-activation, as determined via integrated analysis of genome-wide gene expression profile and short hairpin RNA (shRNA) screening. Further functional protein association analysis of significantly regulated genes associated with these two synthetic lethal pathways indicated that GSK3 played a key role in p53-mediated A549 cell apoptosis, and then gene function study was performed, which revealed that GSK3 inhibition promoted p53-mediated A549 cell apoptosis in a p53 post-translational activity-dependent manner. Our findings provide us with new insights regarding the mechanism by which p53 mediates A549 apoptosis and may cast light on the development of more efficient p53-based strategies for treating lung cancer. © 201 The Author(s). Published by S. Karger AG, Basel.
A system-level model for the microbial regulatory genome.
Brooks, Aaron N; Reiss, David J; Allard, Antoine; Wu, Wei-Ju; Salvanha, Diego M; Plaisier, Christopher L; Chandrasekaran, Sriram; Pan, Min; Kaur, Amardeep; Baliga, Nitin S
2014-07-15
Microbes can tailor transcriptional responses to diverse environmental challenges despite having streamlined genomes and a limited number of regulators. Here, we present data-driven models that capture the dynamic interplay of the environment and genome-encoded regulatory programs of two types of prokaryotes: Escherichia coli (a bacterium) and Halobacterium salinarum (an archaeon). The models reveal how the genome-wide distributions of cis-acting gene regulatory elements and the conditional influences of transcription factors at each of those elements encode programs for eliciting a wide array of environment-specific responses. We demonstrate how these programs partition transcriptional regulation of genes within regulons and operons to re-organize gene-gene functional associations in each environment. The models capture fitness-relevant co-regulation by different transcriptional control mechanisms acting across the entire genome, to define a generalized, system-level organizing principle for prokaryotic gene regulatory networks that goes well beyond existing paradigms of gene regulation. An online resource (http://egrin2.systemsbiology.net) has been developed to facilitate multiscale exploration of conditional gene regulation in the two prokaryotes. © 2014 The Authors. Published under the terms of the CC BY 4.0 license.
QDMR: a quantitative method for identification of differentially methylated regions by entropy
Zhang, Yan; Liu, Hongbo; Lv, Jie; Xiao, Xue; Zhu, Jiang; Liu, Xiaojuan; Su, Jianzhong; Li, Xia; Wu, Qiong; Wang, Fang; Cui, Ying
2011-01-01
DNA methylation plays critical roles in transcriptional regulation and chromatin remodeling. Differentially methylated regions (DMRs) have important implications for development, aging and diseases. Therefore, genome-wide mapping of DMRs across various temporal and spatial methylomes is important in revealing the impact of epigenetic modifications on heritable phenotypic variation. We present a quantitative approach, quantitative differentially methylated regions (QDMRs), to quantify methylation difference and identify DMRs from genome-wide methylation profiles by adapting Shannon entropy. QDMR was applied to synthetic methylation patterns and methylation profiles detected by methylated DNA immunoprecipitation microarray (MeDIP-chip) in human tissues/cells. This approach can give a reasonable quantitative measure of methylation difference across multiple samples. Then DMR threshold was determined from methylation probability model. Using this threshold, QDMR identified 10 651 tissue DMRs which are related to the genes enriched for cell differentiation, including 4740 DMRs not identified by the method developed by Rakyan et al. QDMR can also measure the sample specificity of each DMR. Finally, the application to methylation profiles detected by reduced representation bisulphite sequencing (RRBS) in mouse showed the platform-free and species-free nature of QDMR. This approach provides an effective tool for the high-throughput identification of potential functional regions involved in epigenetic regulation. PMID:21306990
Transcriptional Profiling Identifies Functional Interactions of TGFβ and PPARβ/δ Signaling
Kaddatz, Kerstin; Adhikary, Till; Finkernagel, Florian; Meissner, Wolfgang; Müller-Brüsselbach, Sabine; Müller, Rolf
2010-01-01
Peroxisome proliferator-activated receptors (PPARs) not only play a key role in regulating metabolic pathways but also modulate inflammatory processes, pointing to a functional interaction between PPAR and cytokine signaling pathways. In this study, we show by genome-wide transcriptional profiling that PPARβ/δ and transforming growth factor-β (TGFβ) pathways functionally interact in human myofibroblasts and that a subset of these genes is cooperatively activated by TGFβ and PPARβ/δ. Using the angiopoietin-like 4 (ANGPTL4) gene as a model, we demonstrate that two enhancer regions cooperate to mediate the observed synergistic response. A TGFβ-responsive enhancer located ∼8 kb upstream of the transcriptional start site is regulated by a mechanism involving SMAD3, ETS1, RUNX, and AP-1 transcription factors that interact with multiple contiguous binding sites. A second enhancer (PPAR-E) consisting of three juxtaposed PPAR response elements is located in the third intron ∼3.5 kb downstream of the transcriptional start site. The PPAR-E is strongly activated by all three PPAR subtypes, with a novel type of PPAR response element motif playing a central role. Although the PPAR-E is not regulated by TGFβ, it interacts with SMAD3, ETS1, RUNX2, and AP-1 in vivo, providing a possible mechanistic explanation for the observed synergism. PMID:20595396
Integrative Analysis of Sex-Specific microRNA Networks Following Stress in Mouse Nucleus Accumbens.
Pfau, Madeline L; Purushothaman, Immanuel; Feng, Jian; Golden, Sam A; Aleyasin, Hossein; Lorsch, Zachary S; Cates, Hannah M; Flanigan, Meghan E; Menard, Caroline; Heshmati, Mitra; Wang, Zichen; Ma'ayan, Avi; Shen, Li; Hodes, Georgia E; Russo, Scott J
2016-01-01
Adult women are twice as likely as men to suffer from affective and anxiety disorders, although the mechanisms underlying heightened female stress susceptibility are incompletely understood. Recent findings in mouse Nucleus Accumbens (NAc) suggest a role for DNA methylation-driven sex differences in genome-wide transcriptional profiles. However, the role of another epigenetic process-microRNA (miR) regulation-has yet to be explored. We exposed male and female mice to Subchronic Variable Stress (SCVS), a stress paradigm that produces depression-like behavior in female, but not male, mice, and performed next generation mRNA and miR sequencing on NAc tissue. We applied a combination of differential expression, miR-mRNA network and functional enrichment analyses to characterize the transcriptional and post-transcriptional landscape of sex differences in NAc stress response. We find that male and female mice exhibit largely non-overlapping miR and mRNA profiles following SCVS. The two sexes also show enrichment of different molecular pathways and functions. Collectively, our results suggest that males and females mount fundamentally different transcriptional and post-transcriptional responses to SCVS and engage sex-specific molecular processes following stress. These findings have implications for the pathophysiology and treatment of stress-related disorders in women.
Grünberg, Sebastian; Henikoff, Steven; Hahn, Steven; Zentner, Gabriel E
2016-11-15
Mediator is a conserved, essential transcriptional coactivator complex, but its in vivo functions have remained unclear due to conflicting data regarding its genome-wide binding pattern obtained by genome-wide ChIP Here, we used ChEC-seq, a method orthogonal to ChIP, to generate a high-resolution map of Mediator binding to the yeast genome. We find that Mediator associates with upstream activating sequences (UASs) rather than the core promoter or gene body under all conditions tested. Mediator occupancy is surprisingly correlated with transcription levels at only a small fraction of genes. Using the same approach to map TFIID, we find that TFIID is associated with both TFIID- and SAGA-dependent genes and that TFIID and Mediator occupancy is cooperative. Our results clarify Mediator recruitment and binding to the genome, showing that Mediator binding to UASs is widespread, partially uncoupled from transcription, and mediated in part by TFIID. © 2016 The Authors.
Seo, Sang Woo; Kim, Donghyuk; Szubin, Richard; Palsson, Bernhard O
2015-08-25
Three transcription factors (TFs), OxyR, SoxR, and SoxS, play a critical role in transcriptional regulation of the defense system for oxidative stress in bacteria. However, their full genome-wide regulatory potential is unknown. Here, we perform a genome-scale reconstruction of the OxyR, SoxR, and SoxS regulons in Escherichia coli K-12 MG1655. Integrative data analysis reveals that a total of 68 genes in 51 transcription units (TUs) belong to these regulons. Among them, 48 genes showed more than 2-fold changes in expression level under single-TF-knockout conditions. This reconstruction expands the genome-wide roles of these factors to include direct activation of genes related to amino acid biosynthesis (methionine and aromatic amino acids), cell wall synthesis (lipid A biosynthesis and peptidoglycan growth), and divalent metal ion transport (Mn(2+), Zn(2+), and Mg(2+)). Investigating the co-regulation of these genes with other stress-response TFs reveals that they are independently regulated by stress-specific TFs. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Tadra-Sfeir, Michelle Z.; Faoro, Helisson; Camilios-Neto, Doumit; Brusamarello-Santos, Liziane; Balsanelli, Eduardo; Weiss, Vinicius; Baura, Valter A.; Wassem, Roseli; Cruz, Leonardo M.; De Oliveira Pedrosa, Fábio; Souza, Emanuel M.; Monteiro, Rose A.
2015-01-01
Herbaspirillum seropedicae is a diazotrophic bacterium which associates endophytically with economically important gramineae. Flavonoids such as naringenin have been shown to have an effect on the interaction between H. seropedicae and its host plants. We used a high-throughput sequencing based method (RNA-Seq) to access the influence of naringenin on the whole transcriptome profile of H. seropedicae. Three hundred and four genes were downregulated and seventy seven were upregulated by naringenin. Data analysis revealed that genes related to bacterial flagella biosynthesis, chemotaxis and biosynthesis of peptidoglycan were repressed by naringenin. Moreover, genes involved in aromatic metabolism and multidrug transport efllux were actived. PMID:26052319
Structure-seq2: sensitive and accurate genome-wide profiling of RNA structure in vivo
Ritchey, Laura E.; Su, Zhao; Tang, Yin; Tack, David C.
2017-01-01
Abstract RNA serves many functions in biology such as splicing, temperature sensing, and innate immunity. These functions are often determined by the structure of RNA. There is thus a pressing need to understand RNA structure and how it changes during diverse biological processes both in vivo and genome-wide. Here, we present Structure-seq2, which provides nucleotide-resolution RNA structural information in vivo and genome-wide. This optimized version of our original Structure-seq method increases sensitivity by at least 4-fold and improves data quality by minimizing formation of a deleterious by-product, reducing ligation bias, and improving read coverage. We also present a variation of Structure-seq2 in which a biotinylated nucleotide is incorporated during reverse transcription, which greatly facilitates the protocol by eliminating two PAGE purification steps. We benchmark Structure-seq2 on both mRNA and rRNA structure in rice (Oryza sativa). We demonstrate that Structure-seq2 can lead to new biological insights. Our Structure-seq2 datasets uncover hidden breaks in chloroplast rRNA and identify a previously unreported N1-methyladenosine (m1A) in a nuclear-encoded Oryza sativa rRNA. Overall, Structure-seq2 is a rapid, sensitive, and unbiased method to probe RNA in vivo and genome-wide that facilitates new insights into RNA biology. PMID:28637286
He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei
2016-01-01
WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions, indicating tandem duplicate WRKYs in the adaptive responses to environmental stimuli during the evolution process. Our results provide a framework for future studies regarding the function of WRKY genes in response to stress in B. napus. PMID:27322342
He, Yajun; Mao, Shaoshuai; Gao, Yulong; Zhu, Liying; Wu, Daoming; Cui, Yixin; Li, Jiana; Qian, Wei
2016-01-01
WRKY transcription factors play important roles in responses to environmental stress stimuli. Using a genome-wide domain analysis, we identified 287 WRKY genes with 343 WRKY domains in the sequenced genome of Brassica napus, 139 in the A sub-genome and 148 in the C sub-genome. These genes were classified into eight groups based on phylogenetic analysis. In the 343 WRKY domains, a total of 26 members showed divergence in the WRKY domain, and 21 belonged to group I. This finding suggested that WRKY genes in group I are more active and variable compared with genes in other groups. Using genome-wide identification and analysis of the WRKY gene family in Brassica napus, we observed genome duplication, chromosomal/segmental duplications and tandem duplication. All of these duplications contributed to the expansion of the WRKY gene family. The duplicate segments that were detected indicated that genome duplication events occurred in the two diploid progenitors B. rapa and B. olearecea before they combined to form B. napus. Analysis of the public microarray database and EST database for B. napus indicated that 74 WRKY genes were induced or preferentially expressed under stress conditions. According to the public QTL data, we identified 77 WRKY genes in 31 QTL regions related to various stress tolerance. We further evaluated the expression of 26 BnaWRKY genes under multiple stresses by qRT-PCR. Most of the genes were induced by low temperature, salinity and drought stress, indicating that the WRKYs play important roles in B. napus stress responses. Further, three BnaWRKY genes were strongly responsive to the three multiple stresses simultaneously, which suggests that these 3 WRKY may have multi-functional roles in stress tolerance and can potentially be used in breeding new rapeseed cultivars. We also found six tandem repeat pairs exhibiting similar expression profiles under the various stress conditions, and three pairs were mapped in the stress related QTL regions, indicating tandem duplicate WRKYs in the adaptive responses to environmental stimuli during the evolution process. Our results provide a framework for future studies regarding the function of WRKY genes in response to stress in B. napus.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Neuhof, Torsten; Seibold, Michael; Thewes, Sascha
This is First report on the antifungal effects of the new glycolipopeptide hassallidin A. Due to related molecular structure moieties between hassallidin A and the established antifungal drug caspofungin we assumed parallels in the effects on cell viability. Therefore we compared hassallidin A with caspofungin by antifungal susceptibility testing and by analysing the genome-wide transcriptional profile of Candida albicans. Furthermore, we examined modifications in ultracellular structure due to hassallidin A treatment by electron microscopy. Hassallidin A was found to be fungicidal against all tested Candida species and Cryptococcus neoformans isolates. MICs ranged from 4 to 8 {mu}g/ml, independently from themore » species. Electron microscopy revealed noticeable ultrastructural changes in C. albicans cells exposed to hassallidin A. Comparing the transcriptional profile of C. albicans cells treated with hassallidin A to that of cells exposed to caspofungin, only 20 genes were found to be similarly up- or down-regulated in both assays, while 227 genes were up- or down-regulated induced by hassallidin A specifically. Genes up-regulated in cells exposed to hassallidin A included metabolic and mitotic genes, while genes involved in DNA repair, vesicle docking, and membrane fusion were down-regulated. In summary, our data suggest that, although hassallidin A and caspofungin have similar structures, however, the effects on susceptibility and transcriptional response to yeasts seem to be different.« less
An ensemble model of competitive multi-factor binding of the genome
Wasson, Todd; Hartemink, Alexander J.
2009-01-01
Hundreds of different factors adorn the eukaryotic genome, binding to it in large number. These DNA binding factors (DBFs) include nucleosomes, transcription factors (TFs), and other proteins and protein complexes, such as the origin recognition complex (ORC). DBFs compete with one another for binding along the genome, yet many current models of genome binding do not consider different types of DBFs together simultaneously. Additionally, binding is a stochastic process that results in a continuum of binding probabilities at any position along the genome, but many current models tend to consider positions as being either binding sites or not. Here, we present a model that allows a multitude of DBFs, each at different concentrations, to compete with one another for binding sites along the genome. The result is an “occupancy profile,” a probabilistic description of the DNA occupancy of each factor at each position. We implement our model efficiently as the software package COMPETE. We demonstrate genome-wide and at specific loci how modeling nucleosome binding alters TF binding, and vice versa, and illustrate how factor concentration influences binding occupancy. Binding cooperativity between nearby TFs arises implicitly via mutual competition with nucleosomes. Our method applies not only to TFs, but also recapitulates known occupancy profiles of a well-studied replication origin with and without ORC binding. Importantly, the sequence preferences our model takes as input are derived from in vitro experiments. This ensures that the calculated occupancy profiles are the result of the forces of competition represented explicitly in our model and the inherent sequence affinities of the constituent DBFs. PMID:19720867
Lalli, Matthew A; Jang, Jiwon; Park, Joo-Hye C; Wang, Yidi; Guzman, Elmer; Zhou, Hongjun; Audouard, Morgane; Bridges, Daniel; Tovar, Kenneth R; Papuc, Sorina M; Tutulan-Cunita, Andreea C; Huang, Yadong; Budisteanu, Magdalena; Arghir, Aurora; Kosik, Kenneth S
2016-04-01
Williams syndrome (WS) is a neurodevelopmental disorder caused by a genomic deletion of ∼28 genes that results in a cognitive and behavioral profile marked by overall intellectual impairment with relative strength in expressive language and hypersocial behavior. Advancements in protocols for neuron differentiation from induced pluripotent stem cells allowed us to elucidate the molecular circuitry underpinning the ontogeny of WS. In patient-derived stem cells and neurons, we determined the expression profile of the Williams-Beuren syndrome critical region-deleted genes and the genome-wide transcriptional consequences of the hemizygous genomic microdeletion at chromosome 7q11.23. Derived neurons displayed disease-relevant hallmarks and indicated novel aberrant pathways in WS neurons including over-activated Wnt signaling accompanying an incomplete neurogenic commitment. We show that haploinsufficiency of the ATP-dependent chromatin remodeler, BAZ1B, which is deleted in WS, significantly contributes to this differentiation defect. Chromatin-immunoprecipitation (ChIP-seq) revealed BAZ1B target gene functions are enriched for neurogenesis, neuron differentiation and disease-relevant phenotypes. BAZ1B haploinsufficiency caused widespread gene expression changes in neural progenitor cells, and together with BAZ1B ChIP-seq target genes, explained 42% of the transcriptional dysregulation in WS neurons. BAZ1B contributes to regulating the balance between neural precursor self-renewal and differentiation and the differentiation defect caused by BAZ1B haploinsufficiency can be rescued by mitigating over-active Wnt signaling in neural stem cells. Altogether, these results reveal a pivotal role for BAZ1B in neurodevelopment and implicate its haploinsufficiency as a likely contributor to the neurological phenotypes in WS. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Nepal, Chirag; Coolen, Marion; Hadzhiev, Yavor; Cussigh, Delphine; Mydel, Piotr; Steen, Vidar M.; Carninci, Piero; Andersen, Jesper B.; Bally-Cuif, Laure; Müller, Ferenc; Lenhard, Boris
2016-01-01
MicroRNAs (miRNAs) play a major role in the post-transcriptional regulation of target genes, especially in development and differentiation. Our understanding about the transcriptional regulation of miRNA genes is limited by inadequate annotation of primary miRNA (pri-miRNA) transcripts. Here, we used CAGE-seq and RNA-seq to provide genome-wide identification of the pri-miRNA core promoter repertoire and its dynamic usage during zebrafish embryogenesis. We assigned pri-miRNA promoters to 152 precursor-miRNAs (pre-miRNAs), the majority of which were supported by promoter associated post-translational histone modifications (H3K4me3, H2A.Z) and RNA polymerase II (RNAPII) occupancy. We validated seven miR-9 pri-miRNAs by in situ hybridization and showed similar expression patterns as mature miR-9. In addition, processing of an alternative intronic promoter of miR-9–5 was validated by 5′ RACE PCR. Developmental profiling revealed a subset of pri-miRNAs that are maternally inherited. Moreover, we show that promoter-associated H3K4me3, H2A.Z and RNAPII marks are not only present at pri-miRNA promoters but are also specifically enriched at pre-miRNAs, suggesting chromatin level regulation of pre-miRNAs. Furthermore, we demonstrated that CAGE-seq also detects 3′-end processing of pre-miRNAs on Drosha cleavage site that correlates with miRNA-offset RNAs (moRNAs) production and provides a new tool for detecting Drosha processing events and predicting pre-miRNA processing by a genome-wide assay. PMID:26673698
Cai, Yizhi; Agmon, Neta; Choi, Woo Jin; Ubide, Alba; Stracquadanio, Giovanni; Caravelli, Katrina; Hao, Haiping; Bader, Joel S.; Boeke, Jef D.
2015-01-01
Biocontainment may be required in a wide variety of situations such as work with pathogens, field release applications of engineered organisms, and protection of intellectual properties. Here, we describe the control of growth of the brewer’s yeast, Saccharomyces cerevisiae, using both transcriptional and recombinational “safeguard” control of essential gene function. Practical biocontainment strategies dependent on the presence of small molecules require them to be active at very low concentrations, rendering them inexpensive and difficult to detect. Histone genes were controlled by an inducible promoter and controlled by 30 nM estradiol. The stability of the engineered genes was separately regulated by the expression of a site-specific recombinase. The combined frequency of generating viable derivatives when both systems were active was below detection (<10−10), consistent with their orthogonal nature and the individual escape frequencies of <10−6. Evaluation of escaper mutants suggests strategies for reducing their emergence. Transcript profiling and growth test suggest high fitness of safeguarded strains, an important characteristic for wide acceptance. PMID:25624482
Transcript copy number estimation using a mouse whole-genome oligonucleotide microarray
Carter, Mark G; Sharov, Alexei A; VanBuren, Vincent; Dudekula, Dawood B; Carmack, Condie E; Nelson, Charlie; Ko, Minoru SH
2005-01-01
The ability to quantitatively measure the expression of all genes in a given tissue or cell with a single assay is an exciting promise of gene-expression profiling technology. An in situ-synthesized 60-mer oligonucleotide microarray designed to detect transcripts from all mouse genes was validated, as well as a set of exogenous RNA controls derived from the yeast genome (made freely available without restriction), which allow quantitative estimation of absolute endogenous transcript abundance. PMID:15998450
Applications of nanotechnology, next generation sequencing and microarrays in biomedical research.
Elingaramil, Sauli; Li, Xiaolong; He, Nongyue
2013-07-01
Next-generation sequencing technologies, microarrays and advances in bio nanotechnology have had an enormous impact on research within a short time frame. This impact appears certain to increase further as many biomedical institutions are now acquiring these prevailing new technologies. Beyond conventional sampling of genome content, wide-ranging applications are rapidly evolving for next-generation sequencing, microarrays and nanotechnology. To date, these technologies have been applied in a variety of contexts, including whole-genome sequencing, targeted re sequencing and discovery of transcription factor binding sites, noncoding RNA expression profiling and molecular diagnostics. This paper thus discusses current applications of nanotechnology, next-generation sequencing technologies and microarrays in biomedical research and highlights the transforming potential these technologies offer.
Ribosome profiling-guided depletion of an mRNA increases cell growth rate and protein secretion
NASA Astrophysics Data System (ADS)
Kallehauge, Thomas Beuchert; Li, Shangzhong; Pedersen, Lasse Ebdrup; Ha, Tae Kwang; Ley, Daniel; Andersen, Mikael Rørdam; Kildegaard, Helene Faustrup; Lee, Gyun Min; Lewis, Nathan E.
2017-01-01
Recombinant protein production coopts the host cell machinery to provide high protein yields of industrial enzymes or biotherapeutics. However, since protein translation is energetically expensive and tightly controlled, it is unclear if highly expressed recombinant genes are translated as efficiently as host genes. Furthermore, it is unclear how the high expression impacts global translation. Here, we present the first genome-wide view of protein translation in an IgG-producing CHO cell line, measured with ribosome profiling. Through this we found that our recombinant mRNAs were translated as efficiently as the host cell transcriptome, and sequestered up to 15% of the total ribosome occupancy. During cell culture, changes in recombinant mRNA translation were consistent with changes in transcription, demonstrating that transcript levels influence specific productivity. Using this information, we identified the unnecessary resistance marker NeoR to be a highly transcribed and translated gene. Through siRNA knock-down of NeoR, we improved the production- and growth capacity of the host cell. Thus, ribosomal profiling provides valuable insights into translation in CHO cells and can guide efforts to enhance protein production.
Jensen, Philip J; Fazio, Gennaro; Altman, Naomi; Praul, Craig; McNellis, Timothy W
2014-04-04
Apple tree breeding is slow and difficult due to long generation times, self-incompatibility, and complex genetics. The identification of molecular markers linked to traits of interest is a way to expedite the breeding process. In the present study, we aimed to identify genes whose steady-state transcript abundance was associated with inheritance of specific traits segregating in an apple (Malus × domestica) rootstock F1 breeding population, including resistance to powdery mildew (Podosphaera leucotricha) disease and woolly apple aphid (Eriosoma lanigerum). Transcription profiling was performed for 48 individual F1 apple trees from a cross of two highly heterozygous parents, using RNA isolated from healthy, actively-growing shoot tips and a custom apple DNA oligonucleotide microarray representing 26,000 unique transcripts. Genome-wide expression profiles were not clear indicators of powdery mildew or woolly apple aphid resistance phenotype. However, standard differential gene expression analysis between phenotypic groups of trees revealed relatively small sets of genes with trait-associated expression levels. For example, thirty genes were identified that were differentially expressed between trees resistant and susceptible to powdery mildew. Interestingly, the genes encoding twenty-four of these transcripts were physically clustered on chromosome 12. Similarly, seven genes were identified that were differentially expressed between trees resistant and susceptible to woolly apple aphid, and the genes encoding five of these transcripts were also clustered, this time on chromosome 17. In each case, the gene clusters were in the vicinity of previously identified major quantitative trait loci for the corresponding trait. Similar results were obtained for a series of molecular traits. Several of the differentially expressed genes were used to develop DNA polymorphism markers linked to powdery mildew disease and woolly apple aphid resistance. Gene expression profiling and trait-associated transcript analysis using an apple F1 population readily identified genes physically linked to powdery mildew disease resistance and woolly apple aphid resistance loci. This result was especially useful in apple, where extreme levels of heterozygosity make the development of reliable DNA markers quite difficult. The results suggest that this approach could prove effective in crops with complicated genetics, or for which few genomic information resources are available.
Genome-wide dynamics of a bacterial response to antibiotics that target the cell envelope
2011-01-01
Background A decline in the discovery of new antibacterial drugs, coupled with a persistent rise in the occurrence of drug-resistant bacteria, has highlighted antibiotics as a diminishing resource. The future development of new drugs with novel antibacterial activities requires a detailed understanding of adaptive responses to existing compounds. This study uses Streptomyces coelicolor A3(2) as a model system to determine the genome-wide transcriptional response following exposure to three antibiotics (vancomycin, moenomycin A and bacitracin) that target distinct stages of cell wall biosynthesis. Results A generalised response to all three antibiotics was identified which involves activation of transcription of the cell envelope stress sigma factor σE, together with elements of the stringent response, and of the heat, osmotic and oxidative stress regulons. Attenuation of this system by deletion of genes encoding the osmotic stress sigma factor σB or the ppGpp synthetase RelA reduced resistance to both vancomycin and bacitracin. Many antibiotic-specific transcriptional changes were identified, representing cellular processes potentially important for tolerance to each antibiotic. Sensitivity studies using mutants constructed on the basis of the transcriptome profiling confirmed a role for several such genes in antibiotic resistance, validating the usefulness of the approach. Conclusions Antibiotic inhibition of bacterial cell wall biosynthesis induces both common and compound-specific transcriptional responses. Both can be exploited to increase antibiotic susceptibility. Regulatory networks known to govern responses to environmental and nutritional stresses are also at the core of the common antibiotic response, and likely help cells survive until any specific resistance mechanisms are fully functional. PMID:21569315
Genome-wide analysis of miRNA and mRNA transcriptomes during amelogenesis.
Yin, Kaifeng; Hacia, Joseph G; Zhong, Zhe; Paine, Michael L
2014-11-19
In the rodent incisor during amelogenesis, as ameloblast cells transition from secretory stage to maturation stage, their morphology and transcriptome profiles change dramatically. Prior whole genome transcriptome analysis has given a broad picture of the molecular activities dominating both stages of amelogenesis, but this type of analysis has not included miRNA transcript profiling. In this study, we set out to document which miRNAs and corresponding target genes change significantly as ameloblasts transition from secretory- to maturation-stage amelogenesis. Total RNA samples from both secretory- and maturation-stage rat enamel organs were subjected to genome-wide miRNA and mRNA transcript profiling. We identified 59 miRNAs that were differentially expressed at the maturation stage relative to the secretory stage of enamel development (False Discovery Rate (FDR)<0.05, fold change (FC)≥1.8). In parallel, transcriptome profiling experiments identified 1,729 mRNA transcripts that were differentially expressed in the maturation stage compared to the secretory stage (FDR<0.05, FC≥1.8). Based on bioinformatics analyses, 5.8% (629 total) of these differentially expressed genes (DEGS) were highlighted as being the potential targets of 59 miRNAs that were differentially expressed in the opposite direction, in the same tissue samples. Although the number of predicted target DEGs was not higher than baseline expectations generated by examination of stably expressed miRNAs, Gene Ontology (GO) analysis showed that these 629 DEGS were enriched for ion transport, pH regulation, calcium handling, endocytotic, and apoptotic activities. Seven differentially expressed miRNAs (miR-21, miR-31, miR-488, miR-153, miR-135b, miR-135a and miR298) in secretory- and/or maturation-stage enamel organs were confirmed by in situ hybridization. Further, we used luciferase reporter assays to provide evidence that two of these differentially expressed miRNAs, miR-153 and miR-31, are potential regulators for their predicated target mRNAs, Lamp1 (miR-153) and Tfrc (miR-31). In conclusion, these data indicate that miRNAs exhibit a dynamic expression pattern during the transition from secretory-stage to maturation-stage tooth enamel formation. Although they represent only one of numerous mechanisms influencing gene activities, miRNAs specific to the maturation stage could be involved in regulating several key processes of enamel maturation by influencing mRNA stability and translation.
Genome-wide mapping of autonomous promoter activity in human cells
van Arensbergen, Joris; FitzPatrick, Vincent D.; de Haas, Marcel; Pagie, Ludo; Sluimer, Jasper; Bussemaker, Harmen J.; van Steensel, Bas
2017-01-01
Previous methods to systematically characterize sequence-intrinsic activity of promoters have been limited by relatively low throughput and the length of sequences that could be tested. Here we present Survey of Regulatory Elements (SuRE), a method to assay more than 108 DNA fragments, each 0.2–2kb in size, for their ability to drive transcription autonomously. In SuRE, a plasmid library is constructed of random genomic fragments upstream of a 20bp barcode and decoded by paired-end sequencing. This library is then transfected into cells and transcribed barcodes are quantified in the RNA by high throughput sequencing. When applied to the human genome, we achieved a 55-fold genome coverage, allowing us to map autonomous promoter activity genome-wide. By computational modeling we delineated subregions within promoters that are relevant for their activity. For instance, we show that antisense promoter transcription is generally dependent on the sense core promoter sequences, and that most enhancers and several families of repetitive elements act as autonomous transcription initiation sites. PMID:28024146
Baumbach, Jan; Brinkrolf, Karina; Czaja, Lisa F; Rahmann, Sven; Tauch, Andreas
2006-01-01
Background The application of DNA microarray technology in post-genomic analysis of bacterial genome sequences has allowed the generation of huge amounts of data related to regulatory networks. This data along with literature-derived knowledge on regulation of gene expression has opened the way for genome-wide reconstruction of transcriptional regulatory networks. These large-scale reconstructions can be converted into in silico models of bacterial cells that allow a systematic analysis of network behavior in response to changing environmental conditions. Description CoryneRegNet was designed to facilitate the genome-wide reconstruction of transcriptional regulatory networks of corynebacteria relevant in biotechnology and human medicine. During the import and integration process of data derived from experimental studies or literature knowledge CoryneRegNet generates links to genome annotations, to identified transcription factors and to the corresponding cis-regulatory elements. CoryneRegNet is based on a multi-layered, hierarchical and modular concept of transcriptional regulation and was implemented by using the relational database management system MySQL and an ontology-based data structure. Reconstructed regulatory networks can be visualized by using the yFiles JAVA graph library. As an application example of CoryneRegNet, we have reconstructed the global transcriptional regulation of a cellular module involved in SOS and stress response of corynebacteria. Conclusion CoryneRegNet is an ontology-based data warehouse that allows a pertinent data management of regulatory interactions along with the genome-scale reconstruction of transcriptional regulatory networks. These models can further be combined with metabolic networks to build integrated models of cellular function including both metabolism and its transcriptional regulation. PMID:16478536
Genetic, molecular and physiological basis of variation in Drosophila gut immunocompetence.
Bou Sleiman, Maroun S; Osman, Dani; Massouras, Andreas; Hoffmann, Ary A; Lemaitre, Bruno; Deplancke, Bart
2015-07-27
Gut immunocompetence involves immune, stress and regenerative processes. To investigate the determinants underlying inter-individual variation in gut immunocompetence, we perform enteric infection of 140 Drosophila lines with the entomopathogenic bacterium Pseudomonas entomophila and observe extensive variation in survival. Using genome-wide association analysis, we identify several novel immune modulators. Transcriptional profiling further shows that the intestinal molecular state differs between resistant and susceptible lines, already before infection, with one transcriptional module involving genes linked to reactive oxygen species (ROS) metabolism contributing to this difference. This genetic and molecular variation is physiologically manifested in lower ROS activity, lower susceptibility to ROS-inducing agent, faster pathogen clearance and higher stem cell activity in resistant versus susceptible lines. This study provides novel insights into the determinants underlying population-level variability in gut immunocompetence, revealing how relatively minor, but systematic genetic and transcriptional variation can mediate overt physiological differences that determine enteric infection susceptibility.
Transcriptional atlas of cardiogenesis maps congenital heart disease interactome.
Li, Xing; Martinez-Fernandez, Almudena; Hartjes, Katherine A; Kocher, Jean-Pierre A; Olson, Timothy M; Terzic, Andre; Nelson, Timothy J
2014-07-01
Mammalian heart development is built on highly conserved molecular mechanisms with polygenetic perturbations resulting in a spectrum of congenital heart diseases (CHD). However, knowledge of cardiogenic ontogeny that regulates proper cardiogenesis remains largely based on candidate-gene approaches. Mapping the dynamic transcriptional landscape of cardiogenesis from a genomic perspective is essential to integrate the knowledge of heart development into translational applications that accelerate disease discovery efforts toward mechanistic-based treatment strategies. Herein, we designed a time-course transcriptome analysis to investigate the genome-wide dynamic expression landscape of innate murine cardiogenesis ranging from embryonic stem cells to adult cardiac structures. This comprehensive analysis generated temporal and spatial expression profiles, revealed stage-specific gene functions, and mapped the dynamic transcriptome of cardiogenesis to curated pathways. Reconciling known genetic underpinnings of CHD, we deconstructed a disease-centric dynamic interactome encoded within this cardiogenic atlas to identify stage-specific developmental disturbances clustered on regulation of epithelial-to-mesenchymal transition (EMT), BMP signaling, NF-AT signaling, TGFb-dependent EMT, and Notch signaling. Collectively, this cardiogenic transcriptional landscape defines the time-dependent expression of cardiac ontogeny and prioritizes regulatory networks at the interface between health and disease. Copyright © 2014 the American Physiological Society.
Kimura, Shinzo; Ishidou, Emi; Kurita, Sakiko; Suzuki, Yoshiteru; Shibato, Junko; Rakwal, Randeep; Iwahashi, Hitoshi
2006-07-21
Ionizing radiation (IR) is the most enigmatic of genotoxic stress inducers in our environment that has been around from the eons of time. IR is generally considered harmful, and has been the subject of numerous studies, mostly looking at the DNA damaging effects in cells and the repair mechanisms therein. Moreover, few studies have focused on large-scale identification of cellular responses to IR, and to this end, we describe here an initial study on the transcriptional responses of the unicellular genome model, yeast (Saccharomyces cerevisiae strain S288C), by cDNA microarray. The effect of two different IR, X-rays, and gamma (gamma)-rays, was investigated by irradiating the yeast cells cultured in YPD medium with 50 Gy doses of X- and gamma-rays, followed by resuspension of the cells in YPD for time-course experiments. The samples were collected for microarray analysis at 20, 40, and 80 min after irradiation. Microarray analysis revealed a time-course transcriptional profile of changed gene expressions. Up-regulated genes belonged to the functional categories mainly related to cell cycle and DNA processing, cell rescue defense and virulence, protein and cell fate, and metabolism (X- and gamma-rays). Similarly, for X- and gamma-rays, the down-regulated genes belonged to mostly transcription and protein synthesis, cell cycle and DNA processing, control of cellular organization, cell fate, and C-compound and carbohydrate metabolism categories, respectively. This study provides for the first time a snapshot of the genome-wide mRNA expression profiles in X- and gamma-ray post-irradiated yeast cells and comparatively interprets/discusses the changed gene functional categories as effects of these two radiations vis-à-vis their energy levels.
Genome wide approaches to identify protein-DNA interactions.
Ma, Tao; Ye, Zhenqing; Wang, Liguo
2018-05-29
Transcription factors are DNA-binding proteins that play key roles in many fundamental biological processes. Unraveling their interactions with DNA is essential to identify their target genes and understand the regulatory network. Genome-wide identification of their binding sites became feasible thanks to recent progress in experimental and computational approaches. ChIP-chip, ChIP-seq, and ChIP-exo are three widely used techniques to demarcate genome-wide transcription factor binding sites. This review aims to provide an overview of these three techniques including their experiment procedures, computational approaches, and popular analytic tools. ChIP-chip, ChIP-seq, and ChIP-exo have been the major techniques to study genome-wide in vivo protein-DNA interaction. Due to the rapid development of next-generation sequencing technology, array-based ChIP-chip is deprecated and ChIP-seq has become the most widely used technique to identify transcription factor binding sites in genome-wide. The newly developed ChIP-exo further improves the spatial resolution to single nucleotide. Numerous tools have been developed to analyze ChIP-chip, ChIP-seq and ChIP-exo data. However, different programs may employ different mechanisms or underlying algorithms thus each will inherently include its own set of statistical assumption and bias. So choosing the most appropriate analytic program for a given experiment needs careful considerations. Moreover, most programs only have command line interface so their installation and usage will require basic computation expertise in Unix/Linux. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Genome-wide mapping of 5-hydroxymethylcytosine in embryonic stem cells.
Pastor, William A; Pape, Utz J; Huang, Yun; Henderson, Hope R; Lister, Ryan; Ko, Myunggon; McLoughlin, Erin M; Brudno, Yevgeny; Mahapatra, Sahasransu; Kapranov, Philipp; Tahiliani, Mamta; Daley, George Q; Liu, X Shirley; Ecker, Joseph R; Milos, Patrice M; Agarwal, Suneet; Rao, Anjana
2011-05-19
5-hydroxymethylcytosine (5hmC) is a modified base present at low levels in diverse cell types in mammals. 5hmC is generated by the TET family of Fe(II) and 2-oxoglutarate-dependent enzymes through oxidation of 5-methylcytosine (5mC). 5hmC and TET proteins have been implicated in stem cell biology and cancer, but information on the genome-wide distribution of 5hmC is limited. Here we describe two novel and specific approaches to profile the genomic localization of 5hmC. The first approach, termed GLIB (glucosylation, periodate oxidation, biotinylation) uses a combination of enzymatic and chemical steps to isolate DNA fragments containing as few as a single 5hmC. The second approach involves conversion of 5hmC to cytosine 5-methylenesulphonate (CMS) by treatment of genomic DNA with sodium bisulphite, followed by immunoprecipitation of CMS-containing DNA with a specific antiserum to CMS. High-throughput sequencing of 5hmC-containing DNA from mouse embryonic stem (ES) cells showed strong enrichment within exons and near transcriptional start sites. 5hmC was especially enriched at the start sites of genes whose promoters bear dual histone 3 lysine 27 trimethylation (H3K27me3) and histone 3 lysine 4 trimethylation (H3K4me3) marks. Our results indicate that 5hmC has a probable role in transcriptional regulation, and suggest a model in which 5hmC contributes to the 'poised' chromatin signature found at developmentally-regulated genes in ES cells.
Høgslund, Niels; Radutoiu, Simona; Krusell, Lene; Voroshilova, Vera; Hannah, Matthew A.; Goffard, Nicolas; Sanchez, Diego H.; Lippold, Felix; Ott, Thomas; Sato, Shusei; Tabata, Satoshi; Liboriussen, Poul; Lohmann, Gitte V.; Schauser, Leif; Weiller, Georg F.; Udvardi, Michael K.; Stougaard, Jens
2009-01-01
Genetic analyses of plant symbiotic mutants has led to the identification of key genes involved in Rhizobium-legume communication as well as in development and function of nitrogen fixing root nodules. However, the impact of these genes in coordinating the transcriptional programs of nodule development has only been studied in limited and isolated studies. Here, we present an integrated genome-wide analysis of transcriptome landscapes in Lotus japonicus wild-type and symbiotic mutant plants. Encompassing five different organs, five stages of the sequentially developed determinate Lotus root nodules, and eight mutants impaired at different stages of the symbiotic interaction, our data set integrates an unprecedented combination of organ- or tissue-specific profiles with mutant transcript profiles. In total, 38 different conditions sampled under the same well-defined growth regimes were included. This comprehensive analysis unravelled new and unexpected patterns of transcriptional regulation during symbiosis and organ development. Contrary to expectations, none of the previously characterized nodulins were among the 37 genes specifically expressed in nodules. Another surprise was the extensive transcriptional response in whole root compared to the susceptible root zone where the cellular response is most pronounced. A large number of transcripts predicted to encode transcriptional regulators, receptors and proteins involved in signal transduction, as well as many genes with unknown function, were found to be regulated during nodule organogenesis and rhizobial infection. Combining wild type and mutant profiles of these transcripts demonstrates the activation of a complex genetic program that delineates symbiotic nitrogen fixation. The complete data set was organized into an indexed expression directory that is accessible from a resource database, and here we present selected examples of biological questions that can be addressed with this comprehensive and powerful gene expression data set. PMID:19662091
Decoding the non-coding genome: elucidating genetic risk outside the coding genome.
Barr, C L; Misener, V L
2016-01-01
Current evidence emerging from genome-wide association studies indicates that the genetic underpinnings of complex traits are likely attributable to genetic variation that changes gene expression, rather than (or in combination with) variation that changes protein-coding sequences. This is particularly compelling with respect to psychiatric disorders, as genetic changes in regulatory regions may result in differential transcriptional responses to developmental cues and environmental/psychosocial stressors. Until recently, however, the link between transcriptional regulation and psychiatric genetic risk has been understudied. Multiple obstacles have contributed to the paucity of research in this area, including challenges in identifying the positions of remote (distal from the promoter) regulatory elements (e.g. enhancers) and their target genes and the underrepresentation of neural cell types and brain tissues in epigenome projects - the availability of high-quality brain tissues for epigenetic and transcriptome profiling, particularly for the adolescent and developing brain, has been limited. Further challenges have arisen in the prediction and testing of the functional impact of DNA variation with respect to multiple aspects of transcriptional control, including regulatory-element interaction (e.g. between enhancers and promoters), transcription factor binding and DNA methylation. Further, the brain has uncommon DNA-methylation marks with unique genomic distributions not found in other tissues - current evidence suggests the involvement of non-CG methylation and 5-hydroxymethylation in neurodevelopmental processes but much remains unknown. We review here knowledge gaps as well as both technological and resource obstacles that will need to be overcome in order to elucidate the involvement of brain-relevant gene-regulatory variants in genetic risk for psychiatric disorders. © 2015 John Wiley & Sons Ltd and International Behavioural and Neural Genetics Society.
Ma, Jin-Qi; Jian, Hong-Ju; Yang, Bo; Lu, Kun; Zhang, Ao-Xiang; Liu, Pu; Li, Jia-Na
2017-07-15
Growth regulating-factors (GRFs) are plant-specific transcription factors that help regulate plant growth and development. Genome-wide identification and evolutionary analyses of GRF gene families have been performed in Arabidopsis thaliana, Zea mays, Oryza sativa, and Brassica rapa, but a comprehensive analysis of the GRF gene family in oilseed rape (Brassica napus) has not yet been reported. In the current study, we identified 35 members of the BnGRF family in B. napus. We analyzed the chromosomal distribution, phylogenetic relationships (Bayesian Inference and Neighbor Joining method), gene structures, and motifs of the BnGRF family members, as well as the cis-acting regulatory elements in their promoters. We also analyzed the expression patterns of 15 randomly selected BnGRF genes in various tissues and in plant varieties with different harvest indices and gibberellic acid (GA) responses. The expression levels of BnGRFs under GA treatment suggested the presence of possible negative feedback regulation. The evolutionary patterns and expression profiles of BnGRFs uncovered in this study increase our understanding of the important roles played by these genes in oilseed rape. Copyright © 2017. Published by Elsevier B.V.
2015-01-01
The present study investigated the possibilities and limitations of implementing a genome-wide transcription-based approach that takes into account genetic and environmental variation to better understand the response of natural populations to stressors. When exposing two different Daphnia pulex genotypes (a cadmium-sensitive and a cadmium-tolerant one) to cadmium, the toxic cyanobacteria Microcystis aeruginosa, and their mixture, we found that observations at the transcriptomic level do not always explain observations at a higher level (growth, reproduction). For example, although cadmium elicited an adverse effect at the organismal level, almost no genes were differentially expressed after cadmium exposure. In addition, we identified oxidative stress and polyunsaturated fatty acid metabolism-related pathways, as well as trypsin and neurexin IV gene-families as candidates for the underlying causes of genotypic differences in tolerance to Microcystis. Furthermore, the whole-genome transcriptomic data of a stressor mixture allowed a better understanding of mixture responses by evaluating interactions between two stressors at the gene-expression level against the independent action baseline model. This approach has indicated that ubiquinone pathway and the MAPK serine-threonine protein kinase and collagens gene-families were enriched with genes showing an interactive effect in expression response to exposure to the mixture of the stressors, while transcription and translation-related pathways and gene-families were mostly related with genotypic differences in interactive responses to this mixture. Collectively, our results indicate that the methods we employed may improve further characterization of the possibilities and limitations of transcriptomics approaches in the adverse outcome pathway framework and in predictions of multistressor effects on natural populations. PMID:24552364
The Landscape of Somatic Chromosomal Copy Number Aberrations in GEM Models of Prostate Carcinoma
Bianchi-Frias, Daniella; Hernandez, Susana A.; Coleman, Roger; Wu, Hong; Nelson, Peter S.
2015-01-01
Human prostate cancer (PCa) is known to harbor recurrent genomic aberrations consisting of chromosomal losses, gains, rearrangements and mutations that involve oncogenes and tumor suppressors. Genetically engineered mouse (GEM) models have been constructed to assess the causal role of these putative oncogenic events and provide molecular insight into disease pathogenesis. While GEM models generally initiate neoplasia by manipulating a single gene, expression profiles of GEM tumors typically comprise hundreds of transcript alterations. It is unclear whether these transcriptional changes represent the pleiotropic effects of single oncogenes, and/or cooperating genomic or epigenomic events. Therefore, it was determined if structural chromosomal alterations occur in GEM models of PCa and whether the changes are concordant with human carcinomas. Whole genome array-based comparative genomic hybridization (CGH) was used to identify somatic chromosomal copy number aberrations (SCNAs) in the widely used TRAMP, Hi-Myc, Pten-null and LADY GEM models. Interestingly, very few SCNAs were identified and the genomic architecture of Hi-Myc, Pten-null and LADY tumors were essentially identical to the germline. TRAMP neuroendocrine carcinomas contained SCNAs, which comprised three recurrent aberrations including a single copy loss of chromosome 19 (encoding Pten). In contrast, cell lines derived from the TRAMP, Hi-Myc, and Pten-null tumors were notable for numerous SCNAs that included copy gains of chromosome 15 (encoding Myc) and losses of chromosome 11 (encoding p53). PMID:25298407
Liu, Chaoyang; Wang, Xia; Xu, Yuantao; Deng, Xiuxin; Xu, Qiang
2014-10-01
MYB transcription factor represents one of the largest gene families in plant genomes. Sweet orange (Citrus sinensis) is one of the most important fruit crops worldwide, and recently the genome has been sequenced. This provides an opportunity to investigate the organization and evolutionary characteristics of sweet orange MYB genes from whole genome view. In the present study, we identified 100 R2R3-MYB genes in the sweet orange genome. A comprehensive analysis of this gene family was performed, including the phylogeny, gene structure, chromosomal localization and expression pattern analyses. The 100 genes were divided into 29 subfamilies based on the sequence similarity and phylogeny, and the classification was also well supported by the highly conserved exon/intron structures and motif composition. The phylogenomic comparison of MYB gene family among sweet orange and related plant species, Arabidopsis, cacao and papaya suggested the existence of functional divergence during evolution. Expression profiling indicated that sweet orange R2R3-MYB genes exhibited distinct temporal and spatial expression patterns. Our analysis suggested that the sweet orange MYB genes may play important roles in different plant biological processes, some of which may be potentially involved in citrus fruit quality. These results will be useful for future functional analysis of the MYB gene family in sweet orange.
Genome-wide characterization of the SiDof gene family in foxtail millet (Setaria italica).
Zhang, Li; Liu, Baoling; Zheng, Gewen; Zhang, Aiying; Li, Runzhi
2017-01-01
Dof (DNA binding with one finger) proteins, which constitute a class of transcription factors found exclusively in plants, are involved in numerous physiological and biochemical reactions affecting growth and development. A genome-wide analysis of SiDof genes was performed in this study. Thirty five SiDof genes were identified and those genes were unevenly distributed across nine chromosomes in the Seteria italica genome. Protein lengths, molecular weights, and theoretical isoelectric points of SiDofs all vary greatly. Gene structure analysis demonstrated that most SiDof genes lack introns. Phylogenetic analysis of SiDof proteins and Dof proteins from Arabidopsis thaliana, rice, sorghum, and Setaria viridis revealed six major groups. Analysis of RNA-Seq data indicated that SiDof gene expression levels varied across roots, stems, leaves, and spike. In addition, expression profiling of SiDof genes in response to stress suggested that SiDof 7 and SiDof 15 are involved in drought stress signalling. Overall, this study could provide novel information on SiDofs for further investigation in foxtail millet. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
RegPrecise 3.0--a resource for genome-scale exploration of transcriptional regulation in bacteria.
Novichkov, Pavel S; Kazakov, Alexey E; Ravcheev, Dmitry A; Leyn, Semen A; Kovaleva, Galina Y; Sutormin, Roman A; Kazanov, Marat D; Riehl, William; Arkin, Adam P; Dubchak, Inna; Rodionov, Dmitry A
2013-11-01
Genome-scale prediction of gene regulation and reconstruction of transcriptional regulatory networks in prokaryotes is one of the critical tasks of modern genomics. Bacteria from different taxonomic groups, whose lifestyles and natural environments are substantially different, possess highly diverged transcriptional regulatory networks. The comparative genomics approaches are useful for in silico reconstruction of bacterial regulons and networks operated by both transcription factors (TFs) and RNA regulatory elements (riboswitches). RegPrecise (http://regprecise.lbl.gov) is a web resource for collection, visualization and analysis of transcriptional regulons reconstructed by comparative genomics. We significantly expanded a reference collection of manually curated regulons we introduced earlier. RegPrecise 3.0 provides access to inferred regulatory interactions organized by phylogenetic, structural and functional properties. Taxonomy-specific collections include 781 TF regulogs inferred in more than 160 genomes representing 14 taxonomic groups of Bacteria. TF-specific collections include regulogs for a selected subset of 40 TFs reconstructed across more than 30 taxonomic lineages. Novel collections of regulons operated by RNA regulatory elements (riboswitches) include near 400 regulogs inferred in 24 bacterial lineages. RegPrecise 3.0 provides four classifications of the reference regulons implemented as controlled vocabularies: 55 TF protein families; 43 RNA motif families; ~150 biological processes or metabolic pathways; and ~200 effectors or environmental signals. Genome-wide visualization of regulatory networks and metabolic pathways covered by the reference regulons are available for all studied genomes. A separate section of RegPrecise 3.0 contains draft regulatory networks in 640 genomes obtained by an conservative propagation of the reference regulons to closely related genomes. RegPrecise 3.0 gives access to the transcriptional regulons reconstructed in bacterial genomes. Analytical capabilities include exploration of: regulon content, structure and function; TF binding site motifs; conservation and variations in genome-wide regulatory networks across all taxonomic groups of Bacteria. RegPrecise 3.0 was selected as a core resource on transcriptional regulation of the Department of Energy Systems Biology Knowledgebase, an emerging software and data environment designed to enable researchers to collaboratively generate, test and share new hypotheses about gene and protein functions, perform large-scale analyses, and model interactions in microbes, plants, and their communities.
Cox, Murray P; Dong, Ting; Shen, Genggeng; Dalvi, Yogesh; Scott, D Barry; Ganley, Austen R D
2014-03-01
Polyploidy, a state in which the chromosome complement has undergone an increase, is a major force in evolution. Understanding the consequences of polyploidy has received much attention, and allopolyploids, which result from the union of two different parental genomes, are of particular interest because they must overcome a suite of biological responses to this merger, known as "genome shock." A key question is what happens to gene expression of the two gene copies following allopolyploidization, but until recently the tools to answer this question on a genome-wide basis were lacking. Here we utilize high throughput transcriptome sequencing to produce the first genome-wide picture of gene expression response to allopolyploidy in fungi. A novel pipeline for assigning sequence reads to the gene copies was used to quantify their expression in a fungal allopolyploid. We find that the transcriptional response to allopolyploidy is predominantly conservative: both copies of most genes are retained; over half the genes inherit parental gene expression patterns; and parental differential expression is often lost in the allopolyploid. Strikingly, the patterns of gene expression change are highly concordant with the genome-wide expression results of a cotton allopolyploid. The very different nature of these two allopolyploids implies a conserved, eukaryote-wide transcriptional response to genome merger. We provide evidence that the transcriptional responses we observe are mostly driven by intrinsic differences between the regulatory systems in the parent species, and from this propose a mechanistic model in which the cross-kingdom conservation in transcriptional response reflects conservation of the mutational processes underlying eukaryotic gene regulatory evolution. This work provides a platform to develop a universal understanding of gene expression response to allopolyploidy and suggests that allopolyploids are an exceptional system to investigate gene regulatory changes that have evolved in the parental species prior to allopolyploidization.
Gao, Hui; Zhao, Chunyan
2018-01-01
Chromatin immunoprecipitation (ChIP) has become the most effective and widely used tool to study the interactions between specific proteins or modified forms of proteins and a genomic DNA region. Combined with genome-wide profiling technologies, such as microarray hybridization (ChIP-on-chip) or massively parallel sequencing (ChIP-seq), ChIP could provide a genome-wide mapping of in vivo protein-DNA interactions in various organisms. Here, we describe a protocol of ChIP-on-chip that uses tiling microarray to obtain a genome-wide profiling of ChIPed DNA.
CisMiner: Genome-Wide In-Silico Cis-Regulatory Module Prediction by Fuzzy Itemset Mining
Navarro, Carmen; Lopez, Francisco J.; Cano, Carlos; Garcia-Alcalde, Fernando; Blanco, Armando
2014-01-01
Eukaryotic gene control regions are known to be spread throughout non-coding DNA sequences which may appear distant from the gene promoter. Transcription factors are proteins that coordinately bind to these regions at transcription factor binding sites to regulate gene expression. Several tools allow to detect significant co-occurrences of closely located binding sites (cis-regulatory modules, CRMs). However, these tools present at least one of the following limitations: 1) scope limited to promoter or conserved regions of the genome; 2) do not allow to identify combinations involving more than two motifs; 3) require prior information about target motifs. In this work we present CisMiner, a novel methodology to detect putative CRMs by means of a fuzzy itemset mining approach able to operate at genome-wide scale. CisMiner allows to perform a blind search of CRMs without any prior information about target CRMs nor limitation in the number of motifs. CisMiner tackles the combinatorial complexity of genome-wide cis-regulatory module extraction using a natural representation of motif combinations as itemsets and applying the Top-Down Fuzzy Frequent- Pattern Tree algorithm to identify significant itemsets. Fuzzy technology allows CisMiner to better handle the imprecision and noise inherent to regulatory processes. Results obtained for a set of well-known binding sites in the S. cerevisiae genome show that our method yields highly reliable predictions. Furthermore, CisMiner was also applied to putative in-silico predicted transcription factor binding sites to identify significant combinations in S. cerevisiae and D. melanogaster, proving that our approach can be further applied genome-wide to more complex genomes. CisMiner is freely accesible at: http://genome2.ugr.es/cisminer. CisMiner can be queried for the results presented in this work and can also perform a customized cis-regulatory module prediction on a query set of transcription factor binding sites provided by the user. PMID:25268582
USDA-ARS?s Scientific Manuscript database
Common bean (Phaseolus vulgaris) and soybean (Glycine max) both belong to the Phaseoleae tribe and share significant coding sequence homology. To evaluate the utility of the soybean GeneChip for transcript profiling of common bean, we hybridized cRNAs purified from nodule, leaf, and root of common b...
Traverse, Charles C; Ochman, Howard
2017-08-29
Advances in sequencing technologies have enabled direct quantification of genome-wide errors that occur during RNA transcription. These errors occur at rates that are orders of magnitude higher than rates during DNA replication, but due to technical difficulties such measurements have been limited to single-base substitutions and have not yet quantified the scope of transcription insertions and deletions. Previous reporter gene assay findings suggested that transcription indels are produced exclusively by elongation complex slippage at homopolymeric runs, so we enumerated indels across the protein-coding transcriptomes of Escherichia coli and Buchnera aphidicola , which differ widely in their genomic base compositions and incidence of repeat regions. As anticipated from prior assays, transcription insertions prevailed in homopolymeric runs of A and T; however, transcription deletions arose in much more complex sequences and were rarely associated with homopolymeric runs. By reconstructing the relocated positions of the elongation complex as inferred from the sequences inserted or deleted during transcription, we show that continuation of transcription after slippage hinges on the degree of nucleotide complementarity within the RNA:DNA hybrid at the new DNA template location. IMPORTANCE The high level of mistakes generated during transcription can result in the accumulation of malfunctioning and misfolded proteins which can alter global gene regulation and in the expenditure of energy to degrade these nonfunctional proteins. The transcriptome-wide occurrence of base substitutions has been elucidated in bacteria, but information on transcription insertions and deletions-errors that potentially have more dire effects on protein function-is limited to reporter gene constructs. Here, we capture the transcriptome-wide spectrum of insertions and deletions in Escherichia coli and Buchnera aphidicola and show that they occur at rates approaching those of base substitutions. Knowledge of the full extent of sequences subject to transcription indels supports a new model of bacterial transcription slippage, one that relies on the number of complementary bases between the transcript and the DNA template to which it slipped. Copyright © 2017 Traverse and Ochman.
Celton, Jean-Marc; Gaillard, Sylvain; Bruneau, Maryline; Pelletier, Sandra; Aubourg, Sébastien; Martin-Magniette, Marie-Laure; Navarro, Lionel; Laurens, François; Renou, Jean-Pierre
2014-07-01
Characterizing the transcriptome of eukaryotic organisms is essential for studying gene regulation and its impact on phenotype. The realization that anti-sense (AS) and noncoding RNA transcription is pervasive in many genomes has emphasized our limited understanding of gene transcription and post-transcriptional regulation. Numerous mechanisms including convergent transcription, anti-correlated expression of sense and AS transcripts, and RNAi remain ill-defined. Here, we have combined microarray analysis and high-throughput sequencing of small RNAs (sRNAs) to unravel the complexity of transcriptional and potential post-transcriptional regulation in eight organs of apple (Malus × domestica). The percentage of AS transcript expression is higher than that identified in annual plants such as rice and Arabidopsis thaliana. Furthermore, we show that a majority of AS transcripts are transcribed beyond 3'UTR regions, and may cover a significant portion of the predicted sense transcripts. Finally we demonstrate at a genome-wide scale that anti-sense transcript expression is correlated with the presence of both short (21-23 nt) and long (> 30 nt) siRNAs, and that the sRNA coverage depth varies with the level of AS transcript expression. Our study provides a new insight on the functional role of anti-sense transcripts at the genome-wide level, and a new basis for the understanding of sRNA biogenesis in plants. © 2014 INRA. New Phytologist © 2014 New Phytologist Trust.
Hicks, Chindo; Kumar, Ranjit; Pannuti, Antonio; Miele, Lucio
2012-01-01
Variable response and resistance to tamoxifen treatment in breast cancer patients remains a major clinical problem. To determine whether genes and biological pathways containing SNPs associated with risk for breast cancer are dysregulated in response to tamoxifen treatment, we performed analysis combining information from 43 genome-wide association studies with gene expression data from 298 ER(+) breast cancer patients treated with tamoxifen and 125 ER(+) controls. We identified 95 genes which distinguished tamoxifen treated patients from controls. Additionally, we identified 54 genes which stratified tamoxifen treated patients into two distinct groups. We identified biological pathways containing SNPs associated with risk for breast cancer, which were dysregulated in response to tamoxifen treatment. Key pathways identified included the apoptosis, P53, NFkB, DNA repair and cell cycle pathways. Combining GWAS with transcription profiling provides a unified approach for associating GWAS findings with response to drug treatment and identification of potential drug targets.
Genome-wide identification and expression profiling of the SnRK2 gene family in Malus prunifolia.
Shao, Yun; Qin, Yuan; Zou, Yangjun; Ma, Fengwang
2014-11-15
Sucrose non-fermenting-1-related protein kinase 2 (SnRK2) constitutes a small plant-specific serine/threonine kinase family with essential roles in the abscisic acid (ABA) signal pathway and in responses to osmotic stress. Although a genome-wide analysis of this family has been conducted in some species, little is known about SnRK2 genes in apple (Malus domestica). We identified 14 putative sequences encoding 12 deduced SnRK2 proteins within the apple genome. Gene chromosomal location and synteny analysis of the apple SnRK2 genes indicated that tandem and segmental duplications have likely contributed to the expansion and evolution of these genes. All 12 full-length coding sequences were confirmed by cloning from Malus prunifolia. The gene structure and motif compositions of the apple SnRK2 genes were analyzed. Phylogenetic analysis showed that MpSnRK2s could be classified into four groups. Profiling of these genes presented differential patterns of expression in various tissues. Under stress conditions, transcript levels for some family members were up-regulated in the leaves in response to drought, salinity, or ABA treatments. This suggested their possible roles in plant response to abiotic stress. Our findings provide essential information about SnRK2 genes in apple and will contribute to further functional dissection of this gene family. Copyright © 2014 Elsevier B.V. All rights reserved.
Wang, Peipei; Li, Jing; Gao, Xiaoyang; Zhang, Di; Li, Anlin; Liu, Changning
2018-05-29
Physic nut ( Jatropha curcas L.) is a species of flowering plant with great potential for biofuel production and as an emerging model organism for functional genomic analysis, particularly in the Euphorbiaceae family. DNA binding with one finger (Dof) transcription factors play critical roles in numerous biological processes in plants. Nevertheless, the knowledge about members, and the evolutionary and functional characteristics of the Dof gene family in physic nut is insufficient. Therefore, we performed a genome-wide screening and characterization of the Dof gene family within the physic nut draft genome. In total, 24 JcDof genes (encoding 33 JcDof proteins) were identified. All the JcDof genes were divided into three major groups based on phylogenetic inference, which was further validated by the subsequent gene structure and motif analysis. Genome comparison revealed that segmental duplication may have played crucial roles in the expansion of the JcDof gene family, and gene expansion was mainly subjected to positive selection. The expression profile demonstrated the broad involvement of JcDof genes in response to various abiotic stresses, hormonal treatments and functional divergence. This study provides valuable information for better understanding the evolution of JcDof genes, and lays a foundation for future functional exploration of JcDof genes.
Genome-wide increase in histone H2A ubiquitylation in a mouse model of Huntington's disease.
McFarland, Karen N; Das, Sudeshna; Sun, Ting Ting; Leyfer, Dmitri; Kim, Mee-Ohk; Xia, Eva; Sangrey, Gavin R; Kuhn, Alexandre; Luthi-Carter, Ruth; Clark, Timothy W; Sadri-Vakili, Ghazaleh; Cha, Jang-Ho J
2013-01-01
Huntington's disease (HD) is a neurodegenerative disorder with selective vulnerability of striatal neurons and involves extensive transcriptional dysregulation early in the disease process. Previous work in cell and mouse models has shown that histone modifications are altered in HD. Specifically, monoubiquitylated histone H2A (uH2A) is present at the promoters of downregulated genes which led to the hypothesis that uH2A plays a role in transcriptional silencing in HD. To broaden our view of uH2A function in transcription in HD, we examined genome-wide binding sites of uH2A in 12-week old striatal tissue from R6/2 transgenic HD mouse model. We used chromatin immunoprecipitation followed by genomic promoter microarray hybridization (ChIP-chip) and then interrogated how these binding sites correlate with transcribed genes. Our analysis reveals that, while uH2A levels are globally increased at the genome in the transgenic (TG) striatum, uH2A localization at a gene did not strongly correlate with the absence of its transcript. Furthermore, analysis of differential ubiquitylation in wild-type (WT) and TG striata did not reveal the expected enrichment of uH2A at genes with decreased expression in the TG striatum. This first description of genome-wide localization of uH2A in an HD model reveals that monoubiquitylation of histone H2A may not function at the level of the individual gene but may rather influence transcription through global chromatin structure.
Systematic analysis of transcription start sites in avian development.
Lizio, Marina; Deviatiiarov, Ruslan; Nagai, Hiroki; Galan, Laura; Arner, Erik; Itoh, Masayoshi; Lassmann, Timo; Kasukawa, Takeya; Hasegawa, Akira; Ros, Marian A; Hayashizaki, Yoshihide; Carninci, Piero; Forrest, Alistair R R; Kawaji, Hideya; Gusev, Oleg; Sheng, Guojun
2017-09-01
Cap Analysis of Gene Expression (CAGE) in combination with single-molecule sequencing technology allows precision mapping of transcription start sites (TSSs) and genome-wide capture of promoter activities in differentiated and steady state cell populations. Much less is known about whether TSS profiling can characterize diverse and non-steady state cell populations, such as the approximately 400 transitory and heterogeneous cell types that arise during ontogeny of vertebrate animals. To gain such insight, we used the chick model and performed CAGE-based TSS analysis on embryonic samples covering the full 3-week developmental period. In total, 31,863 robust TSS peaks (>1 tag per million [TPM]) were mapped to the latest chicken genome assembly, of which 34% to 46% were active in any given developmental stage. ZENBU, a web-based, open-source platform, was used for interactive data exploration. TSSs of genes critical for lineage differentiation could be precisely mapped and their activities tracked throughout development, suggesting that non-steady state and heterogeneous cell populations are amenable to CAGE-based transcriptional analysis. Our study also uncovered a large set of extremely stable housekeeping TSSs and many novel stage-specific ones. We furthermore demonstrated that TSS mapping could expedite motif-based promoter analysis for regulatory modules associated with stage-specific and housekeeping genes. Finally, using Brachyury as an example, we provide evidence that precise TSS mapping in combination with Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR)-on technology enables us, for the first time, to efficiently target endogenous avian genes for transcriptional activation. Taken together, our results represent the first report of genome-wide TSS mapping in birds and the first systematic developmental TSS analysis in any amniote species (birds and mammals). By facilitating promoter-based molecular analysis and genetic manipulation, our work also underscores the value of avian models in unravelling the complex regulatory mechanism of cell lineage specification during amniote development.
Davey, Mark W; Graham, Neil S; Vanholme, Bartel; Swennen, Rony; May, Sean T; Keulemans, Johan
2009-01-01
Background 'Systems-wide' approaches such as microarray RNA-profiling are ideally suited to the study of the complex overlapping responses of plants to biotic and abiotic stresses. However, commercial microarrays are only available for a limited number of plant species and development costs are so substantial as to be prohibitive for most research groups. Here we evaluate the use of cross-hybridisation to Affymetrix oligonucleotide GeneChip® microarrays to profile the response of the banana (Musa spp.) leaf transcriptome to drought stress using a genomic DNA (gDNA)-based probe-selection strategy to improve the efficiency of detection of differentially expressed Musa transcripts. Results Following cross-hybridisation of Musa gDNA to the Rice GeneChip® Genome Array, ~33,700 gene-specific probe-sets had a sufficiently high degree of homology to be retained for transcriptomic analyses. In a proof-of-concept approach, pooled RNA representing a single biological replicate of control and drought stressed leaves of the Musa cultivar 'Cachaco' were hybridised to the Affymetrix Rice Genome Array. A total of 2,910 Musa gene homologues with a >2-fold difference in expression levels were subsequently identified. These drought-responsive transcripts included many functional classes associated with plant biotic and abiotic stress responses, as well as a range of regulatory genes known to be involved in coordinating abiotic stress responses. This latter group included members of the ERF, DREB, MYB, bZIP and bHLH transcription factor families. Fifty-two of these drought-sensitive Musa transcripts were homologous to genes underlying QTLs for drought and cold tolerance in rice, including in 2 instances QTLs associated with a single underlying gene. The list of drought-responsive transcripts also included genes identified in publicly-available comparative transcriptomics experiments. Conclusion Our results demonstrate that despite the general paucity of nucleotide sequence data in Musa and only distant phylogenetic relations to rice, gDNA probe-based cross-hybridisation to the Rice GeneChip® is a highly promising strategy to study complex biological responses and illustrates the potential of such strategies for gene discovery in non-model species. PMID:19758430
Genome-wide association between DNA methylation and alternative splicing in an invertebrate
2012-01-01
Background Gene bodies are the most evolutionarily conserved targets of DNA methylation in eukaryotes. However, the regulatory functions of gene body DNA methylation remain largely unknown. DNA methylation in insects appears to be primarily confined to exons. Two recent studies in Apis mellifera (honeybee) and Nasonia vitripennis (jewel wasp) analyzed transcription and DNA methylation data for one gene in each species to demonstrate that exon-specific DNA methylation may be associated with alternative splicing events. In this study we investigated the relationship between DNA methylation, alternative splicing, and cross-species gene conservation on a genome-wide scale using genome-wide transcription and DNA methylation data. Results We generated RNA deep sequencing data (RNA-seq) to measure genome-wide mRNA expression at the exon- and gene-level. We produced a de novo transcriptome from this RNA-seq data and computationally predicted splice variants for the honeybee genome. We found that exons that are included in transcription are higher methylated than exons that are skipped during transcription. We detected enrichment for alternative splicing among methylated genes compared to unmethylated genes using fisher’s exact test. We performed a statistical analysis to reveal that the presence of DNA methylation or alternative splicing are both factors associated with a longer gene length and a greater number of exons in genes. In concordance with this observation, a conservation analysis using BLAST revealed that each of these factors is also associated with higher cross-species gene conservation. Conclusions This study constitutes the first genome-wide analysis exhibiting a positive relationship between exon-level DNA methylation and mRNA expression in the honeybee. Our finding that methylated genes are enriched for alternative splicing suggests that, in invertebrates, exon-level DNA methylation may play a role in the construction of splice variants by positively influencing exon inclusion during transcription. The results from our cross-species homology analysis suggest that DNA methylation and alternative splicing are genetic mechanisms whose utilization could contribute to a longer gene length and a slower rate of gene evolution. PMID:22978521
Kuang, Zheng; Ji, Zhicheng; Boeke, Jef D; Ji, Hongkai
2018-01-09
Biological processes are usually associated with genome-wide remodeling of transcription driven by transcription factors (TFs). Identifying key TFs and their spatiotemporal binding patterns are indispensable to understanding how dynamic processes are programmed. However, most methods are designed to predict TF binding sites only. We present a computational method, dynamic motif occupancy analysis (DynaMO), to infer important TFs and their spatiotemporal binding activities in dynamic biological processes using chromatin profiling data from multiple biological conditions such as time-course histone modification ChIP-seq data. In the first step, DynaMO predicts TF binding sites with a random forests approach. Next and uniquely, DynaMO infers dynamic TF binding activities at predicted binding sites using their local chromatin profiles from multiple biological conditions. Another landmark of DynaMO is to identify key TFs in a dynamic process using a clustering and enrichment analysis of dynamic TF binding patterns. Application of DynaMO to the yeast ultradian cycle, mouse circadian clock and human neural differentiation exhibits its accuracy and versatility. We anticipate DynaMO will be generally useful for elucidating transcriptional programs in dynamic processes. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Qi, Yanxiang; Liu, Xiaomei; Pu, Jinji
2018-01-01
The NAC transcription factors involved plant development and response to various stress stimuli. However, little information is available concerning the NAC family in the woodland strawberry. Herein, 37 NAC genes were identified from the woodland strawberry genome and were classified into 13 groups based on phylogenetic analysis. And further analyses of gene structure and conserved motifs showed closer relationship of them in every subgroup. Quantitative real-time PCR evaluation different tissues revealed distinct spatial expression profiles of the FvNAC genes. The comprehensive expression of FvNAC genes revealed under abiotic stress (cold, heat, drought, salt), signal molecule treatments (H2O2, ABA, melatonin, rapamycin), biotic stress (Colletotrichum gloeosporioides and Ralstonia solanacearum). Expression profiles derived from quantitative real-time PCR suggested that 5 FvNAC genes responded dramatically to the various abiotic and biotic stresses, indicating their contribution to abiotic and biotic stresses resistance in woodland strawberry. Interestingly, FvNAC genes showed greater extent responded to the cold treatment than other abiotic stress, and H2O2 exhibited a greater response than ABA, melatonin, and rapamycin. For biotic stresses, 3 FvNAC genes were up-regulated during infection with C. gloeosporioides, while 6 FvNAC genes were down-regulated during infection with R. solanacearum. In conclusion, this study identified candidate FvNAC genes to be used for the genetic improvement of abiotic and biotic stress tolerance in woodland strawberry. PMID:29897926
2012-01-01
Background The molecular mechanisms altered by the traditional mutation and screening approach during the improvement of antibiotic-producing microorganisms are still poorly understood although this information is essential to design rational strategies for industrial strain improvement. In this study, we applied comparative genomics to identify all genetic changes occurring during the development of an erythromycin overproducer obtained using the traditional mutate-and- screen method. Results Compared with the parental Saccharopolyspora erythraea NRRL 2338, the genome of the overproducing strain presents 117 deletion, 78 insertion and 12 transposition sites, with 71 insertion/deletion sites mapping within coding sequences (CDSs) and generating frame-shift mutations. Single nucleotide variations are present in 144 CDSs. Overall, the genomic variations affect 227 proteins of the overproducing strain and a considerable number of mutations alter genes of key enzymes in the central carbon and nitrogen metabolism and in the biosynthesis of secondary metabolites, resulting in the redirection of common precursors toward erythromycin biosynthesis. Interestingly, several mutations inactivate genes coding for proteins that play fundamental roles in basic transcription and translation machineries including the transcription anti-termination factor NusB and the transcription elongation factor Efp. These mutations, along with those affecting genes coding for pleiotropic or pathway-specific regulators, affect global expression profile as demonstrated by a comparative analysis of the parental and overproducer expression profiles. Genomic data, finally, suggest that the mutate-and-screen process might have been accelerated by mutations in DNA repair genes. Conclusions This study helps to clarify the mechanisms underlying antibiotic overproduction providing valuable information about new possible molecular targets for rationale strain improvement. PMID:22401291
Genomic Signal Processing: Predicting Basic Molecular Biological Principles
NASA Astrophysics Data System (ADS)
Alter, Orly
2005-03-01
Advances in high-throughput technologies enable acquisition of different types of molecular biological data, monitoring the flow of biological information as DNA is transcribed to RNA, and RNA is translated to proteins, on a genomic scale. Future discovery in biology and medicine will come from the mathematical modeling of these data, which hold the key to fundamental understanding of life on the molecular level, as well as answers to questions regarding diagnosis, treatment and drug development. Recently we described data-driven models for genome-scale molecular biological data, which use singular value decomposition (SVD) and the comparative generalized SVD (GSVD). Now we describe an integrative data-driven model, which uses pseudoinverse projection (1). We also demonstrate the predictive power of these matrix algebra models (2). The integrative pseudoinverse projection model formulates any number of genome-scale molecular biological data sets in terms of one chosen set of data samples, or of profiles extracted mathematically from data samples, designated the ``basis'' set. The mathematical variables of this integrative model, the pseudoinverse correlation patterns that are uncovered in the data, represent independent processes and corresponding cellular states (such as observed genome-wide effects of known regulators or transcription factors, the biological components of the cellular machinery that generate the genomic signals, and measured samples in which these regulators or transcription factors are over- or underactive). Reconstruction of the data in the basis simulates experimental observation of only the cellular states manifest in the data that correspond to those of the basis. Classification of the data samples according to their reconstruction in the basis, rather than their overall measured profiles, maps the cellular states of the data onto those of the basis, and gives a global picture of the correlations and possibly also causal coordination of these two sets of states. Mapping genome-scale protein binding data using pseudoinverse projection onto patterns of RNA expression data that had been extracted by SVD and GSVD, a novel correlation between DNA replication initiation and RNA transcription during the cell cycle in yeast, that might be due to a previously unknown mechanism of regulation, is predicted. (1) Alter & Golub, Proc. Natl. Acad. Sci. USA 101, 16577 (2004). (2) Alter, Golub, Brown & Botstein, Miami Nat. Biotechnol. Winter Symp. 2004 (www.med.miami.edu/mnbws/alter-.pdf)
Swindell, William R; Johnston, Andrew; Carbajal, Steve; Han, Gangwen; Wohn, Christian; Lu, Jun; Xing, Xianying; Nair, Rajan P; Voorhees, John J; Elder, James T; Wang, Xiao-Jing; Sano, Shigetoshi; Prens, Errol P; DiGiovanni, John; Pittelkow, Mark R; Ward, Nicole L; Gudjonsson, Johann E
2011-04-04
Development of a suitable mouse model would facilitate the investigation of pathomechanisms underlying human psoriasis and would also assist in development of therapeutic treatments. However, while many psoriasis mouse models have been proposed, no single model recapitulates all features of the human disease, and standardized validation criteria for psoriasis mouse models have not been widely applied. In this study, whole-genome transcriptional profiling is used to compare gene expression patterns manifested by human psoriatic skin lesions with those that occur in five psoriasis mouse models (K5-Tie2, imiquimod, K14-AREG, K5-Stat3C and K5-TGFbeta1). While the cutaneous gene expression profiles associated with each mouse phenotype exhibited statistically significant similarity to the expression profile of psoriasis in humans, each model displayed distinctive sets of similarities and differences in comparison to human psoriasis. For all five models, correspondence to the human disease was strong with respect to genes involved in epidermal development and keratinization. Immune and inflammation-associated gene expression, in contrast, was more variable between models as compared to the human disease. These findings support the value of all five models as research tools, each with identifiable areas of convergence to and divergence from the human disease. Additionally, the approach used in this paper provides an objective and quantitative method for evaluation of proposed mouse models of psoriasis, which can be strategically applied in future studies to score strengths of mouse phenotypes relative to specific aspects of human psoriasis.
Liu, Yan; Guan, Xiaoyu; Liu, Shengnan; Yang, Meng; Ren, Junhui; Guo, Meng; Huang, Zhihui; Zhang, Yaowei
2018-03-14
Chinese cabbage ( Brassica rapa L. ssp . pekinensis ) is a widely cultivated and economically important vegetable crop with typical leaf curvature. The TCP (Teosinte branched1, Cycloidea, Proliferating cell factor) family proteins are plant-specific transcription factors (TFs) and play important roles in many plant biological processes, especially in the regulation of leaf curvature. In this study, 39 genes encoding TCP TFs are detected on the whole genome of B. rapa. Based on the phylogenetic analysis of TCPs between Arabidopsis thaliana and Brassica rapa , TCP genes of Chinese cabbage are named from BrTCP1a to BrTCP24b . Moreover, the chromosomal location; phylogenetic relationships among B. rapa , A. thaliana , and rice; gene structures and protein conserved sequence alignment; and conserved domains are analyzed. The expression profiles of BrTCPs are analyzed in different tissues. To understand the role of Chinese cabbage TCP members in regulating the curvature of leaves, the expression patterns of all BrTCP genes are detected at three development stages essential for leafy head formation. Our results provide information on the classification and details of BrTCPs and allow us to better understand the function of TCPs involved in leaf curvature of Chinese cabbage.
Liu, Yan; Guan, Xiaoyu; Liu, Shengnan; Yang, Meng; Ren, Junhui; Guo, Meng; Huang, Zhihui
2018-01-01
Chinese cabbage (Brassica rapa L. ssp. pekinensis) is a widely cultivated and economically important vegetable crop with typical leaf curvature. The TCP (Teosinte branched1, Cycloidea, Proliferating cell factor) family proteins are plant-specific transcription factors (TFs) and play important roles in many plant biological processes, especially in the regulation of leaf curvature. In this study, 39 genes encoding TCP TFs are detected on the whole genome of B. rapa. Based on the phylogenetic analysis of TCPs between Arabidopsis thaliana and Brassica rapa, TCP genes of Chinese cabbage are named from BrTCP1a to BrTCP24b. Moreover, the chromosomal location; phylogenetic relationships among B. rapa, A. thaliana, and rice; gene structures and protein conserved sequence alignment; and conserved domains are analyzed. The expression profiles of BrTCPs are analyzed in different tissues. To understand the role of Chinese cabbage TCP members in regulating the curvature of leaves, the expression patterns of all BrTCP genes are detected at three development stages essential for leafy head formation. Our results provide information on the classification and details of BrTCPs and allow us to better understand the function of TCPs involved in leaf curvature of Chinese cabbage. PMID:29538304
Muthamilarasan, Mehanathan; Khandelwal, Rohit; Yadav, Chandra Bhan; Bonthala, Venkata Suresh; Khan, Yusuf; Prasad, Manoj
2014-01-01
MYB proteins represent one of the largest transcription factor families in plants, playing important roles in diverse developmental and stress-responsive processes. Considering its significance, several genome-wide analyses have been conducted in almost all land plants except foxtail millet. Foxtail millet (Setaria italica L.) is a model crop for investigating systems biology of millets and bioenergy grasses. Further, the crop is also known for its potential abiotic stress-tolerance. In this context, a comprehensive genome-wide survey was conducted and 209 MYB protein-encoding genes were identified in foxtail millet. All 209 S. italica MYB (SiMYB) genes were physically mapped onto nine chromosomes of foxtail millet. Gene duplication study showed that segmental- and tandem-duplication have occurred in genome resulting in expansion of this gene family. The protein domain investigation classified SiMYB proteins into three classes according to number of MYB repeats present. The phylogenetic analysis categorized SiMYBs into ten groups (I-X). SiMYB-based comparative mapping revealed a maximum orthology between foxtail millet and sorghum, followed by maize, rice and Brachypodium. Heat map analysis showed tissue-specific expression pattern of predominant SiMYB genes. Expression profiling of candidate MYB genes against abiotic stresses and hormone treatments using qRT-PCR revealed specific and/or overlapping expression patterns of SiMYBs. Taken together, the present study provides a foundation for evolutionary and functional characterization of MYB TFs in foxtail millet to dissect their functions in response to environmental stimuli.
Transcriptome changes and cAMP oscillations in an archaeal cell cycle.
Baumann, Anke; Lange, Christian; Soppa, Jörg
2007-06-11
The cell cycle of all organisms includes mass increase by a factor of two, replication of the genetic material, segregation of the genome to different parts of the cell, and cell division into two daughter cells. It is tightly regulated and typically includes cell cycle-specific oscillations of the levels of transcripts, proteins, protein modifications, and signaling molecules. Until now cell cycle-specific transcriptome changes have been described for four eukaryotic species ranging from yeast to human, but only for two prokaryotic species. Similarly, oscillations of small signaling molecules have been identified in very few eukaryotic species, but not in any prokaryote. A synchronization procedure for the archaeon Halobacterium salinarum was optimized, so that nearly 100% of all cells divide in a time interval that is 1/4th of the generation time of exponentially growing cells. The method was used to characterize cell cycle-dependent transcriptome changes using a genome-wide DNA microarray. The transcript levels of 87 genes were found to be cell cycle-regulated, corresponding to 3% of all genes. They could be clustered into seven groups with different transcript level profiles. Cluster-specific sequence motifs were detected around the start of the genes that are predicted to be involved in cell cycle-specific transcriptional regulation. Notably, many cell cycle genes that have oscillating transcript levels in eukaryotes are not regulated on the transcriptional level in H. salinarum. Synchronized cultures were also used to identify putative small signaling molecules. H. salinarum was found to contain a basal cAMP concentration of 200 microM, considerably higher than that of yeast. The cAMP concentration is shortly induced directly prior to and after cell division, and thus cAMP probably is an important signal for cell cycle progression. The analysis of cell cycle-specific transcriptome changes of H. salinarum allowed to identify a strategy of transcript level regulation that is different from all previously characterized species. The transcript levels of only 3% of all genes are regulated, a fraction that is considerably lower than has been reported for four eukaryotic species (6%-28%) and for the bacterium C. crescentus (19%). It was shown that cAMP is present in significant concentrations in an archaeon, and the phylogenetic profile of the adenylate cyclase indicates that this signaling molecule is widely distributed in archaea. The occurrence of cell cycle-dependent oscillations of the cAMP concentration in an archaeon and in several eukaryotic species indicates that cAMP level changes might be a phylogenetically old signal for cell cycle progression.
Comparative analysis reveals genomic features of stress-induced transcriptional readthrough
Vilborg, Anna; Sabath, Niv; Wiesel, Yuval; Nathans, Jenny; Levy-Adam, Flonia; Yario, Therese A.; Steitz, Joan A.; Shalgi, Reut
2017-01-01
Transcription is a highly regulated process, and stress-induced changes in gene transcription have been shown to play a major role in stress responses and adaptation. Genome-wide studies reveal prevalent transcription beyond known protein-coding gene loci, generating a variety of RNA classes, most of unknown function. One such class, termed downstream of gene-containing transcripts (DoGs), was reported to result from transcriptional readthrough upon osmotic stress in human cells. However, how widespread the readthrough phenomenon is, and what its causes and consequences are, remain elusive. Here we present a genome-wide mapping of transcriptional readthrough, using nuclear RNA-Seq, comparing heat shock, osmotic stress, and oxidative stress in NIH 3T3 mouse fibroblast cells. We observe massive induction of transcriptional readthrough, both in levels and length, under all stress conditions, with significant, yet not complete, overlap of readthrough-induced loci between different conditions. Importantly, our analyses suggest that stress-induced transcriptional readthrough is not a random failure process, but is rather differentially induced across different conditions. We explore potential regulators and find a role for HSF1 in the induction of a subset of heat shock-induced readthrough transcripts. Analysis of public datasets detected increases in polymerase II occupancy in DoG regions after heat shock, supporting our findings. Interestingly, DoGs tend to be produced in the vicinity of neighboring genes, leading to a marked increase in their antisense-generating potential. Finally, we examine genomic features of readthrough transcription and observe a unique chromatin signature typical of DoG-producing regions, suggesting that readthrough transcription is associated with the maintenance of an open chromatin state. PMID:28928151
Genome-wide mapping of infection-induced SINE RNAs reveals a role in selective mRNA export.
Karijolich, John; Zhao, Yang; Alla, Ravi; Glaunsinger, Britt
2017-06-02
Short interspersed nuclear elements (SINEs) are retrotransposons evolutionarily derived from endogenous RNA Polymerase III RNAs. Though SINE elements have undergone exaptation into gene regulatory elements, how transcribed SINE RNA impacts transcriptional and post-transcriptional regulation is largely unknown. This is partly due to a lack of information regarding which of the loci have transcriptional potential. Here, we present an approach (short interspersed nuclear element sequencing, SINE-seq), which selectively profiles RNA Polymerase III-derived SINE RNA, thereby identifying transcriptionally active SINE loci. Applying SINE-seq to monitor murine B2 SINE expression during a gammaherpesvirus infection revealed transcription from 28 270 SINE loci, with ∼50% of active SINE elements residing within annotated RNA Polymerase II loci. Furthermore, B2 RNA can form intermolecular RNA-RNA interactions with complementary mRNAs, leading to nuclear retention of the targeted mRNA via a mechanism involving p54nrb. These findings illuminate a pathway for the selective regulation of mRNA export during stress via retrotransposon activation. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Genome-wide mapping of infection-induced SINE RNAs reveals a role in selective mRNA export
Zhao, Yang; Alla, Ravi
2017-01-01
Abstract Short interspersed nuclear elements (SINEs) are retrotransposons evolutionarily derived from endogenous RNA Polymerase III RNAs. Though SINE elements have undergone exaptation into gene regulatory elements, how transcribed SINE RNA impacts transcriptional and post-transcriptional regulation is largely unknown. This is partly due to a lack of information regarding which of the loci have transcriptional potential. Here, we present an approach (short interspersed nuclear element sequencing, SINE-seq), which selectively profiles RNA Polymerase III-derived SINE RNA, thereby identifying transcriptionally active SINE loci. Applying SINE-seq to monitor murine B2 SINE expression during a gammaherpesvirus infection revealed transcription from 28 270 SINE loci, with ∼50% of active SINE elements residing within annotated RNA Polymerase II loci. Furthermore, B2 RNA can form intermolecular RNA–RNA interactions with complementary mRNAs, leading to nuclear retention of the targeted mRNA via a mechanism involving p54nrb. These findings illuminate a pathway for the selective regulation of mRNA export during stress via retrotransposon activation. PMID:28334904
Scheider, Jessica; Afonso-Grunz, Fabian; Jessl, Luzie; Hoffmeier, Klaus; Winter, Peter; Oehlmann, Jörg
2018-03-01
Morphological malformations induced by tributyltin (TBT) exposure during embryonic development have already been characterized in various taxonomic groups, but, nonetheless, the molecular processes underlying these changes remain obscure. The present study provides the first genome-wide screening for differentially expressed genes that are linked to morphological alterations of gonadal tissue from chicken embryos after exposure to TBT. We applied a single injection of TBT (between 0.5 and 30 pg as Sn/g egg) into incubated fertile eggs to simulate maternal transfer of the endocrine disruptive compound. Methyltestosterone (MT) served as a positive control (30 pg/g egg). After 19 days of incubation, structural features of the gonads as well as genome-wide gene expression profiles were assessed simultaneously. TBT induced significant morphological and histological malformations of gonadal tissue from female embryos that show a virilization of the ovaries. This phenotypical virilization was mirrored by altered expression profiles of sex-dependent genes. Among these are several transcription and growth factors (e.g. FGF12, CTCF, NFIB), whose altered expression might serve as a set of markers for early identification of endocrine active chemicals that affect embryonic development by transcriptome profiling without the need of elaborate histological analyses. Copyright © 2017 The Author(s). Published by Elsevier B.V. All rights reserved.
Xu, Jiawei; Bao, Xiao; Peng, Zhaofeng; Wang, Linlin; Du, Linqing; Niu, Wenbin; Sun, Yingpu
2016-05-10
Polycystic ovary syndrome (PCOS) affects approximately 7% of the reproductive-age women. A growing body of evidence indicated that epigenetic mechanisms contributed to the development of PCOS. The role of DNA modification in human PCOS ovary granulosa cell is still unknown in PCOS progression. Global DNA methylation and hydroxymethylation were detected between PCOS' and controls' granulosa cell. Genome-wide DNA methylation was profiled to investigate the putative function of DNA methylaiton. Selected genes expressions were analyzed between PCOS' and controls' granulosa cell. Our results showed that the granulosa cell global DNA methylation of PCOS patients was significant higher than the controls'. The global DNA hydroxymethylation showed low level and no statistical difference between PCOS and control. 6936 differentially methylated CpG sites were identified between control and PCOS-obesity. 12245 differential methylated CpG sites were detected between control and PCOS-nonobesity group. 5202 methylated CpG sites were significantly differential between PCOS-obesity and PCOS-nonobesity group. Our results showed that DNA methylation not hydroxymethylation altered genome-wide in PCOS granulosa cell. The different methylation genes were enriched in development protein, transcription factor activity, alternative splicing, sequence-specific DNA binding and embryonic morphogenesis. YWHAQ, NCF2, DHRS9 and SCNA were up-regulation in PCOS-obesity patients with no significance different between control and PCOS-nonobesity patients, which may be activated by lower DNA methylaiton. Global and genome-wide DNA methylation alteration may contribute to different genes expression and PCOS clinical pathology.
Moskvin, Oleg V; Bolotin, Dmitry; Wang, Andrew; Ivanov, Pavel S; Gomelsky, Mark
2011-02-01
We present Rhodobase, a web-based meta-analytical tool for analysis of transcriptional regulation in a model anoxygenic photosynthetic bacterium, Rhodobacter sphaeroides. The gene association meta-analysis is based on the pooled data from 100 of R. sphaeroides whole-genome DNA microarrays. Gene-centric regulatory networks were visualized using the StarNet approach (Jupiter, D.C., VanBuren, V., 2008. A visual data mining tool that facilitates reconstruction of transcription regulatory networks. PLoS ONE 3, e1717) with several modifications. We developed a means to identify and visualize operons and superoperons. We designed a framework for the cross-genome search for transcription factor binding sites that takes into account high GC-content and oligonucleotide usage profile characteristic of the R. sphaeroides genome. To facilitate reconstruction of directional relationships between co-regulated genes, we screened upstream sequences (-400 to +20bp from start codons) of all genes for putative binding sites of bacterial transcription factors using a self-optimizing search method developed here. To test performance of the meta-analysis tools and transcription factor site predictions, we reconstructed selected nodes of the R. sphaeroides transcription factor-centric regulatory matrix. The test revealed regulatory relationships that correlate well with the experimentally derived data. The database of transcriptional profile correlations, the network visualization engine and the optimized search engine for transcription factor binding sites analysis are available at http://rhodobase.org. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Charlesworth, Jac C; Peralta, Juan M; Drigalenko, Eugene; Göring, Harald Hh; Almasy, Laura; Dyer, Thomas D; Blangero, John
2009-12-15
Gene identification using linkage, association, or genome-wide expression is often underpowered. We propose that formal combination of information from multiple gene-identification approaches may lead to the identification of novel loci that are missed when only one form of information is available. Firstly, we analyze the Genetic Analysis Workshop 16 Framingham Heart Study Problem 2 genome-wide association data for HDL-cholesterol using a "gene-centric" approach. Then we formally combine the association test results with genome-wide transcriptional profiling data for high-density lipoprotein cholesterol (HDL-C), from the San Antonio Family Heart Study, using a Z-transform test (Stouffer's method). We identified 39 genes by the joint test at a conservative 1% false-discovery rate, including 9 from the significant gene-based association test and 23 whose expression was significantly correlated with HDL-C. Seven genes identified as significant in the joint test were not independently identified by either the association or expression tests. This combined approach has increased power and leads to the direct nomination of novel candidate genes likely to be involved in the determination of HDL-C levels. Such information can then be used as justification for a more exhaustive search for functional sequence variation within the nominated genes. We anticipate that this type of analysis will improve our speed of identification of regulatory genes causally involved in disease risk.
In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features.
Ding, Yiliang; Tang, Yin; Kwok, Chun Kit; Zhang, Yu; Bevilacqua, Philip C; Assmann, Sarah M
2014-01-30
RNA structure has critical roles in processes ranging from ligand sensing to the regulation of translation, polyadenylation and splicing. However, a lack of genome-wide in vivo RNA structural data has limited our understanding of how RNA structure regulates gene expression in living cells. Here we present a high-throughput, genome-wide in vivo RNA structure probing method, structure-seq, in which dimethyl sulphate methylation of unprotected adenines and cytosines is identified by next-generation sequencing. Application of this method to Arabidopsis thaliana seedlings yielded the first in vivo genome-wide RNA structure map at nucleotide resolution for any organism, with quantitative structural information across more than 10,000 transcripts. Our analysis reveals a three-nucleotide periodic repeat pattern in the structure of coding regions, as well as a less-structured region immediately upstream of the start codon, and shows that these features are strongly correlated with translation efficiency. We also find patterns of strong and weak secondary structure at sites of alternative polyadenylation, as well as strong secondary structure at 5' splice sites that correlates with unspliced events. Notably, in vivo structures of messenger RNAs annotated for stress responses are poorly predicted in silico, whereas mRNA structures of genes related to cell function maintenance are well predicted. Global comparison of several structural features between these two categories shows that the mRNAs associated with stress responses tend to have more single-strandedness, longer maximal loop length and higher free energy per nucleotide, features that may allow these RNAs to undergo conformational changes in response to environmental conditions. Structure-seq allows the RNA structurome and its biological roles to be interrogated on a genome-wide scale and should be applicable to any organism.
USDA-ARS?s Scientific Manuscript database
Transcription factors (TFs) are proteins that regulate the expression of target genes by binding to specific elements in their regulatory regions. Transcriptional regulators (TRs) also regulate the expression of target genes; however, they operate indirectly via interaction with the basal transcript...
Demissie, Serkalem; Soranzo, Nicole; Bianchi, Estelle N.; Grundberg, Elin; Liang, Liming; Richards, J. Brent; Estrada, Karol; Zhou, Yanhua; van Nas, Atila; Moffatt, Miriam F.; Zhai, Guangju; Hofman, Albert; van Meurs, Joyce B.; Pols, Huibert A. P.; Price, Roger I.; Nilsson, Olle; Pastinen, Tomi; Cupples, L. Adrienne; Lusis, Aldons J.; Schadt, Eric E.; Ferrari, Serge; Uitterlinden, André G.
2010-01-01
Osteoporosis is a complex disorder and commonly leads to fractures in elderly persons. Genome-wide association studies (GWAS) have become an unbiased approach to identify variations in the genome that potentially affect health. However, the genetic variants identified so far only explain a small proportion of the heritability for complex traits. Due to the modest genetic effect size and inadequate power, true association signals may not be revealed based on a stringent genome-wide significance threshold. Here, we take advantage of SNP and transcript arrays and integrate GWAS and expression signature profiling relevant to the skeletal system in cellular and animal models to prioritize the discovery of novel candidate genes for osteoporosis-related traits, including bone mineral density (BMD) at the lumbar spine (LS) and femoral neck (FN), as well as geometric indices of the hip (femoral neck-shaft angle, NSA; femoral neck length, NL; and narrow-neck width, NW). A two-stage meta-analysis of GWAS from 7,633 Caucasian women and 3,657 men, revealed three novel loci associated with osteoporosis-related traits, including chromosome 1p13.2 (RAP1A, p = 3.6×10−8), 2q11.2 (TBC1D8), and 18q11.2 (OSBPL1A), and confirmed a previously reported region near TNFRSF11B/OPG gene. We also prioritized 16 suggestive genome-wide significant candidate genes based on their potential involvement in skeletal metabolism. Among them, 3 candidate genes were associated with BMD in women. Notably, 2 out of these 3 genes (GPR177, p = 2.6×10−13; SOX6, p = 6.4×10−10) associated with BMD in women have been successfully replicated in a large-scale meta-analysis of BMD, but none of the non-prioritized candidates (associated with BMD) did. Our results support the concept of our prioritization strategy. In the absence of direct biological support for identified genes, we highlighted the efficiency of subsequent functional characterization using publicly available expression profiling relevant to the skeletal system in cellular or whole animal models to prioritize candidate genes for further functional validation. PMID:20548944
The Human High-Grade Glioma Interactome (HGi) contains a genome-wide complement of molecular interactions that are Glioblastoma Multiforme (GBM)-specific. HGi v3 contains the post-transcriptional layer of the HGi, which includes the miRNA-target (RNA-RNA) layer of the interactome. Read the Abstract
Tai, Phillip W L; Wu, Hai; van Wijnen, André J; Stein, Gary S; Stein, Janet L; Lian, Jane B
2017-01-01
The ability to discover regulatory sequences that control bone-related genes during development has been greatly improved by massively parallel sequencing methodologies. To expand our understanding of cis-regulatory regions critical to the control of gene expression during osteoblastogenesis, we probed the presence of open chromatin states across the osteoblast genome using global DNase hypersensitivity (DHS) mapping. Our profiling of MC3T3 mouse pre-osteoblasts during differentiation has identified more than 224,000 unique DHS sites. Approximately 65% of these sites are dynamic during temporal stages of osteoblastogenesis, and a majority of them are located within non-promoter (intergenic and intronic) regions. Nearly half of all DHS sites (both constitutive and dynamic) overlap binding events of the bone-essential RUNX2 and/or the chromatin-related CTCF transcription factors. This finding reinforces the role of these regulatory proteins as essential components of the bone gene regulome. We observe a reduction in chromatin accessibility throughout the genome between pre-osteoblast and early osteoblasts. Our analysis also defined a class of differentially expressed genes that harbor DHS peaks centered within 1 kb downstream of transcriptional end sites (TES). These DHSs at the 3'-flanks of genes exhibit dynamic changes during differentiation that may impact regulation of the osteoblast genome. Taken together, the distribution of DHS regions within non-promoter locations harboring osteoblast and chromatin related transcription factor binding motifs, reflect novel cis-regulatory requirements to support temporal gene expression in differentiating osteoblasts.
The Innate Immune Database (IIDB)
Korb, Martin; Rust, Aistair G; Thorsson, Vesteinn; Battail, Christophe; Li, Bin; Hwang, Daehee; Kennedy, Kathleen A; Roach, Jared C; Rosenberger, Carrie M; Gilchrist, Mark; Zak, Daniel; Johnson, Carrie; Marzolf, Bruz; Aderem, Alan; Shmulevich, Ilya; Bolouri, Hamid
2008-01-01
Background As part of a National Institute of Allergy and Infectious Diseases funded collaborative project, we have performed over 150 microarray experiments measuring the response of C57/BL6 mouse bone marrow macrophages to toll-like receptor stimuli. These microarray expression profiles are available freely from our project web site . Here, we report the development of a database of computationally predicted transcription factor binding sites and related genomic features for a set of over 2000 murine immune genes of interest. Our database, which includes microarray co-expression clusters and a host of web-based query, analysis and visualization facilities, is available freely via the internet. It provides a broad resource to the research community, and a stepping stone towards the delineation of the network of transcriptional regulatory interactions underlying the integrated response of macrophages to pathogens. Description We constructed a database indexed on genes and annotations of the immediate surrounding genomic regions. To facilitate both gene-specific and systems biology oriented research, our database provides the means to analyze individual genes or an entire genomic locus. Although our focus to-date has been on mammalian toll-like receptor signaling pathways, our database structure is not limited to this subject, and is intended to be broadly applicable to immunology. By focusing on selected immune-active genes, we were able to perform computationally intensive expression and sequence analyses that would currently be prohibitive if applied to the entire genome. Using six complementary computational algorithms and methodologies, we identified transcription factor binding sites based on the Position Weight Matrices available in TRANSFAC. For one example transcription factor (ATF3) for which experimental data is available, over 50% of our predicted binding sites coincide with genome-wide chromatin immnuopreciptation (ChIP-chip) results. Our database can be interrogated via a web interface. Genomic annotations and binding site predictions can be automatically viewed with a customized version of the Argo genome browser. Conclusion We present the Innate Immune Database (IIDB) as a community resource for immunologists interested in gene regulatory systems underlying innate responses to pathogens. The database website can be freely accessed at . PMID:18321385
System-level perturbations of cell metabolism using CRISPR/Cas9
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jakočiūnas, Tadas; Jensen, Michael K.; Keasling, Jay D.
CRISPR/Cas9 (clustered regularly interspaced palindromic repeats and the associated protein Cas9) techniques have made genome engineering and transcriptional reprogramming studies much more advanced and cost-effective. For metabolic engineering purposes, the CRISPR-based tools have been applied to single and multiplex pathway modifications and transcriptional regulations. The effectiveness of these tools allows researchers to implement genome-wide perturbations, test model-guided genome editing strategies, and perform transcriptional reprogramming perturbations in a more advanced manner than previously possible. In this mini-review we highlight recent studies adopting CRISPR/Cas9 for systems-level perturbations and model-guided metabolic engineering.
2012-01-01
Background The Azadirachta indica (neem) tree is a source of a wide number of natural products, including the potent biopesticide azadirachtin. In spite of its widespread applications in agriculture and medicine, the molecular aspects of the biosynthesis of neem terpenoids remain largely unexplored. The current report describes the draft genome and four transcriptomes of A. indica and attempts to contextualise the sequence information in terms of its molecular phylogeny, transcript expression and terpenoid biosynthesis pathways. A. indica is the first member of the family Meliaceae to be sequenced using next generation sequencing approach. Results The genome and transcriptomes of A. indica were sequenced using multiple sequencing platforms and libraries. The A. indica genome is AT-rich, bears few repetitive DNA elements and comprises about 20,000 genes. The molecular phylogenetic analyses grouped A. indica together with Citrus sinensis from the Rutaceae family validating its conventional taxonomic classification. Comparative transcript expression analysis showed either exclusive or enhanced expression of known genes involved in neem terpenoid biosynthesis pathways compared to other sequenced angiosperms. Genome and transcriptome analyses in A. indica led to the identification of repeat elements, nucleotide composition and expression profiles of genes in various organs. Conclusions This study on A. indica genome and transcriptomes will provide a model for characterization of metabolic pathways involved in synthesis of bioactive compounds, comparative evolutionary studies among various Meliaceae family members and help annotate their genomes. A better understanding of molecular pathways involved in the azadirachtin synthesis in A. indica will pave ways for bulk production of environment friendly biopesticides. PMID:22958331
ChIP-seq and ChIP-exo profiling of Pol II, H2A.Z, and H3K4me3 in human K562 cells.
Mchaourab, Zenab F; Perreault, Andrea A; Venters, Bryan J
2018-03-06
The human K562 chronic myeloid leukemia cell line has long served as an experimental paradigm for functional genomic studies. To systematically and functionally annotate the human genome, the ENCODE consortium generated hundreds of functional genomic data sets, such as chromatin immunoprecipitation coupled to sequencing (ChIP-seq). While ChIP-seq analyses have provided tremendous insights into gene regulation, spatiotemporal insights were limited by a resolution of several hundred base pairs. ChIP-exonuclease (ChIP-exo) is a refined version of ChIP-seq that overcomes this limitation by providing higher precision mapping of protein-DNA interactions. To study the interplay of transcription initiation and chromatin, we profiled the genome-wide locations for RNA polymerase II (Pol II), the histone variant H2A.Z, and the histone modification H3K4me3 using ChIP-seq and ChIP-exo. In this Data Descriptor, we present detailed information on parallel experimental design, data generation, quality control analysis, and data validation. We discuss how these data lay the foundation for future analysis to understand the relationship between the occupancy of Pol II and nucleosome positions at near base pair resolution.
Wang, Hongfeng; Wang, Hongwei; Liu, Rong; Xu, Yiteng; Lu, Zhichao; Zhou, Chuanen
2018-01-01
TCP proteins, the plant-specific transcription factors, are involved in the regulation of multiple aspects of plant development among different species, such as leaf development, branching, and flower symmetry. However, thus far, the roles of TCPs in legume, especially in nodulation are still not clear. In this study, a genome-wide analysis of TCP genes was carried out to discover their evolution and function in Medicago truncatula. In total, 21 MtTCPs were identified and classified into class I and class II, and the class II MtTCPs were further divided into two subclasses, CIN and CYC/TB1. The expression profiles of MtTCPs are dramatically different. The universal expression of class I MtTCPs was detected in all organs. However, the MtTCPs in CIN subclass were highly expressed in leaf and most of the members in CYC/TB1 subclass were highly expressed in flower. Such organ-specific expression patterns of MtTCPs suggest their different roles in plant development. In addition, most MtTCPs were down-regulated during the nodule development, except for the putative MtmiR319 targets, MtTCP3, MtTCP4, and MtTCP10A. Overexpression of MtmiR319A significantly reduced the expression level of MtTCP3/4/10A/10B and resulted in the decreased nodule number, indicating the important roles of MtmiR319-targeted MtTCPs in nodulation. Taken together, this study systematically analyzes the MtTCP gene family at a genome-wide level and their possible functions in nodulation, which lay the basis for further explorations of MtmiR319/MtTCPs module in association with nodule development in M. truncatula.
Barnes, Kayla G.; Irving, Helen; Chiumia, Martin; Mzilahowa, Themba; Coleman, Michael; Hemingway, Janet; Wondji, Charles S.
2017-01-01
Resistance to pyrethroids, the sole insecticide class recommended for treating bed nets, threatens the control of major malaria vectors, including Anopheles funestus. Effective management of resistance requires an understanding of the dynamics and mechanisms driving resistance. Here, using genome-wide transcription and genetic diversity analyses, we show that a shift in the molecular basis of pyrethroid resistance in southern African populations of this species is associated with a restricted gene flow. Across the most highly endemic and densely populated regions in Malawi, An. funestus is resistant to pyrethroids, carbamates, and organochlorides. Genome-wide microarray-based transcription analysis identified overexpression of cytochrome P450 genes as the main mechanism driving this resistance. The most up-regulated genes include cytochrome P450s (CYP) CYP6P9a, CYP6P9b and CYP6M7. However, a significant shift in the overexpression profile of these genes was detected across a south/north transect, with CYP6P9a and CYP6P9b more highly overexpressed in the southern resistance front and CYP6M7 predominant in the northern front. A genome-wide genetic structure analysis of southern African populations of An. funestus from Zambia, Malawi, and Mozambique revealed a restriction of gene flow between populations, in line with the geographical variation observed in the transcriptomic analysis. Genetic polymorphism analysis of the three key resistance genes, CYP6P9a, CYP6P9b, and CYP6M7, support barriers to gene flow that are shaping the underlying molecular basis of pyrethroid resistance across southern Africa. This barrier to gene flow is likely to impact the design and implementation of resistance management strategies in the region. PMID:28003461
Zhu, Jufen; Yu, Xinxu; Xie, Baogui; Gu, Xiaokui; Zhang, Zhenying; Li, Shaojie
2013-06-01
To gain insight into the regulatory mechanisms of oxidative stress responses in filamentous fungi, the genome-wide transcriptional response of Neurospora crassa to menadione was analysed by digital gene expression (DGE) profiling, which identified 779 upregulated genes and 576 downregulated genes. Knockout mutants affecting 130 highly-upregulated genes were tested for menadione sensitivity, which revealed that loss of the transcription factor siderophore regulation (SRE) (a transcriptional repressor for siderophore biosynthesis), catatase-3, cytochrome c peroxidase or superoxide dismutase 1 copper chaperone causes hypersensitivity to menadione. Deletion of sre dramatically increased transcription of the siderophore biosynthesis gene ono and the siderophore iron transporter gene sit during menadione stress, suggesting that SRE is required for repression of iron uptake under oxidative stress conditions. Contrary to its phenotype, the sre deletion mutant showed higher transcriptional levels of genes encoding reactive oxygen species (ROS) scavengers than wild type during menadione stress, which implies that the mutant suffers a higher level of oxidative stress than wild type. Uncontrolled iron uptake in the sre mutant might exacerbate cellular oxidative stress. This is the first report of a negative regulator of iron assimilation participating in the fungal oxidative stress response. In addition to SRE, eight other transcription factor genes were also menadione-responsive but their single gene knockout mutants showed wild-type menadione sensitivity. Two of them, named as mit-2 (menadione induced transcription factor-2) and mit-4 (menadione induced transcription factor-4), were selected for double mutant analysis. The double mutant was hypersensitive to menadione. Similarly, the double mutation of mit-2 and sre also had additive effects on menadione sensitivity, suggesting multiple transcription factors mediate oxidative stress resistance in an additive manner. Copyright © 2013 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.
Cheung, Gordon Y C; Villaruz, Amer E; Joo, Hwang-Soo; Duong, Anthony C; Yeh, Anthony J; Nguyen, Thuan H; Sturdevant, Daniel E; Queck, S Y; Otto, M
2014-07-01
Several methicillin resistance (SCCmec) clusters characteristic of hospital-associated methicillin-resistant Staphylococcus aureus (MRSA) strains harbor the psm-mec locus. In addition to encoding the cytolysin, phenol-soluble modulin (PSM)-mec, this locus has been attributed gene regulatory functions. Here we employed genome-wide transcriptional profiling to define the regulatory function of the psm-mec locus. The immune evasion factor protein A emerged as the primary conserved and strongly regulated target of psm-mec, an effect we show is mediated by the psm-mec RNA. Furthermore, the psm-mec locus exerted regulatory effects that were more moderate in extent. For example, expression of PSM-mec limited expression of mecA, thereby decreasing methicillin resistance. Our study shows that the psm-mec locus has a rare dual regulatory RNA and encoded cytolysin function. Furthermore, our findings reveal a specific mechanism underscoring the recently emerging concept that S. aureus strains balance pronounced virulence and high expression of antibiotic resistance. Published by Elsevier GmbH.
North, Matthew; Tandon, Vickram J.; Thomas, Reuben; Loguinov, Alex; Gerlovina, Inna; Hubbard, Alan E.; Zhang, Luoping; Smith, Martyn T.; Vulpe, Chris D.
2011-01-01
Benzene is a ubiquitous environmental contaminant and is widely used in industry. Exposure to benzene causes a number of serious health problems, including blood disorders and leukemia. Benzene undergoes complex metabolism in humans, making mechanistic determination of benzene toxicity difficult. We used a functional genomics approach to identify the genes that modulate the cellular toxicity of three of the phenolic metabolites of benzene, hydroquinone (HQ), catechol (CAT) and 1,2,4-benzenetriol (BT), in the model eukaryote Saccharomyces cerevisiae. Benzene metabolites generate oxidative and cytoskeletal stress, and tolerance requires correct regulation of iron homeostasis and the vacuolar ATPase. We have identified a conserved bZIP transcription factor, Yap3p, as important for a HQ-specific response pathway, as well as two genes that encode putative NAD(P)H:quinone oxidoreductases, PST2 and YCP4. Many of the yeast genes identified have human orthologs that may modulate human benzene toxicity in a similar manner and could play a role in benzene exposure-related disease. PMID:21912624
Neymotin, Benjamin; Ettorre, Victoria; Gresham, David
2016-01-01
Degradation of mRNA contributes to variation in transcript abundance. Studies of individual mRNAs have shown that both cis and trans factors affect mRNA degradation rates. However, the factors underlying transcriptome-wide variation in mRNA degradation rates are poorly understood. We investigated the contribution of different transcript properties to transcriptome-wide degradation rate variation in the budding yeast, Saccharomyces cerevisiae, using multiple regression analysis. We find that multiple transcript properties are significantly associated with variation in mRNA degradation rates, and that a model incorporating these properties explains ∼50% of the genome-wide variance. Predictors of mRNA degradation rates include transcript length, ribosome density, biased codon usage, and GC content of the third position in codons. To experimentally validate these factors, we studied individual transcripts expressed from identical promoters. We find that decreasing ribosome density by mutating the first translational start site of a transcript increases its degradation rate. Using coding sequence variants of green fluorescent protein (GFP) that differ only at synonymous sites, we show that increased GC content of the third position of codons results in decreased rates of mRNA degradation. Thus, in steady-state conditions, a large fraction of genome-wide variation in mRNA degradation rates is determined by inherent properties of transcripts, many of which are related to translation, rather than specific regulatory mechanisms. PMID:27633789
StereoGene: rapid estimation of genome-wide correlation of continuous or interval feature data.
Stavrovskaya, Elena D; Niranjan, Tejasvi; Fertig, Elana J; Wheelan, Sarah J; Favorov, Alexander V; Mironov, Andrey A
2017-10-15
Genomics features with similar genome-wide distributions are generally hypothesized to be functionally related, for example, colocalization of histones and transcription start sites indicate chromatin regulation of transcription factor activity. Therefore, statistical algorithms to perform spatial, genome-wide correlation among genomic features are required. Here, we propose a method, StereoGene, that rapidly estimates genome-wide correlation among pairs of genomic features. These features may represent high-throughput data mapped to reference genome or sets of genomic annotations in that reference genome. StereoGene enables correlation of continuous data directly, avoiding the data binarization and subsequent data loss. Correlations are computed among neighboring genomic positions using kernel correlation. Representing the correlation as a function of the genome position, StereoGene outputs the local correlation track as part of the analysis. StereoGene also accounts for confounders such as input DNA by partial correlation. We apply our method to numerous comparisons of ChIP-Seq datasets from the Human Epigenome Atlas and FANTOM CAGE to demonstrate its wide applicability. We observe the changes in the correlation between epigenomic features across developmental trajectories of several tissue types consistent with known biology and find a novel spatial correlation of CAGE clusters with donor splice sites and with poly(A) sites. These analyses provide examples for the broad applicability of StereoGene for regulatory genomics. The StereoGene C ++ source code, program documentation, Galaxy integration scripts and examples are available from the project homepage http://stereogene.bioinf.fbb.msu.ru/. favorov@sensi.org. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Genome-wide expression profile of first trimester villous and extravillous human trophoblast cells
Apps, R.; Sharkey, A.; Gardner, L.; Male, V.; Trotter, M.; Miller, N.; North, R.; Founds, S.; Moffett, A.
2011-01-01
We have examined the transcriptional changes associated with differentiation from villous to extravillous trophoblast using a whole genome microarray. Villous trophoblast (VT) is in contact with maternal blood and mediates nutrient exchange whereas extravillous trophoblast (EVT) invades the decidua and remodels uterine arteries. Using highly purified first trimester trophoblast we identified over 3000 transcripts that are differentially expressed. Many of these transcripts represent novel functions and pathways that show co-ordinated up-regulation in VT or EVT. In addition we identify new players in established functions such as migration, immune modulation and cytokine or angiogenic factor secretion by EVT. The transition from VT to EVT is also characterised by alterations in transcription factors such as STAT4 and IRF9, which may co-ordinate these changes. Transcripts encoding several members of the immunoglobulin-superfamily, which are normally expressed on leukocytes, were highly transcribed in EVT but not expressed as protein, indicating specific control of translation in EVT. Interactions of trophoblast with decidual leukocytes are involved in regulating EVT invasion. We show that decidual T-cells, macrophages and NK cells express the inhibitory collagen receptor LAIR-1 and that EVT secrete LAIR-2, which can block this interaction. This represents a new mechanism by which EVT can modulate leukocyte function in the decidua. Since LAIR-2 is detectable in the urine of pregnant, but not non-pregnant women, trophoblast-derived LAIR-2 may also have systemic effects during pregnancy. PMID:21075446
Genome-wide characterization of Mediator recruitment, function, and regulation.
Grünberg, Sebastian; Zentner, Gabriel E
2017-05-27
Mediator is a conserved and essential coactivator complex broadly required for RNA polymerase II (RNAPII) transcription. Recent genome-wide studies of Mediator binding in budding yeast have revealed new insights into the functions of this critical complex and raised new questions about its role in the regulation of gene expression.
Genome-wide analysis of alternative splicing during human heart development
NASA Astrophysics Data System (ADS)
Wang, He; Chen, Yanmei; Li, Xinzhong; Chen, Guojun; Zhong, Lintao; Chen, Gangbing; Liao, Yulin; Liao, Wangjun; Bin, Jianping
2016-10-01
Alternative splicing (AS) drives determinative changes during mouse heart development. Recent high-throughput technological advancements have facilitated genome-wide AS, while its analysis in human foetal heart transition to the adult stage has not been reported. Here, we present a high-resolution global analysis of AS transitions between human foetal and adult hearts. RNA-sequencing data showed extensive AS transitions occurred between human foetal and adult hearts, and AS events occurred more frequently in protein-coding genes than in long non-coding RNA (lncRNA). A significant difference of AS patterns was found between foetal and adult hearts. The predicted difference in AS events was further confirmed using quantitative reverse transcription-polymerase chain reaction analysis of human heart samples. Functional foetal-specific AS event analysis showed enrichment associated with cell proliferation-related pathways including cell cycle, whereas adult-specific AS events were associated with protein synthesis. Furthermore, 42.6% of foetal-specific AS events showed significant changes in gene expression levels between foetal and adult hearts. Genes exhibiting both foetal-specific AS and differential expression were highly enriched in cell cycle-associated functions. In conclusion, we provided a genome-wide profiling of AS transitions between foetal and adult hearts and proposed that AS transitions and deferential gene expression may play determinative roles in human heart development.
Paterson, Kathryn J.; Sisignano, Marco; Schmid, Ramona; Rust, Werner; Hildebrandt, Tobias; Geisslinger, Gerd; Orengo, Christine; Bennett, David L.; McMahon, Stephen B.
2014-01-01
Ultraviolet-B (UVB)-induced inflammation produces a dose-dependent mechanical and thermal hyperalgesia in both humans and rats, most likely via inflammatory mediators acting at the site of injury. Previous work has shown that the gene expression of cytokines and chemokines is positively correlated between species and that these factors can contribute to UVB-induced pain. In order to investigate other potential pain mediators in this model we used RNA-seq to perform genome-wide transcriptional profiling in both human and rat skin at the peak of hyperalgesia. In addition we have also measured transcriptional changes in the L4 and L5 DRG of the rat model. Our data show that UVB irradiation produces a large number of transcriptional changes in the skin: 2186 and 3888 genes are significantly dysregulated in human and rat skin, respectively. The most highly up-regulated genes in human skin feature those encoding cytokines (IL6 and IL24), chemokines (CCL3, CCL20, CXCL1, CXCL2, CXCL3 and CXCL5), the prostanoid synthesising enzyme COX-2 and members of the keratin gene family. Overall there was a strong positive and significant correlation in gene expression between the human and rat (R = 0.8022). In contrast to the skin, only 39 genes were significantly dysregulated in the rat L4 and L5 DRGs, the majority of which had small fold change values. Amongst the most up-regulated genes in DRG were REG3B, CCL2 and VGF. Overall, our data shows that numerous genes were up-regulated in UVB irradiated skin at the peak of hyperalgesia in both human and rats. Many of the top up-regulated genes were cytokines and chemokines, highlighting again their potential as pain mediators. However many other genes were also up-regulated and might play a role in UVB-induced hyperalgesia. In addition, the strong gene expression correlation between species re-emphasises the value of the UVB model as translational tool to study inflammatory pain. PMID:24732968
Vatansever, Recep; Koc, Ibrahim; Ozyigit, Ibrahim Ilker; Sen, Ugur; Uras, Mehmet Emin; Anjum, Naser A; Pereira, Eduarda; Filiz, Ertugrul
2016-12-01
Solanum tuberosum genome analysis revealed 12 StSULTR genes encoding 18 transcripts. Among genes annotated at group level ( StSULTR I-IV), group III members formed the largest SULTRs-cluster and were potentially involved in biotic/abiotic stress responses via various regulatory factors, and stress and signaling proteins. Employing bioinformatics tools, this study performed genome-wide identification and expression analysis of SULTR (StSULTR) genes in potato (Solanum tuberosum L.). Very strict homology search and subsequent domain verification with Hidden Markov Model revealed 12 StSULTR genes encoding 18 transcripts. StSULTR genes were mapped on seven S. tuberosum chromosomes. Annotation of StSULTR genes was also done as StSULTR I-IV at group level based mainly on the phylogenetic distribution with Arabidopsis SULTRs. Several tandem and segmental duplications were identified between StSULTR genes. Among these duplications, Ka/Ks ratios indicated neutral nature of mutations that might not be causing any selection. Two segmental and one-tandem duplications were calculated to occur around 147.69, 180.80 and 191.00 million years ago (MYA), approximately corresponding to the time of monocot/dicot divergence. Two other segmental duplications were found to occur around 61.23 and 67.83 MYA, which is very close to the origination of monocotyledons. Most cis-regulatory elements in StSULTRs were found associated with major hormones (such as abscisic acid and methyl jasmonate), and defense and stress responsiveness. The cis-element distribution in duplicated gene pairs indicated the contribution of duplication events in conferring the neofunctionalization/s in StSULTR genes. Notably, RNAseq data analyses unveiled expression profiles of StSULTR genes under different stress conditions. In particular, expression profiles of StSULTR III members suggested their involvement in plant stress responses. Additionally, gene co-expression networks of these group members included various regulatory factors, stress and signaling proteins, and housekeeping and some other proteins with unknown functions.
Camps, Jordi; Nguyen, Quang Tri; Padilla-Nash, Hesed M.; Knutsen, Turid; McNeil, Nicole E.; Wangsa, Danny; Hummon, Amanda B.; Grade, Marian; Ried, Thomas; Difilippantonio, Michael J.
2016-01-01
To evaluate the mechanisms and consequences of chromosomal aberrations in colorectal cancer (CRC), we used a combination of spectral karyotyping, array comparative genomic hybridization (aCGH), and array-based global gene expression profiling on 31 primary carcinomas and 15 established cell lines. Importantly, aCGH showed that the genomic profiles of primary tumors are recapitulated in the cell lines. We revealed a preponderance of chromosome breakpoints at sites of copy number variants (CNVs) in the CRC cell lines, a novel mechanism of DNA breakage in cancer. The integration of gene expression and aCGH led to the identification of 157 genes localized within high-level copy number changes whose transcriptional deregulation was significantly affected across all of the samples, thereby suggesting that these genes play a functional role in CRC. Genomic amplification at 8q24 was the most recurrent event and led to the overexpression of MYC and FAM84B. Copy number dependent gene expression resulted in deregulation of known cancer genes such as APC, FGFR2, and ERBB2. The identification of only 36 genes whose localization near a breakpoint could account for their observed deregulated expression demonstrates that the major mechanism for transcriptional deregulation in CRC is genomic copy number changes resulting from chromosomal aberrations. PMID:19691111
Staying alive in adversity: transcriptome dynamics in the stress-resistant dauer larva.
Holt, Suzan J
2006-10-01
In response to food depletion and overcrowding, the soil nematode Caenorhabditis elegans can arrest development and form an alternate third larval stage called the dauer. Though nonfeeding, the dauer larva is long lived and stress resistant. Metabolic and transcription rates are lowered but the transcriptome of the dauer is complex. In this study, distribution analysis of transcript profiles generated by Serial Analysis of Gene Expression (SAGE) in dauer larvae and in mixed developmental stages is presented. An inverse relationship was observed between frequency and abundance/copy number of SAGE tag types (transcripts) in both profiles. In the dauer profile, a relatively greater proportion of highly abundant transcripts was counterbalanced by a smaller fraction of low to moderately abundant transcripts. Comparisons of abundant tag counts between the two profiles revealed relative enrichment in the dauer profile of transcripts with predicted or known involvement in ribosome biogenesis and protein synthesis, membrane transport, and immune responses. Translation-coupled mRNA decay is proposed as part of an immune-like stress response in the dauer larva. An influence of genomic region on transcript level may reflect the coordination of transcription and mRNA turnover.
Each cell counts: Hematopoiesis and immunity research in the era of single cell genomics.
Jaitin, Diego Adhemar; Keren-Shaul, Hadas; Elefant, Naama; Amit, Ido
2015-02-01
Hematopoiesis and immunity are mediated through complex interactions between multiple cell types and states. This complexity is currently addressed following a reductionist approach of characterizing cell types by a small number of cell surface molecular features and gross functions. While the introduction of global transcriptional profiling technologies enabled a more comprehensive view, heterogeneity within sampled populations remained unaddressed, obscuring the true picture of hematopoiesis and immune system function. A critical mass of technological advances in molecular biology and genomics has enabled genome-wide measurements of single cells - the fundamental unit of immunity. These new advances are expected to boost detection of less frequent cell types and fuzzy intermediate cell states, greatly expanding the resolution of current available classifications. This new era of single-cell genomics in immunology research holds great promise for further understanding of the mechanisms and circuits regulating hematopoiesis and immunity in both health and disease. In the near future, the accuracy of single-cell genomics will ultimately enable precise diagnostics and treatment of multiple hematopoietic and immune related diseases. Copyright © 2015 Elsevier Ltd. All rights reserved.
Optimization of cDNA-AFLP experiments using genomic sequence data.
Kivioja, Teemu; Arvas, Mikko; Saloheimo, Markku; Penttilä, Merja; Ukkonen, Esko
2005-06-01
cDNA amplified fragment length polymorphism (cDNA-AFLP) is one of the few genome-wide level expression profiling methods capable of finding genes that have not yet been cloned or even predicted from sequence but have interesting expression patterns under the studied conditions. In cDNA-AFLP, a complex cDNA mixture is divided into small subsets using restriction enzymes and selective PCR. A large cDNA-AFLP experiment can require a substantial amount of resources, such as hundreds of PCR amplifications and gel electrophoresis runs, followed by manual cutting of a large number of bands from the gels. Our aim was to test whether this workload can be reduced by rational design of the experiment. We used the available genomic sequence information to optimize cDNA-AFLP experiments beforehand so that as many transcripts as possible could be profiled with a given amount of resources. Optimization of the selection of both restriction enzymes and selective primers for cDNA-AFLP experiments has not been performed previously. The in silico tests performed suggest that substantial amounts of resources can be saved by the optimization of cDNA-AFLP experiments.
Makeyev, Aleksandr V; Bayarsaihan, Dashzeveg
2013-05-01
Objectives : GTF2I and GTF2IRD1 genes located in Williams-Beuren syndrome (WBS) critical region encode TFII-I family transcription factors. The aim of this study was to map genomic sites bound by these proteins across promoter regions of developmental regulators associated with craniofacial development. Design : Chromatin was isolated from human neural crest progenitor cells and the DNA-binding profile was generated using the human RefSeq tiling promoter ChIP-chip arrays. Results : TFII-I transcription factors are recruited to the promoters of SEC23A, CFDP1, and NSD1 previously defined as TFII-I target genes. Moreover, our analysis revealed additional binding elements that contain E-boxes and initiator-like motifs. Conclusions : Genome-wide promoter binding studies revealed SEC23A, CFDP1, and NSD1 linked to craniofacial or dental development as direct TFII-I targets. Developmental regulation of these genes by TFII-I factors could contribute to the WBS-specific facial dysmorphism.
TEA: the epigenome platform for Arabidopsis methylome study.
Su, Sheng-Yao; Chen, Shu-Hwa; Lu, I-Hsuan; Chiang, Yih-Shien; Wang, Yu-Bin; Chen, Pao-Yang; Lin, Chung-Yen
2016-12-22
Bisulfite sequencing (BS-seq) has become a standard technology to profile genome-wide DNA methylation at single-base resolution. It allows researchers to conduct genome-wise cytosine methylation analyses on issues about genomic imprinting, transcriptional regulation, cellular development and differentiation. One single data from a BS-Seq experiment is resolved into many features according to the sequence contexts, making methylome data analysis and data visualization a complex task. We developed a streamlined platform, TEA, for analyzing and visualizing data from whole-genome BS-Seq (WGBS) experiments conducted in the model plant Arabidopsis thaliana. To capture the essence of the genome methylation level and to meet the efficiency for running online, we introduce a straightforward method for measuring genome methylation in each sequence context by gene. The method is scripted in Java to process BS-Seq mapping results. Through a simple data uploading process, the TEA server deploys a web-based platform for deep analysis by linking data to an updated Arabidopsis annotation database and toolkits. TEA is an intuitive and efficient online platform for analyzing the Arabidopsis genomic DNA methylation landscape. It provides several ways to help users exploit WGBS data. TEA is freely accessible for academic users at: http://tea.iis.sinica.edu.tw .
Comprehensive analysis of RNA-seq data reveals the complexity of the transcriptome in Brassica rapa.
Tong, Chaobo; Wang, Xiaowu; Yu, Jingyin; Wu, Jian; Li, Wanshun; Huang, Junyan; Dong, Caihua; Hua, Wei; Liu, Shengyi
2013-10-07
The species Brassica rapa (2n=20, AA) is an important vegetable and oilseed crop, and serves as an excellent model for genomic and evolutionary research in Brassica species. With the availability of whole genome sequence of B. rapa, it is essential to further determine the activity of all functional elements of the B. rapa genome and explore the transcriptome on a genome-wide scale. Here, RNA-seq data was employed to provide a genome-wide transcriptional landscape and characterization of the annotated and novel transcripts and alternative splicing events across tissues. RNA-seq reads were generated using the Illumina platform from six different tissues (root, stem, leaf, flower, silique and callus) of the B. rapa accession Chiifu-401-42, the same line used for whole genome sequencing. First, these data detected the widespread transcription of the B. rapa genome, leading to the identification of numerous novel transcripts and definition of 5'/3' UTRs of known genes. Second, 78.8% of the total annotated genes were detected as expressed and 45.8% were constitutively expressed across all tissues. We further defined several groups of genes: housekeeping genes, tissue-specific expressed genes and co-expressed genes across tissues, which will serve as a valuable repository for future crop functional genomics research. Third, alternative splicing (AS) is estimated to occur in more than 29.4% of intron-containing B. rapa genes, and 65% of them were commonly detected in more than two tissues. Interestingly, genes with high rate of AS were over-represented in GO categories relating to transcriptional regulation and signal transduction, suggesting potential importance of AS for playing regulatory role in these genes. Further, we observed that intron retention (IR) is predominant in the AS events and seems to preferentially occurred in genes with short introns. The high-resolution RNA-seq analysis provides a global transcriptional landscape as a complement to the B. rapa genome sequence, which will advance our understanding of the dynamics and complexity of the B. rapa transcriptome. The atlas of gene expression in different tissues will be useful for accelerating research on functional genomics and genome evolution in Brassica species.
Zhang, Qi; Zeng, Xin; Younkin, Sam; Kawli, Trupti; Snyder, Michael P; Keleş, Sündüz
2016-02-24
Chromatin immunoprecipitation followed by sequencing (ChIP-seq) experiments revolutionized genome-wide profiling of transcription factors and histone modifications. Although maturing sequencing technologies allow these experiments to be carried out with short (36-50 bps), long (75-100 bps), single-end, or paired-end reads, the impact of these read parameters on the downstream data analysis are not well understood. In this paper, we evaluate the effects of different read parameters on genome sequence alignment, coverage of different classes of genomic features, peak identification, and allele-specific binding detection. We generated 101 bps paired-end ChIP-seq data for many transcription factors from human GM12878 and MCF7 cell lines. Systematic evaluations using in silico variations of these data as well as fully simulated data, revealed complex interplay between the sequencing parameters and analysis tools, and indicated clear advantages of paired-end designs in several aspects such as alignment accuracy, peak resolution, and most notably, allele-specific binding detection. Our work elucidates the effect of design on the downstream analysis and provides insights to investigators in deciding sequencing parameters in ChIP-seq experiments. We present the first systematic evaluation of the impact of ChIP-seq designs on allele-specific binding detection and highlights the power of pair-end designs in such studies.
NASA Astrophysics Data System (ADS)
Sharma, Ajeet K.; Ahmed, Nabeel; O'Brien, Edward P.
2018-02-01
Ribosome profiling experiments have found greater than 100-fold variation in ribosome density along mRNA transcripts, indicating that individual codon elongation rates can vary to a similar degree. This wide range of elongation times, coupled with differences in codon usage between transcripts, suggests that the average codon translation-rate per gene can vary widely. Yet, ribosome run-off experiments have found that the average codon translation rate for different groups of transcripts in mouse stem cells is constant at 5.6 AA/s. How these seemingly contradictory results can be reconciled is the focus of this study. Here, we combine knowledge of the molecular factors shown to influence translation speed with genomic information from Escherichia coli, Saccharomyces cerevisiae and Homo sapiens to simulate the synthesis of cytosolic proteins in these organisms. The model recapitulates a near constant average translation rate, which we demonstrate arises because the molecular determinants of translation speed are distributed nearly randomly amongst most of the transcripts. Consequently, codon translation rates are also randomly distributed and fast-translating segments of a transcript are likely to be offset by equally probable slow-translating segments, resulting in similar average elongation rates for most transcripts. We also show that the codon usage bias does not significantly affect the near random distribution of codon translation rates because only about 10 % of the total transcripts in an organism have high codon usage bias while the rest have little to no bias. Analysis of Ribo-Seq data and an in vivo fluorescent assay supports these conclusions.
Replication-associated mutational asymmetry in the human genome.
Chen, Chun-Long; Duquenne, Lauranne; Audit, Benjamin; Guilbaud, Guillaume; Rappailles, Aurélien; Baker, Antoine; Huvet, Maxime; d'Aubenton-Carafa, Yves; Hyrien, Olivier; Arneodo, Alain; Thermes, Claude
2011-08-01
During evolution, mutations occur at rates that can differ between the two DNA strands. In the human genome, nucleotide substitutions occur at different rates on the transcribed and non-transcribed strands that may result from transcription-coupled repair. These mutational asymmetries generate transcription-associated compositional skews. To date, the existence of such asymmetries associated with replication has not yet been established. Here, we compute the nucleotide substitution matrices around replication initiation zones identified as sharp peaks in replication timing profiles and associated with abrupt jumps in the compositional skew profile. We show that the substitution matrices computed in these regions fully explain the jumps in the compositional skew profile when crossing initiation zones. In intergenic regions, we observe mutational asymmetries measured as differences between complementary substitution rates; their sign changes when crossing initiation zones. These mutational asymmetries are unlikely to result from cryptic transcription but can be explained by a model based on replication errors and strand-biased repair. In transcribed regions, mutational asymmetries associated with replication superimpose on the previously described mutational asymmetries associated with transcription. We separate the substitution asymmetries associated with both mechanisms, which allows us to determine for the first time in eukaryotes, the mutational asymmetries associated with replication and to reevaluate those associated with transcription. Replication-associated mutational asymmetry may result from unequal rates of complementary base misincorporation by the DNA polymerases coupled with DNA mismatch repair (MMR) acting with different efficiencies on the leading and lagging strands. Replication, acting in germ line cells during long evolutionary times, contributed equally with transcription to produce the present abrupt jumps in the compositional skew. These results demonstrate that DNA replication is one of the major processes that shape human genome composition.
Regulation of Androgen Receptor-Mediated Transcription by RPB5 Binding Protein URI/RMP ▿
Mita, Paolo; Savas, Jeffrey N.; Djouder, Nabil; Yates, John R.; Ha, Susan; Ruoff, Rachel; Schafler, Eric D.; Nwachukwu, Jerome C.; Tanese, Naoko; Cowan, Nicholas J.; Zavadil, Jiri; Garabedian, Michael J.; Logan, Susan K.
2011-01-01
Androgen receptor (AR)-mediated transcription is modulated by interaction with coregulatory proteins. We demonstrate that the unconventional prefoldin RPB5 interactor (URI) is a new regulator of AR transcription and is critical for antagonist (bicalutamide) action. URI is phosphorylated upon androgen treatment, suggesting communication between the URI and AR signaling pathways. Whereas depletion of URI enhances AR-mediated gene transcription, overexpression of URI suppresses AR transcriptional activation and anchorage-independent prostate cancer cell growth. Repression of AR-mediated transcription is achieved, in part, by URI binding and regulation of androgen receptor trapped clone 27 (Art-27), a previously characterized AR corepressor. Consistent with this idea, genome-wide expression profiling in prostate cancer cells upon depletion of URI or Art-27 reveals substantially overlapping patterns of gene expression. Further, depletion of URI increases the expression of the AR target gene NKX-3.1, decreases the recruitment of Art-27, and increases AR occupancy at the NKX-3.1 promoter. While Art-27 can bind AR directly, URI is bound to chromatin prior to hormone-dependent recruitment of AR, suggesting a role for URI in modulating AR recruitment to target genes. PMID:21730289
Pervasive Transcription of a Herpesvirus Genome Generates Functionally Important RNAs
Canny, Susan P.; Reese, Tiffany A.; Johnson, L. Steven; Zhang, Xin; Kambal, Amal; Duan, Erning; Liu, Catherine Y.; Virgin, Herbert W.
2014-01-01
ABSTRACT Pervasive transcription is observed in a wide range of organisms, including humans, mice, and viruses, but the functional significance of the resulting transcripts remains uncertain. Current genetic approaches are often limited by their emphasis on protein-coding open reading frames (ORFs). We previously identified extensive pervasive transcription from the murine gammaherpesvirus 68 (MHV68) genome outside known ORFs and antisense to known genes (termed expressed genomic regions [EGRs]). Similar antisense transcripts have been identified in many other herpesviruses, including Kaposi’s sarcoma-associated herpesvirus and human and murine cytomegalovirus. Despite their prevalence, whether these RNAs have any functional importance in the viral life cycle is unknown, and one interpretation is that these are merely “noise” generated by functionally unimportant transcriptional events. To determine whether pervasive transcription of a herpesvirus genome generates RNA molecules that are functionally important, we used a strand-specific functional approach to target transcripts from thirteen EGRs in MHV68. We found that targeting transcripts from six EGRs reduced viral protein expression, proving that pervasive transcription can generate functionally important RNAs. We characterized transcripts emanating from EGRs 26 and 27 in detail using several methods, including RNA sequencing, and identified several novel polyadenylated transcripts that were enriched in the nuclei of infected cells. These data provide the first evidence of the functional importance of regions of pervasive transcription emanating from MHV68 EGRs. Therefore, studies utilizing mutation of a herpesvirus genome must account for possible effects on RNAs generated by pervasive transcription. PMID:24618256
Expression Profile of Long Noncoding RNAs in Human Earlobe Keloids: A Microarray Analysis
Guo, Liang; Xu, Kai; Yan, Hongbo; Feng, Haifeng
2016-01-01
Background. Long noncoding RNAs (lncRNAs) play key roles in a wide range of biological processes and their deregulation results in human disease, including keloids. Earlobe keloid is a type of pathological skin scar, and the molecular pathogenesis of this disease remains largely unknown. Methods. In this study, microarray analysis was used to determine the expression profiles of lncRNAs and mRNAs between 3 pairs of earlobe keloid and normal specimens. Gene Ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were performed to identify the main functions of the differentially expressed genes and earlobe keloid-related pathways. Results. A total of 2068 lncRNAs and 1511 mRNAs were differentially expressed between earlobe keloid and normal tissues. Among them, 1290 lncRNAs and 1092 mRNAs were upregulated, and 778 lncRNAs and 419 mRNAs were downregulated. Pathway analysis revealed that 24 pathways were correlated to the upregulated transcripts, while 11 pathways were associated with the downregulated transcripts. Conclusion. We characterized the expression profiles of lncRNA and mRNA in earlobe keloids and suggest that lncRNAs may serve as diagnostic biomarkers for the therapy of earlobe keloid. PMID:28101509
Gessner, Denise K; Winkler, Anne; Koch, Christian; Dusel, Georg; Liebisch, Gerhard; Ringseis, Robert; Eder, Klaus
2017-03-23
It was recently reported that dairy cows fed a polyphenol-rich grape seed and grape marc meal extract (GSGME) during the transition period had an increased milk yield, but the underlying reasons remained unclear. As polyphenols exert a broad spectrum of metabolic effects, we hypothesized that feeding of GSGME influences metabolic pathways in the liver which could account for the positive effects of GSGME in dairy cows. In order to identify these pathways, we performed genome-wide transcript profiling in the liver and lipid profiling in plasma of dairy cows fed GSGME during the transition period at 1 week postpartum. Transcriptomic analysis of the liver revealed 207 differentially expressed transcripts, from which 156 were up- and 51 were down-regulated, between cows fed GSGME and control cows. Gene set enrichment analysis of the 155 up-regulated mRNAs showed that the most enriched gene ontology (GO) biological process terms were dealing with cell cycle regulation and the most enriched Kyoto Encyclopedia of Genes and Genomes pathways were p53 signaling and cell cycle. Functional analysis of the 43 down-regulated mRNAs revealed that a great part of these genes are involved in endoplasmic reticulum (ER) stress-induced unfolded protein response (UPR) and inflammatory processes. Accordingly, protein folding, response to unfolded protein, unfolded protein binding, chemokine activity and heat shock protein binding were identified as one of the most enriched GO biological process and molecular function terms assigned to the down-regulated genes. In line with the transcriptomics data the plasma concentrations of the acute phase proteins serum amyloid A (SAA) and haptoglobin were reduced in cows fed GSGME compared to control cows. Lipidomic analysis of plasma revealed no differences in the concentrations of individual species of major and minor lipid classes between cows fed GSGME and control cows. Analysis of hepatic transcript profile in cows fed GSGME during the transition period at 1 week postpartum indicates that polyphenol-rich feed components are able to inhibit ER stress-induced UPR and inflammatory processes, both of which are considered to contribute to liver-associated diseases and to impair milk performance in dairy cows, in the liver of dairy cows during early lactation.
Downstream targets of HOXB4 in a cell line model of primitive hematopoietic progenitor cells.
Lee, Han M; Zhang, Hui; Schulz, Vincent; Tuck, David P; Forget, Bernard G
2010-08-05
Enforced expression of the homeobox transcription factor HOXB4 has been shown to enhance hematopoietic stem cell self-renewal and expansion ex vivo and in vivo. To investigate the downstream targets of HOXB4 in hematopoietic progenitor cells, HOXB4 was constitutively overexpressed in the primitive hematopoietic progenitor cell line EML. Two genome-wide analytical techniques were used: RNA expression profiling using microarrays and chromatin immunoprecipitation (ChIP)-chip. RNA expression profiling revealed that 465 gene transcripts were differentially expressed in KLS (c-Kit(+), Lin(-), Sca-1(+))-EML cells that overexpressed HOXB4 (KLS-EML-HOXB4) compared with control KLS-EML cells that were transduced with vector alone. In particular, erythroid-specific gene transcripts were observed to be highly down-regulated in KLS-EML-HOXB4 cells. ChIP-chip analysis revealed that the promoter region for 1910 genes, such as CD34, Sox4, and B220, were occupied by HOXB4 in KLS-EML-HOXB4 cells. Side-by-side comparison of the ChIP-chip and RNA expression profiling datasets provided correlative information and identified Gp49a and Laptm4b as candidate "stemness-related" genes. Both genes were highly ranked in both dataset lists and have been previously shown to be preferentially expressed in hematopoietic stem cells and down-regulated in mature hematopoietic cells, thus making them attractive candidates for future functional studies in hematopoietic cells.
Tajaddod, Mansoureh; Tanzer, Andrea; Licht, Konstantin; Wolfinger, Michael T; Badelt, Stefan; Huber, Florian; Pusch, Oliver; Schopoff, Sandy; Janisiw, Michael; Hofacker, Ivo; Jantsch, Michael F
2016-10-25
Short interspersed elements (SINEs) represent the most abundant group of non-long-terminal repeat transposable elements in mammalian genomes. In primates, Alu elements are the most prominent and homogenous representatives of SINEs. Due to their frequent insertion within or close to coding regions, SINEs have been suggested to play a crucial role during genome evolution. Moreover, Alu elements within mRNAs have also been reported to control gene expression at different levels. Here, we undertake a genome-wide analysis of insertion patterns of human Alus within transcribed portions of the genome. Multiple, nearby insertions of SINEs within one transcript are more abundant in tandem orientation than in inverted orientation. Indeed, analysis of transcriptome-wide expression levels of 15 ENCODE cell lines suggests a cis-repressive effect of inverted Alu elements on gene expression. Using reporter assays, we show that the negative effect of inverted SINEs on gene expression is independent of known sensors of double-stranded RNAs. Instead, transcriptional elongation seems impaired, leading to reduced mRNA levels. Our study suggests that there is a bias against multiple SINE insertions that can promote intramolecular base pairing within a transcript. Moreover, at a genome-wide level, mRNAs harboring inverted SINEs are less expressed than mRNAs harboring single or tandemly arranged SINEs. Finally, we demonstrate a novel mechanism by which inverted SINEs can impact on gene expression by interfering with RNA polymerase II.
Cognitive-behavioral stress management reverses anxiety-related leukocyte transcriptional dynamics
Antoni, Michael H.; Lutgendorf, Susan K.; Blomberg, Bonnie; Carver, Charles S.; Lechner, Suzanne; Diaz, Alain; Stagl, Jamie; Arevalo, Jesusa M.G.; Cole, Steven W.
2011-01-01
Background Chronic threat and anxiety are associated with pro-inflammatory transcriptional profiles in circulating leukocytes, but the causal direction of that relationship has not been established. This study tested whether a Cognitive-Behavioral Stress Management (CBSM) intervention targeting negative affect and cognition might counteract anxiety-related transcriptional alterations in people confronting a major medical threat. Methods 199 women undergoing primary treatment of Stage 0–III breast cancer were randomized to a 10-week CBSM protocol or an active control condition. 79 provided peripheral blood leukocyte samples for genome-wide transcriptional profiling and bioinformatic analyses at baseline, 6-, and 12-month follow-ups. Results Baseline negative affect was associated with > 50% differential expression of 201 leukocyte transcripts, including up-regulated expression of pro-inflammatory and metastasis-related genes. CBSM altered leukocyte expression of 91 genes by > 50% at follow-up (Group × Time interaction), including down-regulation of pro-inflammatory and metastasis-related genes and up-regulation of Type I interferon response genes. Promoter-based bioinformatic analyses implicated decreased activity of NF-κB/Rel and GATA family transcription factors and increased activity of Interferon Response Factors and the Glucocorticoid Receptor (GR) as potential mediators of CBSM-induced transcriptional alterations. Conclusions In early stage breast cancer patients, a 10-week CBSM intervention can reverse anxiety-related up-regulation of pro-inflammatory gene expression in circulating leukocytes. These findings clarify the molecular signaling pathways by which behavioral interventions can influence physical health and alter peripheral inflammatory processes that may reciprocally affect brain affective and cognitive processes. PMID:22088795
DNMT1-interacting RNAs block gene specific DNA methylation
Di Ruscio, Annalisa; Ebralidze, Alexander K.; Benoukraf, Touati; Amabile, Giovanni; Goff, Loyal A.; Terragni, Joylon; Figueroa, Maria Eugenia; De Figureido Pontes, Lorena Lobo; Alberich-Jorda, Meritxell; Zhang, Pu; Wu, Mengchu; D’Alò, Francesco; Melnick, Ari; Leone, Giuseppe; Ebralidze, Konstantin K.; Pradhan, Sriharsa; Rinn, John L.; Tenen, Daniel G.
2013-01-01
Summary DNA methylation was described almost a century ago. However, the rules governing its establishment and maintenance remain elusive. Here, we present data demonstrating that active transcription regulates levels of genomic methylation. We identified a novel RNA arising from the CEBPA gene locus critical in regulating the local DNA methylation profile. This RNA binds to DNMT1 and prevents CEBPA gene locus methylation. Deep sequencing of transcripts associated with DNMT1 combined with genome-scale methylation and expression profiling extended the generality of this finding to numerous gene loci. Collectively, these results delineate the nature of DNMT1-RNA interactions and suggest strategies for gene selective demethylation of therapeutic targets in disease. PMID:24107992
Abdallah, Abdallah M.; Hill-Cawthorne, Grant A.; Otto, Thomas D.; Coll, Francesc; Guerra-Assunção, José Afonso; Gao, Ge; Naeem, Raeece; Ansari, Hifzur; Malas, Tareq B.; Adroub, Sabir A.; Verboom, Theo; Ummels, Roy; Zhang, Huoming; Panigrahi, Aswini Kumar; McNerney, Ruth; Brosch, Roland; Clark, Taane G.; Behr, Marcel A.; Bitter, Wilbert; Pain, Arnab
2015-01-01
Although Bacillus Calmette-Guérin (BCG) vaccines against tuberculosis have been available for more than 90 years, their effectiveness has been hindered by variable protective efficacy and a lack of lasting memory responses. One factor contributing to this variability may be the diversity of the BCG strains that are used around the world, in part from genomic changes accumulated during vaccine production and their resulting differences in gene expression. We have compared the genomes and transcriptomes of a global collection of fourteen of the most widely used BCG strains at single base-pair resolution. We have also used quantitative proteomics to identify key differences in expression of proteins across five representative BCG strains of the four tandem duplication (DU) groups. We provide a comprehensive map of single nucleotide polymorphisms (SNPs), copy number variation and insertions and deletions (indels) across fourteen BCG strains. Genome-wide SNP characterization allowed the construction of a new and robust phylogenic genealogy of BCG strains. Transcriptional and proteomic profiling revealed a metabolic remodeling in BCG strains that may be reflected by altered immunogenicity and possibly vaccine efficacy. Together, these integrated-omic data represent the most comprehensive catalogue of genetic variation across a global collection of BCG strains. PMID:26487098
Lim, Su Jun; Boyle, Patrick J.; Chinen, Madoka; Dale, Ryan K.; Lei, Elissa P.
2013-01-01
Chromatin insulators are functionally conserved DNA–protein complexes situated throughout the genome that organize independent transcriptional domains. Previous work implicated RNA as an important cofactor in chromatin insulator activity, although the precise mechanisms are not yet understood. Here we identify the exosome, the highly conserved major cellular 3′ to 5′ RNA degradation machinery, as a physical interactor of CP190-dependent chromatin insulator complexes in Drosophila. Genome-wide profiling of exosome by ChIP-seq in two different embryonic cell lines reveals extensive and specific overlap with the CP190, BEAF-32 and CTCF insulator proteins. Colocalization occurs mainly at promoters but also boundary elements such as Mcp, Fab-8, scs and scs′, which overlaps with a promoter. Surprisingly, exosome associates primarily with promoters but not gene bodies of active genes, arguing against simple cotranscriptional recruitment to RNA substrates. Similar to insulator proteins, exosome is also significantly enriched at divergently transcribed promoters. Directed ChIP of exosome in cell lines depleted of insulator proteins shows that CTCF is required specifically for exosome association at Mcp and Fab-8 but not other sites, suggesting that alternate mechanisms must also contribute to exosome chromatin recruitment. Taken together, our results reveal a novel positive relationship between exosome and chromatin insulators throughout the genome. PMID:23358822
Ayadi, M; Hanana, M; Kharrat, N; Merchaoui, H; Marzoug, R Ben; Lauvergeat, V; Rebaï, A; Mzid, R
2016-10-01
WRKY transcription factors belong to a large family of plant transcriptional regulators whose members have been reported to be involved in a wide range of biological roles including plant development, adaptation to environmental constraints and response to several diseases. However, little or poor information is available about WRKY's in Citrus. The recent release of completely assembled genomes sequences of Citrus sinensis and Citrus clementina and the availability of ESTs sequences from other citrus species allowed us to perform a genome survey for Citrus WRKY proteins. In the present study, we identified 100 WRKY members from C. sinensis (51), C. clementina (48) and Citrus unshiu (1), and analyzed their chromosomal distribution, gene structure, gene duplication, syntenic relation and phylogenetic analysis. A phylogenetic tree of 100 Citrus WRKY sequences with their orthologs from Arabidopsis has distinguished seven groups. The CsWRKY genes were distributed across all ten sweet orange chromosomes. A comprehensive approach and an integrative analysis of Citrus WRKY gene expression revealed variable profiles of expression within tissues and stress conditions indicating functional diversification. Thus, candidate Citrus WRKY genes have been proposed as potentially involved in fruit acidification, essential oil biosynthesis and abiotic/biotic stress tolerance. Our results provided essential prerequisites for further WRKY genes cloning and functional analysis with an aim of citrus crop improvement.
Muleke, Everlyne M’mbone; Jabir, Bashir Mohammed; Xie, Yang; Zhu, Xianwen; Cheng, Wanwan
2017-01-01
NAC (NAM, no apical meristem; ATAF, Arabidopsis transcription activation factor and CUC, cup-shaped cotyledon) proteins are among the largest transcription factor (TF) families playing fundamental biological processes, including cell expansion and differentiation, and hormone signaling in response to biotic and abiotic stresses. In this study, 172 RsNACs comprising 17 membrane-bound members were identified from the whole radish genome. In total, 98 RsNAC genes were non-uniformly distributed across the nine radish chromosomes. In silico analysis revealed that expression patterns of several NAC genes were tissue-specific such as a preferential expression in roots and leaves. In addition, 21 representative NAC genes were selected to investigate their responses to heavy metals (HMs), salt, heat, drought and abscisic acid (ABA) stresses using real-time polymerase chain reaction (RT-qPCR). As a result, differential expressions among these genes were identified where RsNAC023 and RsNAC080 genes responded positively to all stresses except ABA, while RsNAC145 responded more actively to salt, heat and drought stresses compared with other genes. The results provides more valuable information and robust candidate genes for future functional analysis for improving abiotic stress tolerances in radish. PMID:29259849
Wei, Wei; Chai, Zhuangzhuang; Xie, Yinge; Gao, Kuan; Cui, Mengyuan; Jiang, Ying
2017-01-01
Mitogen-activated protein kinases (MAPKs) play essential roles in mediating biotic and abiotic stress responses in plants. However, the MAPK gene family in strawberry has not been systematically characterized. Here, we performed a genome-wide survey and identified 12 MAPK genes in the Fragaria vesca genome. Protein domain analysis indicated that all FvMAPKs have typical protein kinase domains. Sequence alignments and phylogenetic analysis classified the FvMAPK genes into four different groups. Conserved motif and exon-intron organization supported the evolutionary relationships inferred from the phylogenetic analysis. Analysis of the stress-related cis-regulatory element in the promoters and subcellular localization predictions of FvMAPKs were also performed. Gene transcript profile analysis showed that the majority of the FvMAPK genes were ubiquitously transcribed in strawberry leaves after Podosphaera aphanis inoculation and after treatment with cold, heat, drought, salt and the exogenous hormones abscisic acid, ethephon, methyl jasmonate, and salicylic acid. RT-qPCR showed that six selected FvMAPK genes comprehensively responded to various stimuli. Additionally, interaction networks revealed that the crucial signaling transduction controlled by FvMAPKs may be involved in the biotic and abiotic stress responses. Our results may provide useful information for future research on the function of the MAPK gene family and the genetic improvement of strawberry resistance to environmental stresses. PMID:28562633
Transcriptional profiling of rat skeletal muscle hypertrophy under restriction of blood flow.
Xu, Shouyu; Liu, Xueyun; Chen, Zhenhuang; Li, Gaoquan; Chen, Qin; Zhou, Guoqing; Ma, Ruijie; Yao, Xinmiao; Huang, Xiao
2016-12-15
Blood flow restriction (BFR) under low-intensity resistance training (LIRT) can produce similar effects upon muscles to that of high-intensity resistance training (HIRT) while overcoming many of the restrictions to HIRT that occurs in a clinical setting. However, the potential molecular mechanisms of BFR induced muscle hypertrophy remain largely unknown. Here, using a BFR rat model, we aim to better elucidate the mechanisms regulating muscle hypertrophy as induced by BFR and reveal possible clinical therapeutic targets for atrophy cases. We performed genome wide screening with microarray analysis to identify unique differentially expressed genes during rat muscle hypertrophy. We then successfully separated the differentially expressed genes from BRF treated soleus samples by comparing the Affymetrix rat Genome U34 2.0 array with the control. Using qRT-PCR and immunohistochemistry (IHC) we also analyzed other related differentially expressed genes. Results suggested that muscle hypertrophy induced by BFR is essentially regulated by the rate of protein turnover. Specifically, PI3K/AKT and MAPK pathways act as positive regulators in controlling protein synthesis where ubiquitin-proteasome acts as a negative regulator. This represents the first general genome wide level investigation of the gene expression profile in the rat soleus after BFR treatment. This may aid our understanding of the molecular mechanisms regulating and controlling muscle hypertrophy and provide support to the BFR strategies aiming to prevent muscle atrophy in a clinical setting. Copyright © 2016 Elsevier B.V. All rights reserved.
Genome-wide characterization of Mediator recruitment, function, and regulation
2017-01-01
ABSTRACT Mediator is a conserved and essential coactivator complex broadly required for RNA polymerase II (RNAPII) transcription. Recent genome-wide studies of Mediator binding in budding yeast have revealed new insights into the functions of this critical complex and raised new questions about its role in the regulation of gene expression. PMID:28301289
2014-01-01
Background Apple tree breeding is slow and difficult due to long generation times, self-incompatibility, and complex genetics. The identification of molecular markers linked to traits of interest is a way to expedite the breeding process. In the present study, we aimed to identify genes whose steady-state transcript abundance was associated with inheritance of specific traits segregating in an apple (Malus × domestica) rootstock F1 breeding population, including resistance to powdery mildew (Podosphaera leucotricha) disease and woolly apple aphid (Eriosoma lanigerum). Results Transcription profiling was performed for 48 individual F1 apple trees from a cross of two highly heterozygous parents, using RNA isolated from healthy, actively-growing shoot tips and a custom apple DNA oligonucleotide microarray representing 26,000 unique transcripts. Genome-wide expression profiles were not clear indicators of powdery mildew or woolly apple aphid resistance phenotype. However, standard differential gene expression analysis between phenotypic groups of trees revealed relatively small sets of genes with trait-associated expression levels. For example, thirty genes were identified that were differentially expressed between trees resistant and susceptible to powdery mildew. Interestingly, the genes encoding twenty-four of these transcripts were physically clustered on chromosome 12. Similarly, seven genes were identified that were differentially expressed between trees resistant and susceptible to woolly apple aphid, and the genes encoding five of these transcripts were also clustered, this time on chromosome 17. In each case, the gene clusters were in the vicinity of previously identified major quantitative trait loci for the corresponding trait. Similar results were obtained for a series of molecular traits. Several of the differentially expressed genes were used to develop DNA polymorphism markers linked to powdery mildew disease and woolly apple aphid resistance. Conclusions Gene expression profiling and trait-associated transcript analysis using an apple F1 population readily identified genes physically linked to powdery mildew disease resistance and woolly apple aphid resistance loci. This result was especially useful in apple, where extreme levels of heterozygosity make the development of reliable DNA markers quite difficult. The results suggest that this approach could prove effective in crops with complicated genetics, or for which few genomic information resources are available. PMID:24708064
Publication Abstract: Philadelphia chromosome-like acute lymphoblastic leukemia (Ph-like ALL) is characterized by a gene-expression profile similar to that of BCR-ABL1-positive ALL, alterations of lymphoid transcription factor genes, and a poor outcome. The frequency and spectrum of genetic alterations in Ph-like ALL and its responsiveness to tyrosine kinase inhibition are undefined, especially in adolescents and adults. We performed genomic profiling of 1725 patients with precursor B-cell ALL and detailed genomic analysis of 154 patients with Ph-like ALL.
Kuang, Jian-Fei; Chen, Jian-Ye; Liu, Xun-Cheng; Han, Yan-Chao; Xiao, Yun-Yi; Shan, Wei; Tang, Yang; Wu, Ke-Qiang; He, Jun-Xian; Lu, Wang-Jin
2017-04-01
Fruit ripening is a complex, genetically programmed process involving the action of critical transcription factors (TFs). Despite the established significance of dehydration-responsive element binding (DREB) TFs in plant abiotic stress responses, the involvement of DREBs in fruit ripening is yet to be determined. Here, we identified four genes encoding ripening-regulated DREB TFs in banana (Musa acuminata), MaDREB1, MaDREB2, MaDREB3, and MaDREB4, and demonstrated that they play regulatory roles in fruit ripening. We showed that MaDREB1-MaDREB4 are nucleus-localized, induced by ethylene and encompass transcriptional activation activities. We performed a genome-wide chromatin immunoprecipitation and high-throughput sequencing (ChIP-Seq) experiment for MaDREB2 and identified 697 genomic regions as potential targets of MaDREB2. MaDREB2 binds to hundreds of loci with diverse functions and its binding sites are distributed in the promoter regions proximal to the transcriptional start site (TSS). Most of the MaDREB2-binding targets contain the conserved (A/G)CC(G/C)AC motif and MaDREB2 appears to directly regulate the expression of a number of genes involved in fruit ripening. In combination with transcriptome profiling (RNA sequencing) data, our results indicate that MaDREB2 may serve as both transcriptional activator and repressor during banana fruit ripening. In conclusion, our study suggests a hierarchical regulatory model of fruit ripening in banana and that the MaDREB TFs may act as transcriptional regulators in the regulatory network. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
Borkowski, Olivier; Goelzer, Anne; Schaffer, Marc; Calabre, Magali; Mäder, Ulrike; Aymerich, Stéphane; Jules, Matthieu; Fromion, Vincent
2016-05-17
Complex regulatory programs control cell adaptation to environmental changes by setting condition-specific proteomes. In balanced growth, bacterial protein abundances depend on the dilution rate, transcript abundances and transcript-specific translation efficiencies. We revisited the current theory claiming the invariance of bacterial translation efficiency. By integrating genome-wide transcriptome datasets and datasets from a library of synthetic gfp-reporter fusions, we demonstrated that translation efficiencies in Bacillus subtilis decreased up to fourfold from slow to fast growth. The translation initiation regions elicited a growth rate-dependent, differential production of proteins without regulators, hence revealing a unique, hard-coded, growth rate-dependent mode of regulation. We combined model-based data analyses of transcript and protein abundances genome-wide and revealed that this global regulation is extensively used in B. subtilis We eventually developed a knowledge-based, three-step translation initiation model, experimentally challenged the model predictions and proposed that a growth rate-dependent drop in free ribosome abundance accounted for the differential protein production. © 2016 The Authors. Published under the terms of the CC BY 4.0 license.
Heinig, Matthias; Adriaens, Michiel E; Schafer, Sebastian; van Deutekom, Hanneke W M; Lodder, Elisabeth M; Ware, James S; Schneider, Valentin; Felkin, Leanne E; Creemers, Esther E; Meder, Benjamin; Katus, Hugo A; Rühle, Frank; Stoll, Monika; Cambien, François; Villard, Eric; Charron, Philippe; Varro, Andras; Bishopric, Nanette H; George, Alfred L; Dos Remedios, Cristobal; Moreno-Moral, Aida; Pesce, Francesco; Bauerfeind, Anja; Rüschendorf, Franz; Rintisch, Carola; Petretto, Enrico; Barton, Paul J; Cook, Stuart A; Pinto, Yigal M; Bezzina, Connie R; Hubner, Norbert
2017-09-14
Genetic variation is an important determinant of RNA transcription and splicing, which in turn contributes to variation in human traits, including cardiovascular diseases. Here we report the first in-depth survey of heart transcriptome variation using RNA-sequencing in 97 patients with dilated cardiomyopathy and 108 non-diseased controls. We reveal extensive differences of gene expression and splicing between dilated cardiomyopathy patients and controls, affecting known as well as novel dilated cardiomyopathy genes. Moreover, we show a widespread effect of genetic variation on the regulation of transcription, isoform usage, and allele-specific expression. Systematic annotation of genome-wide association SNPs identifies 60 functional candidate genes for heart phenotypes, representing 20% of all published heart genome-wide association loci. Focusing on the dilated cardiomyopathy phenotype we found that eQTL variants are also enriched for dilated cardiomyopathy genome-wide association signals in two independent cohorts. RNA transcription, splicing, and allele-specific expression are each important determinants of the dilated cardiomyopathy phenotype and are controlled by genetic factors. Our results represent a powerful resource for the field of cardiovascular genetics.
Mfd translocase is necessary and sufficient for transcription-coupled repair in Escherichia coli.
Adebali, Ogun; Sancar, Aziz; Selby, Christopher P
2017-11-10
Nucleotide excision repair in Escherichia coli is stimulated by transcription, specifically in the transcribed strand. Previously, it was shown that this transcription-coupled repair (TCR) is mediated by the Mfd translocase. Recently, it was proposed that in fact the majority of TCR in E. coli is catalyzed by a second pathway ("backtracking-mediated TCR") that is dependent on the UvrD helicase and the guanosine pentaphosphate (ppGpp) alarmone/stringent response regulator. Recently, we reported that as measured by the excision repair-sequencing (XR-seq), UvrD plays no role in TCR genome-wide. Here, we tested the role of ppGpp and UvrD in TCR genome-wide and in the lacZ operon using the XR-seq method, which directly measures repair. We found that the mfd mutation abolishes TCR genome-wide and in the lacZ operon. In contrast, the relA - spoT - mutant deficient in ppGpp synthesis carries out normal TCR. We conclude that UvrD and ppGpp play no role in TCR in E. coli . © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Swindell, William R.; Johnston, Andrew; Carbajal, Steve; Han, Gangwen; Wohn, Christian; Lu, Jun; Xing, Xianying; Nair, Rajan P.; Voorhees, John J.; Elder, James T.; Wang, Xiao-Jing; Sano, Shigetoshi; Prens, Errol P.; DiGiovanni, John; Pittelkow, Mark R.; Ward, Nicole L.; Gudjonsson, Johann E.
2011-01-01
Development of a suitable mouse model would facilitate the investigation of pathomechanisms underlying human psoriasis and would also assist in development of therapeutic treatments. However, while many psoriasis mouse models have been proposed, no single model recapitulates all features of the human disease, and standardized validation criteria for psoriasis mouse models have not been widely applied. In this study, whole-genome transcriptional profiling is used to compare gene expression patterns manifested by human psoriatic skin lesions with those that occur in five psoriasis mouse models (K5-Tie2, imiquimod, K14-AREG, K5-Stat3C and K5-TGFbeta1). While the cutaneous gene expression profiles associated with each mouse phenotype exhibited statistically significant similarity to the expression profile of psoriasis in humans, each model displayed distinctive sets of similarities and differences in comparison to human psoriasis. For all five models, correspondence to the human disease was strong with respect to genes involved in epidermal development and keratinization. Immune and inflammation-associated gene expression, in contrast, was more variable between models as compared to the human disease. These findings support the value of all five models as research tools, each with identifiable areas of convergence to and divergence from the human disease. Additionally, the approach used in this paper provides an objective and quantitative method for evaluation of proposed mouse models of psoriasis, which can be strategically applied in future studies to score strengths of mouse phenotypes relative to specific aspects of human psoriasis. PMID:21483750
Distal Limb Patterning Requires Modulation of cis-Regulatory Activities by HOX13
Sheth, Rushikesh; Barozzi, Iros; Langlais, David; ...
2016-12-13
The combinatorial expression of Hox genes along the body axes is a major determinant of cell fate and plays a pivotal role in generating the animal body plan. Loss of HOXA13 and HOXD13 transcription factors (HOX13) leads to digit agenesis in mice, but how HOX13 proteins regulate transcriptional outcomes and confer identity to the distal-most limb cells has remained elusive. Here, we report on the genome-wide profiling of HOXA13 and HOXD13 in vivo binding and changes of the transcriptome and chromatin state in the transition from the early to the late-distal limb developmental program, as well as in Hoxa13–/–; Hoxd13–/– limbs. Ourmore » results show that proper termination of the early limb transcriptional program and activation of the late-distal limb program are coordinated by the dual action of HOX13 on cis-regulatory modules.« less
Genome-wide Identification and Expression Analysis of the CDPK Gene Family in Grape, Vitis spp.
Zhang, Kai; Han, Yong-Tao; Zhao, Feng-Li; Hu, Yang; Gao, Yu-Rong; Ma, Yan-Fei; Zheng, Yi; Wang, Yue-Jin; Wen, Ying-Qiang
2015-06-30
Calcium-dependent protein kinases (CDPKs) play vital roles in plant growth and development, biotic and abiotic stress responses, and hormone signaling. Little is known about the CDPK gene family in grapevine. In this study, we performed a genome-wide analysis of the 12X grape genome (Vitis vinifera) and identified nineteen CDPK genes. Comparison of the structures of grape CDPK genes allowed us to examine their functional conservation and differentiation. Segmentally duplicated grape CDPK genes showed high structural conservation and contributed to gene family expansion. Additional comparisons between grape and Arabidopsis thaliana demonstrated that several grape CDPK genes occured in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of grapevine and Arabidopsis. Phylogenetic analysis divided the grape CDPK genes into four groups. Furthermore, we examined the expression of the corresponding nineteen homologous CDPK genes in the Chinese wild grape (Vitis pseudoreticulata) under various conditions, including biotic stress, abiotic stress, and hormone treatments. The expression profiles derived from reverse transcription and quantitative PCR suggested that a large number of VpCDPKs responded to various stimuli on the transcriptional level, indicating their versatile roles in the responses to biotic and abiotic stresses. Moreover, we examined the subcellular localization of VpCDPKs by transiently expressing six VpCDPK-GFP fusion proteins in Arabidopsis mesophyll protoplasts; this revealed high variability consistent with potential functional differences. Taken as a whole, our data provide significant insights into the evolution and function of grape CDPKs and a framework for future investigation of grape CDPK genes.
He, Shaolan; Zheng, Yongqiang; Yi, Shilai; Lv, Qiang; Deng, Lie
2014-01-01
The R2R3MYB proteins represent one of the largest families of transcription factors, which play important roles in plant growth and development. Although genome-wide analysis of this family has been conducted in many species, little is known about R2R3MYB genes in citrus, In this study, 101 R2R3MYB genes has been identified in the citrus (Citrus sinesis and Citrus clementina) genomes, which are almost equal to the number of rice. Phylogenetic analysis revealed that they could be subdivided into 21 subgroups. The evolutionary relationships and the intro-exon organizations were also analyzed, revealing strong gene conservation but also the expansions of particular functional genes during the plant evolution. Tissue-specific expression profiles showed that 95 citrus R2R3MYB genes were expressed in at least one tissue and the other 6 genes showed very low expression in all tissues tested, suggesting that citrus R2R3MYB genes play important roles in the development of all citrus organs. The transcript abundance level analysis during abiotic conditions (NaCl, abscisic acid, jasmonic acid, drought and low temperature) identified a group of R2R3MYB genes that responded to one or multiple treatments, which showed a promising for improving citrus adaptation to stresses. Our results provided an essential foundation for the future selection of the citrus R2R3MYB genes for cloning and functional dissection with an aim of uncovering their roles in citrus growth and development. PMID:25473954
Muthamilarasan, Mehanathan; Khandelwal, Rohit; Yadav, Chandra Bhan; Bonthala, Venkata Suresh; Khan, Yusuf; Prasad, Manoj
2014-01-01
MYB proteins represent one of the largest transcription factor families in plants, playing important roles in diverse developmental and stress-responsive processes. Considering its significance, several genome-wide analyses have been conducted in almost all land plants except foxtail millet. Foxtail millet (Setaria italica L.) is a model crop for investigating systems biology of millets and bioenergy grasses. Further, the crop is also known for its potential abiotic stress-tolerance. In this context, a comprehensive genome-wide survey was conducted and 209 MYB protein-encoding genes were identified in foxtail millet. All 209 S. italica MYB (SiMYB) genes were physically mapped onto nine chromosomes of foxtail millet. Gene duplication study showed that segmental- and tandem-duplication have occurred in genome resulting in expansion of this gene family. The protein domain investigation classified SiMYB proteins into three classes according to number of MYB repeats present. The phylogenetic analysis categorized SiMYBs into ten groups (I - X). SiMYB-based comparative mapping revealed a maximum orthology between foxtail millet and sorghum, followed by maize, rice and Brachypodium. Heat map analysis showed tissue-specific expression pattern of predominant SiMYB genes. Expression profiling of candidate MYB genes against abiotic stresses and hormone treatments using qRT-PCR revealed specific and/or overlapping expression patterns of SiMYBs. Taken together, the present study provides a foundation for evolutionary and functional characterization of MYB TFs in foxtail millet to dissect their functions in response to environmental stimuli. PMID:25279462
Watanabe, Kazuhide; Biesinger, Jacob; Salmans, Michael L.; Roberts, Brian S.; Arthur, William T.; Cleary, Michele; Andersen, Bogi; Xie, Xiaohui; Dai, Xing
2014-01-01
Background Deregulation of canonical Wnt/CTNNB1 (beta-catenin) pathway is one of the earliest events in the pathogenesis of colon cancer. Mutations in APC or CTNNB1 are highly frequent in colon cancer and cause aberrant stabilization of CTNNB1, which activates the transcription of Wnt target genes by binding to chromatin via the TCF/LEF transcription factors. Here we report an integrative analysis of genome-wide chromatin occupancy of CTNNB1 by chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) and gene expression profiling by microarray analysis upon RNAi-mediated knockdown of CTNNB1 in colon cancer cells. Results We observed 3629 CTNNB1 binding peaks across the genome and a significant correlation between CTNNB1 binding and knockdown-induced gene expression change. Our integrative analysis led to the discovery of a direct Wnt target signature composed of 162 genes. Gene ontology analysis of this signature revealed a significant enrichment of Wnt pathway genes, suggesting multiple feedback regulations of the pathway. We provide evidence that this gene signature partially overlaps with the Lgr5+ intestinal stem cell signature, and is significantly enriched in normal intestinal stem cells as well as in clinical colorectal cancer samples. Interestingly, while the expression of the CTNNB1 target gene set does not correlate with survival, elevated expression of negative feedback regulators within the signature predicts better prognosis. Conclusion Our data provide a genome-wide view of chromatin occupancy and gene regulation of Wnt/CTNNB1 signaling in colon cancer cells. PMID:24651522
Watanabe, Kazuhide; Biesinger, Jacob; Salmans, Michael L; Roberts, Brian S; Arthur, William T; Cleary, Michele; Andersen, Bogi; Xie, Xiaohui; Dai, Xing
2014-01-01
Deregulation of canonical Wnt/CTNNB1 (beta-catenin) pathway is one of the earliest events in the pathogenesis of colon cancer. Mutations in APC or CTNNB1 are highly frequent in colon cancer and cause aberrant stabilization of CTNNB1, which activates the transcription of Wnt target genes by binding to chromatin via the TCF/LEF transcription factors. Here we report an integrative analysis of genome-wide chromatin occupancy of CTNNB1 by chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) and gene expression profiling by microarray analysis upon RNAi-mediated knockdown of CTNNB1 in colon cancer cells. We observed 3629 CTNNB1 binding peaks across the genome and a significant correlation between CTNNB1 binding and knockdown-induced gene expression change. Our integrative analysis led to the discovery of a direct Wnt target signature composed of 162 genes. Gene ontology analysis of this signature revealed a significant enrichment of Wnt pathway genes, suggesting multiple feedback regulations of the pathway. We provide evidence that this gene signature partially overlaps with the Lgr5+ intestinal stem cell signature, and is significantly enriched in normal intestinal stem cells as well as in clinical colorectal cancer samples. Interestingly, while the expression of the CTNNB1 target gene set does not correlate with survival, elevated expression of negative feedback regulators within the signature predicts better prognosis. Our data provide a genome-wide view of chromatin occupancy and gene regulation of Wnt/CTNNB1 signaling in colon cancer cells.
Tao, Xuelian; Chen, Jianning; Jiang, Yanzhi; Wei, Yingying; Chen, Yan; Xu, Huaming; Zhu, Li; Tang, Guoqing; Li, Mingzhou; Jiang, Anan; Shuai, Surong; Bai, Lin; Liu, Haifeng; Ma, Jideng; Jin, Long; Wen, Anxiang; Wang, Qin; Zhu, Guangxiang; Xie, Meng; Wu, Jiayun; He, Tao; Huang, Chunyu; Gao, Xiang; Li, Xuewei
2017-04-28
N 6 -methyladenosine (m 6 A) is the most prevalent internal form of modification in messenger RNA in higher eukaryotes and potential regulatory functions of reversible m 6 A methylation on mRNA have been revealed by mapping of m 6 A methylomes in several species. m 6 A modification in active gene regulation manifests itself as altered methylation profiles in a tissue-specific manner or in response to changing cellular or species living environment. However, up to date, there has no data on m 6 A porcine transcriptome-wide map and its potential biological roles in adipose deposition and muscle growth. In this work, we used methylated RNA immunoprecipitation with next-generation sequencing (MeRIP-Seq) technique to acquire the first ever m 6 A porcine transcriptome-wide map. Transcriptomes of muscle and adipose tissues from three different pig breeds, the wild boar, Landrace, and Rongchang pig, were used to generate these maps. Our findings show that there were 5,872 and 2,826 m 6 A peaks respectively, in the porcine muscle and adipose tissue transcriptomes. Stop codons, 3'-untranslated regions, and coding regions were found to be mainly enriched for m 6 A peaks. Gene ontology analysis revealed that common m 6 A peaks in nuclear genes are associated with transcriptional factors, suggestive of a relationship between m 6 A mRNA methylation and nuclear genome transcription. Some genes showed tissue- and breed-differential methylation, and have novel biological functions. We also found a relationship between the m 6 A methylation extent and the transcript level, suggesting a regulatory role for m 6 A in gene expression. This comprehensive map provides a solid basis for the determination of potential functional roles for RNA m 6 A modification in adipose deposition and muscle growth.
Zhou, Hong; Wang, Xia; Yang, Tengteng; Zhang, Weixin; Chen, Guanjun
2016-01-01
Cytophaga hutchinsonii specializes in cellulose digestion by employing a collection of novel cell-associated proteins. Here, we identified a novel gene locus, CHU_1276, that is essential for C. hutchinsonii cellulose utilization. Disruption of CHU_1276 in C. hutchinsonii resulted in complete deficiency in cellulose degradation, as well as compromised assimilation of cellobiose or glucose at a low concentration. Further analysis showed that CHU_1276 was an outer membrane protein that could be induced by cellulose and low concentrations of glucose. Transcriptional profiling revealed that CHU_1276 exerted a profound effect on the genome-wide response to both glucose and Avicel and that the mutant lacking CHU_1276 displayed expression profiles very different from those of the wild-type strain under different culture conditions. Specifically, comparison of their transcriptional responses to cellulose led to the identification of a gene set potentially regulated by CHU_1276. These results suggest that CHU_1276 plays an essential role in cellulose utilization, probably by coordinating the extracellular hydrolysis of cellulose substrate with the intracellular uptake of the hydrolysis product in C. hutchinsonii. PMID:26773084
Transposable elements contribute to activation of maize genes in response to abiotic stress.
Makarevitch, Irina; Waters, Amanda J; West, Patrick T; Stitzer, Michelle; Hirsch, Candice N; Ross-Ibarra, Jeffrey; Springer, Nathan M
2015-01-01
Transposable elements (TEs) account for a large portion of the genome in many eukaryotic species. Despite their reputation as "junk" DNA or genomic parasites deleterious for the host, TEs have complex interactions with host genes and the potential to contribute to regulatory variation in gene expression. It has been hypothesized that TEs and genes they insert near may be transcriptionally activated in response to stress conditions. The maize genome, with many different types of TEs interspersed with genes, provides an ideal system to study the genome-wide influence of TEs on gene regulation. To analyze the magnitude of the TE effect on gene expression response to environmental changes, we profiled gene and TE transcript levels in maize seedlings exposed to a number of abiotic stresses. Many genes exhibit up- or down-regulation in response to these stress conditions. The analysis of TE families inserted within upstream regions of up-regulated genes revealed that between four and nine different TE families are associated with up-regulated gene expression in each of these stress conditions, affecting up to 20% of the genes up-regulated in response to abiotic stress, and as many as 33% of genes that are only expressed in response to stress. Expression of many of these same TE families also responds to the same stress conditions. The analysis of the stress-induced transcripts and proximity of the transposon to the gene suggests that these TEs may provide local enhancer activities that stimulate stress-responsive gene expression. Our data on allelic variation for insertions of several of these TEs show strong correlation between the presence of TE insertions and stress-responsive up-regulation of gene expression. Our findings suggest that TEs provide an important source of allelic regulatory variation in gene response to abiotic stress in maize.
Strand-specific transcriptome profiling with directly labeled RNA on genomic tiling microarrays
2011-01-01
Background With lower manufacturing cost, high spot density, and flexible probe design, genomic tiling microarrays are ideal for comprehensive transcriptome studies. Typically, transcriptome profiling using microarrays involves reverse transcription, which converts RNA to cDNA. The cDNA is then labeled and hybridized to the probes on the arrays, thus the RNA signals are detected indirectly. Reverse transcription is known to generate artifactual cDNA, in particular the synthesis of second-strand cDNA, leading to false discovery of antisense RNA. To address this issue, we have developed an effective method using RNA that is directly labeled, thus by-passing the cDNA generation. This paper describes this method and its application to the mapping of transcriptome profiles. Results RNA extracted from laboratory cultures of Porphyromonas gingivalis was fluorescently labeled with an alkylation reagent and hybridized directly to probes on genomic tiling microarrays specifically designed for this periodontal pathogen. The generated transcriptome profile was strand-specific and produced signals close to background level in most antisense regions of the genome. In contrast, high levels of signal were detected in the antisense regions when the hybridization was done with cDNA. Five antisense areas were tested with independent strand-specific RT-PCR and none to negligible amplification was detected, indicating that the strong antisense cDNA signals were experimental artifacts. Conclusions An efficient method was developed for mapping transcriptome profiles specific to both coding strands of a bacterial genome. This method chemically labels and uses extracted RNA directly in microarray hybridization. The generated transcriptome profile was free of cDNA artifactual signals. In addition, this method requires fewer processing steps and is potentially more sensitive in detecting small amount of RNA compared to conventional end-labeling methods due to the incorporation of more fluorescent molecules per RNA fragment. PMID:21235785
Gene expression profile in mesenchymal stem cells derived from dental tissues and bone marrow
Kim, Su-Hwan; Kim, Young-Sung; Lee, Su-Yeon; Kim, Kyoung-Hwa; Lee, Yong-Moo; Kim, Won-Kyung
2011-01-01
Purpose The aim of this study is to compare the gene expression profile in mesenchymal stem cells derived from dental tissues and bone marrow for characterization of dental stem cells. Methods We employed GeneChip analysis to the expression levels of approximately 32,321 kinds of transcripts in 5 samples of bone-marrow-derived mesenchymal stem cells (BMSCs) (n=1), periodontal ligament stem cells (PDLSCs) (n=2), and dental pulp stem cells (DPSCs) (n=2). Each cell was sorted by a FACS Vantage Sorter using immunocytochemical staining of the early mesenchymal stem cell surface marker STRO-1 before the microarray analysis. Results We identified 379 up-regulated and 133 down-regulated transcripts in BMSCs, 68 up-regulated and 64 down-regulated transcripts in PDLSCs, and 218 up-regulated and 231 down-regulated transcripts in DPSCs. In addition, anatomical structure development and anatomical structure morphogenesis gene ontology (GO) terms were over-represented in all three different mesenchymal stem cells and GO terms related to blood vessels, and neurons were over-represented only in DPSCs. Conclusions This study demonstrated the genome-wide gene expression patterns of STRO-1+ mesenchymal stem cells derived from dental tissues and bone marrow. The differences among the expression profiles of BMSCs, PDLSCs, and DPSCs were shown, and 999 candidate genes were found to be definitely up- or down-regulated. In addition, GOstat analyses of regulated gene products provided over-represented GO classes. These data provide a first step for discovering molecules key to the characteristics of dental stem cells. PMID:21954424
Transcriptomic Profiling of Fruit Development in Black Raspberry Rubus coreanus
Hu, Yaodong
2018-01-01
The wild Rubus species R. coreanus, which is widely distributed in southwest China, shows great promise as a genetic resource for breeding. One of its outstanding properties is adaptation to high temperature and humidity. To facilitate its use in selection and breeding programs, we assembled de novo 179,738,287 R. coreanus reads (125 bp in length) generated by RNA sequencing from fruits at three representative developmental stages. We also used the recently released draft genome of R. occidentalis to perform reference-guided assembly. We inferred a final 95,845-transcript reference for R. coreanus. Of these genetic resources, 66,597 (69.5%) were annotated. Based on these results, we carried out a comprehensive analysis of differentially expressed genes. Flavonoid biosynthesis, phenylpropanoid biosynthesis, plant hormone signal transduction, and cutin, suberin, and wax biosynthesis pathways were significantly enriched throughout the ripening process. We identified 23 transcripts involved in the flavonoid biosynthesis pathway whose expression perfectly paralleled changes in the metabolites. Additionally, we identified 119 nucleotide-binding site leucine-rich repeat (NBS-LRR) protein-coding genes, involved in pathogen resistance, of which 74 were in the completely conserved domain. These results provide, for the first time, genome-wide genetic information for understanding developmental regulation of R. coreanus fruits. They have the potential for use in breeding through functional genetic approaches in the near future. PMID:29805970
Lu, Leina; Zhou, Liang; Chen, Eric Z.; Sun, Kun; Jiang, Peiyong; Wang, Lijun; Su, Xiaoxi; Sun, Hao; Wang, Huating
2012-01-01
microRNAs (miRNAs) are non-coding RNAs that regulate gene expression post-transcriptionally, and mounting evidence supports the prevalence and functional significance of their interplay with transcription factors (TFs). Here we describe the identification of a regulatory circuit between muscle miRNAs (miR-1, miR-133 and miR-206) and Yin Yang 1 (YY1), an epigenetic repressor of skeletal myogenesis in mouse. Genome-wide identification of potential down-stream targets of YY1 by combining computational prediction with expression profiling data reveals a large number of putative miRNA targets of YY1 during skeletal myoblasts differentiation into myotubes with muscle miRs ranking on top of the list. The subsequent experimental results demonstrate that YY1 indeed represses muscle miRs expression in myoblasts and the repression is mediated through multiple enhancers and recruitment of Polycomb complex to several YY1 binding sites. YY1 regulating miR-1 is functionally important for both C2C12 myogenic differentiation and injury-induced muscle regeneration. Furthermore, we demonstrate that miR-1 in turn targets YY1, thus forming a negative feedback loop. Together, these results identify a novel regulatory circuit required for skeletal myogenesis and reinforce the idea that regulatory circuitries involving miRNAs and TFs are prevalent mechanisms. PMID:22319554
USDA-ARS?s Scientific Manuscript database
Colony development, which includes hyphal extension, branching, anastomosis and asexual sporulation are fundamental aspects of the lifecycle of filamentous fungi; genetic mechanisms underlying these phenomena are poorly understood. We conducted transcriptional profiling during colony development of...
Zhang, Dingxiao; Park, Daechan; Zhong, Yi; Lu, Yue; Rycaj, Kiera; Gong, Shuai; Chen, Xin; Liu, Xin; Chao, Hsueh-Ping; Whitney, Pamela; Calhoun-Davis, Tammy; Takata, Yoko; Shen, Jianjun; Iyer, Vishwanath R.; Tang, Dean G.
2016-01-01
The prostate gland mainly contains basal and luminal cells constructed as a pseudostratified epithelium. Annotation of prostate epithelial transcriptomes provides a foundation for discoveries that can impact disease understanding and treatment. Here we describe a genome-wide transcriptome analysis of human benign prostatic basal and luminal epithelial populations using deep RNA sequencing. Through molecular and biological characterizations, we show that the differential gene-expression profiles account for their distinct functional properties. Strikingly, basal cells preferentially express gene categories associated with stem cells, neurogenesis and ribosomal RNA (rRNA) biogenesis. Consistent with this profile, basal cells functionally exhibit intrinsic stem-like and neurogenic properties with enhanced rRNA transcription activity. Of clinical relevance, the basal cell gene-expression profile is enriched in advanced, anaplastic, castration-resistant and metastatic prostate cancers. Therefore, we link the cell-type-specific gene signatures to aggressive subtypes of prostate cancer and identify gene signatures associated with adverse clinical features. PMID:26924072
Zhang, Dingxiao; Park, Daechan; Zhong, Yi; Lu, Yue; Rycaj, Kiera; Gong, Shuai; Chen, Xin; Liu, Xin; Chao, Hsueh-Ping; Whitney, Pamela; Calhoun-Davis, Tammy; Takata, Yoko; Shen, Jianjun; Iyer, Vishwanath R; Tang, Dean G
2016-02-29
The prostate gland mainly contains basal and luminal cells constructed as a pseudostratified epithelium. Annotation of prostate epithelial transcriptomes provides a foundation for discoveries that can impact disease understanding and treatment. Here we describe a genome-wide transcriptome analysis of human benign prostatic basal and luminal epithelial populations using deep RNA sequencing. Through molecular and biological characterizations, we show that the differential gene-expression profiles account for their distinct functional properties. Strikingly, basal cells preferentially express gene categories associated with stem cells, neurogenesis and ribosomal RNA (rRNA) biogenesis. Consistent with this profile, basal cells functionally exhibit intrinsic stem-like and neurogenic properties with enhanced rRNA transcription activity. Of clinical relevance, the basal cell gene-expression profile is enriched in advanced, anaplastic, castration-resistant and metastatic prostate cancers. Therefore, we link the cell-type-specific gene signatures to aggressive subtypes of prostate cancer and identify gene signatures associated with adverse clinical features.
Identification of functional elements and regulatory circuits by Drosophila modENCODE
DOE Office of Scientific and Technical Information (OSTI.GOV)
Roy, Sushmita; Ernst, Jason; Kharchenko, Peter V.
2010-12-22
To gain insight into how genomic information is translated into cellular and developmental programs, the Drosophila model organism Encyclopedia of DNA Elements (modENCODE) project is comprehensively mapping transcripts, histone modifications, chromosomal proteins, transcription factors, replication proteins and intermediates, and nucleosome properties across a developmental time course and in multiple cell lines. We have generated more than 700 data sets and discovered protein-coding, noncoding, RNA regulatory, replication, and chromatin elements, more than tripling the annotated portion of the Drosophila genome. Correlated activity patterns of these elements reveal a functional regulatory network, which predicts putative new functions for genes, reveals stage- andmore » tissue-specific regulators, and enables gene-expression prediction. Our results provide a foundation for directed experimental and computational studies in Drosophila and related species and also a model for systematic data integration toward comprehensive genomic and functional annotation. Several years after the complete genetic sequencing of many species, it is still unclear how to translate genomic information into a functional map of cellular and developmental programs. The Encyclopedia of DNA Elements (ENCODE) (1) and model organism ENCODE (modENCODE) (2) projects use diverse genomic assays to comprehensively annotate the Homo sapiens (human), Drosophila melanogaster (fruit fly), and Caenorhabditis elegans (worm) genomes, through systematic generation and computational integration of functional genomic data sets. Previous genomic studies in flies have made seminal contributions to our understanding of basic biological mechanisms and genome functions, facilitated by genetic, experimental, computational, and manual annotation of the euchromatic and heterochromatic genome (3), small genome size, short life cycle, and a deep knowledge of development, gene function, and chromosome biology. The functions of {approx}40% of the protein and nonprotein-coding genes [FlyBase 5.12 (4)] have been determined from cDNA collections (5, 6), manual curation of gene models (7), gene mutations and comprehensive genome-wide RNA interference screens (8-10), and comparative genomic analyses (11, 12). The Drosophila modENCODE project has generated more than 700 data sets that profile transcripts, histone modifications and physical nucleosome properties, general and specific transcription factors (TFs), and replication programs in cell lines, isolated tissues, and whole organisms across several developmental stages (Fig. 1). Here, we computationally integrate these data sets and report (i) improved and additional genome annotations, including full-length proteincoding genes and peptides as short as 21 amino acids; (ii) noncoding transcripts, including 132 candidate structural RNAs and 1608 nonstructural transcripts; (iii) additional Argonaute (Ago)-associated small RNA genes and pathways, including new microRNAs (miRNAs) encoded within protein-coding exons and endogenous small interfering RNAs (siRNAs) from 3-inch untranslated regions; (iv) chromatin 'states' defined by combinatorial patterns of 18 chromatin marks that are associated with distinct functions and properties; (v) regions of high TF occupancy and replication activity with likely epigenetic regulation; (vi)mixed TF and miRNA regulatory networks with hierarchical structure and enriched feed-forward loops; (vii) coexpression- and co-regulation-based functional annotations for nearly 3000 genes; (viii) stage- and tissue-specific regulators; and (ix) predictive models of gene expression levels and regulator function.« less
Damienikan, Aliaksandr U.
2016-01-01
The majority of bacterial genome annotations are currently automated and based on a ‘gene by gene’ approach. Regulatory signals and operon structures are rarely taken into account which often results in incomplete and even incorrect gene function assignments. Here we present SigmoID, a cross-platform (OS X, Linux and Windows) open-source application aiming at simplifying the identification of transcription regulatory sites (promoters, transcription factor binding sites and terminators) in bacterial genomes and providing assistance in correcting annotations in accordance with regulatory information. SigmoID combines a user-friendly graphical interface to well known command line tools with a genome browser for visualising regulatory elements in genomic context. Integrated access to online databases with regulatory information (RegPrecise and RegulonDB) and web-based search engines speeds up genome analysis and simplifies correction of genome annotation. We demonstrate some features of SigmoID by constructing a series of regulatory protein binding site profiles for two groups of bacteria: Soft Rot Enterobacteriaceae (Pectobacterium and Dickeya spp.) and Pseudomonas spp. Furthermore, we inferred over 900 transcription factor binding sites and alternative sigma factor promoters in the annotated genome of Pectobacterium atrosepticum. These regulatory signals control putative transcription units covering about 40% of the P. atrosepticum chromosome. Reviewing the annotation in cases where it didn’t fit with regulatory information allowed us to correct product and gene names for over 300 loci. PMID:27257541
Chymkowitch, Pierre; Nguéa P, Aurélie; Aanes, Håvard; Koehler, Christian J.; Thiede, Bernd; Lorenz, Susanne; Meza-Zepeda, Leonardo A.; Klungland, Arne; Enserink, Jorrit M.
2015-01-01
Transcription factors are abundant Sumo targets, yet the global distribution of Sumo along the chromatin and its physiological relevance in transcription are poorly understood. Using Saccharomyces cerevisiae, we determined the genome-wide localization of Sumo along the chromatin. We discovered that Sumo-enriched genes are almost exclusively involved in translation, such as tRNA genes and ribosomal protein genes (RPGs). Genome-wide expression analysis showed that Sumo positively regulates their transcription. We also discovered that the Sumo consensus motif at RPG promoters is identical to the DNA binding motif of the transcription factor Rap1. We demonstrate that Rap1 is a molecular target of Sumo and that sumoylation of Rap1 is important for cell viability. Furthermore, Rap1 sumoylation promotes recruitment of the basal transcription machinery, and sumoylation of Rap1 cooperates with the target of rapamycin kinase complex 1 (TORC1) pathway to promote RPG transcription. Strikingly, our data reveal that sumoylation of Rap1 functions in a homeostatic feedback loop that sustains RPG transcription during translational stress. Taken together, Sumo regulates the cellular translational capacity by promoting transcription of tRNA genes and RPGs. PMID:25800674
RNA-Seq for gene identification and transcript profiling of three Stevia rebaudiana genotypes.
Chen, Junwen; Hou, Kai; Qin, Peng; Liu, Hongchang; Yi, Bin; Yang, Wenting; Wu, Wei
2014-07-07
Stevia (Stevia rebaudiana) is an important medicinal plant that yields diterpenoid steviol glycosides (SGs). SGs are currently used in the preparation of medicines, food products and neutraceuticals because of its sweetening property (zero calories and about 300 times sweeter than sugar). Recently, some progress has been made in understanding the biosynthesis of SGs in Stevia, but little is known about the molecular mechanisms underlying this process. Additionally, the genomics of Stevia, a non-model species, remains uncharacterized. The recent advent of RNA-Seq, a next generation sequencing technology, provides an opportunity to expand the identification of Stevia genes through in-depth transcript profiling. We present a comprehensive landscape of the transcriptome profiles of three genotypes of Stevia with divergent SG compositions characterized using RNA-seq. 191,590,282 high-quality reads were generated and then assembled into 171,837 transcripts with an average sequence length of 969 base pairs. A total of 80,160 unigenes were annotated, and 14,211 of the unique sequences were assigned to specific metabolic pathways by the Kyoto Encyclopedia of Genes and Genomes. Gene sequences of all enzymes known to be involved in SG synthesis were examined. A total of 143 UDP-glucosyltransferase (UGT) unigenes were identified, some of which might be involved in SG biosynthesis. The expression patterns of eight of these genes were further confirmed by RT-QPCR. RNA-seq analysis identified candidate genes encoding enzymes responsible for the biosynthesis of SGs in Stevia, a non-model plant without a reference genome. The transcriptome data from this study yielded new insights into the process of SG accumulation in Stevia. Our results demonstrate that RNA-Seq can be successfully used for gene identification and transcript profiling in a non-model species.
Selective nuclear export of specific classes of mRNA from mammalian nuclei is promoted by GANP
Wickramasinghe, Vihandha O.; Andrews, Robert; Ellis, Peter; Langford, Cordelia; Gurdon, John B.; Stewart, Murray; Venkitaraman, Ashok R.; Laskey, Ronald A.
2014-01-01
The nuclear phase of the gene expression pathway culminates in the export of mature messenger RNAs (mRNAs) to the cytoplasm through nuclear pore complexes. GANP (germinal- centre associated nuclear protein) promotes the transfer of mRNAs bound to the transport factor NXF1 to nuclear pore complexes. Here, we demonstrate that GANP, subunit of the TRanscription-EXport-2 (TREX-2) mRNA export complex, promotes selective nuclear export of a specific subset of mRNAs whose transport depends on NXF1. Genome-wide gene expression profiling showed that half of the transcripts whose nuclear export was impaired following NXF1 depletion also showed reduced export when GANP was depleted. GANP-dependent transcripts were highly expressed, yet short-lived, and were highly enriched in those encoding central components of the gene expression machinery such as RNA synthesis and processing factors. After injection into Xenopus oocyte nuclei, representative GANP-dependent transcripts showed faster nuclear export kinetics than representative transcripts that were not influenced by GANP depletion. We propose that GANP promotes the nuclear export of specific classes of mRNAs that may facilitate rapid changes in gene expression. PMID:24510098
Identification and characterization of Hoxa9 binding sites in hematopoietic cells
Huang, Yongsheng; Sitwala, Kajal; Bronstein, Joel; Sanders, Daniel; Dandekar, Monisha; Collins, Cailin; Robertson, Gordon; MacDonald, James; Cezard, Timothee; Bilenky, Misha; Thiessen, Nina; Zhao, Yongjun; Zeng, Thomas; Hirst, Martin; Hero, Alfred; Jones, Steven
2012-01-01
The clustered homeobox proteins play crucial roles in development, hematopoiesis, and leukemia, yet the targets they regulate and their mechanisms of action are poorly understood. Here, we identified the binding sites for Hoxa9 and the Hox cofactor Meis1 on a genome-wide level and profiled their associated epigenetic modifications and transcriptional targets. Hoxa9 and the Hox cofactor Meis1 cobind at hundreds of highly evolutionarily conserved sites, most of which are distant from transcription start sites. These sites show high levels of histone H3K4 monomethylation and CBP/P300 binding characteristic of enhancers. Furthermore, a subset of these sites shows enhancer activity in transient transfection assays. Many Hoxa9 and Meis1 binding sites are also bound by PU.1 and other lineage-restricted transcription factors previously implicated in establishment of myeloid enhancers. Conditional Hoxa9 activation is associated with CBP/P300 recruitment, histone acetylation, and transcriptional activation of a network of proto-oncogenes, including Erg, Flt3, Lmo2, Myb, and Sox4. Collectively, this work suggests that Hoxa9 regulates transcription by interacting with enhancers of genes important for hematopoiesis and leukemia. PMID:22072553
Simultaneous live imaging of the transcription and nuclear position of specific genes
Ochiai, Hiroshi; Sugawara, Takeshi; Yamamoto, Takashi
2015-01-01
The relationship between genome organization and gene expression has recently been established. However, the relationships between spatial organization, dynamics, and transcriptional regulation of the genome remain unknown. In this study, we developed a live-imaging method for simultaneous measurements of the transcriptional activity and nuclear position of endogenous genes, which we termed the ‘Real-time Observation of Localization and EXpression (ROLEX)’ system. We demonstrated that ROLEX is highly specific and does not affect the expression level of the target gene. ROLEX enabled detection of sub-genome-wide mobility changes that depended on the state of Nanog transactivation in embryonic stem cells. We believe that the ROLEX system will become a powerful tool for exploring the relationship between transcription and nuclear dynamics in living cells. PMID:26092696
Structural properties of prokaryotic promoter regions correlate with functional features.
Meysman, Pieter; Collado-Vides, Julio; Morett, Enrique; Viola, Roberto; Engelen, Kristof; Laukens, Kris
2014-01-01
The structural properties of the DNA molecule are known to play a critical role in transcription. In this paper, the structural profiles of promoter regions were studied within the context of their diversity and their function for eleven prokaryotic species; Escherichia coli, Klebsiella pneumoniae, Salmonella Typhimurium, Pseudomonas auroginosa, Geobacter sulfurreducens Helicobacter pylori, Chlamydophila pneumoniae, Synechocystis sp., Synechoccocus elongates, Bacillus anthracis, and the archaea Sulfolobus solfataricus. The main anchor point for these promoter regions were transcription start sites identified through high-throughput experiments or collected within large curated databases. Prokaryotic promoter regions were found to be less stable and less flexible than the genomic mean across all studied species. However, direct comparison between species revealed differences in their structural profiles that can not solely be explained by the difference in genomic GC content. In addition, comparison with functional data revealed that there are patterns in the promoter structural profiles that can be linked to specific functional loci, such as sigma factor regulation or transcription factor binding. Interestingly, a novel structural element clearly visible near the transcription start site was found in genes associated with essential cellular functions and growth in several species. Our analyses reveals the great diversity in promoter structural profiles both between and within prokaryotic species. We observed relationships between structural diversity and functional features that are interesting prospects for further research to yet uncharacterized functional loci defined by DNA structural properties.
Wei, Li; Xu, Jian
2018-06-01
Epigenetic factors such as histone modifications play integral roles in plant development and stress response, yet their implications in algae remain poorly understood. In the industrial oleaginous microalgae Nannochloropsis spp., the lack of an efficient methodology for chromatin immunoprecipitation (ChIP), which determines the specific genomic location of various histone modifications, has hindered probing the epigenetic basis of their photosynthetic carbon conversion and storage as oil. Here, a detailed ChIP protocol was developed for Nannochloropsis oceanica, which represents a reliable approach for the analysis of histone modifications, chromatin state, and transcription factor-binding sites at the epigenetic level. Using ChIP-qPCR, genes related to photosynthetic carbon fixation in this microalga were systematically assessed. Furthermore, a ChIP-Seq protocol was established and optimized, which generated a genome-wide profile of histone modification events, using histone mark H3K9Ac as an example. These results are the first step for appreciation of the chromatin landscape in industrial oleaginous microalgae and for epigenetics-based microalgal feedstock development. © 2018 Phycological Society of America.
Singh, Amarjeet; Kanwar, Poonam; Pandey, Amita; Tyagi, Akhilesh K.; Sopory, Sudhir K.; Kapoor, Sanjay; Pandey, Girdhar K.
2013-01-01
Background Phospholipase C (PLC) is one of the major lipid hydrolysing enzymes, implicated in lipid mediated signaling. PLCs have been found to play a significant role in abiotic stress triggered signaling and developmental processes in various plant species. Genome wide identification and expression analysis have been carried out for this gene family in Arabidopsis, yet not much has been accomplished in crop plant rice. Methodology/Principal Findings An exhaustive in-silico exploration of rice genome using various online databases and tools resulted in the identification of nine PLC encoding genes. Based on sequence, motif and phylogenetic analysis rice PLC gene family could be divided into phosphatidylinositol-specific PLCs (PI-PLCs) and phosphatidylcholine- PLCs (PC-PLC or NPC) classes with four and five members, respectively. A comparative analysis revealed that PLCs are conserved in Arabidopsis (dicots) and rice (monocot) at gene structure and protein level but they might have evolved through a separate evolutionary path. Transcript profiling using gene chip microarray and quantitative RT-PCR showed that most of the PLC members expressed significantly and differentially under abiotic stresses (salt, cold and drought) and during various developmental stages with condition/stage specific and overlapping expression. This finding suggested an important role of different rice PLC members in abiotic stress triggered signaling and plant development, which was also supported by the presence of relevant cis-regulatory elements in their promoters. Sub-cellular localization of few selected PLC members in Nicotiana benthamiana and onion epidermal cells has provided a clue about their site of action and functional behaviour. Conclusion/Significance The genome wide identification, structural and expression analysis and knowledge of sub-cellular localization of PLC gene family envisage the functional characterization of these genes in crop plants in near future. PMID:23638098
Singh, Amarjeet; Kanwar, Poonam; Pandey, Amita; Tyagi, Akhilesh K; Sopory, Sudhir K; Kapoor, Sanjay; Pandey, Girdhar K
2013-01-01
Phospholipase C (PLC) is one of the major lipid hydrolysing enzymes, implicated in lipid mediated signaling. PLCs have been found to play a significant role in abiotic stress triggered signaling and developmental processes in various plant species. Genome wide identification and expression analysis have been carried out for this gene family in Arabidopsis, yet not much has been accomplished in crop plant rice. An exhaustive in-silico exploration of rice genome using various online databases and tools resulted in the identification of nine PLC encoding genes. Based on sequence, motif and phylogenetic analysis rice PLC gene family could be divided into phosphatidylinositol-specific PLCs (PI-PLCs) and phosphatidylcholine- PLCs (PC-PLC or NPC) classes with four and five members, respectively. A comparative analysis revealed that PLCs are conserved in Arabidopsis (dicots) and rice (monocot) at gene structure and protein level but they might have evolved through a separate evolutionary path. Transcript profiling using gene chip microarray and quantitative RT-PCR showed that most of the PLC members expressed significantly and differentially under abiotic stresses (salt, cold and drought) and during various developmental stages with condition/stage specific and overlapping expression. This finding suggested an important role of different rice PLC members in abiotic stress triggered signaling and plant development, which was also supported by the presence of relevant cis-regulatory elements in their promoters. Sub-cellular localization of few selected PLC members in Nicotiana benthamiana and onion epidermal cells has provided a clue about their site of action and functional behaviour. The genome wide identification, structural and expression analysis and knowledge of sub-cellular localization of PLC gene family envisage the functional characterization of these genes in crop plants in near future.
Lv, Xiaolong; Lan, Shanrong; Guy, Kateta Malangisha; Yang, Jinghua; Zhang, Mingfang; Hu, Zhongyuan
2016-01-01
Watermelon (Citrullus lanatus) is one xerophyte that has relative higher tolerance to drought and salt stresses as well as more sensitivity to cold stress, compared with most model plants. These characteristics facilitate it a potential model crop for researches on salt, drought or cold tolerance. In this study, a genome-wide comprehensive analysis of the ClNAC transcription factor (TF) family was carried out for the first time, to investigate their transcriptional profiles and potential functions in response to these abiotic stresses. The expression profiling analysis reveals that several NAC TFs are highly responsive to abiotic stresses and development, for instance, subfamily IV NACs may play roles in maintaining water status under drought or salt conditions, as well as water and metabolites conduction and translocation toward fruit. In contrast, rapid and negative responses of most of the ClNACs to low-temperature adversity may be related to the sensitivity to cold stress. Crosstalks among these abiotic stresses and hormone (abscisic acid and jasmonic acid) pathways were also discussed based on the expression of ClNAC genes. Our results will provide useful insights for the functional mining of NAC family in watermelon, as well as into the mechanisms underlying abiotic tolerance in other cash crops. PMID:27491393
Lv, Xiaolong; Lan, Shanrong; Guy, Kateta Malangisha; Yang, Jinghua; Zhang, Mingfang; Hu, Zhongyuan
2016-08-05
Watermelon (Citrullus lanatus) is one xerophyte that has relative higher tolerance to drought and salt stresses as well as more sensitivity to cold stress, compared with most model plants. These characteristics facilitate it a potential model crop for researches on salt, drought or cold tolerance. In this study, a genome-wide comprehensive analysis of the ClNAC transcription factor (TF) family was carried out for the first time, to investigate their transcriptional profiles and potential functions in response to these abiotic stresses. The expression profiling analysis reveals that several NAC TFs are highly responsive to abiotic stresses and development, for instance, subfamily IV NACs may play roles in maintaining water status under drought or salt conditions, as well as water and metabolites conduction and translocation toward fruit. In contrast, rapid and negative responses of most of the ClNACs to low-temperature adversity may be related to the sensitivity to cold stress. Crosstalks among these abiotic stresses and hormone (abscisic acid and jasmonic acid) pathways were also discussed based on the expression of ClNAC genes. Our results will provide useful insights for the functional mining of NAC family in watermelon, as well as into the mechanisms underlying abiotic tolerance in other cash crops.
Zeng, Changying; Ding, Zehong; Zhou, Fang; Zhou, Yufei; Yang, Ruiju; Yang, Zi; Wang, Wenquan; Peng, Ming
2017-12-12
Background : Cassava, an important tropical crop, has remarkable drought tolerance, but is very sensitive to cold. The growth, development, and root productivity of cassava are all adversely affected under cold and drought. Methods : To profile the transcriptional response to cold and drought stresses, cassava seedlings were respectively subjected to 0, 6, 24, and 48 h of cold stress and 0, 4, 6, and 10 days of drought stress. Their folded leaves, fully extended leaves, and roots were respectively investigated using RNA-seq. Results : Many genes specifically and commonly responsive to cold and drought were revealed: genes related to basic cellular metabolism, tetrapyrrole synthesis, and brassinosteroid metabolism exclusively responded to cold; genes related to abiotic stress and ethylene metabolism exclusively responded to drought; and genes related to cell wall, photosynthesis, and carbohydrate metabolism, DNA synthesis/chromatic structure, abscisic acid and salicylic acid metabolism, and calcium signaling commonly responded to both cold and drought. Discussion : Combined with cold- and/or drought-responsive transcription factors, the regulatory networks responding to cold and drought in cassava were constructed. All these findings will improve our understanding of the specific and common responses to cold and drought in cassava, and shed light on genetic improvement of cold and drought tolerance in cassava.
Aguilar, Carlos A.; Shcherbina, Anna; Ricke, Darrell O.; Pop, Ramona; Carrigan, Christopher T.; Gifford, Casey A.; Urso, Maria L.; Kottke, Melissa A.; Meissner, Alexander
2015-01-01
Traumatic lower-limb musculoskeletal injuries are pervasive amongst athletes and the military and typically an individual returns to activity prior to fully healing, increasing a predisposition for additional injuries and chronic pain. Monitoring healing progression after a musculoskeletal injury typically involves different types of imaging but these approaches suffer from several disadvantages. Isolating and profiling transcripts from the injured site would abrogate these shortcomings and provide enumerative insights into the regenerative potential of an individual’s muscle after injury. In this study, a traumatic injury was administered to a mouse model and healing progression was examined from 3 hours to 1 month using high-throughput RNA-Sequencing (RNA-Seq). Comprehensive dissection of the genome-wide datasets revealed the injured site to be a dynamic, heterogeneous environment composed of multiple cell types and thousands of genes undergoing significant expression changes in highly regulated networks. Four independent approaches were used to determine the set of genes, isoforms, and genetic pathways most characteristic of different time points post-injury and two novel approaches were developed to classify injured tissues at different time points. These results highlight the possibility to quantitatively track healing progression in situ via transcript profiling using high- throughput sequencing. PMID:26381351
Li, Zhiqian; Zhang, Chen; Guo, Yurui; Niu, Weili; Wang, Yuejin; Xu, Yan
2017-09-21
The HD-Zip family has a diversity of functions during plant development. In this study, we identify 33 HD-Zip transcription factors in grape and detect their expressions in ovules and somatic embryos, as well as in various vegetative organs. A genome-wide survey for HD-Zip transcription factors in Vitis was conducted based on the 12 X grape genome (V. vinifera L.). A total of 33 members were identified and classified into four subfamilies (I-IV) based on phylogeny analysis with Arabidopsis, rice and maize. VvHDZs in the same subfamily have similar protein motifs and intron/exon structures. An evaluation of duplication events suggests several HD-Zip genes arose before the divergence of the grape and Arabidopsis lineages. The 33 members of HD-Zip were differentially expressed in ovules of the stenospermic grape, Thompson Seedless and of the seeded grape, Pinot noir. Most have higher expressions during ovule abortion in Thompson Seedless. In addition, transcripts of the HD-Zip family were also detected in somatic embryogenesis of Thompson Seedless and in different vegetative organs of Thompson Seedless at varying levels. Additionally, VvHDZ28 is located in the nucleus and had transcriptional activity consistent with the typical features of the HD-Zip family. Our results provide a foundation for future grape HD-Zip gene function research. The identification and expression profiles of the HD-Zip transcription factors in grape, reveal their diverse roles during ovule abortion and organ development. Our results lay a foundation for functional analysis of grape HDZ genes.
Grade, Marian; Hörmann, Patrick; Becker, Sandra; Hummon, Amanda B.; Wangsa, Danny; Varma, Sudhir; Simon, Richard; Liersch, Torsten; Becker, Heinz; Difilippantonio, Michael J.; Ghadimi, B. Michael; Ried, Thomas
2016-01-01
To characterize patterns of global transcriptional deregulation in primary colon carcinomas, we did gene expression profiling of 73 tumors [Unio Internationale Contra Cancrum stage II (n = 33) and stage III (n = 40)] using oligonucleotide microarrays. For 30 of the tumors, expression profiles were compared with those from matched normal mucosa samples. We identified a set of 1,950 genes with highly significant deregulation between tumors and mucosa samples (P < 1e–7). A significant proportion of these genes mapped to chromosome 20 (P = 0.01). Seventeen genes had a >5-fold average expression difference between normal colon mucosa and carcinomas, including up-regulation of MYC and of HMGA1, a putative oncogene. Furthermore, we identified 68 genes that were significantly differentially expressed between lymph node–negative and lymph node–positive tumors (P < 0.001), the functional annotation of which revealed a preponderance of genes that play a role in cellular immune response and surveillance. The microarray-derived gene expression levels of 20 deregulated genes were validated using quantitative real-time reverse transcription-PCR in >40 tumor and normal mucosa samples with good concordance between the techniques. Finally, we established a relationship between specific genomic imbalances, which were mapped for 32 of the analyzed colon tumors by comparative genomic hybridization, and alterations of global transcriptional activity. Previously, we had conducted a similar analysis of primary rectal carcinomas. The systematic comparison of colon and rectal carcinomas revealed a significant overlap of genomic imbalances and transcriptional deregulation, including activation of the Wnt/β-catenin signaling cascade, suggesting similar pathogenic pathways. PMID:17210682
Grade, Marian; Hörmann, Patrick; Becker, Sandra; Hummon, Amanda B; Wangsa, Danny; Varma, Sudhir; Simon, Richard; Liersch, Torsten; Becker, Heinz; Difilippantonio, Michael J; Ghadimi, B Michael; Ried, Thomas
2007-01-01
To characterize patterns of global transcriptional deregulation in primary colon carcinomas, we did gene expression profiling of 73 tumors [Unio Internationale Contra Cancrum stage II (n = 33) and stage III (n = 40)] using oligonucleotide microarrays. For 30 of the tumors, expression profiles were compared with those from matched normal mucosa samples. We identified a set of 1,950 genes with highly significant deregulation between tumors and mucosa samples (P < 1e-7). A significant proportion of these genes mapped to chromosome 20 (P = 0.01). Seventeen genes had a >5-fold average expression difference between normal colon mucosa and carcinomas, including up-regulation of MYC and of HMGA1, a putative oncogene. Furthermore, we identified 68 genes that were significantly differentially expressed between lymph node-negative and lymph node-positive tumors (P < 0.001), the functional annotation of which revealed a preponderance of genes that play a role in cellular immune response and surveillance. The microarray-derived gene expression levels of 20 deregulated genes were validated using quantitative real-time reverse transcription-PCR in >40 tumor and normal mucosa samples with good concordance between the techniques. Finally, we established a relationship between specific genomic imbalances, which were mapped for 32 of the analyzed colon tumors by comparative genomic hybridization, and alterations of global transcriptional activity. Previously, we had conducted a similar analysis of primary rectal carcinomas. The systematic comparison of colon and rectal carcinomas revealed a significant overlap of genomic imbalances and transcriptional deregulation, including activation of the Wnt/beta-catenin signaling cascade, suggesting similar pathogenic pathways.
Ayyappan, Vasudevan; Kalavacharla, Venu; Thimmapuram, Jyothi; Bhide, Ketaki P; Sripathi, Venkateswara R; Smolinski, Tomasz G; Manoharan, Muthusamy; Thurston, Yaqoob; Todd, Antonette; Kingham, Bruce
2015-01-01
Histone modifications such as methylation and acetylation play a significant role in controlling gene expression in unstressed and stressed plants. Genome-wide analysis of such stress-responsive modifications and genes in non-model crops is limited. We report the genome-wide profiling of histone methylation (H3K9me2) and acetylation (H4K12ac) in common bean (Phaseolus vulgaris L.) under rust (Uromyces appendiculatus) stress using two high-throughput approaches, chromatin immunoprecipitation sequencing (ChIP-Seq) and RNA sequencing (RNA-Seq). ChIP-Seq analysis revealed 1,235 and 556 histone methylation and acetylation responsive genes from common bean leaves treated with the rust pathogen at 0, 12 and 84 hour-after-inoculation (hai), while RNA-Seq analysis identified 145 and 1,763 genes differentially expressed between mock-inoculated and inoculated plants. The combined ChIP-Seq and RNA-Seq analyses identified some key defense responsive genes (calmodulin, cytochrome p450, chitinase, DNA Pol II, and LRR) and transcription factors (WRKY, bZIP, MYB, HSFB3, GRAS, NAC, and NMRA) in bean-rust interaction. Differential methylation and acetylation affected a large proportion of stress-responsive genes including resistant (R) proteins, detoxifying enzymes, and genes involved in ion flux and cell death. The genes identified were functionally classified using Gene Ontology (GO) and EuKaryotic Orthologous Groups (KOGs). The Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis identified a putative pathway with ten key genes involved in plant-pathogen interactions. This first report of an integrated analysis of histone modifications and gene expression involved in the bean-rust interaction as reported here provides a comprehensive resource for other epigenomic regulation studies in non-model species under stress.
Thimmapuram, Jyothi; Bhide, Ketaki P.; Sripathi, Venkateswara R.; Smolinski, Tomasz G.; Manoharan, Muthusamy; Thurston, Yaqoob; Todd, Antonette; Kingham, Bruce
2015-01-01
Histone modifications such as methylation and acetylation play a significant role in controlling gene expression in unstressed and stressed plants. Genome-wide analysis of such stress-responsive modifications and genes in non-model crops is limited. We report the genome-wide profiling of histone methylation (H3K9me2) and acetylation (H4K12ac) in common bean (Phaseolus vulgaris L.) under rust (Uromyces appendiculatus) stress using two high-throughput approaches, chromatin immunoprecipitation sequencing (ChIP-Seq) and RNA sequencing (RNA-Seq). ChIP-Seq analysis revealed 1,235 and 556 histone methylation and acetylation responsive genes from common bean leaves treated with the rust pathogen at 0, 12 and 84 hour-after-inoculation (hai), while RNA-Seq analysis identified 145 and 1,763 genes differentially expressed between mock-inoculated and inoculated plants. The combined ChIP-Seq and RNA-Seq analyses identified some key defense responsive genes (calmodulin, cytochrome p450, chitinase, DNA Pol II, and LRR) and transcription factors (WRKY, bZIP, MYB, HSFB3, GRAS, NAC, and NMRA) in bean-rust interaction. Differential methylation and acetylation affected a large proportion of stress-responsive genes including resistant (R) proteins, detoxifying enzymes, and genes involved in ion flux and cell death. The genes identified were functionally classified using Gene Ontology (GO) and EuKaryotic Orthologous Groups (KOGs). The Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis identified a putative pathway with ten key genes involved in plant-pathogen interactions. This first report of an integrated analysis of histone modifications and gene expression involved in the bean-rust interaction as reported here provides a comprehensive resource for other epigenomic regulation studies in non-model species under stress. PMID:26167691
Dumitriu, Alexandra; Latourelle, Jeanne C; Hadzi, Tiffany C; Pankratz, Nathan; Garza, Dan; Miller, John P; Vance, Jeffery M; Foroud, Tatiana; Beach, Thomas G; Myers, Richard H
2012-06-01
Parkinson disease (PD) is a complex neurodegenerative disorder with largely unknown genetic mechanisms. While the degeneration of dopaminergic neurons in PD mainly takes place in the substantia nigra pars compacta (SN) region, other brain areas, including the prefrontal cortex, develop Lewy bodies, the neuropathological hallmark of PD. We generated and analyzed expression data from the prefrontal cortex Brodmann Area 9 (BA9) of 27 PD and 26 control samples using the 44K One-Color Agilent 60-mer Whole Human Genome Microarray. All samples were male, without significant Alzheimer disease pathology and with extensive pathological annotation available. 507 of the 39,122 analyzed expression probes were different between PD and control samples at false discovery rate (FDR) of 5%. One of the genes with significantly increased expression in PD was the forkhead box O1 (FOXO1) transcription factor. Notably, genes carrying the FoxO1 binding site were significantly enriched in the FDR-significant group of genes (177 genes covered by 189 probes), suggesting a role for FoxO1 upstream of the observed expression changes. Single-nucleotide polymorphisms (SNPs) selected from a recent meta-analysis of PD genome-wide association studies (GWAS) were successfully genotyped in 50 out of the 53 microarray brains, allowing a targeted expression-SNP (eSNP) analysis for 52 SNPs associated with PD affection at genome-wide significance and the 189 probes from FoxO1 regulated genes. A significant association was observed between a SNP in the cyclin G associated kinase (GAK) gene and a probe in the spermine oxidase (SMOX) gene. Further examination of the FOXO1 region in a meta-analysis of six available GWAS showed two SNPs significantly associated with age at onset of PD. These results implicate FOXO1 as a PD-relevant gene and warrant further functional analyses of its transcriptional regulatory mechanisms.
Dumitriu, Alexandra; Latourelle, Jeanne C.; Hadzi, Tiffany C.; Pankratz, Nathan; Garza, Dan; Miller, John P.; Vance, Jeffery M.; Foroud, Tatiana; Beach, Thomas G.; Myers, Richard H.
2012-01-01
Parkinson disease (PD) is a complex neurodegenerative disorder with largely unknown genetic mechanisms. While the degeneration of dopaminergic neurons in PD mainly takes place in the substantia nigra pars compacta (SN) region, other brain areas, including the prefrontal cortex, develop Lewy bodies, the neuropathological hallmark of PD. We generated and analyzed expression data from the prefrontal cortex Brodmann Area 9 (BA9) of 27 PD and 26 control samples using the 44K One-Color Agilent 60-mer Whole Human Genome Microarray. All samples were male, without significant Alzheimer disease pathology and with extensive pathological annotation available. 507 of the 39,122 analyzed expression probes were different between PD and control samples at false discovery rate (FDR) of 5%. One of the genes with significantly increased expression in PD was the forkhead box O1 (FOXO1) transcription factor. Notably, genes carrying the FoxO1 binding site were significantly enriched in the FDR–significant group of genes (177 genes covered by 189 probes), suggesting a role for FoxO1 upstream of the observed expression changes. Single-nucleotide polymorphisms (SNPs) selected from a recent meta-analysis of PD genome-wide association studies (GWAS) were successfully genotyped in 50 out of the 53 microarray brains, allowing a targeted expression–SNP (eSNP) analysis for 52 SNPs associated with PD affection at genome-wide significance and the 189 probes from FoxO1 regulated genes. A significant association was observed between a SNP in the cyclin G associated kinase (GAK) gene and a probe in the spermine oxidase (SMOX) gene. Further examination of the FOXO1 region in a meta-analysis of six available GWAS showed two SNPs significantly associated with age at onset of PD. These results implicate FOXO1 as a PD–relevant gene and warrant further functional analyses of its transcriptional regulatory mechanisms. PMID:22761592
Tsirigos, Aristotelis; Lin, Zhao; Pavlides, Stephanos; Wang, Chengwang; Flomenberg, Neal; Knudsen, Erik S; Howell, Anthony; Pestell, Richard G
2011-01-01
Previously, we showed that high-energy metabolites (lactate and ketones) “fuel” tumor growth and experimental metastasis in an in vivo xenograft model, most likely by driving oxidative mitochondrial metabolism in breast cancer cells. To mechanistically understand how these metabolites affect tumor cell behavior, here we used genome-wide transcriptional profiling. Human breast cancer cells (MCF7) were cultured with lactate or ketones, and then subjected to transcriptional analysis (exon-array). Interestingly, our results show that treatment with these high-energy metabolites increases the transcriptional expression of gene profiles normally associated with “stemness”, including genes upregulated in embryonic stem (ES) cells. Similarly, we observe that lactate and ketones promote the growth of bonafide ES cells, providing functional validation. The lactate- and ketone-induced “gene signatures” were able to predict poor clinical outcome (including recurrence and metastasis) in human breast cancer patients. Taken together, our results are consistent with the idea that lactate and ketone utilization in cancer cells promotes the “cancer stem cell” phenotype, resulting in significant decreases in patient survival. One possible mechanism by which high-energy metabolites might induce stemness is by increasing the pool of Acetyl-CoA, leading to increased histone acetylation and elevated gene expression. Thus, our results mechanistically imply that clinical outcome in breast cancer could simply be determined by epigenetics and energy metabolism, rather than by the accumulation of specific “classical” gene mutations. We also suggest that high-risk cancer patients (identified by the lactate/ketone gene signatures) could be treated with new therapeutics that target oxidative mitochondrial metabolism, such as the anti-oxidant and “mitochondrial poison” metformin. Finally, we propose that this new approach to personalized cancer medicine be termed “metabolo-genomics,” which incorporates features of both (1) cell metabolism and (2) gene transcriptional profiling. This powerful new approach directly links cancer cell metabolism with clinical outcome, and suggests new therapeutic strategies for inhibiting the TCA cycle and mitochondrial oxidative phosphorylation in cancer cells. PMID:21512313
Caldwell, Julie M.; Collins, Margaret H.; Stucke, Emily M.; Putnam, Philip E.; Franciosi, James P.; Kushner, Jonathan P.; Abonia, J. Pablo; Rothenberg, Marc E.
2014-01-01
Background The definition of eosinophilic gastritis (EG) is currently limited to histological EG based on the tissue eosinophil count. Objective We aimed to provide additional fundamental information about the molecular, histopathological, and clinical characteristics of EG. Methods Genome-wide transcript profiles and histological features of gastric biopsies as well as blood eosinophil numbers were analyzed in EG and control patients (n = 15 each). Results The peak gastric antrum eosinophil count was 282.7 ± 163.9 eosinophils/400X high-power field (HPF) in EG and 11.0 ± 8.5 eosinophils/HPF in control patients (P = 6.1 × 10−7). EG patients (87%) had co-existing eosinophilic inflammation in multiple gastrointestinal segments; the esophagus represented the most common secondary site. Elevated peripheral blood eosinophil numbers (EG 1.09 ± 0.88 × 103 [K]/μl vs. control 0.09 ± 0.08 K/μl, P = .0027) positively correlated with peak gastric eosinophil counts (Pearson r2 = .8102, P < .0001). MIB-1+ (proliferating), CD117+ (mast cells), and FOXP3+ cells (regulatory and/or activated T cells) were increased in EG. Transcript profiling revealed changes in 8% of the genome in EG gastric tissue. Only 7% of this EG transcriptome overlapped with the eosinophilic esophagitis (EoE) transcriptome. Significantly increased IL4, IL5, IL13, IL17, CCL26 and mast cell-specific transcripts and decreased IL33 were observed. Conclusion EG is a systemic disorder involving profound blood and gastrointestinal tract eosinophilia, Th2 immunity, and a conserved gastric transcriptome markedly distinct from the EoE transcriptome. The data herein define germane cellular and molecular pathways of EG and provide a basis for improving diagnosis and treatment. PMID:25234644
A systems analysis of the chemosensitivity of breast cancer cells to the polyamine analogue PG-11047
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kuo, Wen-Lin; Das, Debopriya; Ziyad, Safiyyah
2009-11-14
Polyamines regulate important cellular functions and polyamine dysregulation frequently occurs in cancer. The objective of this study was to use a systems approach to study the relative effects of PG-11047, a polyamine analogue, across breast cancer cells derived from different patients and to identify genetic markers associated with differential cytotoxicity. A panel of 48 breast cell lines that mirror many transcriptional and genomic features present in primary human breast tumours were used to study the antiproliferative activity of PG-11047. Sensitive cell lines were further examined for cell cycle distribution and apoptotic response. Cell line responses, quantified by the GI50 (dosemore » required for 50% relative growth inhibition) were correlated with the omic profiles of the cell lines to identify markers that predict response and cellular functions associated with drug sensitivity. The concentrations of PG-11047 needed to inhibit growth of members of the panel of breast cell lines varied over a wide range, with basal-like cell lines being inhibited at lower concentrations than the luminal cell lines. Sensitive cell lines showed a significant decrease in S phase fraction at doses that produced little apoptosis. Correlation of the GI50 values with the omic profiles of the cell lines identified genomic, transcriptional and proteomic variables associated with response. A 13-gene transcriptional marker set was developed as a predictor of response to PG-11047 that warrants clinical evaluation. Analyses of the pathways, networks and genes associated with response to PG-11047 suggest that response may be influenced by interferon signaling and differential inhibition of aspects of motility and epithelial to mesenchymal transition.« less
Tu, Ying; Xu, Dan; Feng, Jiaqi; He, Li
2017-01-01
Sensitive skin (SS) is a condition of subjective cutaneous hyper-reactivity. The role of long non-coding RNAs (lncRNAs) in subjects with SS is unclear. Therefore, the aim of the present study was to provide a comprehensive profile of the mRNAs and lncRNAs in subjects with SS. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis presented the characteristics of associated protein-coding genes. In addition, a co-expression network of lncRNA and mRNA was constructed to identify potential underlying regulation targets; the results were verified by quantitative real-time PCR (qRT-PCR) and RNA-seq analyses in patients with SS and normal samples. Compared with the normal skin group, 266 novel lncRNAs and 6750 annotated lncRNAs were identified in the SS group. A total of 71 lncRNA transcripts and 2615 mRNA transcripts were differentially expressed (P < 0.05). The heat signature of the SS samples could be distinguished from the normal skin samples, whereas the majority of the genes that were present in enriched pathways were those that participated in focal adhesion, PI3K-Akt signaling, and cancer-related pathways. Five transcripts were selected for qRT-PCR analysis and the results were consistent with RNA-seq. The results suggested that LNC_000265 may play a role in the epidermal barrier structure of patient with SS. The data suggest novel genes and pathways that may be involved in the pathogenesis of SS and highlight potential targets that could be used for individualized treatment applications. PMID:29383128
Thomas, Rachael; Borst, Luke; Rotroff, Daniel; Motsinger-Reif, Alison; Lindblad-Toh, Kerstin; Modiano, Jaime F.; Breen, Matthew
2017-01-01
Canine hemangiosarcoma is a highly aggressive vascular neoplasm associated with extensive clinical and anatomical heterogeneity and a grave prognosis. Comprehensive molecular characterization of hemangiosarcoma may identify novel therapeutic targets and advanced clinical management strategies, but there are no published reports of tumor-associated genome instability and disrupted gene dosage in this cancer. We performed genome-wide microarray-based somatic DNA copy number profiling of 75 primary intra-abdominal hemangiosarcomas from five popular dog breeds that are highly predisposed to this disease. The cohort exhibited limited global genomic instability, compared to other canine sarcomas studied to date, and DNA copy number aberrations (CNAs) were predominantly of low amplitude. Recurrent imbalances of several key cancer-associated genes were evident; however the global penetrance of any single CNA was low and no distinct hallmark aberrations were evident. Copy number gains of dog chromosomes 13, 24 and 31, and loss of chromosome 16, were the most recurrent CNAs involving large chromosome regions, but their relative distribution within and between cases suggests they most likely represent passenger aberrations. CNAs involving CDKN2A, VEGFA and the SKI oncogene were identified as potential driver aberrations of hemangiosarcoma development, highlighting potential targets for therapeutic modulation. CNA profiles were broadly conserved between the five breeds, although subregional variation was evident, including a near two-fold lower incidence of VEGFA gain in Golden Retrievers versus other breeds (22% versus 40%). These observations support prior transcriptional studies suggesting that the clinical heterogeneity of this cancer may reflect the existence of multiple, molecularly-distinct subtypes of canine hemangiosarcoma. PMID:24599718
Thomas, Rachael; Borst, Luke; Rotroff, Daniel; Motsinger-Reif, Alison; Lindblad-Toh, Kerstin; Modiano, Jaime F; Breen, Matthew
2014-09-01
Canine hemangiosarcoma is a highly aggressive vascular neoplasm associated with extensive clinical and anatomical heterogeneity and a grave prognosis. Comprehensive molecular characterization of hemangiosarcoma may identify novel therapeutic targets and advanced clinical management strategies, but there are no published reports of tumor-associated genome instability and disrupted gene dosage in this cancer. We performed genome-wide microarray-based somatic DNA copy number profiling of 75 primary intra-abdominal hemangiosarcomas from five popular dog breeds that are highly predisposed to this disease. The cohort exhibited limited global genomic instability, compared to other canine sarcomas studied to date, and DNA copy number aberrations (CNAs) were predominantly of low amplitude. Recurrent imbalances of several key cancer-associated genes were evident; however, the global penetrance of any single CNA was low and no distinct hallmark aberrations were evident. Copy number gains of dog chromosomes 13, 24, and 31, and loss of chromosome 16, were the most recurrent CNAs involving large chromosome regions, but their relative distribution within and between cases suggests they most likely represent passenger aberrations. CNAs involving CDKN2A, VEGFA, and the SKI oncogene were identified as potential driver aberrations of hemangiosarcoma development, highlighting potential targets for therapeutic modulation. CNA profiles were broadly conserved between the five breeds, although subregional variation was evident, including a near twofold lower incidence of VEGFA gain in Golden Retrievers versus other breeds (22 versus 40 %). These observations support prior transcriptional studies suggesting that the clinical heterogeneity of this cancer may reflect the existence of multiple, molecularly distinct subtypes of canine hemangiosarcoma.
Genome wide gene expression regulation by HIP1 Protein Interactor, HIPPI: prediction and validation.
Datta, Moumita; Choudhury, Ananyo; Lahiri, Ansuman; Bhattacharyya, Nitai P
2011-09-26
HIP1 Protein Interactor (HIPPI) is a pro-apoptotic protein that induces Caspase8 mediated apoptosis in cell. We have shown earlier that HIPPI could interact with a specific 9 bp sequence motif, defined as the HIPPI binding site (HBS), present in the upstream promoter of Caspase1 gene and regulate its expression. We also have shown that HIPPI, without any known nuclear localization signal, could be transported to the nucleus by HIP1, a NLS containing nucleo-cytoplasmic shuttling protein. Thus our present work aims at the investigation of the role of HIPPI as a global transcription regulator. We carried out genome wide search for the presence of HBS in the upstream sequences of genes. Our result suggests that HBS was predominantly located within 2 Kb upstream from transcription start site. Transcription factors like CREBP1, TBP, OCT1, EVI1 and P53 half site were significantly enriched in the 100 bp vicinity of HBS indicating that they might co-operate with HIPPI for transcription regulation. To illustrate the role of HIPPI on transcriptome, we performed gene expression profiling by microarray. Exogenous expression of HIPPI in HeLa cells resulted in up-regulation of 580 genes (p < 0.05) while 457 genes were down-regulated. Several transcription factors including CBP, REST, C/EBP beta were altered by HIPPI in this study. HIPPI also interacted with P53 in the protein level. This interaction occurred exclusively in the nuclear compartment and was absent in cells where HIP1 was knocked down. HIPPI-P53 interaction was necessary for HIPPI mediated up-regulation of Caspase1 gene. Finally, we analyzed published microarray data obtained with post mortem brains of Huntington's disease (HD) patients to investigate the possible involvement of HIPPI in HD pathogenesis. We observed that along with the transcription factors like CREB, P300, SREBP1, Sp1 etc. which are already known to be involved in HD, HIPPI binding site was also significantly over-represented in the upstream sequences of genes altered in HD. Taken together, the results suggest that HIPPI could act as an important transcription regulator in cell regulating a vast array of genes, particularly transcription factors and at least, in part, play a role in transcription deregulation observed in HD.
Genome wide gene expression regulation by HIP1 Protein Interactor, HIPPI: Prediction and validation
2011-01-01
Background HIP1 Protein Interactor (HIPPI) is a pro-apoptotic protein that induces Caspase8 mediated apoptosis in cell. We have shown earlier that HIPPI could interact with a specific 9 bp sequence motif, defined as the HIPPI binding site (HBS), present in the upstream promoter of Caspase1 gene and regulate its expression. We also have shown that HIPPI, without any known nuclear localization signal, could be transported to the nucleus by HIP1, a NLS containing nucleo-cytoplasmic shuttling protein. Thus our present work aims at the investigation of the role of HIPPI as a global transcription regulator. Results We carried out genome wide search for the presence of HBS in the upstream sequences of genes. Our result suggests that HBS was predominantly located within 2 Kb upstream from transcription start site. Transcription factors like CREBP1, TBP, OCT1, EVI1 and P53 half site were significantly enriched in the 100 bp vicinity of HBS indicating that they might co-operate with HIPPI for transcription regulation. To illustrate the role of HIPPI on transcriptome, we performed gene expression profiling by microarray. Exogenous expression of HIPPI in HeLa cells resulted in up-regulation of 580 genes (p < 0.05) while 457 genes were down-regulated. Several transcription factors including CBP, REST, C/EBP beta were altered by HIPPI in this study. HIPPI also interacted with P53 in the protein level. This interaction occurred exclusively in the nuclear compartment and was absent in cells where HIP1 was knocked down. HIPPI-P53 interaction was necessary for HIPPI mediated up-regulation of Caspase1 gene. Finally, we analyzed published microarray data obtained with post mortem brains of Huntington's disease (HD) patients to investigate the possible involvement of HIPPI in HD pathogenesis. We observed that along with the transcription factors like CREB, P300, SREBP1, Sp1 etc. which are already known to be involved in HD, HIPPI binding site was also significantly over-represented in the upstream sequences of genes altered in HD. Conclusions Taken together, the results suggest that HIPPI could act as an important transcription regulator in cell regulating a vast array of genes, particularly transcription factors and at least, in part, play a role in transcription deregulation observed in HD. PMID:21943362
Epigenetics, chromatin and genome organization: recent advances from the ENCODE project.
Siggens, L; Ekwall, K
2014-09-01
The organization of the genome into functional units, such as enhancers and active or repressed promoters, is associated with distinct patterns of DNA and histone modifications. The Encyclopedia of DNA Elements (ENCODE) project has advanced our understanding of the principles of genome, epigenome and chromatin organization, identifying hundreds of thousands of potential regulatory regions and transcription factor binding sites. Part of the ENCODE consortium, GENCODE, has annotated the human genome with novel transcripts including new noncoding RNAs and pseudogenes, highlighting transcriptional complexity. Many disease variants identified in genome-wide association studies are located within putative enhancer regions defined by the ENCODE project. Understanding the principles of chromatin and epigenome organization will help to identify new disease mechanisms, biomarkers and drug targets, particularly as ongoing epigenome mapping projects generate data for primary human cell types that play important roles in disease. © 2014 The Association for the Publication of the Journal of Internal Medicine.
Transcript profiling reveals expression differences in wild-type and glabrous soybean lines
2011-01-01
Background Trichome hairs affect diverse agronomic characters such as seed weight and yield, prevent insect damage and reduce loss of water but their molecular control has not been extensively studied in soybean. Several detailed models for trichome development have been proposed for Arabidopsis thaliana, but their applicability to important crops such as cotton and soybean is not fully known. Results Two high throughput transcript sequencing methods, Digital Gene Expression (DGE) Tag Profiling and RNA-Seq, were used to compare the transcriptional profiles in wild-type (cv. Clark standard, CS) and a mutant (cv. Clark glabrous, i.e., trichomeless or hairless, CG) soybean isoline that carries the dominant P1 allele. DGE data and RNA-Seq data were mapped to the cDNAs (Glyma models) predicted from the reference soybean genome, Williams 82. Extending the model length by 250 bp at both ends resulted in significantly more matches of authentic DGE tags indicating that many of the predicted gene models are prematurely truncated at the 5' and 3' UTRs. The genome-wide comparative study of the transcript profiles of the wild-type versus mutant line revealed a number of differentially expressed genes. One highly-expressed gene, Glyma04g35130, in wild-type soybean was of interest as it has high homology to the cotton gene GhRDL1 gene that has been identified as being involved in cotton fiber initiation and is a member of the BURP protein family. Sequence comparison of Glyma04g35130 among Williams 82 with our sequences derived from CS and CG isolines revealed various SNPs and indels including addition of one nucleotide C in the CG and insertion of ~60 bp in the third exon of CS that causes a frameshift mutation and premature truncation of peptides in both lines as compared to Williams 82. Conclusion Although not a candidate for the P1 locus, a BURP family member (Glyma04g35130) from soybean has been shown to be abundantly expressed in the CS line and very weakly expressed in the glabrous CG line. RNA-Seq and DGE data are compared and provide experimental data on the expression of predicted soybean gene models as well as an overview of the genes expressed in young shoot tips of two closely related isolines. PMID:22029708
PI3K/Akt-dependent functions of TFII-I transcription factors in mouse embryonic stem cells.
Chimge, Nyam-Osor; Makeyev, Aleksandr V; Waigel, Sabine J; Enkhmandakh, Badam; Bayarsaihan, Dashzeveg
2012-04-01
Activation of PI3K/Akt signaling is sufficient to maintain the pluripotency of mouse embryonic stem cells (mESC) and results in down-regulation of Gtf2i and Gtf2ird1 encoding TFII-I family transcription factors. To investigate how these genes might be involved in the process of embryonic stem cell differentiation, we performed expression microarray profiling of mESC upon inhibition of PI3K by LY294002. This analysis revealed significant alterations in expression of genes for specific subsets of chromatin-modifying enzymes. Surprisingly, genome-wide promoter ChIP-chip mapping indicated that the majority of differently expressed genes could be direct targets of TFII-I regulation. The data support the hypothesis that upregulation of TFII-I factors leads to activation of a specific group of developmental genes during mESC differentiation. © 2011 Wiley Periodicals, Inc.
The SAGA/TREX-2 subunit Sus1 binds widely to transcribed genes and affects mRNA turnover globally.
García-Molinero, Varinia; García-Martínez, José; Reja, Rohit; Furió-Tarí, Pedro; Antúnez, Oreto; Vinayachandran, Vinesh; Conesa, Ana; Pugh, B Franklin; Pérez-Ortín, José E; Rodríguez-Navarro, Susana
2018-03-29
Eukaryotic transcription is regulated through two complexes, the general transcription factor IID (TFIID) and the coactivator Spt-Ada-Gcn5 acetyltransferase (SAGA). Recent findings confirm that both TFIID and SAGA contribute to the synthesis of nearly all transcripts and are recruited genome-wide in yeast. However, how this broad recruitment confers selectivity under specific conditions remains an open question. Here we find that the SAGA/TREX-2 subunit Sus1 associates with upstream regulatory regions of many yeast genes and that heat shock drastically changes Sus1 binding. While Sus1 binding to TFIID-dominated genes is not affected by temperature, its recruitment to SAGA-dominated genes and RP genes is significantly disturbed under heat shock, with Sus1 relocated to environmental stress-responsive genes in these conditions. Moreover, in contrast to recent results showing that SAGA deubiquitinating enzyme Ubp8 is dispensable for RNA synthesis, genomic run-on experiments demonstrate that Sus1 contributes to synthesis and stability of a wide range of transcripts. Our study provides support for a model in which SAGA/TREX-2 factor Sus1 acts as a global transcriptional regulator in yeast but has differential activity at yeast genes as a function of their transcription rate or during stress conditions.
Epigenetics of obesity: beyond the genome sequence.
Cordero, Paul; Li, Jiawei; Oben, Jude A
2015-07-01
After the study of the gene code as a trigger for obesity, epigenetic code has appeared as a novel tool in the diagnosis, prognosis and treatment of obesity, and its related comorbidities. This review summarizes the status of the epigenetic field associated with obesity, and the current epigenetic-based approaches for obesity treatment. Thanks to technical advances, novel and key obesity-associated polymorphisms have been described by genome-wide association studies, but there are limitations with their predictive power. Epigenetics is also studied for disease association, which involves decoding of the genome information, transcriptional status and later phenotypes. Obesity could be induced during adult life by feeding and other environmental factors, and there is a strong association between obesity features and specific epigenetic patterns. These patterns could be established during early life stages, and programme the risk of obesity and its comorbidities during adult life. Furthermore, recent studies have shown that DNA methylation profile could be applied as biomarkers of diet-induced weight loss treatment. High-throughput technologies, recently implemented for commercial genetic test panels, could soon lead to the creation of epigenetic test panels for obesity. Nonetheless, epigenetics is a modifiable risk factor, and different dietary patterns or environmental insights during distinct stages of life could lead to rewriting of the epigenetic profile.
Muthamilarasan, Mehanathan; Bonthala, Venkata Suresh; Mishra, Awdhesh Kumar; Khandelwal, Rohit; Khan, Yusuf; Roy, Riti; Prasad, Manoj
2014-09-01
C2H2 type of zinc finger transcription factors (TFs) play crucial roles in plant stress response and hormone signal transduction. Hence considering its importance, genome-wide investigation and characterization of C2H2 zinc finger proteins were performed in Arabidopsis, rice and poplar but no such study was conducted in foxtail millet which is a C4 Panicoid model crop well known for its abiotic stress tolerance. The present study identified 124 C2H2-type zinc finger TFs in foxtail millet (SiC2H2) and physically mapped them onto the genome. The gene duplication analysis revealed that SiC2H2s primarily expanded in the genome through tandem duplication. The phylogenetic tree classified these TFs into five groups (I-V). Further, miRNAs targeting SiC2H2 transcripts in foxtail millet were identified. Heat map demonstrated differential and tissue-specific expression patterns of these SiC2H2 genes. Comparative physical mapping between foxtail millet SiC2H2 genes and its orthologs of sorghum, maize and rice revealed the evolutionary relationships of C2H2 type of zinc finger TFs. The duplication and divergence data provided novel insight into the evolutionary aspects of these TFs in foxtail millet and related grass species. Expression profiling of candidate SiC2H2 genes in response to salinity, dehydration and cold stress showed differential expression pattern of these genes at different time points of stresses.
A sequence-based survey of the complex structural organization of tumor genomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Collins, Colin; Raphael, Benjamin J.; Volik, Stanislav
2008-04-03
The genomes of many epithelial tumors exhibit extensive chromosomal rearrangements. All classes of genome rearrangements can be identified using End Sequencing Profiling (ESP), which relies on paired-end sequencing of cloned tumor genomes. In this study, brain, breast, ovary and prostate tumors along with three breast cancer cell lines were surveyed with ESP yielding the largest available collection of sequence-ready tumor genome breakpoints and providing evidence that some rearrangements may be recurrent. Sequencing and fluorescence in situ hybridization (FISH) confirmed translocations and complex tumor genome structures that include coamplification and packaging of disparate genomic loci with associated molecular heterogeneity. Comparison ofmore » the tumor genomes suggests recurrent rearrangements. Some are likely to be novel structural polymorphisms, whereas others may be bona fide somatic rearrangements. A recurrent fusion transcript in breast tumors and a constitutional fusion transcript resulting from a segmental duplication were identified. Analysis of end sequences for single nucleotide polymorphisms (SNPs) revealed candidate somatic mutations and an elevated rate of novel SNPs in an ovarian tumor. These results suggest that the genomes of many epithelial tumors may be far more dynamic and complex than previously appreciated and that genomic fusions including fusion transcripts and proteins may be common, possibly yielding tumor-specific biomarkers and therapeutic targets.« less
Prielhofer, Roland; Cartwright, Stephanie P; Graf, Alexandra B; Valli, Minoska; Bill, Roslyn M; Mattanovich, Diethard; Gasser, Brigitte
2015-03-11
The methylotrophic, Crabtree-negative yeast Pichia pastoris is widely used as a heterologous protein production host. Strong inducible promoters derived from methanol utilization genes or constitutive glycolytic promoters are typically used to drive gene expression. Notably, genes involved in methanol utilization are not only repressed by the presence of glucose, but also by glycerol. This unusual regulatory behavior prompted us to study the regulation of carbon substrate utilization in different bioprocess conditions on a genome wide scale. We performed microarray analysis on the total mRNA population as well as mRNA that had been fractionated according to ribosome occupancy. Translationally quiescent mRNAs were defined as being associated with single ribosomes (monosomes) and highly-translated mRNAs with multiple ribosomes (polysomes). We found that despite their lower growth rates, global translation was most active in methanol-grown P. pastoris cells, followed by excess glycerol- or glucose-grown cells. Transcript-specific translational responses were found to be minimal, while extensive transcriptional regulation was observed for cells grown on different carbon sources. Due to their respiratory metabolism, cells grown in excess glucose or glycerol had very similar expression profiles. Genes subject to glucose repression were mainly involved in the metabolism of alternative carbon sources including the control of glycerol uptake and metabolism. Peroxisomal and methanol utilization genes were confirmed to be subject to carbon substrate repression in excess glucose or glycerol, but were found to be strongly de-repressed in limiting glucose-conditions (as are often applied in fed batch cultivations) in addition to induction by methanol. P. pastoris cells grown in excess glycerol or glucose have similar transcript profiles in contrast to S. cerevisiae cells, in which the transcriptional response to these carbon sources is very different. The main response to different growth conditions in P. pastoris is transcriptional; translational regulation was not transcript-specific. The high proportion of mRNAs associated with polysomes in methanol-grown cells is a major finding of this study; it reveals that high productivity during methanol induction is directly linked to the growth condition and not only to promoter strength.
Genome-wide identification and analysis of the chicken basic helix-loop-helix factors.
Liu, Wu-Yi; Zhao, Chun-Jiang
2010-01-01
Members of the basic helix-loop-helix (bHLH) family of transcription factors play important roles in a wide range of developmental processes. In this study, we conducted a genome-wide survey using the chicken (Gallus gallus) genomic database, and identified 104 bHLH sequences belonging to 42 gene families in an effort to characterize the chicken bHLH transcription factor family. Phylogenetic analyses revealed that chicken has 50, 21, 15, 4, 8, and 3 bHLH members in groups A, B, C, D, E, and F, respectively, while three members belonging to none of these groups were classified as ''orphans". A comparison between chicken and human bHLH repertoires suggested that both organisms have a number of lineage-specific bHLH members in the proteomes. Chromosome distribution patterns and phylogenetic analyses strongly suggest that the bHLH members should have arisen through gene duplication at an early date. Gene Ontology (GO) enrichment statistics showed 51 top GO annotations of biological processes counted in the frequency. The present study deepens our understanding of the chicken bHLH transcription factor family and provides much useful information for further studies using chicken as a model system.
ATRX Directs Binding of PRC2 to Xist RNA and Polycomb Targets
Sarma, Kavitha; Cifuentes-Rojas, Catherine; Ergun, Ayla; del Rosario, Amanda; Jeon, Yesu; White, Forest; Sadreyev, Ruslan; Lee, Jeannie T.
2015-01-01
SUMMARY X chromosome inactivation (XCI) depends on the long noncoding RNA Xist and its recruitment of Polycomb Repressive Complex 2 (PRC2). PRC2 is also targeted to other sites throughout the genome to effect transcriptional repression. Using XCI as a model, we apply an unbiased proteomics approach to isolate Xist and PRC2 regulators and identified ATRX. ATRX unexpectedly functions as a high-affinity RNA-binding protein that directly interacts with RepA/Xist RNA to promote loading of PRC2 in vivo. Without ATRX, PRC2 cannot load onto Xist RNA nor spread in cis along the X chromosome. Moreover, epigenomic profiling reveals that genome-wide targeting of PRC2 depends on ATRX, as loss of ATRX leads to spatial redistribution of PRC2 and derepression of Polycomb responsive genes. Thus, ATRX is a required specificity determinant for PRC2 targeting and function. PMID:25417162
Coate, Jeremy E; Doyle, Jeff J
2010-01-01
Evolutionary biologists are increasingly comparing gene expression patterns across species. Due to the way in which expression assays are normalized, such studies provide no direct information about expression per gene copy (dosage responses) or per cell and can give a misleading picture of genes that are differentially expressed. We describe an assay for estimating relative expression per cell. When used in conjunction with transcript profiling data, it is possible to compare the sizes of whole transcriptomes, which in turn makes it possible to compare expression per cell for each gene in the transcript profiling data set. We applied this approach, using quantitative reverse transcriptase-polymerase chain reaction and high throughput RNA sequencing, to a recently formed allopolyploid and showed that its leaf transcriptome was approximately 1.4-fold larger than either progenitor transcriptome (70% of the sum of the progenitor transcriptomes). In contrast, the allopolyploid genome is 94.3% as large as the sum of its progenitor genomes and retains > or =93.5% of the sum of its progenitor gene complements. Thus, "transcriptome downsizing" is greater than genome downsizing. Using this transcriptome size estimate, we inferred dosage responses for several thousand genes and showed that the majority exhibit partial dosage compensation. Homoeologue silencing is nonrandomly distributed across dosage responses, with genes showing extreme responses in either direction significantly more likely to have a silent homoeologue. This experimental approach will add value to transcript profiling experiments involving interspecies and interploidy comparisons by converting expression per transcriptome to expression per genome, eliminating the need for assumptions about transcriptome size.
Melamed, Anat; Laydon, Daniel J.; Gillet, Nicolas A.; Tanaka, Yuetsu; Taylor, Graham P.; Bangham, Charles R. M.
2013-01-01
The regulation of proviral latency is a central problem in retrovirology. We postulate that the genomic integration site of human T lymphotropic virus type 1 (HTLV-1) determines the pattern of expression of the provirus, which in turn determines the abundance and pathogenic potential of infected T cell clones in vivo. We recently developed a high-throughput method for the genome-wide amplification, identification and quantification of proviral integration sites. Here, we used this protocol to test two hypotheses. First, that binding sites for transcription factors and chromatin remodelling factors in the genome flanking the proviral integration site of HTLV-1 are associated with integration targeting, spontaneous proviral expression, and in vivo clonal abundance. Second, that the transcriptional orientation of the HTLV-1 provirus relative to that of the nearest host gene determines spontaneous proviral expression and in vivo clonal abundance. Integration targeting was strongly associated with the presence of a binding site for specific host transcription factors, especially STAT1 and p53. The presence of the chromatin remodelling factors BRG1 and INI1 and certain host transcription factors either upstream or downstream of the provirus was associated respectively with silencing or spontaneous expression of the provirus. Cells expressing HTLV-1 Tax protein were significantly more frequent in clones of low abundance in vivo. We conclude that transcriptional interference and chromatin remodelling are critical determinants of proviral latency in natural HTLV-1 infection. PMID:23555266
Mlh1 deficiency in normal mouse colon mucosa associates with chromosomally unstable colon cancer
Pussila, Marjaana; Törönen, Petri; Einarsdottir, Elisabet; Katayama, Shintaro; Krjutškov, Kaarel; Holm, Liisa; Kere, Juha; Peltomäki, Päivi; Mäkinen, Markus J; Linden, Jere; Nyström, Minna
2018-01-01
Abstract Colorectal cancer (CRC) genome is unstable and different types of instabilities, such as chromosomal instability (CIN) and microsatellite instability (MSI) are thought to reflect distinct cancer initiating mechanisms. Although 85% of sporadic CRC reveal CIN, 15% reveal mismatch repair (MMR) malfunction and MSI, the hallmarks of Lynch syndrome with inherited heterozygous germline mutations in MMR genes. Our study was designed to comprehensively follow genome-wide expression changes and their implications during colon tumorigenesis. We conducted a long-term feeding experiment in the mouse to address expression changes arising in histologically normal colonic mucosa as putative cancer preceding events, and the effect of inherited predisposition (Mlh1+/−) and Western-style diet (WD) on those. During the 21-month experiment, carcinomas developed mainly in WD-fed mice and were evenly distributed between genotypes. Unexpectedly, the heterozygote (B6.129-Mlh1tm1Rak) mice did not show MSI in their CRCs. Instead, both wildtype and heterozygote CRC mice showed a distinct mRNA expression profile and shortage of several chromosomal segregation gene-specific transcripts (Mlh1, Bub1, Mis18a, Tpx2, Rad9a, Pms2, Cenpe, Ncapd3, Odf2 and Dclre1b) in their colon mucosa, as well as an increased mitotic activity and abundant numbers of unbalanced/atypical mitoses in tumours. Our genome-wide expression profiling experiment demonstrates that cancer preceding changes are already seen in histologically normal colon mucosa and that decreased expressions of Mlh1 and other chromosomal segregation genes may form a field-defect in mucosa, which trigger MMR-proficient, chromosomally unstable CRC. PMID:29701748
Islam, Md Shiful; Choudhury, Mouraj; Majlish, Al-Nahian Khan; Islam, Tahmina; Ghosh, Ajit
2018-01-10
Glutathione S-transferases (GSTs) are ubiquitous enzymes which play versatile functions including cellular detoxification and stress tolerance. In this study, a comprehensive genome-wide identification of GST gene family was carried out in potato (Solanum tuberosum L.). The result demonstrated the presence of at least 90 GST genes in potato which is greater than any other reported species. According to the phylogenetic analyses of Arabidopsis, rice and potato GST members, GSTs could be subdivided into ten different classes and each class is found to be highly conserved. The largest class of potato GST family is tau with 66 members, followed by phi and lambda. The chromosomal localization analysis revealed the highly uneven distribution of StGST genes across the potato genome. Transcript profiling of 55 StGST genes showed the tissue-specific expression for most of the members. Moreover, expression of StGST genes were mainly repressed in response to abiotic stresses, while largely induced in response to biotic and hormonal elicitations. Further analysis of StGST gene's promoter identified the presence of various stress responsive cis-regulatory elements. Moreover, one of the highly stress responsive StGST members, StGSTU46, showed strong affinity towards flurazole with lowest binding energy of -7.6kcal/mol that could be used as antidote to protect crop against herbicides. These findings will facilitate the further functional and evolutionary characterization of GST genes in potato. Copyright © 2017 Elsevier B.V. All rights reserved.
Gene and miRNA expression profiles in autism spectrum disorders.
Ghahramani Seno, Mohammad M; Hu, Pingzhao; Gwadry, Fuad G; Pinto, Dalila; Marshall, Christian R; Casallo, Guillermo; Scherer, Stephen W
2011-03-22
Accumulating data indicate that there is significant genetic heterogeneity underlying the etiology in individuals diagnosed with autism spectrum disorder (ASD). Some rare and highly-penetrant gene variants and copy number variation (CNV) regions including NLGN3, NLGN4, NRXN1, SHANK2, SHANK3, PTCHD1, 1q21.1, maternally-inherited duplication of 15q11-q13, 16p11.2, amongst others, have been identified to be involved in ASD. Genome-wide association studies have identified other apparently low risk loci and in some other cases, ASD arises as a co-morbid phenotype with other medical genetic conditions (e.g. fragile X). The progress studying the genetics of ASD has largely been accomplished using genomic analyses of germline-derived DNA. Here, we used gene and miRNA expression profiling using cell-line derived total RNA to evaluate possible transcripts and networks of molecules involved in ASD. Our analysis identified several novel dysregulated genes and miRNAs in ASD compared with controls, including HEY1, SOX9, miR-486 and miR-181b. All of these are involved in nervous system development and function and some others, for example, are involved in NOTCH signaling networks (e.g. HEY1). Further, we found significant enrichment in molecules associated with neurological disorders such as Rett syndrome and those associated with nervous system development and function including long-term potentiation. Our data will provide a valuable resource for discovery purposes and for comparison to other gene expression-based, genome-wide DNA studies and other functional data. Copyright © 2010 Elsevier B.V. All rights reserved.
USDA-ARS?s Scientific Manuscript database
Transcriptional profiles of soybean (Glycine max, L. Merr) near isogenic lines Clark (PI548553, iron efficient) and IsoClark (PI547430, iron inefficient) were analyzed and compared using the Affymetrix® GeneChip® Soybean Genome Array. A comparison of plants grown under Fe-sufficient and Fe-limited ...
Wong, Chui E; Bhalla, Prem L; Ottenhof, Harald; Singh, Mohan B
2008-01-01
Background Despite the importance of the shoot apical meristem (SAM) in plant development and organ formation, our understanding of the molecular mechanisms controlling its function is limited. Genomic tools have the potential to unravel the molecular mysteries of the SAM, and legume systems are increasingly being used in plant-development studies owing to their unique characteristics such as nitrogen fixation, secondary metabolism, and pod development. Garden pea (Pisum sativum) is a well-established classic model species for genetics studies that has been used since the Mendel era. In addition, the availability of a plethora of developmental mutants makes pea an ideal crop legume for genomics studies. This study aims to utilise genomics tools in isolating genes that play potential roles in the regulation of SAM activity. Results In order to identify genes that are differentially expressed in the SAM, we generated 2735 ESTs from three cDNA libraries derived from freshly micro-dissected SAMs from 10-day-old garden peas (Pisum sativum cv Torsdag). Custom-designed oligonucleotide arrays were used to compare the transcriptional profiles of pea SAMs and non-meristematic tissues. A total of 184 and 175 transcripts were significantly up- or down-regulated in the pea SAM, respectively. As expected, close to 61% of the transcripts down-regulated in the SAM were found in the public database, whereas sequences from the same source only comprised 12% of the genes that were expressed at higher levels in the SAM. This highlights the under-representation of transcripts from the meristematic tissues in the current public pea protein database, and demonstrates the utility of our SAM EST collection as an essential genetic resource for revealing further information on the regulation of this developmental process. In addition to unknowns, many of the up-regulated transcripts are known to encode products associated with cell division and proliferation, epigenetic regulation, auxin-mediated responses and microRNA regulation. Conclusion The presented data provide a picture of the transcriptional profile of the pea SAM, and reveal possible roles of differentially expressed transcripts in meristem function and maintenance. PMID:18590528
Kirby, Marie K; Ramaker, Ryne C; Roberts, Brian S; Lasseigne, Brittany N; Gunther, David S; Burwell, Todd C; Davis, Nicholas S; Gulzar, Zulfiqar G; Absher, Devin M; Cooper, Sara J; Brooks, James D; Myers, Richard M
2017-04-17
Current diagnostic tools for prostate cancer lack specificity and sensitivity for detecting very early lesions. DNA methylation is a stable genomic modification that is detectable in peripheral patient fluids such as urine and blood plasma that could serve as a non-invasive diagnostic biomarker for prostate cancer. We measured genome-wide DNA methylation patterns in 73 clinically annotated fresh-frozen prostate cancers and 63 benign-adjacent prostate tissues using the Illumina Infinium HumanMethylation450 BeadChip array. We overlaid the most significantly differentially methylated sites in the genome with transcription factor binding sites measured by the Encyclopedia of DNA Elements consortium. We used logistic regression and receiver operating characteristic curves to assess the performance of candidate diagnostic models. We identified methylation patterns that have a high predictive power for distinguishing malignant prostate tissue from benign-adjacent prostate tissue, and these methylation signatures were validated using data from The Cancer Genome Atlas Project. Furthermore, by overlaying ENCODE transcription factor binding data, we observed an enrichment of enhancer of zeste homolog 2 binding in gene regulatory regions with higher DNA methylation in malignant prostate tissues. DNA methylation patterns are greatly altered in prostate cancer tissue in comparison to benign-adjacent tissue. We have discovered patterns of DNA methylation marks that can distinguish prostate cancers with high specificity and sensitivity in multiple patient tissue cohorts, and we have identified transcription factors binding in these differentially methylated regions that may play important roles in prostate cancer development.
Wang, Jun; Sun, Na; Deng, Ting; Zhang, Lida; Zuo, Kaijing
2014-11-06
Heat shock transcriptional factors (Hsfs) play important roles in the processes of biotic and abiotic stresses as well as in plant development. Cotton (Gossypium hirsutum, 2n=4x=(AD)2=52) is an important crop for natural fiber production. Due to continuous high temperature and intermittent drought, heat stress is becoming a handicap to improve cotton yield and lint quality. Recently, the related wild diploid species Gossypium raimondii genome (2n=2x=(D5)2=26) has been fully sequenced. In order to analyze the functions of different Hsfs at the genome-wide level, detailed characterization and analysis of the Hsf gene family in G. hirsutum is indispensable. EST assembly and genome-wide analyses were applied to clone and identify heat shock transcription factor (Hsf) genes in Upland cotton (GhHsf). Forty GhHsf genes were cloned, identified and classified into three main classes (A, B and C) according to the characteristics of their domains. Analysis of gene duplications showed that GhHsfs have occurred more frequently than reported in plant genomes such as Arabidopsis and Populus. Quantitative real-time PCR (qRT-PCR) showed that all GhHsf transcripts are expressed in most cotton plant tissues including roots, stems, leaves and developing fibers, and abundantly in developing ovules. Three expression patterns were confirmed in GhHsfs when cotton plants were exposed to high temperature for 1 h. GhHsf39 exhibited the most immediate response to heat shock. Comparative analysis of Hsfs expression differences between the wild-type and fiberless mutant suggested that Hsfs are involved in fiber development. Comparative genome analysis showed that Upland cotton D-subgenome contains 40 Hsf members, and that the whole genome of Upland cotton contains more than 80 Hsf genes due to genome duplication. The expression patterns in different tissues in response to heat shock showed that GhHsfs are important for heat stress as well as fiber development. These results provide an improved understanding of the roles of the Hsf gene family during stress responses and fiber development.
Fang, Jingping; Lin, Aiting; Qiu, Weijing; Cai, Hanyang; Umar, Muhammad; Chen, Rukai; Ming, Ray
2016-01-01
Papaya is a productive and nutritious tropical fruit. Papaya Ringspot Virus (PRSV) is the most devastating pathogen threatening papaya production worldwide. Development of transgenic resistant varieties is the most effective strategy to control this disease. However, little is known about the genome-wide functional changes induced by particle bombardment transformation. We conducted transcriptome sequencing of PRSV resistant transgenic papaya SunUp and its PRSV susceptible progenitor Sunset to compare the transcriptional changes in young healthy leaves prior to infection with PRSV. In total, 20,700 transcripts were identified, and 842 differentially expressed genes (DEGs) randomly distributed among papaya chromosomes. Gene ontology (GO) category analysis revealed that microtubule-related categories were highly enriched among these DEGs. Numerous DEGs related to various transcription factors, transporters and hormone biosynthesis showed clear differences between the two cultivars, and most were up-regulated in transgenic papaya. Many known and novel stress-induced and disease-resistance genes were most highly expressed in SunUp, including MYB, WRKY, ERF, NAC, nitrate and zinc transporters, and genes involved in the abscisic acid, salicylic acid, and ethylene signaling pathways. We also identified 67,686 alternative splicing (AS) events in Sunset and 68,455 AS events in SunUp, mapping to 10,994 and 10,995 papaya annotated genes, respectively. GO enrichment for the genes displaying AS events exclusively in Sunset was significantly different from those in SunUp. Transcriptomes in Sunset and transgenic SunUp are very similar with noteworthy differences, which increased PRSV-resistance in transgenic papaya. No detrimental pathways and allergenic or toxic proteins were induced on a genome-wide scale in transgenic SunUp. Our results provide a foundation for unraveling the mechanism of PRSV resistance in transgenic papaya. PMID:27379138
Evaluating the protein coding potential of exonized transposable element sequences
Piriyapongsa, Jittima; Rutledge, Mark T; Patel, Sanil; Borodovsky, Mark; Jordan, I King
2007-01-01
Background Transposable element (TE) sequences, once thought to be merely selfish or parasitic members of the genomic community, have been shown to contribute a wide variety of functional sequences to their host genomes. Analysis of complete genome sequences have turned up numerous cases where TE sequences have been incorporated as exons into mRNAs, and it is widely assumed that such 'exonized' TEs encode protein sequences. However, the extent to which TE-derived sequences actually encode proteins is unknown and a matter of some controversy. We have tried to address this outstanding issue from two perspectives: i-by evaluating ascertainment biases related to the search methods used to uncover TE-derived protein coding sequences (CDS) and ii-through a probabilistic codon-frequency based analysis of the protein coding potential of TE-derived exons. Results We compared the ability of three classes of sequence similarity search methods to detect TE-derived sequences among data sets of experimentally characterized proteins: 1-a profile-based hidden Markov model (HMM) approach, 2-BLAST methods and 3-RepeatMasker. Profile based methods are more sensitive and more selective than the other methods evaluated. However, the application of profile-based search methods to the detection of TE-derived sequences among well-curated experimentally characterized protein data sets did not turn up many more cases than had been previously detected and nowhere near as many cases as recent genome-wide searches have. We observed that the different search methods used were complementary in the sense that they yielded largely non-overlapping sets of hits and differed in their ability to recover known cases of TE-derived CDS. The probabilistic analysis of TE-derived exon sequences indicates that these sequences have low protein coding potential on average. In particular, non-autonomous TEs that do not encode protein sequences, such as Alu elements, are frequently exonized but unlikely to encode protein sequences. Conclusion The exaptation of the numerous TE sequences found in exons as bona fide protein coding sequences may prove to be far less common than has been suggested by the analysis of complete genomes. We hypothesize that many exonized TE sequences actually function as post-transcriptional regulators of gene expression, rather than coding sequences, which may act through a variety of double stranded RNA related regulatory pathways. Indeed, their relatively high copy numbers and similarity to sequences dispersed throughout the genome suggests that exonized TE sequences could serve as master regulators with a wide scope of regulatory influence. Reviewers: This article was reviewed by Itai Yanai, Kateryna D. Makova, Melissa Wilson (nominated by Kateryna D. Makova) and Cedric Feschotte (nominated by John M. Logsdon Jr.). PMID:18036258
Xiao, Yinghua; van Hijum, Sacha A F T; Abee, Tjakko; Wells-Bennik, Marjon H J
2015-01-01
The formation of bacterial spores is a highly regulated process and the ultimate properties of the spores are determined during sporulation and subsequent maturation. A wide variety of genes that are expressed during sporulation determine spore properties such as resistance to heat and other adverse environmental conditions, dormancy and germination responses. In this study we characterized the sporulation phases of C. perfringens enterotoxic strain SM101 based on morphological characteristics, biomass accumulation (OD600), the total viable counts of cells plus spores, the viable count of heat resistant spores alone, the pH of the supernatant, enterotoxin production and dipicolinic acid accumulation. Subsequently, whole-genome expression profiling during key phases of the sporulation process was performed using DNA microarrays, and genes were clustered based on their time-course expression profiles during sporulation. The majority of previously characterized C. perfringens germination genes showed upregulated expression profiles in time during sporulation and belonged to two main clusters of genes. These clusters with up-regulated genes contained a large number of C. perfringens genes which are homologs of Bacillus genes with roles in sporulation and germination; this study therefore suggests that those homologs are functional in C. perfringens. A comprehensive homology search revealed that approximately half of the upregulated genes in the two clusters are conserved within a broad range of sporeforming Firmicutes. Another 30% of upregulated genes in the two clusters were found only in Clostridium species, while the remaining 20% appeared to be specific for C. perfringens. These newly identified genes may add to the repertoire of genes with roles in sporulation and determining spore properties including germination behavior. Their exact roles remain to be elucidated in future studies.
Xiao, Yinghua; van Hijum, Sacha A. F. T.; Abee, Tjakko; Wells-Bennik, Marjon H. J.
2015-01-01
The formation of bacterial spores is a highly regulated process and the ultimate properties of the spores are determined during sporulation and subsequent maturation. A wide variety of genes that are expressed during sporulation determine spore properties such as resistance to heat and other adverse environmental conditions, dormancy and germination responses. In this study we characterized the sporulation phases of C. perfringens enterotoxic strain SM101 based on morphological characteristics, biomass accumulation (OD600), the total viable counts of cells plus spores, the viable count of heat resistant spores alone, the pH of the supernatant, enterotoxin production and dipicolinic acid accumulation. Subsequently, whole-genome expression profiling during key phases of the sporulation process was performed using DNA microarrays, and genes were clustered based on their time-course expression profiles during sporulation. The majority of previously characterized C. perfringens germination genes showed upregulated expression profiles in time during sporulation and belonged to two main clusters of genes. These clusters with up-regulated genes contained a large number of C. perfringens genes which are homologs of Bacillus genes with roles in sporulation and germination; this study therefore suggests that those homologs are functional in C. perfringens. A comprehensive homology search revealed that approximately half of the upregulated genes in the two clusters are conserved within a broad range of sporeforming Firmicutes. Another 30% of upregulated genes in the two clusters were found only in Clostridium species, while the remaining 20% appeared to be specific for C. perfringens. These newly identified genes may add to the repertoire of genes with roles in sporulation and determining spore properties including germination behavior. Their exact roles remain to be elucidated in future studies. PMID:25978838
Chymkowitch, Pierre; Nguéa, Aurélie P; Aanes, Håvard; Koehler, Christian J; Thiede, Bernd; Lorenz, Susanne; Meza-Zepeda, Leonardo A; Klungland, Arne; Enserink, Jorrit M
2015-06-01
Transcription factors are abundant Sumo targets, yet the global distribution of Sumo along the chromatin and its physiological relevance in transcription are poorly understood. Using Saccharomyces cerevisiae, we determined the genome-wide localization of Sumo along the chromatin. We discovered that Sumo-enriched genes are almost exclusively involved in translation, such as tRNA genes and ribosomal protein genes (RPGs). Genome-wide expression analysis showed that Sumo positively regulates their transcription. We also discovered that the Sumo consensus motif at RPG promoters is identical to the DNA binding motif of the transcription factor Rap1. We demonstrate that Rap1 is a molecular target of Sumo and that sumoylation of Rap1 is important for cell viability. Furthermore, Rap1 sumoylation promotes recruitment of the basal transcription machinery, and sumoylation of Rap1 cooperates with the target of rapamycin kinase complex 1 (TORC1) pathway to promote RPG transcription. Strikingly, our data reveal that sumoylation of Rap1 functions in a homeostatic feedback loop that sustains RPG transcription during translational stress. Taken together, Sumo regulates the cellular translational capacity by promoting transcription of tRNA genes and RPGs. © 2015 Chymkowitch et al.; Published by Cold Spring Harbor Laboratory Press.
A resource for functional profiling of noncoding RNA in the yeast Saccharomyces cerevisiae.
Parker, Steven; Fraczek, Marcin G; Wu, Jian; Shamsah, Sara; Manousaki, Alkisti; Dungrattanalert, Kobchai; de Almeida, Rogerio Alves; Estrada-Rivadeneyra, Diego; Omara, Walid; Delneri, Daniela; O'Keefe, Raymond T
2017-08-01
Eukaryotic genomes are extensively transcribed, generating many different RNAs with no known function. We have constructed 1502 molecular barcoded ncRNA gene deletion strains encompassing 443 ncRNAs in the yeast Saccharomyces cerevisiae as tools for ncRNA functional analysis. This resource includes deletions of small nuclear RNAs (snRNAs), transfer RNAs (tRNAs), small nucleolar RNAs (snoRNAs), and other annotated ncRNAs as well as the more recently identified stable unannotated transcripts (SUTs) and cryptic unstable transcripts (CUTs) whose functions are largely unknown. Specifically, deletions have been constructed for ncRNAs found in the intergenic regions, not overlapping genes or their promoters (i.e., at least 200 bp minimum distance from the closest gene start codon). The deletion strains carry molecular barcodes designed to be complementary with the protein gene deletion collection enabling parallel analysis experiments. These strains will be useful for the numerous genomic and molecular techniques that utilize deletion strains, including genome-wide phenotypic screens under different growth conditions, pooled chemogenomic screens with drugs or chemicals, synthetic genetic array analysis to uncover novel genetic interactions, and synthetic dosage lethality screens to analyze gene dosage. Overall, we created a valuable resource for the RNA community and for future ncRNA research. © 2017 Parker et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Introns Protect Eukaryotic Genomes from Transcription-Associated Genetic Instability.
Bonnet, Amandine; Grosso, Ana R; Elkaoutari, Abdessamad; Coleno, Emeline; Presle, Adrien; Sridhara, Sreerama C; Janbon, Guilhem; Géli, Vincent; de Almeida, Sérgio F; Palancade, Benoit
2017-08-17
Transcription is a source of genetic instability that can notably result from the formation of genotoxic DNA:RNA hybrids, or R-loops, between the nascent mRNA and its template. Here we report an unexpected function for introns in counteracting R-loop accumulation in eukaryotic genomes. Deletion of endogenous introns increases R-loop formation, while insertion of an intron into an intronless gene suppresses R-loop accumulation and its deleterious impact on transcription and recombination in yeast. Recruitment of the spliceosome onto the mRNA, but not splicing per se, is shown to be critical to attenuate R-loop formation and transcription-associated genetic instability. Genome-wide analyses in a number of distant species differing in their intron content, including human, further revealed that intron-containing genes and the intron-richest genomes are best protected against R-loop accumulation and subsequent genetic instability. Our results thereby provide a possible rationale for the conservation of introns throughout the eukaryotic lineage. Copyright © 2017 Elsevier Inc. All rights reserved.
Schlecht, Ulrich; Erb, Ionas; Demougin, Philippe; Robine, Nicolas; Borde, Valérie; van Nimwegen, Erik; Nicolas, Alain
2008-01-01
The autonomously replicating sequence binding factor 1 (Abf1) was initially identified as an essential DNA replication factor and later shown to be a component of the regulatory network controlling mitotic and meiotic cell cycle progression in budding yeast. The protein is thought to exert its functions via specific interaction with its target site as part of distinct protein complexes, but its roles during mitotic growth and meiotic development are only partially understood. Here, we report a comprehensive approach aiming at the identification of direct Abf1-target genes expressed during fermentation, respiration, and sporulation. Computational prediction of the protein's target sites was integrated with a genome-wide DNA binding assay in growing and sporulating cells. The resulting data were combined with the output of expression profiling studies using wild-type versus temperature-sensitive alleles. This work identified 434 protein-coding loci as being transcriptionally dependent on Abf1. More than 60% of their putative promoter regions contained a computationally predicted Abf1 binding site and/or were bound by Abf1 in vivo, identifying them as direct targets. The present study revealed numerous loci previously unknown to be under Abf1 control, and it yielded evidence for the protein's variable DNA binding pattern during mitotic growth and meiotic development. PMID:18305101
Liu, Zhi-Ping; Wu, Canglin; Miao, Hongyu; Wu, Hulin
2015-01-01
Transcriptional and post-transcriptional regulation of gene expression is of fundamental importance to numerous biological processes. Nowadays, an increasing amount of gene regulatory relationships have been documented in various databases and literature. However, to more efficiently exploit such knowledge for biomedical research and applications, it is necessary to construct a genome-wide regulatory network database to integrate the information on gene regulatory relationships that are widely scattered in many different places. Therefore, in this work, we build a knowledge-based database, named ‘RegNetwork’, of gene regulatory networks for human and mouse by collecting and integrating the documented regulatory interactions among transcription factors (TFs), microRNAs (miRNAs) and target genes from 25 selected databases. Moreover, we also inferred and incorporated potential regulatory relationships based on transcription factor binding site (TFBS) motifs into RegNetwork. As a result, RegNetwork contains a comprehensive set of experimentally observed or predicted transcriptional and post-transcriptional regulatory relationships, and the database framework is flexibly designed for potential extensions to include gene regulatory networks for other organisms in the future. Based on RegNetwork, we characterized the statistical and topological properties of genome-wide regulatory networks for human and mouse, we also extracted and interpreted simple yet important network motifs that involve the interplays between TF-miRNA and their targets. In summary, RegNetwork provides an integrated resource on the prior information for gene regulatory relationships, and it enables us to further investigate context-specific transcriptional and post-transcriptional regulatory interactions based on domain-specific experimental data. Database URL: http://www.regnetworkweb.org PMID:26424082
Du, Jiancan; Hu, Simin; Yu, Qin; Wang, Chongde; Yang, Yunqiang; Sun, Hang; Yang, Yongping; Sun, Xudong
2017-01-01
The teosinte branched1/cycloidea/proliferating cell factor (TCP) gene family is a plant-specific transcription factor that participates in the control of plant development by regulating cell proliferation. However, no report is currently available about this gene family in turnips ( Brassica rapa ssp. rapa ). In this study, a genome-wide analysis of TCP genes was performed in turnips. Thirty-nine TCP genes in turnip genome were identified and distributed on 10 chromosomes. Phylogenetic analysis clearly showed that the family was classified as two clades: class I and class II. Gene structure and conserved motif analysis showed that the same clade genes have similar gene structures and conserved motifs. The expression profiles of 39 TCP genes were determined through quantitative real-time PCR. Most CIN-type BrrTCP genes were highly expressed in leaf. The members of CYC/TB1 subclade are highly expressed in flower bud and weakly expressed in root. By contrast, class I clade showed more widespread but less tissue-specific expression patterns. Yeast two-hybrid data show that BrrTCP proteins preferentially formed heterodimers. The function of BrrTCP2 was confirmed through ectopic expression of BrrTCP2 in wild-type and loss-of-function ortholog mutant of Arabidopsis. Overexpression of BrrTCP2 in wild-type Arabidopsis resulted in the diminished leaf size. Overexpression of BrrTCP2 in triple mutants of tcp2/4/10 restored the leaf phenotype of tcp2/4/10 to the phenotype of wild type. The comprehensive analysis of turnip TCP gene family provided the foundation to further study the roles of TCP genes in turnips.
Transcriptome study of differential expression in schizophrenia
Sanders, Alan R.; Göring, Harald H. H.; Duan, Jubao; Drigalenko, Eugene I.; Moy, Winton; Freda, Jessica; He, Deli; Shi, Jianxin; Gejman, Pablo V.
2013-01-01
Schizophrenia genome-wide association studies (GWAS) have identified common SNPs, rare copy number variants (CNVs) and a large polygenic contribution to illness risk, but biological mechanisms remain unclear. Bioinformatic analyses of significantly associated genetic variants point to a large role for regulatory variants. To identify gene expression abnormalities in schizophrenia, we generated whole-genome gene expression profiles using microarrays on lymphoblastoid cell lines (LCLs) from 413 cases and 446 controls. Regression analysis identified 95 transcripts differentially expressed by affection status at a genome-wide false discovery rate (FDR) of 0.05, while simultaneously controlling for confounding effects. These transcripts represented 89 genes with functions such as neurotransmission, gene regulation, cell cycle progression, differentiation, apoptosis, microRNA (miRNA) processing and immunity. This functional diversity is consistent with schizophrenia's likely significant pathophysiological heterogeneity. The overall enrichment of immune-related genes among those differentially expressed by affection status is consistent with hypothesized immune contributions to schizophrenia risk. The observed differential expression of extended major histocompatibility complex (xMHC) region histones (HIST1H2BD, HIST1H2BC, HIST1H2BH, HIST1H2BG and HIST1H4K) converges with the genetic evidence from GWAS, which find the xMHC to be the most significant susceptibility locus. Among the differentially expressed immune-related genes, B3GNT2 is implicated in autoimmune disorders previously tied to schizophrenia risk (rheumatoid arthritis and Graves’ disease), and DICER1 is pivotal in miRNA processing potentially linking to miRNA alterations in schizophrenia (e.g. MIR137, the second strongest GWAS finding). Our analysis provides novel candidate genes for further study to assess their potential contribution to schizophrenia. PMID:23904455
2012-01-01
Background F1 hybrid clones of Eucalyptus grandis and E. urophylla are widely grown for pulp and paper production in tropical and subtropical regions. Volume growth and wood quality are priority objectives in Eucalyptus tree improvement. The molecular basis of quantitative variation and trait expression in eucalypt hybrids, however, remains largely unknown. The recent availability of a draft genome sequence (http://www.phytozome.net) and genome-wide genotyping platforms, combined with high levels of genetic variation and high linkage disequilibrium in hybrid crosses, greatly facilitate the detection of quantitative trait loci (QTLs) as well as underlying candidate genes for growth and wood property traits. In this study, we used Diversity Arrays Technology markers to assess the genetic architecture of volume growth (diameter at breast height, DBH) and wood basic density in four-year-old progeny of an interspecific backcross pedigree of E. grandis and E. urophylla. In addition, we used Illumina RNA-Seq expression profiling in the E. urophylla backcross family to identify cis- and trans-acting polymorphisms (eQTLs) affecting transcript abundance of genes underlying QTLs for wood basic density. Results A total of five QTLs for DBH and 12 for wood basic density were identified in the two backcross families. Individual QTLs for DBH and wood basic density explained 3.1 to 12.2% of phenotypic variation. Candidate genes underlying QTLs for wood basic density on linkage groups 8 and 9 were found to share trans-acting eQTLs located on linkage groups 4 and 10, which in turn coincided with QTLs for wood basic density suggesting that these QTLs represent segregating components of an underlying transcriptional network. Conclusion This is the first demonstration of the use of next-generation expression profiling to quantify transcript abundance in a segregating tree population and identify candidate genes potentially affecting wood property variation. The QTLs identified in this study provide a resource for identifying candidate genes and developing molecular markers for marker-assisted breeding of volume growth and wood basic density. Our results suggest that integrated analysis of transcript and trait variation in eucalypt hybrids can be used to dissect the molecular basis of quantitative variation in wood property traits. PMID:22817272
Genomic and transcriptomic predictors of triglyceride response to regular exercise
Sarzynski, Mark A; Davidsen, Peter K; Sung, Yun Ju; Hesselink, Matthijs K C; Schrauwen, Patrick; Rice, Treva K; Rao, D C; Falciani, Francesco; Bouchard, Claude
2015-01-01
Aim We performed genome-wide and transcriptome-wide profiling to identify genes and single nucleotide polymorphisms (SNPs) associated with the response of triglycerides (TG) to exercise training. Methods Plasma TG levels were measured before and after a 20-week endurance training programme in 478 white participants from the HERITAGE Family Study. Illumina HumanCNV370-Quad v3.0 BeadChips were genotyped using the Illumina BeadStation 500GX platform. Affymetrix HG-U133+2 arrays were used to quantitate gene expression levels from baseline muscle biopsies of a subset of participants (N=52). Genome-wide association study (GWAS) analysis was performed using MERLIN, while transcriptomic predictor models were developed using the R-package GALGO. Results The GWAS results showed that eight SNPs were associated with TG training-response (ΔTG) at p<9.9×10−6, while another 31 SNPs showed p values <1×10−4. In multivariate regression models, the top 10 SNPs explained 32.0% of the variance in ΔTG, while conditional heritability analysis showed that four SNPs statistically accounted for all of the heritability of ΔTG. A molecular signature based on the baseline expression of 11 genes predicted 27% of ΔTG in HERITAGE, which was validated in an independent study. A composite SNP score based on the top four SNPs, each from the genomic and transcriptomic analyses, was the strongest predictor of ΔTG (R2=0.14, p=3.0×10−68). Conclusions Our results indicate that skeletal muscle transcript abundance at 11 genes and SNPs at a number of loci contribute to TG response to exercise training. Combining data from genomics and transcriptomics analyses identified a SNP-based gene signature that should be further tested in independent samples. PMID:26491034
Nagalingam, Kumaran; Lorenc, Michał T; Manoli, Sahana; Cameron, Stephen L; Clarke, Anthony R; Dudley, Kevin J
2018-01-01
Interactions between DNA and proteins located in the cell nucleus play an important role in controlling physiological processes by specifying, augmenting and regulating context-specific transcription events. Chromatin immunoprecipitation (ChIP) is a widely used methodology to study DNA-protein interactions and has been successfully used in various cell types for over three decades. More recently, by combining ChIP with genomic screening technologies and Next Generation Sequencing (e.g. ChIP-seq), it has become possible to profile DNA-protein interactions (including covalent histone modifications) across entire genomes. However, the applicability of ChIP-chip and ChIP-seq has rarely been extended to non-model species because of a number of technical challenges. Here we report a method that can be used to identify genome wide covalent histone modifications in a group of non-model fruit fly species (Diptera: Tephritidae). The method was developed by testing and refining protocols that have been used in model organisms, including Drosophila melanogaster. We demonstrate that this method is suitable for a group of economically important pest fruit fly species, viz., Bactrocera dorsalis, Ceratitis capitata, Zeugodacus cucurbitae and Bactrocera tryoni. We also report an example ChIP-seq dataset for B. tryoni, providing evidence for histone modifications in the genome of a tephritid fruit fly for the first time. Since tephritids are major agricultural pests globally, this methodology will be a valuable resource to study taxa-specific evolutionary questions and to assist with pest management. It also provides a basis for researchers working with other non-model species to undertake genome wide DNA-protein interaction studies.
Comparative transcriptional profiling identifies takeout as a gene that regulates life span
Bauer, Johannes; Antosh, Michael; Chang, Chengyi; Schorl, Christoph; Kolli, Santharam; Neretti, Nicola; Helfand, Stephen L.
2010-01-01
A major challenge in translating the positive effects of dietary restriction (DR) for the improvement of human health is the development of therapeutic mimics. One approach to finding DR mimics is based upon identification of the proximal effectors of DR life span extension. Whole genome profiling of DR in Drosophila shows a large number of changes in gene expression, making it difficult to establish which changes are involved in life span determination as opposed to other unrelated physiological changes. We used comparative whole genome expression profiling to discover genes whose change in expression is shared between DR and two molecular genetic life span extending interventions related to DR, increased dSir2 and decreased Dmp53 activity. We find twenty-one genes shared among the three related life span extending interventions. One of these genes, takeout, thought to be involved in circadian rhythms, feeding behavior and juvenile hormone binding is also increased in four other life span extending conditions: Rpd3, Indy, chico and methuselah. We demonstrate takeout is involved in longevity determination by specifically increasing adult takeout expression and extending life span. These studies demonstrate the power of comparative whole genome transcriptional profiling for identifying specific downstream elements of the DR life span extending pathway. PMID:20519778
Integrative Analysis of Many RNA-Seq Datasets to Study Alternative Splicing
Li, Wenyuan; Dai, Chao; Kang, Shuli; Zhou, Xianghong Jasmine
2014-01-01
Alternative splicing is an important gene regulatory mechanism that dramatically increases the complexity of the proteome. However, how alternative splicing is regulated and how transcription and splicing are coordinated are still poorly understood, and functions of transcript isoforms have been studied only in a few limited cases. Nowadays, RNA-seq technology provides an exceptional opportunity to study alternative splicing on genome-wide scales and in an unbiased manner. With the rapid accumulation of data in public repositories, new challenges arise from the urgent need to effectively integrate many different RNA-seq datasets for study alterative splicing. This paper discusses a set of advanced computational methods that can integrate and analyze many RNA-seq datasets to systematically identify splicing modules, unravel the coupling of transcription and splicing, and predict the functions of splicing isoforms on a genome-wide scale. PMID:24583115
FANTOM5 CAGE profiles of human and mouse samples.
Noguchi, Shuhei; Arakawa, Takahiro; Fukuda, Shiro; Furuno, Masaaki; Hasegawa, Akira; Hori, Fumi; Ishikawa-Kato, Sachi; Kaida, Kaoru; Kaiho, Ai; Kanamori-Katayama, Mutsumi; Kawashima, Tsugumi; Kojima, Miki; Kubosaki, Atsutaka; Manabe, Ri-Ichiroh; Murata, Mitsuyoshi; Nagao-Sato, Sayaka; Nakazato, Kenichi; Ninomiya, Noriko; Nishiyori-Sueki, Hiromi; Noma, Shohei; Saijyo, Eri; Saka, Akiko; Sakai, Mizuho; Simon, Christophe; Suzuki, Naoko; Tagami, Michihira; Watanabe, Shoko; Yoshida, Shigehiro; Arner, Peter; Axton, Richard A; Babina, Magda; Baillie, J Kenneth; Barnett, Timothy C; Beckhouse, Anthony G; Blumenthal, Antje; Bodega, Beatrice; Bonetti, Alessandro; Briggs, James; Brombacher, Frank; Carlisle, Ailsa J; Clevers, Hans C; Davis, Carrie A; Detmar, Michael; Dohi, Taeko; Edge, Albert S B; Edinger, Matthias; Ehrlund, Anna; Ekwall, Karl; Endoh, Mitsuhiro; Enomoto, Hideki; Eslami, Afsaneh; Fagiolini, Michela; Fairbairn, Lynsey; Farach-Carson, Mary C; Faulkner, Geoffrey J; Ferrai, Carmelo; Fisher, Malcolm E; Forrester, Lesley M; Fujita, Rie; Furusawa, Jun-Ichi; Geijtenbeek, Teunis B; Gingeras, Thomas; Goldowitz, Daniel; Guhl, Sven; Guler, Reto; Gustincich, Stefano; Ha, Thomas J; Hamaguchi, Masahide; Hara, Mitsuko; Hasegawa, Yuki; Herlyn, Meenhard; Heutink, Peter; Hitchens, Kelly J; Hume, David A; Ikawa, Tomokatsu; Ishizu, Yuri; Kai, Chieko; Kawamoto, Hiroshi; Kawamura, Yuki I; Kempfle, Judith S; Kenna, Tony J; Kere, Juha; Khachigian, Levon M; Kitamura, Toshio; Klein, Sarah; Klinken, S Peter; Knox, Alan J; Kojima, Soichi; Koseki, Haruhiko; Koyasu, Shigeo; Lee, Weonju; Lennartsson, Andreas; Mackay-Sim, Alan; Mejhert, Niklas; Mizuno, Yosuke; Morikawa, Hiromasa; Morimoto, Mitsuru; Moro, Kazuyo; Morris, Kelly J; Motohashi, Hozumi; Mummery, Christine L; Nakachi, Yutaka; Nakahara, Fumio; Nakamura, Toshiyuki; Nakamura, Yukio; Nozaki, Tadasuke; Ogishima, Soichi; Ohkura, Naganari; Ohno, Hiroshi; Ohshima, Mitsuhiro; Okada-Hatakeyama, Mariko; Okazaki, Yasushi; Orlando, Valerio; Ovchinnikov, Dmitry A; Passier, Robert; Patrikakis, Margaret; Pombo, Ana; Pradhan-Bhatt, Swati; Qin, Xian-Yang; Rehli, Michael; Rizzu, Patrizia; Roy, Sugata; Sajantila, Antti; Sakaguchi, Shimon; Sato, Hiroki; Satoh, Hironori; Savvi, Suzana; Saxena, Alka; Schmidl, Christian; Schneider, Claudio; Schulze-Tanzil, Gundula G; Schwegmann, Anita; Sheng, Guojun; Shin, Jay W; Sugiyama, Daisuke; Sugiyama, Takaaki; Summers, Kim M; Takahashi, Naoko; Takai, Jun; Tanaka, Hiroshi; Tatsukawa, Hideki; Tomoiu, Andru; Toyoda, Hiroo; van de Wetering, Marc; van den Berg, Linda M; Verardo, Roberto; Vijayan, Dipti; Wells, Christine A; Winteringham, Louise N; Wolvetang, Ernst; Yamaguchi, Yoko; Yamamoto, Masayuki; Yanagi-Mizuochi, Chiyo; Yoneda, Misako; Yonekura, Yohei; Zhang, Peter G; Zucchelli, Silvia; Abugessaisa, Imad; Arner, Erik; Harshbarger, Jayson; Kondo, Atsushi; Lassmann, Timo; Lizio, Marina; Sahin, Serkan; Sengstag, Thierry; Severin, Jessica; Shimoji, Hisashi; Suzuki, Masanori; Suzuki, Harukazu; Kawai, Jun; Kondo, Naoto; Itoh, Masayoshi; Daub, Carsten O; Kasukawa, Takeya; Kawaji, Hideya; Carninci, Piero; Forrest, Alistair R R; Hayashizaki, Yoshihide
2017-08-29
In the FANTOM5 project, transcription initiation events across the human and mouse genomes were mapped at a single base-pair resolution and their frequencies were monitored by CAGE (Cap Analysis of Gene Expression) coupled with single-molecule sequencing. Approximately three thousands of samples, consisting of a variety of primary cells, tissues, cell lines, and time series samples during cell activation and development, were subjected to a uniform pipeline of CAGE data production. The analysis pipeline started by measuring RNA extracts to assess their quality, and continued to CAGE library production by using a robotic or a manual workflow, single molecule sequencing, and computational processing to generate frequencies of transcription initiation. Resulting data represents the consequence of transcriptional regulation in each analyzed state of mammalian cells. Non-overlapping peaks over the CAGE profiles, approximately 200,000 and 150,000 peaks for the human and mouse genomes, were identified and annotated to provide precise location of known promoters as well as novel ones, and to quantify their activities.
FANTOM5 CAGE profiles of human and mouse samples
Noguchi, Shuhei; Arakawa, Takahiro; Fukuda, Shiro; Furuno, Masaaki; Hasegawa, Akira; Hori, Fumi; Ishikawa-Kato, Sachi; Kaida, Kaoru; Kaiho, Ai; Kanamori-Katayama, Mutsumi; Kawashima, Tsugumi; Kojima, Miki; Kubosaki, Atsutaka; Manabe, Ri-ichiroh; Murata, Mitsuyoshi; Nagao-Sato, Sayaka; Nakazato, Kenichi; Ninomiya, Noriko; Nishiyori-Sueki, Hiromi; Noma, Shohei; Saijyo, Eri; Saka, Akiko; Sakai, Mizuho; Simon, Christophe; Suzuki, Naoko; Tagami, Michihira; Watanabe, Shoko; Yoshida, Shigehiro; Arner, Peter; Axton, Richard A.; Babina, Magda; Baillie, J. Kenneth; Barnett, Timothy C.; Beckhouse, Anthony G.; Blumenthal, Antje; Bodega, Beatrice; Bonetti, Alessandro; Briggs, James; Brombacher, Frank; Carlisle, Ailsa J.; Clevers, Hans C.; Davis, Carrie A.; Detmar, Michael; Dohi, Taeko; Edge, Albert S.B.; Edinger, Matthias; Ehrlund, Anna; Ekwall, Karl; Endoh, Mitsuhiro; Enomoto, Hideki; Eslami, Afsaneh; Fagiolini, Michela; Fairbairn, Lynsey; Farach-Carson, Mary C.; Faulkner, Geoffrey J.; Ferrai, Carmelo; Fisher, Malcolm E.; Forrester, Lesley M.; Fujita, Rie; Furusawa, Jun-ichi; Geijtenbeek, Teunis B.; Gingeras, Thomas; Goldowitz, Daniel; Guhl, Sven; Guler, Reto; Gustincich, Stefano; Ha, Thomas J.; Hamaguchi, Masahide; Hara, Mitsuko; Hasegawa, Yuki; Herlyn, Meenhard; Heutink, Peter; Hitchens, Kelly J.; Hume, David A.; Ikawa, Tomokatsu; Ishizu, Yuri; Kai, Chieko; Kawamoto, Hiroshi; Kawamura, Yuki I.; Kempfle, Judith S.; Kenna, Tony J.; Kere, Juha; Khachigian, Levon M.; Kitamura, Toshio; Klein, Sarah; Klinken, S. Peter; Knox, Alan J.; Kojima, Soichi; Koseki, Haruhiko; Koyasu, Shigeo; Lee, Weonju; Lennartsson, Andreas; Mackay-sim, Alan; Mejhert, Niklas; Mizuno, Yosuke; Morikawa, Hiromasa; Morimoto, Mitsuru; Moro, Kazuyo; Morris, Kelly J.; Motohashi, Hozumi; Mummery, Christine L.; Nakachi, Yutaka; Nakahara, Fumio; Nakamura, Toshiyuki; Nakamura, Yukio; Nozaki, Tadasuke; Ogishima, Soichi; Ohkura, Naganari; Ohno, Hiroshi; Ohshima, Mitsuhiro; Okada-Hatakeyama, Mariko; Okazaki, Yasushi; Orlando, Valerio; Ovchinnikov, Dmitry A.; Passier, Robert; Patrikakis, Margaret; Pombo, Ana; Pradhan-Bhatt, Swati; Qin, Xian-Yang; Rehli, Michael; Rizzu, Patrizia; Roy, Sugata; Sajantila, Antti; Sakaguchi, Shimon; Sato, Hiroki; Satoh, Hironori; Savvi, Suzana; Saxena, Alka; Schmidl, Christian; Schneider, Claudio; Schulze-Tanzil, Gundula G.; Schwegmann, Anita; Sheng, Guojun; Shin, Jay W.; Sugiyama, Daisuke; Sugiyama, Takaaki; Summers, Kim M.; Takahashi, Naoko; Takai, Jun; Tanaka, Hiroshi; Tatsukawa, Hideki; Tomoiu, Andru; Toyoda, Hiroo; van de Wetering, Marc; van den Berg, Linda M.; Verardo, Roberto; Vijayan, Dipti; Wells, Christine A.; Winteringham, Louise N.; Wolvetang, Ernst; Yamaguchi, Yoko; Yamamoto, Masayuki; Yanagi-Mizuochi, Chiyo; Yoneda, Misako; Yonekura, Yohei; Zhang, Peter G.; Zucchelli, Silvia; Abugessaisa, Imad; Arner, Erik; Harshbarger, Jayson; Kondo, Atsushi; Lassmann, Timo; Lizio, Marina; Sahin, Serkan; Sengstag, Thierry; Severin, Jessica; Shimoji, Hisashi; Suzuki, Masanori; Suzuki, Harukazu; Kawai, Jun; Kondo, Naoto; Itoh, Masayoshi; Daub, Carsten O.; Kasukawa, Takeya; Kawaji, Hideya; Carninci, Piero; Forrest, Alistair R.R.; Hayashizaki, Yoshihide
2017-01-01
In the FANTOM5 project, transcription initiation events across the human and mouse genomes were mapped at a single base-pair resolution and their frequencies were monitored by CAGE (Cap Analysis of Gene Expression) coupled with single-molecule sequencing. Approximately three thousands of samples, consisting of a variety of primary cells, tissues, cell lines, and time series samples during cell activation and development, were subjected to a uniform pipeline of CAGE data production. The analysis pipeline started by measuring RNA extracts to assess their quality, and continued to CAGE library production by using a robotic or a manual workflow, single molecule sequencing, and computational processing to generate frequencies of transcription initiation. Resulting data represents the consequence of transcriptional regulation in each analyzed state of mammalian cells. Non-overlapping peaks over the CAGE profiles, approximately 200,000 and 150,000 peaks for the human and mouse genomes, were identified and annotated to provide precise location of known promoters as well as novel ones, and to quantify their activities. PMID:28850106
Aoyagi, Luciano N; Lopes-Caitar, Valéria S; de Carvalho, Mayra C C G; Darben, Luana M; Polizel-Podanosqui, Adriana; Kuwahara, Marcia K; Nepomuceno, Alexandre L; Abdelnoor, Ricardo V; Marcelino-Guimarães, Francismar C
2014-12-01
Myb genes constitute one of the largest transcription factor families in the plant kingdom. Soybean MYB transcription factors have been related to the plant response to biotic stresses. Their involvement in response to Phakopsora pachyrhizi infection has been reported by several transcriptional studies. Due to their apparently highly diverse functions, these genes are promising targets for developing crop varieties resistant to diseases. In the present study, the identification and phylogenetic analysis of the soybean R2R3-MYB (GmMYB) transcription factor family was performed and the expression profiles of these genes under biotic stress were determined. GmMYBs were identified from the soybean genome using bioinformatic tools, and their putative functions were determined based on the phylogenetic tree and classified into subfamilies using guides AtMYBs describing known functions. The transcriptional profiles of GmMYBs upon infection with different pathogen were revealed by in vivo and in silico analyses. Selected target genes potentially involved in disease responses were assessed by RT-qPCR after different times of inoculation with P. pachyrhizi using different genetic backgrounds related to resistance genes (Rpp2 and Rpp5). R2R3-MYB transcription factors related to lignin synthesis and genes responsive to chitin were significantly induced in the resistant genotypes. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Martin, Kathleen; Singh, Jugpreet; Hill, John H; Whitham, Steven A; Cannon, Steven B
2016-08-11
Bean common mosaic virus (BCMV) is widespread, with Phaseolus species as the primary host plants. Numerous BCMV strains have been identified on the basis of a panel of bean varieties that distinguish the pathogenicity types with respect to the viral strains. The molecular responses in Phaseolus to BCMV infection have not yet been well characterized. We report the transcriptional responses of a widely susceptible variety of common bean (Phaseolus vulgaris L., cultivar 'Stringless green refugee') to two BCMV strains, in a time-course experiment. We also report the genome sequence of a previously unreported BCMV strain. The interaction with the known strain NL1-Iowa causes moderate symptoms and large transcriptional responses, and the newly identified strain (Strain 2 or S2) causes severe symptoms and moderate transcriptional responses. The transcriptional profiles of host plants infected with the two isolates are distinct, and involve numerous differences in splice forms in particular genes, and pathway specific expression patterns. We identified differential host transcriptome response after infection of two different strains of Bean common mosaic virus (BCMV) in common bean (Phaseolus vulgaris L.). Virus infection initiated a suite of changes in gene expression level and patterns in the host plants. Pathways related to defense, gene regulation, metabolic processes, photosynthesis were specifically altered after virus infection. Results presented in this study can increase the understanding of host-pathogen interactions and provide resources for further investigations of the biological mechanisms in BCMV infection and defense.
Velardo, Margaret J; Burger, Corinna; Williams, Philip R; Baker, Henry V; López, M Cecilia; Mareci, Thomas H; White, Todd E; Muzyczka, Nicholas; Reier, Paul J
2004-09-29
Spinal cord injury (SCI) induces a progressive pathophysiology affecting cell survival and neurological integrity via complex and evolving molecular cascades whose interrelationships are not fully understood. The present experiments were designed to: (1) determine potential functional interactions within transcriptional expression profiles obtained after a clinically relevant SCI and (2) test the consistency of transcript expression after SCI in two genetically and immunologically diverse rat strains characterized by differences in T cell competence and associated inflammatory responses. By interrogating Affymetrix U34A rat genome GeneChip microarrays, we defined the transcriptional expression patterns in midcervical contusion lesion sites between 1 and 90 d postinjury of athymic nude (AN) and Sprague Dawley (SD) strains. Stringent statistical analyses detected significant changes in 3638 probe sets, with 80 genes differing between the AN and SD groups. Subsequent detailed functional categorization of these transcripts unveiled an overall tissue remodeling response that was common to both strains. The functionally organized gene profiles were temporally distinct and correlated with repair indices observed microscopically and by magnetic resonance microimaging. Our molecular and anatomical observations have identified a novel, longitudinal perspective of the post-SCI response, namely, that of a highly orchestrated tissue repair and remodeling repertoire with a prominent cutaneous wound healing signature that is conserved between two widely differing rat strains. These results have significant bearing on the continuing development of cellular and pharmacological therapeutics directed at tissue rescue and neuronal regeneration in the injured spinal cord.
Kuttippurathu, Lakshmi; Patra, Biswanath; Hoek, Jan B; Vadigepalli, Rajanikanth
2016-03-01
Liver regeneration after partial hepatectomy is a clinically important process that is impaired by adaptation to chronic alcohol intake. We focused on the initial time points following partial hepatectomy (PHx) to analyze the genome-wide binding activity of NF-κB, a key immediate early regulator. We investigated the effect of chronic alcohol intake on immediate early NF-κB genome-wide localization, in the adapted state as well as in response to partial hepatectomy, using chromatin immunoprecipitation followed by promoter microarray analysis. We found many ethanol-specific NF-κB binding target promoters in the ethanol-adapted state, corresponding to the regulation of biosynthetic processes, oxidation-reduction and apoptosis. Partial hepatectomy induced a diet-independent shift in NF-κB binding loci relative to the transcription start sites. We employed a novel pattern count analysis to exhaustively enumerate and compare the number of promoters corresponding to the temporal binding patterns in ethanol and pair-fed control groups. The highest pattern count corresponded to promoters with NF-κB binding exclusively in the ethanol group at 1 h post PHx. This set was associated with the regulation of cell death, response to oxidative stress, histone modification, mitochondrial function, and metabolic processes. Integration with the global gene expression profiles to identify putative transcriptional consequences of NF-κB binding patterns revealed that several of ethanol-specific 1 h binding targets showed ethanol-specific differential expression through 6 h post PHx. Motif analysis yielded co-incident binding loci for STAT3, AP-1, CREB, C/EBP-β, PPAR-γ and C/EBP-α, likely participating in co-regulatory modules with NF-κB in shaping the immediate early response to PHx. We conclude that adaptation to chronic ethanol intake disrupts the NF-κB promoter binding landscape with consequences for the immediate early gene regulatory response to the acute challenge of PHx.
Ong, Wen Dee; Voo, Lok-Yung Christopher; Kumar, Vijay Subbiah
2012-01-01
Pineapple (Ananas comosus var. comosus), is an important tropical non-climacteric fruit with high commercial potential. Understanding the mechanism and processes underlying fruit ripening would enable scientists to enhance the improvement of quality traits such as, flavor, texture, appearance and fruit sweetness. Although, the pineapple is an important fruit, there is insufficient transcriptomic or genomic information that is available in public databases. Application of high throughput transcriptome sequencing to profile the pineapple fruit transcripts is therefore needed. To facilitate this, we have performed transcriptome sequencing of ripe yellow pineapple fruit flesh using Illumina technology. About 4.7 millions Illumina paired-end reads were generated and assembled using the Velvet de novo assembler. The assembly produced 28,728 unique transcripts with a mean length of approximately 200 bp. Sequence similarity search against non-redundant NCBI database identified a total of 16,932 unique transcripts (58.93%) with significant hits. Out of these, 15,507 unique transcripts were assigned to gene ontology terms. Functional annotation against Kyoto Encyclopedia of Genes and Genomes pathway database identified 13,598 unique transcripts (47.33%) which were mapped to 126 pathways. The assembly revealed many transcripts that were previously unknown. The unique transcripts derived from this work have rapidly increased of the number of the pineapple fruit mRNA transcripts as it is now available in public databases. This information can be further utilized in gene expression, genomics and other functional genomics studies in pineapple.
Ong, Wen Dee; Voo, Lok-Yung Christopher; Kumar, Vijay Subbiah
2012-01-01
Background Pineapple (Ananas comosus var. comosus), is an important tropical non-climacteric fruit with high commercial potential. Understanding the mechanism and processes underlying fruit ripening would enable scientists to enhance the improvement of quality traits such as, flavor, texture, appearance and fruit sweetness. Although, the pineapple is an important fruit, there is insufficient transcriptomic or genomic information that is available in public databases. Application of high throughput transcriptome sequencing to profile the pineapple fruit transcripts is therefore needed. Methodology/Principal Findings To facilitate this, we have performed transcriptome sequencing of ripe yellow pineapple fruit flesh using Illumina technology. About 4.7 millions Illumina paired-end reads were generated and assembled using the Velvet de novo assembler. The assembly produced 28,728 unique transcripts with a mean length of approximately 200 bp. Sequence similarity search against non-redundant NCBI database identified a total of 16,932 unique transcripts (58.93%) with significant hits. Out of these, 15,507 unique transcripts were assigned to gene ontology terms. Functional annotation against Kyoto Encyclopedia of Genes and Genomes pathway database identified 13,598 unique transcripts (47.33%) which were mapped to 126 pathways. The assembly revealed many transcripts that were previously unknown. Conclusions The unique transcripts derived from this work have rapidly increased of the number of the pineapple fruit mRNA transcripts as it is now available in public databases. This information can be further utilized in gene expression, genomics and other functional genomics studies in pineapple. PMID:23091603
Hu, Wei; Zuo, Jiao; Hou, Xiaowan; Yan, Yan; Wei, Yunxie; Liu, Juhua; Li, Meiying; Xu, Biyu; Jin, Zhiqiang
2015-01-01
Auxin signaling regulates various auxin-responsive genes via two types of transcriptional regulators, Auxin Response Factors (ARF) and Aux/IAA. ARF transcription factors act as critical components of auxin signaling that play important roles in modulating various biological processes. However, limited information about this gene family in fruit crops is currently available. Herein, 47 ARF genes were identified in banana based on its genome sequence. Phylogenetic analysis of the ARFs from banana, rice, and Arabidopsis suggested that the ARFs could be divided into four subgroups, among which most ARFs from the banana showed a closer relationship with those from rice than those from Arabidopsis. Conserved motif analysis showed that all identified MaARFs had typical DNA-binding and ARF domains, but 12 members lacked the dimerization domain. Gene structure analysis showed that the number of exons in MaARF genes ranged from 5 to 21, suggesting large variation amongst banana ARF genes. The comprehensive expression profiles of MaARF genes yielded useful information about their involvement in diverse tissues, different stages of fruit development and ripening, and responses to abiotic stresses in different varieties. Interaction networks and co-expression assays indicated the strong transcriptional response of banana ARFs and ARF-mediated networks in early fruit development for different varieties. Our systematic analysis of MaARFs revealed robust tissue-specific, development-dependent, and abiotic stress-responsive candidate MaARF genes for further functional assays in planta. These findings could lead to potential applications in the genetic improvement of banana cultivars, and yield new insights into the complexity of the control of MaARF gene expression at the transcriptional level. Finally, they support the hypothesis that ARFs are a crucial component of the auxin signaling pathway, which regulates a wide range of physiological processes. PMID:26442055
Leveraging transcript quantification for fast computation of alternative splicing profiles.
Alamancos, Gael P; Pagès, Amadís; Trincado, Juan L; Bellora, Nicolás; Eyras, Eduardo
2015-09-01
Alternative splicing plays an essential role in many cellular processes and bears major relevance in the understanding of multiple diseases, including cancer. High-throughput RNA sequencing allows genome-wide analyses of splicing across multiple conditions. However, the increasing number of available data sets represents a major challenge in terms of computation time and storage requirements. We describe SUPPA, a computational tool to calculate relative inclusion values of alternative splicing events, exploiting fast transcript quantification. SUPPA accuracy is comparable and sometimes superior to standard methods using simulated as well as real RNA-sequencing data compared with experimentally validated events. We assess the variability in terms of the choice of annotation and provide evidence that using complete transcripts rather than more transcripts per gene provides better estimates. Moreover, SUPPA coupled with de novo transcript reconstruction methods does not achieve accuracies as high as using quantification of known transcripts, but remains comparable to existing methods. Finally, we show that SUPPA is more than 1000 times faster than standard methods. Coupled with fast transcript quantification, SUPPA provides inclusion values at a much higher speed than existing methods without compromising accuracy, thereby facilitating the systematic splicing analysis of large data sets with limited computational resources. The software is implemented in Python 2.7 and is available under the MIT license at https://bitbucket.org/regulatorygenomicsupf/suppa. © 2015 Alamancos et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Yu, Da-Hai; Ware, Carol; Waterland, Robert A.; Zhang, Jiexin; Chen, Miao-Hsueh; Gadkari, Manasi; Kunde-Ramamoorthy, Govindarajan; Nosavanh, Lagina M.
2013-01-01
During development, a small but significant number of CpG islands (CGIs) become methylated. The timing of developmentally programmed CGI methylation and associated mechanisms of transcriptional regulation during cellular differentiation, however, remain poorly characterized. Here, we used genome-wide DNA methylation microarrays to identify epigenetic changes during human embryonic stem cell (hESC) differentiation. We discovered a group of CGIs associated with developmental genes that gain methylation after hESCs differentiate. Conversely, erasure of methylation was observed at the identified CGIs during subsequent reprogramming to induced pluripotent stem cells (iPSCs), further supporting a functional role for the CGI methylation. Both global gene expression profiling and quantitative reverse transcription-PCR (RT-PCR) validation indicated opposing effects of CGI methylation in transcriptional regulation during differentiation, with promoter CGI methylation repressing and 3′ CGI methylation activating transcription. By studying diverse human tissues and mouse models, we further confirmed that developmentally programmed 3′ CGI methylation confers tissue- and cell-type-specific gene activation in vivo. Importantly, luciferase reporter assays provided evidence that 3′ CGI methylation regulates transcriptional activation via a CTCF-dependent enhancer-blocking mechanism. These findings expand the classic view of mammalian CGI methylation as a mechanism for transcriptional silencing and indicate a functional role for 3′ CGI methylation in developmental gene regulation. PMID:23459939
Genome-wide miRNA response to anacardic acid in breast cancer cells
Schultz, David J.; Muluhngwi, Penn; Alizadeh-Rad, Negin; Green, Madelyn A.; Rouchka, Eric C.; Waigel, Sabine J.
2017-01-01
MicroRNAs are biomarkers and potential therapeutic targets for breast cancer. Anacardic acid (AnAc) is a dietary phenolic lipid that inhibits both MCF-7 estrogen receptor α (ERα) positive and MDA-MB-231 triple negative breast cancer (TNBC) cell proliferation with IC50s of 13.5 and 35 μM, respectively. To identify potential mediators of AnAc action in breast cancer, we profiled the genome-wide microRNA transcriptome (microRNAome) in these two cell lines altered by the AnAc 24:1n5 congener. Whole genome expression profiling (RNA-seq) and subsequent network analysis in MetaCore Gene Ontology (GO) algorithm was used to characterize the biological pathways altered by AnAc. In MCF-7 cells, 69 AnAc-responsive miRNAs were identified, e.g., increased let-7a and reduced miR-584. Fewer, i.e., 37 AnAc-responsive miRNAs were identified in MDA-MB-231 cells, e.g., decreased miR-23b and increased miR-1257. Only two miRNAs were increased by AnAc in both cell lines: miR-612 and miR-20b; however, opposite miRNA arm preference was noted: miR-20b-3p and miR-20b-5p were upregulated in MCF-7 and MDA-MB-231, respectively. miR-20b-5p target EFNB2 transcript levels were reduced by AnAc in MDA-MB-231 cells. AnAc reduced miR-378g that targets VIM (vimentin) and VIM mRNA transcript expression was increased in AnAc-treated MCF-7 cells, suggesting a reciprocal relationship. The top three enriched GO terms for AnAc-treated MCF-7 cells were B cell receptor signaling pathway and ribosomal large subunit biogenesis and S-adenosylmethionine metabolic process for AnAc-treated MDA-MB-231 cells. The pathways modulated by these AnAc-regulated miRNAs suggest that key nodal molecules, e.g., Cyclin D1, MYC, c-FOS, PPARγ, and SIN3, are targets of AnAc activity. PMID:28886127
Lata, Charu; Mishra, Awdhesh Kumar; Muthamilarasan, Mehanathan; Bonthala, Venkata Suresh; Khan, Yusuf; Prasad, Manoj
2014-01-01
The APETALA2/ethylene-responsive element binding factor (AP2/ERF) family is one of the largest transcription factor (TF) families in plants that includes four major sub-families, namely AP2, DREB (dehydration responsive element binding), ERF (ethylene responsive factors) and RAV (Related to ABI3/VP). AP2/ERFs are known to play significant roles in various plant processes including growth and development and biotic and abiotic stress responses. Considering this, a comprehensive genome-wide study was conducted in foxtail millet (Setaria italica L.). A total of 171 AP2/ERF genes were identified by systematic sequence analysis and were physically mapped onto nine chromosomes. Phylogenetic analysis grouped AP2/ERF genes into six classes (I to VI). Duplication analysis revealed that 12 (∼7%) SiAP2/ERF genes were tandem repeated and 22 (∼13%) were segmentally duplicated. Comparative physical mapping between foxtail millet AP2/ERF genes and its orthologs of sorghum (18 genes), maize (14 genes), rice (9 genes) and Brachypodium (6 genes) showed the evolutionary insights of AP2/ERF gene family and also the decrease in orthology with increase in phylogenetic distance. The evolutionary significance in terms of gene-duplication and divergence was analyzed by estimating synonymous and non-synonymous substitution rates. Expression profiling of candidate AP2/ERF genes against drought, salt and phytohormones revealed insights into their precise and/or overlapping expression patterns which could be responsible for their functional divergence in foxtail millet. The study showed that the genes SiAP2/ERF-069, SiAP2/ERF-103 and SiAP2/ERF-120 may be considered as potential candidate genes for further functional validation as well for utilization in crop improvement programs for stress resistance since these genes were up-regulated under drought and salinity stresses in ABA dependent manner. Altogether the present study provides new insights into evolution, divergence and systematic functional analysis of AP2/ERF gene family at genome level in foxtail millet which may be utilized for improving stress adaptation and tolerance in millets, cereals and bioenergy grasses. PMID:25409524
Lata, Charu; Mishra, Awdhesh Kumar; Muthamilarasan, Mehanathan; Bonthala, Venkata Suresh; Khan, Yusuf; Prasad, Manoj
2014-01-01
The APETALA2/ethylene-responsive element binding factor (AP2/ERF) family is one of the largest transcription factor (TF) families in plants that includes four major sub-families, namely AP2, DREB (dehydration responsive element binding), ERF (ethylene responsive factors) and RAV (Related to ABI3/VP). AP2/ERFs are known to play significant roles in various plant processes including growth and development and biotic and abiotic stress responses. Considering this, a comprehensive genome-wide study was conducted in foxtail millet (Setaria italica L.). A total of 171 AP2/ERF genes were identified by systematic sequence analysis and were physically mapped onto nine chromosomes. Phylogenetic analysis grouped AP2/ERF genes into six classes (I to VI). Duplication analysis revealed that 12 (∼7%) SiAP2/ERF genes were tandem repeated and 22 (∼13%) were segmentally duplicated. Comparative physical mapping between foxtail millet AP2/ERF genes and its orthologs of sorghum (18 genes), maize (14 genes), rice (9 genes) and Brachypodium (6 genes) showed the evolutionary insights of AP2/ERF gene family and also the decrease in orthology with increase in phylogenetic distance. The evolutionary significance in terms of gene-duplication and divergence was analyzed by estimating synonymous and non-synonymous substitution rates. Expression profiling of candidate AP2/ERF genes against drought, salt and phytohormones revealed insights into their precise and/or overlapping expression patterns which could be responsible for their functional divergence in foxtail millet. The study showed that the genes SiAP2/ERF-069, SiAP2/ERF-103 and SiAP2/ERF-120 may be considered as potential candidate genes for further functional validation as well for utilization in crop improvement programs for stress resistance since these genes were up-regulated under drought and salinity stresses in ABA dependent manner. Altogether the present study provides new insights into evolution, divergence and systematic functional analysis of AP2/ERF gene family at genome level in foxtail millet which may be utilized for improving stress adaptation and tolerance in millets, cereals and bioenergy grasses.
Yang, Jun; An, Dong; Zhang, Peng
2011-03-01
Mechanisms related to the development of cassava storage roots and starch accumulation remain largely unknown. To evaluate genome-wide expression patterns during tuberization, a 60 mer oligonucleotide microarray representing 20 840 cassava genes was designed to identify differentially expressed transcripts in fibrous roots, developing storage roots and mature storage roots. Using a random variance model and the traditional twofold change method for statistical analysis, 912 and 3 386 upregulated and downregulated genes related to the three developmental phases were identified. Among 25 significantly changed pathways identified, glycolysis/gluconeogenesis was the most evident one. Rate-limiting enzymes were identified from each individual pathway, for example, enolase, L-lactate dehydrogenase and aldehyde dehydrogenase for glycolysis/gluconeogenesis, and ADP-glucose pyrophosphorylase, starch branching enzyme and glucan phosphorylase for sucrose and starch metabolism. This study revealed that dynamic changes in at least 16% of the total transcripts, including transcription factors, oxidoreductases/transferases/hydrolases, hormone-related genes, and effectors of homeostasis. The reliability of these differentially expressed genes was verified by quantitative real-time reverse transcription-polymerase chain reaction. These studies should facilitate our understanding of the storage root formation and cassava improvement. © 2011 Institute of Botany, Chinese Academy of Sciences.
Murray, Vincent; Chen, Jon K; Galea, Anne M
2014-04-01
The genome-wide pattern of DNA cleavage at transcription start sites (TSSs) for the anti-tumor drug bleomycin was examined in human HeLa cells using next-generation DNA sequencing. It was found that actively transcribed genes were preferentially cleaved compared with non-transcribed genes. The 143,600 identified human TSSs were split into non-transcribed genes (82,596) and transcribed genes (61,004) for HeLa cells. These transcribed genes were further split into quintiles of 12,201 genes comprising the top 20, 20-40, 40-60, 60-80, and 80-100 % of expressed genes. The bleomycin cleavage pattern at highly transcribed gene TSSs was greatly enhanced compared with purified DNA and non-transcribed gene TSSs. The top 20 and 20-40 % quintiles had a very similar enhanced cleavage pattern, the 40-60 % quintile was intermediate, while the 60-80 and 80-100 % quintiles were close to the non-transcribed and purified DNA profiles. The pattern of bleomycin enhanced cleavage had peaks that were approximately 200 bp apart, and this indicated that bleomycin was identifying the presence of phased nucleosomes at TSSs. Hence bleomycin can be utilized to detect chromatin structures that are present at actively transcribed genes. In this study, for the first time, the pattern of DNA damage by a clinically utilized cancer chemotherapeutic agent was performed on a human genome-wide scale at the nucleotide level.
Si, H; Lu, H; Yang, X; Mattox, A; Jang, M; Bian, Y; Sano, E; Viadiu, H; Yan, B; Yau, C; Ng, S; Lee, S K; Romano, R-A; Davis, S; Walker, R L; Xiao, W; Sun, H; Wei, L; Sinha, S; Benz, C C; Stuart, J M; Meltzer, P S; Van Waes, C; Chen, Z
2016-11-03
The Cancer Genome Atlas (TCGA) network study of 12 cancer types (PanCancer 12) revealed frequent mutation of TP53, and amplification and expression of related TP63 isoform ΔNp63 in squamous cancers. Further, aberrant expression of inflammatory genes and TP53/p63/p73 targets were detected in the PanCancer 12 project, reminiscent of gene programs comodulated by cREL/ΔNp63/TAp73 transcription factors we uncovered in head and neck squamous cell carcinomas (HNSCCs). However, how inflammatory gene signatures and cREL/p63/p73 targets are comodulated genome wide is unclear. Here, we examined how the inflammatory factor tumor necrosis factor-α (TNF-α) broadly modulates redistribution of cREL with ΔNp63α/TAp73 complexes and signatures genome wide in the HNSCC model UM-SCC46 using chromatin immunoprecipitation sequencing (ChIP-seq). TNF-α enhanced genome-wide co-occupancy of cREL with ΔNp63α on TP53/p63 sites, while unexpectedly promoting redistribution of TAp73 from TP53 to activator protein-1 (AP-1) sites. cREL, ΔNp63α and TAp73 binding and oligomerization on NF-κB-, TP53- or AP-1-specific sequences were independently validated by ChIP-qPCR (quantitative PCR), oligonucleotide-binding assays and analytical ultracentrifugation. Function of the binding activity was confirmed using TP53-, AP-1- and NF-κB-specific REs or p21, SERPINE1 and IL-6 promoter luciferase reporter activities. Concurrently, TNF-α regulated a broad gene network with cobinding activities for cREL, ΔNp63α and TAp73 observed upon array profiling and reverse transcription-PCR. Overlapping target gene signatures were observed in squamous cancer subsets and in inflamed skin of transgenic mice overexpressing ΔNp63α. Furthermore, multiple target genes identified in this study were linked to TP63 and TP73 activity and increased gene expression in large squamous cancer samples from PanCancer 12 TCGA by CircleMap. PARADIGM inferred pathway analysis revealed the network connection of TP63 and NF-κB complexes through an AP-1 hub, further supporting our findings. Thus, inflammatory cytokine TNF-α mediates genome-wide redistribution of the cREL/p63/p73, and AP-1 interactome, to diminish TAp73 tumor suppressor function and reciprocally activate NF-κB and AP-1 gene programs implicated in malignancy.
A Comparative Encyclopedia of DNA Elements in the Mouse Genome
Yue, Feng; Cheng, Yong; Breschi, Alessandra; Vierstra, Jeff; Wu, Weisheng; Ryba, Tyrone; Sandstrom, Richard; Ma, Zhihai; Davis, Carrie; Pope, Benjamin D.; Shen, Yin; Pervouchine, Dmitri D.; Djebali, Sarah; Thurman, Bob; Kaul, Rajinder; Rynes, Eric; Kirilusha, Anthony; Marinov, Georgi K.; Williams, Brian A.; Trout, Diane; Amrhein, Henry; Fisher-Aylor, Katherine; Antoshechkin, Igor; DeSalvo, Gilberto; See, Lei-Hoon; Fastuca, Meagan; Drenkow, Jorg; Zaleski, Chris; Dobin, Alex; Prieto, Pablo; Lagarde, Julien; Bussotti, Giovanni; Tanzer, Andrea; Denas, Olgert; Li, Kanwei; Bender, M. A.; Zhang, Miaohua; Byron, Rachel; Groudine, Mark T.; McCleary, David; Pham, Long; Ye, Zhen; Kuan, Samantha; Edsall, Lee; Wu, Yi-Chieh; Rasmussen, Matthew D.; Bansal, Mukul S.; Keller, Cheryl A.; Morrissey, Christapher S.; Mishra, Tejaswini; Jain, Deepti; Dogan, Nergiz; Harris, Robert S.; Cayting, Philip; Kawli, Trupti; Boyle, Alan P.; Euskirchen, Ghia; Kundaje, Anshul; Lin, Shin; Lin, Yiing; Jansen, Camden; Malladi, Venkat S.; Cline, Melissa S.; Erickson, Drew T.; Kirkup, Vanessa M; Learned, Katrina; Sloan, Cricket A.; Rosenbloom, Kate R.; de Sousa, Beatriz Lacerda; Beal, Kathryn; Pignatelli, Miguel; Flicek, Paul; Lian, Jin; Kahveci, Tamer; Lee, Dongwon; Kent, W. James; Santos, Miguel Ramalho; Herrero, Javier; Notredame, Cedric; Johnson, Audra; Vong, Shinny; Lee, Kristen; Bates, Daniel; Neri, Fidencio; Diegel, Morgan; Canfield, Theresa; Sabo, Peter J.; Wilken, Matthew S.; Reh, Thomas A.; Giste, Erika; Shafer, Anthony; Kutyavin, Tanya; Haugen, Eric; Dunn, Douglas; Reynolds, Alex P.; Neph, Shane; Humbert, Richard; Hansen, R. Scott; De Bruijn, Marella; Selleri, Licia; Rudensky, Alexander; Josefowicz, Steven; Samstein, Robert; Eichler, Evan E.; Orkin, Stuart H.; Levasseur, Dana; Papayannopoulou, Thalia; Chang, Kai-Hsin; Skoultchi, Arthur; Gosh, Srikanta; Disteche, Christine; Treuting, Piper; Wang, Yanli; Weiss, Mitchell J.; Blobel, Gerd A.; Good, Peter J.; Lowdon, Rebecca F.; Adams, Leslie B.; Zhou, Xiao-Qiao; Pazin, Michael J.; Feingold, Elise A.; Wold, Barbara; Taylor, James; Kellis, Manolis; Mortazavi, Ali; Weissman, Sherman M.; Stamatoyannopoulos, John; Snyder, Michael P.; Guigo, Roderic; Gingeras, Thomas R.; Gilbert, David M.; Hardison, Ross C.; Beer, Michael A.; Ren, Bing
2014-01-01
Summary As the premier model organism in biomedical research, the laboratory mouse shares the majority of protein-coding genes with humans, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications, and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of other sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases. PMID:25409824
A comparative encyclopedia of DNA elements in the mouse genome.
Yue, Feng; Cheng, Yong; Breschi, Alessandra; Vierstra, Jeff; Wu, Weisheng; Ryba, Tyrone; Sandstrom, Richard; Ma, Zhihai; Davis, Carrie; Pope, Benjamin D; Shen, Yin; Pervouchine, Dmitri D; Djebali, Sarah; Thurman, Robert E; Kaul, Rajinder; Rynes, Eric; Kirilusha, Anthony; Marinov, Georgi K; Williams, Brian A; Trout, Diane; Amrhein, Henry; Fisher-Aylor, Katherine; Antoshechkin, Igor; DeSalvo, Gilberto; See, Lei-Hoon; Fastuca, Meagan; Drenkow, Jorg; Zaleski, Chris; Dobin, Alex; Prieto, Pablo; Lagarde, Julien; Bussotti, Giovanni; Tanzer, Andrea; Denas, Olgert; Li, Kanwei; Bender, M A; Zhang, Miaohua; Byron, Rachel; Groudine, Mark T; McCleary, David; Pham, Long; Ye, Zhen; Kuan, Samantha; Edsall, Lee; Wu, Yi-Chieh; Rasmussen, Matthew D; Bansal, Mukul S; Kellis, Manolis; Keller, Cheryl A; Morrissey, Christapher S; Mishra, Tejaswini; Jain, Deepti; Dogan, Nergiz; Harris, Robert S; Cayting, Philip; Kawli, Trupti; Boyle, Alan P; Euskirchen, Ghia; Kundaje, Anshul; Lin, Shin; Lin, Yiing; Jansen, Camden; Malladi, Venkat S; Cline, Melissa S; Erickson, Drew T; Kirkup, Vanessa M; Learned, Katrina; Sloan, Cricket A; Rosenbloom, Kate R; Lacerda de Sousa, Beatriz; Beal, Kathryn; Pignatelli, Miguel; Flicek, Paul; Lian, Jin; Kahveci, Tamer; Lee, Dongwon; Kent, W James; Ramalho Santos, Miguel; Herrero, Javier; Notredame, Cedric; Johnson, Audra; Vong, Shinny; Lee, Kristen; Bates, Daniel; Neri, Fidencio; Diegel, Morgan; Canfield, Theresa; Sabo, Peter J; Wilken, Matthew S; Reh, Thomas A; Giste, Erika; Shafer, Anthony; Kutyavin, Tanya; Haugen, Eric; Dunn, Douglas; Reynolds, Alex P; Neph, Shane; Humbert, Richard; Hansen, R Scott; De Bruijn, Marella; Selleri, Licia; Rudensky, Alexander; Josefowicz, Steven; Samstein, Robert; Eichler, Evan E; Orkin, Stuart H; Levasseur, Dana; Papayannopoulou, Thalia; Chang, Kai-Hsin; Skoultchi, Arthur; Gosh, Srikanta; Disteche, Christine; Treuting, Piper; Wang, Yanli; Weiss, Mitchell J; Blobel, Gerd A; Cao, Xiaoyi; Zhong, Sheng; Wang, Ting; Good, Peter J; Lowdon, Rebecca F; Adams, Leslie B; Zhou, Xiao-Qiao; Pazin, Michael J; Feingold, Elise A; Wold, Barbara; Taylor, James; Mortazavi, Ali; Weissman, Sherman M; Stamatoyannopoulos, John A; Snyder, Michael P; Guigo, Roderic; Gingeras, Thomas R; Gilbert, David M; Hardison, Ross C; Beer, Michael A; Ren, Bing
2014-11-20
The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways. To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types. By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization. Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases.
Lakhina, Vanisha; Arey, Rachel N.; Kaletsky, Rachel; Kauffman, Amanda; Stein, Geneva; Keyes, William; Xu, Daniel; Murphy, Coleen T.
2014-01-01
SUMMARY Induced CREB activity is a hallmark of long-term memory, but the full repertoire of CREB transcriptional targets required specifically for memory is not known in any system. To obtain a more complete picture of the mechanisms involved in memory, we combined memory training with genome-wide transcriptional analysis of C. elegans CREB mutants. This approach identified 757 significant CREB/memory-induced targets and confirmed the involvement of known memory genes from other organisms, but also suggested new mechanisms and novel components that may be conserved through mammals. CREB mediates distinct basal and memory transcriptional programs at least partially through spatial restriction of CREB activity: basal targets are regulated primarily in nonneuronal tissues, while memory targets are enriched for neuronal expression, emanating from CREB activity in AIM neurons. This suite of novel memory-associated genes will provide a platform for the discovery of orthologous mammalian long-term memory components. PMID:25611510
Mediator links transcription and DNA repair by facilitating Rad2/XPG recruitment.
Eyboulet, Fanny; Cibot, Camille; Eychenne, Thomas; Neil, Helen; Alibert, Olivier; Werner, Michel; Soutourina, Julie
2013-12-01
Mediator is a large multiprotein complex conserved in all eukaryotes. The crucial function of Mediator in transcription is now largely established. However, we found that this complex also plays an important role by connecting transcription with DNA repair. We identified a functional contact between the Med17 Mediator subunit and Rad2/XPG, the 3' endonuclease involved in nucleotide excision DNA repair. Genome-wide location analyses revealed that Rad2 is associated with RNA polymerase II (Pol II)- and Pol III-transcribed genes and telomeric regions in the absence of exogenous genotoxic stress. Rad2 occupancy of Pol II-transcribed genes is transcription-dependent. Genome-wide Rad2 occupancy of class II gene promoters is well correlated with that of Mediator. Furthermore, UV sensitivity of med17 mutants is correlated with reduced Rad2 occupancy of class II genes and concomitant decrease of Mediator interaction with Rad2 protein. Our results suggest that Mediator is involved in DNA repair by facilitating Rad2 recruitment to transcribed genes.
Mediator links transcription and DNA repair by facilitating Rad2/XPG recruitment
Eyboulet, Fanny; Cibot, Camille; Eychenne, Thomas; Neil, Helen; Alibert, Olivier; Werner, Michel; Soutourina, Julie
2013-01-01
Mediator is a large multiprotein complex conserved in all eukaryotes. The crucial function of Mediator in transcription is now largely established. However, we found that this complex also plays an important role by connecting transcription with DNA repair. We identified a functional contact between the Med17 Mediator subunit and Rad2/XPG, the 3′ endonuclease involved in nucleotide excision DNA repair. Genome-wide location analyses revealed that Rad2 is associated with RNA polymerase II (Pol II)- and Pol III-transcribed genes and telomeric regions in the absence of exogenous genotoxic stress. Rad2 occupancy of Pol II-transcribed genes is transcription-dependent. Genome-wide Rad2 occupancy of class II gene promoters is well correlated with that of Mediator. Furthermore, UV sensitivity of med17 mutants is correlated with reduced Rad2 occupancy of class II genes and concomitant decrease of Mediator interaction with Rad2 protein. Our results suggest that Mediator is involved in DNA repair by facilitating Rad2 recruitment to transcribed genes. PMID:24298055
Activity-Dependent Human Brain Coding/Noncoding Gene Regulatory Networks
Lipovich, Leonard; Dachet, Fabien; Cai, Juan; Bagla, Shruti; Balan, Karina; Jia, Hui; Loeb, Jeffrey A.
2012-01-01
While most gene transcription yields RNA transcripts that code for proteins, a sizable proportion of the genome generates RNA transcripts that do not code for proteins, but may have important regulatory functions. The brain-derived neurotrophic factor (BDNF) gene, a key regulator of neuronal activity, is overlapped by a primate-specific, antisense long noncoding RNA (lncRNA) called BDNFOS. We demonstrate reciprocal patterns of BDNF and BDNFOS transcription in highly active regions of human neocortex removed as a treatment for intractable seizures. A genome-wide analysis of activity-dependent coding and noncoding human transcription using a custom lncRNA microarray identified 1288 differentially expressed lncRNAs, of which 26 had expression profiles that matched activity-dependent coding genes and an additional 8 were adjacent to or overlapping with differentially expressed protein-coding genes. The functions of most of these protein-coding partner genes, such as ARC, include long-term potentiation, synaptic activity, and memory. The nuclear lncRNAs NEAT1, MALAT1, and RPPH1, composing an RNAse P-dependent lncRNA-maturation pathway, were also upregulated. As a means to replicate human neuronal activity, repeated depolarization of SY5Y cells resulted in sustained CREB activation and produced an inverse pattern of BDNF-BDNFOS co-expression that was not achieved with a single depolarization. RNAi-mediated knockdown of BDNFOS in human SY5Y cells increased BDNF expression, suggesting that BDNFOS directly downregulates BDNF. Temporal expression patterns of other lncRNA-messenger RNA pairs validated the effect of chronic neuronal activity on the transcriptome and implied various lncRNA regulatory mechanisms. lncRNAs, some of which are unique to primates, thus appear to have potentially important regulatory roles in activity-dependent human brain plasticity. PMID:22960213
Antisense Transcription Is Pervasive but Rarely Conserved in Enteric Bacteria
Raghavan, Rahul; Sloan, Daniel B.; Ochman, Howard
2012-01-01
ABSTRACT Noncoding RNAs, including antisense RNAs (asRNAs) that originate from the complementary strand of protein-coding genes, are involved in the regulation of gene expression in all domains of life. Recent application of deep-sequencing technologies has revealed that the transcription of asRNAs occurs genome-wide in bacteria. Although the role of the vast majority of asRNAs remains unknown, it is often assumed that their presence implies important regulatory functions, similar to those of other noncoding RNAs. Alternatively, many antisense transcripts may be produced by chance transcription events from promoter-like sequences that result from the degenerate nature of bacterial transcription factor binding sites. To investigate the biological relevance of antisense transcripts, we compared genome-wide patterns of asRNA expression in closely related enteric bacteria, Escherichia coli and Salmonella enterica serovar Typhimurium, by performing strand-specific transcriptome sequencing. Although antisense transcripts are abundant in both species, less than 3% of asRNAs are expressed at high levels in both species, and only about 14% appear to be conserved among species. And unlike the promoters of protein-coding genes, asRNA promoters show no evidence of sequence conservation between, or even within, species. Our findings suggest that many or even most bacterial asRNAs are nonadaptive by-products of the cell’s transcription machinery. PMID:22872780
USDA-ARS?s Scientific Manuscript database
One focus of the Sorghum Translational Genomics Lab (part of sorghum CRIS, PSGD, CSRL, USDA-ARS, Lubbock TX) is to utilize nucleotide variation between sorghum germplasm such as those derived from RNA seq for translation and validation of Single Nucleotide Polymorphism (SNP) into easy access DNA m...
Galloway, Alison; Ahlfors, Helena; Turner, Martin
2016-01-01
The RNA binding proteins Zfp36l1 and Zfp36l2 act redundantly to enforce the β-selection checkpoint during thymopoiesis, yet their molecular targets remain largely unknown. Here, we identify these targets on a genome wide scale in primary mouse thymocytes and show that Zfp36l1/l2 regulate DNA damage response and cell cycle transcripts to ensure proper β-selection. DN3 thymocytes lacking Zfp36l1/l2 share a gene expression profile with post-selected DN3b cells despite the absence of intracellular TCRβ and reduced IL-7 signaling. Our findings show that in addition to controlling the timing of proliferation at β-selection post-transcriptional control by Zfp36l1/l2 limits DNA damage responses which are known to promote thymocyte differentiation. Zfp36l1/l2 therefore act as post-transcriptional safeguards against chromosomal instability and replication stress by integrating pre-TCR and IL-7 signaling with DNA damage and cell cycle control. PMID:27566829
Liu, Shouan; Kracher, Barbara; Ziegler, Jörg; Birkenbihl, Rainer P; Somssich, Imre E
2015-01-01
The Arabidopsis mutant wrky33 is highly susceptible to Botrytis cinerea. We identified >1680 Botrytis-induced WRKY33 binding sites associated with 1576 Arabidopsis genes. Transcriptional profiling defined 318 functional direct target genes at 14 hr post inoculation. Comparative analyses revealed that WRKY33 possesses dual functionality acting either as a repressor or as an activator in a promoter-context dependent manner. We confirmed known WRKY33 targets involved in hormone signaling and phytoalexin biosynthesis, but also uncovered a novel negative role of abscisic acid (ABA) in resistance towards B. cinerea 2100. The ABA biosynthesis genes NCED3 and NCED5 were identified as direct targets required for WRKY33-mediated resistance. Loss-of-WRKY33 function resulted in elevated ABA levels and genetic studies confirmed that WRKY33 acts upstream of NCED3/NCED5 to negatively regulate ABA biosynthesis. This study provides the first detailed view of the genome-wide contribution of a specific plant transcription factor in modulating the transcriptional network associated with plant immunity. DOI: http://dx.doi.org/10.7554/eLife.07295.001 PMID:26076231
Transcriptional risk scores link GWAS to eQTLs and predict complications in Crohn's disease.
Marigorta, Urko M; Denson, Lee A; Hyams, Jeffrey S; Mondal, Kajari; Prince, Jarod; Walters, Thomas D; Griffiths, Anne; Noe, Joshua D; Crandall, Wallace V; Rosh, Joel R; Mack, David R; Kellermayer, Richard; Heyman, Melvin B; Baker, Susan S; Stephens, Michael C; Baldassano, Robert N; Markowitz, James F; Kim, Mi-Ok; Dubinsky, Marla C; Cho, Judy; Aronow, Bruce J; Kugathasan, Subra; Gibson, Greg
2017-10-01
Gene expression profiling can be used to uncover the mechanisms by which loci identified through genome-wide association studies (GWAS) contribute to pathology. Given that most GWAS hits are in putative regulatory regions and transcript abundance is physiologically closer to the phenotype of interest, we hypothesized that summation of risk-allele-associated gene expression, namely a transcriptional risk score (TRS), should provide accurate estimates of disease risk. We integrate summary-level GWAS and expression quantitative trait locus (eQTL) data with RNA-seq data from the RISK study, an inception cohort of pediatric Crohn's disease. We show that TRSs based on genes regulated by variants linked to inflammatory bowel disease (IBD) not only outperform genetic risk scores (GRSs) in distinguishing Crohn's disease from healthy samples, but also serve to identify patients who in time will progress to complicated disease. Our dissection of eQTL effects may be used to distinguish genes whose association with disease is through promotion versus protection, thereby linking statistical association to biological mechanism. The TRS approach constitutes a potential strategy for personalized medicine that enhances inference from static genotypic risk assessment.
Choi, Young-Jun; Fuchs, Jeremy F.; Mayhew, George F.; Yu, Helen E.; Christensen, Bruce M.
2012-01-01
Hemocytes are integral components of mosquito immune mechanisms such as phagocytosis, melanization, and production of antimicrobial peptides. However, our understanding of hemocyte-specific molecular processes and their contribution to shaping the host immune response remains limited. To better understand the immunophysiological features distinctive of hemocytes, we conducted genome-wide analysis of hemocyte-enriched transcripts, and examined how tissue-enriched expression patterns change with the immune status of the host. Our microarray data indicate that the hemocyte-enriched trascriptome is dynamic and context-dependent. Analysis of transcripts enriched after bacterial challenge in circulating hemocytes with respect to carcass added a dimension to evaluating infection-responsive genes and immune-related gene families. We resolved patterns of transcriptional change unique to hemocytes from those that are likely shared by other immune responsive tissues, and identified clusters of genes preferentially induced in hemocytes, likely reflecting their involvement in cell type specific functions. In addition, the study revealed conserved hemocyte-enriched molecular repertoires which might be implicated in core hemocyte function by cross-species meta-analysis of microarray expression data from Anopheles gambiae and Drosophila melanogaster. PMID:22796331
Comparative 454 pyrosequencing of transcripts from two olive genotypes during fruit development
Alagna, Fiammetta; D'Agostino, Nunzio; Torchia, Laura; Servili, Maurizio; Rao, Rosa; Pietrella, Marco; Giuliano, Giovanni; Chiusano, Maria Luisa; Baldoni, Luciana; Perrotta, Gaetano
2009-01-01
Background Despite its primary economic importance, genomic information on olive tree is still lacking. 454 pyrosequencing was used to enrich the very few sequence data currently available for the Olea europaea species and to identify genes involved in expression of fruit quality traits. Results Fruits of Coratina, a widely cultivated variety characterized by a very high phenolic content, and Tendellone, an oleuropein-lacking natural variant, were used as starting material for monitoring the transcriptome. Four different cDNA libraries were sequenced, respectively at the beginning and at the end of drupe development. A total of 261,485 reads were obtained, for an output of about 58 Mb. Raw sequence data were processed using a four step pipeline procedure and data were stored in a relational database with a web interface. Conclusion Massively parallel sequencing of different fruit cDNA collections has provided large scale information about the structure and putative function of gene transcripts accumulated during fruit development. Comparative transcript profiling allowed the identification of differentially expressed genes with potential relevance in regulating the fruit metabolism and phenolic content during ripening. PMID:19709400
Bowman, Megan J.; Park, Wonkeun; Bauer, Philip J.; Udall, Joshua A.; Page, Justin T.; Raney, Joshua; Scheffler, Brian E.; Jones, Don. C.; Campbell, B. Todd
2013-01-01
An RNA-Seq experiment was performed using field grown well-watered and naturally rain fed cotton plants to identify differentially expressed transcripts under water-deficit stress. Our work constitutes the first application of the newly published diploid D5 Gossypium raimondii sequence in the study of tetraploid AD1 upland cotton RNA-seq transcriptome analysis. A total of 1,530 transcripts were differentially expressed between well-watered and water-deficit stressed root tissues, in patterns that confirm the accuracy of this technique for future studies in cotton genomics. Additionally, putative sequence based genome localization of differentially expressed transcripts detected A2 genome specific gene expression under water-deficit stress. These data will facilitate efforts to understand the complex responses governing transcriptomic regulatory mechanisms and to identify candidate genes that may benefit applied plant breeding programs. PMID:24324815
FANTOM5 CAGE profiles of human and mouse reprocessed for GRCh38 and GRCm38 genome assemblies.
Abugessaisa, Imad; Noguchi, Shuhei; Hasegawa, Akira; Harshbarger, Jayson; Kondo, Atsushi; Lizio, Marina; Severin, Jessica; Carninci, Piero; Kawaji, Hideya; Kasukawa, Takeya
2017-08-29
The FANTOM5 consortium described the promoter-level expression atlas of human and mouse by using CAGE (Cap Analysis of Gene Expression) with single molecule sequencing. In the original publications, GRCh37/hg19 and NCBI37/mm9 assemblies were used as the reference genomes of human and mouse respectively; later, the Genome Reference Consortium released newer genome assemblies GRCh38/hg38 and GRCm38/mm10. To increase the utility of the atlas in forthcoming researches, we reprocessed the data to make them available on the recent genome assemblies. The data include observed frequencies of transcription starting sites (TSSs) based on the realignment of CAGE reads, and TSS peaks that are converted from those based on the previous reference. Annotations of the peak names were also updated based on the latest public databases. The reprocessed results enable us to examine frequencies of transcription initiations on the recent genome assemblies and to refer promoters with updated information across the genome assemblies consistently.
Genome-wide alterations of the DNA replication program during tumor progression
NASA Astrophysics Data System (ADS)
Arneodo, A.; Goldar, A.; Argoul, F.; Hyrien, O.; Audit, B.
2016-08-01
Oncogenic stress is a major driving force in the early stages of cancer development. Recent experimental findings reveal that, in precancerous lesions and cancers, activated oncogenes may induce stalling and dissociation of DNA replication forks resulting in DNA damage. Replication timing is emerging as an important epigenetic feature that recapitulates several genomic, epigenetic and functional specificities of even closely related cell types. There is increasing evidence that chromosome rearrangements, the hallmark of many cancer genomes, are intimately associated with the DNA replication program and that epigenetic replication timing changes often precede chromosomic rearrangements. The recent development of a novel methodology to map replication fork polarity using deep sequencing of Okazaki fragments has provided new and complementary genome-wide replication profiling data. We review the results of a wavelet-based multi-scale analysis of genomic and epigenetic data including replication profiles along human chromosomes. These results provide new insight into the spatio-temporal replication program and its dynamics during differentiation. Here our goal is to bring to cancer research, the experimental protocols and computational methodologies for replication program profiling, and also the modeling of the spatio-temporal replication program. To illustrate our purpose, we report very preliminary results obtained for the chronic myelogeneous leukemia, the archetype model of cancer. Finally, we discuss promising perspectives on using genome-wide DNA replication profiling as a novel efficient tool for cancer diagnosis, prognosis and personalized treatment.
Orgeur, Mickael; Martens, Marvin; Leonte, Georgeta; Nassari, Sonya; Bonnin, Marie-Ange; Börno, Stefan T; Timmermann, Bernd; Hecht, Jochen; Duprez, Delphine; Stricker, Sigmar
2018-03-29
Connective tissues support organs and play crucial roles in development, homeostasis and fibrosis, yet our understanding of their formation is still limited. To gain insight into the molecular mechanisms of connective tissue specification, we selected five zinc-finger transcription factors - OSR1, OSR2, EGR1, KLF2 and KLF4 - based on their expression patterns and/or known involvement in connective tissue subtype differentiation. RNA-seq and ChIP-seq profiling of chick limb micromass cultures revealed a set of common genes regulated by all five transcription factors, which we describe as a connective tissue core expression set. This common core was enriched with genes associated with axon guidance and myofibroblast signature, including fibrosis-related genes. In addition, each transcription factor regulated a specific set of signalling molecules and extracellular matrix components. This suggests a concept whereby local molecular niches can be created by the expression of specific transcription factors impinging on the specification of local microenvironments. The regulatory network established here identifies common and distinct molecular signatures of limb connective tissue subtypes, provides novel insight into the signalling pathways governing connective tissue specification, and serves as a resource for connective tissue development. © 2018. Published by The Company of Biologists Ltd.
Rangel, Luiz Thibério; Novaes, Jeniffer; Durham, Alan M.; Madeira, Alda Maria B. N.; Gruber, Arthur
2013-01-01
Parasites of the genus Eimeria infect a wide range of vertebrate hosts, including chickens. We have recently reported a comparative analysis of the transcriptomes of Eimeria acervulina, Eimeria maxima and Eimeria tenella, integrating ORESTES data produced by our group and publicly available Expressed Sequence Tags (ESTs). All cDNA reads have been assembled, and the reconstructed transcripts have been submitted to a comprehensive functional annotation pipeline. Additional studies included orthology assignment across apicomplexan parasites and clustering analyses of gene expression profiles among different developmental stages of the parasites. To make all this body of information publicly available, we constructed the Eimeria Transcript Database (EimeriaTDB), a web repository that provides access to sequence data, annotation and comparative analyses. Here, we describe the web interface, available sequence data sets and query tools implemented on the site. The main goal of this work is to offer a public repository of sequence and functional annotation data of reconstructed transcripts of parasites of the genus Eimeria. We believe that EimeriaTDB will represent a valuable and complementary resource for the Eimeria scientific community and for those researchers interested in comparative genomics of apicomplexan parasites. Database URL: http://www.coccidia.icb.usp.br/eimeriatdb/ PMID:23411718
β-adrenergic-stimulated macrophages: Comprehensive localization in the M1–M2 spectrum
Lamkin, Donald M.; Ho, Hsin-Yun; Ong, Tiffany H.; Kawanishi, Carly K.; Stoffers, Victoria L.; Ahlawat, Nivedita; Ma, Jeffrey C.Y.; Arevalo, Jesusa M. G.; Cole, Steve W.; Sloan, Erica K.
2016-01-01
β-adrenergic signaling can regulate macrophage involvement in several diseases and often produces anti-inflammatory properties in macrophages, which are similar to M2 properties in a dichotomous M1 vs. M2 macrophage taxonomy. However, it is not clear that β-adrenergic-stimulated macrophages may be classified strictly as M2. In this in vitro study, we utilized recently published criteria and transcriptome-wide bioinformatics methods to map the relative polarity of murine β-adrenergic-stimulated macrophages within a wider M1–M2 spectrum. Results show that β-adrenergic-stimulated macrophages did not fit entirely into any one predefined category of the M1–M2 spectrum but did express genes that are representative of some M2 side categories. Moreover, transcript origin analysis of genome-wide transcriptional profiles located β-adrenergic-stimulated macrophages firmly on the M2 side of the M1–M2 spectrum and found active suppression of M1 side gene transcripts. The signal transduction pathways involved were mapped through blocking experiments and bioinformatics analysis of transcription factor binding motifs. M2-promoting effects were mediated specifically through β2-adrenergic receptors and were associated with CREB, C/EBPβ, and ATF transcription factor pathways but not with established M1–M2 STAT pathways. Thus, β-adrenergic-signaling induces a macrophage transcriptome that locates on the M2 side of the M1–M2 spectrum but likely accomplishes this effect through a signaling pathway that is atypical for M2-spectrum macrophages. PMID:27485040
β-Adrenergic-stimulated macrophages: Comprehensive localization in the M1-M2 spectrum.
Lamkin, Donald M; Ho, Hsin-Yun; Ong, Tiffany H; Kawanishi, Carly K; Stoffers, Victoria L; Ahlawat, Nivedita; Ma, Jeffrey C Y; Arevalo, Jesusa M G; Cole, Steve W; Sloan, Erica K
2016-10-01
β-Adrenergic signaling can regulate macrophage involvement in several diseases and often produces anti-inflammatory properties in macrophages, which are similar to M2 properties in a dichotomous M1 vs. M2 macrophage taxonomy. However, it is not clear that β-adrenergic-stimulated macrophages may be classified strictly as M2. In this in vitro study, we utilized recently published criteria and transcriptome-wide bioinformatics methods to map the relative polarity of murine β-adrenergic-stimulated macrophages within a wider M1-M2 spectrum. Results show that β-adrenergic-stimulated macrophages did not fit entirely into any one pre-defined category of the M1-M2 spectrum but did express genes that are representative of some M2 side categories. Moreover, transcript origin analysis of genome-wide transcriptional profiles located β-adrenergic-stimulated macrophages firmly on the M2 side of the M1-M2 spectrum and found active suppression of M1 side gene transcripts. The signal transduction pathways involved were mapped through blocking experiments and bioinformatics analysis of transcription factor binding motifs. M2-promoting effects were mediated specifically through β2-adrenergic receptors and were associated with CREB, C/EBPβ, and ATF transcription factor pathways but not with established M1-M2 STAT pathways. Thus, β-adrenergic-signaling induces a macrophage transcriptome that locates on the M2 side of the M1-M2 spectrum but likely accomplishes this effect through a signaling pathway that is atypical for M2-spectrum macrophages. Copyright © 2016 Elsevier Inc. All rights reserved.
Lee, Je Hyuk; Daugharthy, Evan R.; Scheiman, Jonathan; Kalhor, Reza; Ferrante, Thomas C.; Terry, Richard; Turczyk, Brian M.; Yang, Joyce L.; Lee, Ho Suk; Aach, John; Zhang, Kun; Church, George M.
2014-01-01
RNA sequencing measures the quantitative change in gene expression over the whole transcriptome, but it lacks spatial context. On the other hand, in situ hybridization provides the location of gene expression, but only for a small number of genes. Here we detail a protocol for genome-wide profiling of gene expression in situ in fixed cells and tissues, in which RNA is converted into cross-linked cDNA amplicons and sequenced manually on a confocal microscope. Unlike traditional RNA-seq our method enriches for context-specific transcripts over house-keeping and/or structural RNA, and it preserves the tissue architecture for RNA localization studies. Our protocol is written for researchers experienced in cell microscopy with minimal computing skills. Library construction and sequencing can be completed within 14 d, with image analysis requiring an additional 2 d. PMID:25675209
Dosunmu, Remi; Alashwal, Hany; Zawia, Nasser H
2012-06-01
In this study, we assessed global gene expression patterns in adolescent mice exposed to lead (Pb) as infants and their aged siblings to identify reprogrammed genes. Global expression on postnatal day 20 and 700 was analyzed and genes that were down- and up-regulated (≥2 fold) were identified, clustered and analyzed for their relationship to DNA methylation. About 150 genes were differentially expressed in old age. In normal aging, we observed an up-regulation of genes related to the immune response, metal-binding, metabolism and transcription/transduction coupling. Prior exposure to Pb revealed a repression in these genes suggesting that disturbances in developmental stages of the brain compromise the ability to defend against age-related stressors, thus promoting the neurodegenerative process. Overexpression and repression of genes corresponded with their DNA methylation profile. Published by Elsevier Ireland Ltd.
Genome-wide identification and expression profiling of dehydrin gene family in Malus domestica.
Liang, Dong; Xia, Hui; Wu, Shan; Ma, Fengwang
2012-12-01
The family of dehydrin genes has important roles in protecting higher plants against abiotic stress, such as drought, salinity and cold. However, knowledge about apple dehydrin gene family is limited. In the present study, we used a bioinformatics approach to identify members of that family in apple (Malus domestica). A total of 12 apple dehydrin genes (MdDHNs) were identified and located on various chromosomes. All putative proteins from those genes contained a typical K domain. Among 12 MdDHNs, nine were cloned and their expression patterns were investigated. Expression profiling indicated that the these nine dehydrin genes display differential expression patterns in various tissues. Moreover, transcript levels of some MdDHNs were up-regulated significantly under drought, low temperature, or ABA treatment, which indicated their important roles during stress adaptation. These results demonstrate that the apple dehydrin gene family may function in tissue development and plant stress responses.
Pede, Valerie; Rombout, Ans; Vermeire, Jolien; Naessens, Evelien; Mestdagh, Pieter; Robberecht, Nore; Vanderstraeten, Hanne; Van Roy, Nadine; Vandesompele, Jo; Speleman, Frank; Philippé, Jan; Verhasselt, Bruno
2013-01-01
Chronic lymphocytic leukemia (CLL) is a disease with variable clinical outcome. Several prognostic factors such as the immunoglobulin heavy chain variable genes (IGHV) mutation status are linked to the B-cell receptor (BCR) complex, supporting a role for triggering the BCR in vivo in the pathogenesis. The miRNA profile upon stimulation and correlation with IGHV mutation status is however unknown. To evaluate the transcriptional response of peripheral blood CLL cells upon BCR stimulation in vitro, miRNA and mRNA expression was measured using hybridization arrays and qPCR. We found both IGHV mutated and unmutated CLL cells to respond with increased expression of MYC and other genes associated with BCR activation, and a phenotype of cell cycle progression. Genome-wide expression studies showed hsa-miR-132-3p/hsa-miR-212 miRNA cluster induction associated with a set of downregulated genes, enriched for genes modulated by BCR activation and amplified by Myc. We conclude that BCR triggering of CLL cells induces a transcriptional response of genes associated with BCR activation, enhanced cell cycle entry and progression and suggest that part of the transcriptional profiles linked to IGHV mutation status observed in isolated peripheral blood are not cell intrinsic but rather secondary to in vivo BCR stimulation. PMID:23560086
Transcriptional Profiling of Nitrogen Fixation in Azotobacter vinelandii▿†
Hamilton, Trinity L.; Ludwig, Marcus; Dixon, Ray; Boyd, Eric S.; Dos Santos, Patricia C.; Setubal, João C.; Bryant, Donald A.; Dean, Dennis R.; Peters, John W.
2011-01-01
Most biological nitrogen (N2) fixation results from the activity of a molybdenum-dependent nitrogenase, a complex iron-sulfur enzyme found associated with a diversity of bacteria and some methanogenic archaea. Azotobacter vinelandii, an obligate aerobe, fixes nitrogen via the oxygen-sensitive Mo nitrogenase but is also able to fix nitrogen through the activities of genetically distinct alternative forms of nitrogenase designated the Vnf and Anf systems when Mo is limiting. The Vnf system appears to replace Mo with V, and the Anf system is thought to contain Fe as the only transition metal within the respective active site metallocofactors. Prior genetic analyses suggest that a number of nif-encoded components are involved in the Vnf and Anf systems. Genome-wide transcription profiling of A. vinelandiicultured under nitrogen-fixing conditions under various metal amendments (e.g., Mo or V) revealed the discrete complement of genes associated with each nitrogenase system and the extent of cross talk between the systems. In addition, changes in transcript levels of genes not directly involved in N2fixation provided insight into the integration of central metabolic processes and the oxygen-sensitive process of N2fixation in this obligate aerobe. The results underscored significant differences between Mo-dependent and Mo-independent diazotrophic growth that highlight the significant advantages of diazotrophic growth in the presence of Mo. PMID:21724999
Ruan, Meng-Bin; Guo, Xin; Wang, Bin; Yang, Yi-Ling; Li, Wen-Qi; Yu, Xiao-Ling; Zhang, Peng; Peng, Ming
2017-06-15
The myeloblastosis (MYB) transcription factor superfamily is the largest transcription factor family in plants, playing different roles during stress response. However, abiotic stress-responsive MYB transcription factors have not been systematically studied in cassava (Manihot esculenta), an important tropical tuber root crop. In this study, we used a genome-wide transcriptome analysis to predict 299 putative MeMYB genes in the cassava genome. Under drought and cold stresses, many MeMYB genes exhibited different expression patterns in cassava leaves, indicating that these genes might play a role in abiotic stress responses. We found that several stress-responsive MeMYB genes responded to abscisic acid (ABA) in cassava leaves. We characterize four MeMYBs, namely MeMYB1, MeMYB2, MeMYB4, and MeMYB9, as R2R3-MYB transcription factors. Furthermore, RNAi-driven repression of MeMYB2 resulted in drought and cold tolerance in transgenic cassava. Gene expression assays in wild-type and MeMYB2-RNAi cassava plants revealed that MeMYB2 may affect other MeMYBs as well as MeWRKYs under drought and cold stress, suggesting crosstalk between MYB and WRKY family genes under stress conditions in cassava. © The Author 2017. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Feng, Sheng Jun; Liu, Xue Song; Tao, Hua; Tan, Shang Kun; Chu, Shan Shan; Oono, Youko; Zhang, Xian Duo; Chen, Jian; Yang, Zhi Min
2016-12-01
We report genome-wide single-base resolution maps of methylated cytosines and transcriptome change in Cd-exposed rice. Widespread differences were identified in CG and non-CG methylation marks between Cd-exposed and Cd-free rice genomes. There are 2320 non-redundant differentially methylated regions detected in the genome. RNA sequencing revealed 2092 DNA methylation-modified genes differentially expressed under Cd exposure. More genes were found hypermethylated than those hypomethylated in CG, CHH and CHG (where H is A, C or T) contexts in upstream, gene body and downstream regions. Many of the genes were involved in stress response, metal transport and transcription factors. Most of the DNA methylation-modified genes were transcriptionally altered under Cd stress. A subset of loss of function mutants defective in DNA methylation and histone modification activities was used to identify transcript abundance of selected genes. Compared with wide type, mutation of MET1 and DRM2 resulted in general lower transcript levels of the genes under Cd stress. Transcripts of OsIRO2, OsPR1b and Os09g02214 in drm2 were significantly reduced. A commonly used DNA methylation inhibitor 5-azacytidine was employed to investigate whether DNA demethylation affected physiological consequences. 5-azacytidine provision decreased general DNA methylation levels of selected genes, but promoted growth of rice seedlings and Cd accumulation in rice plant. © 2016 John Wiley & Sons Ltd.
Functional regression method for whole genome eQTL epistasis analysis with sequencing data.
Xu, Kelin; Jin, Li; Xiong, Momiao
2017-05-18
Epistasis plays an essential rule in understanding the regulation mechanisms and is an essential component of the genetic architecture of the gene expressions. However, interaction analysis of gene expressions remains fundamentally unexplored due to great computational challenges and data availability. Due to variation in splicing, transcription start sites, polyadenylation sites, post-transcriptional RNA editing across the entire gene, and transcription rates of the cells, RNA-seq measurements generate large expression variability and collectively create the observed position level read count curves. A single number for measuring gene expression which is widely used for microarray measured gene expression analysis is highly unlikely to sufficiently account for large expression variation across the gene. Simultaneously analyzing epistatic architecture using the RNA-seq and whole genome sequencing (WGS) data poses enormous challenges. We develop a nonlinear functional regression model (FRGM) with functional responses where the position-level read counts within a gene are taken as a function of genomic position, and functional predictors where genotype profiles are viewed as a function of genomic position, for epistasis analysis with RNA-seq data. Instead of testing the interaction of all possible pair-wises SNPs, the FRGM takes a gene as a basic unit for epistasis analysis, which tests for the interaction of all possible pairs of genes and use all the information that can be accessed to collectively test interaction between all possible pairs of SNPs within two genome regions. By large-scale simulations, we demonstrate that the proposed FRGM for epistasis analysis can achieve the correct type 1 error and has higher power to detect the interactions between genes than the existing methods. The proposed methods are applied to the RNA-seq and WGS data from the 1000 Genome Project. The numbers of pairs of significantly interacting genes after Bonferroni correction identified using FRGM, RPKM and DESeq were 16,2361, 260 and 51, respectively, from the 350 European samples. The proposed FRGM for epistasis analysis of RNA-seq can capture isoform and position-level information and will have a broad application. Both simulations and real data analysis highlight the potential for the FRGM to be a good choice of the epistatic analysis with sequencing data.
oPOSSUM: identification of over-represented transcription factor binding sites in co-expressed genes
Ho Sui, Shannan J.; Mortimer, James R.; Arenillas, David J.; Brumm, Jochen; Walsh, Christopher J.; Kennedy, Brian P.; Wasserman, Wyeth W.
2005-01-01
Targeted transcript profiling studies can identify sets of co-expressed genes; however, identification of the underlying functional mechanism(s) is a significant challenge. Established methods for the analysis of gene annotations, particularly those based on the Gene Ontology, can identify functional linkages between genes. Similar methods for the identification of over-represented transcription factor binding sites (TFBSs) have been successful in yeast, but extension to human genomics has largely proved ineffective. Creation of a system for the efficient identification of common regulatory mechanisms in a subset of co-expressed human genes promises to break a roadblock in functional genomics research. We have developed an integrated system that searches for evidence of co-regulation by one or more transcription factors (TFs). oPOSSUM combines a pre-computed database of conserved TFBSs in human and mouse promoters with statistical methods for identification of sites over-represented in a set of co-expressed genes. The algorithm successfully identified mediating TFs in control sets of tissue-specific genes and in sets of co-expressed genes from three transcript profiling studies. Simulation studies indicate that oPOSSUM produces few false positives using empirically defined thresholds and can tolerate up to 50% noise in a set of co-expressed genes. PMID:15933209
TFIIS-Dependent Non-coding Transcription Regulates Developmental Genome Rearrangements
Maliszewska-Olejniczak, Kamila; Gruchota, Julita; Gromadka, Robert; Denby Wilkes, Cyril; Arnaiz, Olivier; Mathy, Nathalie; Duharcourt, Sandra; Bétermier, Mireille; Nowak, Jacek K.
2015-01-01
Because of their nuclear dimorphism, ciliates provide a unique opportunity to study the role of non-coding RNAs (ncRNAs) in the communication between germline and somatic lineages. In these unicellular eukaryotes, a new somatic nucleus develops at each sexual cycle from a copy of the zygotic (germline) nucleus, while the old somatic nucleus degenerates. In the ciliate Paramecium tetraurelia, the genome is massively rearranged during this process through the reproducible elimination of repeated sequences and the precise excision of over 45,000 short, single-copy Internal Eliminated Sequences (IESs). Different types of ncRNAs resulting from genome-wide transcription were shown to be involved in the epigenetic regulation of genome rearrangements. To understand how ncRNAs are produced from the entire genome, we have focused on a homolog of the TFIIS elongation factor, which regulates RNA polymerase II transcriptional pausing. Six TFIIS-paralogs, representing four distinct families, can be found in P. tetraurelia genome. Using RNA interference, we showed that TFIIS4, which encodes a development-specific TFIIS protein, is essential for the formation of a functional somatic genome. Molecular analyses and high-throughput DNA sequencing upon TFIIS4 RNAi demonstrated that TFIIS4 is involved in all kinds of genome rearrangements, including excision of ~48% of IESs. Localization of a GFP-TFIIS4 fusion revealed that TFIIS4 appears specifically in the new somatic nucleus at an early developmental stage, before IES excision. RT-PCR experiments showed that TFIIS4 is necessary for the synthesis of IES-containing non-coding transcripts. We propose that these IES+ transcripts originate from the developing somatic nucleus and serve as pairing substrates for germline-specific short RNAs that target elimination of their homologous sequences. Our study, therefore, connects the onset of zygotic non coding transcription to the control of genome plasticity in Paramecium, and establishes for the first time a specific role of TFIIS in non-coding transcription in eukaryotes. PMID:26177014
Pattison, Jillian M.; Posternak, Valeriya; Cole, Michael D.
2016-01-01
It is well established that environmental toxins, such as exposure to arsenic, are risk factors in the development of urinary bladder cancer, yet recent genome-wide association studies (GWAS) provide compelling evidence that there is a strong genetic component associated with disease predisposition. A single nucleotide polymorphism (SNP), rs8102137, was identified on chromosome 19q12, residing 6 kb upstream of the important cell cycle regulator and proto-oncogene, Cyclin E1 (CCNE1). However, the functional role of this variant in bladder cancer predisposition has been unclear since it lies within a non-coding region of the genome. Here, it is demonstrated that bladder cancer cells heterozygous for this SNP exhibit biased allelic expression of CCNE1 with 1.5-fold more transcription occurring from the risk allele. Furthermore, using chromatin immunoprecipitation assays, a novel enhancer element was identified within the first intron of CCNE1 that binds Kruppel-like Factor 5 (KLF5), a known transcriptional activator in bladder cancer. Moreover, the data reveal that the presence of rs200996365, a SNP in high linkage disequilibrium with rs8102137 residing in the center of a KLF5 motif, alters KLF5 binding to this genomic region. Through luciferase assays and CRISPR-Cas9 genome editing, a novel polymorphic intronic regulatory element controlling CCNE1 transcription is characterized. These studies uncover how a cancer-associated polymorphism mechanistically contributes to an increased predisposition for bladder cancer development. Implications A polymorphic KLF5 binding site near the CCNE1 gene explains genetic risk identified through genome wide association studies. PMID:27514407
ChIP-chip versus ChIP-seq: Lessons for experimental design and data analysis
2011-01-01
Background Chromatin immunoprecipitation (ChIP) followed by microarray hybridization (ChIP-chip) or high-throughput sequencing (ChIP-seq) allows genome-wide discovery of protein-DNA interactions such as transcription factor bindings and histone modifications. Previous reports only compared a small number of profiles, and little has been done to compare histone modification profiles generated by the two technologies or to assess the impact of input DNA libraries in ChIP-seq analysis. Here, we performed a systematic analysis of a modENCODE dataset consisting of 31 pairs of ChIP-chip/ChIP-seq profiles of the coactivator CBP, RNA polymerase II (RNA PolII), and six histone modifications across four developmental stages of Drosophila melanogaster. Results Both technologies produce highly reproducible profiles within each platform, ChIP-seq generally produces profiles with a better signal-to-noise ratio, and allows detection of more peaks and narrower peaks. The set of peaks identified by the two technologies can be significantly different, but the extent to which they differ varies depending on the factor and the analysis algorithm. Importantly, we found that there is a significant variation among multiple sequencing profiles of input DNA libraries and that this variation most likely arises from both differences in experimental condition and sequencing depth. We further show that using an inappropriate input DNA profile can impact the average signal profiles around genomic features and peak calling results, highlighting the importance of having high quality input DNA data for normalization in ChIP-seq analysis. Conclusions Our findings highlight the biases present in each of the platforms, show the variability that can arise from both technology and analysis methods, and emphasize the importance of obtaining high quality and deeply sequenced input DNA libraries for ChIP-seq analysis. PMID:21356108
Campos, Bruno; Fletcher, Danielle; Piña, Benjamín; Tauler, Romà; Barata, Carlos
2018-05-18
Unravelling the link between genes and environment across the life cycle is a challenging goal that requires model organisms with well-characterized life-cycles, ecological interactions in nature, tractability in the laboratory, and available genomic tools. Very few well-studied invertebrate model species meet these requirements, being the waterflea Daphnia magna one of them. Here we report a full genome transcription profiling of D. magna during its life-cycle. The study was performed using a new microarray platform designed from the complete set of gene models representing the whole transcribed genome of D. magna. Up to 93% of the existing 41,317 D. magna gene models showed differential transcription patterns across the developmental stages of D. magna, 59% of which were functionally annotated. Embryos showed the highest number of unique transcribed genes, mainly related to DNA, RNA, and ribosome biogenesis, likely related to cellular proliferation and morphogenesis of the several body organs. Adult females showed an enrichment of transcripts for genes involved in reproductive processes. These female-specific transcripts were essentially absent in males, whose transcriptome was enriched in specific genes of male sexual differentiation genes, like doublesex. Our results define major characteristics of transcriptional programs involved in the life-cycle, differentiate males and females, and show that large scale gene-transcription data collected in whole animals can be used to identify genes involved in specific biological and biochemical processes.
De Smet, Lina; De Koker, Dieter; Hawley, Alyse K; Foster, Leonard J; De Vos, Paul; de Graaf, Dirk C
2014-01-01
Paenibacillus larvae, the causal agent of American Foulbrood disease (AFB), affects honey bee health worldwide. The present study investigates the effect of bodily fluids from honey bee larvae on growth velocity and transcription for this Gram-positive, endospore-forming bacterium. It was observed that larval fluids accelerate the growth and lead to higher bacterial densities during stationary phase. The genome-wide transcriptional response of in vitro cultures of P. larvae to larval fluids was studied by microarray technology. Early responses of P. larvae to larval fluids are characterized by a general down-regulation of oligopeptide and sugar transporter genes, as well as by amino acid and carbohydrate metabolic genes, among others. Late responses are dominated by general down-regulation of sporulation genes and up-regulation of phage-related genes. A theoretical mechanism of carbon catabolite repression is discussed.
Genome-wide Association Study Implicates PARD3B-based AIDS Restriction
Nelson, George W.; Lautenberger, James A.; Chinn, Leslie; McIntosh, Carl; Johnson, Randall C.; Sezgin, Efe; Kessing, Bailey; Malasky, Michael; Hendrickson, Sher L.; Pontius, Joan; Tang, Minzhong; An, Ping; Winkler, Cheryl A.; Limou, Sophie; Le Clerc, Sigrid; Delaneau, Olivier; Zagury, Jean-François; Schuitemaker, Hanneke; van Manen, Daniëlle; Bream, Jay H.; Gomperts, Edward D.; Buchbinder, Susan; Goedert, James J.; Kirk, Gregory D.; O'Brien, Stephen J.
2011-01-01
Background. Host genetic variation influences human immunodeficiency virus (HIV) infection and progression to AIDS. Here we used clinically well-characterized subjects from 5 pretreatment HIV/AIDS cohorts for a genome-wide association study to identify gene associations with rate of AIDS progression. Methods. European American HIV seroconverters (n = 755) were interrogated for single-nucleotide polymorphisms (SNPs) (n = 700,022) associated with progression to AIDS 1987 (Cox proportional hazards regression analysis, co-dominant model). Results. Association with slower progression was observed for SNPs in the gene PARD3B. One of these, rs11884476, reached genome-wide significance (relative hazard = 0.3; P =3. 370 × 10−9) after statistical correction for 700,022 SNPs and contributes 4.52% of the overall variance in AIDS progression in this study. Nine of the top-ranked SNPs define a PARD3B haplotype that also displays significant association with progression to AIDS (hazard ratio, 0.3; P = 3.220 × 10−8). One of these SNPs, rs10185378, is a predicted exonic splicing enhancer; significant alteration in the expression profile of PARD3B splicing transcripts was observed in B cell lines with alternate rs10185378 genotypes. This SNP was typed in European cohorts of rapid progressors and was found to be protective for AIDS 1993 definition (odds ratio, 0.43, P = .025). Conclusions. These observations suggest a potential unsuspected pathway of host genetic influence on the dynamics of AIDS progression. PMID:21502085
Genome-wide screen of ovary-specific DNA methylation in polycystic ovary syndrome.
Yu, Ying-Ying; Sun, Cui-Xiang; Liu, Yin-Kun; Li, Yan; Wang, Li; Zhang, Wei
2015-07-01
To compare genome-wide DNA methylation profiles in ovary tissue from women with polycystic ovary syndrome (PCOS) and healthy controls. Case-control study matched for age and body mass index. University-affiliated hospital. Ten women with PCOS who underwent ovarian drilling to induce ovulation and 10 healthy women who were undergoing laparoscopic sterilization, hysterectomy for benign conditions, diagnostic laparoscopy for pelvic pain, or oophorectomy for nonovarian indications. None. Genome-wide DNA methylation patterns determined by immunoprecipitation and microarray (MeDIP-chip) analysis. The methylation levels were statistically significantly higher in CpG island shores (CGI shores), which lie outside of core promoter regions, and lower within gene bodies in women with PCOS relative to the controls. In addition, high CpG content promoters were the most frequently hypermethylated promoters in PCOS ovaries but were more often hypomethylated in controls. Second, 872 CGIs, specifically methylated in PCOS, represented 342 genes that could be associated with various molecular functions, including protein binding, hormone activity, and transcription regulator activity. Finally, methylation differences were validated in seven genes by methylation-specific polymerase chain reaction. These genes correlated to several functional families related to the pathogenesis of PCOS and may be potential biomarkers for this disease. Our results demonstrated that epigenetic modification differs between PCOS and normal ovaries, which may help to further understand the pathophysiology of this disease. Copyright © 2015 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.
Skelly, Daniel A.; Johansson, Marnie; Madeoy, Jennifer; Wakefield, Jon; Akey, Joshua M.
2011-01-01
Variation in gene expression is thought to make a significant contribution to phenotypic diversity among individuals within populations. Although high-throughput cDNA sequencing offers a unique opportunity to delineate the genome-wide architecture of regulatory variation, new statistical methods need to be developed to capitalize on the wealth of information contained in RNA-seq data sets. To this end, we developed a powerful and flexible hierarchical Bayesian model that combines information across loci to allow both global and locus-specific inferences about allele-specific expression (ASE). We applied our methodology to a large RNA-seq data set obtained in a diploid hybrid of two diverse Saccharomyces cerevisiae strains, as well as to RNA-seq data from an individual human genome. Our statistical framework accurately quantifies levels of ASE with specified false-discovery rates, achieving high reproducibility between independent sequencing platforms. We pinpoint loci that show unusual and biologically interesting patterns of ASE, including allele-specific alternative splicing and transcription termination sites. Our methodology provides a rigorous, quantitative, and high-resolution tool for profiling ASE across whole genomes. PMID:21873452
EBR1 genomic expansion and its role in virulence of Fusarium species
USDA-ARS?s Scientific Manuscript database
Genome sequencing of Fusarium oxysporum revealed that pathogenic forms of this fungus harbor supernumerary chromosomes with a wide variety of genes, many of which likely encode traits required for pathogenicity or niche specialization. Specific transcription factor (TF) gene families are expanded on...
2010-01-01
Background Pollen development from the microspore involves a series of coordinated cellular events, and the resulting mature pollen has a specialized function to quickly germinate, produce a polar-growth pollen tube derived from the vegetative cell, and deliver two sperm cells into the embryo sac for double fertilization. The gene expression profiles of developing and germinated pollen have been characterised by use of the eudicot model plant Arabidopsis. Rice, one of the most important cereal crops, has been used as an excellent monocot model. A comprehensive analysis of transcriptome profiles of developing and germinated pollen in rice is important to understand the conserved and diverse mechanism underlying pollen development and germination in eudicots and monocots. Results We used Affymetrix GeneChip® Rice Genome Array to comprehensively analyzed the dynamic changes in the transcriptomes of rice pollen at five sequential developmental stages from microspores to germinated pollen. Among the 51,279 transcripts on the array, we found 25,062 pollen-preferential transcripts, among which 2,203 were development stage-enriched. The diversity of transcripts decreased greatly from microspores to mature and germinated pollen, whereas the number of stage-enriched transcripts displayed a "U-type" change, with the lowest at the bicellular pollen stage; and a transition of overrepresented stage-enriched transcript groups associated with different functional categories, which indicates a shift in gene expression program at the bicellular pollen stage. About 54% of the now-annotated rice F-box protein genes were expressed preferentially in pollen. The transcriptome profile of germinated pollen was significantly and positively correlated with that of mature pollen. Analysis of expression profiles and coexpressed features of the pollen-preferential transcripts related to cell cycle, transcription, the ubiquitin/26S proteasome system, phytohormone signalling, the kinase system and defense/stress response revealed five expression patterns, which are compatible with changes in major cellular events during pollen development and germination. A comparison of pollen transcriptomes between rice and Arabidopsis revealed that 56.6% of the rice pollen preferential genes had homologs in Arabidopsis genome, but 63.4% of these homologs were expressed, with a small proportion being expressed preferentially, in Arabidopsis pollen. Rice and Arabidopsis pollen had non-conservative transcription factors each. Conclusions Our results demonstrated that rice pollen expressed a set of reduced but specific transcripts in comparison with vegetative tissues, and the number of stage-enriched transcripts displayed a "U-type" change during pollen development, with the lowest at the bicellular pollen stage. These features are conserved in rice and Arabidopsis. The shift in gene expression program at the bicellular pollen stage may be important to the transition from earlier cell division to later pollen maturity. Pollen at maturity pre-synthesized transcripts needed for germination and early pollen tube growth. The transcription regulation associated with pollen development would have divergence between the two species. Our results also provide novel insights into the molecular program and key components of the regulatory network regulating pollen development and germination. PMID:20507633
Zhao, Junliang; Zhang, Shaohong; Yang, Tifeng; Zeng, Zichong; Huang, Zhanghui; Liu, Qing; Wang, Xiaofei; Leach, Jan; Leung, Hei; Liu, Bin
2015-07-01
Gene expression profiling under severe cold stress (4°C) has been conducted in plants including rice. However, rice seedlings are frequently exposed to milder cold stresses under natural environments. To understand the responses of rice to milder cold stress, a moderately low temperature (8°C) was used for cold treatment prior to genome-wide profiling of gene expression in a cold-tolerant japonica variety, Lijiangxintuanheigu (LTH). A total of 5557 differentially expressed genes (DEGs) were found at four time points during moderate cold stress. Both the DEGs and differentially expressed transcription factor genes were clustered into two groups based on their expression, suggesting a two-phase response to cold stress and a determinative role of transcription factors in the regulation of stress response. The induction of OsDREB2A under cold stress is reported for the first time in this study. Among the anti-oxidant enzyme genes, glutathione peroxidase (GPX) and glutathione S-transferase (GST) were upregulated, suggesting that the glutathione system may serve as the main reactive oxygen species (ROS) scavenger in LTH. Changes in expression of genes in signal transduction pathways for auxin, abscisic acid (ABA) and salicylic acid (SA) imply their involvement in cold stress responses. The induction of ABA response genes and detection of enriched cis-elements in DEGs suggest that ABA signaling pathway plays a dominant role in the cold stress response. Our results suggest that rice responses to cold stress vary with the specific temperature imposed and the rice genotype. © 2014 Scandinavian Plant Physiology Society.
Beird, Hannah C.; Wu, Chia-Chin; Ingram, Davis R.; Wang, Wei-Lien; Alimohamed, Asrar; Gumbs, Curtis; Little, Latasha; Song, Xingzhi; Feig, Barry W.; Roland, Christina L.; Zhang, Jianhua; Benjamin, Robert S.; Hwu, Patrick; Lazar, Alexander J.; Futreal, P. Andrew; Somaiah, Neeta
2018-01-01
Well-differentiated (WD) liposarcoma is a low-grade mesenchymal tumor with features of mature adipocytes and high propensity for local recurrence. Often, WD patients present with or later progress to a higher-grade nonlipogenic form known as dedifferentiated (DD) liposarcoma. These DD tumors behave more aggressively and can metastasize. Both WD and DD liposarcomas harbor neochromosomes formed from amplifications and rearrangements of Chr 12q that encode oncogenes (MDM2, CDK4, and YEATS2) and adipocytic differentiation factors (HMGA2 and CPM). However, genomic changes associated with progression from WD to DD have not been well-defined. Therefore, we selected patients with matched WD and DD tumors for extensive genomic profiling in order to understand their clonal relationships and to delineate any defining alterations for each entity. Exome and transcriptomic sequencing was performed for 17 patients with both WD and DD diagnoses. Somatic point and copy-number alterations were integrated with transcriptional analyses to determine subtype-associated genomic features and pathways. The results were, on average, that only 8.3% of somatic mutations in WD liposarcoma were shared with their cognate DD component. DD tumors had higher numbers of somatic copy-number losses, amplifications involving Chr 12q, and fusion transcripts than WD tumors. HMGA2 and CPM rearrangements occur more frequently in DD components. The shared somatic mutations indicate a clonal origin for matched WD and DD tumors and show early divergence with ongoing genomic instability due to continual generation and selection of neochromosomes. Stochastic generation and subsequent expression of fusion transcripts from the neochromosome that involve adipogenesis genes such as HMGA2 and CPM may influence the differentiation state of the subsequent tumor. PMID:29610390
Transcriptional and chromatin regulation during fasting – The genomic era
Goldstein, Ido; Hager, Gordon L.
2015-01-01
An elaborate metabolic response to fasting is orchestrated by the liver and is heavily reliant upon transcriptional regulation. In response to hormones (glucagon, glucocorticoids) many transcription factors (TFs) are activated and regulate various genes involved in metabolic pathways aimed at restoring homeostasis: gluconeogenesis, fatty acid oxidation, ketogenesis and amino acid shuttling. We summarize the recent discoveries regarding fasting-related TFs with an emphasis on genome-wide binding patterns. Collectively, the summarized findings reveal a large degree of co-operation between TFs during fasting which occurs at motif-rich DNA sites bound by a combination of TFs. These new findings implicate transcriptional and chromatin regulation as major determinants of the response to fasting and unravels the complex, multi-TF nature of this response. PMID:26520657
Genome-wide Gene Expression Profiling of Acute Metal Exposures in Male Zebrafish
2014-10-23
Data in Brief Genome-wide gene expression profiling of acute metal exposures in male zebrafish Christine E. Baer a,⁎, Danielle L. Ippolito b, Naissan... Zebrafish Whole organism Nickel Chromium Cobalt Toxicogenomics To capture global responses to metal poisoning and mechanistic insights into metal...toxicity, gene expression changes were evaluated in whole adult male zebrafish following acute 24 h high dose exposure to three metals with known human
Filatov, Victor; Dowdle, John; Smirnoff, Nicholas; Ford-Lloyd, Brian; Newbury, H John; Macnair, Mark R
2006-09-01
One of the challenges of comparative genomics is to identify specific genetic changes associated with the evolution of a novel adaptation or trait. We need to be able to disassociate the genes involved with a particular character from all the other genetic changes that take place as lineages diverge. Here we show that by comparing the transcriptional profile of segregating families with that of parent species differing in a novel trait, it is possible to narrow down substantially the list of potential target genes. In addition, by assuming synteny with a related model organism for which the complete genome sequence is available, it is possible to use the cosegregation of markers differing in transcription level to identify regions of the genome which probably contain quantitative trait loci (QTLs) for the character. This novel combination of genomics and classical genetics provides a very powerful tool to identify candidate genes. We use this methodology to investigate zinc hyperaccumulation in Arabidopsis halleri, the sister species to the model plant, Arabidopsis thaliana. We compare the transcriptional profile of A. halleri with that of its sister nonaccumulator species, Arabidopsis petraea, and between accumulator and nonaccumulator F(3)s derived from the cross between the two species. We identify eight genes which consistently show greater expression in accumulator phenotypes in both roots and shoots, including two metal transporter genes (NRAMP3 and ZIP6), and cytoplasmic aconitase, a gene involved in iron homeostasis in mammals. We also show that there appear to be two QTLs for zinc accumulation, on chromosomes 3 and 7.
TEs or not TEs? That is the evolutionary question.
Vaknin, Keren; Goren, Amir; Ast, Gil
2009-10-23
Transposable elements (TEs) have contributed a wide range of functional sequences to their host genomes. A recent paper in BMC Molecular Biology discusses the creation of new transcripts by transposable element insertion upstream of retrocopies and the involvement of such insertions in tissue-specific post-transcriptional regulation.
USDA-ARS?s Scientific Manuscript database
StuA, first discovered in Aspergillus nidulans and a member of the APSES class of transcription factors, regulates several essential developmental stages in fungi such as virulence, sporulation and toxin production in phytopathogenic fungi. Fusarium verticillioides (Fv), a maize phytopathogen, produ...
Identification of hypertension-related genes through an integrated genomic-transcriptomic approach.
Yagil, Chana; Hubner, Norbert; Monti, Jan; Schulz, Herbert; Sapojnikov, Marina; Luft, Friedrich C; Ganten, Detlev; Yagil, Yoram
2005-04-01
In search for the genetic basis of hypertension, we applied an integrated genomic-transcriptomic approach to identify genes involved in the pathogenesis of hypertension in the Sabra rat model of salt-susceptibility. In the genomic arm of the project, we previously detected in male rats two salt-susceptibility QTLs on chromosome 1, SS1a (D1Mgh2-D1Mit11; span 43.1 cM) and SS1b (D1Mit11-D1Mit4; span 18 cM). In the transcriptomic arm, we studied differential gene expression in kidneys of SBH/y and SBN/y rats that had been fed regular diet or salt-loaded. We used the Affymetrix Rat Genome RAE230 GeneChip and probed >30,000 transcripts. The research algorithm called for an initial genome-wide screen for differentially expressed transcripts between the study groups. This step was followed by cluster analysis based on 2x2 ANOVA to identify transcripts that were of relevance specifically to salt-sensitivity and hypertension and to salt-resistance. The two arms of the project were integrated by identifying those differentially expressed transcripts that showed an allele-specific hypertensive effect on salt-loading and that mapped within the defined boundaries of the salt-susceptibility QTLs on chromosome 1. The differentially expressed transcripts were confirmed by RT-PCR. Of the 2933 genes annotated to rat chromosome 1, 1102 genes were identified within the boundaries of the two blood pressure QTLs. The microarray identified 2470 transcripts that were differentially expressed between the study groups. Cluster analysis identified genome-wide 192 genes that were relevant to salt-susceptibility and/or hypertension, 19 of which mapped to chromosome 1. Eight of these genes mapped within the boundaries of QTLs SS1a and SS1b. RT-PCR confirmed 7 genes, leaving TcTex1, Myadm, Lisch7, Axl-like, Fah, PRC1-like, and Serpinh1. None of these genes has been implicated in hypertension before. These genes become henceforth targets for our continuing search for the genetic basis of hypertension.
Harris, R. Alan; Wang, Ting; Coarfa, Cristian; Nagarajan, Raman P.; Hong, Chibo; Downey, Sara L.; Johnson, Brett E.; Fouse, Shaun D.; Delaney, Allen; Zhao, Yongjun; Olshen, Adam; Ballinger, Tracy; Zhou, Xin; Forsberg, Kevin J.; Gu, Junchen; Echipare, Lorigail; O’Geen, Henriette; Lister, Ryan; Pelizzola, Mattia; Xi, Yuanxin; Epstein, Charles B.; Bernstein, Bradley E.; Hawkins, R. David; Ren, Bing; Chung, Wen-Yu; Gu, Hongcang; Bock, Christoph; Gnirke, Andreas; Zhang, Michael Q.; Haussler, David; Ecker, Joseph; Li, Wei; Farnham, Peggy J.; Waterland, Robert A.; Meissner, Alexander; Marra, Marco A.; Hirst, Martin; Milosavljevic, Aleksandar; Costello, Joseph F.
2010-01-01
Sequencing-based DNA methylation profiling methods are comprehensive and, as accuracy and affordability improve, will increasingly supplant microarrays for genome-scale analyses. Here, four sequencing-based methodologies were applied to biological replicates of human embryonic stem cells to compare their CpG coverage genome-wide and in transposons, resolution, cost, concordance and its relationship with CpG density and genomic context. The two bisulfite methods reached concordance of 82% for CpG methylation levels and 99% for non-CpG cytosine methylation levels. Using binary methylation calls, two enrichment methods were 99% concordant, while regions assessed by all four methods were 97% concordant. To achieve comprehensive methylome coverage while reducing cost, an approach integrating two complementary methods was examined. The integrative methylome profile along with histone methylation, RNA, and SNP profiles derived from the sequence reads allowed genome-wide assessment of allele-specific epigenetic states, identifying most known imprinted regions and new loci with monoallelic epigenetic marks and monoallelic expression. PMID:20852635
USDA-ARS?s Scientific Manuscript database
Common bean (Phaseolus vulgaris) and soybean (Glycine max) both belong to the Phaseoleae tribe and share significant coding sequence homology. This suggests that the GeneChip(R) Soybean Genome Array (soybean GeneChip) may be used for gene expression studies using common bean. To evaluate the utility...
USDA-ARS?s Scientific Manuscript database
This study reports generation of large-scale genomic resources for pigeonpea, a so-called ‘orphan crop species’ of the semi-arid tropic regions. Roche FLX/454 sequencing was carried out on a normalized cDNA pool prepared from 31 tissues produced 494,353 short transcript reads (STRs). Cluster analysi...
Wang, Ping; Lin, Mingyan; Pedrosa, Erika; Hrabovsky, Anastasia; Zhang, Zheng; Guo, Wenjun; Lachman, Herbert M; Zheng, Deyou
2015-01-01
Disruptive mutation in the CHD8 gene is one of the top genetic risk factors in autism spectrum disorders (ASDs). Previous analyses of genome-wide CHD8 occupancy and reduced expression of CHD8 by shRNA knockdown in committed neural cells showed that CHD8 regulates multiple cell processes critical for neural functions, and its targets are enriched with ASD-associated genes. To further understand the molecular links between CHD8 functions and ASD, we have applied the CRISPR/Cas9 technology to knockout one copy of CHD8 in induced pluripotent stem cells (iPSCs) to better mimic the loss-of-function status that would exist in the developing human embryo prior to neuronal differentiation. We then carried out transcriptomic and bioinformatic analyses of neural progenitors and neurons derived from the CHD8 mutant iPSCs. Transcriptome profiling revealed that CHD8 hemizygosity (CHD8 (+/-)) affected the expression of several thousands of genes in neural progenitors and early differentiating neurons. The differentially expressed genes were enriched for functions of neural development, β-catenin/Wnt signaling, extracellular matrix, and skeletal system development. They also exhibited significant overlap with genes previously associated with autism and schizophrenia, as well as the downstream transcriptional targets of multiple genes implicated in autism. Providing important insight into how CHD8 mutations might give rise to macrocephaly, we found that seven of the twelve genes associated with human brain volume or head size by genome-wide association studies (e.g., HGMA2) were dysregulated in CHD8 (+/-) neural progenitors or neurons. We have established a renewable source of CHD8 (+/-) iPSC lines that would be valuable for investigating the molecular and cellular functions of CHD8. Transcriptomic profiling showed that CHD8 regulates multiple genes implicated in ASD pathogenesis and genes associated with brain volume.
Zhu, Bin; Shao, Yujiao; Pan, Qi; Ge, Xianhong; Li, Zaiyun
2015-01-01
Aneuploidy with loss of entire chromosomes from normal complement disrupts the balanced genome and is tolerable only by polyploidy plants. In this study, the monosomic and nullisomic plants losing one or two copies of C2 chromosome from allotetraploid Brassica napus L. (2n = 38, AACC) were produced and compared for their phenotype and transcriptome. The monosomics gave a plant phenotype very similar to the original donor, but the nullisomics had much smaller stature and also shorter growth period. By the comparative analyses on the global transcript profiles with the euploid donor, genome-wide alterations in gene expression were revealed in two aneuploids, and their majority of differentially expressed genes (DEGs) resulted from the trans-acting effects of the zero and one copy of C2 chromosome. The higher number of up-regulated genes than down-regulated genes on other chromosomes suggested that the genome responded to the C2 loss via enhancing the expression of certain genes. Particularly, more DEGs were detected in the monosomics than nullisomics, contrasting with their phenotypes. The gene expression of the other chromosomes was differently affected, and several dysregulated domains in which up- or downregulated genes obviously clustered were identifiable. But the mean gene expression (MGE) for homoeologous chromosome A2 reduced with the C2 loss. Some genes and their expressions on C2 were correlated with the phenotype deviations in the aneuploids. These results provided new insights into the transcriptomic perturbation of the allopolyploid genome elicited by the loss of individual chromosome. PMID:26442076
Nilsson, Emil K; Boström, Adrian E; Mwinyi, Jessica; Schiöth, Helgi B
2016-06-01
Despite an established link between sleep deprivation and epigenetic processes in humans, it remains unclear to what extent sleep deprivation modulates DNA methylation. We performed a within-subject randomized blinded study with 16 healthy subjects to examine the effect of one night of total sleep deprivation (TSD) on the genome-wide methylation profile in blood compared with that in normal sleep. Genome-wide differences in methylation between both conditions were assessed by applying a paired regression model that corrected for monocyte subpopulations. In addition, the correlations between the methylation of genes detected to be modulated by TSD and gene expression were examined in a separate, publicly available cohort of 10 healthy male donors (E-GEOD-49065). Sleep deprivation significantly affected the DNA methylation profile both independently and in dependency of shifts in monocyte composition. Our study detected differential methylation of 269 probes. Notably, one CpG site was located 69 bp upstream of ING5, which has been shown to be differentially expressed after sleep deprivation. Gene set enrichment analysis detected the Notch and Wnt signaling pathways to be enriched among the differentially methylated genes. These results provide evidence that total acute sleep deprivation alters the methylation profile in healthy human subjects. This is, to our knowledge, the first study that systematically investigated the impact of total acute sleep deprivation on genome-wide DNA methylation profiles in blood and related the epigenomic findings to the expression data.
Dynamic maps of UV damage formation and repair for the human genome
Hu, Jinchuan; Adebali, Ogun; Adar, Sheera; Sancar, Aziz
2017-01-01
Formation and repair of UV-induced DNA damage in human cells are affected by cellular context. To study factors influencing damage formation and repair genome-wide, we developed a highly sensitive single-nucleotide resolution damage mapping method [high-sensitivity damage sequencing (HS–Damage-seq)]. Damage maps of both cyclobutane pyrimidine dimers (CPDs) and pyrimidine-pyrimidone (6-4) photoproducts [(6-4)PPs] from UV-irradiated cellular and naked DNA revealed that the effect of transcription factor binding on bulky adducts formation varies, depending on the specific transcription factor, damage type, and strand. We also generated time-resolved UV damage maps of both CPDs and (6-4)PPs by HS–Damage-seq and compared them to the complementary repair maps of the human genome obtained by excision repair sequencing to gain insight into factors that affect UV-induced DNA damage and repair and ultimately UV carcinogenesis. The combination of the two methods revealed that, whereas UV-induced damage is virtually uniform throughout the genome, repair is affected by chromatin states, transcription, and transcription factor binding, in a manner that depends on the type of DNA damage. PMID:28607063