Inferring genome-wide interplay landscape between DNA methylation and transcriptional regulation.
Tang, Binhua; Wang, Xin
2015-01-01
DNA methylation and transcriptional regulation play important roles in cancer cell development and differentiation processes. Based on the currently available cell line profiling information from the ENCODE Consortium, we propose a Bayesian inference model to infer and construct genome-wide interaction landscape between DNA methylation and transcriptional regulation, which sheds light on the underlying complex functional mechanisms important within the human cancer and disease context. For the first time, we select all the currently available cell lines (>=20) and transcription factors (>=80) profiling information from the ENCODE Consortium portal. Through the integration of those genome-wide profiling sources, our genome-wide analysis detects multiple functional loci of interest, and indicates that DNA methylation is cell- and region-specific, due to the interplay mechanisms with transcription regulatory activities. We validate our analysis results with the corresponding RNA-sequencing technique for those detected genomic loci. Our results provide novel and meaningful insights for the interplay mechanisms of transcriptional regulation and gene expression for the human cancer and disease studies.
Huang, Lulin; Cheng, Tingcai; Xu, Pingzhen; Fang, Ting; Xia, Qingyou
2012-01-01
Transcription factors are present in all living organisms, and play vital roles in a wide range of biological processes. Studies of transcription factors will help reveal the complex regulation mechanism of organisms. So far, hundreds of domains have been identified that show transcription factor activity. Here, 281 reported transcription factor domains were used as seeds to search the transcription factors in genomes of Bombyx mori L. (Lepidoptera: Bombycidae) and four other model insects. Overall, 666 transcription factors including 36 basal factors and 630 other factors were identified in B. mori genome, which accounted for 4.56% of its genome. The silkworm transcription factors' expression profiles were investigated in relation to multiple tissues, developmental stages, sexual dimorphism, and responses to oral infection by pathogens and direct bacterial injection. These all provided rich clues for revealing the transcriptional regulation mechanism of silkworm organ differentiation, growth and development, sexual dimorphism, and response to pathogen infection. PMID:22943524
Global Identification and Characterization of Transcriptionally Active Regions in the Rice Genome
Stolc, Viktor; Deng, Wei; He, Hang; Korbel, Jan; Chen, Xuewei; Tongprasit, Waraporn; Ronald, Pamela; Chen, Runsheng; Gerstein, Mark; Wang Deng, Xing
2007-01-01
Genome tiling microarray studies have consistently documented rich transcriptional activity beyond the annotated genes. However, systematic characterization and transcriptional profiling of the putative novel transcripts on the genome scale are still lacking. We report here the identification of 25,352 and 27,744 transcriptionally active regions (TARs) not encoded by annotated exons in the rice (Oryza. sativa) subspecies japonica and indica, respectively. The non-exonic TARs account for approximately two thirds of the total TARs detected by tiling arrays and represent transcripts likely conserved between japonica and indica. Transcription of 21,018 (83%) japonica non-exonic TARs was verified through expression profiling in 10 tissue types using a re-array in which annotated genes and TARs were each represented by five independent probes. Subsequent analyses indicate that about 80% of the japonica TARs that were not assigned to annotated exons can be assigned to various putatively functional or structural elements of the rice genome, including splice variants, uncharacterized portions of incompletely annotated genes, antisense transcripts, duplicated gene fragments, and potential non-coding RNAs. These results provide a systematic characterization of non-exonic transcripts in rice and thus expand the current view of the complexity and dynamics of the rice transcriptome. PMID:17372628
Transcription facilitated genome-wide recruitment of topoisomerase I and DNA gyrase.
Ahmed, Wareed; Sala, Claudia; Hegde, Shubhada R; Jha, Rajiv Kumar; Cole, Stewart T; Nagaraja, Valakunja
2017-05-01
Movement of the transcription machinery along a template alters DNA topology resulting in the accumulation of supercoils in DNA. The positive supercoils generated ahead of transcribing RNA polymerase (RNAP) and the negative supercoils accumulating behind impose severe topological constraints impeding transcription process. Previous studies have implied the role of topoisomerases in the removal of torsional stress and the maintenance of template topology but the in vivo interaction of functionally distinct topoisomerases with heterogeneous chromosomal territories is not deciphered. Moreover, how the transcription-induced supercoils influence the genome-wide recruitment of DNA topoisomerases remains to be explored in bacteria. Using ChIP-Seq, we show the genome-wide occupancy profile of both topoisomerase I and DNA gyrase in conjunction with RNAP in Mycobacterium tuberculosis taking advantage of minimal topoisomerase representation in the organism. The study unveils the first in vivo genome-wide interaction of both the topoisomerases with the genomic regions and establishes that transcription-induced supercoils govern their recruitment at genomic sites. Distribution profiles revealed co-localization of RNAP and the two topoisomerases on the active transcriptional units (TUs). At a given locus, topoisomerase I and DNA gyrase were localized behind and ahead of RNAP, respectively, correlating with the twin-supercoiled domains generated. The recruitment of topoisomerases was higher at the genomic loci with higher transcriptional activity and/or at regions under high torsional stress compared to silent genomic loci. Importantly, the occupancy of DNA gyrase, sole type II topoisomerase in Mtb, near the Ter domain of the Mtb chromosome validates its function as a decatenase.
Transcription facilitated genome-wide recruitment of topoisomerase I and DNA gyrase
Ahmed, Wareed; Sala, Claudia; Hegde, Shubhada R.; Jha, Rajiv Kumar
2017-01-01
Movement of the transcription machinery along a template alters DNA topology resulting in the accumulation of supercoils in DNA. The positive supercoils generated ahead of transcribing RNA polymerase (RNAP) and the negative supercoils accumulating behind impose severe topological constraints impeding transcription process. Previous studies have implied the role of topoisomerases in the removal of torsional stress and the maintenance of template topology but the in vivo interaction of functionally distinct topoisomerases with heterogeneous chromosomal territories is not deciphered. Moreover, how the transcription-induced supercoils influence the genome-wide recruitment of DNA topoisomerases remains to be explored in bacteria. Using ChIP-Seq, we show the genome-wide occupancy profile of both topoisomerase I and DNA gyrase in conjunction with RNAP in Mycobacterium tuberculosis taking advantage of minimal topoisomerase representation in the organism. The study unveils the first in vivo genome-wide interaction of both the topoisomerases with the genomic regions and establishes that transcription-induced supercoils govern their recruitment at genomic sites. Distribution profiles revealed co-localization of RNAP and the two topoisomerases on the active transcriptional units (TUs). At a given locus, topoisomerase I and DNA gyrase were localized behind and ahead of RNAP, respectively, correlating with the twin-supercoiled domains generated. The recruitment of topoisomerases was higher at the genomic loci with higher transcriptional activity and/or at regions under high torsional stress compared to silent genomic loci. Importantly, the occupancy of DNA gyrase, sole type II topoisomerase in Mtb, near the Ter domain of the Mtb chromosome validates its function as a decatenase. PMID:28463980
Guo, D; Li, H L; Tang, X; Peng, S Q
2014-12-18
In plants, homeodomain proteins play a critical role in regulating various aspects of plant growth and development. KNOX proteins are members of the homeodomain protein family. The KNOX transcription factors have been reported from Arabidopsis, rice, and other higher plants. The recent publication of the draft genome sequence of cassava (Manihot esculenta Krantz) has allowed a genome-wide search for M. esculenta KNOX (MeKNOX) transcription factors and the comparison of these positively identified proteins with their homologs in model plants. In the present study, we identified 12 MeKNOX genes in the cassava genome and grouped them into two distinct subfamilies based on their domain composition and phylogenetic analysis. Furthermore, semi-quantitative reverse transcription polymerase chain reaction analysis was performed to elucidate the expression profiles of these genes in different tissues and during various stages of root development. The analysis of MeKNOX expression profiles of indicated that 12 MeKNOX genes display differential expressions either in their transcript abundance or expression patterns.
Transcription regulation by distal enhancers
Stadhouders, Ralph; van den Heuvel, Anita; Kolovos, Petros; Jorna, Ruud; Leslie, Kris; Grosveld, Frank; Soler, Eric
2012-01-01
Genome-wide chromatin profiling efforts have shown that enhancers are often located at large distances from gene promoters within the noncoding genome. Whereas enhancers can stimulate transcription initiation by communicating with promoters via chromatin looping mechanisms, we propose that enhancers may also stimulate transcription elongation by physical interactions with intronic elements. We review here recent findings derived from the study of the hematopoietic system. PMID:22771987
2014-01-01
Background Cis-regulatory modules (CRMs), or the DNA sequences required for regulating gene expression, play the central role in biological researches on transcriptional regulation in metazoan species. Nowadays, the systematic understanding of CRMs still mainly resorts to computational methods due to the time-consuming and small-scale nature of experimental methods. But the accuracy and reliability of different CRM prediction tools are still unclear. Without comparative cross-analysis of the results and combinatorial consideration with extra experimental information, there is no easy way to assess the confidence of the predicted CRMs. This limits the genome-wide understanding of CRMs. Description It is known that transcription factor binding and epigenetic profiles tend to determine functions of CRMs in gene transcriptional regulation. Thus integration of the genome-wide epigenetic profiles with systematically predicted CRMs can greatly help researchers evaluate and decipher the prediction confidence and possible transcriptional regulatory functions of these potential CRMs. However, these data are still fragmentary in the literatures. Here we performed the computational genome-wide screening for potential CRMs using different prediction tools and constructed the pioneer database, cisMEP (cis-regulatory module epigenetic profile database), to integrate these computationally identified CRMs with genomic epigenetic profile data. cisMEP collects the literature-curated TFBS location data and nine genres of epigenetic data for assessing the confidence of these potential CRMs and deciphering the possible CRM functionality. Conclusions cisMEP aims to provide a user-friendly interface for researchers to assess the confidence of different potential CRMs and to understand the functions of CRMs through experimentally-identified epigenetic profiles. The deposited potential CRMs and experimental epigenetic profiles for confidence assessment provide experimentally testable hypotheses for the molecular mechanisms of metazoan gene regulation. We believe that the information deposited in cisMEP will greatly facilitate the comparative usage of different CRM prediction tools and will help biologists to study the modular regulatory mechanisms between different TFs and their target genes. PMID:25521507
Transcription regulation by distal enhancers: who's in the loop?
Stadhouders, Ralph; van den Heuvel, Anita; Kolovos, Petros; Jorna, Ruud; Leslie, Kris; Grosveld, Frank; Soler, Eric
2012-01-01
Genome-wide chromatin profiling efforts have shown that enhancers are often located at large distances from gene promoters within the noncoding genome. Whereas enhancers can stimulate transcription initiation by communicating with promoters via chromatin looping mechanisms, we propose that enhancers may also stimulate transcription elongation by physical interactions with intronic elements. We review here recent findings derived from the study of the hematopoietic system.
Marcon, Helena Sanches; Domingues, Douglas Silva; Silva, Juliana Costa; Borges, Rafael Junqueira; Matioli, Fábio Filippi; Fontes, Marcos Roberto de Mattos; Marino, Celso Luis
2015-08-14
In Eucalyptus genus, studies on genome composition and transposable elements (TEs) are particularly scarce. Nearly half of the recently released Eucalyptus grandis genome is composed by retrotransposons and this data provides an important opportunity to understand TE dynamics in Eucalyptus genome and transcriptome. We characterized nine families of transcriptionally active LTR retrotransposons from Copia and Gypsy superfamilies in Eucalyptus grandis genome and we depicted genomic distribution and copy number in two Eucalyptus species. We also evaluated genomic polymorphism and transcriptional profile in three organs of five Eucalyptus species. We observed contrasting genomic and transcriptional behavior in the same family among different species. RLC_egMax_1 was the most prevalent family and RLC_egAngela_1 was the family with the lowest copy number. Most families of both superfamilies have their insertions occurring <3 million years, except one Copia family, RLC_egBianca_1. Protein theoretical models suggest different properties between Copia and Gypsy domains. IRAP and REMAP markers suggested genomic polymorphisms among Eucalyptus species. Using EST analysis and qRT-PCRs, we observed transcriptional activity in several tissues and in all evaluated species. In some families, osmotic stress increases transcript values. Our strategy was successful in isolating transcriptionally active retrotransposons in Eucalyptus, and each family has a particular genomic and transcriptional pattern. Overall, our results show that retrotransposon activity have differentially affected genome and transcriptome among Eucalyptus species.
Identification and expression profiles of the WRKY transcription factor family in Ricinus communis.
Li, Hui-Liang; Zhang, Liang-Bo; Guo, Dong; Li, Chang-Zhu; Peng, Shi-Qing
2012-07-25
In plants, WRKY proteins constitute a large family of transcription factors. They are involved in many biological processes, such as plant development, metabolism, and responses to biotic and abiotic stresses. A large number of WRKY transcription factors have been reported from Arabidopsis, rice, and other higher plants. The recent publication of the draft genome sequence of castor bean (Ricinus communis) has allowed a genome-wide search for R. communis WRKY (RcWRKY) transcription factors and the comparison of these positively identified proteins with their homologs in model plants. A total of 47 WRKY genes were identified in the castor bean genome. According to the structural features of the WRKY domain, the RcWRKY are classified into seven main phylogenetic groups. Furthermore, putative orthologs of RcWRKY proteins in Arabidopsis and rice could now be assigned. An analysis of expression profiles of RcWRKY genes indicates that 47 WRKY genes display differential expressions either in their transcript abundance or expression patterns under normal growth conditions. Copyright © 2012 Elsevier B.V. All rights reserved.
NASA Technical Reports Server (NTRS)
Stolc, Viktor; Samanta, Manoj Pratim; Tongprasit, Waraporn; Marshall, Wallace F.
2005-01-01
The important role that cilia and flagella play in human disease creates an urgent need to identify genes involved in ciliary assembly and function. The strong and specific induction of flagellar-coding genes during flagellar regeneration in Chlamydomonas reinhardtii suggests that transcriptional profiling of such cells would reveal new flagella-related genes. We have conducted a genome-wide analysis of RNA transcript levels during flagellar regeneration in Chlamydomonas by using maskless photolithography method-produced DNA oligonucleotide microarrays with unique probe sequences for all exons of the 19,803 predicted genes. This analysis represents previously uncharacterized whole-genome transcriptional activity profiling study in this important model organism. Analysis of strongly induced genes reveals a large set of known flagellar components and also identifies a number of important disease-related proteins as being involved with cilia and flagella, including the zebrafish polycystic kidney genes Qilin, Reptin, and Pontin, as well as the testis-expressed tubby-like protein TULP2.
Hilson, Pierre; Allemeersch, Joke; Altmann, Thomas; Aubourg, Sébastien; Avon, Alexandra; Beynon, Jim; Bhalerao, Rishikesh P.; Bitton, Frédérique; Caboche, Michel; Cannoot, Bernard; Chardakov, Vasil; Cognet-Holliger, Cécile; Colot, Vincent; Crowe, Mark; Darimont, Caroline; Durinck, Steffen; Eickhoff, Holger; de Longevialle, Andéol Falcon; Farmer, Edward E.; Grant, Murray; Kuiper, Martin T.R.; Lehrach, Hans; Léon, Céline; Leyva, Antonio; Lundeberg, Joakim; Lurin, Claire; Moreau, Yves; Nietfeld, Wilfried; Paz-Ares, Javier; Reymond, Philippe; Rouzé, Pierre; Sandberg, Goran; Segura, Maria Dolores; Serizet, Carine; Tabrett, Alexandra; Taconnat, Ludivine; Thareau, Vincent; Van Hummelen, Paul; Vercruysse, Steven; Vuylsteke, Marnik; Weingartner, Magdalena; Weisbeek, Peter J.; Wirta, Valtteri; Wittink, Floyd R.A.; Zabeau, Marc; Small, Ian
2004-01-01
Microarray transcript profiling and RNA interference are two new technologies crucial for large-scale gene function studies in multicellular eukaryotes. Both rely on sequence-specific hybridization between complementary nucleic acid strands, inciting us to create a collection of gene-specific sequence tags (GSTs) representing at least 21,500 Arabidopsis genes and which are compatible with both approaches. The GSTs were carefully selected to ensure that each of them shared no significant similarity with any other region in the Arabidopsis genome. They were synthesized by PCR amplification from genomic DNA. Spotted microarrays fabricated from the GSTs show good dynamic range, specificity, and sensitivity in transcript profiling experiments. The GSTs have also been transferred to bacterial plasmid vectors via recombinational cloning protocols. These cloned GSTs constitute the ideal starting point for a variety of functional approaches, including reverse genetics. We have subcloned GSTs on a large scale into vectors designed for gene silencing in plant cells. We show that in planta expression of GST hairpin RNA results in the expected phenotypes in silenced Arabidopsis lines. These versatile GST resources provide novel and powerful tools for functional genomics. PMID:15489341
A transcriptional profile of the decidua in preeclampsia
LØSET, Mari; MUNDAL, Siv B.; JOHNSON, Matthew P.; FENSTAD, Mona H.; FREED, Katherine A.; LIAN, Ingrid A.; EIDE, Irina P.; BJØRGE, Line; BLANGERO, John; MOSES, Eric K.; AUSTGULEN, Rigmor
2010-01-01
OBJECTIVE To obtain insight into possible mechanisms underlying preeclampsia using genome-wide transcriptional profiling in decidua basalis. STUDY DESIGN Genome-wide transcriptional profiling was performed on decidua basalis tissue from preeclamptic (n = 37) and normal pregnancies (n = 58). Differentially expressed genes were identified and merged into canonical pathways and networks. RESULTS Of the 26,504 expressed transcripts detected, 455 were differentially expressed (P <0.05, FDR P <0.1). Both novel (ARL5B, SLITRK4) and previously reported preeclampsia-associated genes (PLA2G7, HMOX1) were identified. Pathway analysis revealed that ‘tryptophan metabolism’, ‘endoplasmic reticulum stress’, ‘linoleic acid metabolism’, ‘notch signaling’, ‘fatty acid metabolism’, ‘arachidonic acid metabolism’ and ‘NRF2-mediated oxidative stress response’ were overrepresented canonical pathways. CONCLUSION In the present study single genes, canonical pathways and gene-gene networks that are likely to play an important role in the pathogenesis of preeclampsia, have been identified. Future functional studies are needed to accomplish a greater understanding of the mechanisms involved. PMID:20934677
Saponaro, Marco; Kantidakis, Theodoros; Mitter, Richard; Kelly, Gavin P.; Heron, Mark; Williams, Hannah; Söding, Johannes; Stewart, Aengus; Svejstrup, Jesper Q.
2014-01-01
Summary RECQL5 is the sole member of the RECQ family of helicases associated with RNA polymerase II (RNAPII). We now show that RECQL5 is a general elongation factor that is important for preserving genome stability during transcription. Depletion or overexpression of RECQL5 results in corresponding shifts in the genome-wide RNAPII density profile. Elongation is particularly affected, with RECQL5 depletion causing a striking increase in the average rate, concurrent with increased stalling, pausing, arrest, and/or backtracking (transcription stress). RECQL5 therefore controls the movement of RNAPII across genes. Loss of RECQL5 also results in the loss or gain of genomic regions, with the breakpoints of lost regions located in genes and common fragile sites. The chromosomal breakpoints overlap with areas of elevated transcription stress, suggesting that RECQL5 suppresses such stress and its detrimental effects, and thereby prevents genome instability in the transcribed region of genes. PMID:24836610
Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets
Macosko, Evan Z.; Basu, Anindita; Satija, Rahul; Nemesh, James; Shekhar, Karthik; Goldman, Melissa; Tirosh, Itay; Bialas, Allison R.; Kamitaki, Nolan; Martersteck, Emily M.; Trombetta, John J.; Weitz, David A.; Sanes, Joshua R.; Shalek, Alex K.; Regev, Aviv; McCarroll, Steven A.
2015-01-01
Summary Cells, the basic units of biological structure and function, vary broadly in type and state. Single-cell genomics can characterize cell identity and function, but limitations of ease and scale have prevented its broad application. Here we describe Drop-Seq, a strategy for quickly profiling thousands of individual cells by separating them into nanoliter-sized aqueous droplets, associating a different barcode with each cell’s RNAs, and sequencing them all together. Drop-Seq analyzes mRNA transcripts from thousands of individual cells simultaneously while remembering transcripts’ cell of origin. We analyzed transcriptomes from 44,808 mouse retinal cells and identified 39 transcriptionally distinct cell populations, creating a molecular atlas of gene expression for known retinal cell classes and novel candidate cell subtypes. Drop-Seq will accelerate biological discovery by enabling routine transcriptional profiling at single-cell resolution. PMID:26000488
Transcript copy number estimation using a mouse whole-genome oligonucleotide microarray
Carter, Mark G; Sharov, Alexei A; VanBuren, Vincent; Dudekula, Dawood B; Carmack, Condie E; Nelson, Charlie; Ko, Minoru SH
2005-01-01
The ability to quantitatively measure the expression of all genes in a given tissue or cell with a single assay is an exciting promise of gene-expression profiling technology. An in situ-synthesized 60-mer oligonucleotide microarray designed to detect transcripts from all mouse genes was validated, as well as a set of exogenous RNA controls derived from the yeast genome (made freely available without restriction), which allow quantitative estimation of absolute endogenous transcript abundance. PMID:15998450
Integrative analysis of the Caenorhabditis elegans genome by the modENCODE project.
Gerstein, Mark B; Lu, Zhi John; Van Nostrand, Eric L; Cheng, Chao; Arshinoff, Bradley I; Liu, Tao; Yip, Kevin Y; Robilotto, Rebecca; Rechtsteiner, Andreas; Ikegami, Kohta; Alves, Pedro; Chateigner, Aurelien; Perry, Marc; Morris, Mitzi; Auerbach, Raymond K; Feng, Xin; Leng, Jing; Vielle, Anne; Niu, Wei; Rhrissorrakrai, Kahn; Agarwal, Ashish; Alexander, Roger P; Barber, Galt; Brdlik, Cathleen M; Brennan, Jennifer; Brouillet, Jeremy Jean; Carr, Adrian; Cheung, Ming-Sin; Clawson, Hiram; Contrino, Sergio; Dannenberg, Luke O; Dernburg, Abby F; Desai, Arshad; Dick, Lindsay; Dosé, Andréa C; Du, Jiang; Egelhofer, Thea; Ercan, Sevinc; Euskirchen, Ghia; Ewing, Brent; Feingold, Elise A; Gassmann, Reto; Good, Peter J; Green, Phil; Gullier, Francois; Gutwein, Michelle; Guyer, Mark S; Habegger, Lukas; Han, Ting; Henikoff, Jorja G; Henz, Stefan R; Hinrichs, Angie; Holster, Heather; Hyman, Tony; Iniguez, A Leo; Janette, Judith; Jensen, Morten; Kato, Masaomi; Kent, W James; Kephart, Ellen; Khivansara, Vishal; Khurana, Ekta; Kim, John K; Kolasinska-Zwierz, Paulina; Lai, Eric C; Latorre, Isabel; Leahey, Amber; Lewis, Suzanna; Lloyd, Paul; Lochovsky, Lucas; Lowdon, Rebecca F; Lubling, Yaniv; Lyne, Rachel; MacCoss, Michael; Mackowiak, Sebastian D; Mangone, Marco; McKay, Sheldon; Mecenas, Desirea; Merrihew, Gennifer; Miller, David M; Muroyama, Andrew; Murray, John I; Ooi, Siew-Loon; Pham, Hoang; Phippen, Taryn; Preston, Elicia A; Rajewsky, Nikolaus; Rätsch, Gunnar; Rosenbaum, Heidi; Rozowsky, Joel; Rutherford, Kim; Ruzanov, Peter; Sarov, Mihail; Sasidharan, Rajkumar; Sboner, Andrea; Scheid, Paul; Segal, Eran; Shin, Hyunjin; Shou, Chong; Slack, Frank J; Slightam, Cindie; Smith, Richard; Spencer, William C; Stinson, E O; Taing, Scott; Takasaki, Teruaki; Vafeados, Dionne; Voronina, Ksenia; Wang, Guilin; Washington, Nicole L; Whittle, Christina M; Wu, Beijing; Yan, Koon-Kiu; Zeller, Georg; Zha, Zheng; Zhong, Mei; Zhou, Xingliang; Ahringer, Julie; Strome, Susan; Gunsalus, Kristin C; Micklem, Gos; Liu, X Shirley; Reinke, Valerie; Kim, Stuart K; Hillier, LaDeana W; Henikoff, Steven; Piano, Fabio; Snyder, Michael; Stein, Lincoln; Lieb, Jason D; Waterston, Robert H
2010-12-24
We systematically generated large-scale data sets to improve genome annotation for the nematode Caenorhabditis elegans, a key model organism. These data sets include transcriptome profiling across a developmental time course, genome-wide identification of transcription factor-binding sites, and maps of chromatin organization. From this, we created more complete and accurate gene models, including alternative splice forms and candidate noncoding RNAs. We constructed hierarchical networks of transcription factor-binding and microRNA interactions and discovered chromosomal locations bound by an unusually large number of transcription factors. Different patterns of chromatin composition and histone modification were revealed between chromosome arms and centers, with similarly prominent differences between autosomes and the X chromosome. Integrating data types, we built statistical models relating chromatin, transcription factor binding, and gene expression. Overall, our analyses ascribed putative functions to most of the conserved genome.
2017-01-01
Recent advances in next-generation sequencing approaches have revolutionized our understanding of transcriptional expression in diverse systems. However, measurements of transcription do not necessarily reflect gene translation, the process of ultimate importance in understanding cellular function. To circumvent this limitation, biochemical tagging of ribosome subunits to isolate ribosome-associated mRNA has been developed. However, this approach, called TRAP, lacks quantitative resolution compared to a superior technology, ribosome profiling. Here, we report the development of an optimized ribosome profiling approach in Drosophila. We first demonstrate successful ribosome profiling from a specific tissue, larval muscle, with enhanced resolution compared to conventional TRAP approaches. We next validate the ability of this technology to define genome-wide translational regulation. This technology is leveraged to test the relative contributions of transcriptional and translational mechanisms in the postsynaptic muscle that orchestrate the retrograde control of presynaptic function at the neuromuscular junction. Surprisingly, we find no evidence that significant changes in the transcription or translation of specific genes are necessary to enable retrograde homeostatic signaling, implying that post-translational mechanisms ultimately gate instructive retrograde communication. Finally, we show that a global increase in translation induces adaptive responses in both transcription and translation of protein chaperones and degradation factors to promote cellular proteostasis. Together, this development and validation of tissue-specific ribosome profiling enables sensitive and specific analysis of translation in Drosophila. PMID:29194454
Model-based redesign of global transcription regulation
Carrera, Javier; Rodrigo, Guillermo; Jaramillo, Alfonso
2009-01-01
Synthetic biology aims to the design or redesign of biological systems. In particular, one possible goal could be the rewiring of the transcription regulation network by exchanging the endogenous promoters. To achieve this objective, we have adapted current methods to the inference of a model based on ordinary differential equations that is able to predict the network response after a major change in its topology. Our procedure utilizes microarray data for training. We have experimentally validated our inferred global regulatory model in Escherichia coli by predicting transcriptomic profiles under new perturbations. We have also tested our methodology in silico by providing accurate predictions of the underlying networks from expression data generated with artificial genomes. In addition, we have shown the predictive power of our methodology by obtaining the gene profile in experimental redesigns of the E. coli genome, where rewiring the transcriptional network by means of knockouts of master regulators or by upregulating transcription factors controlled by different promoters. Our approach is compatible with most network inference methods, allowing to explore computationally future genome-wide redesign experiments in synthetic biology. PMID:19188257
USDA-ARS?s Scientific Manuscript database
Common bean (Phaseolus vulgaris) and soybean (Glycine max) both belong to the Phaseoleae tribe and share significant coding sequence homology. To evaluate the utility of the soybean GeneChip for transcript profiling of common bean, we hybridized cRNAs purified from nodule, leaf, and root of common b...
2012-01-01
Background The molecular mechanisms altered by the traditional mutation and screening approach during the improvement of antibiotic-producing microorganisms are still poorly understood although this information is essential to design rational strategies for industrial strain improvement. In this study, we applied comparative genomics to identify all genetic changes occurring during the development of an erythromycin overproducer obtained using the traditional mutate-and- screen method. Results Compared with the parental Saccharopolyspora erythraea NRRL 2338, the genome of the overproducing strain presents 117 deletion, 78 insertion and 12 transposition sites, with 71 insertion/deletion sites mapping within coding sequences (CDSs) and generating frame-shift mutations. Single nucleotide variations are present in 144 CDSs. Overall, the genomic variations affect 227 proteins of the overproducing strain and a considerable number of mutations alter genes of key enzymes in the central carbon and nitrogen metabolism and in the biosynthesis of secondary metabolites, resulting in the redirection of common precursors toward erythromycin biosynthesis. Interestingly, several mutations inactivate genes coding for proteins that play fundamental roles in basic transcription and translation machineries including the transcription anti-termination factor NusB and the transcription elongation factor Efp. These mutations, along with those affecting genes coding for pleiotropic or pathway-specific regulators, affect global expression profile as demonstrated by a comparative analysis of the parental and overproducer expression profiles. Genomic data, finally, suggest that the mutate-and-screen process might have been accelerated by mutations in DNA repair genes. Conclusions This study helps to clarify the mechanisms underlying antibiotic overproduction providing valuable information about new possible molecular targets for rationale strain improvement. PMID:22401291
Integrated analysis of chromosome copy number variation and gene expression in cervical carcinoma
Yan, Deng; Yi, Song; Chiu, Wang Chi; Qin, Liu Gui; Kin, Wong Hoi; Kwok Hung, Chung Tony; Linxiao, Han; Wai, Choy Kwong; Yi, Sui; Tao, Yang; Tao, Tang
2017-01-01
Objective This study was conducted to explore chromosomal copy number variations (CNV) and transcript expression and to examine pathways in cervical pathogenesis using genome-wide high resolution microarrays. Methods Genome-wide chromosomal CNVs were investigated in 6 cervical cancer cell lines by Human Genome CGH Microarray Kit (4x44K). Gene expression profiles in cervical cancer cell lines, primary cervical carcinoma and normal cervical epithelium tissues were also studied using the Whole Human Genome Microarray Kit (4x44K). Results Fifty common chromosomal CNVs were identified in the cervical cancer cell lines. Correlation analysis revealed that gene up-regulation or down-regulation is significantly correlated with genomic amplification (P=0.009) or deletion (P=0.006) events. Expression profiles were identified through cluster analysis. Gene annotation analysis pinpointed cell cycle pathways was significantly (P=1.15E-08) affected in cervical cancer. Common CNVs were associated with cervical cancer. Conclusion Chromosomal CNVs may contribute to their transcript expression in cervical cancer. PMID:29312578
Integrated analysis of chromosome copy number variation and gene expression in cervical carcinoma.
Yan, Deng; Yi, Song; Chiu, Wang Chi; Qin, Liu Gui; Kin, Wong Hoi; Kwok Hung, Chung Tony; Linxiao, Han; Wai, Choy Kwong; Yi, Sui; Tao, Yang; Tao, Tang
2017-12-12
This study was conducted to explore chromosomal copy number variations (CNV) and transcript expression and to examine pathways in cervical pathogenesis using genome-wide high resolution microarrays. Genome-wide chromosomal CNVs were investigated in 6 cervical cancer cell lines by Human Genome CGH Microarray Kit (4x44K). Gene expression profiles in cervical cancer cell lines, primary cervical carcinoma and normal cervical epithelium tissues were also studied using the Whole Human Genome Microarray Kit (4x44K). Fifty common chromosomal CNVs were identified in the cervical cancer cell lines. Correlation analysis revealed that gene up-regulation or down-regulation is significantly correlated with genomic amplification ( P =0.009) or deletion ( P =0.006) events. Expression profiles were identified through cluster analysis. Gene annotation analysis pinpointed cell cycle pathways was significantly ( P =1.15E-08) affected in cervical cancer. Common CNVs were associated with cervical cancer. Chromosomal CNVs may contribute to their transcript expression in cervical cancer.
Moskvin, Oleg V; Bolotin, Dmitry; Wang, Andrew; Ivanov, Pavel S; Gomelsky, Mark
2011-02-01
We present Rhodobase, a web-based meta-analytical tool for analysis of transcriptional regulation in a model anoxygenic photosynthetic bacterium, Rhodobacter sphaeroides. The gene association meta-analysis is based on the pooled data from 100 of R. sphaeroides whole-genome DNA microarrays. Gene-centric regulatory networks were visualized using the StarNet approach (Jupiter, D.C., VanBuren, V., 2008. A visual data mining tool that facilitates reconstruction of transcription regulatory networks. PLoS ONE 3, e1717) with several modifications. We developed a means to identify and visualize operons and superoperons. We designed a framework for the cross-genome search for transcription factor binding sites that takes into account high GC-content and oligonucleotide usage profile characteristic of the R. sphaeroides genome. To facilitate reconstruction of directional relationships between co-regulated genes, we screened upstream sequences (-400 to +20bp from start codons) of all genes for putative binding sites of bacterial transcription factors using a self-optimizing search method developed here. To test performance of the meta-analysis tools and transcription factor site predictions, we reconstructed selected nodes of the R. sphaeroides transcription factor-centric regulatory matrix. The test revealed regulatory relationships that correlate well with the experimentally derived data. The database of transcriptional profile correlations, the network visualization engine and the optimized search engine for transcription factor binding sites analysis are available at http://rhodobase.org. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Regulatory variation: an emerging vantage point for cancer biology.
Li, Luolan; Lorzadeh, Alireza; Hirst, Martin
2014-01-01
Transcriptional regulation involves complex and interdependent interactions of noncoding and coding regions of the genome with proteins that interact and modify them. Genetic variation/mutation in coding and noncoding regions of the genome can drive aberrant transcription and disease. In spite of accounting for nearly 98% of the genome comparatively little is known about the contribution of noncoding DNA elements to disease. Genome-wide association studies of complex human diseases including cancer have revealed enrichment for variants in the noncoding genome. A striking finding of recent cancer genome re-sequencing efforts has been the previously underappreciated frequency of mutations in epigenetic modifiers across a wide range of cancer types. Taken together these results point to the importance of dysregulation in transcriptional regulatory control in genesis of cancer. Powered by recent technological advancements in functional genomic profiling, exploration of normal and transformed regulatory networks will provide novel insight into the initiation and progression of cancer and open new windows to future prognostic and diagnostic tools. © 2013 Wiley Periodicals, Inc.
Gao, Qingqing; Xia, Le; Liu, Juanhua; Wang, Xiaobo; Gao, Song; Liu, Xiufan
2016-11-01
Avian pathogenic Escherichia coli (APEC) cause typical extraintestinal infections in poultry, including acute fatal septicemia, subacute pericarditis, and airsacculitis. These bacteria most often infect chickens, turkeys, ducks, and other avian species, and therefore pose a significant economic burden on the poultry industry worldwide. Few studies have analyzed the genome-wide transcriptional profile of APEC during infection in vivo. In this study, we examined the genome-wide transcriptional response of APEC O2 strain E058 in an in vivo chicken infection model to better understand the factors necessary for APEC colonization, growth, and survival in vivo. An Affymetrix multigenome DNA microarray, which contains most of the genomic open reading frames of E. coli K-12 strain MG1655, uropathogenic E. coli strain CFT073, and E. coli O157:H7 strain EDL 933, was used to profile the gene expression in APEC E058. We identified the in vivo transcriptional response of APEC E058 bacteria collected directly from the blood of infected chickens. Significant differences in expression levels were detected between the in vivo expression profile and the in vitro expression profile in LB medium. The genes highly expressed during infection were involved in metabolism, iron acquisition or transport, virulence, response to stress, and biological regulation. The reliability of the microarray data was confirmed by performing quantitative real-time PCR on 12 representative genes. Moreover, several significantly upregulated genes, including yjiY, sodA, phoB and spy, were selected to study their role in APEC pathogenesis. The data will help to better understand the mechanisms of APEC pathogenesis. Copyright © 2016 Elsevier Ltd. All rights reserved.
Global effects of the CSR-1 RNA interference pathway on the transcriptional landscape.
Cecere, Germano; Hoersch, Sebastian; O'Keeffe, Sean; Sachidanandam, Ravi; Grishok, Alla
2014-04-01
Argonaute proteins and their small RNA cofactors short interfering RNAs are known to inhibit gene expression at the transcriptional and post-transcriptional levels. In Caenorhabditis elegans, the Argonaute CSR-1 binds thousands of endogenous siRNAs (endo-siRNAs) that are antisense to germline transcripts. However, its role in gene expression regulation remains controversial. Here we used genome-wide profiling of nascent RNA transcripts and found that the CSR-1 RNA interference pathway promoted sense-oriented RNA polymerase II transcription. Moreover, a loss of CSR-1 function resulted in global increase in antisense transcription and ectopic transcription of silent chromatin domains, which led to reduced chromatin incorporation of centromere-specific histone H3. On the basis of these findings, we propose that the CSR-1 pathway helps maintain the directionality of active transcription, thereby propagating the distinction between transcriptionally active and silent genomic regions.
Gilardi, Federica; Liechti, Robin; Martin, Olivier; Harshman, Keith; Delorenzi, Mauro; Desvergne, Béatrice; Herr, Winship; Deplancke, Bart; Schibler, Ueli; Rougemont, Jacques; Guex, Nicolas; Hernandez, Nouria; Naef, Felix
2012-01-01
Interactions of cell-autonomous circadian oscillators with diurnal cycles govern the temporal compartmentalization of cell physiology in mammals. To understand the transcriptional and epigenetic basis of diurnal rhythms in mouse liver genome-wide, we generated temporal DNA occupancy profiles by RNA polymerase II (Pol II) as well as profiles of the histone modifications H3K4me3 and H3K36me3. We used these data to quantify the relationships of phases and amplitudes between different marks. We found that rhythmic Pol II recruitment at promoters rather than rhythmic transition from paused to productive elongation underlies diurnal gene transcription, a conclusion further supported by modeling. Moreover, Pol II occupancy preceded mRNA accumulation by 3 hours, consistent with mRNA half-lives. Both methylation marks showed that the epigenetic landscape is highly dynamic and globally remodeled during the 24-hour cycle. While promoters of transcribed genes had tri-methylated H3K4 even at their trough activity times, tri-methylation levels reached their peak, on average, 1 hour after Pol II. Meanwhile, rhythms in tri-methylation of H3K36 lagged transcription by 3 hours. Finally, modeling profiles of Pol II occupancy and mRNA accumulation identified three classes of genes: one showing rhythmicity both in transcriptional and mRNA accumulation, a second class with rhythmic transcription but flat mRNA levels, and a third with constant transcription but rhythmic mRNAs. The latter class emphasizes widespread temporally gated posttranscriptional regulation in the mouse liver. PMID:23209382
Decoherence in yeast cell populations and its implications for genome-wide expression noise.
Briones, M R S; Bosco, F
2009-01-20
Gene expression "noise" is commonly defined as the stochastic variation of gene expression levels in different cells of the same population under identical growth conditions. Here, we tested whether this "noise" is amplified with time, as a consequence of decoherence in global gene expression profiles (genome-wide microarrays) of synchronized cells. The stochastic component of transcription causes fluctuations that tend to be amplified as time progresses, leading to a decay of correlations of expression profiles, in perfect analogy with elementary relaxation processes. Measuring decoherence, defined here as a decay in the auto-correlation function of yeast genome-wide expression profiles, we found a slowdown in the decay of correlations, opposite to what would be expected if, as in mixing systems, correlations decay exponentially as the equilibrium state is reached. Our results indicate that the populational variation in gene expression (noise) is a consequence of temporal decoherence, in which the slow decay of correlations is a signature of strong interdependence of the transcription dynamics of different genes.
Alexandrov, Boian S; Fukuyo, Yayoi; Lange, Martin; Horikoshi, Nobuo; Gelev, Vladimir; Rasmussen, Kim Ø; Bishop, Alan R; Usheva, Anny
2012-11-01
The genome-wide mapping of the major gene expression regulators, the transcription factors (TFs) and their DNA binding sites, is of great importance for describing cellular behavior and phenotypic diversity. Presently, the methods for prediction of genomic TF binding produce a large number of false positives, most likely due to insufficient description of the physiochemical mechanisms of protein-DNA binding. Growing evidence suggests that, in the cell, the double-stranded DNA (dsDNA) is subject to local transient strands separations (breathing) that contribute to genomic functions. By using site-specific chromatin immunopecipitations, gel shifts, BIOBASE data, and our model that accurately describes the melting behavior and breathing dynamics of dsDNA we report a specific DNA breathing profile found at YY1 binding sites in cells. We find that the genomic flanking sequence variations and SNPs, may exert long-range effects on DNA dynamics and predetermine YY1 binding. The ubiquitous TF YY1 has a fundamental role in essential biological processes by activating, initiating or repressing transcription depending upon the sequence context it binds. We anticipate that consensus binding sequences together with the related DNA dynamics profile may significantly improve the accuracy of genomic TF binding sites and TF binding-related functional SNPs.
Camps, Jordi; Nguyen, Quang Tri; Padilla-Nash, Hesed M.; Knutsen, Turid; McNeil, Nicole E.; Wangsa, Danny; Hummon, Amanda B.; Grade, Marian; Ried, Thomas; Difilippantonio, Michael J.
2016-01-01
To evaluate the mechanisms and consequences of chromosomal aberrations in colorectal cancer (CRC), we used a combination of spectral karyotyping, array comparative genomic hybridization (aCGH), and array-based global gene expression profiling on 31 primary carcinomas and 15 established cell lines. Importantly, aCGH showed that the genomic profiles of primary tumors are recapitulated in the cell lines. We revealed a preponderance of chromosome breakpoints at sites of copy number variants (CNVs) in the CRC cell lines, a novel mechanism of DNA breakage in cancer. The integration of gene expression and aCGH led to the identification of 157 genes localized within high-level copy number changes whose transcriptional deregulation was significantly affected across all of the samples, thereby suggesting that these genes play a functional role in CRC. Genomic amplification at 8q24 was the most recurrent event and led to the overexpression of MYC and FAM84B. Copy number dependent gene expression resulted in deregulation of known cancer genes such as APC, FGFR2, and ERBB2. The identification of only 36 genes whose localization near a breakpoint could account for their observed deregulated expression demonstrates that the major mechanism for transcriptional deregulation in CRC is genomic copy number changes resulting from chromosomal aberrations. PMID:19691111
Staying alive in adversity: transcriptome dynamics in the stress-resistant dauer larva.
Holt, Suzan J
2006-10-01
In response to food depletion and overcrowding, the soil nematode Caenorhabditis elegans can arrest development and form an alternate third larval stage called the dauer. Though nonfeeding, the dauer larva is long lived and stress resistant. Metabolic and transcription rates are lowered but the transcriptome of the dauer is complex. In this study, distribution analysis of transcript profiles generated by Serial Analysis of Gene Expression (SAGE) in dauer larvae and in mixed developmental stages is presented. An inverse relationship was observed between frequency and abundance/copy number of SAGE tag types (transcripts) in both profiles. In the dauer profile, a relatively greater proportion of highly abundant transcripts was counterbalanced by a smaller fraction of low to moderately abundant transcripts. Comparisons of abundant tag counts between the two profiles revealed relative enrichment in the dauer profile of transcripts with predicted or known involvement in ribosome biogenesis and protein synthesis, membrane transport, and immune responses. Translation-coupled mRNA decay is proposed as part of an immune-like stress response in the dauer larva. An influence of genomic region on transcript level may reflect the coordination of transcription and mRNA turnover.
Lohmann, Ingrid
2012-01-01
In multi-cellular organisms, spatiotemporal activity of cis-regulatory DNA elements depends on their occupancy by different transcription factors (TFs). In recent years, genome-wide ChIP-on-Chip, ChIP-Seq and DamID assays have been extensively used to unravel the combinatorial interaction of TFs with cis-regulatory modules (CRMs) in the genome. Even though genome-wide binding profiles are increasingly becoming available for different TFs, single TF binding profiles are in most cases not sufficient for dissecting complex regulatory networks. Thus, potent computational tools detecting statistically significant and biologically relevant TF-motif co-occurrences in genome-wide datasets are essential for analyzing context-dependent transcriptional regulation. We have developed COPS (Co-Occurrence Pattern Search), a new bioinformatics tool based on a combination of association rules and Markov chain models, which detects co-occurring TF binding sites (BSs) on genomic regions of interest. COPS scans DNA sequences for frequent motif patterns using a Frequent-Pattern tree based data mining approach, which allows efficient performance of the software with respect to both data structure and implementation speed, in particular when mining large datasets. Since transcriptional gene regulation very often relies on the formation of regulatory protein complexes mediated by closely adjoining TF binding sites on CRMs, COPS additionally detects preferred short distance between co-occurring TF motifs. The performance of our software with respect to biological significance was evaluated using three published datasets containing genomic regions that are independently bound by several TFs involved in a defined biological process. In sum, COPS is a fast, efficient and user-friendly tool mining statistically and biologically significant TFBS co-occurrences and therefore allows the identification of TFs that combinatorially regulate gene expression. PMID:23272209
Replication-associated mutational asymmetry in the human genome.
Chen, Chun-Long; Duquenne, Lauranne; Audit, Benjamin; Guilbaud, Guillaume; Rappailles, Aurélien; Baker, Antoine; Huvet, Maxime; d'Aubenton-Carafa, Yves; Hyrien, Olivier; Arneodo, Alain; Thermes, Claude
2011-08-01
During evolution, mutations occur at rates that can differ between the two DNA strands. In the human genome, nucleotide substitutions occur at different rates on the transcribed and non-transcribed strands that may result from transcription-coupled repair. These mutational asymmetries generate transcription-associated compositional skews. To date, the existence of such asymmetries associated with replication has not yet been established. Here, we compute the nucleotide substitution matrices around replication initiation zones identified as sharp peaks in replication timing profiles and associated with abrupt jumps in the compositional skew profile. We show that the substitution matrices computed in these regions fully explain the jumps in the compositional skew profile when crossing initiation zones. In intergenic regions, we observe mutational asymmetries measured as differences between complementary substitution rates; their sign changes when crossing initiation zones. These mutational asymmetries are unlikely to result from cryptic transcription but can be explained by a model based on replication errors and strand-biased repair. In transcribed regions, mutational asymmetries associated with replication superimpose on the previously described mutational asymmetries associated with transcription. We separate the substitution asymmetries associated with both mechanisms, which allows us to determine for the first time in eukaryotes, the mutational asymmetries associated with replication and to reevaluate those associated with transcription. Replication-associated mutational asymmetry may result from unequal rates of complementary base misincorporation by the DNA polymerases coupled with DNA mismatch repair (MMR) acting with different efficiencies on the leading and lagging strands. Replication, acting in germ line cells during long evolutionary times, contributed equally with transcription to produce the present abrupt jumps in the compositional skew. These results demonstrate that DNA replication is one of the major processes that shape human genome composition.
Genome-wide transcription start site profiling in biofilm-grown Burkholderia cenocepacia J2315.
Sass, Andrea M; Van Acker, Heleen; Förstner, Konrad U; Van Nieuwerburgh, Filip; Deforce, Dieter; Vogel, Jörg; Coenye, Tom
2015-10-13
Burkholderia cenocepacia is a soil-dwelling Gram-negative Betaproteobacterium with an important role as opportunistic pathogen in humans. Infections with B. cenocepacia are very difficult to treat due to their high intrinsic resistance to most antibiotics. Biofilm formation further adds to their antibiotic resistance. B. cenocepacia harbours a large, multi-replicon genome with a high GC-content, the reference genome of strain J2315 includes 7374 annotated genes. This study aims to annotate transcription start sites and identify novel transcripts on a whole genome scale. RNA extracted from B. cenocepacia J2315 biofilms was analysed by differential RNA-sequencing and the resulting dataset compared to data derived from conventional, global RNA-sequencing. Transcription start sites were annotated and further analysed according to their position relative to annotated genes. Four thousand ten transcription start sites were mapped over the whole B. cenocepacia genome and the primary transcription start site of 2089 genes expressed in B. cenocepacia biofilms were defined. For 64 genes a start codon alternative to the annotated one was proposed. Substantial antisense transcription for 105 genes and two novel protein coding sequences were identified. The distribution of internal transcription start sites can be used to identify genomic islands in B. cenocepacia. A potassium pump strongly induced only under biofilm conditions was found and 15 non-coding small RNAs highly expressed in biofilms were discovered. Mapping transcription start sites across the B. cenocepacia genome added relevant information to the J2315 annotation. Genes and novel regulatory RNAs putatively involved in B. cenocepacia biofilm formation were identified. These findings will help in understanding regulation of B. cenocepacia biofilm formation.
Chacon, Diego; Beck, Dominik; Perera, Dilmi; Wong, Jason W H; Pimanda, John E
2014-01-01
The BloodChIP database (http://www.med.unsw.edu.au/CRCWeb.nsf/page/BloodChIP) supports exploration and visualization of combinatorial transcription factor (TF) binding at a particular locus in human CD34-positive and other normal and leukaemic cells or retrieval of target gene sets for user-defined combinations of TFs across one or more cell types. Increasing numbers of genome-wide TF binding profiles are being added to public repositories, and this trend is likely to continue. For the power of these data sets to be fully harnessed by experimental scientists, there is a need for these data to be placed in context and easily accessible for downstream applications. To this end, we have built a user-friendly database that has at its core the genome-wide binding profiles of seven key haematopoietic TFs in human stem/progenitor cells. These binding profiles are compared with binding profiles in normal differentiated and leukaemic cells. We have integrated these TF binding profiles with chromatin marks and expression data in normal and leukaemic cell fractions. All queries can be exported into external sites to construct TF-gene and protein-protein networks and to evaluate the association of genes with cellular processes and tissue expression.
DNMT1-interacting RNAs block gene specific DNA methylation
Di Ruscio, Annalisa; Ebralidze, Alexander K.; Benoukraf, Touati; Amabile, Giovanni; Goff, Loyal A.; Terragni, Joylon; Figueroa, Maria Eugenia; De Figureido Pontes, Lorena Lobo; Alberich-Jorda, Meritxell; Zhang, Pu; Wu, Mengchu; D’Alò, Francesco; Melnick, Ari; Leone, Giuseppe; Ebralidze, Konstantin K.; Pradhan, Sriharsa; Rinn, John L.; Tenen, Daniel G.
2013-01-01
Summary DNA methylation was described almost a century ago. However, the rules governing its establishment and maintenance remain elusive. Here, we present data demonstrating that active transcription regulates levels of genomic methylation. We identified a novel RNA arising from the CEBPA gene locus critical in regulating the local DNA methylation profile. This RNA binds to DNMT1 and prevents CEBPA gene locus methylation. Deep sequencing of transcripts associated with DNMT1 combined with genome-scale methylation and expression profiling extended the generality of this finding to numerous gene loci. Collectively, these results delineate the nature of DNMT1-RNA interactions and suggest strategies for gene selective demethylation of therapeutic targets in disease. PMID:24107992
High-resolution mapping of transcription factor binding sites on native chromatin
Kasinathan, Sivakanthan; Orsi, Guillermo A.; Zentner, Gabriel E.; Ahmad, Kami; Henikoff, Steven
2014-01-01
Sequence-specific DNA-binding proteins including transcription factors (TFs) are key determinants of gene regulation and chromatin architecture. Formaldehyde cross-linking and sonication followed by Chromatin ImmunoPrecipitation (X-ChIP) is widely used for profiling of TF binding, but is limited by low resolution and poor specificity and sensitivity. We present a simple protocol that starts with micrococcal nuclease-digested uncross-linked chromatin and is followed by affinity purification of TFs and paired-end sequencing. The resulting ORGANIC (Occupied Regions of Genomes from Affinity-purified Naturally Isolated Chromatin) profiles of Saccharomyces cerevisiae Abf1 and Reb1 provide highly accurate base-pair resolution maps that are not biased toward accessible chromatin, and do not require input normalization. We also demonstrate the high specificity of our method when applied to larger genomes by profiling Drosophila melanogaster GAGA Factor and Pipsqueak. Our results suggest that ORGANIC profiling is a widely applicable high-resolution method for sensitive and specific profiling of direct protein-DNA interactions. PMID:24336359
Publication Abstract: Philadelphia chromosome-like acute lymphoblastic leukemia (Ph-like ALL) is characterized by a gene-expression profile similar to that of BCR-ABL1-positive ALL, alterations of lymphoid transcription factor genes, and a poor outcome. The frequency and spectrum of genetic alterations in Ph-like ALL and its responsiveness to tyrosine kinase inhibition are undefined, especially in adolescents and adults. We performed genomic profiling of 1725 patients with precursor B-cell ALL and detailed genomic analysis of 154 patients with Ph-like ALL.
Vivek-Ananth, R P; Samal, Areejit
2016-09-01
A major goal of systems biology is to build predictive computational models of cellular metabolism. Availability of complete genome sequences and wealth of legacy biochemical information has led to the reconstruction of genome-scale metabolic networks in the last 15 years for several organisms across the three domains of life. Due to paucity of information on kinetic parameters associated with metabolic reactions, the constraint-based modelling approach, flux balance analysis (FBA), has proved to be a vital alternative to investigate the capabilities of reconstructed metabolic networks. In parallel, advent of high-throughput technologies has led to the generation of massive amounts of omics data on transcriptional regulation comprising mRNA transcript levels and genome-wide binding profile of transcriptional regulators. A frontier area in metabolic systems biology has been the development of methods to integrate the available transcriptional regulatory information into constraint-based models of reconstructed metabolic networks in order to increase the predictive capabilities of computational models and understand the regulation of cellular metabolism. Here, we review the existing methods to integrate transcriptional regulatory information into constraint-based models of metabolic networks. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Genome-wide transcriptional profiling of human glioblastoma cells in response to ITE treatment.
Kang, Bo; Zhou, Yanwen; Zheng, Min; Wang, Ying-Jie
2015-09-01
A ligand-activated transcription factor aryl hydrocarbon receptor (AhR) is recently revealed to play a key role in embryogenesis and tumorigenesis (Feng et al. [1], Safe et al. [2]) and 2-(1'H-indole-3'-carbonyl)-thiazole-4-carboxylic acid methyl ester (ITE) (Song et al. [3]) is an endogenous AhR ligand that possesses anti-tumor activity. In order to gain insights into how ITE acts via the AhR in embryogenesis and tumorigenesis, we analyzed the genome-wide transcriptional profiles of the following three groups of cells: the human glioblastoma U87 parental cells, U87 tumor sphere cells treated with vehicle (DMSO) and U87 tumor sphere cells treated with ITE. Here, we provide the details of the sample gathering strategy and show the quality controls and the analyses associated with our gene array data deposited into the Gene Expression Omnibus (GEO) under the accession code of GSE67986.
Mikhaylova, Lyudmila; Zhang, Yiming; Kobzik, Lester; Fedulov, Alexey V
2013-01-01
We investigated the link between epigenome-wide methylation aberrations at birth and genomic transcriptional changes upon allergen sensitization that occur in the neonatal dendritic cells (DC) due to maternal asthma. We previously demonstrated that neonates of asthmatic mothers are born with a functional skew in splenic DCs that can be seen even in allergen-naïve pups and can convey allergy responses to normal recipients. However, minimal-to-no transcriptional or phenotypic changes were found to explain this alteration. Here we provide in-depth analysis of genome-wide DNA methylation profiles and RNA transcriptional (microarray) profiles before and after allergen sensitization. We identified differentially methylated and differentially expressed loci and performed manually-curated matching of methylation status of the key regulatory sequences (promoters and CpG islands) to expression of their respective transcripts before and after sensitization. We found that while allergen-naive DCs from asthma-at-risk neonates have minimal transcriptional change compared to controls, the methylation changes are extensive. The substantial transcriptional change only becomes evident upon allergen sensitization, when it occurs in multiple genes with the pre-existing epigenetic alterations. We demonstrate that maternal asthma leads to both hyper- and hypomethylation in neonatal DCs, and that both types of events at various loci significantly overlap with transcriptional responses to allergen. Pathway analysis indicates that approximately 1/2 of differentially expressed and differentially methylated genes directly interact in known networks involved in allergy and asthma processes. We conclude that congenital epigenetic changes in DCs are strongly linked to altered transcriptional responses to allergen and to early-life asthma origin. The findings are consistent with the emerging paradigm that asthma is a disease with underlying epigenetic changes.
Principles of regulatory information conservation between mouse and human.
Cheng, Yong; Ma, Zhihai; Kim, Bong-Hyun; Wu, Weisheng; Cayting, Philip; Boyle, Alan P; Sundaram, Vasavi; Xing, Xiaoyun; Dogan, Nergiz; Li, Jingjing; Euskirchen, Ghia; Lin, Shin; Lin, Yiing; Visel, Axel; Kawli, Trupti; Yang, Xinqiong; Patacsil, Dorrelyn; Keller, Cheryl A; Giardine, Belinda; Kundaje, Anshul; Wang, Ting; Pennacchio, Len A; Weng, Zhiping; Hardison, Ross C; Snyder, Michael P
2014-11-20
To broaden our understanding of the evolution of gene regulation mechanisms, we generated occupancy profiles for 34 orthologous transcription factors (TFs) in human-mouse erythroid progenitor, lymphoblast and embryonic stem-cell lines. By combining the genome-wide transcription factor occupancy repertoires, associated epigenetic signals, and co-association patterns, here we deduce several evolutionary principles of gene regulatory features operating since the mouse and human lineages diverged. The genomic distribution profiles, primary binding motifs, chromatin states, and DNA methylation preferences are well conserved for TF-occupied sequences. However, the extent to which orthologous DNA segments are bound by orthologous TFs varies both among TFs and with genomic location: binding at promoters is more highly conserved than binding at distal elements. Notably, occupancy-conserved TF-occupied sequences tend to be pleiotropic; they function in several tissues and also co-associate with many TFs. Single nucleotide variants at sites with potential regulatory functions are enriched in occupancy-conserved TF-occupied sequences.
Strand-specific transcriptome profiling with directly labeled RNA on genomic tiling microarrays
2011-01-01
Background With lower manufacturing cost, high spot density, and flexible probe design, genomic tiling microarrays are ideal for comprehensive transcriptome studies. Typically, transcriptome profiling using microarrays involves reverse transcription, which converts RNA to cDNA. The cDNA is then labeled and hybridized to the probes on the arrays, thus the RNA signals are detected indirectly. Reverse transcription is known to generate artifactual cDNA, in particular the synthesis of second-strand cDNA, leading to false discovery of antisense RNA. To address this issue, we have developed an effective method using RNA that is directly labeled, thus by-passing the cDNA generation. This paper describes this method and its application to the mapping of transcriptome profiles. Results RNA extracted from laboratory cultures of Porphyromonas gingivalis was fluorescently labeled with an alkylation reagent and hybridized directly to probes on genomic tiling microarrays specifically designed for this periodontal pathogen. The generated transcriptome profile was strand-specific and produced signals close to background level in most antisense regions of the genome. In contrast, high levels of signal were detected in the antisense regions when the hybridization was done with cDNA. Five antisense areas were tested with independent strand-specific RT-PCR and none to negligible amplification was detected, indicating that the strong antisense cDNA signals were experimental artifacts. Conclusions An efficient method was developed for mapping transcriptome profiles specific to both coding strands of a bacterial genome. This method chemically labels and uses extracted RNA directly in microarray hybridization. The generated transcriptome profile was free of cDNA artifactual signals. In addition, this method requires fewer processing steps and is potentially more sensitive in detecting small amount of RNA compared to conventional end-labeling methods due to the incorporation of more fluorescent molecules per RNA fragment. PMID:21235785
USDA-ARS?s Scientific Manuscript database
Colony development, which includes hyphal extension, branching, anastomosis and asexual sporulation are fundamental aspects of the lifecycle of filamentous fungi; genetic mechanisms underlying these phenomena are poorly understood. We conducted transcriptional profiling during colony development of...
Damienikan, Aliaksandr U.
2016-01-01
The majority of bacterial genome annotations are currently automated and based on a ‘gene by gene’ approach. Regulatory signals and operon structures are rarely taken into account which often results in incomplete and even incorrect gene function assignments. Here we present SigmoID, a cross-platform (OS X, Linux and Windows) open-source application aiming at simplifying the identification of transcription regulatory sites (promoters, transcription factor binding sites and terminators) in bacterial genomes and providing assistance in correcting annotations in accordance with regulatory information. SigmoID combines a user-friendly graphical interface to well known command line tools with a genome browser for visualising regulatory elements in genomic context. Integrated access to online databases with regulatory information (RegPrecise and RegulonDB) and web-based search engines speeds up genome analysis and simplifies correction of genome annotation. We demonstrate some features of SigmoID by constructing a series of regulatory protein binding site profiles for two groups of bacteria: Soft Rot Enterobacteriaceae (Pectobacterium and Dickeya spp.) and Pseudomonas spp. Furthermore, we inferred over 900 transcription factor binding sites and alternative sigma factor promoters in the annotated genome of Pectobacterium atrosepticum. These regulatory signals control putative transcription units covering about 40% of the P. atrosepticum chromosome. Reviewing the annotation in cases where it didn’t fit with regulatory information allowed us to correct product and gene names for over 300 loci. PMID:27257541
RNA-Seq for gene identification and transcript profiling of three Stevia rebaudiana genotypes.
Chen, Junwen; Hou, Kai; Qin, Peng; Liu, Hongchang; Yi, Bin; Yang, Wenting; Wu, Wei
2014-07-07
Stevia (Stevia rebaudiana) is an important medicinal plant that yields diterpenoid steviol glycosides (SGs). SGs are currently used in the preparation of medicines, food products and neutraceuticals because of its sweetening property (zero calories and about 300 times sweeter than sugar). Recently, some progress has been made in understanding the biosynthesis of SGs in Stevia, but little is known about the molecular mechanisms underlying this process. Additionally, the genomics of Stevia, a non-model species, remains uncharacterized. The recent advent of RNA-Seq, a next generation sequencing technology, provides an opportunity to expand the identification of Stevia genes through in-depth transcript profiling. We present a comprehensive landscape of the transcriptome profiles of three genotypes of Stevia with divergent SG compositions characterized using RNA-seq. 191,590,282 high-quality reads were generated and then assembled into 171,837 transcripts with an average sequence length of 969 base pairs. A total of 80,160 unigenes were annotated, and 14,211 of the unique sequences were assigned to specific metabolic pathways by the Kyoto Encyclopedia of Genes and Genomes. Gene sequences of all enzymes known to be involved in SG synthesis were examined. A total of 143 UDP-glucosyltransferase (UGT) unigenes were identified, some of which might be involved in SG biosynthesis. The expression patterns of eight of these genes were further confirmed by RT-QPCR. RNA-seq analysis identified candidate genes encoding enzymes responsible for the biosynthesis of SGs in Stevia, a non-model plant without a reference genome. The transcriptome data from this study yielded new insights into the process of SG accumulation in Stevia. Our results demonstrate that RNA-Seq can be successfully used for gene identification and transcript profiling in a non-model species.
Global effects of the CSR-1 RNA interference pathway on transcriptional landscape
Cecere, Germano; Hoersch, Sebastian; O’Keeffe, Sean; Sachidanandam, Ravi; Grishok, Alla
2014-01-01
Argonaute proteins and their small RNA co-factors short interfering RNAs (siRNAs) are known to inhibit gene expression at the transcriptional and post-transcriptional levels. In Caenorhabditis elegans, the Argonaute CSR-1 binds thousands of endogenous siRNAs (endo-siRNAs) antisense to germline transcripts and associates with chromatin in a siRNA-dependent manner. However, its role in gene expression regulation remains controversial. Here, we used a genome-wide profiling of nascent RNA transcripts to demonstrate that the CSR-1 RNAi pathway promotes sense-oriented Pol II transcription. Moreover, a loss of CSR-1 function resulted in global increase in antisense transcription and ectopic transcription of silent chromatin domains, which led to reduced chromatin incorporation of centromere-specific histone H3. Based on these findings, we propose that the CSR-1 pathway has a role in maintaining the directionality of active transcription thereby propagating the distinction between transcriptionally active and silent genomic regions. PMID:24681887
Structural properties of prokaryotic promoter regions correlate with functional features.
Meysman, Pieter; Collado-Vides, Julio; Morett, Enrique; Viola, Roberto; Engelen, Kristof; Laukens, Kris
2014-01-01
The structural properties of the DNA molecule are known to play a critical role in transcription. In this paper, the structural profiles of promoter regions were studied within the context of their diversity and their function for eleven prokaryotic species; Escherichia coli, Klebsiella pneumoniae, Salmonella Typhimurium, Pseudomonas auroginosa, Geobacter sulfurreducens Helicobacter pylori, Chlamydophila pneumoniae, Synechocystis sp., Synechoccocus elongates, Bacillus anthracis, and the archaea Sulfolobus solfataricus. The main anchor point for these promoter regions were transcription start sites identified through high-throughput experiments or collected within large curated databases. Prokaryotic promoter regions were found to be less stable and less flexible than the genomic mean across all studied species. However, direct comparison between species revealed differences in their structural profiles that can not solely be explained by the difference in genomic GC content. In addition, comparison with functional data revealed that there are patterns in the promoter structural profiles that can be linked to specific functional loci, such as sigma factor regulation or transcription factor binding. Interestingly, a novel structural element clearly visible near the transcription start site was found in genes associated with essential cellular functions and growth in several species. Our analyses reveals the great diversity in promoter structural profiles both between and within prokaryotic species. We observed relationships between structural diversity and functional features that are interesting prospects for further research to yet uncharacterized functional loci defined by DNA structural properties.
Grade, Marian; Hörmann, Patrick; Becker, Sandra; Hummon, Amanda B.; Wangsa, Danny; Varma, Sudhir; Simon, Richard; Liersch, Torsten; Becker, Heinz; Difilippantonio, Michael J.; Ghadimi, B. Michael; Ried, Thomas
2016-01-01
To characterize patterns of global transcriptional deregulation in primary colon carcinomas, we did gene expression profiling of 73 tumors [Unio Internationale Contra Cancrum stage II (n = 33) and stage III (n = 40)] using oligonucleotide microarrays. For 30 of the tumors, expression profiles were compared with those from matched normal mucosa samples. We identified a set of 1,950 genes with highly significant deregulation between tumors and mucosa samples (P < 1e–7). A significant proportion of these genes mapped to chromosome 20 (P = 0.01). Seventeen genes had a >5-fold average expression difference between normal colon mucosa and carcinomas, including up-regulation of MYC and of HMGA1, a putative oncogene. Furthermore, we identified 68 genes that were significantly differentially expressed between lymph node–negative and lymph node–positive tumors (P < 0.001), the functional annotation of which revealed a preponderance of genes that play a role in cellular immune response and surveillance. The microarray-derived gene expression levels of 20 deregulated genes were validated using quantitative real-time reverse transcription-PCR in >40 tumor and normal mucosa samples with good concordance between the techniques. Finally, we established a relationship between specific genomic imbalances, which were mapped for 32 of the analyzed colon tumors by comparative genomic hybridization, and alterations of global transcriptional activity. Previously, we had conducted a similar analysis of primary rectal carcinomas. The systematic comparison of colon and rectal carcinomas revealed a significant overlap of genomic imbalances and transcriptional deregulation, including activation of the Wnt/β-catenin signaling cascade, suggesting similar pathogenic pathways. PMID:17210682
Grade, Marian; Hörmann, Patrick; Becker, Sandra; Hummon, Amanda B; Wangsa, Danny; Varma, Sudhir; Simon, Richard; Liersch, Torsten; Becker, Heinz; Difilippantonio, Michael J; Ghadimi, B Michael; Ried, Thomas
2007-01-01
To characterize patterns of global transcriptional deregulation in primary colon carcinomas, we did gene expression profiling of 73 tumors [Unio Internationale Contra Cancrum stage II (n = 33) and stage III (n = 40)] using oligonucleotide microarrays. For 30 of the tumors, expression profiles were compared with those from matched normal mucosa samples. We identified a set of 1,950 genes with highly significant deregulation between tumors and mucosa samples (P < 1e-7). A significant proportion of these genes mapped to chromosome 20 (P = 0.01). Seventeen genes had a >5-fold average expression difference between normal colon mucosa and carcinomas, including up-regulation of MYC and of HMGA1, a putative oncogene. Furthermore, we identified 68 genes that were significantly differentially expressed between lymph node-negative and lymph node-positive tumors (P < 0.001), the functional annotation of which revealed a preponderance of genes that play a role in cellular immune response and surveillance. The microarray-derived gene expression levels of 20 deregulated genes were validated using quantitative real-time reverse transcription-PCR in >40 tumor and normal mucosa samples with good concordance between the techniques. Finally, we established a relationship between specific genomic imbalances, which were mapped for 32 of the analyzed colon tumors by comparative genomic hybridization, and alterations of global transcriptional activity. Previously, we had conducted a similar analysis of primary rectal carcinomas. The systematic comparison of colon and rectal carcinomas revealed a significant overlap of genomic imbalances and transcriptional deregulation, including activation of the Wnt/beta-catenin signaling cascade, suggesting similar pathogenic pathways.
An integrated workflow for analysis of ChIP-chip data.
Weigelt, Karin; Moehle, Christoph; Stempfl, Thomas; Weber, Bernhard; Langmann, Thomas
2008-08-01
Although ChIP-chip is a powerful tool for genome-wide discovery of transcription factor target genes, the steps involving raw data analysis, identification of promoters, and correlation with binding sites are still laborious processes. Therefore, we report an integrated workflow for the analysis of promoter tiling arrays with the Genomatix ChipInspector system. We compare this tool with open-source software packages to identify PU.1 regulated genes in mouse macrophages. Our results suggest that ChipInspector data analysis, comparative genomics for binding site prediction, and pathway/network modeling significantly facilitate and enhance whole-genome promoter profiling to reveal in vivo sites of transcription factor-DNA interactions.
Kim, Ji Eun; Lee, Min Hee; Cho, Eun Ju; Kim, Ji Hong; Chung, Byung Yeoup; Kim, Jin-Hong
2013-12-01
Ionizing radiation causes various epigenetic changes, as well as a variety of DNA lesions such as strand breaks, cross-links, oxidative damages, etc., in genomes. However, radiation-induced epigenetic changes have rarely been substantiated in plant genomes. The current study investigates whether DNA methylation of Arabidopsis thaliana genome is altered by gamma rays. We found that genomic DNA methylation decreased in wild-type plants with increasing doses of gamma rays (5, 50 and 200 Gy). Irradiation with 200 Gy significantly increased the expression of transcriptionally inactive centromeric 180-bp (CEN) and transcriptionally silent information (TSI) repeats. This increase suggested that there was a substantial release of transcriptional gene silencing by gamma rays, probably by induction of DNA hypomethylation. High expression of the DNA demethylase ROS1 and low expression of the DNA methyltransferase CMT3 supported this hypothesis. Moreover, Southern blot analysis following digestion of genomic DNA with methylation-sensitive enzymes revealed that the DNA hypomethylation occured preferentially at CHG or CHH sites rather than CG sites, depending on the radiation dose. Unlike CEN and TSI repeats, the number of Ta3, AtSN1 and FWA repeats decreased in transcription but increased in non-CG methylation. In addition, the cmt3-11 mutant showed neither DNA hypomethylation nor transcriptional activation of silenced repeats upon gamma irradiation. Furthermore, profiles of genome-wide transcriptomes in response to gamma rays differed between the wild-type and cmt3-11 mutant. These results suggest that gamma irradiation induced DNA hypomethylation preferentially at non-CG sites of transcriptionally inactive repeats in a locus-specific manner, which depends on CMT3 activity.
Sunkel, Benjamin; Wu, Dayong; Chen, Zhong; Wang, Chiou-Miin; Liu, Xiangtao; Ye, Zhenqing; Horning, Aaron M; Liu, Joseph; Mahalingam, Devalingam; Lopez-Nicora, Horacio; Lin, Chun-Lin; Goodfellow, Paul J; Clinton, Steven K; Jin, Victor X; Chen, Chun-Liang; Huang, Tim H-M; Wang, Qianben
2016-05-19
Identifying prostate cancer-driving transcription factors (TFs) in addition to the androgen receptor promises to improve our ability to effectively diagnose and treat this disease. We employed an integrative genomics analysis of master TFs CREB1 and FoxA1 in androgen-dependent prostate cancer (ADPC) and castration-resistant prostate cancer (CRPC) cell lines, primary prostate cancer tissues and circulating tumor cells (CTCs) to investigate their role in defining prostate cancer gene expression profiles. Combining genome-wide binding site and gene expression profiles we define CREB1 as a critical driver of pro-survival, cell cycle and metabolic transcription programs. We show that CREB1 and FoxA1 co-localize and mutually influence each other's binding to define disease-driving transcription profiles associated with advanced prostate cancer. Gene expression analysis in human prostate cancer samples found that CREB1/FoxA1 target gene panels predict prostate cancer recurrence. Finally, we showed that this signaling pathway is sensitive to compounds that inhibit the transcription co-regulatory factor MED1. These findings not only reveal a novel, global transcriptional co-regulatory function of CREB1 and FoxA1, but also suggest CREB1/FoxA1 signaling is a targetable driver of prostate cancer progression and serves as a biomarker of poor clinical outcomes. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
A sequence-based survey of the complex structural organization of tumor genomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Collins, Colin; Raphael, Benjamin J.; Volik, Stanislav
2008-04-03
The genomes of many epithelial tumors exhibit extensive chromosomal rearrangements. All classes of genome rearrangements can be identified using End Sequencing Profiling (ESP), which relies on paired-end sequencing of cloned tumor genomes. In this study, brain, breast, ovary and prostate tumors along with three breast cancer cell lines were surveyed with ESP yielding the largest available collection of sequence-ready tumor genome breakpoints and providing evidence that some rearrangements may be recurrent. Sequencing and fluorescence in situ hybridization (FISH) confirmed translocations and complex tumor genome structures that include coamplification and packaging of disparate genomic loci with associated molecular heterogeneity. Comparison ofmore » the tumor genomes suggests recurrent rearrangements. Some are likely to be novel structural polymorphisms, whereas others may be bona fide somatic rearrangements. A recurrent fusion transcript in breast tumors and a constitutional fusion transcript resulting from a segmental duplication were identified. Analysis of end sequences for single nucleotide polymorphisms (SNPs) revealed candidate somatic mutations and an elevated rate of novel SNPs in an ovarian tumor. These results suggest that the genomes of many epithelial tumors may be far more dynamic and complex than previously appreciated and that genomic fusions including fusion transcripts and proteins may be common, possibly yielding tumor-specific biomarkers and therapeutic targets.« less
Coate, Jeremy E; Doyle, Jeff J
2010-01-01
Evolutionary biologists are increasingly comparing gene expression patterns across species. Due to the way in which expression assays are normalized, such studies provide no direct information about expression per gene copy (dosage responses) or per cell and can give a misleading picture of genes that are differentially expressed. We describe an assay for estimating relative expression per cell. When used in conjunction with transcript profiling data, it is possible to compare the sizes of whole transcriptomes, which in turn makes it possible to compare expression per cell for each gene in the transcript profiling data set. We applied this approach, using quantitative reverse transcriptase-polymerase chain reaction and high throughput RNA sequencing, to a recently formed allopolyploid and showed that its leaf transcriptome was approximately 1.4-fold larger than either progenitor transcriptome (70% of the sum of the progenitor transcriptomes). In contrast, the allopolyploid genome is 94.3% as large as the sum of its progenitor genomes and retains > or =93.5% of the sum of its progenitor gene complements. Thus, "transcriptome downsizing" is greater than genome downsizing. Using this transcriptome size estimate, we inferred dosage responses for several thousand genes and showed that the majority exhibit partial dosage compensation. Homoeologue silencing is nonrandomly distributed across dosage responses, with genes showing extreme responses in either direction significantly more likely to have a silent homoeologue. This experimental approach will add value to transcript profiling experiments involving interspecies and interploidy comparisons by converting expression per transcriptome to expression per genome, eliminating the need for assumptions about transcriptome size.
Bloom, Chloe I; Graham, Christine M; Berry, Matthew P R; Rozakeas, Fotini; Redford, Paul S; Wang, Yuanyuan; Xu, Zhaohui; Wilkinson, Katalin A; Wilkinson, Robert J; Kendrick, Yvonne; Devouassoux, Gilles; Ferry, Tristan; Miyara, Makoto; Bouvry, Diane; Valeyre, Dominique; Dominique, Valeyre; Gorochov, Guy; Blankenship, Derek; Saadatian, Mitra; Vanhems, Phillip; Beynon, Huw; Vancheeswaran, Rama; Wickremasinghe, Melissa; Chaussabel, Damien; Banchereau, Jacques; Pascual, Virginia; Ho, Ling-Pei; Lipman, Marc; O'Garra, Anne
2013-01-01
New approaches to define factors underlying the immunopathogenesis of pulmonary diseases including sarcoidosis and tuberculosis are needed to develop new treatments and biomarkers. Comparing the blood transcriptional response of tuberculosis to other similar pulmonary diseases will advance knowledge of disease pathways and help distinguish diseases with similar clinical presentations. To determine the factors underlying the immunopathogenesis of the granulomatous diseases, sarcoidosis and tuberculosis, by comparing the blood transcriptional responses in these and other pulmonary diseases. We compared whole blood genome-wide transcriptional profiles in pulmonary sarcoidosis, pulmonary tuberculosis, to community acquired pneumonia and primary lung cancer and healthy controls, before and after treatment, and in purified leucocyte populations. An Interferon-inducible neutrophil-driven blood transcriptional signature was present in both sarcoidosis and tuberculosis, with a higher abundance and expression in tuberculosis. Heterogeneity of the sarcoidosis signature correlated significantly with disease activity. Transcriptional profiles in pneumonia and lung cancer revealed an over-abundance of inflammatory transcripts. After successful treatment the transcriptional activity in tuberculosis and pneumonia patients was significantly reduced. However the glucocorticoid-responsive sarcoidosis patients showed a significant increase in transcriptional activity. 144-blood transcripts were able to distinguish tuberculosis from other lung diseases and controls. Tuberculosis and sarcoidosis revealed similar blood transcriptional profiles, dominated by interferon-inducible transcripts, while pneumonia and lung cancer showed distinct signatures, dominated by inflammatory genes. There were also significant differences between tuberculosis and sarcoidosis in the degree of their transcriptional activity, the heterogeneity of their profiles and their transcriptional response to treatment.
Reverse Engineering of Genome-wide Gene Regulatory Networks from Gene Expression Data
Liu, Zhi-Ping
2015-01-01
Transcriptional regulation plays vital roles in many fundamental biological processes. Reverse engineering of genome-wide regulatory networks from high-throughput transcriptomic data provides a promising way to characterize the global scenario of regulatory relationships between regulators and their targets. In this review, we summarize and categorize the main frameworks and methods currently available for inferring transcriptional regulatory networks from microarray gene expression profiling data. We overview each of strategies and introduce representative methods respectively. Their assumptions, advantages, shortcomings, and possible improvements and extensions are also clarified and commented. PMID:25937810
USDA-ARS?s Scientific Manuscript database
Transcriptional profiles of soybean (Glycine max, L. Merr) near isogenic lines Clark (PI548553, iron efficient) and IsoClark (PI547430, iron inefficient) were analyzed and compared using the Affymetrix® GeneChip® Soybean Genome Array. A comparison of plants grown under Fe-sufficient and Fe-limited ...
Wong, Chui E; Bhalla, Prem L; Ottenhof, Harald; Singh, Mohan B
2008-01-01
Background Despite the importance of the shoot apical meristem (SAM) in plant development and organ formation, our understanding of the molecular mechanisms controlling its function is limited. Genomic tools have the potential to unravel the molecular mysteries of the SAM, and legume systems are increasingly being used in plant-development studies owing to their unique characteristics such as nitrogen fixation, secondary metabolism, and pod development. Garden pea (Pisum sativum) is a well-established classic model species for genetics studies that has been used since the Mendel era. In addition, the availability of a plethora of developmental mutants makes pea an ideal crop legume for genomics studies. This study aims to utilise genomics tools in isolating genes that play potential roles in the regulation of SAM activity. Results In order to identify genes that are differentially expressed in the SAM, we generated 2735 ESTs from three cDNA libraries derived from freshly micro-dissected SAMs from 10-day-old garden peas (Pisum sativum cv Torsdag). Custom-designed oligonucleotide arrays were used to compare the transcriptional profiles of pea SAMs and non-meristematic tissues. A total of 184 and 175 transcripts were significantly up- or down-regulated in the pea SAM, respectively. As expected, close to 61% of the transcripts down-regulated in the SAM were found in the public database, whereas sequences from the same source only comprised 12% of the genes that were expressed at higher levels in the SAM. This highlights the under-representation of transcripts from the meristematic tissues in the current public pea protein database, and demonstrates the utility of our SAM EST collection as an essential genetic resource for revealing further information on the regulation of this developmental process. In addition to unknowns, many of the up-regulated transcripts are known to encode products associated with cell division and proliferation, epigenetic regulation, auxin-mediated responses and microRNA regulation. Conclusion The presented data provide a picture of the transcriptional profile of the pea SAM, and reveal possible roles of differentially expressed transcripts in meristem function and maintenance. PMID:18590528
Chatterjee, Sumantra; Sivakamasundari, V; Yap, Sook Peng; Kraus, Petra; Kumar, Vibhor; Xing, Xing; Lim, Siew Lan; Sng, Joel; Prabhakar, Shyam; Lufkin, Thomas
2014-12-05
Vertebrate organogenesis is a highly complex process involving sequential cascades of transcription factor activation or repression. Interestingly a single developmental control gene can occasionally be essential for the morphogenesis and differentiation of tissues and organs arising from vastly disparate embryological lineages. Here we elucidated the role of the mammalian homeobox gene Bapx1 during the embryogenesis of five distinct organs at E12.5 - vertebral column, spleen, gut, forelimb and hindlimb - using expression profiling of sorted wildtype and mutant cells combined with genome wide binding site analysis. Furthermore we analyzed the development of the vertebral column at the molecular level by combining transcriptional profiling and genome wide binding data for Bapx1 with similarly generated data sets for Sox9 to assemble a detailed gene regulatory network revealing genes previously not reported to be controlled by either of these two transcription factors. The gene regulatory network appears to control cell fate decisions and morphogenesis in the vertebral column along with the prevention of premature chondrocyte differentiation thus providing a detailed molecular view of vertebral column development.
Genome wide interactions of wild-type and activator bypass forms of σ54.
Schaefer, Jorrit; Engl, Christoph; Zhang, Nan; Lawton, Edward; Buck, Martin
2015-09-03
Enhancer-dependent transcription involving the promoter specificity factor σ(54) is widely distributed amongst bacteria and commonly associated with cell envelope function. For transcription initiation, σ(54)-RNA polymerase yields open promoter complexes through its remodelling by cognate AAA+ ATPase activators. Since activators can be bypassed in vitro, bypass transcription in vivo could be a source of emergent gene expression along evolutionary pathways yielding new control networks and transcription patterns. At a single test promoter in vivo bypass transcription was not observed. We now use genome-wide transcription profiling, genome-wide mutagenesis and gene over-expression strategies in Escherichia coli, to (i) scope the range of bypass transcription in vivo and (ii) identify genes which might alter bypass transcription in vivo. We find little evidence for pervasive bypass transcription in vivo with only a small subset of σ(54) promoters functioning without activators. Results also suggest no one gene limits bypass transcription in vivo, arguing bypass transcription is strongly kept in check. Promoter sequences subject to repression by σ(54) were evident, indicating loss of rpoN (encoding σ(54)) rather than creating rpoN bypass alleles would be one evolutionary route for new gene expression patterns. Finally, cold-shock promoters showed unusual σ(54)-dependence in vivo not readily correlated with conventional σ(54) binding-sites. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Genome wide interactions of wild-type and activator bypass forms of σ54
Schaefer, Jorrit; Engl, Christoph; Zhang, Nan; Lawton, Edward; Buck, Martin
2015-01-01
Enhancer-dependent transcription involving the promoter specificity factor σ54 is widely distributed amongst bacteria and commonly associated with cell envelope function. For transcription initiation, σ54-RNA polymerase yields open promoter complexes through its remodelling by cognate AAA+ ATPase activators. Since activators can be bypassed in vitro, bypass transcription in vivo could be a source of emergent gene expression along evolutionary pathways yielding new control networks and transcription patterns. At a single test promoter in vivo bypass transcription was not observed. We now use genome-wide transcription profiling, genome-wide mutagenesis and gene over-expression strategies in Escherichia coli, to (i) scope the range of bypass transcription in vivo and (ii) identify genes which might alter bypass transcription in vivo. We find little evidence for pervasive bypass transcription in vivo with only a small subset of σ54 promoters functioning without activators. Results also suggest no one gene limits bypass transcription in vivo, arguing bypass transcription is strongly kept in check. Promoter sequences subject to repression by σ54 were evident, indicating loss of rpoN (encoding σ54) rather than creating rpoN bypass alleles would be one evolutionary route for new gene expression patterns. Finally, cold-shock promoters showed unusual σ54-dependence in vivo not readily correlated with conventional σ54 binding-sites. PMID:26082500
Comparative transcriptional profiling identifies takeout as a gene that regulates life span
Bauer, Johannes; Antosh, Michael; Chang, Chengyi; Schorl, Christoph; Kolli, Santharam; Neretti, Nicola; Helfand, Stephen L.
2010-01-01
A major challenge in translating the positive effects of dietary restriction (DR) for the improvement of human health is the development of therapeutic mimics. One approach to finding DR mimics is based upon identification of the proximal effectors of DR life span extension. Whole genome profiling of DR in Drosophila shows a large number of changes in gene expression, making it difficult to establish which changes are involved in life span determination as opposed to other unrelated physiological changes. We used comparative whole genome expression profiling to discover genes whose change in expression is shared between DR and two molecular genetic life span extending interventions related to DR, increased dSir2 and decreased Dmp53 activity. We find twenty-one genes shared among the three related life span extending interventions. One of these genes, takeout, thought to be involved in circadian rhythms, feeding behavior and juvenile hormone binding is also increased in four other life span extending conditions: Rpd3, Indy, chico and methuselah. We demonstrate takeout is involved in longevity determination by specifically increasing adult takeout expression and extending life span. These studies demonstrate the power of comparative whole genome transcriptional profiling for identifying specific downstream elements of the DR life span extending pathway. PMID:20519778
Genome-wide transcriptional profiling of human glioblastoma cells in response to ITE treatment
Kang, Bo; Zhou, Yanwen; Zheng, Min; Wang, Ying-Jie
2015-01-01
A ligand-activated transcription factor aryl hydrocarbon receptor (AhR) is recently revealed to play a key role in embryogenesis and tumorigenesis (Feng et al. [1], Safe et al. [2]) and 2-(1′H-indole-3′-carbonyl)-thiazole-4-carboxylic acid methyl ester (ITE) (Song et al. [3]) is an endogenous AhR ligand that possesses anti-tumor activity. In order to gain insights into how ITE acts via the AhR in embryogenesis and tumorigenesis, we analyzed the genome-wide transcriptional profiles of the following three groups of cells: the human glioblastoma U87 parental cells, U87 tumor sphere cells treated with vehicle (DMSO) and U87 tumor sphere cells treated with ITE. Here, we provide the details of the sample gathering strategy and show the quality controls and the analyses associated with our gene array data deposited into the Gene Expression Omnibus (GEO) under the accession code of GSE67986. PMID:26484269
Pellacani, Davide; Bilenky, Misha; Kannan, Nagarajan; Heravi-Moussavi, Alireza; Knapp, David J H F; Gakkhar, Sitanshu; Moksa, Michelle; Carles, Annaick; Moore, Richard; Mungall, Andrew J; Marra, Marco A; Jones, Steven J M; Aparicio, Samuel; Hirst, Martin; Eaves, Connie J
2016-11-15
The normal adult human mammary gland is a continuous bilayered epithelial system. Bipotent and myoepithelial progenitors are prominent and unique components of the outer (basal) layer. The inner (luminal) layer includes both luminal-restricted progenitors and a phenotypically separable fraction that lacks progenitor activity. We now report an epigenomic comparison of these three subsets with one another, with their associated stromal cells, and with three immortalized, non-tumorigenic human mammary cell lines. Each genome-wide analysis contains profiles for six histone marks, methylated DNA, and RNA transcripts. Analysis of these datasets shows that each cell type has unique features, primarily within genomic regulatory regions, and that the cell lines group together. Analyses of the promoter and enhancer profiles place the luminal progenitors in between the basal cells and the non-progenitor luminal subset. Integrative analysis reveals networks of subset-specific transcription factors. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.
FANTOM5 CAGE profiles of human and mouse samples.
Noguchi, Shuhei; Arakawa, Takahiro; Fukuda, Shiro; Furuno, Masaaki; Hasegawa, Akira; Hori, Fumi; Ishikawa-Kato, Sachi; Kaida, Kaoru; Kaiho, Ai; Kanamori-Katayama, Mutsumi; Kawashima, Tsugumi; Kojima, Miki; Kubosaki, Atsutaka; Manabe, Ri-Ichiroh; Murata, Mitsuyoshi; Nagao-Sato, Sayaka; Nakazato, Kenichi; Ninomiya, Noriko; Nishiyori-Sueki, Hiromi; Noma, Shohei; Saijyo, Eri; Saka, Akiko; Sakai, Mizuho; Simon, Christophe; Suzuki, Naoko; Tagami, Michihira; Watanabe, Shoko; Yoshida, Shigehiro; Arner, Peter; Axton, Richard A; Babina, Magda; Baillie, J Kenneth; Barnett, Timothy C; Beckhouse, Anthony G; Blumenthal, Antje; Bodega, Beatrice; Bonetti, Alessandro; Briggs, James; Brombacher, Frank; Carlisle, Ailsa J; Clevers, Hans C; Davis, Carrie A; Detmar, Michael; Dohi, Taeko; Edge, Albert S B; Edinger, Matthias; Ehrlund, Anna; Ekwall, Karl; Endoh, Mitsuhiro; Enomoto, Hideki; Eslami, Afsaneh; Fagiolini, Michela; Fairbairn, Lynsey; Farach-Carson, Mary C; Faulkner, Geoffrey J; Ferrai, Carmelo; Fisher, Malcolm E; Forrester, Lesley M; Fujita, Rie; Furusawa, Jun-Ichi; Geijtenbeek, Teunis B; Gingeras, Thomas; Goldowitz, Daniel; Guhl, Sven; Guler, Reto; Gustincich, Stefano; Ha, Thomas J; Hamaguchi, Masahide; Hara, Mitsuko; Hasegawa, Yuki; Herlyn, Meenhard; Heutink, Peter; Hitchens, Kelly J; Hume, David A; Ikawa, Tomokatsu; Ishizu, Yuri; Kai, Chieko; Kawamoto, Hiroshi; Kawamura, Yuki I; Kempfle, Judith S; Kenna, Tony J; Kere, Juha; Khachigian, Levon M; Kitamura, Toshio; Klein, Sarah; Klinken, S Peter; Knox, Alan J; Kojima, Soichi; Koseki, Haruhiko; Koyasu, Shigeo; Lee, Weonju; Lennartsson, Andreas; Mackay-Sim, Alan; Mejhert, Niklas; Mizuno, Yosuke; Morikawa, Hiromasa; Morimoto, Mitsuru; Moro, Kazuyo; Morris, Kelly J; Motohashi, Hozumi; Mummery, Christine L; Nakachi, Yutaka; Nakahara, Fumio; Nakamura, Toshiyuki; Nakamura, Yukio; Nozaki, Tadasuke; Ogishima, Soichi; Ohkura, Naganari; Ohno, Hiroshi; Ohshima, Mitsuhiro; Okada-Hatakeyama, Mariko; Okazaki, Yasushi; Orlando, Valerio; Ovchinnikov, Dmitry A; Passier, Robert; Patrikakis, Margaret; Pombo, Ana; Pradhan-Bhatt, Swati; Qin, Xian-Yang; Rehli, Michael; Rizzu, Patrizia; Roy, Sugata; Sajantila, Antti; Sakaguchi, Shimon; Sato, Hiroki; Satoh, Hironori; Savvi, Suzana; Saxena, Alka; Schmidl, Christian; Schneider, Claudio; Schulze-Tanzil, Gundula G; Schwegmann, Anita; Sheng, Guojun; Shin, Jay W; Sugiyama, Daisuke; Sugiyama, Takaaki; Summers, Kim M; Takahashi, Naoko; Takai, Jun; Tanaka, Hiroshi; Tatsukawa, Hideki; Tomoiu, Andru; Toyoda, Hiroo; van de Wetering, Marc; van den Berg, Linda M; Verardo, Roberto; Vijayan, Dipti; Wells, Christine A; Winteringham, Louise N; Wolvetang, Ernst; Yamaguchi, Yoko; Yamamoto, Masayuki; Yanagi-Mizuochi, Chiyo; Yoneda, Misako; Yonekura, Yohei; Zhang, Peter G; Zucchelli, Silvia; Abugessaisa, Imad; Arner, Erik; Harshbarger, Jayson; Kondo, Atsushi; Lassmann, Timo; Lizio, Marina; Sahin, Serkan; Sengstag, Thierry; Severin, Jessica; Shimoji, Hisashi; Suzuki, Masanori; Suzuki, Harukazu; Kawai, Jun; Kondo, Naoto; Itoh, Masayoshi; Daub, Carsten O; Kasukawa, Takeya; Kawaji, Hideya; Carninci, Piero; Forrest, Alistair R R; Hayashizaki, Yoshihide
2017-08-29
In the FANTOM5 project, transcription initiation events across the human and mouse genomes were mapped at a single base-pair resolution and their frequencies were monitored by CAGE (Cap Analysis of Gene Expression) coupled with single-molecule sequencing. Approximately three thousands of samples, consisting of a variety of primary cells, tissues, cell lines, and time series samples during cell activation and development, were subjected to a uniform pipeline of CAGE data production. The analysis pipeline started by measuring RNA extracts to assess their quality, and continued to CAGE library production by using a robotic or a manual workflow, single molecule sequencing, and computational processing to generate frequencies of transcription initiation. Resulting data represents the consequence of transcriptional regulation in each analyzed state of mammalian cells. Non-overlapping peaks over the CAGE profiles, approximately 200,000 and 150,000 peaks for the human and mouse genomes, were identified and annotated to provide precise location of known promoters as well as novel ones, and to quantify their activities.
FANTOM5 CAGE profiles of human and mouse samples
Noguchi, Shuhei; Arakawa, Takahiro; Fukuda, Shiro; Furuno, Masaaki; Hasegawa, Akira; Hori, Fumi; Ishikawa-Kato, Sachi; Kaida, Kaoru; Kaiho, Ai; Kanamori-Katayama, Mutsumi; Kawashima, Tsugumi; Kojima, Miki; Kubosaki, Atsutaka; Manabe, Ri-ichiroh; Murata, Mitsuyoshi; Nagao-Sato, Sayaka; Nakazato, Kenichi; Ninomiya, Noriko; Nishiyori-Sueki, Hiromi; Noma, Shohei; Saijyo, Eri; Saka, Akiko; Sakai, Mizuho; Simon, Christophe; Suzuki, Naoko; Tagami, Michihira; Watanabe, Shoko; Yoshida, Shigehiro; Arner, Peter; Axton, Richard A.; Babina, Magda; Baillie, J. Kenneth; Barnett, Timothy C.; Beckhouse, Anthony G.; Blumenthal, Antje; Bodega, Beatrice; Bonetti, Alessandro; Briggs, James; Brombacher, Frank; Carlisle, Ailsa J.; Clevers, Hans C.; Davis, Carrie A.; Detmar, Michael; Dohi, Taeko; Edge, Albert S.B.; Edinger, Matthias; Ehrlund, Anna; Ekwall, Karl; Endoh, Mitsuhiro; Enomoto, Hideki; Eslami, Afsaneh; Fagiolini, Michela; Fairbairn, Lynsey; Farach-Carson, Mary C.; Faulkner, Geoffrey J.; Ferrai, Carmelo; Fisher, Malcolm E.; Forrester, Lesley M.; Fujita, Rie; Furusawa, Jun-ichi; Geijtenbeek, Teunis B.; Gingeras, Thomas; Goldowitz, Daniel; Guhl, Sven; Guler, Reto; Gustincich, Stefano; Ha, Thomas J.; Hamaguchi, Masahide; Hara, Mitsuko; Hasegawa, Yuki; Herlyn, Meenhard; Heutink, Peter; Hitchens, Kelly J.; Hume, David A.; Ikawa, Tomokatsu; Ishizu, Yuri; Kai, Chieko; Kawamoto, Hiroshi; Kawamura, Yuki I.; Kempfle, Judith S.; Kenna, Tony J.; Kere, Juha; Khachigian, Levon M.; Kitamura, Toshio; Klein, Sarah; Klinken, S. Peter; Knox, Alan J.; Kojima, Soichi; Koseki, Haruhiko; Koyasu, Shigeo; Lee, Weonju; Lennartsson, Andreas; Mackay-sim, Alan; Mejhert, Niklas; Mizuno, Yosuke; Morikawa, Hiromasa; Morimoto, Mitsuru; Moro, Kazuyo; Morris, Kelly J.; Motohashi, Hozumi; Mummery, Christine L.; Nakachi, Yutaka; Nakahara, Fumio; Nakamura, Toshiyuki; Nakamura, Yukio; Nozaki, Tadasuke; Ogishima, Soichi; Ohkura, Naganari; Ohno, Hiroshi; Ohshima, Mitsuhiro; Okada-Hatakeyama, Mariko; Okazaki, Yasushi; Orlando, Valerio; Ovchinnikov, Dmitry A.; Passier, Robert; Patrikakis, Margaret; Pombo, Ana; Pradhan-Bhatt, Swati; Qin, Xian-Yang; Rehli, Michael; Rizzu, Patrizia; Roy, Sugata; Sajantila, Antti; Sakaguchi, Shimon; Sato, Hiroki; Satoh, Hironori; Savvi, Suzana; Saxena, Alka; Schmidl, Christian; Schneider, Claudio; Schulze-Tanzil, Gundula G.; Schwegmann, Anita; Sheng, Guojun; Shin, Jay W.; Sugiyama, Daisuke; Sugiyama, Takaaki; Summers, Kim M.; Takahashi, Naoko; Takai, Jun; Tanaka, Hiroshi; Tatsukawa, Hideki; Tomoiu, Andru; Toyoda, Hiroo; van de Wetering, Marc; van den Berg, Linda M.; Verardo, Roberto; Vijayan, Dipti; Wells, Christine A.; Winteringham, Louise N.; Wolvetang, Ernst; Yamaguchi, Yoko; Yamamoto, Masayuki; Yanagi-Mizuochi, Chiyo; Yoneda, Misako; Yonekura, Yohei; Zhang, Peter G.; Zucchelli, Silvia; Abugessaisa, Imad; Arner, Erik; Harshbarger, Jayson; Kondo, Atsushi; Lassmann, Timo; Lizio, Marina; Sahin, Serkan; Sengstag, Thierry; Severin, Jessica; Shimoji, Hisashi; Suzuki, Masanori; Suzuki, Harukazu; Kawai, Jun; Kondo, Naoto; Itoh, Masayoshi; Daub, Carsten O.; Kasukawa, Takeya; Kawaji, Hideya; Carninci, Piero; Forrest, Alistair R.R.; Hayashizaki, Yoshihide
2017-01-01
In the FANTOM5 project, transcription initiation events across the human and mouse genomes were mapped at a single base-pair resolution and their frequencies were monitored by CAGE (Cap Analysis of Gene Expression) coupled with single-molecule sequencing. Approximately three thousands of samples, consisting of a variety of primary cells, tissues, cell lines, and time series samples during cell activation and development, were subjected to a uniform pipeline of CAGE data production. The analysis pipeline started by measuring RNA extracts to assess their quality, and continued to CAGE library production by using a robotic or a manual workflow, single molecule sequencing, and computational processing to generate frequencies of transcription initiation. Resulting data represents the consequence of transcriptional regulation in each analyzed state of mammalian cells. Non-overlapping peaks over the CAGE profiles, approximately 200,000 and 150,000 peaks for the human and mouse genomes, were identified and annotated to provide precise location of known promoters as well as novel ones, and to quantify their activities. PMID:28850106
Aoyagi, Luciano N; Lopes-Caitar, Valéria S; de Carvalho, Mayra C C G; Darben, Luana M; Polizel-Podanosqui, Adriana; Kuwahara, Marcia K; Nepomuceno, Alexandre L; Abdelnoor, Ricardo V; Marcelino-Guimarães, Francismar C
2014-12-01
Myb genes constitute one of the largest transcription factor families in the plant kingdom. Soybean MYB transcription factors have been related to the plant response to biotic stresses. Their involvement in response to Phakopsora pachyrhizi infection has been reported by several transcriptional studies. Due to their apparently highly diverse functions, these genes are promising targets for developing crop varieties resistant to diseases. In the present study, the identification and phylogenetic analysis of the soybean R2R3-MYB (GmMYB) transcription factor family was performed and the expression profiles of these genes under biotic stress were determined. GmMYBs were identified from the soybean genome using bioinformatic tools, and their putative functions were determined based on the phylogenetic tree and classified into subfamilies using guides AtMYBs describing known functions. The transcriptional profiles of GmMYBs upon infection with different pathogen were revealed by in vivo and in silico analyses. Selected target genes potentially involved in disease responses were assessed by RT-qPCR after different times of inoculation with P. pachyrhizi using different genetic backgrounds related to resistance genes (Rpp2 and Rpp5). R2R3-MYB transcription factors related to lignin synthesis and genes responsive to chitin were significantly induced in the resistant genotypes. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Ong, Wen Dee; Voo, Lok-Yung Christopher; Kumar, Vijay Subbiah
2012-01-01
Pineapple (Ananas comosus var. comosus), is an important tropical non-climacteric fruit with high commercial potential. Understanding the mechanism and processes underlying fruit ripening would enable scientists to enhance the improvement of quality traits such as, flavor, texture, appearance and fruit sweetness. Although, the pineapple is an important fruit, there is insufficient transcriptomic or genomic information that is available in public databases. Application of high throughput transcriptome sequencing to profile the pineapple fruit transcripts is therefore needed. To facilitate this, we have performed transcriptome sequencing of ripe yellow pineapple fruit flesh using Illumina technology. About 4.7 millions Illumina paired-end reads were generated and assembled using the Velvet de novo assembler. The assembly produced 28,728 unique transcripts with a mean length of approximately 200 bp. Sequence similarity search against non-redundant NCBI database identified a total of 16,932 unique transcripts (58.93%) with significant hits. Out of these, 15,507 unique transcripts were assigned to gene ontology terms. Functional annotation against Kyoto Encyclopedia of Genes and Genomes pathway database identified 13,598 unique transcripts (47.33%) which were mapped to 126 pathways. The assembly revealed many transcripts that were previously unknown. The unique transcripts derived from this work have rapidly increased of the number of the pineapple fruit mRNA transcripts as it is now available in public databases. This information can be further utilized in gene expression, genomics and other functional genomics studies in pineapple.
Ong, Wen Dee; Voo, Lok-Yung Christopher; Kumar, Vijay Subbiah
2012-01-01
Background Pineapple (Ananas comosus var. comosus), is an important tropical non-climacteric fruit with high commercial potential. Understanding the mechanism and processes underlying fruit ripening would enable scientists to enhance the improvement of quality traits such as, flavor, texture, appearance and fruit sweetness. Although, the pineapple is an important fruit, there is insufficient transcriptomic or genomic information that is available in public databases. Application of high throughput transcriptome sequencing to profile the pineapple fruit transcripts is therefore needed. Methodology/Principal Findings To facilitate this, we have performed transcriptome sequencing of ripe yellow pineapple fruit flesh using Illumina technology. About 4.7 millions Illumina paired-end reads were generated and assembled using the Velvet de novo assembler. The assembly produced 28,728 unique transcripts with a mean length of approximately 200 bp. Sequence similarity search against non-redundant NCBI database identified a total of 16,932 unique transcripts (58.93%) with significant hits. Out of these, 15,507 unique transcripts were assigned to gene ontology terms. Functional annotation against Kyoto Encyclopedia of Genes and Genomes pathway database identified 13,598 unique transcripts (47.33%) which were mapped to 126 pathways. The assembly revealed many transcripts that were previously unknown. Conclusions The unique transcripts derived from this work have rapidly increased of the number of the pineapple fruit mRNA transcripts as it is now available in public databases. This information can be further utilized in gene expression, genomics and other functional genomics studies in pineapple. PMID:23091603
Principles of regulatory information conservation between mouse and human
Cheng, Yong; Ma, Zhihai; Kim, Bong-Hyun; ...
2014-11-19
To broaden our understanding of the evolution of gene regulation mechanisms, we generated occupancy profiles for 34 orthologous transcription factors (TFs) in human–mouse erythroid progenitor, lymphoblast and embryonic stem-cell lines. By combining the genome-wide transcription factor occupancy repertoires, associated epigenetic signals, and co-association patterns, here we deduce several evolutionary principles of gene regulatory features operating since the mouse and human lineages diverged. The genomic distribution profiles, primary binding motifs, chromatin states, and DNA methylation preferences are well conserved for TF-occupied sequences. However, the extent to which orthologous DNA segments are bound by orthologous TFs varies both among TFs and withmore » genomic location: binding at promoters is more highly conserved than binding at distal elements. Notably, occupancy-conserved TF-occupied sequences tend to be pleiotropic; they function in several tissues and also co-associate with many TFs. Lastly, single nucleotide variants at sites with potential regulatory functions are enriched in occupancy-conserved TF-occupied sequences.« less
2010-01-01
Background The identification of non-coding transcripts in human, mouse, and Escherichia coli has revealed their widespread occurrence and functional importance in both eukaryotic and prokaryotic life. In prokaryotes, studies have shown that non-coding transcripts participate in a broad range of cellular functions like gene regulation, stress and virulence. However, very little is known about non-coding transcripts in Streptococcus pneumoniae (pneumococcus), an obligate human respiratory pathogen responsible for significant worldwide morbidity and mortality. Tiling microarrays enable genome wide mRNA profiling as well as identification of novel transcripts at a high-resolution. Results Here, we describe a high-resolution transcription map of the S. pneumoniae clinical isolate TIGR4 using genomic tiling arrays. Our results indicate that approximately 66% of the genome is expressed under our experimental conditions. We identified a total of 50 non-coding small RNAs (sRNAs) from the intergenic regions, of which 36 had no predicted function. Half of the identified sRNA sequences were found to be unique to S. pneumoniae genome. We identified eight overrepresented sequence motifs among sRNA sequences that correspond to sRNAs in different functional categories. Tiling arrays also identified approximately 202 operon structures in the genome. Conclusions In summary, the pneumococcal operon structures and novel sRNAs identified in this study enhance our understanding of the complexity and extent of the pneumococcal 'expressed' genome. Furthermore, the results of this study open up new avenues of research for understanding the complex RNA regulatory network governing S. pneumoniae physiology and virulence. PMID:20525227
USDA-ARS?s Scientific Manuscript database
One focus of the Sorghum Translational Genomics Lab (part of sorghum CRIS, PSGD, CSRL, USDA-ARS, Lubbock TX) is to utilize nucleotide variation between sorghum germplasm such as those derived from RNA seq for translation and validation of Single Nucleotide Polymorphism (SNP) into easy access DNA m...
Bowman, Megan J.; Park, Wonkeun; Bauer, Philip J.; Udall, Joshua A.; Page, Justin T.; Raney, Joshua; Scheffler, Brian E.; Jones, Don. C.; Campbell, B. Todd
2013-01-01
An RNA-Seq experiment was performed using field grown well-watered and naturally rain fed cotton plants to identify differentially expressed transcripts under water-deficit stress. Our work constitutes the first application of the newly published diploid D5 Gossypium raimondii sequence in the study of tetraploid AD1 upland cotton RNA-seq transcriptome analysis. A total of 1,530 transcripts were differentially expressed between well-watered and water-deficit stressed root tissues, in patterns that confirm the accuracy of this technique for future studies in cotton genomics. Additionally, putative sequence based genome localization of differentially expressed transcripts detected A2 genome specific gene expression under water-deficit stress. These data will facilitate efforts to understand the complex responses governing transcriptomic regulatory mechanisms and to identify candidate genes that may benefit applied plant breeding programs. PMID:24324815
FANTOM5 CAGE profiles of human and mouse reprocessed for GRCh38 and GRCm38 genome assemblies.
Abugessaisa, Imad; Noguchi, Shuhei; Hasegawa, Akira; Harshbarger, Jayson; Kondo, Atsushi; Lizio, Marina; Severin, Jessica; Carninci, Piero; Kawaji, Hideya; Kasukawa, Takeya
2017-08-29
The FANTOM5 consortium described the promoter-level expression atlas of human and mouse by using CAGE (Cap Analysis of Gene Expression) with single molecule sequencing. In the original publications, GRCh37/hg19 and NCBI37/mm9 assemblies were used as the reference genomes of human and mouse respectively; later, the Genome Reference Consortium released newer genome assemblies GRCh38/hg38 and GRCm38/mm10. To increase the utility of the atlas in forthcoming researches, we reprocessed the data to make them available on the recent genome assemblies. The data include observed frequencies of transcription starting sites (TSSs) based on the realignment of CAGE reads, and TSS peaks that are converted from those based on the previous reference. Annotations of the peak names were also updated based on the latest public databases. The reprocessed results enable us to examine frequencies of transcription initiations on the recent genome assemblies and to refer promoters with updated information across the genome assemblies consistently.
2012-01-01
Background The biphasic life cycle with pelagic larva and benthic adult stages is widely observed in the animal kingdom, including the Porifera (sponges), which are the earliest branching metazoans. The demosponge, Amphimedon queenslandica, undergoes metamorphosis from a free-swimming larva into a sessile adult that bears no morphological resemblance to other animals. While the genome of A. queenslandica contains an extensive repertoire of genes very similar to that of complex bilaterians, it is as yet unclear how this is drawn upon to coordinate changing morphological features and ecological demands throughout the sponge life cycle. Results To identify genome-wide events that accompany the pelagobenthic transition in A. queenslandica, we compared global gene expression profiles at four key developmental stages by sequencing the poly(A) transcriptome using SOLiD technology. Large-scale changes in transcription were observed as sponge larvae settled on the benthos and began metamorphosis. Although previous systematics suggest that the only clear homology between Porifera and other animals is in the embryonic and larval stages, we observed extensive use of genes involved in metazoan-associated cellular processes throughout the sponge life cycle. Sponge-specific transcripts are not over-represented in the morphologically distinct adult; rather, many genes that encode typical metazoan features, such as cell adhesion and immunity, are upregulated. Our analysis further revealed gene families with candidate roles in competence, settlement, and metamorphosis in the sponge, including transcription factors, G-protein coupled receptors and other signaling molecules. Conclusions This first genome-wide study of the developmental transcriptome in an early branching metazoan highlights major transcriptional events that accompany the pelagobenthic transition and point to a network of regulatory mechanisms that coordinate changes in morphology with shifting environmental demands. Metazoan developmental and structural gene orthologs are well-integrated into the expression profiles at every stage of sponge development, including the adult. The utilization of genes involved in metazoan-associated processes throughout sponge development emphasizes the potential of the genome of the last common ancestor of animals to generate phenotypic complexity. PMID:22646746
Bloom, Chloe I.; Graham, Christine M.; Berry, Matthew P. R.; Rozakeas, Fotini; Redford, Paul S.; Wang, Yuanyuan; Xu, Zhaohui; Wilkinson, Katalin A.; Wilkinson, Robert J.; Kendrick, Yvonne; Devouassoux, Gilles; Ferry, Tristan; Miyara, Makoto; Bouvry, Diane; Dominique, Valeyre; Gorochov, Guy; Blankenship, Derek; Saadatian, Mitra; Vanhems, Phillip; Beynon, Huw; Vancheeswaran, Rama; Wickremasinghe, Melissa; Chaussabel, Damien; Banchereau, Jacques; Pascual, Virginia; Ho, Ling-pei; Lipman, Marc; O’Garra, Anne
2013-01-01
Rationale New approaches to define factors underlying the immunopathogenesis of pulmonary diseases including sarcoidosis and tuberculosis are needed to develop new treatments and biomarkers. Comparing the blood transcriptional response of tuberculosis to other similar pulmonary diseases will advance knowledge of disease pathways and help distinguish diseases with similar clinical presentations. Objectives To determine the factors underlying the immunopathogenesis of the granulomatous diseases, sarcoidosis and tuberculosis, by comparing the blood transcriptional responses in these and other pulmonary diseases. Methods We compared whole blood genome-wide transcriptional profiles in pulmonary sarcoidosis, pulmonary tuberculosis, to community acquired pneumonia and primary lung cancer and healthy controls, before and after treatment, and in purified leucocyte populations. Measurements and Main Results An Interferon-inducible neutrophil-driven blood transcriptional signature was present in both sarcoidosis and tuberculosis, with a higher abundance and expression in tuberculosis. Heterogeneity of the sarcoidosis signature correlated significantly with disease activity. Transcriptional profiles in pneumonia and lung cancer revealed an over-abundance of inflammatory transcripts. After successful treatment the transcriptional activity in tuberculosis and pneumonia patients was significantly reduced. However the glucocorticoid-responsive sarcoidosis patients showed a significant increase in transcriptional activity. 144-blood transcripts were able to distinguish tuberculosis from other lung diseases and controls. Conclusions Tuberculosis and sarcoidosis revealed similar blood transcriptional profiles, dominated by interferon-inducible transcripts, while pneumonia and lung cancer showed distinct signatures, dominated by inflammatory genes. There were also significant differences between tuberculosis and sarcoidosis in the degree of their transcriptional activity, the heterogeneity of their profiles and their transcriptional response to treatment. PMID:23940611
oPOSSUM: identification of over-represented transcription factor binding sites in co-expressed genes
Ho Sui, Shannan J.; Mortimer, James R.; Arenillas, David J.; Brumm, Jochen; Walsh, Christopher J.; Kennedy, Brian P.; Wasserman, Wyeth W.
2005-01-01
Targeted transcript profiling studies can identify sets of co-expressed genes; however, identification of the underlying functional mechanism(s) is a significant challenge. Established methods for the analysis of gene annotations, particularly those based on the Gene Ontology, can identify functional linkages between genes. Similar methods for the identification of over-represented transcription factor binding sites (TFBSs) have been successful in yeast, but extension to human genomics has largely proved ineffective. Creation of a system for the efficient identification of common regulatory mechanisms in a subset of co-expressed human genes promises to break a roadblock in functional genomics research. We have developed an integrated system that searches for evidence of co-regulation by one or more transcription factors (TFs). oPOSSUM combines a pre-computed database of conserved TFBSs in human and mouse promoters with statistical methods for identification of sites over-represented in a set of co-expressed genes. The algorithm successfully identified mediating TFs in control sets of tissue-specific genes and in sets of co-expressed genes from three transcript profiling studies. Simulation studies indicate that oPOSSUM produces few false positives using empirically defined thresholds and can tolerate up to 50% noise in a set of co-expressed genes. PMID:15933209
Campos, Bruno; Fletcher, Danielle; Piña, Benjamín; Tauler, Romà; Barata, Carlos
2018-05-18
Unravelling the link between genes and environment across the life cycle is a challenging goal that requires model organisms with well-characterized life-cycles, ecological interactions in nature, tractability in the laboratory, and available genomic tools. Very few well-studied invertebrate model species meet these requirements, being the waterflea Daphnia magna one of them. Here we report a full genome transcription profiling of D. magna during its life-cycle. The study was performed using a new microarray platform designed from the complete set of gene models representing the whole transcribed genome of D. magna. Up to 93% of the existing 41,317 D. magna gene models showed differential transcription patterns across the developmental stages of D. magna, 59% of which were functionally annotated. Embryos showed the highest number of unique transcribed genes, mainly related to DNA, RNA, and ribosome biogenesis, likely related to cellular proliferation and morphogenesis of the several body organs. Adult females showed an enrichment of transcripts for genes involved in reproductive processes. These female-specific transcripts were essentially absent in males, whose transcriptome was enriched in specific genes of male sexual differentiation genes, like doublesex. Our results define major characteristics of transcriptional programs involved in the life-cycle, differentiate males and females, and show that large scale gene-transcription data collected in whole animals can be used to identify genes involved in specific biological and biochemical processes.
2010-01-01
Background Pollen development from the microspore involves a series of coordinated cellular events, and the resulting mature pollen has a specialized function to quickly germinate, produce a polar-growth pollen tube derived from the vegetative cell, and deliver two sperm cells into the embryo sac for double fertilization. The gene expression profiles of developing and germinated pollen have been characterised by use of the eudicot model plant Arabidopsis. Rice, one of the most important cereal crops, has been used as an excellent monocot model. A comprehensive analysis of transcriptome profiles of developing and germinated pollen in rice is important to understand the conserved and diverse mechanism underlying pollen development and germination in eudicots and monocots. Results We used Affymetrix GeneChip® Rice Genome Array to comprehensively analyzed the dynamic changes in the transcriptomes of rice pollen at five sequential developmental stages from microspores to germinated pollen. Among the 51,279 transcripts on the array, we found 25,062 pollen-preferential transcripts, among which 2,203 were development stage-enriched. The diversity of transcripts decreased greatly from microspores to mature and germinated pollen, whereas the number of stage-enriched transcripts displayed a "U-type" change, with the lowest at the bicellular pollen stage; and a transition of overrepresented stage-enriched transcript groups associated with different functional categories, which indicates a shift in gene expression program at the bicellular pollen stage. About 54% of the now-annotated rice F-box protein genes were expressed preferentially in pollen. The transcriptome profile of germinated pollen was significantly and positively correlated with that of mature pollen. Analysis of expression profiles and coexpressed features of the pollen-preferential transcripts related to cell cycle, transcription, the ubiquitin/26S proteasome system, phytohormone signalling, the kinase system and defense/stress response revealed five expression patterns, which are compatible with changes in major cellular events during pollen development and germination. A comparison of pollen transcriptomes between rice and Arabidopsis revealed that 56.6% of the rice pollen preferential genes had homologs in Arabidopsis genome, but 63.4% of these homologs were expressed, with a small proportion being expressed preferentially, in Arabidopsis pollen. Rice and Arabidopsis pollen had non-conservative transcription factors each. Conclusions Our results demonstrated that rice pollen expressed a set of reduced but specific transcripts in comparison with vegetative tissues, and the number of stage-enriched transcripts displayed a "U-type" change during pollen development, with the lowest at the bicellular pollen stage. These features are conserved in rice and Arabidopsis. The shift in gene expression program at the bicellular pollen stage may be important to the transition from earlier cell division to later pollen maturity. Pollen at maturity pre-synthesized transcripts needed for germination and early pollen tube growth. The transcription regulation associated with pollen development would have divergence between the two species. Our results also provide novel insights into the molecular program and key components of the regulatory network regulating pollen development and germination. PMID:20507633
Beird, Hannah C.; Wu, Chia-Chin; Ingram, Davis R.; Wang, Wei-Lien; Alimohamed, Asrar; Gumbs, Curtis; Little, Latasha; Song, Xingzhi; Feig, Barry W.; Roland, Christina L.; Zhang, Jianhua; Benjamin, Robert S.; Hwu, Patrick; Lazar, Alexander J.; Futreal, P. Andrew; Somaiah, Neeta
2018-01-01
Well-differentiated (WD) liposarcoma is a low-grade mesenchymal tumor with features of mature adipocytes and high propensity for local recurrence. Often, WD patients present with or later progress to a higher-grade nonlipogenic form known as dedifferentiated (DD) liposarcoma. These DD tumors behave more aggressively and can metastasize. Both WD and DD liposarcomas harbor neochromosomes formed from amplifications and rearrangements of Chr 12q that encode oncogenes (MDM2, CDK4, and YEATS2) and adipocytic differentiation factors (HMGA2 and CPM). However, genomic changes associated with progression from WD to DD have not been well-defined. Therefore, we selected patients with matched WD and DD tumors for extensive genomic profiling in order to understand their clonal relationships and to delineate any defining alterations for each entity. Exome and transcriptomic sequencing was performed for 17 patients with both WD and DD diagnoses. Somatic point and copy-number alterations were integrated with transcriptional analyses to determine subtype-associated genomic features and pathways. The results were, on average, that only 8.3% of somatic mutations in WD liposarcoma were shared with their cognate DD component. DD tumors had higher numbers of somatic copy-number losses, amplifications involving Chr 12q, and fusion transcripts than WD tumors. HMGA2 and CPM rearrangements occur more frequently in DD components. The shared somatic mutations indicate a clonal origin for matched WD and DD tumors and show early divergence with ongoing genomic instability due to continual generation and selection of neochromosomes. Stochastic generation and subsequent expression of fusion transcripts from the neochromosome that involve adipogenesis genes such as HMGA2 and CPM may influence the differentiation state of the subsequent tumor. PMID:29610390
Filatov, Victor; Dowdle, John; Smirnoff, Nicholas; Ford-Lloyd, Brian; Newbury, H John; Macnair, Mark R
2006-09-01
One of the challenges of comparative genomics is to identify specific genetic changes associated with the evolution of a novel adaptation or trait. We need to be able to disassociate the genes involved with a particular character from all the other genetic changes that take place as lineages diverge. Here we show that by comparing the transcriptional profile of segregating families with that of parent species differing in a novel trait, it is possible to narrow down substantially the list of potential target genes. In addition, by assuming synteny with a related model organism for which the complete genome sequence is available, it is possible to use the cosegregation of markers differing in transcription level to identify regions of the genome which probably contain quantitative trait loci (QTLs) for the character. This novel combination of genomics and classical genetics provides a very powerful tool to identify candidate genes. We use this methodology to investigate zinc hyperaccumulation in Arabidopsis halleri, the sister species to the model plant, Arabidopsis thaliana. We compare the transcriptional profile of A. halleri with that of its sister nonaccumulator species, Arabidopsis petraea, and between accumulator and nonaccumulator F(3)s derived from the cross between the two species. We identify eight genes which consistently show greater expression in accumulator phenotypes in both roots and shoots, including two metal transporter genes (NRAMP3 and ZIP6), and cytoplasmic aconitase, a gene involved in iron homeostasis in mammals. We also show that there appear to be two QTLs for zinc accumulation, on chromosomes 3 and 7.
Yu, Hong; Soler, Marçal; Mila, Isabelle; San Clemente, Hélène; Savelli, Bruno; Dunand, Christophe; Paiva, Jorge A. P.; Myburg, Alexander A.; Bouzayen, Mondher; Grima-Pettenati, Jacqueline; Cassan-Wang, Hua
2014-01-01
Auxin is a central hormone involved in a wide range of developmental processes including the specification of vascular stem cells. Auxin Response Factors (ARF) are important actors of the auxin signalling pathway, regulating the transcription of auxin-responsive genes through direct binding to their promoters. The recent availability of the Eucalyptus grandis genome sequence allowed us to examine the characteristics and evolutionary history of this gene family in a woody plant of high economic importance. With 17 members, the E. grandis ARF gene family is slightly contracted, as compared to those of most angiosperms studied hitherto, lacking traces of duplication events. In silico analysis of alternative transcripts and gene truncation suggested that these two mechanisms were preeminent in shaping the functional diversity of the ARF family in Eucalyptus. Comparative phylogenetic analyses with genomes of other taxonomic lineages revealed the presence of a new ARF clade found preferentially in woody and/or perennial plants. High-throughput expression profiling among different organs and tissues and in response to environmental cues highlighted genes expressed in vascular cambium and/or developing xylem, responding dynamically to various environmental stimuli. Finally, this study allowed identification of three ARF candidates potentially involved in the auxin-regulated transcriptional program underlying wood formation. PMID:25269088
USDA-ARS?s Scientific Manuscript database
Common bean (Phaseolus vulgaris) and soybean (Glycine max) both belong to the Phaseoleae tribe and share significant coding sequence homology. This suggests that the GeneChip(R) Soybean Genome Array (soybean GeneChip) may be used for gene expression studies using common bean. To evaluate the utility...
USDA-ARS?s Scientific Manuscript database
This study reports generation of large-scale genomic resources for pigeonpea, a so-called ‘orphan crop species’ of the semi-arid tropic regions. Roche FLX/454 sequencing was carried out on a normalized cDNA pool prepared from 31 tissues produced 494,353 short transcript reads (STRs). Cluster analysi...
Miller, Gregory E; Chen, Edith; Shalowitz, Madeleine U; Story, Rachel E; Leigh, Adam K K; Ham, Paula; Arevalo, Jesusa M G; Cole, Steve W
2018-06-01
There are marked socioeconomic disparities in pediatric asthma control, but the molecular origins of these disparities are not well understood. To fill this gap, we performed genome-wide expression profiling of monocytes and T-helper cells from pediatric asthma patients of lower and higher socioeconomic status (SES). Ninety-nine children with asthma participated in a cross-sectional assessment. Out of which 87% were atopic, and most had disease of mild (54%) or moderate (29%) severity. Children were from lower-SES (n = 49; household income <$50 000) or higher-SES (n = 50; household income >$140 000) families. Peripheral blood monocytes and T-helper cells were isolated for genome-wide expression profiling of mRNA. Lower-SES children had worse asthma quality of life relative to higher-SES children, by both their own and their parents' reports. Although the groups had similar disease severity and potential confounds were controlled, their transcriptional profiles differed notably. The monocytes of lower-SES children showed transcriptional indications of up-regulated anti-microbial and pro-inflammatory activity. The T-helper cells of lower-SES children also had comparatively reduced expression of genes encoding γ-interferon and tumor necrosis factor-α, cytokines that orchestrate Type 1 responses. They also showed up-regulated activity of transcription factors that polarize cells towards Type 2 responses and promote Th17 cell maturation. Collectively, these patterns implicate pro-inflammatory monocytes and Type 2 cytokine activity as mechanisms contributing to worse asthma control among lower-SES children. © 2018 Wiley Periodicals, Inc.
GWIPS-viz: development of a ribo-seq genome browser
Michel, Audrey M.; Fox, Gearoid; M. Kiran, Anmol; De Bo, Christof; O’Connor, Patrick B. F.; Heaphy, Stephen M.; Mullan, James P. A.; Donohue, Claire A.; Higgins, Desmond G.; Baranov, Pavel V.
2014-01-01
We describe the development of GWIPS-viz (http://gwips.ucc.ie), an online genome browser for viewing ribosome profiling data. Ribosome profiling (ribo-seq) is a recently developed technique that provides genome-wide information on protein synthesis (GWIPS) in vivo. It is based on the deep sequencing of ribosome-protected messenger RNA (mRNA) fragments, which allows the ribosome density along all mRNA transcripts present in the cell to be quantified. Since its inception, ribo-seq has been carried out in a number of eukaryotic and prokaryotic organisms. Owing to the increasing interest in ribo-seq, there is a pertinent demand for a dedicated ribo-seq genome browser. GWIPS-viz is based on The University of California Santa Cruz (UCSC) Genome Browser. Ribo-seq tracks, coupled with mRNA-seq tracks, are currently available for several genomes: human, mouse, zebrafish, nematode, yeast, bacteria (Escherichia coli K12, Bacillus subtilis), human cytomegalovirus and bacteriophage lambda. Our objective is to continue incorporating published ribo-seq data sets so that the wider community can readily view ribosome profiling information from multiple studies without the need to carry out computational processing. PMID:24185699
Kuuluvainen, Emilia; Hakala, Heini; Havula, Essi; Sahal Estimé, Michelle; Rämet, Mika; Hietakangas, Ville; Mäkelä, Tomi P
2014-06-06
The Cdk8 (cyclin-dependent kinase 8) module of Mediator integrates regulatory cues from transcription factors to RNA polymerase II. It consists of four subunits where Med12 and Med13 link Cdk8 and cyclin C (CycC) to core Mediator. Here we have investigated the contributions of the Cdk8 module subunits to transcriptional regulation using RNA interference in Drosophila cells. Genome-wide expression profiling demonstrated separation of Cdk8-CycC and Med12-Med13 profiles. However, transcriptional regulation by Cdk8-CycC was dependent on Med12-Med13. This observation also revealed that Cdk8-CycC and Med12-Med13 often have opposite transcriptional effects. Interestingly, Med12 and Med13 profiles overlapped significantly with that of the GATA factor Serpent. Accordingly, mutational analyses indicated that GATA sites are required for Med12-Med13 regulation of Serpent-dependent genes. Med12 and Med13 were also found to be required for Serpent-activated innate immunity genes in defense to bacterial infection. The results reveal a novel role for the Cdk8 module in Serpent-dependent transcription and innate immunity. © 2014 by The American Society for Biochemistry and Molecular Biology, Inc.
Genes on B chromosomes: old questions revisited with new tools.
Banaei-Moghaddam, Ali M; Martis, Mihaela M; Macas, Jiří; Gundlach, Heidrun; Himmelbach, Axel; Altschmied, Lothar; Mayer, Klaus F X; Houben, Andreas
2015-01-01
B chromosomes are supernumerary dispensable parts of the karyotype which appear in some individuals of some populations in some species. Often, they have been considered as 'junk DNA' or genomic parasites without functional genes. Due to recent advances in sequencing technologies, it became possible to investigate their DNA composition, transcriptional activity and effects on the host transcriptome profile in detail. Here, we review the most recent findings regarding the gene content of B chromosomes and their transcriptional activities and discuss these findings in the context of comparable biological phenomena, like sex chromosomes, aneuploidy and pseudogenes. Recent data suggest that B chromosomes carry transcriptionally active genic sequences which could affect the transcriptome profile of their host genome. These findings are gradually changing our view that B chromosomes are solely genetically inert selfish elements without any functional genes. This at one side could partly explain the deleterious effects which are associated with their presence. On the other hand it makes B chromosome a nice model for studying regulatory mechanisms of duplicated genes and their evolutionary consequences. Copyright © 2014 Elsevier B.V. All rights reserved.
Moqtaderi, Zarmik; Wang, Jie; Raha, Debasish; White, Robert J; Snyder, Michael; Weng, Zhiping; Struhl, Kevin
2010-05-01
Genome-wide occupancy profiles of five components of the RNA polymerase III (Pol III) machinery in human cells identified the expected tRNA and noncoding RNA targets and revealed many additional Pol III-associated loci, mostly near short interspersed elements (SINEs). Several genes are targets of an alternative transcription factor IIIB (TFIIIB) containing Brf2 instead of Brf1 and have extremely low levels of TFIIIC. Strikingly, expressed Pol III genes, unlike nonexpressed Pol III genes, are situated in regions with a pattern of histone modifications associated with functional Pol II promoters. TFIIIC alone associates with numerous ETC loci, via the B box or a novel motif. ETCs are often near CTCF binding sites, suggesting a potential role in chromosome organization. Our results suggest that human Pol III complexes associate preferentially with regions near functional Pol II promoters and that TFIIIC-mediated recruitment of TFIIIB is regulated in a locus-specific manner.
Tomlinson, Gillian S.; Booth, Helen; Petit, Sarah J.; Potton, Elspeth; Towers, Greg J.; Miller, Robert F.; Chain, Benjamin M.; Noursadeghi, Mahdad
2012-01-01
Alveolar macrophages (AM) are thought to have a key role in the immunopathogenesis of respiratory diseases. We sought to test the hypothesis that human AM exhibit an anti-inflammatory bias by making genome-wide comparisons with monocyte derived macrophages (MDM). Adherent AM obtained by bronchoalveolar lavage of patients under investigation for haemoptysis, but found to have no respiratory pathology, were compared to MDM from healthy volunteers by whole genome transcriptional profiling before and after innate immune stimulation. We found that freshly isolated AM exhibited a marked pro-inflammatory transcriptional signature. High levels of basal pro-inflammatory gene expression gave the impression of attenuated responses to lipopolysaccharide (LPS) and the RNA analogue, poly IC, but in rested cells pro-inflammatory gene expression declined and transcriptional responsiveness to these stimuli was restored. In comparison to MDM, both freshly isolated and rested AM showed upregulation of MHC class II molecules. In most experimental paradigms ex vivo adherent AM are used immediately after isolation. Therefore, the confounding effects of their pro-inflammatory profile at baseline need careful consideration. Moreover, despite the prevailing view that AM have an anti-inflammatory bias, our data clearly show that they can adopt a striking pro-inflammatory phenotype, and may have greater capacity for presentation of exogenous antigens than MDM. PMID:22768282
Transcriptome profiling reveals mosaic genomic origins of modern cultivated barley.
Dai, Fei; Chen, Zhong-Hua; Wang, Xiaolei; Li, Zefeng; Jin, Gulei; Wu, Dezhi; Cai, Shengguan; Wang, Ning; Wu, Feibo; Nevo, Eviatar; Zhang, Guoping
2014-09-16
The domestication of cultivated barley has been used as a model system for studying the origins and early spread of agrarian culture. Our previous results indicated that the Tibetan Plateau and its vicinity is one of the centers of domestication of cultivated barley. Here we reveal multiple origins of domesticated barley using transcriptome profiling of cultivated and wild-barley genotypes. Approximately 48-Gb of clean transcript sequences in 12 Hordeum spontaneum and 9 Hordeum vulgare accessions were generated. We reported 12,530 de novo assembled transcripts in all of the 21 samples. Population structure analysis showed that Tibetan hulless barley (qingke) might have existed in the early stage of domestication. Based on the large number of unique genomic regions showing the similarity between cultivated and wild-barley groups, we propose that the genomic origin of modern cultivated barley is derived from wild-barley genotypes in the Fertile Crescent (mainly in chromosomes 1H, 2H, and 3H) and Tibet (mainly in chromosomes 4H, 5H, 6H, and 7H). This study indicates that the domestication of barley may have occurred over time in geographically distinct regions.
GST-PRIME: an algorithm for genome-wide primer design.
Leister, Dario; Varotto, Claudio
2007-01-01
The profiling of mRNA expression based on DNA arrays has become a powerful tool to study genome-wide transcription of genes in a number of organisms. GST-PRIME is a software package created to facilitate large-scale primer design for the amplification of probes to be immobilized on arrays for transcriptome analyses, even though it can be also applied in low-throughput approaches. GST-PRIME allows highly efficient, direct amplification of gene-sequence tags (GSTs) from genomic DNA (gDNA), starting from annotated genome or transcript sequences. GST-PRIME provides a customer-friendly platform for automatic primer design, and despite the relative simplicity of the algorithm, experimental tests in the model plant species Arabidopsis thaliana confirmed the reliability of the software. This chapter describes the algorithm used for primer design, its input and output files, and the installation of the standalone package and its use.
Weissgerber, Thomas; Dobler, Nadine; Polen, Tino; Latus, Jeanette; Stockdreher, Yvonne
2013-01-01
The purple sulfur bacterium Allochromatium vinosum DSM 180T is one of the best-studied sulfur-oxidizing anoxygenic phototrophic bacteria, and it has been developed into a model organism for laboratory-based studies of oxidative sulfur metabolism. Here, we took advantage of the organism's high metabolic versatility and performed whole-genome transcriptional profiling to investigate the response of A. vinosum cells upon exposure to sulfide, thiosulfate, elemental sulfur, or sulfite compared to photoorganoheterotrophic growth on malate. Differential expression of 1,178 genes was observed, corresponding to 30% of the A. vinosum genome. Relative transcription of 551 genes increased significantly during growth on one of the different sulfur sources, while the relative transcript abundance of 627 genes decreased. A significant number of genes that revealed strongly enhanced relative transcription levels have documented sulfur metabolism-related functions. Among these are the dsr genes, including dsrAB for dissimilatory sulfite reductase, and the sgp genes for the proteins of the sulfur globule envelope, thus confirming former results. In addition, we identified new genes encoding proteins with appropriate subcellular localization and properties to participate in oxidative dissimilatory sulfur metabolism. Those four genes for hypothetical proteins that exhibited the strongest increases of mRNA levels on sulfide and elemental sulfur, respectively, were chosen for inactivation and phenotypic analyses of the respective mutant strains. This approach verified the importance of the encoded proteins for sulfur globule formation during the oxidation of sulfide and thiosulfate and thereby also documented the suitability of comparative transcriptomics for the identification of new sulfur-related genes in anoxygenic phototrophic sulfur bacteria. PMID:23873913
Genome-wide profiling of PRC1 and PRC2 Polycomb chromatin binding in Drosophila melanogaster.
Tolhuis, Bas; de Wit, Elzo; Muijrers, Inhua; Teunissen, Hans; Talhout, Wendy; van Steensel, Bas; van Lohuizen, Maarten
2006-06-01
Polycomb group (PcG) proteins maintain transcriptional repression of developmentally important genes and have been implicated in cell proliferation and stem cell self-renewal. We used a genome-wide approach to map binding patterns of PcG proteins (Pc, esc and Sce) in Drosophila melanogaster Kc cells. We found that Pc associates with large genomic regions of up to approximately 150 kb in size, hereafter referred to as 'Pc domains'. Sce and esc accompany Pc in most of these domains. PcG-bound chromatin is trimethylated at histone H3 Lys27 and is generally transcriptionally silent. Furthermore, PcG proteins preferentially bind to developmental genes. Many of these encode transcriptional regulators and key components of signal transduction pathways, including Wingless, Hedgehog, Notch and Delta. We also identify several new putative functions of PcG proteins, such as in steroid hormone biosynthesis. These results highlight the extensive involvement of PcG proteins in the coordination of development through the formation of large repressive chromatin domains.
Brasa, Sarah; Teo, Soon-Siong; Roloff, Tim-Christoph; Morawiec, Laurent; Zamurovic, Natasa; Vicart, Axel; Funhoff, Enrico; Couttet, Philippe; Schübeler, Dirk; Grenet, Olivier; Marlowe, Jennifer; Moggs, Jonathan; Terranova, Rémi
2011-01-01
Evidence suggests that epigenetic perturbations are involved in the adverse effects associated with some drugs and toxicants, including certain classes of non-genotoxic carcinogens. Such epigenetic changes (altered DNA methylation and covalent histone modifications) may take place at the earliest stages of carcinogenesis and their identification holds great promise for biomedical research. Here, we evaluate the sensitivity and specificity of genome-wide epigenomic and transcriptomic profiling in phenobarbital (PB)-treated B6C3F1 mice, a well-characterized rodent model of non-genotoxic liver carcinogenesis. Methylated DNA Immunoprecipitation (MeDIP)-coupled microarray profiling of 17,967 promoter regions and 4,566 intergenic CpG islands was combined with genome-wide mRNA expression profiling to identify liver tissue-specific PB-mediated DNA methylation and transcriptional alterations. Only a limited number of significant anti-correlations were observed between PB-induced transcriptional and promoter-based DNA methylation perturbations. However, the constitutive androstane receptor (CAR) target gene Cyp2b10 was found to be concomitantly hypomethylated and transcriptionally activated in a liver tissue-specific manner following PB treatment. Furthermore, analysis of active and repressive histone modifications using chromatin immunoprecipitation revealed a strong PB-mediated epigenetic switch at the Cyp2b10 promoter. Our data reveal that PB-induced transcriptional perturbations are not generally associated with broad changes in the DNA methylation status at proximal promoters and suggest that the drug-inducible CAR pathway regulates an epigenetic switch from repressive to active chromatin at the target gene Cyp2b10. This study demonstrates the utility of integrated epigenomic and transcriptomic profiling for elucidating early mechanisms and biomarkers of non-genotoxic carcinogenesis. PMID:21455306
Lempiäinen, Harri; Müller, Arne; Brasa, Sarah; Teo, Soon-Siong; Roloff, Tim-Christoph; Morawiec, Laurent; Zamurovic, Natasa; Vicart, Axel; Funhoff, Enrico; Couttet, Philippe; Schübeler, Dirk; Grenet, Olivier; Marlowe, Jennifer; Moggs, Jonathan; Terranova, Rémi
2011-03-24
Evidence suggests that epigenetic perturbations are involved in the adverse effects associated with some drugs and toxicants, including certain classes of non-genotoxic carcinogens. Such epigenetic changes (altered DNA methylation and covalent histone modifications) may take place at the earliest stages of carcinogenesis and their identification holds great promise for biomedical research. Here, we evaluate the sensitivity and specificity of genome-wide epigenomic and transcriptomic profiling in phenobarbital (PB)-treated B6C3F1 mice, a well-characterized rodent model of non-genotoxic liver carcinogenesis. Methylated DNA Immunoprecipitation (MeDIP)-coupled microarray profiling of 17,967 promoter regions and 4,566 intergenic CpG islands was combined with genome-wide mRNA expression profiling to identify liver tissue-specific PB-mediated DNA methylation and transcriptional alterations. Only a limited number of significant anti-correlations were observed between PB-induced transcriptional and promoter-based DNA methylation perturbations. However, the constitutive androstane receptor (CAR) target gene Cyp2b10 was found to be concomitantly hypomethylated and transcriptionally activated in a liver tissue-specific manner following PB treatment. Furthermore, analysis of active and repressive histone modifications using chromatin immunoprecipitation revealed a strong PB-mediated epigenetic switch at the Cyp2b10 promoter. Our data reveal that PB-induced transcriptional perturbations are not generally associated with broad changes in the DNA methylation status at proximal promoters and suggest that the drug-inducible CAR pathway regulates an epigenetic switch from repressive to active chromatin at the target gene Cyp2b10. This study demonstrates the utility of integrated epigenomic and transcriptomic profiling for elucidating early mechanisms and biomarkers of non-genotoxic carcinogenesis.
Mahajan, Prashant; Kuppermann, Nathan; Suarez, Nicolas; Mejias, Asuncion; Casper, Charlie; Dean, J Michael; Ramilo, Octavio
2015-01-01
To develop the infrastructure and demonstrate the feasibility of conducting microarray-based RNA transcriptional profile analyses for the diagnosis of serious bacterial infections in febrile infants 60 days and younger in a multicenter pediatric emergency research network. We designed a prospective multicenter cohort study with the aim of enrolling more than 4000 febrile infants 60 days and younger. To ensure success of conducting complex genomic studies in emergency department (ED) settings, we established an infrastructure within the Pediatric Emergency Care Applied Research Network, including 21 sites, to evaluate RNA transcriptional profiles in young febrile infants. We developed a comprehensive manual of operations and trained site investigators to obtain and process blood samples for RNA extraction and genomic analyses. We created standard operating procedures for blood sample collection, processing, storage, shipping, and analyses. We planned to prospectively identify, enroll, and collect 1 mL blood samples for genomic analyses from eligible patients to identify logistical issues with study procedures. Finally, we planned to batch blood samples and determined RNA quantity and quality at the central microarray laboratory and organized data analysis with the Pediatric Emergency Care Applied Research Network data coordinating center. Below we report on establishment of the infrastructure and the feasibility success in the first year based on the enrollment of a limited number of patients. We successfully established the infrastructure at 21 EDs. Over the first 5 months we enrolled 79% (74 of 94) of eligible febrile infants. We were able to obtain and ship 1 mL of blood from 74% (55 of 74) of enrolled participants, with at least 1 sample per participating ED. The 55 samples were shipped and evaluated at the microarray laboratory, and 95% (52 of 55) of blood samples were of adequate quality and contained sufficient RNA for expression analysis. It is possible to create a robust infrastructure to conduct genomic studies in young febrile infants in the context of a multicenter pediatric ED research setting. The sufficient quantity and high quality of RNA obtained suggests that whole blood transcriptional profile analysis for the diagnostic evaluation of young febrile infants can be successfully performed in this setting.
Yoo, Eung Jae; Cajiao, Isabela; Kim, Jeong-Seon; Kimura, Atsushi P.; Zhang, Aiwen; Cooke, Nancy E.; Liebhaber, Stephen A.
2006-01-01
Random assortment within mammalian genomes juxtaposes genes with distinct expression profiles. This organization, along with the prevalence of long-range regulatory controls, generates a potential for aberrant transcriptional interactions. The human CD79b/GH locus contains six tightly linked genes with three mutually exclusive tissue specificities and interdigitated control elements. One consequence of this compact organization is that the pituitarycell-specific transcriptional events that activate hGH-N also trigger ectopic activation of CD79b. However, the B-cell-specific events that activate CD79b do not trigger reciprocal activation of hGH-N. Here we utilized DNase I hypersensitive site mapping, chromatin immunoprecipitation, and transgenic models to explore the basis for this asymmetric relationship. The results reveal tissue-specific patterns of chromatin structures and transcriptional controls at the CD79b/GH locus in B cells distinct from those in the pituitary gland and placenta. These three unique transcriptional environments suggest a set of corresponding gene expression pathways and transcriptional interactions that are likely to be found juxtaposed at multiple sites within the eukaryotic genome. PMID:16847312
SOX2 and p63 colocalize at genetic loci in squamous cell carcinomas
Watanabe, Hideo; Ma, Qiuping; Peng, Shouyong; Adelmant, Guillaume; Swain, Danielle; Song, Wenyu; Fox, Cameron; Francis, Joshua M.; Pedamallu, Chandra Sekhar; DeLuca, David S.; Brooks, Angela N.; Wang, Su; Que, Jianwen; Rustgi, Anil K.; Wong, Kwok-kin; Ligon, Keith L.; Liu, X. Shirley; Marto, Jarrod A.; Meyerson, Matthew; Bass, Adam J.
2014-01-01
The transcription factor SOX2 is an essential regulator of pluripotent stem cells and promotes development and maintenance of squamous epithelia. We previously reported that SOX2 is an oncogene and subject to highly recurrent genomic amplification in squamous cell carcinomas (SCCs). Here, we have further characterized the function of SOX2 in SCC. Using ChIP-seq analysis, we compared SOX2-regulated gene profiles in multiple SCC cell lines to ES cell profiles and determined that SOX2 binds to distinct genomic loci in SCCs. In SCCs, SOX2 preferentially interacts with the transcription factor p63, as opposed to the transcription factor OCT4, which is the preferred SOX2 binding partner in ES cells. SOX2 and p63 exhibited overlapping genomic occupancy at a large number of loci in SCCs; however, coordinate binding of SOX2 and p63 was absent in ES cells. We further demonstrated that SOX2 and p63 jointly regulate gene expression, including the oncogene ETV4, which was essential for SOX2-amplified SCC cell survival. Together, these findings demonstrate that the action of SOX2 in SCC differs substantially from its role in pluripotency. The identification of the SCC-associated interaction between SOX2 and p63 will enable deeper characterization the downstream targets of this interaction in SCC and normal squamous epithelial physiology. PMID:24590290
SOX2 and p63 colocalize at genetic loci in squamous cell carcinomas.
Watanabe, Hideo; Ma, Qiuping; Peng, Shouyong; Adelmant, Guillaume; Swain, Danielle; Song, Wenyu; Fox, Cameron; Francis, Joshua M; Pedamallu, Chandra Sekhar; DeLuca, David S; Brooks, Angela N; Wang, Su; Que, Jianwen; Rustgi, Anil K; Wong, Kwok-kin; Ligon, Keith L; Liu, X Shirley; Marto, Jarrod A; Meyerson, Matthew; Bass, Adam J
2014-04-01
The transcription factor SOX2 is an essential regulator of pluripotent stem cells and promotes development and maintenance of squamous epithelia. We previously reported that SOX2 is an oncogene and subject to highly recurrent genomic amplification in squamous cell carcinomas (SCCs). Here, we have further characterized the function of SOX2 in SCC. Using ChIP-seq analysis, we compared SOX2-regulated gene profiles in multiple SCC cell lines to ES cell profiles and determined that SOX2 binds to distinct genomic loci in SCCs. In SCCs, SOX2 preferentially interacts with the transcription factor p63, as opposed to the transcription factor OCT4, which is the preferred SOX2 binding partner in ES cells. SOX2 and p63 exhibited overlapping genomic occupancy at a large number of loci in SCCs; however, coordinate binding of SOX2 and p63 was absent in ES cells. We further demonstrated that SOX2 and p63 jointly regulate gene expression, including the oncogene ETV4, which was essential for SOX2-amplified SCC cell survival. Together, these findings demonstrate that the action of SOX2 in SCC differs substantially from its role in pluripotency. The identification of the SCC-associated interaction between SOX2 and p63 will enable deeper characterization the downstream targets of this interaction in SCC and normal squamous epithelial physiology.
Characterization of a Genomic Signature of Pregnancy in the Breast
Belitskaya-Lévy, Ilana; Zeleniuch-Jacquotte, Anne; Russo, Jose; Russo, Irma H.; Bordás, Pal; Åhman, Janet; Afanasyeva, Yelena; Johansson, Robert; Lenner, Per; Li, Xiaochun; de Cicco, Ricardo López; Peri, Suraj; Ross, Eric; Russo, Patricia A.; Santucci-Pereira, Julia; Sheriff, Fathima S.; Slifker, Michael; Hallmans, Göran; Toniolo, Paolo; Arslan, Alan A.
2012-01-01
The objective of the current study was to comprehensively compare the genomic profiles in the breast of parous and nulliparous postmenopausal women to identify genes that permanently change their expression following pregnancy. The study was designed as a two-phase approach. In the discovery phase, we compared breast genomic profiles of 37 parous with 18 nulliparous postmenopausal women. In the validation phase, confirmation of the genomic patterns observed in the discovery phase was sought in an independent set of 30 parous and 22 nulliparous postmenopausal women. RNA was hybridized to Affymetrix HG_U133 Plus 2.0 oligonucleotide arrays containing probes to 54,675 transcripts; scanned and the images analyzed using Affymetrix GCOS software. Surrogate variable analysis, logistic regression and significance analysis for microarrays were used to identify statistically significant differences in expression of genes. The False Discovery Rate (FDR) approach was used to control for multiple comparisons. We found that 208 genes (305 probe sets) were differentially expressed between parous and nulliparous women in both discovery and validation phases of the study at a FDR of 10% and with at least a 1.25-fold change. These genes are involved in regulation of transcription, centrosome organization, RNA splicing, cell cycle control, adhesion and differentiation. The results provide persuasive evidence that full-term pregnancy induces long-term genomic changes in the breast. The genomic signature of pregnancy could be used as an intermediate marker to assess potential chemopreventive interventions with hormones mimicking the effects of pregnancy for prevention of breast cancer. PMID:21622728
Repurposing Archival Samples for Investigating Toxicological Modes of Action
Little is known about formalin fixation induced genomic artifacts, limiting the use of formalin-fixed paraffin-embedded (FFPE) samples in toxicological and clinical studies. Previously, we identified a consistent shift in transcriptional profiles between paired frozen and FFPE sa...
Decomposing genomic variance using information from GWA, GWE and eQTL analysis.
Ehsani, A; Janss, L; Pomp, D; Sørensen, P
2016-04-01
A commonly used procedure in genome-wide association (GWA), genome-wide expression (GWE) and expression quantitative trait locus (eQTL) analyses is based on a bottom-up experimental approach that attempts to individually associate molecular variants with complex traits. Top-down modeling of the entire set of genomic data and partitioning of the overall variance into subcomponents may provide further insight into the genetic basis of complex traits. To test this approach, we performed a whole-genome variance components analysis and partitioned the genomic variance using information from GWA, GWE and eQTL analyses of growth-related traits in a mouse F2 population. We characterized the mouse trait genetic architecture by ordering single nucleotide polymorphisms (SNPs) based on their P-values and studying the areas under the curve (AUCs). The observed traits were found to have a genomic variance profile that differed significantly from that expected of a trait under an infinitesimal model. This situation was particularly true for both body weight and body fat, for which the AUCs were much higher compared with that of glucose. In addition, SNPs with a high degree of trait-specific regulatory potential (SNPs associated with subset of transcripts that significantly associated with a specific trait) explained a larger proportion of the genomic variance than did SNPs with high overall regulatory potential (SNPs associated with transcripts using traditional eQTL analysis). We introduced AUC measures of genomic variance profiles that can be used to quantify relative importance of SNPs as well as degree of deviation of a trait's inheritance from an infinitesimal model. The shape of the curve aids global understanding of traits: The steeper the left-hand side of the curve, the fewer the number of SNPs controlling most of the phenotypic variance. © 2015 Stichting International Foundation for Animal Genetics.
Macqueen, Daniel J; Kristjánsson, Bjarni K; Johnston, Ian A
2010-06-01
Metazoan akirin genes regulate innate immunity, myogenesis, and carcinogenesis. Invertebrates typically have one family member, while most tetrapod and teleost vertebrates have one to three. We demonstrate an expanded repertoire of eight family members in genomes of four salmonid fishes, owing to paralog preservation after three tetraploidization events. Retention of paralogs secondarily lost in other teleosts may be related to functional diversification and posttranslational regulation. We hypothesized that salmonid akirins would be transcriptionally regulated in fast-twitch skeletal muscle during activation of conserved pathways governing catabolism and growth. The in vivo nutritional state of Arctic charr (Salvelinus alpinus L.) was experimentally manipulated, and transcript levels for akirin family members and 26 other genes were measured by quantitative real-time PCR (qPCR), allowing the establishment of a similarity network of expression profiles. In fasted muscle, a class of akirins was upregulated, with one family member showing high coexpression with catabolic genes coding the NF-kappaB p65 subunit, E2 ubiquitin-conjugating enzymes, E3 ubiquitin ligases, and IGF-I receptors. Another class of akirin was upregulated with subsequent feeding, coexpressed with 14-3-3 protein genes. There was no similarity between expression profiles of akirins with IGF hormones or binding protein genes. The level of phylogenetic relatedness of akirin family members was not a strong predictor of transcriptional responses to nutritional state, or differences in transcript abundance levels, indicating a complex pattern of regulatory evolution. The salmonid akirins epitomize the complexity linking the genome to physiological phenotypes of vertebrates with a history of tetraploidization.
Adenosine-to-inosine (A-to-I) RNA editing is a widespread post-transcriptional mechanism, but its genomic landscape and clinical relevance in cancer have not been investigated systematically. We characterized the global A-to-I RNA editing profiles of 6,236 patient samples of 17 cancer types from The Cancer Genome Atlas and revealed a striking diversity of altered RNA-editing patterns in tumors relative to normal tissues. We identified an appreciable number of clinically relevant editing events, many of which are in noncoding regions.
Xu, Yaomin; Guo, Xingyi; Sun, Jiayang; Zhao, Zhongming
2015-01-01
Motivation: Large-scale cancer genomic studies, such as The Cancer Genome Atlas (TCGA), have profiled multidimensional genomic data, including mutation and expression profiles on a variety of cancer cell types, to uncover the molecular mechanism of cancerogenesis. More than a hundred driver mutations have been characterized that confer the advantage of cell growth. However, how driver mutations regulate the transcriptome to affect cellular functions remains largely unexplored. Differential analysis of gene expression relative to a driver mutation on patient samples could provide us with new insights in understanding driver mutation dysregulation in tumor genome and developing personalized treatment strategies. Results: Here, we introduce the Snowball approach as a highly sensitive statistical analysis method to identify transcriptional signatures that are affected by a recurrent driver mutation. Snowball utilizes a resampling-based approach and combines a distance-based regression framework to assign a robust ranking index of genes based on their aggregated association with the presence of the mutation, and further selects the top significant genes for downstream data analyses or experiments. In our application of the Snowball approach to both synthesized and TCGA data, we demonstrated that it outperforms the standard methods and provides more accurate inferences to the functional effects and transcriptional dysregulation of driver mutations. Availability and implementation: R package and source code are available from CRAN at http://cran.r-project.org/web/packages/DESnowball, and also available at http://bioinfo.mc.vanderbilt.edu/DESnowball/. Contact: zhongming.zhao@vanderbilt.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25192743
The root transcriptome for North American ginseng assembled and profiled across seasonal development
2013-01-01
Background Ginseng including North American ginseng (Panax quinquefolius L.) is one of the most widely used medicinal plants. Its success is thought to be due to a diverse collection of ginsenosides that serve as its major bioactive compounds. However, few genomic resources exist and the details concerning its various biosynthetic pathways remain poorly understood. As the root is the primary tissue harvested commercially for ginsenosides, next generation sequencing was applied to the characterization and assembly of the root transcriptome throughout seasonal development. Transcripts showing homology to ginsenoside biosynthesis enzymes were profiled in greater detail. Results RNA extracts from root samples from seven development stages of North American ginseng were subjected to 454 sequencing, filtered for quality and used in the de novo assembly of a collective root reference transcriptome consisting of 41,623 transcripts. Annotation efforts using a number of public databases resulted in detailed annotation information for 34,801 (84%) transcripts. In addition, 3,955 genes were assigned to metabolic pathways using the Kyoto Encyclopedia of Genes and Genomes. Among our results, we found all of the known enzymes involved in the ginsenoside backbone biosynthesis and used co-expression analysis to identify a number of candidate sequences involved in the latter stages ginsenoside biosynthesis pathway. Transcript profiles suggest ginsenoside biosynthesis occurs at distinct stages of development. Conclusions The assembly generated provides a comprehensive annotated reference for future transcriptomic study of North American ginseng. A collection of putative ginsenoside biosynthesis genes were identified and candidate genes predicted from the lesser understood downstream stages of biosynthesis. Transcript expression profiles across seasonal development suggest a primary dammarane-type ginsenoside biosynthesis occurs just prior to plant senescence, with secondary ginsenoside production occurring throughout development. Data from the study provide a valuable resource for conducting future ginsenoside biosynthesis research in this important medicinal plant. PMID:23957709
Ulianov, Sergey V; Galitsyna, Aleksandra A; Flyamer, Ilya M; Golov, Arkadiy K; Khrameeva, Ekaterina E; Imakaev, Maxim V; Abdennur, Nezar A; Gelfand, Mikhail S; Gavrilov, Alexey A; Razin, Sergey V
2017-07-11
In homeotherms, the alpha-globin gene clusters are located within permanently open genome regions enriched in housekeeping genes. Terminal erythroid differentiation results in dramatic upregulation of alpha-globin genes making their expression comparable to the rRNA transcriptional output. Little is known about the influence of the erythroid-specific alpha-globin gene transcription outburst on adjacent, widely expressed genes and large-scale chromatin organization. Here, we have analyzed the total transcription output, the overall chromatin contact profile, and CTCF binding within the 2.7 Mb segment of chicken chromosome 14 harboring the alpha-globin gene cluster in cultured lymphoid cells and cultured erythroid cells before and after induction of terminal erythroid differentiation. We found that, similarly to mammalian genome, the chicken genomes is organized in TADs and compartments. Full activation of the alpha-globin gene transcription in differentiated erythroid cells is correlated with upregulation of several adjacent housekeeping genes and the emergence of abundant intergenic transcription. An extended chromosome region encompassing the alpha-globin cluster becomes significantly decompacted in differentiated erythroid cells, and depleted in CTCF binding and CTCF-anchored chromatin loops, while the sub-TAD harboring alpha-globin gene cluster and the upstream major regulatory element (MRE) becomes highly enriched with chromatin interactions as compared to lymphoid and proliferating erythroid cells. The alpha-globin gene domain and the neighboring loci reside within the A-like chromatin compartment in both lymphoid and erythroid cells and become further segregated from the upstream gene desert upon terminal erythroid differentiation. Our findings demonstrate that the effects of tissue-specific transcription activation are not restricted to the host genomic locus but affect the overall chromatin structure and transcriptional output of the encompassing topologically associating domain.
Tabassum, Rubina; Sivadas, Ambily; Agrawal, Vartika; Tian, Haozheng; Arafat, Dalia; Gibson, Greg
2015-08-13
Personalized medicine is predicated on the notion that individual biochemical and genomic profiles are relatively constant in times of good health and to some extent predictive of disease or therapeutic response. We report a pilot study quantifying gene expression and methylation profile consistency over time, addressing the reasons for individual uniqueness, and its relation to N = 1 phenotypes. Whole blood samples from four African American women, four Caucasian women, and four Caucasian men drawn from the Atlanta Center for Health Discovery and Well Being study at three successive 6-month intervals were profiled by RNA-Seq, miRNA-Seq, and Illumina Methylation 450 K arrays. Standard regression approaches were used to evaluate the proportion of variance for each type of omic measure among individuals, and to quantify correlations among measures and with clinical attributes related to wellness. Longitudinal omic profiles were in general highly consistent over time, with an average of 67 % variance in transcript abundance, 42 % in CpG methylation level (but 88 % for the most differentiated CpG per gene), and 50 % in miRNA abundance among individuals, which are all comparable to 74 % variance among individuals for 74 clinical traits. One third of the variance could be attributed to differential blood cell type abundance, which was also fairly stable over time, and a lesser amount to expression quantitative trait loci (eQTL) effects. Seven conserved axes of covariance that capture diverse aspects of immune function explained over half of the variance. These axes also explained a considerable proportion of individually extreme transcript abundance, namely approximately 100 genes that were significantly up-regulated or down-regulated in each person and were in some cases enriched for relevant gene activities that plausibly associate with clinical attributes. A similar fraction of genes had individually divergent methylation levels, but these did not overlap with the transcripts, and fewer than 20 % of genes had significantly correlated methylation and gene expression. People express an "omic personality" consisting of peripheral blood transcriptional and epigenetic profiles that are constant over the course of a year and reflect various types of immune activity. Baseline genomic profiles can provide a window into the molecular basis of traits that might be useful for explaining medical conditions or guiding personalized health decisions.
Puranik, Swati; Sahu, Pranav Pankaj; Mandal, Sambhu Nath; B., Venkata Suresh; Parida, Swarup Kumar; Prasad, Manoj
2013-01-01
The NAC proteins represent a major plant-specific transcription factor family that has established enormously diverse roles in various plant processes. Aided by the availability of complete genomes, several members of this family have been identified in Arabidopsis, rice, soybean and poplar. However, no comprehensive investigation has been presented for the recently sequenced, naturally stress tolerant crop, Setaria italica (foxtail millet) that is famed as a model crop for bioenergy research. In this study, we identified 147 putative NAC domain-encoding genes from foxtail millet by systematic sequence analysis and physically mapped them onto nine chromosomes. Genomic organization suggested that inter-chromosomal duplications may have been responsible for expansion of this gene family in foxtail millet. Phylogenetically, they were arranged into 11 distinct sub-families (I-XI), with duplicated genes fitting into one cluster and possessing conserved motif compositions. Comparative mapping with other grass species revealed some orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of genes. The evolutionary significance as duplication and divergence of NAC genes based on their amino acid substitution rates was understood. Expression profiling against various stresses and phytohormones provides novel insights into specific and/or overlapping expression patterns of SiNAC genes, which may be responsible for functional divergence among individual members in this crop. Further, we performed structure modeling and molecular simulation of a stress-responsive protein, SiNAC128, proffering an initial framework for understanding its molecular function. Taken together, this genome-wide identification and expression profiling unlocks new avenues for systematic functional analysis of novel NAC gene family candidates which may be applied for improvising stress adaption in plants. PMID:23691254
Puranik, Swati; Sahu, Pranav Pankaj; Mandal, Sambhu Nath; B, Venkata Suresh; Parida, Swarup Kumar; Prasad, Manoj
2013-01-01
The NAC proteins represent a major plant-specific transcription factor family that has established enormously diverse roles in various plant processes. Aided by the availability of complete genomes, several members of this family have been identified in Arabidopsis, rice, soybean and poplar. However, no comprehensive investigation has been presented for the recently sequenced, naturally stress tolerant crop, Setaria italica (foxtail millet) that is famed as a model crop for bioenergy research. In this study, we identified 147 putative NAC domain-encoding genes from foxtail millet by systematic sequence analysis and physically mapped them onto nine chromosomes. Genomic organization suggested that inter-chromosomal duplications may have been responsible for expansion of this gene family in foxtail millet. Phylogenetically, they were arranged into 11 distinct sub-families (I-XI), with duplicated genes fitting into one cluster and possessing conserved motif compositions. Comparative mapping with other grass species revealed some orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of genes. The evolutionary significance as duplication and divergence of NAC genes based on their amino acid substitution rates was understood. Expression profiling against various stresses and phytohormones provides novel insights into specific and/or overlapping expression patterns of SiNAC genes, which may be responsible for functional divergence among individual members in this crop. Further, we performed structure modeling and molecular simulation of a stress-responsive protein, SiNAC128, proffering an initial framework for understanding its molecular function. Taken together, this genome-wide identification and expression profiling unlocks new avenues for systematic functional analysis of novel NAC gene family candidates which may be applied for improvising stress adaption in plants.
Genome-wide transcriptional profiling by microarrays provides a powerful platform for gene expression-based biomarker discovery. After their wide acceptance in human disease diagnosis, prognosis, and drug discovery, these gene signatures are increasingly being adopted for environ...
Genome-wide transcriptional profiling by microarrays provides a powerful platform for gene expression-based biomarker discovery. After their wide acceptance in human disease diagnosis, prognosis, and drug discovery, these gene signatures are increasingly being adopted for environ...
Zhou, Yan; Xu, Daixiang; Jia, Ledong; Huang, Xiaohu; Ma, Guoqiang; Wang, Shuxian; Zhu, Meichen; Zhang, Aoxiang; Guan, Mingwei; Lu, Kun; Xu, Xinfu; Wang, Rui; Li, Jiana; Qu, Cunmin
2017-10-24
The basic region/leucine zipper motif (bZIP) transcription factor family is one of the largest families of transcriptional regulators in plants. bZIP genes have been systematically characterized in some plants, but not in rapeseed ( Brassica napus ). In this study, we identified 247 BnbZIP genes in the rapeseed genome, which we classified into 10 subfamilies based on phylogenetic analysis of their deduced protein sequences. The BnbZIP genes were grouped into functional clades with Arabidopsis genes with similar putative functions, indicating functional conservation. Genome mapping analysis revealed that the BnbZIPs are distributed unevenly across all 19 chromosomes, and that some of these genes arose through whole-genome duplication and dispersed duplication events. All expression profiles of 247 bZIP genes were extracted from RNA-sequencing data obtained from 17 different B . napus ZS11 tissues with 42 various developmental stages. These genes exhibited different expression patterns in various tissues, revealing that these genes are differentially regulated. Our results provide a valuable foundation for functional dissection of the different BnbZIP homologs in B . napus and its parental lines and for molecular breeding studies of bZIP genes in B . napus .
Zhou, Yan; Xu, Daixiang; Jia, Ledong; Huang, Xiaohu; Ma, Guoqiang; Wang, Shuxian; Zhu, Meichen; Zhang, Aoxiang; Guan, Mingwei; Xu, Xinfu; Wang, Rui; Li, Jiana
2017-01-01
The basic region/leucine zipper motif (bZIP) transcription factor family is one of the largest families of transcriptional regulators in plants. bZIP genes have been systematically characterized in some plants, but not in rapeseed (Brassica napus). In this study, we identified 247 BnbZIP genes in the rapeseed genome, which we classified into 10 subfamilies based on phylogenetic analysis of their deduced protein sequences. The BnbZIP genes were grouped into functional clades with Arabidopsis genes with similar putative functions, indicating functional conservation. Genome mapping analysis revealed that the BnbZIPs are distributed unevenly across all 19 chromosomes, and that some of these genes arose through whole-genome duplication and dispersed duplication events. All expression profiles of 247 bZIP genes were extracted from RNA-sequencing data obtained from 17 different B. napus ZS11 tissues with 42 various developmental stages. These genes exhibited different expression patterns in various tissues, revealing that these genes are differentially regulated. Our results provide a valuable foundation for functional dissection of the different BnbZIP homologs in B. napus and its parental lines and for molecular breeding studies of bZIP genes in B. napus. PMID:29064393
Prakash, Celine; Haeseler, Arndt Von
2017-03-01
RNA sequencing (RNA-seq) has emerged as the method of choice for measuring the expression of RNAs in a given cell population. In most RNA-seq technologies, sequencing the full length of RNA molecules requires fragmentation into smaller pieces. Unfortunately, the issue of nonuniform sequencing coverage across a genomic feature has been a concern in RNA-seq and is attributed to biases for certain fragments in RNA-seq library preparation and sequencing. To investigate the expected coverage obtained from fragmentation, we develop a simple fragmentation model that is independent of bias from the experimental method and is not specific to the transcript sequence. Essentially, we enumerate all configurations for maximal placement of a given fragment length, F, on transcript length, T, to represent every possible fragmentation pattern, from which we compute the expected coverage profile across a transcript. We extend this model to incorporate general empirical attributes such as read length, fragment length distribution, and number of molecules of the transcript. We further introduce the fragment starting-point, fragment coverage, and read coverage profiles. We find that the expected profiles are not uniform and that factors such as fragment length to transcript length ratio, read length to fragment length ratio, fragment length distribution, and number of molecules influence the variability of coverage across a transcript. Finally, we explore a potential application of the model where, with simulations, we show that it is possible to correctly estimate the transcript copy number for any transcript in the RNA-seq experiment.
Haeseler, Arndt Von
2017-01-01
Abstract RNA sequencing (RNA-seq) has emerged as the method of choice for measuring the expression of RNAs in a given cell population. In most RNA-seq technologies, sequencing the full length of RNA molecules requires fragmentation into smaller pieces. Unfortunately, the issue of nonuniform sequencing coverage across a genomic feature has been a concern in RNA-seq and is attributed to biases for certain fragments in RNA-seq library preparation and sequencing. To investigate the expected coverage obtained from fragmentation, we develop a simple fragmentation model that is independent of bias from the experimental method and is not specific to the transcript sequence. Essentially, we enumerate all configurations for maximal placement of a given fragment length, F, on transcript length, T, to represent every possible fragmentation pattern, from which we compute the expected coverage profile across a transcript. We extend this model to incorporate general empirical attributes such as read length, fragment length distribution, and number of molecules of the transcript. We further introduce the fragment starting-point, fragment coverage, and read coverage profiles. We find that the expected profiles are not uniform and that factors such as fragment length to transcript length ratio, read length to fragment length ratio, fragment length distribution, and number of molecules influence the variability of coverage across a transcript. Finally, we explore a potential application of the model where, with simulations, we show that it is possible to correctly estimate the transcript copy number for any transcript in the RNA-seq experiment. PMID:27661099
Shang, Haihong; Li, Wei; Zou, Changsong; Yuan, Youlu
2013-07-01
NAC domain proteins are plant-specific transcription factors known to play diverse roles in various plant developmental processes. In the present study, we performed the first comprehensive study of the NAC gene family in Gossypium raimondii Ulbr., incorporating phylogenetic, chromosomal location, gene structure, conserved motif, and expression profiling analyses. We identified 145 NAC transcription factor (NAC-TF) genes that were phylogenetically clustered into 18 distinct subfamilies. Of these, 127 NAC-TF genes were distributed across the 13 chromosomes, 80 (55%) were preferentially retained duplicates located in both duplicated regions and six were located in triplicated chromosomal regions. The majority of NAC-TF genes showed temporal-, spatial-, and tissue-specific expression patterns based on transcriptomic and qRT-PCR analyses. However, the expression patterns of several duplicate genes were partially redundant, suggesting the occurrence of sub-functionalization during their evolution. Based on their genomic organization, we concluded that genomic duplications contributed significantly to the expansion of the NAC-TF gene family in G. raimondii. Comprehensive analysis of their expression profiles could provide novel insights into the functional divergence among members of the NAC gene family in G. raimondii. © 2013 Institute of Botany, Chinese Academy of Sciences.
Accurate Prediction of Inducible Transcription Factor Binding Intensities In Vivo
Siepel, Adam; Lis, John T.
2012-01-01
DNA sequence and local chromatin landscape act jointly to determine transcription factor (TF) binding intensity profiles. To disentangle these influences, we developed an experimental approach, called protein/DNA binding followed by high-throughput sequencing (PB–seq), that allows the binding energy landscape to be characterized genome-wide in the absence of chromatin. We applied our methods to the Drosophila Heat Shock Factor (HSF), which inducibly binds a target DNA sequence element (HSE) following heat shock stress. PB–seq involves incubating sheared naked genomic DNA with recombinant HSF, partitioning the HSF–bound and HSF–free DNA, and then detecting HSF–bound DNA by high-throughput sequencing. We compared PB–seq binding profiles with ones observed in vivo by ChIP–seq and developed statistical models to predict the observed departures from idealized binding patterns based on covariates describing the local chromatin environment. We found that DNase I hypersensitivity and tetra-acetylation of H4 were the most influential covariates in predicting changes in HSF binding affinity. We also investigated the extent to which DNA accessibility, as measured by digital DNase I footprinting data, could be predicted from MNase–seq data and the ChIP–chip profiles for many histone modifications and TFs, and found GAGA element associated factor (GAF), tetra-acetylation of H4, and H4K16 acetylation to be the most predictive covariates. Lastly, we generated an unbiased model of HSF binding sequences, which revealed distinct biophysical properties of the HSF/HSE interaction and a previously unrecognized substructure within the HSE. These findings provide new insights into the interplay between the genomic sequence and the chromatin landscape in determining transcription factor binding intensity. PMID:22479205
George, Julie; Walter, Vonn; Peifer, Martin; Alexandrov, Ludmil B; Seidel, Danila; Leenders, Frauke; Maas, Lukas; Müller, Christian; Dahmen, Ilona; Delhomme, Tiffany M; Ardin, Maude; Leblay, Noemie; Byrnes, Graham; Sun, Ruping; De Reynies, Aurélien; McLeer-Florin, Anne; Bosco, Graziella; Malchers, Florian; Menon, Roopika; Altmüller, Janine; Becker, Christian; Nürnberg, Peter; Achter, Viktor; Lang, Ulrich; Schneider, Peter M; Bogus, Magdalena; Soloway, Matthew G; Wilkerson, Matthew D; Cun, Yupeng; McKay, James D; Moro-Sibilot, Denis; Brambilla, Christian G; Lantuejoul, Sylvie; Lemaitre, Nicolas; Soltermann, Alex; Weder, Walter; Tischler, Verena; Brustugun, Odd Terje; Lund-Iversen, Marius; Helland, Åslaug; Solberg, Steinar; Ansén, Sascha; Wright, Gavin; Solomon, Benjamin; Roz, Luca; Pastorino, Ugo; Petersen, Iver; Clement, Joachim H; Sänger, Jörg; Wolf, Jürgen; Vingron, Martin; Zander, Thomas; Perner, Sven; Travis, William D; Haas, Stefan A; Olivier, Magali; Foll, Matthieu; Büttner, Reinhard; Hayes, David Neil; Brambilla, Elisabeth; Fernandez-Cuesta, Lynnette; Thomas, Roman K
2018-03-13
Pulmonary large-cell neuroendocrine carcinomas (LCNECs) have similarities with other lung cancers, but their precise relationship has remained unclear. Here we perform a comprehensive genomic (n = 60) and transcriptomic (n = 69) analysis of 75 LCNECs and identify two molecular subgroups: "type I LCNECs" with bi-allelic TP53 and STK11/KEAP1 alterations (37%), and "type II LCNECs" enriched for bi-allelic inactivation of TP53 and RB1 (42%). Despite sharing genomic alterations with adenocarcinomas and squamous cell carcinomas, no transcriptional relationship was found; instead LCNECs form distinct transcriptional subgroups with closest similarity to SCLC. While type I LCNECs and SCLCs exhibit a neuroendocrine profile with ASCL1 high /DLL3 high /NOTCH low , type II LCNECs bear TP53 and RB1 alterations and differ from most SCLC tumors with reduced neuroendocrine markers, a pattern of ASCL1 low /DLL3 low /NOTCH high , and an upregulation of immune-related pathways. In conclusion, LCNECs comprise two molecularly defined subgroups, and distinguishing them from SCLC may allow stratified targeted treatment of high-grade neuroendocrine lung tumors.
Kravatsky, Yuri V; Chechetkin, Vladimir R; Tchurikov, Nikolai A; Kravatskaya, Galina I
2015-02-01
The broad class of tasks in genetics and epigenetics can be reduced to the study of various features that are distributed over the genome (genome tracks). The rapid and efficient processing of the huge amount of data stored in the genome-scale databases cannot be achieved without the software packages based on the analytical criteria. However, strong inhomogeneity of genome tracks hampers the development of relevant statistics. We developed the criteria for the assessment of genome track inhomogeneity and correlations between two genome tracks. We also developed a software package, Genome Track Analyzer, based on this theory. The theory and software were tested on simulated data and were applied to the study of correlations between CpG islands and transcription start sites in the Homo sapiens genome, between profiles of protein-binding sites in chromosomes of Drosophila melanogaster, and between DNA double-strand breaks and histone marks in the H. sapiens genome. Significant correlations between transcription start sites on the forward and the reverse strands were observed in genomes of D. melanogaster, Caenorhabditis elegans, Mus musculus, H. sapiens, and Danio rerio. The observed correlations may be related to the regulation of gene expression in eukaryotes. Genome Track Analyzer is freely available at http://ancorr.eimb.ru/. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C; Zhang, Baohong
2014-10-16
Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence.
Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C.; Zhang, Baohong
2014-01-01
Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence. PMID:25322260
Reprogramming cell fate with a genome-scale library of artificial transcription factors.
Eguchi, Asuka; Wleklinski, Matthew J; Spurgat, Mackenzie C; Heiderscheit, Evan A; Kropornicka, Anna S; Vu, Catherine K; Bhimsaria, Devesh; Swanson, Scott A; Stewart, Ron; Ramanathan, Parameswaran; Kamp, Timothy J; Slukvin, Igor; Thomson, James A; Dutton, James R; Ansari, Aseem Z
2016-12-20
Artificial transcription factors (ATFs) are precision-tailored molecules designed to bind DNA and regulate transcription in a preprogrammed manner. Libraries of ATFs enable the high-throughput screening of gene networks that trigger cell fate decisions or phenotypic changes. We developed a genome-scale library of ATFs that display an engineered interaction domain (ID) to enable cooperative assembly and synergistic gene expression at targeted sites. We used this ATF library to screen for key regulators of the pluripotency network and discovered three combinations of ATFs capable of inducing pluripotency without exogenous expression of Oct4 (POU domain, class 5, TF 1). Cognate site identification, global transcriptional profiling, and identification of ATF binding sites reveal that the ATFs do not directly target Oct4; instead, they target distinct nodes that converge to stimulate the endogenous pluripotency network. This forward genetic approach enables cell type conversions without a priori knowledge of potential key regulators and reveals unanticipated gene network dynamics that drive cell fate choices.
Rodrigues, Raquel; Grosso, Ana Rita; Moita, Luís
2013-01-01
The immune system relies on the plasticity of its components to produce appropriate responses to frequent environmental challenges. Dendritic cells (DCs) are critical initiators of innate immunity and orchestrate the later and more specific adaptive immunity. The generation of diversity in transcriptional programs is central for effective immune responses. Alternative splicing is widely considered a key generator of transcriptional and proteomic complexity, but its role has been rarely addressed systematically in immune cells. Here we used splicing-sensitive arrays to assess genome-wide gene- and exon-level expression profiles in human DCs in response to a bacterial challenge. We find widespread alternative splicing events and splicing factor transcriptional signatures induced by an E. coli challenge to human DCs. Alternative splicing acts in concert with transcriptional modulation, but these two mechanisms of gene regulation affect primarily distinct functional gene groups. Alternative splicing is likely to have an important role in DC immunobiology because it affects genes known to be involved in DC development, endocytosis, antigen presentation and cell cycle arrest.
Reprogramming cell fate with a genome-scale library of artificial transcription factors
Eguchi, Asuka; Wleklinski, Matthew J.; Spurgat, Mackenzie C.; Heiderscheit, Evan A.; Kropornicka, Anna S.; Vu, Catherine K.; Bhimsaria, Devesh; Swanson, Scott A.; Stewart, Ron; Ramanathan, Parameswaran; Kamp, Timothy J.; Slukvin, Igor; Thomson, James A.; Dutton, James R.; Ansari, Aseem Z.
2016-01-01
Artificial transcription factors (ATFs) are precision-tailored molecules designed to bind DNA and regulate transcription in a preprogrammed manner. Libraries of ATFs enable the high-throughput screening of gene networks that trigger cell fate decisions or phenotypic changes. We developed a genome-scale library of ATFs that display an engineered interaction domain (ID) to enable cooperative assembly and synergistic gene expression at targeted sites. We used this ATF library to screen for key regulators of the pluripotency network and discovered three combinations of ATFs capable of inducing pluripotency without exogenous expression of Oct4 (POU domain, class 5, TF 1). Cognate site identification, global transcriptional profiling, and identification of ATF binding sites reveal that the ATFs do not directly target Oct4; instead, they target distinct nodes that converge to stimulate the endogenous pluripotency network. This forward genetic approach enables cell type conversions without a priori knowledge of potential key regulators and reveals unanticipated gene network dynamics that drive cell fate choices. PMID:27930301
Jing, Zhaobin; Liu, Zhande
2018-04-01
As one of the largest transcriptional factor families in plants, WRKY transcription factors play important roles in various biotic and abiotic stress responses. To date, WRKY genes in kiwifruit (Actinidia spp.) remain poorly understood. In our study, o total of 97 AcWRKY genes have been identified in the kiwifruit genome. An overview of these AcWRKY genes is analyzed, including the phylogenetic relationships, exon-intron structures, synteny and expression profiles. The 97 AcWRKY genes were divided into three groups based on the conserved WRKY domain. Synteny analysis indicated that segmental duplication events contributed to the expansion of the kiwifruit AcWRKY family. In addition, the synteny analysis between kiwifruit and Arabidopsis suggested that some of the AcWRKY genes were derived from common ancestors before the divergence of these two species. Conserved motifs outside the AcWRKY domain may reflect their functional conservation. Genome-wide segmental and tandem duplication were found, which may contribute to the expansion of AcWRKY genes. Furthermore, the analysis of selected AcWRKY genes showed a variety of expression patterns in five different organs as well as during biotic and abiotic stresses. The genome-wide identification and characterization of kiwifruit WRKY transcription factors provides insight into the evolutionary history and is a useful resource for further functional analyses of kiwifruit.
Retrotransposon profiling of RNA polymerase III initiation sites.
Qi, Xiaojie; Daily, Kenneth; Nguyen, Kim; Wang, Haoyi; Mayhew, David; Rigor, Paul; Forouzan, Sholeh; Johnston, Mark; Mitra, Robi David; Baldi, Pierre; Sandmeyer, Suzanne
2012-04-01
Although retroviruses are relatively promiscuous in choice of integration sites, retrotransposons can display marked integration specificity. In yeast and slime mold, some retrotransposons are associated with tRNA genes (tDNAs). In the Saccharomyces cerevisiae genome, the long terminal repeat retrotransposon Ty3 is found at RNA polymerase III (Pol III) transcription start sites of tDNAs. Ty1, 2, and 4 elements also cluster in the upstream regions of these genes. To determine the extent to which other Pol III-transcribed genes serve as genomic targets for Ty3, a set of 10,000 Ty3 genomic retrotranspositions were mapped using high-throughput DNA sequencing. Integrations occurred at all known tDNAs, two tDNA relics (iYGR033c and ZOD1), and six non-tDNA, Pol III-transcribed types of genes (RDN5, SNR6, SNR52, RPR1, RNA170, and SCR1). Previous work in vitro demonstrated that the Pol III transcription factor (TF) IIIB is important for Ty3 targeting. However, seven loci that bind the TFIIIB loader, TFIIIC, were not targeted, underscoring the unexplained absence of TFIIIB at those sites. Ty3 integrations also occurred in two open reading frames not previously associated with Pol III transcription, suggesting the existence of a small number of additional sites in the yeast genome that interact with Pol III transcription complexes.
Shamimuzzaman, Md.
2018-01-01
To understand translational capacity on a genome-wide scale across three developmental stages of immature soybean seed cotyledons, ribosome profiling was performed in combination with RNA sequencing and cluster analysis. Transcripts representing 216 unique genes demonstrated a higher level of translational activity in at least one stage by exhibiting higher translational efficiencies (TEs) in which there were relatively more ribosome footprint sequence reads mapping to the transcript than were present in the control total RNA sample. The majority of these transcripts were more translationally active at the early stage of seed development and included 12 unique serine or cysteine proteases and 16 2S albumin and low molecular weight cysteine-rich proteins that may serve as substrates for turnover and mobilization early in seed development. It would appear that the serine proteases and 2S albumins play a vital role in the early stages. In contrast, our investigation of profiles of 19 genes encoding high abundance seed storage proteins, such as glycinins, beta-conglycinins, lectin, and Kunitz trypsin inhibitors, showed that they all had similar patterns in which the TE values started at low levels and increased approximately 2 to 6-fold during development. The highest levels of these seed protein transcripts were found at the mid-developmental stage, whereas the highest ribosome footprint levels of only up to 1.6 TE were found at the late developmental stage. These experimental findings suggest that the major seed storage protein coding genes are primarily regulated at the transcriptional level during normal soybean cotyledon development. Finally, our analyses also identified a total of 370 unique gene models that showed very low TE values including over 48 genes encoding ribosomal family proteins and 95 gene models that are related to energy and photosynthetic functions, many of which have homology to the chloroplast genome. Additionally, we showed that genes of the chloroplast were relatively translationally inactive during seed development. PMID:29570733
A transcriptome atlas of rabbit revealed by PacBio single-molecule long-read sequencing.
Chen, Shi-Yi; Deng, Feilong; Jia, Xianbo; Li, Cao; Lai, Song-Jia
2017-08-09
It is widely acknowledged that transcriptional diversity largely contributes to biological regulation in eukaryotes. Since the advent of second-generation sequencing technologies, a large number of RNA sequencing studies have considerably improved our understanding of transcriptome complexity. However, it still remains a huge challenge for obtaining full-length transcripts because of difficulties in the short read-based assembly. In the present study we employ PacBio single-molecule long-read sequencing technology for whole-transcriptome profiling in rabbit (Oryctolagus cuniculus). We totally obtain 36,186 high-confidence transcripts from 14,474 genic loci, among which more than 23% of genic loci and 66% of isoforms have not been annotated yet within the current reference genome. Furthermore, about 17% of transcripts are computationally revealed to be non-coding RNAs. Up to 24,797 alternative splicing (AS) and 11,184 alternative polyadenylation (APA) events are detected within this de novo constructed transcriptome, respectively. The results provide a comprehensive set of reference transcripts and hence contribute to the improved annotation of rabbit genome.
Masulis, Irina S.; Babaeva, Zaira Sh.; Chernyshov, Sergey V.; Ozoline, Olga N.
2015-01-01
Mosaic pattern of transcription in alternating directions is a common feature of prokaryotic and eukaryotic genomes which rationality and origin remain enigmatic. In Escherichia coli approximately 25% of genes comprise pairs of topologically linked divergently transcribed units. Given that transcriptional complex formation at each promoter in the pair induces topological changes and is itself sensitive to DNA structural perturbations, study of the functional anatomy in such areas requires special approaches. Here we suggested the dual-colour promoter probe vector which may become an ideal tool for divergent transcription profiling. The vector was used to characterize the specific genomic region nearby appY with multiple bidirectional promoters predicted in silico. Only three promoters of this region were shown to be engaged in the transcription initiation resulting in the expression of reporter genes. RNA product transcribed in antisense direction is suggested as a novel RNA. Nalidixin-induced topological modulation differentially affected transcription in sense and antisense directions thus exemplifying anticooperative mode in the response to topological alterations. PMID:26081797
BLISS is a versatile and quantitative method for genome-wide profiling of DNA double-strand breaks.
Yan, Winston X; Mirzazadeh, Reza; Garnerone, Silvano; Scott, David; Schneider, Martin W; Kallas, Tomasz; Custodio, Joaquin; Wernersson, Erik; Li, Yinqing; Gao, Linyi; Federova, Yana; Zetsche, Bernd; Zhang, Feng; Bienko, Magda; Crosetto, Nicola
2017-05-12
Precisely measuring the location and frequency of DNA double-strand breaks (DSBs) along the genome is instrumental to understanding genomic fragility, but current methods are limited in versatility, sensitivity or practicality. Here we present Breaks Labeling In Situ and Sequencing (BLISS), featuring the following: (1) direct labelling of DSBs in fixed cells or tissue sections on a solid surface; (2) low-input requirement by linear amplification of tagged DSBs by in vitro transcription; (3) quantification of DSBs through unique molecular identifiers; and (4) easy scalability and multiplexing. We apply BLISS to profile endogenous and exogenous DSBs in low-input samples of cancer cells, embryonic stem cells and liver tissue. We demonstrate the sensitivity of BLISS by assessing the genome-wide off-target activity of two CRISPR-associated RNA-guided endonucleases, Cas9 and Cpf1, observing that Cpf1 has higher specificity than Cas9. Our results establish BLISS as a versatile, sensitive and efficient method for genome-wide DSB mapping in many applications.
Dong, Chen; Hu, Huigang; Xie, Jianghui
2016-12-01
DNA-binding with one finger (Dof) domain proteins are a multigene family of plant-specific transcription factors involved in numerous aspects of plant growth and development. In this study, we report a genome-wide search for Musa acuminata Dof (MaDof) genes and their expression profiles at different developmental stages and in response to various abiotic stresses. In addition, a complete overview of the Dof gene family in bananas is presented, including the gene structures, chromosomal locations, cis-regulatory elements, conserved protein domains, and phylogenetic inferences. Based on the genome-wide analysis, we identified 74 full-length protein-coding MaDof genes unevenly distributed on 11 chromosomes. Phylogenetic analysis with Dof members from diverse plant species showed that MaDof genes can be classified into four subgroups (StDof I, II, III, and IV). The detailed genomic information of the MaDof gene homologs in the present study provides opportunities for functional analyses to unravel the exact role of the genes in plant growth and development.
Pothi, Radhika; Hesketh, Andrew; Möller-Levet, Carla; Hodgson, David A; Laing, Emma E; Stewart, Graham R; Smith, Colin P
2018-01-01
Abstract Stress-induced adaptations require multiple levels of regulation in all organisms to repair cellular damage. In the present study we evaluated the genome-wide transcriptional and translational changes following heat stress exposure in the soil-dwelling model actinomycete bacterium, Streptomyces coelicolor. The combined analysis revealed an unprecedented level of translational control of gene expression, deduced through polysome profiling, in addition to transcriptional changes. Our data show little correlation between the transcriptome and ‘translatome’; while an obvious downward trend in genome wide transcription was observed, polysome associated transcripts following heat-shock showed an opposite upward trend. A handful of key protein players, including the major molecular chaperones and proteases were highly induced at both the transcriptional and translational level following heat-shock, a phenomenon known as ‘potentiation’. Many other transcripts encoding cold-shock proteins, ABC-transporter systems, multiple transcription factors were more highly polysome-associated following heat stress; interestingly, these protein families were not induced at the transcriptional level and therefore were not previously identified as part of the stress response. Thus, stress coping mechanisms at the level of gene expression in this bacterium go well beyond the induction of a relatively small number of molecular chaperones and proteases in order to ensure cellular survival at non-physiological temperatures. PMID:29746664
Bucca, Giselda; Pothi, Radhika; Hesketh, Andrew; Möller-Levet, Carla; Hodgson, David A; Laing, Emma E; Stewart, Graham R; Smith, Colin P
2018-05-09
Stress-induced adaptations require multiple levels of regulation in all organisms to repair cellular damage. In the present study we evaluated the genome-wide transcriptional and translational changes following heat stress exposure in the soil-dwelling model actinomycete bacterium, Streptomyces coelicolor. The combined analysis revealed an unprecedented level of translational control of gene expression, deduced through polysome profiling, in addition to transcriptional changes. Our data show little correlation between the transcriptome and 'translatome'; while an obvious downward trend in genome wide transcription was observed, polysome associated transcripts following heat-shock showed an opposite upward trend. A handful of key protein players, including the major molecular chaperones and proteases were highly induced at both the transcriptional and translational level following heat-shock, a phenomenon known as 'potentiation'. Many other transcripts encoding cold-shock proteins, ABC-transporter systems, multiple transcription factors were more highly polysome-associated following heat stress; interestingly, these protein families were not induced at the transcriptional level and therefore were not previously identified as part of the stress response. Thus, stress coping mechanisms at the level of gene expression in this bacterium go well beyond the induction of a relatively small number of molecular chaperones and proteases in order to ensure cellular survival at non-physiological temperatures.
Dynamic regulation of VEGF-inducible genes by an ERK/ERG/p300 transcriptional network.
Fish, Jason E; Cantu Gutierrez, Manuel; Dang, Lan T; Khyzha, Nadiya; Chen, Zhiqi; Veitch, Shawn; Cheng, Henry S; Khor, Melvin; Antounians, Lina; Njock, Makon-Sébastien; Boudreau, Emilie; Herman, Alexander M; Rhyner, Alexander M; Ruiz, Oscar E; Eisenhoffer, George T; Medina-Rivera, Alejandra; Wilson, Michael D; Wythe, Joshua D
2017-07-01
The transcriptional pathways activated downstream of vascular endothelial growth factor (VEGF) signaling during angiogenesis remain incompletely characterized. By assessing the signals responsible for induction of the Notch ligand delta-like 4 (DLL4) in endothelial cells, we find that activation of the MAPK/ERK pathway mirrors the rapid and dynamic induction of DLL4 transcription and that this pathway is required for DLL4 expression. Furthermore, VEGF/ERK signaling induces phosphorylation and activation of the ETS transcription factor ERG, a prerequisite for DLL4 induction. Transcription of DLL4 coincides with dynamic ERG-dependent recruitment of the transcriptional co-activator p300. Genome-wide gene expression profiling identified a network of VEGF-responsive and ERG-dependent genes, and ERG chromatin immunoprecipitation (ChIP)-seq revealed the presence of conserved ERG-bound putative enhancer elements near these target genes. Functional experiments performed in vitro and in vivo confirm that this network of genes requires ERK, ERG and p300 activity. Finally, genome-editing and transgenic approaches demonstrate that a highly conserved ERG-bound enhancer located upstream of HLX (which encodes a transcription factor implicated in sprouting angiogenesis) is required for its VEGF-mediated induction. Collectively, these findings elucidate a novel transcriptional pathway contributing to VEGF-dependent angiogenesis. © 2017. Published by The Company of Biologists Ltd.
Genome-wide bisulfite sensitivity profiling of yeast suggests bisulfite inhibits transcription.
Segovia, Romulo; Mathew, Veena; Tam, Annie S; Stirling, Peter C
2017-09-01
Bisulfite, in the form of sodium bisulfite or metabisulfite, is used commercially as a food preservative. Bisulfite is used in the laboratory as a single-stranded DNA mutagen in epigenomic analyses of DNA methylation. Recently it has also been used on whole yeast cells to induce mutations in exposed single-stranded regions in vivo. To understand the effects of bisulfite on live cells we conducted a genome-wide screen for bisulfite sensitive mutants in yeast. Screening the deletion mutant array, and collections of essential gene mutants we define a genetic network of bisulfite sensitive mutants. Validation of screen hits revealed hyper-sensitivity of transcription and RNA processing mutants, rather than DNA repair pathways and follow-up analyses support a role in perturbation of RNA transactions. We propose a model in which bisulfite-modified nucleotides may interfere with transcription or RNA metabolism when used in vivo. Copyright © 2017 Elsevier B.V. All rights reserved.
De Novo Transcriptome Analysis of Medicinally Important Plantago ovata Using RNA-Seq
Kotwal, Shivanjali; Kaul, Sanjana; Sharma, Pooja; Gupta, Mehak; Shankar, Rama; Jain, Mukesh; Dhar, Manoj K.
2016-01-01
Plantago ovata is an economically and medicinally important plant of the family Plantaginaceae. It is used extensively for the production of seed husk for its application in pharmaceutical, food and cosmetic industries. In the present study, the transcriptome of P. ovata ovary was sequenced using Illumina Genome Analyzer platform to characterize the mucilage biosynthesis pathway in the plant. De novo assembly was carried out using Oases followed by velvet. A total of 46,955 non-redundant transcripts (≥100 bp) using ~29 million high-quality paired end reads were generated. Functional categorization of these transcripts revealed the presence of several genes involved in various biological processes like metabolic pathways, mucilage biosynthesis, biosynthesis of secondary metabolites and antioxidants. In addition, simple sequence-repeat motifs, non-coding RNAs and transcription factors were also identified. Expression profiling of some genes involved in mucilage biosynthetic pathway was performed in different tissues of P. ovata using Real time PCR analysis. The study has resulted in a valuable resource for further studies on gene expression, genomics and functional genomics in P. ovata. PMID:26943165
Characterizing genomic alterations in cancer by complementary functional associations.
Kim, Jong Wook; Botvinnik, Olga B; Abudayyeh, Omar; Birger, Chet; Rosenbluh, Joseph; Shrestha, Yashaswi; Abazeed, Mohamed E; Hammerman, Peter S; DiCara, Daniel; Konieczkowski, David J; Johannessen, Cory M; Liberzon, Arthur; Alizad-Rahvar, Amir Reza; Alexe, Gabriela; Aguirre, Andrew; Ghandi, Mahmoud; Greulich, Heidi; Vazquez, Francisca; Weir, Barbara A; Van Allen, Eliezer M; Tsherniak, Aviad; Shao, Diane D; Zack, Travis I; Noble, Michael; Getz, Gad; Beroukhim, Rameen; Garraway, Levi A; Ardakani, Masoud; Romualdi, Chiara; Sales, Gabriele; Barbie, David A; Boehm, Jesse S; Hahn, William C; Mesirov, Jill P; Tamayo, Pablo
2016-05-01
Systematic efforts to sequence the cancer genome have identified large numbers of mutations and copy number alterations in human cancers. However, elucidating the functional consequences of these variants, and their interactions to drive or maintain oncogenic states, remains a challenge in cancer research. We developed REVEALER, a computational method that identifies combinations of mutually exclusive genomic alterations correlated with functional phenotypes, such as the activation or gene dependency of oncogenic pathways or sensitivity to a drug treatment. We used REVEALER to uncover complementary genomic alterations associated with the transcriptional activation of β-catenin and NRF2, MEK-inhibitor sensitivity, and KRAS dependency. REVEALER successfully identified both known and new associations, demonstrating the power of combining functional profiles with extensive characterization of genomic alterations in cancer genomes.
Yan, Qian; Liu, Hou-Sheng; Yao, Dan; Li, Xin; Chen, Han; Dou, Yang; Wang, Yi; Pei, Yan; Xiao, Yue-Hua
2015-01-01
Basic/helix-loop-helix (bHLH) proteins comprise one of the largest transcription factor families and play important roles in diverse cellular and molecular processes. Comprehensive analyses of the composition and evolution of the bHLH family in cotton are essential to elucidate their functions and the molecular basis of cotton development. By searching bHLH homologous genes in sequenced diploid cotton genomes (Gossypium raimondii and G. arboreum), a set of cotton bHLH reference genes containing 289 paralogs were identified and named as GobHLH001-289. Based on their phylogenetic relationships, these cotton bHLH proteins were clustered into 27 subfamilies. Compared to those in Arabidopsis and cacao, cotton bHLH proteins generally increased in number, but unevenly in different subfamilies. To further uncover evolutionary changes of bHLH genes during tetraploidization of cotton, all genes of S5a and S5b subfamilies in upland cotton and its diploid progenitors were cloned and compared, and their transcript profiles were determined in upland cotton. A total of 10 genes of S5a and S5b subfamilies (doubled from A- and D-genome progenitors) maintained in tetraploid cottons. The major sequence changes in upland cotton included a 15-bp in-frame deletion in GhbHLH130D and a long terminal repeat retrotransposon inserted in GhbHLH062A, which eliminated GhbHLH062A expression in various tissues. The S5a and S5b bHLH genes of A and D genomes (except GobHLH062) showed similar transcription patterns in various tissues including roots, stems, leaves, petals, ovules, and fibers, while the A- and D-genome genes of GobHLH110 and GobHLH130 displayed clearly different transcript profiles during fiber development. In total, this study represented a genome-wide analysis of cotton bHLH family, and revealed significant changes in sequence and expression of these genes in tetraploid cottons, which paved the way for further functional analyses of bHLH genes in the cotton genus. PMID:25992947
Yan, Qian; Liu, Hou-Sheng; Yao, Dan; Li, Xin; Chen, Han; Dou, Yang; Wang, Yi; Pei, Yan; Xiao, Yue-Hua
2015-01-01
Basic/helix-loop-helix (bHLH) proteins comprise one of the largest transcription factor families and play important roles in diverse cellular and molecular processes. Comprehensive analyses of the composition and evolution of the bHLH family in cotton are essential to elucidate their functions and the molecular basis of cotton development. By searching bHLH homologous genes in sequenced diploid cotton genomes (Gossypium raimondii and G. arboreum), a set of cotton bHLH reference genes containing 289 paralogs were identified and named as GobHLH001-289. Based on their phylogenetic relationships, these cotton bHLH proteins were clustered into 27 subfamilies. Compared to those in Arabidopsis and cacao, cotton bHLH proteins generally increased in number, but unevenly in different subfamilies. To further uncover evolutionary changes of bHLH genes during tetraploidization of cotton, all genes of S5a and S5b subfamilies in upland cotton and its diploid progenitors were cloned and compared, and their transcript profiles were determined in upland cotton. A total of 10 genes of S5a and S5b subfamilies (doubled from A- and D-genome progenitors) maintained in tetraploid cottons. The major sequence changes in upland cotton included a 15-bp in-frame deletion in GhbHLH130D and a long terminal repeat retrotransposon inserted in GhbHLH062A, which eliminated GhbHLH062A expression in various tissues. The S5a and S5b bHLH genes of A and D genomes (except GobHLH062) showed similar transcription patterns in various tissues including roots, stems, leaves, petals, ovules, and fibers, while the A- and D-genome genes of GobHLH110 and GobHLH130 displayed clearly different transcript profiles during fiber development. In total, this study represented a genome-wide analysis of cotton bHLH family, and revealed significant changes in sequence and expression of these genes in tetraploid cottons, which paved the way for further functional analyses of bHLH genes in the cotton genus.
2013-01-01
Background Alloplasmic lines provide a unique tool to study nuclear-cytoplasmic interactions. Three alloplasmic lines, with nuclear genomes from Triticum aestivum and harboring cytoplasm from Aegilops uniaristata, Aegilops tauschii and Hordeum chilense, were investigated by transcript and metabolite profiling to identify the effects of cytoplasmic substitution on nuclear-cytoplasmic signaling mechanisms. Results In combining the wheat nuclear genome with a cytoplasm of H. chilense, 540 genes were significantly altered, whereas 11 and 28 genes were significantly changed in the alloplasmic lines carrying the cytoplasm of Ae. uniaristata or Ae. tauschii, respectively. We identified the RNA maturation-related process as one of the most sensitive to a perturbation of the nuclear-cytoplasmic interaction. Several key components of the ROS chloroplast retrograde signaling, together with the up-regulation of the ROS scavenging system, showed that changes in the chloroplast genome have a direct impact on nuclear-cytoplasmic cross-talk. Remarkably, the H. chilense alloplasmic line down-regulated some genes involved in the determination of cytoplasmic male sterility without expressing the male sterility phenotype. Metabolic profiling showed a comparable response of the central metabolism of the alloplasmic and euplasmic lines to light, while exposing larger metabolite alterations in the H. chilense alloplasmic line as compared with the Aegilops lines, in agreement with the transcriptomic data. Several stress-related metabolites, remarkably raffinose, were altered in content in the H. chilense alloplasmic line when exposed to high light, while amino acids, as well as organic acids were significantly decreased. Alterations in the levels of transcript, related to raffinose, and the photorespiration-related metabolisms were associated with changes in the level of related metabolites. Conclusion The replacement of a wheat cytoplasm with the cytoplasm of a related species affects the nuclear-cytoplasmic cross-talk leading to transcript and metabolite alterations. The extent of these modifications was limited in the alloplasmic lines with Aegilops cytoplasm, and more evident in the alloplasmic line with H. chilense cytoplasm. We consider that, this finding might be linked to the phylogenetic distance of the genomes. PMID:24320731
Sajuthi, Satria P; Sharma, Neeraj K; Comeau, Mary E; Chou, Jeff W; Bowden, Donald W; Freedman, Barry I; Langefeld, Carl D; Parks, John S; Das, Swapan K
2017-10-20
Dyslipidemia is a major contributor to the increased cardiovascular disease and mortality associated with obesity and type 2 diabetes. We hypothesized that variation in expression of adipose tissue transcripts is associated with serum lipid concentrations in African Americans (AAs), and common genetic variants regulate expression levels of these transcripts. Fasting serum lipid levels, genome-wide transcript expression profiles of subcutaneous adipose tissue, and genome-wide SNP genotypes were analyzed in a cohort of non-diabetic AAs (N=250). Serum triglyceride (TRIG) and high density lipoprotein-cholesterol (HDL-C) levels were associated (FDR<0.01) with expression level of 1021 and 1875 adipose tissue transcripts, respectively, but none associated with total cholesterol or LDL-C levels. Serum HDL-C-associated transcripts were enriched for salient biological pathways, including branched-chain amino acid degradation, and oxidative phosphorylation. Genes in immuno-inflammatory pathways were activated among individuals with higher serum TRIG levels. We identified significant cis-regulatory SNPs (cis-eSNPs) for 449 serum lipid-associated transcripts in adipose tissue. The cis-eSNPs of 12 genes were nominally associated (p<0.001) with serum lipid level in genome wide association studies in Global Lipids Genetics Consortium (GLGC) cohorts. Allelic effect direction of cis-eSNPs on expression of MARCH2, BEST1 and TMEM258 matched with effect direction of these SNP alleles on serum TRIG or HDL-C levels in GLGC cohorts. These data suggest that expressions of serum lipid-associated transcripts in adipose tissue are dependent on common cis-eSNPs in African Americans. Thus, genetically-mediated transcriptional regulation in adipose tissue may play a role in reducing HDL-C and increasing TRIG in serum. Copyright © 2017 Elsevier B.V. All rights reserved.
2010-01-01
Background De novo assembly of transcript sequences produced by short-read DNA sequencing technologies offers a rapid approach to obtain expressed gene catalogs for non-model organisms. A draft genome sequence will be produced in 2010 for a Eucalyptus tree species (E. grandis) representing the most important hardwood fibre crop in the world. Genome annotation of this valuable woody plant and genetic dissection of its superior growth and productivity will be greatly facilitated by the availability of a comprehensive collection of expressed gene sequences from multiple tissues and organs. Results We present an extensive expressed gene catalog for a commercially grown E. grandis × E. urophylla hybrid clone constructed using only Illumina mRNA-Seq technology and de novo assembly. A total of 18,894 transcript-derived contigs, a large proportion of which represent full-length protein coding genes were assembled and annotated. Analysis of assembly quality, length and diversity show that this dataset represent the most comprehensive expressed gene catalog for any Eucalyptus tree. mRNA-Seq analysis furthermore allowed digital expression profiling of all of the assembled transcripts across diverse xylogenic and non-xylogenic tissues, which is invaluable for ascribing putative gene functions. Conclusions De novo assembly of Illumina mRNA-Seq reads is an efficient approach for transcriptome sequencing and profiling in Eucalyptus and other non-model organisms. The transcriptome resource (Eucspresso, http://eucspresso.bi.up.ac.za/) generated by this study will be of value for genomic analysis of woody biomass production in Eucalyptus and for comparative genomic analysis of growth and development in woody and herbaceous plants. PMID:21122097
Yao, Bing; Lin, Li; Street, R Craig; Zalewski, Zachary A; Galloway, Jocelyn N; Wu, Hao; Nelson, David L; Jin, Peng
2014-02-15
Fragile X-associated tremor/ataxia syndrome (FXTAS) is a late-onset neurodegenerative disorder in which patients carry premutation alleles of 55-200 CGG repeats in the FMR1 gene. To date, whether alterations in epigenetic regulation modulate FXTAS has gone unexplored. 5-Hydroxymethylcytosine (5hmC) converted from 5-methylcytosine (5mC) by the ten-eleven translocation (TET) family of proteins has been found recently to play key roles in neuronal functions. Here, we undertook genome-wide profiling of cerebellar 5hmC in a FXTAS mouse model (rCGG mice) and found that rCGG mice at 16 weeks showed overall reduced 5hmC levels genome-wide compared with age-matched wild-type littermates. However, we also observed gain-of-5hmC regions in repetitive elements, as well as in cerebellum-specific enhancers, but not in general enhancers. Genomic annotation and motif prediction of wild-type- and rCGG-specific differential 5-hydroxymethylated regions (DhMRs) revealed their high correlation with genes and transcription factors that are important in neuronal developmental and functional pathways. DhMR-associated genes partially overlapped with genes that were differentially associated with ribosomes in CGG mice identified by bacTRAP ribosomal profiling. Taken together, our data strongly indicate a functional role for 5hmC-mediated epigenetic modulation in the etiology of FXTAS, possibly through the regulation of transcription.
Russell, Scott D; Gou, Xiaoping; Wong, Chui E; Wang, Xinkun; Yuan, Tong; Wei, Xiaoping; Bhalla, Prem L; Singh, Mohan B
2012-08-01
Genomic assay of sperm cell RNA provides insight into functional control, modes of regulation, and contributions of male gametes to double fertilization. Sperm cells of rice (Oryza sativa) were isolated from field-grown, disease-free plants and RNA was processed for use with the full-genome Affymetrix microarray. Comparison with Gene Expression Omnibus (GEO) reference arrays confirmed expressionally distinct gene profiles. A total of 10,732 distinct gene sequences were detected in sperm cells, of which 1668 were not expressed in pollen or seedlings. Pathways enriched in male germ cells included ubiquitin-mediated pathways, pathways involved in chromatin modeling including histones, histone modification and nonhistone epigenetic modification, and pathways related to RNAi and gene silencing. Genome-wide expression patterns in angiosperm sperm cells indicate common and divergent themes in the male germline that appear to be largely self-regulating through highly up-regulated chromatin modification pathways. A core of highly conserved genes appear common to all sperm cells, but evidence is still emerging that another class of genes have diverged in expression between monocots and dicots since their divergence. Sperm cell transcripts present at fusion may be transmitted through plasmogamy during double fertilization to effect immediate post-fertilization expression of early embryo and (or) endosperm development. © 2012 The Authors. New Phytologist © 2012 New Phytologist Trust.
Zou, Zhi; Yang, Lifu; Wang, Danhua; Huang, Qixing; Mo, Yeyong; Xie, Guishui
2016-01-01
WRKY proteins comprise one of the largest transcription factor families in plants and form key regulators of many plant processes. This study presents the characterization of 58 WRKY genes from the castor bean (Ricinus communis L., Euphorbiaceae) genome. Compared with the automatic genome annotation, one more WRKY-encoding locus was identified and 20 out of the 57 predicted gene models were manually corrected. All RcWRKY genes were shown to contain at least one intron in their coding sequences. According to the structural features of the present WRKY domains, the identified RcWRKY genes were assigned to three previously defined groups (I-III). Although castor bean underwent no recent whole-genome duplication event like physic nut (Jatropha curcas L., Euphorbiaceae), comparative genomics analysis indicated that one gene loss, one intron loss and one recent proximal duplication occurred in the RcWRKY gene family. The expression of all 58 RcWRKY genes was supported by ESTs and/or RNA sequencing reads derived from roots, leaves, flowers, seeds and endosperms. Further global expression profiles with RNA sequencing data revealed diverse expression patterns among various tissues. Results obtained from this study not only provide valuable information for future functional analysis and utilization of the castor bean WRKY genes, but also provide a useful reference to investigate the gene family expansion and evolution in Euphorbiaceus plants.
Deppdb--DNA electrostatic potential properties database: electrostatic properties of genome DNA.
Osypov, Alexander A; Krutinin, Gleb G; Kamzolova, Svetlana G
2010-06-01
The electrostatic properties of genome DNA influence its interactions with different proteins, in particular, the regulation of transcription by RNA-polymerases. DEPPDB--DNA Electrostatic Potential Properties Database--was developed to hold and provide all available information on the electrostatic properties of genome DNA combined with its sequence and annotation of biological and structural properties of genome elements and whole genomes. Genomes in DEPPDB are organized on a taxonomical basis. Currently, the database contains all the completely sequenced bacterial and viral genomes according to NCBI RefSeq. General properties of the genome DNA electrostatic potential profile and principles of its formation are revealed. This potential correlates with the GC content but does not correspond to it exactly and strongly depends on both the sequence arrangement and its context (flanking regions). Analysis of the promoter regions for bacterial and viral RNA polymerases revealed a correspondence between the scale of these proteins' physical properties and electrostatic profile patterns. We also discovered a direct correlation between the potential value and the binding frequency of RNA polymerase to DNA, supporting the idea of the role of electrostatics in these interactions. This matches a pronounced tendency of the promoter regions to possess higher values of the electrostatic potential.
Nascent Transcription Affected by RNA Polymerase IV in Zea mays
Erhard, Karl F.; Talbot, Joy-El R. B.; Deans, Natalie C.; McClish, Allison E.; Hollick, Jay B.
2015-01-01
All eukaryotes use three DNA-dependent RNA polymerases (RNAPs) to create cellular RNAs from DNA templates. Plants have additional RNAPs related to Pol II, but their evolutionary role(s) remain largely unknown. Zea mays (maize) RNA polymerase D1 (RPD1), the largest subunit of RNA polymerase IV (Pol IV), is required for normal plant development, paramutation, transcriptional repression of certain transposable elements (TEs), and transcriptional regulation of specific alleles. Here, we define the nascent transcriptomes of rpd1 mutant and wild-type (WT) seedlings using global run-on sequencing (GRO-seq) to identify the broader targets of RPD1-based regulation. Comparisons of WT and rpd1 mutant GRO-seq profiles indicate that Pol IV globally affects transcription at both transcriptional start sites and immediately downstream of polyadenylation addition sites. We found no evidence of divergent transcription from gene promoters as seen in mammalian GRO-seq profiles. Statistical comparisons identify genes and TEs whose transcription is affected by RPD1. Most examples of significant increases in genic antisense transcription appear to be initiated by 3ʹ-proximal long terminal repeat retrotransposons. These results indicate that maize Pol IV specifies Pol II-based transcriptional regulation for specific regions of the maize genome including genes having developmental significance. PMID:25653306
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hirsch, Candice N.; Hirsch, Cory D.; Brohammer, Alex B.
Intense artificial selection over the last 100 years has produced elite maize (Zea mays) inbred lines that combine to produce high-yielding hybrids. To further our understanding of how genome and transcriptome variation contribute to the production of high-yielding hybrids, we generated a draft genome assembly of the inbred line PH207 to complement and compare with the existing B73 reference sequence. B73 is a founder of the Stiff Stalk germplasm pool, while PH207 is a founder of Iodent germplasm, both of which have contributed substantially to the production of temperate commercial maize and are combined to make heterotic hybrids. Comparison ofmore » these two assemblies revealed over 2500 genes present in only one of the two genotypes and 136 gene families that have undergone extensive expansion or contraction. Transcriptome profiling revealed extensive expression variation, with as many as 10,564 differentially expressed transcripts and 7128 transcripts expressed in only one of the two genotypes in a single tissue. Genotype-specific genes were more likely to have tissue/condition-specific expression and lower transcript abundance. The availability of a high-quality genome assembly for the elite maize inbred PH207 expands our knowledge of the breadth of natural genome and transcriptome variation in elite maize inbred lines across heterotic pools.« less
Hirsch, Candice N.; Hirsch, Cory D.; Brohammer, Alex B.; ...
2016-11-01
Intense artificial selection over the last 100 years has produced elite maize (Zea mays) inbred lines that combine to produce high-yielding hybrids. To further our understanding of how genome and transcriptome variation contribute to the production of high-yielding hybrids, we generated a draft genome assembly of the inbred line PH207 to complement and compare with the existing B73 reference sequence. B73 is a founder of the Stiff Stalk germplasm pool, while PH207 is a founder of Iodent germplasm, both of which have contributed substantially to the production of temperate commercial maize and are combined to make heterotic hybrids. Comparison ofmore » these two assemblies revealed over 2500 genes present in only one of the two genotypes and 136 gene families that have undergone extensive expansion or contraction. Transcriptome profiling revealed extensive expression variation, with as many as 10,564 differentially expressed transcripts and 7128 transcripts expressed in only one of the two genotypes in a single tissue. Genotype-specific genes were more likely to have tissue/condition-specific expression and lower transcript abundance. The availability of a high-quality genome assembly for the elite maize inbred PH207 expands our knowledge of the breadth of natural genome and transcriptome variation in elite maize inbred lines across heterotic pools.« less
Soifer, Ilya; Barad, Omer; Shem-Tov, Doron; Baruch, Kobi; Lu, Fei; Hernandez, Alvaro G.; Wright, Chris L.; Koehler, Klaus; Buell, C. Robin; de Leon, Natalia
2016-01-01
Intense artificial selection over the last 100 years has produced elite maize (Zea mays) inbred lines that combine to produce high-yielding hybrids. To further our understanding of how genome and transcriptome variation contribute to the production of high-yielding hybrids, we generated a draft genome assembly of the inbred line PH207 to complement and compare with the existing B73 reference sequence. B73 is a founder of the Stiff Stalk germplasm pool, while PH207 is a founder of Iodent germplasm, both of which have contributed substantially to the production of temperate commercial maize and are combined to make heterotic hybrids. Comparison of these two assemblies revealed over 2500 genes present in only one of the two genotypes and 136 gene families that have undergone extensive expansion or contraction. Transcriptome profiling revealed extensive expression variation, with as many as 10,564 differentially expressed transcripts and 7128 transcripts expressed in only one of the two genotypes in a single tissue. Genotype-specific genes were more likely to have tissue/condition-specific expression and lower transcript abundance. The availability of a high-quality genome assembly for the elite maize inbred PH207 expands our knowledge of the breadth of natural genome and transcriptome variation in elite maize inbred lines across heterotic pools. PMID:27803309
Li, You; Heavican, Tayla B.; Vellichirammal, Neetha N.; Iqbal, Javeed
2017-01-01
Abstract The RNA-Seq technology has revolutionized transcriptome characterization not only by accurately quantifying gene expression, but also by the identification of novel transcripts like chimeric fusion transcripts. The ‘fusion’ or ‘chimeric’ transcripts have improved the diagnosis and prognosis of several tumors, and have led to the development of novel therapeutic regimen. The fusion transcript detection is currently accomplished by several software packages, primarily relying on sequence alignment algorithms. The alignment of sequencing reads from fusion transcript loci in cancer genomes can be highly challenging due to the incorrect mapping induced by genomic alterations, thereby limiting the performance of alignment-based fusion transcript detection methods. Here, we developed a novel alignment-free method, ChimeRScope that accurately predicts fusion transcripts based on the gene fingerprint (as k-mers) profiles of the RNA-Seq paired-end reads. Results on published datasets and in-house cancer cell line datasets followed by experimental validations demonstrate that ChimeRScope consistently outperforms other popular methods irrespective of the read lengths and sequencing depth. More importantly, results on our in-house datasets show that ChimeRScope is a better tool that is capable of identifying novel fusion transcripts with potential oncogenic functions. ChimeRScope is accessible as a standalone software at (https://github.com/ChimeRScope/ChimeRScope/wiki) or via the Galaxy web-interface at (https://galaxy.unmc.edu/). PMID:28472320
Regulating RNA polymerase pausing and transcription elongation in embryonic stem cells
Min, Irene M.; Waterfall, Joshua J.; Core, Leighton J.; Munroe, Robert J.; Schimenti, John; Lis, John T.
2011-01-01
Transitions between pluripotent stem cells and differentiated cells are executed by key transcription regulators. Comparative measurements of RNA polymerase distribution over the genome's primary transcription units in different cell states can identify the genes and steps in the transcription cycle that are regulated during such transitions. To identify the complete transcriptional profiles of RNA polymerases with high sensitivity and resolution, as well as the critical regulated steps upon which regulatory factors act, we used genome-wide nuclear run-on (GRO-seq) to map the density and orientation of transcriptionally engaged RNA polymerases in mouse embryonic stem cells (ESCs) and mouse embryonic fibroblasts (MEFs). In both cell types, progression of a promoter-proximal, paused RNA polymerase II (Pol II) into productive elongation is a rate-limiting step in transcription of ∼40% of mRNA-encoding genes. Importantly, quantitative comparisons between cell types reveal that transcription is controlled frequently at paused Pol II's entry into elongation. Furthermore, “bivalent” ESC genes (exhibiting both active and repressive histone modifications) bound by Polycomb group complexes PRC1 (Polycomb-repressive complex 1) and PRC2 show dramatically reduced levels of paused Pol II at promoters relative to an average gene. In contrast, bivalent promoters bound by only PRC2 allow Pol II pausing, but it is confined to extremely 5′ proximal regions. Altogether, these findings identify rate-limiting targets for transcription regulation during cell differentiation. PMID:21460038
Genome wide transcriptional profile analysis of Vitis amurensis in response to cold stress
USDA-ARS?s Scientific Manuscript database
Grape is one of the most important fruit crops worldwide and is cultivated on all of the continents except Antarctica. However, low temperatures can limit the geographical locations and productivity of grapes. Vitis amurensis is a wild grapevine species with remarkable cold-tolerance, exceeding th...
Herbicides are structurally diverse chemicals that inhibit plant-specific targets, however their off-target and potentially differentiating side-effects are less well defined. In this study, genome-wide expression profiling based on Affymetrix AtH1 arrays was used to identify dis...
In this study, genome-wide expression profiling based on Affymetrix ATH1 arrays was used to identify discriminating responses of Arabidopsis thaliana to five herbicides, which contain active ingredients targeting two different branches of amino acid biosynthesis. One herbicide co...
Establishment and functions of DNA methylation in the germline
Stewart, Kathleen R; Veselovska, Lenka; Kelsey, Gavin
2016-01-01
Epigenetic modifications established during gametogenesis regulate transcription and other nuclear processes in gametes, but also have influences in the zygote, embryo and postnatal life. This is best understood for DNA methylation which, established at discrete regions of the oocyte and sperm genomes, governs genomic imprinting. In this review, we describe how imprinting has informed our understanding of de novo DNA methylation mechanisms, highlight how recent genome-wide profiling studies have provided unprecedented insights into establishment of the sperm and oocyte methylomes and consider the fate and function of gametic methylation and other epigenetic modifications after fertilization. PMID:27659720
The histone H3 variant H3.3 regulates gene body DNA methylation in Arabidopsis thaliana.
Wollmann, Heike; Stroud, Hume; Yelagandula, Ramesh; Tarutani, Yoshiaki; Jiang, Danhua; Jing, Li; Jamge, Bhagyshree; Takeuchi, Hidenori; Holec, Sarah; Nie, Xin; Kakutani, Tetsuji; Jacobsen, Steven E; Berger, Frédéric
2017-05-18
Gene bodies of vertebrates and flowering plants are occupied by the histone variant H3.3 and DNA methylation. The origin and significance of these profiles remain largely unknown. DNA methylation and H3.3 enrichment profiles over gene bodies are correlated and both have a similar dependence on gene transcription levels. This suggests a mechanistic link between H3.3 and gene body methylation. We engineered an H3.3 knockdown in Arabidopsis thaliana and observed transcription reduction that predominantly affects genes responsive to environmental cues. When H3.3 levels are reduced, gene bodies show a loss of DNA methylation correlated with transcription levels. To study the origin of changes in DNA methylation profiles when H3.3 levels are reduced, we examined genome-wide distributions of several histone H3 marks, H2A.Z, and linker histone H1. We report that in the absence of H3.3, H1 distribution increases in gene bodies in a transcription-dependent manner. We propose that H3.3 prevents recruitment of H1, inhibiting H1's promotion of chromatin folding that restricts access to DNA methyltransferases responsible for gene body methylation. Thus, gene body methylation is likely shaped by H3.3 dynamics in conjunction with transcriptional activity.
GATA3 acts upstream of FOXA1 in mediating ESR1 binding by shaping enhancer accessibility.
Theodorou, Vasiliki; Stark, Rory; Menon, Suraj; Carroll, Jason S
2013-01-01
Estrogen receptor (ESR1) drives growth in the majority of human breast cancers by binding to regulatory elements and inducing transcription events that promote tumor growth. Differences in enhancer occupancy by ESR1 contribute to the diverse expression profiles and clinical outcome observed in breast cancer patients. GATA3 is an ESR1-cooperating transcription factor mutated in breast tumors; however, its genomic properties are not fully defined. In order to investigate the composition of enhancers involved in estrogen-induced transcription and the potential role of GATA3, we performed extensive ChIP-sequencing in unstimulated breast cancer cells and following estrogen treatment. We find that GATA3 is pivotal in mediating enhancer accessibility at regulatory regions involved in ESR1-mediated transcription. GATA3 silencing resulted in a global redistribution of cofactors and active histone marks prior to estrogen stimulation. These global genomic changes altered the ESR1-binding profile that subsequently occurred following estrogen, with events exhibiting both loss and gain in binding affinity, implying a GATA3-mediated redistribution of ESR1 binding. The GATA3-mediated redistributed ESR1 profile correlated with changes in gene expression, suggestive of its functionality. Chromatin loops at the TFF locus involving ESR1-bound enhancers occurred independently of ESR1 when GATA3 was silenced, indicating that GATA3, when present on the chromatin, may serve as a licensing factor for estrogen-ESR1-mediated interactions between cis-regulatory elements. Together, these experiments suggest that GATA3 directly impacts ESR1 enhancer accessibility, and may potentially explain the contribution of mutant-GATA3 in the heterogeneity of ESR1+ breast cancer.
GATA3 acts upstream of FOXA1 in mediating ESR1 binding by shaping enhancer accessibility
Theodorou, Vasiliki; Stark, Rory; Menon, Suraj; Carroll, Jason S.
2013-01-01
Estrogen receptor (ESR1) drives growth in the majority of human breast cancers by binding to regulatory elements and inducing transcription events that promote tumor growth. Differences in enhancer occupancy by ESR1 contribute to the diverse expression profiles and clinical outcome observed in breast cancer patients. GATA3 is an ESR1-cooperating transcription factor mutated in breast tumors; however, its genomic properties are not fully defined. In order to investigate the composition of enhancers involved in estrogen-induced transcription and the potential role of GATA3, we performed extensive ChIP-sequencing in unstimulated breast cancer cells and following estrogen treatment. We find that GATA3 is pivotal in mediating enhancer accessibility at regulatory regions involved in ESR1-mediated transcription. GATA3 silencing resulted in a global redistribution of cofactors and active histone marks prior to estrogen stimulation. These global genomic changes altered the ESR1-binding profile that subsequently occurred following estrogen, with events exhibiting both loss and gain in binding affinity, implying a GATA3-mediated redistribution of ESR1 binding. The GATA3-mediated redistributed ESR1 profile correlated with changes in gene expression, suggestive of its functionality. Chromatin loops at the TFF locus involving ESR1-bound enhancers occurred independently of ESR1 when GATA3 was silenced, indicating that GATA3, when present on the chromatin, may serve as a licensing factor for estrogen–ESR1-mediated interactions between cis-regulatory elements. Together, these experiments suggest that GATA3 directly impacts ESR1 enhancer accessibility, and may potentially explain the contribution of mutant-GATA3 in the heterogeneity of ESR1+ breast cancer. PMID:23172872
Tran, Frances; Penniket, Carolyn; Patel, Rohan V; Provart, Nicholas J; Laroche, André; Rowland, Owen; Robert, Laurian S
2013-06-01
Despite their importance, there remains a paucity of large-scale gene expression-based studies of reproductive development in species belonging to the Triticeae. As a first step to address this deficiency, a gene expression atlas of triticale reproductive development was generated using the 55K Affymetrix GeneChip(®) wheat genome array. The global transcriptional profiles of the anther/pollen, ovary and stigma were analyzed at concurrent developmental stages, and co-expressed as well as preferentially expressed genes were identified. Data analysis revealed both novel and conserved regulatory factors underlying Triticeae floral development and function. This comprehensive resource rests upon detailed gene annotations, and the expression profiles are readily accessible via a web browser. © 2013 Her Majesty the Queen in Right of Canada as represented by the Minister of Agriculture and Agri-Food Canada.
Schwartz, Thomas; Armant, Olivier; Bretschneider, Nancy; Hahn, Alexander; Kirchen, Silke; Seifert, Martin; Dötsch, Andreas
2015-01-01
The fitness of sensitive and resistant Pseudomonas aeruginosa in different aquatic environments depends on genetic capacities and transcriptional regulation. Therefore, an antibiotic-sensitive isolate PA30 and a multi-resistant isolate PA49 originating from waste waters were compared via whole genome and transcriptome Illumina sequencing after exposure to municipal waste water and tap water. A number of different genomic islands (e.g. PAGIs, PAPIs) were identified in the two environmental isolates beside the highly conserved core genome. Exposure to tap water and waste water exhibited similar transcriptional impacts on several gene clusters (antibiotic and metal resistance, genetic mobile elements, efflux pumps) in both environmental P. aeruginosa isolates. The MexCD-OprJ efflux pump was overexpressed in PA49 in response to waste water. The expression of resistance genes, genetic mobile elements in PA49 was independent from the water matrix. Consistently, the antibiotic sensitive strain PA30 did not show any difference in expression of the intrinsic resistance determinants and genetic mobile elements. Thus, the exposure of both isolates to polluted waste water and oligotrophic tap water resulted in similar expression profiles of mentioned genes. However, changes in environmental milieus resulted in rather unspecific transcriptional responses than selected and stimuli-specific gene regulation. PMID:25186059
Kuhl, U; Lassner, D; Dorner, A; Rohde, M; Escher, F; Seeberg, B; Hertel, E; Tschope, C; Skurk, C; Gross, U M; Schultheiss, H-P; Poller, W
2013-09-01
Recent studies have detected erythrovirus genomes in the hearts of cardiomyopathy and cardiac transplant patients. Assessment of the functional status of viruses may provide clinically important information beyond detection of the viral genomes. Here, we report transcriptional activation of cardiotropic erythrovirus to be associated with strongly altered myocardial gene expression in a distinct subgroup of cardiomyopathy patients. Endomyocardial biopsies (EMBs) from 415 consecutive cardiac erythrovirus (B19V)-positive patients with clinically suspected cardiomyopathy were screened for virus-encoded VP1/VP2 mRNA indicating transcriptional activation of the virus, and correlated with cardiac host gene expression patterns in transcriptionally active versus latent infections, and in virus-free control hearts. Transcriptional activity was detected in baseline biopsies of only 66/415 patients (15.9 %) harbouring erythrovirus. At the molecular level, significant differences between cardiac B19V-positive patients with transcriptionally active versus latent virus were revealed by expression profiling of EMBs. Importantly, latent B19V infection was indistinguishable from controls. Genes involved encode proteins of antiviral immune response, B19V receptor complex, and mitochondrial energy metabolism. Thus, functional mapping of erythrovirus allows definition of a subgroup of B19V-infected cardiomyopathy patients characterized by virus-encoded VP1/VP2 transcripts and anomalous host myocardial transcriptomes. Cardiac B19V reactivation from latency, as reported here for the first time, is a key factor required for erythrovirus to induce altered cardiac gene expression in a subgroup of cardiomyopathy patients. Virus genome detection is insufficient to assess pathogenic potential, but additional transcriptional mapping should be incorporated into future pathogenetic and therapeutic studies both in cardiology and transplantation medicine.
Transcriptional Profiling of Human Endogenous Retrovirus Group HERV-K(HML-2) Loci in Melanoma
Schmitt, Katja; Reichrath, Jörg; Roesch, Alexander; Meese, Eckart; Mayer, Jens
2013-01-01
Recent studies suggested a role for the human endogenous retrovirus (HERV) group HERV-K(HML-2) in melanoma because of upregulated transcription and expression of HERV-K(HML-2)-encoded proteins. Very little is known about which HML-2 loci are transcribed in melanoma. We assigned >1,400 HML-2 cDNA sequences generated from various melanoma and related samples to genomic HML-2 loci, identifying a total of 23 loci as transcribed. Transcription profiles of loci differed significantly between samples. One locus was found transcribed only in melanoma-derived samples but not in melanocytes and might represent a marker for melanoma. Several of the transcribed loci harbor ORFs for retroviral Gag and/or Env proteins. Env-encoding loci were transcribed only in melanoma. Specific investigation of rec and np9 transcripts indicated transcription of protein encoding loci in melanoma and melanocytes hinting at the relevance of Rec and Np9 in melanoma. UVB irradiation changed transcription profiles of loci and overall transcript levels decreased in melanoma and melanocytes. We further identified transcribed HML-2 loci formed by reverse transcription of spliced HML-2 transcripts by L1 machinery or in a retroviral fashion, with loci potentially encoding HML-2-like proteins. We reveal complex, sample-specific transcription of HML-2 loci in melanoma and related samples. Identified HML-2 loci and proteins encoded by those loci are particularly relevant for further studying the role of HML-2 in melanoma. Transcription of HERVs appears as a complex mechanism requiring specific studies to elucidate which HERV loci are transcribed and how transcribed HERVs may be involved in disease. PMID:23338945
Tang, Guo-Qing; Maxwell, E. Stuart
2008-01-01
The amphibian Xenopus provides a model organism for investigating microRNA expression during vertebrate embryogenesis and development. Searching available Xenopus genome databases using known human pre-miRNAs as query sequences, more than 300 genes encoding 142 Xenopus tropicalis miRNAs were identified. Analysis of Xenopus tropicalis miRNA genes revealed a predominate positioning within introns of protein-coding and nonprotein-coding RNA Pol II-transcribed genes. MiRNA genes were also located in pre-mRNA exons and positioned intergenically between known protein-coding genes. Many miRNA species were found in multiple locations and in more than one genomic context. MiRNA genes were also clustered throughout the genome, indicating the potential for the cotranscription and coordinate expression of miRNAs located in a given cluster. Northern blot analysis confirmed the expression of many identified miRNAs in both X. tropicalis and X. laevis. Comparison of X. tropicalis and X. laevis blots revealed comparable expression profiles, although several miRNAs exhibited species-specific expression in different tissues. More detailed analysis revealed that for some miRNAs, the tissue-specific expression profile of the pri-miRNA precursor was distinctly different from that of the mature miRNA profile. Differential miRNA precursor processing in both the nucleus and cytoplasm was implicated in the observed tissue-specific differences. These observations indicated that post-transcriptional processing plays an important role in regulating miRNA expression in the amphibian Xenopus. PMID:18032731
Jung, Ki-Hong; Dardick, Christopher; Bartley, Laura E; Cao, Peijian; Phetsom, Jirapa; Canlas, Patrick; Seo, Young-Su; Shultz, Michael; Ouyang, Shu; Yuan, Qiaoping; Frank, Bryan C; Ly, Eugene; Zheng, Li; Jia, Yi; Hsia, An-Ping; An, Kyungsook; Chou, Hui-Hsien; Rocke, David; Lee, Geun Cheol; Schnable, Patrick S; An, Gynheung; Buell, C Robin; Ronald, Pamela C
2008-10-06
Studies of gene function are often hampered by gene-redundancy, especially in organisms with large genomes such as rice (Oryza sativa). We present an approach for using transcriptomics data to focus functional studies and address redundancy. To this end, we have constructed and validated an inexpensive and publicly available rice oligonucleotide near-whole genome array, called the rice NSF45K array. We generated expression profiles for light- vs. dark-grown rice leaf tissue and validated the biological significance of the data by analyzing sources of variation and confirming expression trends with reverse transcription polymerase chain reaction. We examined trends in the data by evaluating enrichment of gene ontology terms at multiple false discovery rate thresholds. To compare data generated with the NSF45K array with published results, we developed publicly available, web-based tools (www.ricearray.org). The Oligo and EST Anatomy Viewer enables visualization of EST-based expression profiling data for all genes on the array. The Rice Multi-platform Microarray Search Tool facilitates comparison of gene expression profiles across multiple rice microarray platforms. Finally, we incorporated gene expression and biochemical pathway data to reduce the number of candidate gene products putatively participating in the eight steps of the photorespiration pathway from 52 to 10, based on expression levels of putatively functionally redundant genes. We confirmed the efficacy of this method to cope with redundancy by correctly predicting participation in photorespiration of a gene with five paralogs. Applying these methods will accelerate rice functional genomics.
Single-cell transcriptional analysis of taste sensory neuron pair in Caenorhabditis elegans.
Takayama, Jun; Faumont, Serge; Kunitomo, Hirofumi; Lockery, Shawn R; Iino, Yuichi
2010-01-01
The nervous system is composed of a wide variety of neurons. A description of the transcriptional profiles of each neuron would yield enormous information about the molecular mechanisms that define morphological or functional characteristics. Here we show that RNA isolation from single neurons is feasible by using an optimized mRNA tagging method. This method extracts transcripts in the target cells by co-immunoprecipitation of the complexes of RNA and epitope-tagged poly(A) binding protein expressed specifically in the cells. With this method and genome-wide microarray, we compared the transcriptional profiles of two functionally different neurons in the main C. elegans gustatory neuron class ASE. Eight of the 13 known subtype-specific genes were successfully detected. Additionally, we identified nine novel genes including a receptor guanylyl cyclase, secreted proteins, a TRPC channel and uncharacterized genes conserved among nematodes, suggesting the two neurons are substantially different than previously thought. The expression of these novel genes was controlled by the previously known regulatory network for subtype differentiation. We also describe unique motif organization within individual gene groups classified by the expression patterns in ASE. Our study paves the way to the complete catalog of the expression profiles of individual C. elegans neurons.
Stojnic, Robert; Fu, Audrey Qiuyan; Adryan, Boris
2012-01-01
Inferring the combinatorial regulatory code of transcription factors (TFs) from genome-wide TF binding profiles is challenging. A major reason is that TF binding profiles significantly overlap and are therefore highly correlated. Clustered occurrence of multiple TFs at genomic sites may arise from chromatin accessibility and local cooperation between TFs, or binding sites may simply appear clustered if the profiles are generated from diverse cell populations. Overlaps in TF binding profiles may also result from measurements taken at closely related time intervals. It is thus of great interest to distinguish TFs that directly regulate gene expression from those that are indirectly associated with gene expression. Graphical models, in particular Bayesian networks, provide a powerful mathematical framework to infer different types of dependencies. However, existing methods do not perform well when the features (here: TF binding profiles) are highly correlated, when their association with the biological outcome is weak, and when the sample size is small. Here, we develop a novel computational method, the Neighbourhood Consistent PC (NCPC) algorithms, which deal with these scenarios much more effectively than existing methods do. We further present a novel graphical representation, the Direct Dependence Graph (DDGraph), to better display the complex interactions among variables. NCPC and DDGraph can also be applied to other problems involving highly correlated biological features. Both methods are implemented in the R package ddgraph, available as part of Bioconductor (http://bioconductor.org/packages/2.11/bioc/html/ddgraph.html). Applied to real data, our method identified TFs that specify different classes of cis-regulatory modules (CRMs) in Drosophila mesoderm differentiation. Our analysis also found depletion of the early transcription factor Twist binding at the CRMs regulating expression in visceral and somatic muscle cells at later stages, which suggests a CRM-specific repression mechanism that so far has not been characterised for this class of mesodermal CRMs. PMID:23144600
Aberrantly Expressed OTX Homeobox Genes Deregulate B-Cell Differentiation in Hodgkin Lymphoma.
Nagel, Stefan; Ehrentraut, Stefan; Meyer, Corinna; Kaufmann, Maren; Drexler, Hans G; MacLeod, Roderick A F
2015-01-01
In Hodgkin lymphoma (HL) we recently reported that deregulated homeobox gene MSX1 mediates repression of the B-cell specific transcription factor ZHX2. In this study we investigated regulation of MSX1 in this B-cell malignancy. Accordingly, we analyzed expression and function of OTX homeobox genes which activate MSX1 transcription during embryonal development in the neural plate border region. Our data demonstrate that OTX1 and OTX2 are aberrantly expressed in both HL patients and cell lines. Moreover, both OTX loci are targeted by genomic gains in overexpressing cell lines. Comparative expression profiling and subsequent pathway modulations in HL cell lines indicated that aberrantly enhanced FGF2-signalling activates the expression of OTX2. Downstream analyses of OTX2 demonstrated transcriptional activation of genes encoding transcription factors MSX1, FOXC1 and ZHX1. Interestingly, examination of the physiological expression profile of ZHX1 in normal hematopoietic cells revealed elevated levels in T-cells and reduced expression in B-cells, indicating a discriminatory role in lymphopoiesis. Furthermore, two OTX-negative HL cell lines overexpressed ZHX1 in correlation with genomic amplification of its locus at chromosomal band 8q24, supporting the oncogenic potential of this gene in HL. Taken together, our data demonstrate that deregulated homeobox genes MSX1 and OTX2 respectively impact transcriptional inhibition of (B-cell specific) ZHX2 and activation of (T-cell specific) ZHX1. Thus, we show how reactivation of a specific embryonal gene regulatory network promotes disturbed B-cell differentiation in HL.
DNA methylation profiles of donor nuclei cells and tissues of cloned bovine fetuses.
Kremenskoy, Maksym; Kremenska, Yuliya; Suzuki, Masako; Imai, Kei; Takahashi, Seiya; Hashizume, Kazuyoshi; Yagi, Shintaro; Shiota, Kunio
2006-04-01
Methylation of DNA in CpG islands plays an important role during fetal development and differentiation because CpG islands are preferentially located in upstream regions of mammalian genomic DNA, including the transcription start site of housekeeping genes and are also associated with tissue-specific genes. Somatic nuclear transfer (NT) technology has been used to generate live clones in numerous mammalian species, but only a low percentage of nuclear transferred animals develop to term. Abnormal epigenetic changes in the CpG islands of donor nuclei after nuclear transfer could contribute to a high rate of abortion during early gestation and increase perinatal death. These changes have yet to be explored. Thus, we investigated the genome-wide DNA methylation profiles of CpG islands in nuclei donor cells and NT animals. Using Restriction Landmark Genomic Scanning (RLGS), we showed, for the first time, the epigenetic profile formation of tissues from NT bovine fetuses produced from cumulus cells. From approximately 2600 unmethylated NotI sites visualized on the RLGS profile, at least 35 NotI sites showed different methylation statuses. Moreover, we proved that fetal and placental tissues from artificially inseminated and cloned cattle have tissue-specific differences in the genome-wide methylation profiles of the CpG islands. We also found that possible abnormalities occurred in the fetal brain and placental tissues of cloned animals.
Lindgren, Emma; Hägg, Sara; Giordano, Fosco; Björkegren, Johan; Ström, Lena
2014-01-01
Genome integrity is fundamental for cell survival and cell cycle progression. Important mechanisms for keeping the genome intact are proper sister chromatid segregation, correct gene regulation and efficient repair of damaged DNA. Cohesin and its DNA loader, the Scc2/4 complex have been implicated in all these cellular actions. The gene regulation role has been described in several organisms. In yeast it has been suggested that the proteins in the cohesin network would effect transcription based on its role as insulator. More recently, data are emerging indicating direct roles for gene regulation also in yeast. Here we extend these studies by investigating whether the cohesin loader Scc2 is involved in regulation of gene expression. We performed global gene expression profiling in the absence and presence of DNA damage, in wild type and Scc2 deficient G2/M arrested cells, when it is known that Scc2 is important for DNA double strand break repair and formation of damage induced cohesion. We found that not only the DNA damage specific transcriptional response is distorted after inactivation of Scc2 but also the overall transcription profile. Interestingly, these alterations did not correlate with changes in cohesin binding. PMID:25483075
Genome-wide transcriptome and expression profile analysis of Phalaenopsis during explant browning.
Xu, Chuanjun; Zeng, Biyu; Huang, Junmei; Huang, Wen; Liu, Yumei
2015-01-01
Explant browning presents a major problem for in vitro culture, and can lead to the death of the explant and failure of regeneration. Considerable work has examined the physiological mechanisms underlying Phalaenopsis leaf explant browning, but the molecular mechanisms of browning remain elusive. In this study, we used whole genome RNA sequencing to examine Phalaenopsis leaf explant browning at genome-wide level. We first used Illumina high-throughput technology to sequence the transcriptome of Phalaenopsis and then performed de novo transcriptome assembly. We assembled 79,434,350 clean reads into 31,708 isogenes and generated 26,565 annotated unigenes. We assigned Gene Ontology (GO) terms, Kyoto Encyclopedia of Genes and Genomes (KEGG) annotations, and potential Pfam domains to each transcript. Using the transcriptome data as a reference, we next analyzed the differential gene expression of explants cultured for 0, 3, and 6 d, respectively. We then identified differentially expressed genes (DEGs) before and after Phalaenopsis explant browning. We also performed GO, KEGG functional enrichment and Pfam analysis of all DEGs. Finally, we selected 11 genes for quantitative real-time PCR (qPCR) analysis to confirm the expression profile analysis. Here, we report the first comprehensive analysis of transcriptome and expression profiles during Phalaenopsis explant browning. Our results suggest that Phalaenopsis explant browning may be due in part to gene expression changes that affect the secondary metabolism, such as: phenylpropanoid pathway and flavonoid biosynthesis. Genes involved in photosynthesis and ATPase activity have been found to be changed at transcription level; these changes may perturb energy metabolism and thus lead to the decay of plant cells and tissues. This study provides comprehensive gene expression data for Phalaenopsis browning. Our data constitute an important resource for further functional studies to prevent explant browning.
Multi-targeted priming for genome-wide gene expression assays.
Adomas, Aleksandra B; Lopez-Giraldez, Francesc; Clark, Travis A; Wang, Zheng; Townsend, Jeffrey P
2010-08-17
Complementary approaches to assaying global gene expression are needed to assess gene expression in regions that are poorly assayed by current methodologies. A key component of nearly all gene expression assays is the reverse transcription of transcribed sequences that has traditionally been performed by priming the poly-A tails on many of the transcribed genes in eukaryotes with oligo-dT, or by priming RNA indiscriminately with random hexamers. We designed an algorithm to find common sequence motifs that were present within most protein-coding genes of Saccharomyces cerevisiae and of Neurospora crassa, but that were not present within their ribosomal RNA or transfer RNA genes. We then experimentally tested whether degenerately priming these motifs with multi-targeted primers improved the accuracy and completeness of transcriptomic assays. We discovered two multi-targeted primers that would prime a preponderance of genes in the genomes of Saccharomyces cerevisiae and Neurospora crassa while avoiding priming ribosomal RNA or transfer RNA. Examining the response of Saccharomyces cerevisiae to nitrogen deficiency and profiling Neurospora crassa early sexual development, we demonstrated that using multi-targeted primers in reverse transcription led to superior performance of microarray profiling and next-generation RNA tag sequencing. Priming with multi-targeted primers in addition to oligo-dT resulted in higher sensitivity, a larger number of well-measured genes and greater power to detect differences in gene expression. Our results provide the most complete and detailed expression profiles of the yeast nitrogen starvation response and N. crassa early sexual development to date. Furthermore, our multi-targeting priming methodology for genome-wide gene expression assays provides selective targeting of multiple sequences and counter-selection against undesirable sequences, facilitating a more complete and precise assay of the transcribed sequences within the genome.
Genome-Wide Transcriptome and Expression Profile Analysis of Phalaenopsis during Explant Browning
Xu, Chuanjun; Zeng, Biyu; Huang, Junmei; Huang, Wen; Liu, Yumei
2015-01-01
Background Explant browning presents a major problem for in vitro culture, and can lead to the death of the explant and failure of regeneration. Considerable work has examined the physiological mechanisms underlying Phalaenopsis leaf explant browning, but the molecular mechanisms of browning remain elusive. In this study, we used whole genome RNA sequencing to examine Phalaenopsis leaf explant browning at genome-wide level. Methodology/Principal Findings We first used Illumina high-throughput technology to sequence the transcriptome of Phalaenopsis and then performed de novo transcriptome assembly. We assembled 79,434,350 clean reads into 31,708 isogenes and generated 26,565 annotated unigenes. We assigned Gene Ontology (GO) terms, Kyoto Encyclopedia of Genes and Genomes (KEGG) annotations, and potential Pfam domains to each transcript. Using the transcriptome data as a reference, we next analyzed the differential gene expression of explants cultured for 0, 3, and 6 d, respectively. We then identified differentially expressed genes (DEGs) before and after Phalaenopsis explant browning. We also performed GO, KEGG functional enrichment and Pfam analysis of all DEGs. Finally, we selected 11 genes for quantitative real-time PCR (qPCR) analysis to confirm the expression profile analysis. Conclusions/Significance Here, we report the first comprehensive analysis of transcriptome and expression profiles during Phalaenopsis explant browning. Our results suggest that Phalaenopsis explant browning may be due in part to gene expression changes that affect the secondary metabolism, such as: phenylpropanoid pathway and flavonoid biosynthesis. Genes involved in photosynthesis and ATPase activity have been found to be changed at transcription level; these changes may perturb energy metabolism and thus lead to the decay of plant cells and tissues. This study provides comprehensive gene expression data for Phalaenopsis browning. Our data constitute an important resource for further functional studies to prevent explant browning. PMID:25874455
The rubber tree genome shows expansion of gene family associated with rubber biosynthesis.
Lau, Nyok-Sean; Makita, Yuko; Kawashima, Mika; Taylor, Todd D; Kondo, Shinji; Othman, Ahmad Sofiman; Shu-Chien, Alexander Chong; Matsui, Minami
2016-06-24
Hevea brasiliensis Muell. Arg, a member of the family Euphorbiaceae, is the sole natural resource exploited for commercial production of high-quality natural rubber. The properties of natural rubber latex are almost irreplaceable by synthetic counterparts for many industrial applications. A paucity of knowledge on the molecular mechanisms of rubber biosynthesis in high yield traits still persists. Here we report the comprehensive genome-wide analysis of the widely planted H. brasiliensis clone, RRIM 600. The genome was assembled based on ~155-fold combined coverage with Illumina and PacBio sequence data and has a total length of 1.55 Gb with 72.5% comprising repetitive DNA sequences. A total of 84,440 high-confidence protein-coding genes were predicted. Comparative genomic analysis revealed strong synteny between H. brasiliensis and other Euphorbiaceae genomes. Our data suggest that H. brasiliensis's capacity to produce high levels of latex can be attributed to the expansion of rubber biosynthesis-related genes in its genome and the high expression of these genes in latex. Using cap analysis gene expression data, we illustrate the tissue-specific transcription profiles of rubber biosynthesis-related genes, revealing alternative means of transcriptional regulation. Our study adds to the understanding of H. brasiliensis biology and provides valuable genomic resources for future agronomic-related improvement of the rubber tree.
O'Brien, M.A.; Costin, B.N.; Miles, M.F.
2014-01-01
Postgenomic studies of the function of genes and their role in disease have now become an area of intense study since efforts to define the raw sequence material of the genome have largely been completed. The use of whole-genome approaches such as microarray expression profiling and, more recently, RNA-sequence analysis of transcript abundance has allowed an unprecedented look at the workings of the genome. However, the accurate derivation of such high-throughput data and their analysis in terms of biological function has been critical to truly leveraging the postgenomic revolution. This chapter will describe an approach that focuses on the use of gene networks to both organize and interpret genomic expression data. Such networks, derived from statistical analysis of large genomic datasets and the application of multiple bioinformatics data resources, poten-tially allow the identification of key control elements for networks associated with human disease, and thus may lead to derivation of novel therapeutic approaches. However, as discussed in this chapter, the leveraging of such networks cannot occur without a thorough understanding of the technical and statistical factors influencing the derivation of genomic expression data. Thus, while the catch phrase may be “it's the network … stupid,” the understanding of factors extending from RNA isolation to genomic profiling technique, multivariate statistics, and bioinformatics are all critical to defining fully useful gene networks for study of complex biology. PMID:23195313
Qin, Jin-Hong; Zhang, Qing; Zhang, Zhi-Ming; Zhong, Yi; Yang, Yang; Hu, Bao-Yu; Zhao, Guo-Ping; Guo, Xiao-Kui
2008-06-01
DNA microarray analysis was used to compare the differential gene expression profiles between Leptospira interrogans serovar Lai type strain 56601 and its corresponding attenuated strain IPAV. A 22-kb genomic island covering a cluster of 34 genes (i.e., genes LA0186 to LA0219) was actively expressed in both strains but concomitantly upregulated in strain 56601 in contrast to that of IPAV. Reverse transcription-PCR assays proved that the gene cluster comprised five transcripts. Gene annotation of this cluster revealed characteristics of a putative prophage-like remnant with at least 8 of 34 sequences encoding prophage-like proteins, of which the LA0195 protein is probably a putative prophage CI-like regulator. The transcription initiation activities of putative promoter-regulatory sequences of transcripts I, II, and III, all proximal to the LA0195 gene, were further analyzed in the Escherichia coli promoter probe vector pKK232-8 by assaying the reporter chloramphenicol acetyltransferase (CAT) activities. The strong promoter activities of both transcripts I and II indicated by the E. coli CAT assay were well correlated with the in vitro sequence-specific binding of the recombinant LA0195 protein to the corresponding promoter probes detected by the electrophoresis mobility shift assay. On the other hand, the promoter activity of transcript III was very low in E. coli and failed to show active binding to the LA0195 protein in vitro. These results suggested that the LA0195 protein is likely involved in the transcription of transcripts I and II. However, the identical complete DNA sequences of this prophage remnant from these two strains strongly suggests that possible regulatory factors or signal transduction systems residing outside of this region within the genome may be responsible for the differential expression profiling in these two strains.
2010-01-01
Background Systematic research on fish immunogenetics is indispensable in understanding the origin and evolution of immune systems. This has long been a challenging task because of the limited number of deep sequencing technologies and genome backgrounds of non-model fish available. The newly developed Solexa/Illumina RNA-seq and Digital gene expression (DGE) are high-throughput sequencing approaches and are powerful tools for genomic studies at the transcriptome level. This study reports the transcriptome profiling analysis of bacteria-challenged Lateolabrax japonicus using RNA-seq and DGE in an attempt to gain insights into the immunogenetics of marine fish. Results RNA-seq analysis generated 169,950 non-redundant consensus sequences, among which 48,987 functional transcripts with complete or various length encoding regions were identified. More than 52% of these transcripts are possibly involved in approximately 219 known metabolic or signalling pathways, while 2,673 transcripts were associated with immune-relevant genes. In addition, approximately 8% of the transcripts appeared to be fish-specific genes that have never been described before. DGE analysis revealed that the host transcriptome profile of Vibrio harveyi-challenged L. japonicus is considerably altered, as indicated by the significant up- or down-regulation of 1,224 strong infection-responsive transcripts. Results indicated an overall conservation of the components and transcriptome alterations underlying innate and adaptive immunity in fish and other vertebrate models. Analysis suggested the acquisition of numerous fish-specific immune system components during early vertebrate evolution. Conclusion This study provided a global survey of host defence gene activities against bacterial challenge in a non-model marine fish. Results can contribute to the in-depth study of candidate genes in marine fish immunity, and help improve current understanding of host-pathogen interactions and evolutionary history of immunogenetics from fish to mammals. PMID:20707909
Dalla Rosa, Ilaria; Zhang, Hongliang; Khiati, Salim; Wu, Xiaolin; Pommier, Yves
2017-12-08
Mitochondrial DNA (mtDNA) is essential for cell viability because it encodes subunits of the respiratory chain complexes. Mitochondrial topoisomerase IB (TOP1MT) facilitates mtDNA replication by removing DNA topological tensions produced during mtDNA transcription, but it appears to be dispensable. To test whether cells lacking TOP1MT have aberrant mtDNA transcription, we performed mitochondrial transcriptome profiling. To that end, we designed and implemented a customized tiling array, which enabled genome-wide, strand-specific, and simultaneous detection of all mitochondrial transcripts. Our technique revealed that Top1mt KO mouse cells process the mitochondrial transcripts normally but that protein-coding mitochondrial transcripts are elevated. Moreover, we found discrete long noncoding RNAs produced by H-strand transcription and encompassing the noncoding regulatory region of mtDNA in human and murine cells and tissues. Of note, these noncoding RNAs were strongly up-regulated in the absence of TOP1MT. In contrast, 7S DNA, produced by mtDNA replication, was reduced in the Top1mt KO cells. We propose that the long noncoding RNA species in the D-loop region are generated by the extension of H-strand transcripts beyond their canonical stop site and that TOP1MT acts as a topological barrier and regulator for mtDNA transcription and D-loop formation.
Bencun, Maja; Klinke, Olaf; Hotz-Wagenblatt, Agnes; Klaus, Severina; Tsai, Ming-Han; Poirey, Remy; Delecluse, Henri-Jacques
2018-04-06
The Epstein-Barr virus (EBV) genome encodes several hundred transcripts. We have used ribosome profiling to characterize viral translation in infected cells and map new translation initiation sites. We show here that EBV transcripts are translated with highly variable efficiency, owing to variable transcription and translation rates, variable ribosome recruitment to the leader region and coverage by monosomes versus polysomes. Some transcripts were hardly translated, others mainly carried monosomes, showed ribosome accumulation in leader regions and most likely represent non-coding RNAs. A similar process was visible for a subset of lytic genes including the key transactivators BZLF1 and BRLF1 in cells infected with weakly replicating EBV strains. This suggests that ribosome trapping, particularly in the leader region, represents a new checkpoint for the repression of lytic replication. We could identify 25 upstream open reading frames (uORFs) located upstream of coding transcripts that displayed 5' leader ribosome trapping, six of which were located in the leader region shared by many latent transcripts. These uORFs repressed viral translation and are likely to play an important role in the regulation of EBV translation.
Gilad, Yoav; Pritchard, Jonathan K.; Stephens, Matthew
2015-01-01
Understanding global gene regulation depends critically on accurate annotation of regulatory elements that are functional in a given cell type. CENTIPEDE, a powerful, probabilistic framework for identifying transcription factor binding sites from tissue-specific DNase I cleavage patterns and genomic sequence content, leverages the hypersensitivity of factor-bound chromatin and the information in the DNase I spatial cleavage profile characteristic of each DNA binding protein to accurately infer functional factor binding sites. However, the model for the spatial profile in this framework fails to account for the substantial variation in the DNase I cleavage profiles across different binding sites. Neither does it account for variation in the profiles at the same binding site across multiple replicate DNase I experiments, which are increasingly available. In this work, we introduce new methods, based on multi-scale models for inhomogeneous Poisson processes, to account for such variation in DNase I cleavage patterns both within and across binding sites. These models account for the spatial structure in the heterogeneity in DNase I cleavage patterns for each factor. Using DNase-seq measurements assayed in a lymphoblastoid cell line, we demonstrate the improved performance of this model for several transcription factors by comparing against the Chip-seq peaks for those factors. Finally, we explore the effects of DNase I sequence bias on inference of factor binding using a simple extension to our framework that allows for a more flexible background model. The proposed model can also be easily applied to paired-end ATAC-seq and DNase-seq data. msCentipede, a Python implementation of our algorithm, is available at http://rajanil.github.io/msCentipede. PMID:26406244
Raj, Anil; Shim, Heejung; Gilad, Yoav; Pritchard, Jonathan K; Stephens, Matthew
2015-01-01
Understanding global gene regulation depends critically on accurate annotation of regulatory elements that are functional in a given cell type. CENTIPEDE, a powerful, probabilistic framework for identifying transcription factor binding sites from tissue-specific DNase I cleavage patterns and genomic sequence content, leverages the hypersensitivity of factor-bound chromatin and the information in the DNase I spatial cleavage profile characteristic of each DNA binding protein to accurately infer functional factor binding sites. However, the model for the spatial profile in this framework fails to account for the substantial variation in the DNase I cleavage profiles across different binding sites. Neither does it account for variation in the profiles at the same binding site across multiple replicate DNase I experiments, which are increasingly available. In this work, we introduce new methods, based on multi-scale models for inhomogeneous Poisson processes, to account for such variation in DNase I cleavage patterns both within and across binding sites. These models account for the spatial structure in the heterogeneity in DNase I cleavage patterns for each factor. Using DNase-seq measurements assayed in a lymphoblastoid cell line, we demonstrate the improved performance of this model for several transcription factors by comparing against the Chip-seq peaks for those factors. Finally, we explore the effects of DNase I sequence bias on inference of factor binding using a simple extension to our framework that allows for a more flexible background model. The proposed model can also be easily applied to paired-end ATAC-seq and DNase-seq data. msCentipede, a Python implementation of our algorithm, is available at http://rajanil.github.io/msCentipede.
Baev, Vesselin; Milev, Ivan; Naydenov, Mladen; Vachev, Tihomir; Apostolova, Elena; Mehterov, Nikolay; Gozmanva, Mariyana; Minkov, Georgi; Sablok, Gaurav; Yahubyan, Galina
2014-11-01
Small RNA profiling and assessing its dependence on changing environmental factors have expanded our understanding of the transcriptional and post-transcriptional regulation of plant stress responses. Insufficient data have been documented earlier to depict the profiling of small RNA classes in temperature-associated stress which has a wide implication for climate change biology. In the present study, we report a comparative assessment of the genome-wide profiling of small RNAs in Arabidopsis thaliana using two conditional responses, induced by high- and low-temperature. Genome-wide profiling of small RNAs revealed an abundance of 21 nt small RNAs at low temperature, while high temperature showed an abundance of 21 nt and 24 nt small RNAs. The two temperature treatments altered the expression of a specific subset of mature miRNAs and displayed differential expression of a number of miRNA isoforms (isomiRs). Comparative analysis demonstrated that a large number of protein-coding genes can give rise to differentially expressed small RNAs following temperature shifts. Low temperature caused accumulation of small RNAs, corresponding to the sense strand of a number of cold-responsive genes. In contrast, high temperature stimulated the production of small RNAs of both polarities from genes encoding functionally diverse proteins. Copyright © 2014 Elsevier Masson SAS. All rights reserved.
Lee, Ann-Ying; Chen, Chun-Yi; Chang, Yao-Chien Alex; Chao, Ya-Ting; Shih, Ming-Che
2013-01-01
Previously we developed genomic resources for orchids, including transcriptomic analyses using next-generation sequencing techniques and construction of a web-based orchid genomic database. Here, we report a modified molecular model of flower development in the Orchidaceae based on functional analysis of gene expression profiles in Phalaenopsis aphrodite (a moth orchid) that revealed novel roles for the transcription factors involved in floral organ pattern formation. Phalaenopsis orchid floral organ-specific genes were identified by microarray analysis. Several critical transcription factors including AP3, PI, AP1 and AGL6, displayed distinct spatial distribution patterns. Phylogenetic analysis of orchid MADS box genes was conducted to infer the evolutionary relationship among floral organ-specific genes. The results suggest that gene duplication MADS box genes in orchid may have resulted in their gaining novel functions during evolution. Based on these analyses, a modified model of orchid flowering was proposed. Comparison of the expression profiles of flowers of a peloric mutant and wild-type Phalaenopsis orchid further identified genes associated with lip morphology and peloric effects. Large scale investigation of gene expression profiles revealed that homeotic genes from the ABCDE model of flower development classes A and B in the Phalaenopsis orchid have novel functions due to evolutionary diversification, and display differential expression patterns. PMID:24265826
An anatomically comprehensive atlas of the adult human brain transcriptome
Guillozet-Bongaarts, Angela L.; Shen, Elaine H.; Ng, Lydia; Miller, Jeremy A.; van de Lagemaat, Louie N.; Smith, Kimberly A.; Ebbert, Amanda; Riley, Zackery L.; Abajian, Chris; Beckmann, Christian F.; Bernard, Amy; Bertagnolli, Darren; Boe, Andrew F.; Cartagena, Preston M.; Chakravarty, M. Mallar; Chapin, Mike; Chong, Jimmy; Dalley, Rachel A.; David Daly, Barry; Dang, Chinh; Datta, Suvro; Dee, Nick; Dolbeare, Tim A.; Faber, Vance; Feng, David; Fowler, David R.; Goldy, Jeff; Gregor, Benjamin W.; Haradon, Zeb; Haynor, David R.; Hohmann, John G.; Horvath, Steve; Howard, Robert E.; Jeromin, Andreas; Jochim, Jayson M.; Kinnunen, Marty; Lau, Christopher; Lazarz, Evan T.; Lee, Changkyu; Lemon, Tracy A.; Li, Ling; Li, Yang; Morris, John A.; Overly, Caroline C.; Parker, Patrick D.; Parry, Sheana E.; Reding, Melissa; Royall, Joshua J.; Schulkin, Jay; Sequeira, Pedro Adolfo; Slaughterbeck, Clifford R.; Smith, Simon C.; Sodt, Andy J.; Sunkin, Susan M.; Swanson, Beryl E.; Vawter, Marquis P.; Williams, Derric; Wohnoutka, Paul; Zielke, H. Ronald; Geschwind, Daniel H.; Hof, Patrick R.; Smith, Stephen M.; Koch, Christof; Grant, Seth G. N.; Jones, Allan R.
2014-01-01
Neuroanatomically precise, genome-wide maps of transcript distributions are critical resources to complement genomic sequence data and to correlate functional and genetic brain architecture. Here we describe the generation and analysis of a transcriptional atlas of the adult human brain, comprising extensive histological analysis and comprehensive microarray profiling of ~900 neuroanatomically precise subdivisions in two individuals. Transcriptional regulation varies enormously by anatomical location, with different regions and their constituent cell types displaying robust molecular signatures that are highly conserved between individuals. Analysis of differential gene expression and gene co-expression relationships demonstrates that brain-wide variation strongly reflects the distributions of major cell classes such as neurons, oligodendrocytes, astrocytes and microglia. Local neighbourhood relationships between fine anatomical subdivisions are associated with discrete neuronal subtypes and genes involved with synaptic transmission. The neocortex displays a relatively homogeneous transcriptional pattern, but with distinct features associated selectively with primary sensorimotor cortices and with enriched frontal lobe expression. Notably, the spatial topography of the neocortex is strongly reflected in its molecular topography— the closer two cortical regions, the more similar their transcriptomes. This freely accessible online data resource forms a high-resolution transcriptional baseline for neurogenetic studies of normal and abnormal human brain function. PMID:22996553
Charfeddine, Mariam; Saïdi, Mohamed Najib; Charfeddine, Safa; Hammami, Asma; Gargouri Bouzid, Radhia
2015-04-01
The ERF transcription factors belong to the AP2/ERF superfamily, one of the largest transcription factor families in plants. They play important roles in plant development processes, as well as in the response to biotic, abiotic, and hormone signaling. In the present study, 155 putative ERF transcription factor genes were identified from the potato (Solanum tuberosum) genome database, and compared with those from Arabidopsis thaliana. The StERF proteins are divided into ten phylogenetic groups. Expression analyses of five StERFs were carried out by semi-quantitative RT-PCR and compared with published RNA-seq data. These latter analyses were used to distinguish tissue-specific, biotic, and abiotic stress genes as well as hormone-responsive StERF genes. The results are of interest to better understand the role of the AP2/ERF genes in response to diverse types of stress in potatoes. A comprehensive analysis of the physiological functions and biological roles of the ERF family genes in S. tuberosum is required to understand crop stress tolerance mechanisms.
De novo-based transcriptome profiling of male-sterile and fertile watermelon lines
Seo, Minseok; Jang, Yoon Jeong; Sim, Tae Yong; Cho, Seoae; Han, Sang-Wook
2017-01-01
The whole-genome sequence of watermelon (Citrullus lanatus (Thunb.) Matsum. & Nakai), a valuable horticultural crop worldwide, was released in 2013. Here, we compared a de novo-based approach (DBA) to a reference-based approach (RBA) using RNA-seq data, to aid in efforts to improve the annotation of the watermelon reference genome and to obtain biological insight into male-sterility in watermelon. We applied these techniques to available data from two watermelon lines: the male-sterile line DAH3615-MS and the male-fertile line DAH3615. Using DBA, we newly annotated 855 watermelon transcripts, and found gene functional clusters predicted to be related to stimulus responses, nucleic acid binding, transmembrane transport, homeostasis, and Golgi/vesicles. Among the DBA-annotated transcripts, 138 de novo-exclusive differentially-expressed genes (DEDEGs) related to male sterility were detected. Out of 33 randomly selected newly annotated transcripts and DEDEGs, 32 were validated by RT-qPCR. This study demonstrates the usefulness and reliability of the de novo transcriptome assembly in watermelon, and provides new insights for researchers exploring transcriptional blueprints with regard to the male sterility. PMID:29095876
Lam, L T; Pickeral, O K; Peng, A C; Rosenwald, A; Hurt, E M; Giltnane, J M; Averett, L M; Zhao, H; Davis, R E; Sathyamoorthy, M; Wahl, L M; Harris, E D; Mikovits, J A; Monks, A P; Hollingshead, M G; Sausville, E A; Staudt, L M
2001-01-01
Flavopiridol, a flavonoid currently in cancer clinical trials, inhibits cyclin-dependent kinases (CDKs) by competitively blocking their ATP-binding pocket. However, the mechanism of action of flavopiridol as an anti-cancer agent has not been fully elucidated. Using DNA microarrays, we found that flavopiridol inhibited gene expression broadly, in contrast to two other CDK inhibitors, roscovitine and 9-nitropaullone. The gene expression profile of flavopiridol closely resembled the profiles of two transcription inhibitors, actinomycin D and 5,6-dichloro-1-beta-D-ribofuranosyl-benzimidazole (DRB), suggesting that flavopiridol inhibits transcription globally. We were therefore able to use flavopiridol to measure mRNA turnover rates comprehensively and we found that different functional classes of genes had distinct distributions of mRNA turnover rates. In particular, genes encoding apoptosis regulators frequently had very short half-lives, as did several genes encoding key cell-cycle regulators. Strikingly, genes that were transcriptionally inducible were disproportionately represented in the class of genes with rapid mRNA turnover. The present genomic-scale measurement of mRNA turnover uncovered a regulatory logic that links gene function with mRNA half-life. The observation that transcriptionally inducible genes often have short mRNA half-lives demonstrates that cells have a coordinated strategy to rapidly modulate the mRNA levels of these genes. In addition, the present results suggest that flavopiridol may be more effective against types of cancer that are highly dependent on genes with unstable mRNAs.
Genome-Wide Expression Profiling of Complex Regional Pain Syndrome
Jin, Eun-Heui; Zhang, Enji; Ko, Youngkwon; Sim, Woo Seog; Moon, Dong Eon; Yoon, Keon Jung; Hong, Jang Hee; Lee, Won Hyung
2013-01-01
Complex regional pain syndrome (CRPS) is a chronic, progressive, and devastating pain syndrome characterized by spontaneous pain, hyperalgesia, allodynia, altered skin temperature, and motor dysfunction. Although previous gene expression profiling studies have been conducted in animal pain models, there genome-wide expression profiling in the whole blood of CRPS patients has not been reported yet. Here, we successfully identified certain pain-related genes through genome-wide expression profiling in the blood from CRPS patients. We found that 80 genes were differentially expressed between 4 CRPS patients (2 CRPS I and 2 CRPS II) and 5 controls (cut-off value: 1.5-fold change and p<0.05). Most of those genes were associated with signal transduction, developmental processes, cell structure and motility, and immunity and defense. The expression levels of major histocompatibility complex class I A subtype (HLA-A29.1), matrix metalloproteinase 9 (MMP9), alanine aminopeptidase N (ANPEP), l-histidine decarboxylase (HDC), granulocyte colony-stimulating factor 3 receptor (G-CSF3R), and signal transducer and activator of transcription 3 (STAT3) genes selected from the microarray were confirmed in 24 CRPS patients and 18 controls by quantitative reverse transcription-polymerase chain reaction (qRT-PCR). We focused on the MMP9 gene that, by qRT-PCR, showed a statistically significant difference in expression in CRPS patients compared to controls with the highest relative fold change (4.0±1.23 times and p = 1.4×10−4). The up-regulation of MMP9 gene in the blood may be related to the pain progression in CRPS patients. Our findings, which offer a valuable contribution to the understanding of the differential gene expression in CRPS may help in the understanding of the pathophysiology of CRPS pain progression. PMID:24244504
Divergent transcription is associated with promoters of transcriptional regulators
2013-01-01
Background Divergent transcription is a wide-spread phenomenon in mammals. For instance, short bidirectional transcripts are a hallmark of active promoters, while longer transcripts can be detected antisense from active genes in conditions where the RNA degradation machinery is inhibited. Moreover, many described long non-coding RNAs (lncRNAs) are transcribed antisense from coding gene promoters. However, the general significance of divergent lncRNA/mRNA gene pair transcription is still poorly understood. Here, we used strand-specific RNA-seq with high sequencing depth to thoroughly identify antisense transcripts from coding gene promoters in primary mouse tissues. Results We found that a substantial fraction of coding-gene promoters sustain divergent transcription of long non-coding RNA (lncRNA)/mRNA gene pairs. Strikingly, upstream antisense transcription is significantly associated with genes related to transcriptional regulation and development. Their promoters share several characteristics with those of transcriptional developmental genes, including very large CpG islands, high degree of conservation and epigenetic regulation in ES cells. In-depth analysis revealed a unique GC skew profile at these promoter regions, while the associated coding genes were found to have large first exons, two genomic features that might enforce bidirectional transcription. Finally, genes associated with antisense transcription harbor specific H3K79me2 epigenetic marking and RNA polymerase II enrichment profiles linked to an intensified rate of early transcriptional elongation. Conclusions We concluded that promoters of a class of transcription regulators are characterized by a specialized transcriptional control mechanism, which is directly coupled to relaxed bidirectional transcription. PMID:24365181
Guo, Yong; Qiu, Li-Juan
2013-01-01
The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max). In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs) were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.
Single-cell epigenomics: powerful new methods for understanding gene regulation and cell identity.
Clark, Stephen J; Lee, Heather J; Smallwood, Sébastien A; Kelsey, Gavin; Reik, Wolf
2016-04-18
Emerging single-cell epigenomic methods are being developed with the exciting potential to transform our knowledge of gene regulation. Here we review available techniques and future possibilities, arguing that the full potential of single-cell epigenetic studies will be realized through parallel profiling of genomic, transcriptional, and epigenetic information.
Zhao, Feng; Yan, Chao; Wang, Xuan; Yang, Yang; Wang, Guangyin; Lee, Wenhui; Xiang, Yang; Zhang, Yun
2014-01-01
Amphibians occupy a key phylogenetic position in vertebrates and evolution of the immune system. But, the resources of its transcriptome or genome are still little now. Bombina maxima possess strong ability to survival in very harsh environment with a more mature immune system. We obtained a comprehensive transcriptome by RNA-sequencing technology. 14.3% of transcripts were identified to be skin-specific genes, most of which were not isolated from skin secretion in previous works or novel non-coding RNAs. 27.9% of transcripts were mapped into 242 predicted KEGG pathways and 6.16% of transcripts related to human disease and cancer. Of 39 448 transcripts with the coding sequence, at least 1501 transcripts (570 genes) related to the immune system process. The molecules of immune signalling pathway were almost presented, several transcripts with high expression in skin and stomach. Experiments showed that lipopolysaccharide or bacteria challenge stimulated pro-inflammatory cytokine production and activation of pro-inflammatory caspase-1. These frog's data can remarkably expand the existing genome or transcriptome resources of amphibians, especially immunity data. The entity of the data provides a valuable platform for further investigation on more detailed immune response in B. maxima and a comparative study with other amphibians. PMID:23942912
Nascent-Seq reveals novel features of mouse circadian transcriptional regulation
Menet, Jerome S; Rodriguez, Joseph; Abruzzi, Katharine C; Rosbash, Michael
2012-01-01
A substantial fraction of the metazoan transcriptome undergoes circadian oscillations in many cells and tissues. Based on the transcription feedback loops important for circadian timekeeping, it is commonly assumed that this mRNA cycling reflects widespread transcriptional regulation. To address this issue, we directly measured the circadian dynamics of mouse liver transcription using Nascent-Seq (genome-wide sequencing of nascent RNA). Although many genes are rhythmically transcribed, many rhythmic mRNAs manifest poor transcriptional rhythms, indicating a prominent contribution of post-transcriptional regulation to circadian mRNA expression. This analysis of rhythmic transcription also showed that the rhythmic DNA binding profile of the transcription factors CLOCK and BMAL1 does not determine the transcriptional phase of most target genes. This likely reflects gene-specific collaborations of CLK:BMAL1 with other transcription factors. These insights from Nascent-Seq indicate that it should have broad applicability to many other gene expression regulatory issues. DOI: http://dx.doi.org/10.7554/eLife.00011.001 PMID:23150795
Pandey, Ashutosh; Alok, Anshu; Lakhwani, Deepika; Singh, Jagdeep; Asif, Mehar H.; Trivedi, Prabodh K.
2016-01-01
Flavonoid biosynthesis is largely regulated at the transcriptional level due to the modulated expression of genes related to the phenylpropanoid pathway in plants. Although accumulation of different flavonoids has been reported in banana, a staple fruit crop, no detailed information is available on regulation of the biosynthesis in this important plant. We carried out genome-wide analysis of banana (Musa acuminata, AAA genome) and identified 28 genes belonging to 9 gene families associated with flavonoid biosynthesis. Expression analysis suggested spatial and temporal regulation of the identified genes in different tissues of banana. Analysis revealed enhanced expression of genes related to flavonol and proanthocyanidin (PA) biosynthesis in peel and pulp at the early developmental stages of fruit. Genes involved in anthocyanin biosynthesis were highly expressed during banana fruit ripening. In general, higher accumulation of metabolites was observed in the peel as compared to pulp tissue. A correlation between expression of genes and metabolite content was observed at the early stage of fruit development. Furthermore, this study also suggests regulation of flavonoid biosynthesis, at transcriptional level, under light and dark exposures as well as methyl jasmonate (MJ) treatment in banana. PMID:27539368
Pandey, Ashutosh; Alok, Anshu; Lakhwani, Deepika; Singh, Jagdeep; Asif, Mehar H; Trivedi, Prabodh K
2016-08-19
Flavonoid biosynthesis is largely regulated at the transcriptional level due to the modulated expression of genes related to the phenylpropanoid pathway in plants. Although accumulation of different flavonoids has been reported in banana, a staple fruit crop, no detailed information is available on regulation of the biosynthesis in this important plant. We carried out genome-wide analysis of banana (Musa acuminata, AAA genome) and identified 28 genes belonging to 9 gene families associated with flavonoid biosynthesis. Expression analysis suggested spatial and temporal regulation of the identified genes in different tissues of banana. Analysis revealed enhanced expression of genes related to flavonol and proanthocyanidin (PA) biosynthesis in peel and pulp at the early developmental stages of fruit. Genes involved in anthocyanin biosynthesis were highly expressed during banana fruit ripening. In general, higher accumulation of metabolites was observed in the peel as compared to pulp tissue. A correlation between expression of genes and metabolite content was observed at the early stage of fruit development. Furthermore, this study also suggests regulation of flavonoid biosynthesis, at transcriptional level, under light and dark exposures as well as methyl jasmonate (MJ) treatment in banana.
Stevenson, Clare E. M.; Assaad, Aoun; Chandra, Govind; Le, Tung B. K.; Greive, Sandra J.; Bibb, Mervyn J.; Lawson, David M.
2013-01-01
Consistent with their complex lifestyles and rich secondary metabolite profiles, the genomes of streptomycetes encode a plethora of transcription factors, the vast majority of which are uncharacterized. Herein, we use Surface Plasmon Resonance (SPR) to identify and delineate putative operator sites for SCO3205, a MarR family transcriptional regulator from Streptomyces coelicolor that is well represented in sequenced actinomycete genomes. In particular, we use a novel SPR footprinting approach that exploits indirect ligand capture to vastly extend the lifetime of a standard streptavidin SPR chip. We define two operator sites upstream of sco3205 and a pseudopalindromic consensus sequence derived from these enables further potential operator sites to be identified in the S. coelicolor genome. We evaluate each of these through SPR and test the importance of the conserved bases within the consensus sequence. Informed by these results, we determine the crystal structure of a SCO3205-DNA complex at 2.8 Å resolution, enabling molecular level rationalization of the SPR data. Taken together, our observations support a DNA recognition mechanism involving both direct and indirect sequence readout. PMID:23748564
2014-01-01
Background Genome-wide microarrays have been useful for predicting chemical-genetic interactions at the gene level. However, interpreting genome-wide microarray results can be overwhelming due to the vast output of gene expression data combined with off-target transcriptional responses many times induced by a drug treatment. This study demonstrates how experimental and computational methods can interact with each other, to arrive at more accurate predictions of drug-induced perturbations. We present a two-stage strategy that links microarray experimental testing and network training conditions to predict gene perturbations for a drug with a known mechanism of action in a well-studied organism. Results S. cerevisiae cells were treated with the antifungal, fluconazole, and expression profiling was conducted under different biological conditions using Affymetrix genome-wide microarrays. Transcripts were filtered with a formal network-based method, sparse simultaneous equation models and Lasso regression (SSEM-Lasso), under different network training conditions. Gene expression results were evaluated using both gene set and single gene target analyses, and the drug’s transcriptional effects were narrowed first by pathway and then by individual genes. Variables included: (i) Testing conditions – exposure time and concentration and (ii) Network training conditions – training compendium modifications. Two analyses of SSEM-Lasso output – gene set and single gene – were conducted to gain a better understanding of how SSEM-Lasso predicts perturbation targets. Conclusions This study demonstrates that genome-wide microarrays can be optimized using a two-stage strategy for a more in-depth understanding of how a cell manifests biological reactions to a drug treatment at the transcription level. Additionally, a more detailed understanding of how the statistical model, SSEM-Lasso, propagates perturbations through a network of gene regulatory interactions is achieved. PMID:24444313
Wan, Cen; Lees, Jonathan G; Minneci, Federico; Orengo, Christine A; Jones, David T
2017-10-01
Accurate gene or protein function prediction is a key challenge in the post-genome era. Most current methods perform well on molecular function prediction, but struggle to provide useful annotations relating to biological process functions due to the limited power of sequence-based features in that functional domain. In this work, we systematically evaluate the predictive power of temporal transcription expression profiles for protein function prediction in Drosophila melanogaster. Our results show significantly better performance on predicting protein function when transcription expression profile-based features are integrated with sequence-derived features, compared with the sequence-derived features alone. We also observe that the combination of expression-based and sequence-based features leads to further improvement of accuracy on predicting all three domains of gene function. Based on the optimal feature combinations, we then propose a novel multi-classifier-based function prediction method for Drosophila melanogaster proteins, FFPred-fly+. Interpreting our machine learning models also allows us to identify some of the underlying links between biological processes and developmental stages of Drosophila melanogaster.
Angelastro, James M.; Klimaschewski, Lars; Tang, Song; Vitolo, Ottavio V.; Weissman, Tamily A.; Donlin, Laura T.; Shelanski, Michael L.; Greene, Lloyd A.
2000-01-01
Neurotrophic factors such as nerve growth factor (NGF) promote a wide variety of responses in neurons, including differentiation, survival, plasticity, and repair. Such actions often require changes in gene expression. To identify the regulated genes and thereby to more fully understand the NGF mechanism, we carried out serial analysis of gene expression (SAGE) profiling of transcripts derived from rat PC12 cells before and after NGF-promoted neuronal differentiation. Multiple criteria supported the reliability of the profile. Approximately 157,000 SAGE tags were analyzed, representing at least 21,000 unique transcripts. Of these, nearly 800 were regulated by 6-fold or more in response to NGF. Approximately 150 of the regulated transcripts have been matched to named genes, the majority of which were not previously known to be NGF-responsive. Functional categorization of the regulated genes provides insight into the complex, integrated mechanism by which NGF promotes its multiple actions. It is anticipated that as genomic sequence information accrues the data derived here will continue to provide information about neurotrophic factor mechanisms. PMID:10984536
Bajaj, Deepak; Upadhyaya, Hari D.; Khan, Yusuf; Das, Shouvik; Badoni, Saurabh; Shree, Tanima; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. L.; Singh, Sube; Sharma, Shivali; Tyagi, Akhilesh K.; Chattopdhyay, Debasis; Parida, Swarup K.
2015-01-01
High experimental validation/genotyping success rate (94–96%) and intra-specific polymorphic potential (82–96%) of 1536 SNP and 472 SSR markers showing in silico polymorphism between desi ICC 4958 and kabuli ICC 12968 chickpea was obtained in a 190 mapping population (ICC 4958 × ICC 12968) and 92 diverse desi and kabuli genotypes. A high-density 2001 marker-based intra-specific genetic linkage map comprising of eight LGs constructed is comparatively much saturated (mean map-density: 0.94 cM) in contrast to existing intra-specific genetic maps in chickpea. Fifteen robust QTLs (PVE: 8.8–25.8% with LOD: 7.0–13.8) associated with pod and seed number/plant (PN and SN) and 100 seed weight (SW) were identified and mapped on 10 major genomic regions of eight LGs. One of 126.8 kb major genomic region harbouring a strong SW-associated robust QTL (Caq'SW1.1: 169.1–171.3 cM) has been delineated by integrating high-resolution QTL mapping with comprehensive marker-based comparative genome mapping and differential expression profiling. This identified one potential regulatory SNP (G/A) in the cis-acting element of candidate ERF (ethylene responsive factor) TF (transcription factor) gene governing seed weight in chickpea. The functionally relevant molecular tags identified have potential to be utilized for marker-assisted genetic improvement of chickpea. PMID:25786576
A genome-wide SNP scan accelerates trait-regulatory genomic loci identification in chickpea
Kujur, Alice; Bajaj, Deepak; Upadhyaya, Hari D.; Das, Shouvik; Ranjan, Rajeev; Shree, Tanima; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C.L.L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
We identified 44844 high-quality SNPs by sequencing 92 diverse chickpea accessions belonging to a seed and pod trait-specific association panel using reference genome- and de novo-based GBS (genotyping-by-sequencing) assays. A GWAS (genome-wide association study) in an association panel of 211, including the 92 sequenced accessions, identified 22 major genomic loci showing significant association (explaining 23–47% phenotypic variation) with pod and seed number/plant and 100-seed weight. Eighteen trait-regulatory major genomic loci underlying 13 robust QTLs were validated and mapped on an intra-specific genetic linkage map by QTL mapping. A combinatorial approach of GWAS, QTL mapping and gene haplotype-specific LD mapping and transcript profiling uncovered one superior haplotype and favourable natural allelic variants in the upstream regulatory region of a CesA-type cellulose synthase (Ca_Kabuli_CesA3) gene regulating high pod and seed number/plant (explaining 47% phenotypic variation) in chickpea. The up-regulation of this superior gene haplotype correlated with increased transcript expression of Ca_Kabuli_CesA3 gene in the pollen and pod of high pod/seed number accession, resulting in higher cellulose accumulation for normal pollen and pollen tube growth. A rapid combinatorial genome-wide SNP genotyping-based approach has potential to dissect complex quantitative agronomic traits and delineate trait-regulatory genomic loci (candidate genes) for genetic enhancement in crop plants, including chickpea. PMID:26058368
LeBlanc, Chantal; Lee, Tae-Jin; Mulvaney, Patrick; Allen, George C.; Martienssen, Robert A.; Thompson, William F.
2017-01-01
All plants and animals must replicate their DNA, using a regulated process to ensure that their genomes are completely and accurately replicated. DNA replication timing programs have been extensively studied in yeast and animal systems, but much less is known about the replication programs of plants. We report a novel adaptation of the “Repli-seq” assay for use in intact root tips of maize (Zea mays) that includes several different cell lineages and present whole-genome replication timing profiles from cells in early, mid, and late S phase of the mitotic cell cycle. Maize root tips have a complex replication timing program, including regions of distinct early, mid, and late S replication that each constitute between 20 and 24% of the genome, as well as other loci corresponding to ∼32% of the genome that exhibit replication activity in two different time windows. Analyses of genomic, transcriptional, and chromatin features of the euchromatic portion of the maize genome provide evidence for a gradient of early replicating, open chromatin that transitions gradually to less open and less transcriptionally active chromatin replicating in mid S phase. Our genomic level analysis also demonstrated that the centromere core replicates in mid S, before heavily compacted classical heterochromatin, including pericentromeres and knobs, which replicate during late S phase. PMID:28842533
Audit, Benjamin; Zaghloul, Lamia; Baker, Antoine; Arneodo, Alain; Chen, Chun-Long; d'Aubenton-Carafa, Yves; Thermes, Claude
2013-01-01
In higher eukaryotes, the absence of specific sequence motifs, marking the origins of replication has been a serious hindrance to the understanding of (i) the mechanisms that regulate the spatio-temporal replication program, and (ii) the links between origins activation, chromatin structure and transcription. In this chapter, we review the partitioning of the human genome into megabased-size replication domains delineated as N-shaped motifs in the strand compositional asymmetry profiles. They collectively span 28.3% of the genome and are bordered by more than 1,000 putative replication origins. We recapitulate the comparison of this partition of the human genome with high-resolution experimental data that confirms that replication domain borders are likely to be preferential replication initiation zones in the germline. In addition, we highlight the specific distribution of experimental and numerical chromatin marks along replication domains. Domain borders correspond to particular open chromatin regions, possibly encoded in the DNA sequence, and around which replication and transcription are highly coordinated. These regions also present a high evolutionary breakpoint density, suggesting that susceptibility to breakage might be linked to local open chromatin fiber state. Altogether, this chapter presents a compartmentalization of the human genome into replication domains that are landmarks of the human genome organization and are likely to play a key role in genome dynamics during evolution and in pathological situations.
Obermeier, Christian; Hosseini, Bashir; Friedt, Wolfgang; Snowdon, Rod
2009-01-01
Background Serial analysis of gene expression (LongSAGE) was applied for gene expression profiling in seeds of oilseed rape (Brassica napus ssp. napus). The usefulness of this technique for detailed expression profiling in a non-model organism was demonstrated for the highly complex, neither fully sequenced nor annotated genome of B. napus by applying a tag-to-gene matching strategy based on Brassica ESTs and the annotated proteome of the closely related model crucifer A. thaliana. Results Transcripts from 3,094 genes were detected at two time-points of seed development, 23 days and 35 days after pollination (DAP). Differential expression showed a shift from gene expression involved in diverse developmental processes including cell proliferation and seed coat formation at 23 DAP to more focussed metabolic processes including storage protein accumulation and lipid deposition at 35 DAP. The most abundant transcripts at 23 DAP were coding for diverse protease inhibitor proteins and proteases, including cysteine proteases involved in seed coat formation and a number of lipid transfer proteins involved in embryo pattern formation. At 35 DAP, transcripts encoding napin, cruciferin and oleosin storage proteins were most abundant. Over both time-points, 18.6% of the detected genes were matched by Brassica ESTs identified by LongSAGE tags in antisense orientation. This suggests a strong involvement of antisense transcript expression in regulatory processes during B. napus seed development. Conclusion This study underlines the potential of transcript tagging approaches for gene expression profiling in Brassica crop species via EST matching to annotated A. thaliana genes. Limits of tag detection for low-abundance transcripts can today be overcome by ultra-high throughput sequencing approaches, so that tag-based gene expression profiling may soon become the method of choice for global expression profiling in non-model species. PMID:19575793
Khan, Aziz; Fornes, Oriol; Stigliani, Arnaud; Gheorghe, Marius; Castro-Mondragon, Jaime A; van der Lee, Robin; Bessy, Adrien; Chèneby, Jeanne; Kulkarni, Shubhada R; Tan, Ge; Baranasic, Damir; Arenillas, David J; Sandelin, Albin; Vandepoele, Klaas; Lenhard, Boris; Ballester, Benoît; Wasserman, Wyeth W; Parcy, François; Mathelier, Anthony
2018-01-04
JASPAR (http://jaspar.genereg.net) is an open-access database of curated, non-redundant transcription factor (TF)-binding profiles stored as position frequency matrices (PFMs) and TF flexible models (TFFMs) for TFs across multiple species in six taxonomic groups. In the 2018 release of JASPAR, the CORE collection has been expanded with 322 new PFMs (60 for vertebrates and 262 for plants) and 33 PFMs were updated (24 for vertebrates, 8 for plants and 1 for insects). These new profiles represent a 30% expansion compared to the 2016 release. In addition, we have introduced 316 TFFMs (95 for vertebrates, 218 for plants and 3 for insects). This release incorporates clusters of similar PFMs in each taxon and each TF class per taxon. The JASPAR 2018 CORE vertebrate collection of PFMs was used to predict TF-binding sites in the human genome. The predictions are made available to the scientific community through a UCSC Genome Browser track data hub. Finally, this update comes with a new web framework with an interactive and responsive user-interface, along with new features. All the underlying data can be retrieved programmatically using a RESTful API and through the JASPAR 2018 R/Bioconductor package. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Bastarrachea, Raúl A.; Gallegos-Cabriales, Esther C.; Nava-González, Edna J.; Haack, Karin; Voruganti, V. Saroja; Charlesworth, Jac; Laviada-Molina, Hugo A.; Veloz-Garza, Rosa A.; Cardenas-Villarreal, Velia Margarita; Valdovinos-Chavez, Salvador B.; Gomez-Aguilar, Patricia; Meléndez, Guillermo; López-Alvarenga, Juan Carlos; Göring, Harald H. H.; Cole, Shelley A.; Blangero, John; Comuzzie, Anthony G.; Kent, Jack W.
2012-01-01
Whole-transcriptome expression profiling provides novel phenotypes for analysis of complex traits. Gene expression measurements reflect quantitative variation in transcript-specific messenger RNA levels and represent phenotypes lying close to the action of genes. Understanding the genetic basis of gene expression will provide insight into the processes that connect genotype to clinically significant traits representing a central tenet of system biology. Synchronous in vivo expression profiles of lymphocytes, muscle, and subcutaneous fat were obtained from healthy Mexican men. Most genes were expressed at detectable levels in multiple tissues, and RNA levels were correlated between tissue types. A subset of transcripts with high reliability of expression across tissues (estimated by intraclass correlation coefficients) was enriched for cis-regulated genes, suggesting that proximal sequence variants may influence expression similarly in different cellular environments. This integrative global gene expression profiling approach is proving extremely useful for identifying genes and pathways that contribute to complex clinical traits. Clearly, the coincidence of clinical trait quantitative trait loci and expression quantitative trait loci can help in the prioritization of positional candidate genes. Such data will be crucial for the formal integration of positional and transcriptomic information characterized as genetical genomics. PMID:22797999
Menzel, Ralph; Swain, Suresh C; Hoess, Sebastian; Claus, Evelyn; Menzel, Stefanie; Steinberg, Christian EW; Reifferscheid, Georg; Stürzenbaum, Stephen R
2009-01-01
Background Traditionally, toxicity of river sediments is assessed using whole sediment tests with benthic organisms. The challenge, however, is the differentiation between multiple effects caused by complex contaminant mixtures and the unspecific toxicity endpoints such as survival, growth or reproduction. The use of gene expression profiling facilitates the identification of transcriptional changes at the molecular level that are specific to the bio-available fraction of pollutants. Results In this pilot study, we exposed the nematode Caenorhabditis elegans to three sediments of German rivers with varying (low, medium and high) levels of heavy metal and organic contamination. Beside chemical analysis, three standard bioassays were performed: reproduction of C. elegans, genotoxicity (Comet assay) and endocrine disruption (YES test). Gene expression was profiled using a whole genome DNA-microarray approach to identify overrepresented functional gene categories and derived cellular processes. Disaccharide and glycogen metabolism were found to be affected, whereas further functional pathways, such as oxidative phosphorylation, ribosome biogenesis, metabolism of xenobiotics, aging and several developmental processes were found to be differentially regulated only in response to the most contaminated sediment. Conclusion This study demonstrates how ecotoxicogenomics can identify transcriptional responses in complex mixture scenarios to distinguish different samples of river sediments. PMID:19366437
Almstrup, Kristian; Hoei-Hansen, Christina E; Wirkner, Ute; Blake, Jonathon; Schwager, Christian; Ansorge, Wilhelm; Nielsen, John E; Skakkebaek, Niels E; Rajpert-De Meyts, Ewa; Leffers, Henrik
2004-07-15
Carcinoma in situ (CIS) is the common precursor of histologically heterogeneous testicular germ cell tumors (TGCTs), which in recent decades have markedly increased and now are the most common malignancy of young men. Using genome-wide gene expression profiling, we identified >200 genes highly expressed in testicular CIS, including many never reported in testicular neoplasms. Expression was further verified by semiquantitative reverse transcription-PCR and in situ hybridization. Among the highest expressed genes were NANOG and POU5F1, and reverse transcription-PCR revealed possible changes in their stoichiometry on progression into embryonic carcinoma. We compared the CIS expression profile with patterns reported in embryonic stem cells (ESCs), which revealed a substantial overlap that may be as high as 50%. We also demonstrated an over-representation of expressed genes in regions of 17q and 12, reported as unstable in cultured ESCs. The close similarity between CIS and ESCs explains the pluripotency of CIS. Moreover, the findings are consistent with an early prenatal origin of TGCTs and thus suggest that etiologic factors operating in utero are of primary importance for the incidence trends of TGCTs. Finally, some of the highly expressed genes identified in this study are promising candidates for new diagnostic markers for CIS and/or TGCTs.
Fornes, Oriol; Stigliani, Arnaud; Gheorghe, Marius; Castro-Mondragon, Jaime A; Bessy, Adrien; Chèneby, Jeanne; Kulkarni, Shubhada R; Tan, Ge; Baranasic, Damir; Arenillas, David J; Vandepoele, Klaas; Parcy, François
2018-01-01
Abstract JASPAR (http://jaspar.genereg.net) is an open-access database of curated, non-redundant transcription factor (TF)-binding profiles stored as position frequency matrices (PFMs) and TF flexible models (TFFMs) for TFs across multiple species in six taxonomic groups. In the 2018 release of JASPAR, the CORE collection has been expanded with 322 new PFMs (60 for vertebrates and 262 for plants) and 33 PFMs were updated (24 for vertebrates, 8 for plants and 1 for insects). These new profiles represent a 30% expansion compared to the 2016 release. In addition, we have introduced 316 TFFMs (95 for vertebrates, 218 for plants and 3 for insects). This release incorporates clusters of similar PFMs in each taxon and each TF class per taxon. The JASPAR 2018 CORE vertebrate collection of PFMs was used to predict TF-binding sites in the human genome. The predictions are made available to the scientific community through a UCSC Genome Browser track data hub. Finally, this update comes with a new web framework with an interactive and responsive user-interface, along with new features. All the underlying data can be retrieved programmatically using a RESTful API and through the JASPAR 2018 R/Bioconductor package. PMID:29140473
Zhang, Yunzeng; Barthe, Gary; Grosser, Jude W; Wang, Nian
2016-07-08
Citrus blight is a citrus tree overall decline disease and causes serious losses in the citrus industry worldwide. Although it was described more than one hundred years ago, its causal agent remains unknown and its pathophysiology is not well determined, which hampers our understanding of the disease and design of suitable disease management. In this study, we sequenced and assembled the draft genome for Swingle citrumelo, one important citrus rootstock. The draft genome is approximately 280 Mb, which covers 74 % of the estimated Swingle citrumelo genome and the average coverage is around 15X. The draft genome of Swingle citrumelo enabled us to conduct transcriptome analysis of roots of blight and healthy Swingle citrumelo using RNA-seq. The RNA-seq was reliable as evidenced by the high consistence of RNA-seq analysis and quantitative reverse transcription PCR results (R(2) = 0.966). Comparison of the gene expression profiles between blight and healthy root samples revealed the molecular mechanism underneath the characteristic blight phenotypes including decline, starch accumulation, and drought stress. The JA and ET biosynthesis and signaling pathways showed decreased transcript abundance, whereas SA-mediated defense-related genes showed increased transcript abundance in blight trees, suggesting unclassified biotrophic pathogen was involved in this disease. Overall, the Swingle citrumelo draft genome generated in this study will advance our understanding of plant biology and contribute to the citrus breeding. Transcriptome analysis of blight and healthy trees deepened our understanding of the pathophysiology of citrus blight.
Non-equivalent contributions of maternal and paternal genomes to early plant embryogenesis.
Del Toro-De León, Gerardo; García-Aguilar, Marcelina; Gillmor, C Stewart
2014-10-30
Zygotic genome activation in metazoans typically occurs several hours to a day after fertilization, and thus maternal RNAs and proteins drive early animal embryo development. In plants, despite several molecular studies of post-fertilization transcriptional activation, the timing of zygotic genome activation remains a matter of debate. For example, two recent reports that used different hybrid ecotype combinations for RNA sequence profiling of early Arabidopsis embryo transcriptomes came to divergent conclusions. One identified paternal contributions that varied by gene, but with overall maternal dominance, while the other found that the maternal and paternal genomes are transcriptionally equivalent. Here we assess paternal gene activation functionally in an isogenic background, by performing a large-scale genetic analysis of 49 EMBRYO DEFECTIVE genes and testing the ability of wild-type paternal alleles to complement phenotypes conditioned by mutant maternal alleles. Our results demonstrate that wild-type paternal alleles for nine of these genes are completely functional 2 days after pollination, with the remaining 40 genes showing partial activity beginning at 2, 3 or 5 days after pollination. Using our functional assay, we also demonstrate that different hybrid combinations exhibit significant variation in paternal allele activation, reconciling the apparently contradictory results of previous transcriptional studies. The variation in timing of gene function that we observe confirms that paternal genome activation does not occur in one early discrete step, provides large-scale functional evidence that maternal and paternal genomes make non-equivalent contributions to early plant embryogenesis, and uncovers an unexpectedly profound effect of hybrid genetic backgrounds on paternal gene activity.
Transcriptome profile of a bovine respiratory disease pathogen: Mannheimia haemolytica PHL213
2012-01-01
Background Computational methods for structural gene annotation have propelled gene discovery but face certain drawbacks with regards to prokaryotic genome annotation. Identification of transcriptional start sites, demarcating overlapping gene boundaries, and identifying regulatory elements such as small RNA are not accurate using these approaches. In this study, we re-visit the structural annotation of Mannheimia haemolytica PHL213, a bovine respiratory disease pathogen. M. haemolytica is one of the causative agents of bovine respiratory disease that results in about $3 billion annual losses to the cattle industry. We used RNA-Seq and analyzed the data using freely-available computational methods and resources. The aim was to identify previously unannotated regions of the genome using RNA-Seq based expression profile to complement the existing annotation of this pathogen. Results Using the Illumina Genome Analyzer, we generated 9,055,826 reads (average length ~76 bp) and aligned them to the reference genome using Bowtie. The transcribed regions were analyzed using SAMTOOLS and custom Perl scripts in conjunction with BLAST searches and available gene annotation information. The single nucleotide resolution map enabled the identification of 14 novel protein coding regions as well as 44 potential novel sRNA. The basal transcription profile revealed that 2,506 of the 2,837 annotated regions were expressed in vitro, at 95.25% coverage, representing all broad functional gene categories in the genome. The expression profile also helped identify 518 potential operon structures involving 1,086 co-expressed pairs. We also identified 11 proteins with mutated/alternate start codons. Conclusions The application of RNA-Seq based transcriptome profiling to structural gene annotation helped correct existing annotation errors and identify potential novel protein coding regions and sRNA. We used computational tools to predict regulatory elements such as promoters and terminators associated with the novel expressed regions for further characterization of these novel functional elements. Our study complements the existing structural annotation of Mannheimia haemolytica PHL213 based on experimental evidence. Given the role of sRNA in virulence gene regulation and stress response, potential novel sRNA described in this study can form the framework for future studies to determine the role of sRNA, if any, in M. haemolytica pathogenesis. PMID:23046475
Quantitative Profiling of Peptides from RNAs classified as non-coding
Prabakaran, Sudhakaran; Hemberg, Martin; Chauhan, Ruchi; Winter, Dominic; Tweedie-Cullen, Ry Y.; Dittrich, Christian; Hong, Elizabeth; Gunawardena, Jeremy; Steen, Hanno; Kreiman, Gabriel; Steen, Judith A.
2014-01-01
Only a small fraction of the mammalian genome codes for messenger RNAs destined to be translated into proteins, and it is generally assumed that a large portion of transcribed sequences - including introns and several classes of non-coding RNAs (ncRNAs) do not give rise to peptide products. A systematic examination of translation and physiological regulation of ncRNAs has not been conducted. Here, we use computational methods to identify the products of non-canonical translation in mouse neurons by analyzing unannotated transcripts in combination with proteomic data. This study supports the existence of non-canonical translation products from both intragenic and extragenic genomic regions, including peptides derived from anti-sense transcripts and introns. Moreover, the studied novel translation products exhibit temporal regulation similar to that of proteins known to be involved in neuronal activity processes. These observations highlight a potentially large and complex set of biologically regulated translational events from transcripts formerly thought to lack coding potential. PMID:25403355
Transcriptional profiling of Medicago truncatula meristematic root cells
Holmes, Peta; Goffard, Nicolas; Weiller, Georg F; Rolfe, Barry G; Imin, Nijat
2008-01-01
Background The root apical meristem of crop and model legume Medicago truncatula is a significantly different stem cell system to that of the widely studied model plant species Arabidopsis thaliana. In this study we used the Affymetrix Medicago GeneChip® to compare the transcriptomes of meristem and non-meristematic root to identify root meristem specific candidate genes. Results Using mRNA from root meristem and non-meristem we were able to identify 324 and 363 transcripts differentially expressed from the two regions. With bioinformatics tools developed to functionally annotate the Medicago genome array we could identify significant changes in metabolism, signalling and the differentially expression of 55 transcription factors in meristematic and non-meristematic roots. Conclusion This is the first comprehensive analysis of M. truncatula root meristem cells using this genome array. This data will facilitate the mapping of regulatory and metabolic networks involved in the open root meristem of M. truncatula and provides candidates for functional analysis. PMID:18302802
Jensen, Philip J; Fazio, Gennaro; Altman, Naomi; Praul, Craig; McNellis, Timothy W
2014-04-04
Apple tree breeding is slow and difficult due to long generation times, self-incompatibility, and complex genetics. The identification of molecular markers linked to traits of interest is a way to expedite the breeding process. In the present study, we aimed to identify genes whose steady-state transcript abundance was associated with inheritance of specific traits segregating in an apple (Malus × domestica) rootstock F1 breeding population, including resistance to powdery mildew (Podosphaera leucotricha) disease and woolly apple aphid (Eriosoma lanigerum). Transcription profiling was performed for 48 individual F1 apple trees from a cross of two highly heterozygous parents, using RNA isolated from healthy, actively-growing shoot tips and a custom apple DNA oligonucleotide microarray representing 26,000 unique transcripts. Genome-wide expression profiles were not clear indicators of powdery mildew or woolly apple aphid resistance phenotype. However, standard differential gene expression analysis between phenotypic groups of trees revealed relatively small sets of genes with trait-associated expression levels. For example, thirty genes were identified that were differentially expressed between trees resistant and susceptible to powdery mildew. Interestingly, the genes encoding twenty-four of these transcripts were physically clustered on chromosome 12. Similarly, seven genes were identified that were differentially expressed between trees resistant and susceptible to woolly apple aphid, and the genes encoding five of these transcripts were also clustered, this time on chromosome 17. In each case, the gene clusters were in the vicinity of previously identified major quantitative trait loci for the corresponding trait. Similar results were obtained for a series of molecular traits. Several of the differentially expressed genes were used to develop DNA polymorphism markers linked to powdery mildew disease and woolly apple aphid resistance. Gene expression profiling and trait-associated transcript analysis using an apple F1 population readily identified genes physically linked to powdery mildew disease resistance and woolly apple aphid resistance loci. This result was especially useful in apple, where extreme levels of heterozygosity make the development of reliable DNA markers quite difficult. The results suggest that this approach could prove effective in crops with complicated genetics, or for which few genomic information resources are available.
GDA, a web-based tool for Genomics and Drugs integrated analysis.
Caroli, Jimmy; Sorrentino, Giovanni; Forcato, Mattia; Del Sal, Giannino; Bicciato, Silvio
2018-05-25
Several major screenings of genetic profiling and drug testing in cancer cell lines proved that the integration of genomic portraits and compound activities is effective in discovering new genetic markers of drug sensitivity and clinically relevant anticancer compounds. Despite most genetic and drug response data are publicly available, the availability of user-friendly tools for their integrative analysis remains limited, thus hampering an effective exploitation of this information. Here, we present GDA, a web-based tool for Genomics and Drugs integrated Analysis that combines drug response data for >50 800 compounds with mutations and gene expression profiles across 73 cancer cell lines. Genomic and pharmacological data are integrated through a modular architecture that allows users to identify compounds active towards cancer cell lines bearing a specific genomic background and, conversely, the mutational or transcriptional status of cells responding or not-responding to a specific compound. Results are presented through intuitive graphical representations and supplemented with information obtained from public repositories. As both personalized targeted therapies and drug-repurposing are gaining increasing attention, GDA represents a resource to formulate hypotheses on the interplay between genomic traits and drug response in cancer. GDA is freely available at http://gda.unimore.it/.
Characterization of Equine Infectious Anemia Virus Integration in the Horse Genome.
Liu, Qiang; Wang, Xue-Feng; Ma, Jian; He, Xi-Jun; Wang, Xiao-Jun; Zhou, Jian-Hua
2015-06-19
Human immunodeficiency virus (HIV)-1 has a unique integration profile in the human genome relative to murine and avian retroviruses. Equine infectious anemia virus (EIAV) is another well-studied lentivirus that can also be used as a promising retro-transfection vector, but its integration into its native host has not been characterized. In this study, we mapped 477 integration sites of the EIAV strain EIAVFDDV13 in fetal equine dermal (FED) cells during in vitro infection. Published integration sites of EIAV and HIV-1 in the human genome were also analyzed as references. Our results demonstrated that EIAVFDDV13 tended to integrate into genes and AT-rich regions, and it avoided integrating into transcription start sites (TSS), which is consistent with EIAV and HIV-1 integration in the human genome. Notably, the integration of EIAVFDDV13 favored long interspersed elements (LINEs) and DNA transposons in the horse genome, whereas the integration of HIV-1 favored short interspersed elements (SINEs) in the human genome. The chromosomal environment near LINEs or DNA transposons potentially influences viral transcription and may be related to the unique EIAV latency states in equids. The data on EIAV integration in its natural host will facilitate studies on lentiviral infection and lentivirus-based therapeutic vectors.
Characterization of Equine Infectious Anemia Virus Integration in the Horse Genome
Liu, Qiang; Wang, Xue-Feng; Ma, Jian; He, Xi-Jun; Wang, Xiao-Jun; Zhou, Jian-Hua
2015-01-01
Human immunodeficiency virus (HIV)-1 has a unique integration profile in the human genome relative to murine and avian retroviruses. Equine infectious anemia virus (EIAV) is another well-studied lentivirus that can also be used as a promising retro-transfection vector, but its integration into its native host has not been characterized. In this study, we mapped 477 integration sites of the EIAV strain EIAVFDDV13 in fetal equine dermal (FED) cells during in vitro infection. Published integration sites of EIAV and HIV-1 in the human genome were also analyzed as references. Our results demonstrated that EIAVFDDV13 tended to integrate into genes and AT-rich regions, and it avoided integrating into transcription start sites (TSS), which is consistent with EIAV and HIV-1 integration in the human genome. Notably, the integration of EIAVFDDV13 favored long interspersed elements (LINEs) and DNA transposons in the horse genome, whereas the integration of HIV-1 favored short interspersed elements (SINEs) in the human genome. The chromosomal environment near LINEs or DNA transposons potentially influences viral transcription and may be related to the unique EIAV latency states in equids. The data on EIAV integration in its natural host will facilitate studies on lentiviral infection and lentivirus-based therapeutic vectors. PMID:26102582
This data set contains the transcriptional profiles of 20 dorsal skin samples from eight-week-old mice. Mice were generated by crossing FVB/N to Mus spretus mice to generate F1 mice, and then crossing F1 mice back to the FVB/N strain. 10 FVB/N mice lacking Hras1 (aka HrasKO, Hras-/-) and 10 FVB/N mice with wild-type Hras1 were generated. Read the abstract.
Paul, Sujay
2017-06-01
MicroRNAs (miRNAs) are endogenous, short (~21-nucleotide), non-coding RNA molecules that play pivotal roles in plant growth, development, and stress response signaling. In this study using recently published draft genome sequence of a high-altitude plant maca (Lepidium meyenii Walp) and applying genome-wide computational-based approaches, a total of 62 potentially conserved miRNAs belonging to 28 families were identified and four (lme-miR160a, lme-miR164c, lme-miR 166a, and lme-miR 319a) of them further validated by RT-PCR. Deploying psRNATarget tool a total of 99 potential miRNA target transcripts were also identified in maca. Targets include a number of transcription factors like Squamosa promoter-binding, NAC, MYB, auxin response factor, APETALA, WRKY, and F-box protein. To the best of my knowledge, this is the first genome-based miRNA profiling of a high-altitude plant.
Yan, Bin; Yang, Xinping; Lee, Tin-Lap; Friedman, Jay; Tang, Jun; Van Waes, Carter; Chen, Zhong
2007-01-01
Background Differentially expressed gene profiles have previously been observed among pathologically defined cancers by microarray technologies, including head and neck squamous cell carcinomas (HNSCCs). However, the molecular expression signatures and transcriptional regulatory controls that underlie the heterogeneity in HNSCCs are not well defined. Results Genome-wide cDNA microarray profiling of ten HNSCC cell lines revealed novel gene expression signatures that distinguished cancer cell subsets associated with p53 status. Three major clusters of over-expressed genes (A to C) were defined through hierarchical clustering, Gene Ontology, and statistical modeling. The promoters of genes in these clusters exhibited different patterns and prevalence of transcription factor binding sites for p53, nuclear factor-κB (NF-κB), activator protein (AP)-1, signal transducer and activator of transcription (STAT)3 and early growth response (EGR)1, as compared with the frequency in vertebrate promoters. Cluster A genes involved in chromatin structure and function exhibited enrichment for p53 and decreased AP-1 binding sites, whereas clusters B and C, containing cytokine and antiapoptotic genes, exhibited a significant increase in prevalence of NF-κB binding sites. An increase in STAT3 and EGR1 binding sites was distributed among the over-expressed clusters. Novel regulatory modules containing p53 or NF-κB concomitant with other transcription factor binding motifs were identified, and experimental data supported the predicted transcriptional regulation and binding activity. Conclusion The transcription factors p53, NF-κB, and AP-1 may be important determinants of the heterogeneous pattern of gene expression, whereas STAT3 and EGR1 may broadly enhance gene expression in HNSCCs. Defining these novel gene signatures and regulatory mechanisms will be important for establishing new molecular classifications and subtyping, which in turn will promote development of targeted therapeutics for HNSCC. PMID:17498291
Singh, Anil Kumar; Sharma, Vishal; Pal, Awadhesh Kumar; Acharya, Vishal; Ahuja, Paramvir Singh
2013-08-01
NAC [no apical meristem (NAM), Arabidopsis thaliana transcription activation factor [ATAF1/2] and cup-shaped cotyledon (CUC2)] proteins belong to one of the largest plant-specific transcription factor (TF) families and play important roles in plant development processes, response to biotic and abiotic cues and hormone signalling. Our genome-wide analysis identified 110 StNAC genes in potato encoding for 136 proteins, including 14 membrane-bound TFs. The physical map positions of StNAC genes on 12 potato chromosomes were non-random, and 40 genes were found to be distributed in 16 clusters. The StNAC proteins were phylogenetically clustered into 12 subgroups. Phylogenetic analysis of StNACs along with their Arabidopsis and rice counterparts divided these proteins into 18 subgroups. Our comparative analysis has also identified 36 putative TNAC proteins, which appear to be restricted to Solanaceae family. In silico expression analysis, using Illumina RNA-seq transcriptome data, revealed tissue-specific, biotic, abiotic stress and hormone-responsive expression profile of StNAC genes. Several StNAC genes, including StNAC072 and StNAC101that are orthologs of known stress-responsive Arabidopsis RESPONSIVE TO DEHYDRATION 26 (RD26) were identified as highly abiotic stress responsive. Quantitative real-time polymerase chain reaction analysis largely corroborated the expression profile of StNAC genes as revealed by the RNA-seq data. Taken together, this analysis indicates towards putative functions of several StNAC TFs, which will provide blue-print for their functional characterization and utilization in potato improvement.
Single-cell Genomics using Droplet-based Microfluidics
NASA Astrophysics Data System (ADS)
Basu, Anindita; Macosko, Evan; Shalek, Alex; McCarroll, Steven; Regev, Aviv; Weitz, Dave
2014-03-01
We develop a system to profile the transcriptome of mammalian cells in isolation using reverse emulsion droplet-based microfluidic techniques. This is accomplished by (a) encapsulating and lysing one cell per emulsion droplet, and (b) uniquely barcoding the RNA contents from each cell using unique DNA-barcoded microgel beads. This enables us to study the transcriptional behavior of a large number of cells at single-cell resolution. We then use these techniques to study transcriptional responses of isolated immune cells to precisely controlled chemical and pathological stimuli provided in the emulsion droplet.
High-Resolution Analysis of Coronavirus Gene Expression by RNA Sequencing and Ribosome Profiling
Jones, Joshua D.; Chung, Betty Y.-W.; Siddell, Stuart G.; Brierley, Ian
2016-01-01
Members of the family Coronaviridae have the largest genomes of all RNA viruses, typically in the region of 30 kilobases. Several coronaviruses, such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and Middle East respiratory syndrome-related coronavirus (MERS-CoV), are of medical importance, with high mortality rates and, in the case of SARS-CoV, significant pandemic potential. Other coronaviruses, such as Porcine epidemic diarrhea virus and Avian coronavirus, are important livestock pathogens. Ribosome profiling is a technique which exploits the capacity of the translating ribosome to protect around 30 nucleotides of mRNA from ribonuclease digestion. Ribosome-protected mRNA fragments are purified, subjected to deep sequencing and mapped back to the transcriptome to give a global “snap-shot” of translation. Parallel RNA sequencing allows normalization by transcript abundance. Here we apply ribosome profiling to cells infected with Murine coronavirus, mouse hepatitis virus, strain A59 (MHV-A59), a model coronavirus in the same genus as SARS-CoV and MERS-CoV. The data obtained allowed us to study the kinetics of virus transcription and translation with exquisite precision. We studied the timecourse of positive and negative-sense genomic and subgenomic viral RNA production and the relative translation efficiencies of the different virus ORFs. Virus mRNAs were not found to be translated more efficiently than host mRNAs; rather, virus translation dominates host translation at later time points due to high levels of virus transcripts. Triplet phasing of the profiling data allowed precise determination of translated reading frames and revealed several translated short open reading frames upstream of, or embedded within, known virus protein-coding regions. Ribosome pause sites were identified in the virus replicase polyprotein pp1a ORF and investigated experimentally. Contrary to expectations, ribosomes were not found to pause at the ribosomal frameshift site. To our knowledge this is the first application of ribosome profiling to an RNA virus. PMID:26919232
Whole-genome expression analysis of mammalian-wide interspersed repeat elements in human cell lines.
Carnevali, Davide; Conti, Anastasia; Pellegrini, Matteo; Dieci, Giorgio
2017-02-01
With more than 500,000 copies, mammalian-wide interspersed repeats (MIRs), a sub-group of SINEs, represent ∼2.5% of the human genome and one of the most numerous family of potential targets for the RNA polymerase (Pol) III transcription machinery. Since MIR elements ceased to amplify ∼130 myr ago, previous studies primarily focused on their genomic impact, while the issue of their expression has not been extensively addressed. We applied a dedicated bioinformatic pipeline to ENCODE RNA-Seq datasets of seven human cell lines and, for the first time, we were able to define the Pol III-driven MIR transcriptome at single-locus resolution. While the majority of Pol III-transcribed MIR elements are cell-specific, we discovered a small set of ubiquitously transcribed MIRs mapping within Pol II-transcribed genes in antisense orientation that could influence the expression of the overlapping gene. We also identified novel Pol III-transcribed ncRNAs, deriving from transcription of annotated MIR fragments flanked by unique MIR-unrelated sequences, and confirmed the role of Pol III-specific internal promoter elements in MIR transcription. Besides demonstrating widespread transcription at these retrotranspositionally inactive elements in human cells, the ability to profile MIR expression at single-locus resolution will facilitate their study in different cell types and states including pathological alterations. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Strategies for the acquisition of transcriptional and epigenetic information in single cells.
Li, Guang; Dzilic, Elda; Flores, Nick; Shieh, Alice; Wu, Sean M
2017-03-01
As the basic unit of living organisms, each single cell has unique molecular signatures and functions. Our ability to uncover the transcriptional and epigenetic signature of single cells has been hampered by the lack of tools to explore this area of research. The advent of microfluidic single cell technology along with single cell genome-wide DNA amplification methods had greatly improved our understanding of the expression variation in single cells. Transcriptional expression profile by multiplex qPCR or genome-wide RNA sequencing has enabled us to examine genes expression in single cells in different tissues. With the new tools, the identification of new cellular heterogeneity, novel marker genes, unique subpopulations, and spatial locations of each single cell can be acquired successfully. Epigenetic modifications for each single cell can also be obtained via similar methods. Based on single cell genome sequencing, single cell epigenetic information including histone modifications, DNA methylation, and chromatin accessibility have been explored and provided valuable insights regarding gene regulation and disease prognosis. In this article, we review the development of strategies to obtain single cell transcriptional and epigenetic data. Furthermore, we discuss ways in which single cell studies may help to provide greater understanding of the mechanisms of basic cardiovascular biology that will eventually lead to improvement in our ability to diagnose disease and develop new therapies.
Resolving Heart Regeneration by Replacement Histone Profiling.
Goldman, Joseph Aaron; Kuzu, Guray; Lee, Nutishia; Karasik, Jaclyn; Gemberling, Matthew; Foglia, Matthew J; Karra, Ravi; Dickson, Amy L; Sun, Fei; Tolstorukov, Michael Y; Poss, Kenneth D
2017-02-27
Chromatin regulation is a principal mechanism governing animal development, yet it is unclear to what extent structural changes in chromatin underlie tissue regeneration. Non-mammalian vertebrates such as zebrafish activate cardiomyocyte (CM) division after tissue damage to regenerate lost heart muscle. Here, we generated transgenic zebrafish expressing a biotinylatable H3.3 histone variant in CMs and derived cell-type-specific profiles of histone replacement. We identified an emerging program of putative enhancers that revise H3.3 occupancy during regeneration, overlaid upon a genome-wide reduction of H3.3 from promoters. In transgenic reporter lines, H3.3-enriched elements directed gene expression in subpopulations of CMs. Other elements increased H3.3 enrichment and displayed enhancer activity in settings of injury- and/or Neuregulin1-elicited CM proliferation. Dozens of consensus sequence motifs containing predicted transcription factor binding sites were enriched in genomic regions with regeneration-responsive H3.3 occupancy. Thus, cell-type-specific regulatory programs of tissue regeneration can be revealed by genome-wide H3.3 profiling. Copyright © 2017 Elsevier Inc. All rights reserved.
Haralambieva, Iana H.; Oberg, Ann L.; Ovsyannikova, Inna G.; Kennedy, Richard B.; Grill, Diane E.; Middha, Sumit; Bot, Brian M.; Wang, Vivian W.; Smith, David I.; Jacobson, Robert M.; Poland, Gregory A.
2013-01-01
Immune responses to current rubella vaccines demonstrate significant inter-individual variability. We performed mRNA-Seq profiling on PBMCs from high and low antibody responders to rubella vaccination to delineate transcriptional differences upon viral stimulation. Generalized linear models were used to assess the per gene fold change (FC) for stimulated versus unstimulated samples or the interaction between outcome and stimulation. Model results were evaluated by both FC and p-value. Pathway analysis and self-contained gene set tests were performed for assessment of gene group effects. Of 17,566 detected genes, we identified 1,080 highly significant differentially expressed genes upon viral stimulation (p<1.00E−15, FDR<1.00E−14), including various immune function and inflammation-related genes, genes involved in cell signaling, cell regulation and transcription, and genes with unknown function. Analysis by immune outcome and stimulation status identified 27 genes (p≤0.0006 and FDR≤0.30) that responded differently to viral stimulation in high vs. low antibody responders, including major histocompatibility complex (MHC) class I genes (HLA-A, HLA-B and B2M with p = 0.0001, p = 0.0005 and p = 0.0002, respectively), and two genes related to innate immunity and inflammation (EMR3 and MEFV with p = 1.46E−08 and p = 0.0004, respectively). Pathway and gene set analysis also revealed transcriptional differences in antigen presentation and innate/inflammatory gene sets and pathways between high and low responders. Using mRNA-Seq genome-wide transcriptional profiling, we identified antigen presentation and innate/inflammatory genes that may assist in explaining rubella vaccine-induced immune response variations. Such information may provide new scientific insights into vaccine-induced immunity useful in rational vaccine development and immune response monitoring. PMID:23658707
Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui; ...
2014-10-02
Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptionalmore » regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui
Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptionalmore » regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.« less
The Long Noncoding RNA Transcriptome of Dictyostelium discoideum Development.
Rosengarten, Rafael D; Santhanam, Balaji; Kokosar, Janez; Shaulsky, Gad
2017-02-09
Dictyostelium discoideum live in the soil as single cells, engulfing bacteria and growing vegetatively. Upon starvation, tens of thousands of amoebae enter a developmental program that includes aggregation, multicellular differentiation, and sporulation. Major shifts across the protein-coding transcriptome accompany these developmental changes. However, no study has presented a global survey of long noncoding RNAs (ncRNAs) in D. discoideum To characterize the antisense and long intergenic noncoding RNA (lncRNA) transcriptome, we analyzed previously published developmental time course samples using an RNA-sequencing (RNA-seq) library preparation method that selectively depletes ribosomal RNAs (rRNAs). We detected the accumulation of transcripts for 9833 protein-coding messenger RNAs (mRNAs), 621 lncRNAs, and 162 putative antisense RNAs (asRNAs). The noncoding RNAs were interspersed throughout the genome, and were distinct in expression level, length, and nucleotide composition. The noncoding transcriptome displayed a temporal profile similar to the coding transcriptome, with stages of gradual change interspersed with larger leaps. The transcription profiles of some noncoding RNAs were strongly correlated with known differentially expressed coding RNAs, hinting at a functional role for these molecules during development. Examining the mitochondrial transcriptome, we modeled two novel antisense transcripts. We applied yet another ribosomal depletion method to a subset of the samples to better retain transfer RNA (tRNA) transcripts. We observed polymorphisms in tRNA anticodons that suggested a post-transcriptional means by which D. discoideum compensates for codons missing in the genomic complement of tRNAs. We concluded that the prevalence and characteristics of long ncRNAs indicate that these molecules are relevant to the progression of molecular and cellular phenotypes during development. Copyright © 2017 Rosengarten et al.
Schwartz, Thomas; Armant, Olivier; Bretschneider, Nancy; Hahn, Alexander; Kirchen, Silke; Seifert, Martin; Dötsch, Andreas
2015-01-01
The fitness of sensitive and resistant Pseudomonas aeruginosa in different aquatic environments depends on genetic capacities and transcriptional regulation. Therefore, an antibiotic-sensitive isolate PA30 and a multi-resistant isolate PA49 originating from waste waters were compared via whole genome and transcriptome Illumina sequencing after exposure to municipal waste water and tap water. A number of different genomic islands (e.g. PAGIs, PAPIs) were identified in the two environmental isolates beside the highly conserved core genome. Exposure to tap water and waste water exhibited similar transcriptional impacts on several gene clusters (antibiotic and metal resistance, genetic mobile elements, efflux pumps) in both environmental P. aeruginosa isolates. The MexCD-OprJ efflux pump was overexpressed in PA49 in response to waste water. The expression of resistance genes, genetic mobile elements in PA49 was independent from the water matrix. Consistently, the antibiotic sensitive strain PA30 did not show any difference in expression of the intrinsic resistance determinants and genetic mobile elements. Thus, the exposure of both isolates to polluted waste water and oligotrophic tap water resulted in similar expression profiles of mentioned genes. However, changes in environmental milieus resulted in rather unspecific transcriptional responses than selected and stimuli-specific gene regulation. © 2014 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.
Best practices for mapping replication origins in eukaryotic chromosomes.
Besnard, Emilie; Desprat, Romain; Ryan, Michael; Kahli, Malik; Aladjem, Mirit I; Lemaitre, Jean-Marc
2014-09-02
Understanding the regulatory principles ensuring complete DNA replication in each cell division is critical for deciphering the mechanisms that maintain genomic stability. Recent advances in genome sequencing technology facilitated complete mapping of DNA replication sites and helped move the field from observing replication patterns at a handful of single loci to analyzing replication patterns genome-wide. These advances address issues, such as the relationship between replication initiation events, transcription, and chromatin modifications, and identify potential replication origin consensus sequences. This unit summarizes the technological and fundamental aspects of replication profiling and briefly discusses novel insights emerging from mining large datasets, published in the last 3 years, and also describes DNA replication dynamics on a whole-genome scale. Copyright © 2014 John Wiley & Sons, Inc.
Schadt, Eric E; Edwards, Stephen W; GuhaThakurta, Debraj; Holder, Dan; Ying, Lisa; Svetnik, Vladimir; Leonardson, Amy; Hart, Kyle W; Russell, Archie; Li, Guoya; Cavet, Guy; Castle, John; McDonagh, Paul; Kan, Zhengyan; Chen, Ronghua; Kasarskis, Andrew; Margarint, Mihai; Caceres, Ramon M; Johnson, Jason M; Armour, Christopher D; Garrett-Engele, Philip W; Tsinoremas, Nicholas F; Shoemaker, Daniel D
2004-01-01
Background Computational and microarray-based experimental approaches were used to generate a comprehensive transcript index for the human genome. Oligonucleotide probes designed from approximately 50,000 known and predicted transcript sequences from the human genome were used to survey transcription from a diverse set of 60 tissues and cell lines using ink-jet microarrays. Further, expression activity over at least six conditions was more generally assessed using genomic tiling arrays consisting of probes tiled through a repeat-masked version of the genomic sequence making up chromosomes 20 and 22. Results The combination of microarray data with extensive genome annotations resulted in a set of 28,456 experimentally supported transcripts. This set of high-confidence transcripts represents the first experimentally driven annotation of the human genome. In addition, the results from genomic tiling suggest that a large amount of transcription exists outside of annotated regions of the genome and serves as an example of how this activity could be measured on a genome-wide scale. Conclusions These data represent one of the most comprehensive assessments of transcriptional activity in the human genome and provide an atlas of human gene expression over a unique set of gene predictions. Before the annotation of the human genome is considered complete, however, the previously unannotated transcriptional activity throughout the genome must be fully characterized. PMID:15461792
Xin, Haiping; Zhu, Wei; Wang, Lina; Xiang, Yue; Fang, Linchuan; Li, Jitao; Sun, Xiaoming; Wang, Nian; Londo, Jason P.; Li, Shaohua
2013-01-01
Grape is one of the most important fruit crops worldwide. The suitable geographical locations and productivity of grapes are largely limited by temperature. Vitis amurensis is a wild grapevine species with remarkable cold-tolerance, exceeding that of Vitis vinifera, the dominant cultivated species of grapevine. However, the molecular mechanisms that contribute to the enhanced freezing tolerance of V. amurensis remain unknown. Here we used deep sequencing data from restriction endonuclease-generated cDNA fragments to evaluate the whole genome wide modification of transcriptome of V. amurensis under cold treatment. Vitis vinifera cv. Muscat of Hamburg was used as control to help investigate the distinctive features of V. amruensis in responding to cold stress. Approximately 9 million tags were sequenced from non-cold treatment (NCT) and cold treatment (CT) cDNA libraries in each species of grapevine sampled from shoot apices. Alignment of tags into V. vinifera cv. Pinot noir (PN40024) annotated genome identified over 15,000 transcripts in each library in V. amruensis and more than 16,000 in Muscat of Hamburg. Comparative analysis between NCT and CT libraries indicate that V. amurensis has fewer differential expressed genes (DEGs, 1314 transcripts) than Muscat of Hamburg (2307 transcripts) when exposed to cold stress. Common DEGs (408 transcripts) suggest that some genes provide fundamental roles during cold stress in grapes. The most robust DEGs (more than 20-fold change) also demonstrated significant differences between two kinds of grapevine, indicating that cold stress may trigger species specific pathways in V. amurensis. Functional categories of DEGs indicated that the proportion of up-regulated transcripts related to metabolism, transport, signal transduction and transcription were more abundant in V. amurensis. Several highly expressed transcripts that were found uniquely accumulated in V. amurensis are discussed in detail. This subset of unique candidate transcripts may contribute to the excellent cold-hardiness of V. amurensis. PMID:23516547
van den Brink, Joost; Daran-Lapujade, Pascale; Pronk, Jack T; de Winde, Johannes H
2008-01-01
Background The capacity of respiring cultures of Saccharomyces cerevisiae to immediately switch to fast alcoholic fermentation upon a transfer to anaerobic sugar-excess conditions is a key characteristic of Saccharomyces cerevisiae in many of its industrial applications. This transition was studied by exposing aerobic glucose-limited chemostat cultures grown at a low specific growth rate to two simultaneous perturbations: oxygen depletion and relief of glucose limitation. Results The shift towards fully fermentative conditions caused a massive transcriptional reprogramming, where one third of all genes within the genome were transcribed differentially. The changes in transcript levels were mostly driven by relief from glucose-limitation. After an initial strong response to the addition of glucose, the expression profile of most transcriptionally regulated genes displayed a clear switch at 30 minutes. In this respect, a striking difference was observed between the transcript profiles of genes encoding ribosomal proteins and those encoding ribosomal biogenesis components. Not all regulated genes responded with this binary profile. A group of 87 genes showed a delayed and steady increase in expression that specifically responded to anaerobiosis. Conclusion Our study demonstrated that, despite the complexity of this multiple-input perturbation, the transcriptional responses could be categorized and biologically interpreted. By comparing this study with public datasets representing dynamic and steady conditions, 14 up-regulated and 11 down-regulated genes were determined to be anaerobic specific. Therefore, these can be seen as true "signature" transcripts for anaerobicity under dynamic as well as under steady state conditions. PMID:18304306
Genomic Signal Processing: Predicting Basic Molecular Biological Principles
NASA Astrophysics Data System (ADS)
Alter, Orly
2005-03-01
Advances in high-throughput technologies enable acquisition of different types of molecular biological data, monitoring the flow of biological information as DNA is transcribed to RNA, and RNA is translated to proteins, on a genomic scale. Future discovery in biology and medicine will come from the mathematical modeling of these data, which hold the key to fundamental understanding of life on the molecular level, as well as answers to questions regarding diagnosis, treatment and drug development. Recently we described data-driven models for genome-scale molecular biological data, which use singular value decomposition (SVD) and the comparative generalized SVD (GSVD). Now we describe an integrative data-driven model, which uses pseudoinverse projection (1). We also demonstrate the predictive power of these matrix algebra models (2). The integrative pseudoinverse projection model formulates any number of genome-scale molecular biological data sets in terms of one chosen set of data samples, or of profiles extracted mathematically from data samples, designated the ``basis'' set. The mathematical variables of this integrative model, the pseudoinverse correlation patterns that are uncovered in the data, represent independent processes and corresponding cellular states (such as observed genome-wide effects of known regulators or transcription factors, the biological components of the cellular machinery that generate the genomic signals, and measured samples in which these regulators or transcription factors are over- or underactive). Reconstruction of the data in the basis simulates experimental observation of only the cellular states manifest in the data that correspond to those of the basis. Classification of the data samples according to their reconstruction in the basis, rather than their overall measured profiles, maps the cellular states of the data onto those of the basis, and gives a global picture of the correlations and possibly also causal coordination of these two sets of states. Mapping genome-scale protein binding data using pseudoinverse projection onto patterns of RNA expression data that had been extracted by SVD and GSVD, a novel correlation between DNA replication initiation and RNA transcription during the cell cycle in yeast, that might be due to a previously unknown mechanism of regulation, is predicted. (1) Alter & Golub, Proc. Natl. Acad. Sci. USA 101, 16577 (2004). (2) Alter, Golub, Brown & Botstein, Miami Nat. Biotechnol. Winter Symp. 2004 (www.med.miami.edu/mnbws/alter-.pdf)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Castanera, Raul; Lopez-Varas, Leticia; Borgognone, Alessandra
Transposable elements (TEs) are exceptional contributors to eukaryotic genome diversity. Their ubiquitous presence impacts the genomes of nearly all species and mediates genome evolution by causing mutations and chromosomal rearrangements and by modulating gene expression. We performed an exhaustive analysis of the TE content in 18 fungal genomes, including strains of the same species and species of the same genera. Our results depicted a scenario of exceptional variability, with species having 0.02 to 29.8% of their genome consisting of transposable elements. A detailed analysis performed on two strains of Pleurotus ostreatus uncovered a genome that is populated mainly by Classmore » I elements, especially LTR-retrotransposons amplified in recent bursts from 0 to 2 million years (My) ago. The preferential accumulation of TEs in clusters led to the presence of genomic regions that lacked intra- and inter-specific conservation. In addition, we investigated the effect of TE insertions on the expression of their nearby upstream and downstream genes. Our results showed that an important number of genes under TE influence are significantly repressed, with stronger repression when genes are localized within transposon clusters. Our transcriptional analysis performed in four additional fungal models revealed that this TE-mediated silencing was present only in species with active cytosine methylation machinery. We hypothesize that this phenomenon is related to epigenetic defense mechanisms that are aimed to suppress TE expression and control their proliferation.« less
Castanera, Raul; Lopez-Varas, Leticia; Borgognone, Alessandra; ...
2016-06-13
Transposable elements (TEs) are exceptional contributors to eukaryotic genome diversity. Their ubiquitous presence impacts the genomes of nearly all species and mediates genome evolution by causing mutations and chromosomal rearrangements and by modulating gene expression. We performed an exhaustive analysis of the TE content in 18 fungal genomes, including strains of the same species and species of the same genera. Our results depicted a scenario of exceptional variability, with species having 0.02 to 29.8% of their genome consisting of transposable elements. A detailed analysis performed on two strains of Pleurotus ostreatus uncovered a genome that is populated mainly by Classmore » I elements, especially LTR-retrotransposons amplified in recent bursts from 0 to 2 million years (My) ago. The preferential accumulation of TEs in clusters led to the presence of genomic regions that lacked intra- and inter-specific conservation. In addition, we investigated the effect of TE insertions on the expression of their nearby upstream and downstream genes. Our results showed that an important number of genes under TE influence are significantly repressed, with stronger repression when genes are localized within transposon clusters. Our transcriptional analysis performed in four additional fungal models revealed that this TE-mediated silencing was present only in species with active cytosine methylation machinery. We hypothesize that this phenomenon is related to epigenetic defense mechanisms that are aimed to suppress TE expression and control their proliferation.« less
Transposon identification using profile HMMs
2010-01-01
Background Transposons are "jumping genes" that account for large quantities of repetitive content in genomes. They are known to affect transcriptional regulation in several different ways, and are implicated in many human diseases. Transposons are related to microRNAs and viruses, and many genes, pseudogenes, and gene promoters are derived from transposons or have origins in transposon-induced duplication. Modeling transposon-derived genomic content is difficult because they are poorly conserved. Profile hidden Markov models (profile HMMs), widely used for protein sequence family modeling, are rarely used for modeling DNA sequence families. The algorithm commonly used to estimate the parameters of profile HMMs, Baum-Welch, is prone to prematurely converge to local optima. The DNA domain is especially problematic for the Baum-Welch algorithm, since it has only four letters as opposed to the twenty residues of the amino acid alphabet. Results We demonstrate with a simulation study and with an application to modeling the MIR family of transposons that two recently introduced methods, Conditional Baum-Welch and Dynamic Model Surgery, achieve better estimates of the parameters of profile HMMs across a range of conditions. Conclusions We argue that these new algorithms expand the range of potential applications of profile HMMs to many important DNA sequence family modeling problems, including that of searching for and modeling the virus-like transposons that are found in all known genomes. PMID:20158867
The rubber tree genome shows expansion of gene family associated with rubber biosynthesis
Lau, Nyok-Sean; Makita, Yuko; Kawashima, Mika; Taylor, Todd D.; Kondo, Shinji; Othman, Ahmad Sofiman; Shu-Chien, Alexander Chong; Matsui, Minami
2016-01-01
Hevea brasiliensis Muell. Arg, a member of the family Euphorbiaceae, is the sole natural resource exploited for commercial production of high-quality natural rubber. The properties of natural rubber latex are almost irreplaceable by synthetic counterparts for many industrial applications. A paucity of knowledge on the molecular mechanisms of rubber biosynthesis in high yield traits still persists. Here we report the comprehensive genome-wide analysis of the widely planted H. brasiliensis clone, RRIM 600. The genome was assembled based on ~155-fold combined coverage with Illumina and PacBio sequence data and has a total length of 1.55 Gb with 72.5% comprising repetitive DNA sequences. A total of 84,440 high-confidence protein-coding genes were predicted. Comparative genomic analysis revealed strong synteny between H. brasiliensis and other Euphorbiaceae genomes. Our data suggest that H. brasiliensis’s capacity to produce high levels of latex can be attributed to the expansion of rubber biosynthesis-related genes in its genome and the high expression of these genes in latex. Using cap analysis gene expression data, we illustrate the tissue-specific transcription profiles of rubber biosynthesis-related genes, revealing alternative means of transcriptional regulation. Our study adds to the understanding of H. brasiliensis biology and provides valuable genomic resources for future agronomic-related improvement of the rubber tree. PMID:27339202
2010-01-01
Background Cutaneous mycoses are common human infections among healthy and immunocompromised hosts, and the anthropophilic fungus Trichophyton rubrum is the most prevalent microorganism isolated from such clinical cases worldwide. The aim of this study was to determine the transcriptional profile of T. rubrum exposed to various stimuli in order to obtain insights into the responses of this pathogen to different environmental challenges. Therefore, we generated an expressed sequence tag (EST) collection by constructing one cDNA library and nine suppression subtractive hybridization libraries. Results The 1388 unigenes identified in this study were functionally classified based on the Munich Information Center for Protein Sequences (MIPS) categories. The identified proteins were involved in transcriptional regulation, cellular defense and stress, protein degradation, signaling, transport, and secretion, among other functions. Analysis of these unigenes revealed 575 T. rubrum sequences that had not been previously deposited in public databases. Conclusion In this study, we identified novel T. rubrum genes that will be useful for ORF prediction in genome sequencing and facilitating functional genome analysis. Annotation of these expressed genes revealed metabolic adaptations of T. rubrum to carbon sources, ambient pH shifts, and various antifungal drugs used in medical practice. Furthermore, challenging T. rubrum with cytotoxic drugs and ambient pH shifts extended our understanding of the molecular events possibly involved in the infectious process and resistance to antifungal drugs. PMID:20144196
Minas, Giorgos; Momiji, Hiroshi; Jenkins, Dafyd J; Costa, Maria J; Rand, David A; Finkenstädt, Bärbel
2017-06-26
Given the development of high-throughput experimental techniques, an increasing number of whole genome transcription profiling time series data sets, with good temporal resolution, are becoming available to researchers. The ReTrOS toolbox (Reconstructing Transcription Open Software) provides MATLAB-based implementations of two related methods, namely ReTrOS-Smooth and ReTrOS-Switch, for reconstructing the temporal transcriptional activity profile of a gene from given mRNA expression time series or protein reporter time series. The methods are based on fitting a differential equation model incorporating the processes of transcription, translation and degradation. The toolbox provides a framework for model fitting along with statistical analyses of the model with a graphical interface and model visualisation. We highlight several applications of the toolbox, including the reconstruction of the temporal cascade of transcriptional activity inferred from mRNA expression data and protein reporter data in the core circadian clock in Arabidopsis thaliana, and how such reconstructed transcription profiles can be used to study the effects of different cell lines and conditions. The ReTrOS toolbox allows users to analyse gene and/or protein expression time series where, with appropriate formulation of prior information about a minimum of kinetic parameters, in particular rates of degradation, users are able to infer timings of changes in transcriptional activity. Data from any organism and obtained from a range of technologies can be used as input due to the flexible and generic nature of the model and implementation. The output from this software provides a useful analysis of time series data and can be incorporated into further modelling approaches or in hypothesis generation.
QuickMap: a public tool for large-scale gene therapy vector insertion site mapping and analysis.
Appelt, J-U; Giordano, F A; Ecker, M; Roeder, I; Grund, N; Hotz-Wagenblatt, A; Opelz, G; Zeller, W J; Allgayer, H; Fruehauf, S; Laufs, S
2009-07-01
Several events of insertional mutagenesis in pre-clinical and clinical gene therapy studies have created intense interest in assessing the genomic insertion profiles of gene therapy vectors. For the construction of such profiles, vector-flanking sequences detected by inverse PCR, linear amplification-mediated-PCR or ligation-mediated-PCR need to be mapped to the host cell's genome and compared to a reference set. Although remarkable progress has been achieved in mapping gene therapy vector insertion sites, public reference sets are lacking, as are the possibilities to quickly detect non-random patterns in experimental data. We developed a tool termed QuickMap, which uniformly maps and analyzes human and murine vector-flanking sequences within seconds (available at www.gtsg.org). Besides information about hits in chromosomes and fragile sites, QuickMap automatically determines insertion frequencies in +/- 250 kb adjacency to genes, cancer genes, pseudogenes, transcription factor and (post-transcriptional) miRNA binding sites, CpG islands and repetitive elements (short interspersed nuclear elements (SINE), long interspersed nuclear elements (LINE), Type II elements and LTR elements). Additionally, all experimental frequencies are compared with the data obtained from a reference set, containing 1 000 000 random integrations ('random set'). Thus, for the first time a tool allowing high-throughput profiling of gene therapy vector insertion sites is available. It provides a basis for large-scale insertion site analyses, which is now urgently needed to discover novel gene therapy vectors with 'safe' insertion profiles.
Davey, Mark W; Graham, Neil S; Vanholme, Bartel; Swennen, Rony; May, Sean T; Keulemans, Johan
2009-01-01
Background 'Systems-wide' approaches such as microarray RNA-profiling are ideally suited to the study of the complex overlapping responses of plants to biotic and abiotic stresses. However, commercial microarrays are only available for a limited number of plant species and development costs are so substantial as to be prohibitive for most research groups. Here we evaluate the use of cross-hybridisation to Affymetrix oligonucleotide GeneChip® microarrays to profile the response of the banana (Musa spp.) leaf transcriptome to drought stress using a genomic DNA (gDNA)-based probe-selection strategy to improve the efficiency of detection of differentially expressed Musa transcripts. Results Following cross-hybridisation of Musa gDNA to the Rice GeneChip® Genome Array, ~33,700 gene-specific probe-sets had a sufficiently high degree of homology to be retained for transcriptomic analyses. In a proof-of-concept approach, pooled RNA representing a single biological replicate of control and drought stressed leaves of the Musa cultivar 'Cachaco' were hybridised to the Affymetrix Rice Genome Array. A total of 2,910 Musa gene homologues with a >2-fold difference in expression levels were subsequently identified. These drought-responsive transcripts included many functional classes associated with plant biotic and abiotic stress responses, as well as a range of regulatory genes known to be involved in coordinating abiotic stress responses. This latter group included members of the ERF, DREB, MYB, bZIP and bHLH transcription factor families. Fifty-two of these drought-sensitive Musa transcripts were homologous to genes underlying QTLs for drought and cold tolerance in rice, including in 2 instances QTLs associated with a single underlying gene. The list of drought-responsive transcripts also included genes identified in publicly-available comparative transcriptomics experiments. Conclusion Our results demonstrate that despite the general paucity of nucleotide sequence data in Musa and only distant phylogenetic relations to rice, gDNA probe-based cross-hybridisation to the Rice GeneChip® is a highly promising strategy to study complex biological responses and illustrates the potential of such strategies for gene discovery in non-model species. PMID:19758430
Sex-specific hippocampal 5-hydroxymethylcytosine is disrupted in response to acute stress
Papale, Ligia A.; Li, Sisi; Madrid, Andy; Zhang, Qi; Chen, Li; Chopra, Pankaj; Jin, Peng; Keleş, Sündüz; Alisch, Reid S.
2016-01-01
Environmental stress is among the most important contributors to increased susceptibility to develop psychiatric disorders. While it is well known that acute environmental stress alters gene expression, the molecular mechanisms underlying these changes remain largely unknown. 5-hydroxymethylcytosine (5hmC) is a novel environmentally sensitive epigenetic modification that is highly enriched in neurons and is associated with active neuronal transcription. Recently, we reported a genome-wide disruption of hippocampal 5hmC in male mice following acute stress that was correlated to altered transcript levels of genes in known stress related pathways. Since sex-specific endocrine mechanisms respond to environmental stimulus by altering the neuronal epigenome, we examined the genome-wide profile of hippocampal 5hmC in female mice following exposure to acute stress and identified 363 differentially hydroxymethylated regions (DhMRs) linked to known (e.g., Nr3c1 and Ntrk2) and potentially novel genes associated with stress response and psychiatric disorders. Integration of hippocampal expression data from the same female mice found stress-related hydroxymethylation correlated to altered transcript levels. Finally, characterization of stress-induced sex-specific 5hmC profiles in the hippocampus revealed 778 sex-specific acute stress-induced DhMRs some of which were correlated to altered transcript levels that produce sex-specific isoforms in response to stress. Together, the alterations in 5hmC presented here provide a possible molecular mechanism for the adaptive sex-specific response to stress that may augment the design of novel therapeutic agents that will have optimal effectiveness in each sex. PMID:27576189
Lalli, Matthew A; Jang, Jiwon; Park, Joo-Hye C; Wang, Yidi; Guzman, Elmer; Zhou, Hongjun; Audouard, Morgane; Bridges, Daniel; Tovar, Kenneth R; Papuc, Sorina M; Tutulan-Cunita, Andreea C; Huang, Yadong; Budisteanu, Magdalena; Arghir, Aurora; Kosik, Kenneth S
2016-04-01
Williams syndrome (WS) is a neurodevelopmental disorder caused by a genomic deletion of ∼28 genes that results in a cognitive and behavioral profile marked by overall intellectual impairment with relative strength in expressive language and hypersocial behavior. Advancements in protocols for neuron differentiation from induced pluripotent stem cells allowed us to elucidate the molecular circuitry underpinning the ontogeny of WS. In patient-derived stem cells and neurons, we determined the expression profile of the Williams-Beuren syndrome critical region-deleted genes and the genome-wide transcriptional consequences of the hemizygous genomic microdeletion at chromosome 7q11.23. Derived neurons displayed disease-relevant hallmarks and indicated novel aberrant pathways in WS neurons including over-activated Wnt signaling accompanying an incomplete neurogenic commitment. We show that haploinsufficiency of the ATP-dependent chromatin remodeler, BAZ1B, which is deleted in WS, significantly contributes to this differentiation defect. Chromatin-immunoprecipitation (ChIP-seq) revealed BAZ1B target gene functions are enriched for neurogenesis, neuron differentiation and disease-relevant phenotypes. BAZ1B haploinsufficiency caused widespread gene expression changes in neural progenitor cells, and together with BAZ1B ChIP-seq target genes, explained 42% of the transcriptional dysregulation in WS neurons. BAZ1B contributes to regulating the balance between neural precursor self-renewal and differentiation and the differentiation defect caused by BAZ1B haploinsufficiency can be rescued by mitigating over-active Wnt signaling in neural stem cells. Altogether, these results reveal a pivotal role for BAZ1B in neurodevelopment and implicate its haploinsufficiency as a likely contributor to the neurological phenotypes in WS. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Genome-wide expression analyses of the stationary phase model of ageing in yeast.
Wanichthanarak, Kwanjeera; Wongtosrad, Nutvadee; Petranovic, Dina
2015-07-01
Ageing processes involved in replicative lifespan (RLS) and chronological lifespan (CLS) have been found to be conserved among many organisms, including in unicellular Eukarya such as yeast Saccharomyces cerevisiae. Here we performed an integrated approach of genome wide expression profiles of yeast at different time points, during growth and starvation. The aim of the study was to identify transcriptional changes in those conditions by using several different computational analyses in order to propose transcription factors, biological networks and metabolic pathways that seem to be relevant during the process of chronological ageing in yeast. Specifically, we performed differential gene expression analysis, gene-set enrichment analysis and network-based analysis, and we identified pathways affected in the stationary phase and specific transcription factors driving transcriptional adaptations. The results indicate signal propagation from G protein-coupled receptors through signaling pathway components and other stress and nutrient-induced transcription factors resulting in adaptation of yeast cells to the lack of nutrients by activating metabolism associated with aerobic metabolism of carbon sources such as ethanol, glycerol and fatty acids. In addition, we found STE12, XBP1 and TOS8 as highly connected nodes in the subnetworks of ageing yeast. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Identification of functional elements and regulatory circuits by Drosophila modENCODE
DOE Office of Scientific and Technical Information (OSTI.GOV)
Roy, Sushmita; Ernst, Jason; Kharchenko, Peter V.
2010-12-22
To gain insight into how genomic information is translated into cellular and developmental programs, the Drosophila model organism Encyclopedia of DNA Elements (modENCODE) project is comprehensively mapping transcripts, histone modifications, chromosomal proteins, transcription factors, replication proteins and intermediates, and nucleosome properties across a developmental time course and in multiple cell lines. We have generated more than 700 data sets and discovered protein-coding, noncoding, RNA regulatory, replication, and chromatin elements, more than tripling the annotated portion of the Drosophila genome. Correlated activity patterns of these elements reveal a functional regulatory network, which predicts putative new functions for genes, reveals stage- andmore » tissue-specific regulators, and enables gene-expression prediction. Our results provide a foundation for directed experimental and computational studies in Drosophila and related species and also a model for systematic data integration toward comprehensive genomic and functional annotation. Several years after the complete genetic sequencing of many species, it is still unclear how to translate genomic information into a functional map of cellular and developmental programs. The Encyclopedia of DNA Elements (ENCODE) (1) and model organism ENCODE (modENCODE) (2) projects use diverse genomic assays to comprehensively annotate the Homo sapiens (human), Drosophila melanogaster (fruit fly), and Caenorhabditis elegans (worm) genomes, through systematic generation and computational integration of functional genomic data sets. Previous genomic studies in flies have made seminal contributions to our understanding of basic biological mechanisms and genome functions, facilitated by genetic, experimental, computational, and manual annotation of the euchromatic and heterochromatic genome (3), small genome size, short life cycle, and a deep knowledge of development, gene function, and chromosome biology. The functions of {approx}40% of the protein and nonprotein-coding genes [FlyBase 5.12 (4)] have been determined from cDNA collections (5, 6), manual curation of gene models (7), gene mutations and comprehensive genome-wide RNA interference screens (8-10), and comparative genomic analyses (11, 12). The Drosophila modENCODE project has generated more than 700 data sets that profile transcripts, histone modifications and physical nucleosome properties, general and specific transcription factors (TFs), and replication programs in cell lines, isolated tissues, and whole organisms across several developmental stages (Fig. 1). Here, we computationally integrate these data sets and report (i) improved and additional genome annotations, including full-length proteincoding genes and peptides as short as 21 amino acids; (ii) noncoding transcripts, including 132 candidate structural RNAs and 1608 nonstructural transcripts; (iii) additional Argonaute (Ago)-associated small RNA genes and pathways, including new microRNAs (miRNAs) encoded within protein-coding exons and endogenous small interfering RNAs (siRNAs) from 3-inch untranslated regions; (iv) chromatin 'states' defined by combinatorial patterns of 18 chromatin marks that are associated with distinct functions and properties; (v) regions of high TF occupancy and replication activity with likely epigenetic regulation; (vi)mixed TF and miRNA regulatory networks with hierarchical structure and enriched feed-forward loops; (vii) coexpression- and co-regulation-based functional annotations for nearly 3000 genes; (viii) stage- and tissue-specific regulators; and (ix) predictive models of gene expression levels and regulator function.« less
Malinowski, Douglas P
2007-05-01
In recent years, the application of genomic and proteomic technologies to the problem of breast cancer prognosis and the prediction of therapy response have begun to yield encouraging results. Independent studies employing transcriptional profiling of primary breast cancer specimens using DNA microarrays have identified gene expression profiles that correlate with clinical outcome in primary breast biopsy specimens. Recent advances in microarray technology have demonstrated reproducibility, making clinical applications more achievable. In this regard, one such DNA microarray device based upon a 70-gene expression signature was recently cleared by the US FDA for application to breast cancer prognosis. These DNA microarrays often employ at least 70 gene targets for transcriptional profiling and prognostic assessment in breast cancer. The use of PCR-based methods utilizing a small subset of genes has recently demonstrated the ability to predict the clinical outcome in early-stage breast cancer. Furthermore, protein-based immunohistochemistry methods have progressed from using gene clusters and gene expression profiling to smaller subsets of expressed proteins to predict prognosis in early-stage breast cancer. Beyond prognostic applications, DNA microarray-based transcriptional profiling has demonstrated the ability to predict response to chemotherapy in early-stage breast cancer patients. In this review, recent advances in the use of multiple markers for prognosis of disease recurrence in early-stage breast cancer and the prediction of therapy response will be discussed.
A HaemAtlas: characterizing gene expression in differentiated human blood cells.
Watkins, Nicholas A; Gusnanto, Arief; de Bono, Bernard; De, Subhajyoti; Miranda-Saavedra, Diego; Hardie, Debbie L; Angenent, Will G J; Attwood, Antony P; Ellis, Peter D; Erber, Wendy; Foad, Nicola S; Garner, Stephen F; Isacke, Clare M; Jolley, Jennifer; Koch, Kerstin; Macaulay, Iain C; Morley, Sarah L; Rendon, Augusto; Rice, Kate M; Taylor, Niall; Thijssen-Timmer, Daphne C; Tijssen, Marloes R; van der Schoot, C Ellen; Wernisch, Lorenz; Winzer, Thilo; Dudbridge, Frank; Buckley, Christopher D; Langford, Cordelia F; Teichmann, Sarah; Göttgens, Berthold; Ouwehand, Willem H
2009-05-07
Hematopoiesis is a carefully controlled process that is regulated by complex networks of transcription factors that are, in part, controlled by signals resulting from ligand binding to cell-surface receptors. To further understand hematopoiesis, we have compared gene expression profiles of human erythroblasts, megakaryocytes, B cells, cytotoxic and helper T cells, natural killer cells, granulocytes, and monocytes using whole genome microarrays. A bioinformatics analysis of these data was performed focusing on transcription factors, immunoglobulin superfamily members, and lineage-specific transcripts. We observed that the numbers of lineage-specific genes varies by 2 orders of magnitude, ranging from 5 for cytotoxic T cells to 878 for granulocytes. In addition, we have identified novel coexpression patterns for key transcription factors involved in hematopoiesis (eg, GATA3-GFI1 and GATA2-KLF1). This study represents the most comprehensive analysis of gene expression in hematopoietic cells to date and has identified genes that play key roles in lineage commitment and cell function. The data, which are freely accessible, will be invaluable for future studies on hematopoiesis and the role of specific genes and will also aid the understanding of the recent genome-wide association studies.
A HaemAtlas: characterizing gene expression in differentiated human blood cells
Gusnanto, Arief; de Bono, Bernard; De, Subhajyoti; Miranda-Saavedra, Diego; Hardie, Debbie L.; Angenent, Will G. J.; Attwood, Antony P.; Ellis, Peter D.; Erber, Wendy; Foad, Nicola S.; Garner, Stephen F.; Isacke, Clare M.; Jolley, Jennifer; Koch, Kerstin; Macaulay, Iain C.; Morley, Sarah L.; Rendon, Augusto; Rice, Kate M.; Taylor, Niall; Thijssen-Timmer, Daphne C.; Tijssen, Marloes R.; van der Schoot, C. Ellen; Wernisch, Lorenz; Winzer, Thilo; Dudbridge, Frank; Buckley, Christopher D.; Langford, Cordelia F.; Teichmann, Sarah; Göttgens, Berthold; Ouwehand, Willem H.
2009-01-01
Hematopoiesis is a carefully controlled process that is regulated by complex networks of transcription factors that are, in part, controlled by signals resulting from ligand binding to cell-surface receptors. To further understand hematopoiesis, we have compared gene expression profiles of human erythroblasts, megakaryocytes, B cells, cytotoxic and helper T cells, natural killer cells, granulocytes, and monocytes using whole genome microarrays. A bioinformatics analysis of these data was performed focusing on transcription factors, immunoglobulin superfamily members, and lineage-specific transcripts. We observed that the numbers of lineage-specific genes varies by 2 orders of magnitude, ranging from 5 for cytotoxic T cells to 878 for granulocytes. In addition, we have identified novel coexpression patterns for key transcription factors involved in hematopoiesis (eg, GATA3-GFI1 and GATA2-KLF1). This study represents the most comprehensive analysis of gene expression in hematopoietic cells to date and has identified genes that play key roles in lineage commitment and cell function. The data, which are freely accessible, will be invaluable for future studies on hematopoiesis and the role of specific genes and will also aid the understanding of the recent genome-wide association studies. PMID:19228925
A novel non-thermostable deuterolysin from Aspergillus oryzae.
Maeda, Hiroshi; Katase, Toru; Sakai, Daisuke; Takeuchi, Michio; Kusumoto, Ken-Ichi; Amano, Hitoshi; Ishida, Hiroki; Abe, Keietsu; Yamagata, Youhei
2016-09-01
Three putative deuterolysin (EC 3.4.24.29) genes (deuA, deuB, and deuC) were found in the Aspergillus oryzae genome database ( http://www.bio.nite.go.jp/dogan/project/view/AO ). One of these genes, deuA, was corresponding to NpII gene, previously reported. DeuA and DeuB were overexpressed by recombinant A. oryzae and were purified. The degradation profiles against protein substrates of both enzymes were similar, but DeuB showed wider substrate specificity against peptidyl MCA-substrates compared with DeuA. Enzymatic profiles of DeuB except for thermostability also resembled those of DeuA. DeuB was inactivated by heat treatment above 80° C, different from thermostable DeuA. Transcription analysis in wild type A. oryzae showed only deuB was expressed in liquid culture, and the addition of the proteinous substrate upregulated the transcription. Furthermore, the NaNO3 addition seems to eliminate the effect of proteinous substrate for the transcription of deuB.
Wang, Lili; Fan, Jean; Francis, Joshua M.; Georghiou, George; Hergert, Sarah; Li, Shuqiang; Gambe, Rutendo; Zhou, Chensheng W.; Yang, Chunxiao; Xiao, Sheng; Cin, Paola Dal; Bowden, Michaela; Kotliar, Dylan; Shukla, Sachet A.; Brown, Jennifer R.; Neuberg, Donna; Alessi, Dario R.; Zhang, Cheng-Zhong; Kharchenko, Peter V.; Livak, Kenneth J.; Wu, Catherine J.
2017-01-01
Intra-tumoral genetic heterogeneity has been characterized across cancers by genome sequencing of bulk tumors, including chronic lymphocytic leukemia (CLL). In order to more accurately identify subclones, define phylogenetic relationships, and probe genotype–phenotype relationships, we developed methods for targeted mutation detection in DNA and RNA isolated from thousands of single cells from five CLL samples. By clearly resolving phylogenic relationships, we uncovered mutated LCP1 and WNK1 as novel CLL drivers, supported by functional evidence demonstrating their impact on CLL pathways. Integrative analysis of somatic mutations with transcriptional states prompts the idea that convergent evolution generates phenotypically similar cells in distinct genetic branches, thus creating a cohesive expression profile in each CLL sample despite the presence of genetic heterogeneity. Our study highlights the potential for single-cell RNA-based targeted analysis to sensitively determine transcriptional and mutational profiles of individual cancer cells, leading to increased understanding of driving events in malignancy. PMID:28679620
Transcriptional architecture of the primate neocortex.
Bernard, Amy; Lubbers, Laura S; Tanis, Keith Q; Luo, Rui; Podtelezhnikov, Alexei A; Finney, Eva M; McWhorter, Mollie M E; Serikawa, Kyle; Lemon, Tracy; Morgan, Rebecca; Copeland, Catherine; Smith, Kimberly; Cullen, Vivian; Davis-Turak, Jeremy; Lee, Chang-Kyu; Sunkin, Susan M; Loboda, Andrey P; Levine, David M; Stone, David J; Hawrylycz, Michael J; Roberts, Christopher J; Jones, Allan R; Geschwind, Daniel H; Lein, Ed S
2012-03-22
Genome-wide transcriptional profiling was used to characterize the molecular underpinnings of neocortical organization in rhesus macaque, including cortical areal specialization and laminar cell-type diversity. Microarray analysis of individual cortical layers across sensorimotor and association cortices identified robust and specific molecular signatures for individual cortical layers and areas, prominently involving genes associated with specialized neuronal function. Overall, transcriptome-based relationships were related to spatial proximity, being strongest between neighboring cortical areas and between proximal layers. Primary visual cortex (V1) displayed the most distinctive gene expression compared to other cortical regions in rhesus and human, both in the specialized layer 4 as well as other layers. Laminar patterns were more similar between macaque and human compared to mouse, as was the unique V1 profile that was not observed in mouse. These data provide a unique resource detailing neocortical transcription patterns in a nonhuman primate with great similarity in gene expression to human. Copyright © 2012 Elsevier Inc. All rights reserved.
Analysis of genomic regions of Trichoderma harzianum IOC-3844 related to biomass degradation.
Crucello, Aline; Sforça, Danilo Augusto; Horta, Maria Augusta Crivelente; dos Santos, Clelton Aparecido; Viana, Américo José Carvalho; Beloti, Lilian Luzia; de Toledo, Marcelo Augusto Szymanski; Vincentz, Michel; Kuroshu, Reginaldo Massanobu; de Souza, Anete Pereira
2015-01-01
Trichoderma harzianum IOC-3844 secretes high levels of cellulolytic-active enzymes and is therefore a promising strain for use in biotechnological applications in second-generation bioethanol production. However, the T. harzianum biomass degradation mechanism has not been well explored at the genetic level. The present work investigates six genomic regions (~150 kbp each) in this fungus that are enriched with genes related to biomass conversion. A BAC library consisting of 5,760 clones was constructed, with an average insert length of 90 kbp. The assembled BAC sequences revealed 232 predicted genes, 31.5% of which were related to catabolic pathways, including those involved in biomass degradation. An expression profile analysis based on RNA-Seq data demonstrated that putative regulatory elements, such as membrane transport proteins and transcription factors, are located in the same genomic regions as genes related to carbohydrate metabolism and exhibit similar expression profiles. Thus, we demonstrate a rapid and efficient tool that focuses on specific genomic regions by combining a BAC library with transcriptomic data. This is the first BAC-based structural genomic study of the cellulolytic fungus T. harzianum, and its findings provide new perspectives regarding the use of this species in biomass degradation processes.
Analysis of Genomic Regions of Trichoderma harzianum IOC-3844 Related to Biomass Degradation
Crucello, Aline; Sforça, Danilo Augusto; Horta, Maria Augusta Crivelente; dos Santos, Clelton Aparecido; Viana, Américo José Carvalho; Beloti, Lilian Luzia; de Toledo, Marcelo Augusto Szymanski; Vincentz, Michel; Kuroshu, Reginaldo Massanobu; de Souza, Anete Pereira
2015-01-01
Trichoderma harzianum IOC-3844 secretes high levels of cellulolytic-active enzymes and is therefore a promising strain for use in biotechnological applications in second-generation bioethanol production. However, the T. harzianum biomass degradation mechanism has not been well explored at the genetic level. The present work investigates six genomic regions (~150 kbp each) in this fungus that are enriched with genes related to biomass conversion. A BAC library consisting of 5,760 clones was constructed, with an average insert length of 90 kbp. The assembled BAC sequences revealed 232 predicted genes, 31.5% of which were related to catabolic pathways, including those involved in biomass degradation. An expression profile analysis based on RNA-Seq data demonstrated that putative regulatory elements, such as membrane transport proteins and transcription factors, are located in the same genomic regions as genes related to carbohydrate metabolism and exhibit similar expression profiles. Thus, we demonstrate a rapid and efficient tool that focuses on specific genomic regions by combining a BAC library with transcriptomic data. This is the first BAC-based structural genomic study of the cellulolytic fungus T. harzianum, and its findings provide new perspectives regarding the use of this species in biomass degradation processes. PMID:25836973
An ensemble model of competitive multi-factor binding of the genome
Wasson, Todd; Hartemink, Alexander J.
2009-01-01
Hundreds of different factors adorn the eukaryotic genome, binding to it in large number. These DNA binding factors (DBFs) include nucleosomes, transcription factors (TFs), and other proteins and protein complexes, such as the origin recognition complex (ORC). DBFs compete with one another for binding along the genome, yet many current models of genome binding do not consider different types of DBFs together simultaneously. Additionally, binding is a stochastic process that results in a continuum of binding probabilities at any position along the genome, but many current models tend to consider positions as being either binding sites or not. Here, we present a model that allows a multitude of DBFs, each at different concentrations, to compete with one another for binding sites along the genome. The result is an “occupancy profile,” a probabilistic description of the DNA occupancy of each factor at each position. We implement our model efficiently as the software package COMPETE. We demonstrate genome-wide and at specific loci how modeling nucleosome binding alters TF binding, and vice versa, and illustrate how factor concentration influences binding occupancy. Binding cooperativity between nearby TFs arises implicitly via mutual competition with nucleosomes. Our method applies not only to TFs, but also recapitulates known occupancy profiles of a well-studied replication origin with and without ORC binding. Importantly, the sequence preferences our model takes as input are derived from in vitro experiments. This ensures that the calculated occupancy profiles are the result of the forces of competition represented explicitly in our model and the inherent sequence affinities of the constituent DBFs. PMID:19720867
Bajaj, Deepak; Das, Shouvik; Upadhyaya, Hari D.; Ranjan, Rajeev; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. Laxmipathi; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
The study identified 9045 high-quality SNPs employing both genome-wide GBS- and candidate gene-based SNP genotyping assays in 172, including 93 cultivated (desi and kabuli) and 79 wild chickpea accessions. The GWAS in a structured population of 93 sequenced accessions detected 15 major genomic loci exhibiting significant association with seed coat color. Five seed color-associated major genomic loci underlying robust QTLs mapped on a high-density intra-specific genetic linkage map were validated by QTL mapping. The integration of association and QTL mapping with gene haplotype-specific LD mapping and transcript profiling identified novel allelic variants (non-synonymous SNPs) and haplotypes in a MATE secondary transporter gene regulating light/yellow brown and beige seed coat color differentiation in chickpea. The down-regulation and decreased transcript expression of beige seed coat color-associated MATE gene haplotype was correlated with reduced proanthocyanidins accumulation in the mature seed coats of beige than light/yellow brown seed colored desi and kabuli accessions for their coloration/pigmentation. This seed color-regulating MATE gene revealed strong purifying selection pressure primarily in LB/YB seed colored desi and wild Cicer reticulatum accessions compared with the BE seed colored kabuli accessions. The functionally relevant molecular tags identified have potential to decipher the complex transcriptional regulatory gene function of seed coat coloration and for understanding the selective sweep-based seed color trait evolutionary pattern in cultivated and wild accessions during chickpea domestication. The genome-wide integrated approach employed will expedite marker-assisted genetic enhancement for developing cultivars with desirable seed coat color types in chickpea. PMID:26635822
Li, LiQi; Jothi, Raja; Cui, Kairong; Lee, Jan Y; Cohen, Tsadok; Gorivodsky, Marat; Tzchori, Itai; Zhao, Yangu; Hayes, Sandra M; Bresnick, Emery H; Zhao, Keji; Westphal, Heiner; Love, Paul E
2013-01-01
The nuclear adaptor Ldb1 functions as a core component of multiprotein transcription complexes that regulate differentiation in diverse cell types. In the hematopoietic lineage, Ldb1 forms a complex with the non–DNA-binding adaptor Lmo2 and the transcription factors E2A, Scl and GATA-1 (or GATA-2). Here we demonstrate a critical and continuous requirement for Ldb1 in the maintenance of both fetal and adult mouse hematopoietic stem cells (HSCs). Deletion of Ldb1 in hematopoietic progenitors resulted in the downregulation of many transcripts required for HSC maintenance. Genome-wide profiling by chromatin immunoprecipitation followed by sequencing (ChIP-Seq) identified Ldb1 complex–binding sites at highly conserved regions in the promoters of genes involved in HSC maintenance. Our results identify a central role for Ldb1 in regulating the transcriptional program responsible for the maintenance of HSCs. PMID:21186366
Li, LiQi; Jothi, Raja; Cui, Kairong; Lee, Jan Y; Cohen, Tsadok; Gorivodsky, Marat; Tzchori, Itai; Zhao, Yangu; Hayes, Sandra M; Bresnick, Emery H; Zhao, Keji; Westphal, Heiner; Love, Paul E
2011-02-01
The nuclear adaptor Ldb1 functions as a core component of multiprotein transcription complexes that regulate differentiation in diverse cell types. In the hematopoietic lineage, Ldb1 forms a complex with the non-DNA-binding adaptor Lmo2 and the transcription factors E2A, Scl and GATA-1 (or GATA-2). Here we demonstrate a critical and continuous requirement for Ldb1 in the maintenance of both fetal and adult mouse hematopoietic stem cells (HSCs). Deletion of Ldb1 in hematopoietic progenitors resulted in the downregulation of many transcripts required for HSC maintenance. Genome-wide profiling by chromatin immunoprecipitation followed by sequencing (ChIP-Seq) identified Ldb1 complex-binding sites at highly conserved regions in the promoters of genes involved in HSC maintenance. Our results identify a central role for Ldb1 in regulating the transcriptional program responsible for the maintenance of HSCs.
USDA-ARS?s Scientific Manuscript database
The stem cell walls of alfalfa [Medicago sativa (L.) ssp. sativa] genotype 252 have high cellulose and lignin concentrations, while stem cell walls of genotype 1283 have low cellulose and lignin concentrations. The GeneChip® Medicago Genome Array, developed for Medicago truncatula, is a suitable pla...
USDA-ARS?s Scientific Manuscript database
Background: The objective of this study was to acquire a broader, more comprehensive picture of the transcriptional changes in the L. Thoracis muscle (LT) and subcutaneous fat (SF) of lambs supplemented with vitamin E. Furthermore, we aimed to identify novel genes involved in the metabolism of vitam...
Li, Guoqing; Shao, Jinhui; Liu, Cong; Lu, Jun; Zhao, Xiaodong
2017-01-01
Alternative polyadenylation (APA) plays an important role in regulation of genes expression and is involved in many biological processes. As eukaryotic cells receive a variety of external signals, genes produce diverse transcriptional isoforms and exhibit different translation efficiency. The traditional Chinese medicine (TCM) Jinfukang (JFK) has been effectively used for lung cancer treatment. In this study, we investigated whether JFK exerts its antitumor effect by modulating APA patterns in lung cancer cells. We performed a genome-wide APA site profiling analysis in JFK treated lung cancer cells A549 with 3T-seq approach that we reported previously. Comparing with those in untreated A549, in JFK treated A549 we observed APA-mediated 3′ UTRs alterations in 310 genes including 77 genes with shortened 3′ UTRs. In particular, we identified TMEM123, a gene involved in oncotic cell death, which produced transcripts with shortened 3′ UTR and thus was upregulated upon JFK treatment. Taken together, our studies suggest that APA might be one of the antitumor mechanisms of JFK and provide a new insight for the understanding of TCM against cancer. PMID:29234412
Kou, Yao; Li, Guoqing; Shao, Jinhui; Liu, Cong; Wu, Jun; Lu, Jun; Zhao, Xiaodong; Tian, Jing
2017-01-01
Alternative polyadenylation (APA) plays an important role in regulation of genes expression and is involved in many biological processes. As eukaryotic cells receive a variety of external signals, genes produce diverse transcriptional isoforms and exhibit different translation efficiency. The traditional Chinese medicine (TCM) Jinfukang (JFK) has been effectively used for lung cancer treatment. In this study, we investigated whether JFK exerts its antitumor effect by modulating APA patterns in lung cancer cells. We performed a genome-wide APA site profiling analysis in JFK treated lung cancer cells A549 with 3T-seq approach that we reported previously. Comparing with those in untreated A549, in JFK treated A549 we observed APA-mediated 3' UTRs alterations in 310 genes including 77 genes with shortened 3' UTRs. In particular, we identified TMEM123 , a gene involved in oncotic cell death, which produced transcripts with shortened 3' UTR and thus was upregulated upon JFK treatment. Taken together, our studies suggest that APA might be one of the antitumor mechanisms of JFK and provide a new insight for the understanding of TCM against cancer.
Chai, Wenbo; Si, Weina; Ji, Wei; Qin, Qianqian; Zhao, Manli; Jiang, Haiyang
2018-01-01
HD-Zip proteins represent the major transcription factors in higher plants, playing essential roles in plant development and stress responses. Foxtail millet is a crop to investigate the systems biology of millet and biofuel grasses and the HD-Zip gene family has not been studied in foxtail millet. For further investigation of the expression profile of the HD-Zip gene family in foxtail millet, a comprehensive genome-wide expression analysis was conducted in this study. We found 47 protein-encoding genes in foxtail millet using BLAST search tools; the putative proteins were classified into four subfamilies, namely, subfamilies I, II, III, and IV. Gene structure and motif analysis indicate that the genes in one subfamily were conserved. Promotor analysis showed that HD-Zip gene was involved in abiotic stress. Duplication analysis revealed that 8 (~17%) hdz genes were tandemly duplicated and 28 (58%) were segmentally duplicated; purifying duplication plays important roles in gene expansion. Microsynteny analysis revealed the maximum relationship in foxtail millet-sorghum and foxtail millet-rice. Expression profiling upon the abiotic stresses of drought and high salinity and the biotic stress of ABA revealed that some genes regulated responses to drought and salinity stresses via an ABA-dependent process, especially sihdz29 and sihdz45. Our study provides new insight into evolutionary and functional analyses of HD-Zip genes involved in environmental stress responses in foxtail millet.
Pervasive, Genome-Wide Transcription in the Organelle Genomes of Diverse Plastid-Bearing Protists.
Sanitá Lima, Matheus; Smith, David Roy
2017-11-06
Organelle genomes are among the most sequenced kinds of chromosome. This is largely because they are small and widely used in molecular studies, but also because next-generation sequencing technologies made sequencing easier, faster, and cheaper. However, studies of organelle RNA have not kept pace with those of DNA, despite huge amounts of freely available eukaryotic RNA-sequencing (RNA-seq) data. Little is known about organelle transcription in nonmodel species, and most of the available eukaryotic RNA-seq data have not been mined for organelle transcripts. Here, we use publicly available RNA-seq experiments to investigate organelle transcription in 30 diverse plastid-bearing protists with varying organelle genomic architectures. Mapping RNA-seq data to organelle genomes revealed pervasive, genome-wide transcription, regardless of the taxonomic grouping, gene organization, or noncoding content. For every species analyzed, transcripts covered ≥85% of the mitochondrial and/or plastid genomes (all of which were ≤105 kb), indicating that most of the organelle DNA-coding and noncoding-is transcriptionally active. These results follow earlier studies of model species showing that organellar transcription is coupled and ubiquitous across the genome, requiring significant downstream processing of polycistronic transcripts. Our findings suggest that noncoding organelle DNA can be transcriptionally active, raising questions about the underlying function of these transcripts and underscoring the utility of publicly available RNA-seq data for recovering complete genome sequences. If pervasive transcription is also found in bigger organelle genomes (>105 kb) and across a broader range of eukaryotes, this could indicate that noncoding organelle RNAs are regulating fundamental processes within eukaryotic cells. Copyright © 2017 Sanitá Lima and Smith.
The Innate Immune Database (IIDB)
Korb, Martin; Rust, Aistair G; Thorsson, Vesteinn; Battail, Christophe; Li, Bin; Hwang, Daehee; Kennedy, Kathleen A; Roach, Jared C; Rosenberger, Carrie M; Gilchrist, Mark; Zak, Daniel; Johnson, Carrie; Marzolf, Bruz; Aderem, Alan; Shmulevich, Ilya; Bolouri, Hamid
2008-01-01
Background As part of a National Institute of Allergy and Infectious Diseases funded collaborative project, we have performed over 150 microarray experiments measuring the response of C57/BL6 mouse bone marrow macrophages to toll-like receptor stimuli. These microarray expression profiles are available freely from our project web site . Here, we report the development of a database of computationally predicted transcription factor binding sites and related genomic features for a set of over 2000 murine immune genes of interest. Our database, which includes microarray co-expression clusters and a host of web-based query, analysis and visualization facilities, is available freely via the internet. It provides a broad resource to the research community, and a stepping stone towards the delineation of the network of transcriptional regulatory interactions underlying the integrated response of macrophages to pathogens. Description We constructed a database indexed on genes and annotations of the immediate surrounding genomic regions. To facilitate both gene-specific and systems biology oriented research, our database provides the means to analyze individual genes or an entire genomic locus. Although our focus to-date has been on mammalian toll-like receptor signaling pathways, our database structure is not limited to this subject, and is intended to be broadly applicable to immunology. By focusing on selected immune-active genes, we were able to perform computationally intensive expression and sequence analyses that would currently be prohibitive if applied to the entire genome. Using six complementary computational algorithms and methodologies, we identified transcription factor binding sites based on the Position Weight Matrices available in TRANSFAC. For one example transcription factor (ATF3) for which experimental data is available, over 50% of our predicted binding sites coincide with genome-wide chromatin immnuopreciptation (ChIP-chip) results. Our database can be interrogated via a web interface. Genomic annotations and binding site predictions can be automatically viewed with a customized version of the Argo genome browser. Conclusion We present the Innate Immune Database (IIDB) as a community resource for immunologists interested in gene regulatory systems underlying innate responses to pathogens. The database website can be freely accessed at . PMID:18321385
University of California San Francisco (UCSF-2): Gene Expression Profiling of Normal Mouse Skin, Hras WT and Hras -/- This data set contains the transcriptional profiles of 20 dorsal skin samples from eight-week-old mice. Mice were generated by crossing FVB/N to Mus spretus mice to generate F1 mice, and then crossing F1 mice back to the FVB/N strain. 10 FVB/N mice lacking Hras1 (aka HrasKO, Hras-/-) and 10 FVB/N mice with wild-type Hras1 were generated. Read the abstract.
Iandolino, Alberto; Nobuta, Kan; da Silva, Francisco Goes; Cook, Douglas R; Meyers, Blake C
2008-05-12
Vitis vinifera (V. vinifera) is the primary grape species cultivated for wine production, with an industry valued annually in the billions of dollars worldwide. In order to sustain and increase grape production, it is necessary to understand the genetic makeup of grape species. Here we performed mRNA profiling using Massively Parallel Signature Sequencing (MPSS) and combined it with available Expressed Sequence Tag (EST) data. These tag-based technologies, which do not require a priori knowledge of genomic sequence, are well-suited for transcriptional profiling. The sequence depth of MPSS allowed us to capture and quantify almost all the transcripts at a specific stage in the development of the grape berry. The number and relative abundance of transcripts from stage II grape berries was defined using Massively Parallel Signature Sequencing (MPSS). A total of 2,635,293 17-base and 2,259,286 20-base signatures were obtained, representing at least 30,737 and 26,878 distinct sequences. The average normalized abundance per signature was approximately 49 TPM (Transcripts Per Million). Comparisons of the MPSS signatures with available Vitis species' ESTs and a unigene set demonstrated that 6,430 distinct contigs and 2,190 singletons have a perfect match to at least one MPSS signature. Among the matched sequences, ESTs were identified from tissues other than berries or from berries at different developmental stages. Additional MPSS signatures not matching to known grape ESTs can extend our knowledge of the V. vinifera transcriptome, particularly when these data are used to assist in annotation of whole genome sequences from Vitis vinifera. The MPSS data presented here not only achieved a higher level of saturation than previous EST based analyses, but in doing so, expand the known set of transcripts of grape berries during the unique stage in development that immediately precedes the onset of ripening. The MPSS dataset also revealed evidence of antisense expression not previously reported in grapes but comparable to that reported in other plant species. Finally, we developed a novel web-based, public resource for utilization of the grape MPSS data [1].
Transcriptional analysis of phloem-associated cells of potato.
Lin, Tian; Lashbrook, Coralie C; Cho, Sung Ki; Butler, Nathaniel M; Sharma, Pooja; Muppirala, Usha; Severin, Andrew J; Hannapel, David J
2015-09-03
Numerous signal molecules, including proteins and mRNAs, are transported through the architecture of plants via the vascular system. As the connection between leaves and other organs, the petiole and stem are especially important in their transport function, which is carried out by the phloem and xylem, especially by the sieve elements in the phloem system. The phloem is an important conduit for transporting photosynthate and signal molecules like metabolites, proteins, small RNAs, and full-length mRNAs. Phloem sap has been used as an unadulterated source to profile phloem proteins and RNAs, but unfortunately, pure phloem sap cannot be obtained in most plant species. Here we make use of laser capture microdissection (LCM) and RNA-seq for an in-depth transcriptional profile of phloem-associated cells of both petioles and stems of potato. To expedite our analysis, we have taken advantage of the potato genome that has recently been fully sequenced and annotated. Out of the 27 k transcripts assembled that we identified, approximately 15 k were present in phloem-associated cells of petiole and stem with greater than ten reads. Among these genes, roughly 10 k are affected by photoperiod. Several RNAs from this day length-regulated group are also abundant in phloem cells of petioles and encode for proteins involved in signaling or transcriptional control. Approximately 22 % of the transcripts in phloem cells contained at least one binding motif for Pumilio, Nova, or polypyrimidine tract-binding proteins in their downstream sequences. Highlighting the predominance of binding processes identified in the gene ontology analysis of active genes from phloem cells, 78 % of the 464 RNA-binding proteins present in the potato genome were detected in our phloem transcriptome. As a reasonable alternative when phloem sap collection is not possible, LCM can be used to isolate RNA from specific cell types, and along with RNA-seq, provides practical access to expression profiles of phloem tissue. The combination of these techniques provides a useful approach to the study of phloem and a comprehensive picture of the mechanisms associated with long-distance signaling. The data presented here provide valuable insights into potentially novel phloem-mobile mRNAs and phloem-associated RNA-binding proteins.
Griffin, Nicole G; Wang, Yu; Hulette, Christine M; Halvorsen, Matt; Cronin, Kenneth D; Walley, Nicole M; Haglund, Michael M; Radtke, Rodney A; Skene, J H Pate; Sinha, Saurabh R; Heinzen, Erin L
2016-03-01
Hippocampal sclerosis is the most common neuropathologic finding in cases of medically intractable mesial temporal lobe epilepsy. In this study, we analyzed the gene expression profiles of dentate granule cells of patients with mesial temporal lobe epilepsy with and without hippocampal sclerosis to show that next-generation sequencing methods can produce interpretable genomic data from RNA collected from small homogenous cell populations, and to shed light on the transcriptional changes associated with hippocampal sclerosis. RNA was extracted, and complementary DNA (cDNA) was prepared and amplified from dentate granule cells that had been harvested by laser capture microdissection from surgically resected hippocampi from patients with mesial temporal lobe epilepsy with and without hippocampal sclerosis. Sequencing libraries were sequenced, and the resulting sequencing reads were aligned to the reference genome. Differential expression analysis was used to ascertain expression differences between patients with and without hippocampal sclerosis. Greater than 90% of the RNA-Seq reads aligned to the reference. There was high concordance between transcriptional profiles obtained for duplicate samples. Principal component analysis revealed that the presence or absence of hippocampal sclerosis was the main determinant of the variance within the data. Among the genes up-regulated in the hippocampal sclerosis samples, there was significant enrichment for genes involved in oxidative phosphorylation. By analyzing the gene expression profiles of dentate granule cells from surgically resected hippocampal specimens from patients with mesial temporal lobe epilepsy with and without hippocampal sclerosis, we have demonstrated the utility of next-generation sequencing methods for producing biologically relevant results from small populations of homogeneous cells, and have provided insight on the transcriptional changes associated with this pathology. Wiley Periodicals, Inc. © 2016 International League Against Epilepsy.
Kimura, Shinzo; Ishidou, Emi; Kurita, Sakiko; Suzuki, Yoshiteru; Shibato, Junko; Rakwal, Randeep; Iwahashi, Hitoshi
2006-07-21
Ionizing radiation (IR) is the most enigmatic of genotoxic stress inducers in our environment that has been around from the eons of time. IR is generally considered harmful, and has been the subject of numerous studies, mostly looking at the DNA damaging effects in cells and the repair mechanisms therein. Moreover, few studies have focused on large-scale identification of cellular responses to IR, and to this end, we describe here an initial study on the transcriptional responses of the unicellular genome model, yeast (Saccharomyces cerevisiae strain S288C), by cDNA microarray. The effect of two different IR, X-rays, and gamma (gamma)-rays, was investigated by irradiating the yeast cells cultured in YPD medium with 50 Gy doses of X- and gamma-rays, followed by resuspension of the cells in YPD for time-course experiments. The samples were collected for microarray analysis at 20, 40, and 80 min after irradiation. Microarray analysis revealed a time-course transcriptional profile of changed gene expressions. Up-regulated genes belonged to the functional categories mainly related to cell cycle and DNA processing, cell rescue defense and virulence, protein and cell fate, and metabolism (X- and gamma-rays). Similarly, for X- and gamma-rays, the down-regulated genes belonged to mostly transcription and protein synthesis, cell cycle and DNA processing, control of cellular organization, cell fate, and C-compound and carbohydrate metabolism categories, respectively. This study provides for the first time a snapshot of the genome-wide mRNA expression profiles in X- and gamma-ray post-irradiated yeast cells and comparatively interprets/discusses the changed gene functional categories as effects of these two radiations vis-à-vis their energy levels.
Di, Li-Jun; Byun, Jung S; Wong, Madeline M; Wakano, Clay; Taylor, Tara; Bilke, Sven; Baek, Songjoon; Hunter, Kent; Yang, Howard; Lee, Maxwell; Zvosec, Cecilia; Khramtsova, Galina; Cheng, Fan; Perou, Charles M; Miller, C Ryan; Raab, Rachel; Olopade, Olufunmilayo I; Gardner, Kevin
2013-01-01
The C-terminal binding protein (CtBP) is a NADH-dependent transcriptional repressor that links carbohydrate metabolism to epigenetic regulation by recruiting diverse histone-modifying complexes to chromatin. Here global profiling of CtBP in breast cancer cells reveals that it drives epithelial-to-mesenchymal transition, stem cell pathways and genome instability. CtBP expression induces mesenchymal and stem cell-like features, whereas CtBP depletion or caloric restriction reverses gene repression and increases DNA repair. Multiple members of the CtBP-targeted gene network are selectively downregulated in aggressive breast cancer subtypes. Differential expression of CtBP-targeted genes predicts poor clinical outcome in breast cancer patients, and elevated levels of CtBP in patient tumours predict shorter median survival. Finally, both CtBP promoter targeting and gene repression can be reversed by small molecule inhibition. These findings define broad roles for CtBP in breast cancer biology and suggest novel chromatin-based strategies for pharmacologic and metabolic intervention in cancer.
Yuan, Xiao-Long; Gao, Ning; Xing, Yan; Zhang, Hai-Bin; Zhang, Ai-Ling; Liu, Jing; He, Jin-Long; Xu, Yuan; Lin, Wen-Mian; Chen, Zan-Mou; Zhang, Hao; Zhang, Zhe; Li, Jia-Qi
2016-02-25
Substantial evidence has shown that DNA methylation regulates the initiation of ovarian and sexual maturation. Here, we investigated the genome-wide profile of DNA methylation in porcine ovaries at single-base resolution using reduced representation bisulfite sequencing. The biological variation was minimal among the three ovarian replicates. We found hypermethylation frequently occurred in regions with low gene abundance, while hypomethylation in regions with high gene abundance. The DNA methylation around transcriptional start sites was negatively correlated with their own CpG content. Additionally, the methylation level in the bodies of genes was higher than that in their 5' and 3' flanking regions. The DNA methylation pattern of the low CpG content promoter genes differed obviously from that of the high CpG content promoter genes. The DNA methylation level of the porcine ovary was higher than that of the porcine intestine. Analyses of the genome-wide DNA methylation in porcine ovaries would advance the knowledge and understanding of the porcine ovarian methylome.
Høgslund, Niels; Radutoiu, Simona; Krusell, Lene; Voroshilova, Vera; Hannah, Matthew A.; Goffard, Nicolas; Sanchez, Diego H.; Lippold, Felix; Ott, Thomas; Sato, Shusei; Tabata, Satoshi; Liboriussen, Poul; Lohmann, Gitte V.; Schauser, Leif; Weiller, Georg F.; Udvardi, Michael K.; Stougaard, Jens
2009-01-01
Genetic analyses of plant symbiotic mutants has led to the identification of key genes involved in Rhizobium-legume communication as well as in development and function of nitrogen fixing root nodules. However, the impact of these genes in coordinating the transcriptional programs of nodule development has only been studied in limited and isolated studies. Here, we present an integrated genome-wide analysis of transcriptome landscapes in Lotus japonicus wild-type and symbiotic mutant plants. Encompassing five different organs, five stages of the sequentially developed determinate Lotus root nodules, and eight mutants impaired at different stages of the symbiotic interaction, our data set integrates an unprecedented combination of organ- or tissue-specific profiles with mutant transcript profiles. In total, 38 different conditions sampled under the same well-defined growth regimes were included. This comprehensive analysis unravelled new and unexpected patterns of transcriptional regulation during symbiosis and organ development. Contrary to expectations, none of the previously characterized nodulins were among the 37 genes specifically expressed in nodules. Another surprise was the extensive transcriptional response in whole root compared to the susceptible root zone where the cellular response is most pronounced. A large number of transcripts predicted to encode transcriptional regulators, receptors and proteins involved in signal transduction, as well as many genes with unknown function, were found to be regulated during nodule organogenesis and rhizobial infection. Combining wild type and mutant profiles of these transcripts demonstrates the activation of a complex genetic program that delineates symbiotic nitrogen fixation. The complete data set was organized into an indexed expression directory that is accessible from a resource database, and here we present selected examples of biological questions that can be addressed with this comprehensive and powerful gene expression data set. PMID:19662091
Sun, Xiaoji; Wang, Xuya; Tang, Zuojian; Grivainis, Mark; Kahler, David; Yun, Chi; Mita, Paolo; Fenyö, David
2018-01-01
Transposable elements (TEs) represent a substantial fraction of many eukaryotic genomes, and transcriptional regulation of these factors is important to determine TE activities in human cells. However, due to the repetitive nature of TEs, identifying transcription factor (TF)-binding sites from ChIP-sequencing (ChIP-seq) datasets is challenging. Current algorithms are focused on subtle differences between TE copies and thus bias the analysis to relatively old and inactive TEs. Here we describe an approach termed “MapRRCon” (mapping repeat reads to a consensus) which allows us to identify proteins binding to TE DNA sequences by mapping ChIP-seq reads to the TE consensus sequence after whole-genome alignment. Although this method does not assign binding sites to individual insertions in the genome, it provides a landscape of interacting TFs by capturing factors that bind to TEs under various conditions. We applied this method to screen TFs’ interaction with L1 in human cells/tissues using ENCODE ChIP-seq datasets and identified 178 of the 512 TFs tested as bound to L1 in at least one biological condition with most of them (138) localized to the promoter. Among these L1-binding factors, we focused on Myc and CTCF, as they play important roles in cancer progression and 3D chromatin structure formation. Furthermore, we explored the transcriptomes of The Cancer Genome Atlas breast and ovarian tumor samples in which a consistent anti-/correlation between L1 and Myc/CTCF expression was observed, suggesting that these two factors may play roles in regulating L1 transcription during the development of such tumors. PMID:29802231
Washington, Shannan D; Musarrat, Farhana; Ertel, Monica K; Backes, Gregory L; Neumann, Donna M
2018-04-15
There are seven conserved CTCF binding domains in the herpes simplex virus 1 (HSV-1) genome. These binding sites individually flank the latency-associated transcript (LAT) and the immediate early (IE) gene regions, suggesting that CTCF insulators differentially control transcriptional domains in HSV-1 latency. In this work, we show that two CTCF binding motifs in HSV-1 display enhancer blocking in a cell-type-specific manner. We found that CTCF binding to the latent HSV-1 genome was LAT dependent and that the quantity of bound CTCF was site specific. Following reactivation, CTCF eviction was dynamic, suggesting that each CTCF site was independently regulated. We explored whether CTCF sites recruit the polycomb-repressive complex 2 (PRC2) to establish repressive domains through a CTCF-Suz12 interaction and found that Suz12 colocalized to the CTCF insulators flanking the ICP0 and ICP4 regions and, conversely, was removed at early times postreactivation. Collectively, these data support the idea that CTCF sites in HSV-1 are independently regulated and may contribute to lytic-latent HSV-1 control in a site-specific manner. IMPORTANCE The role of chromatin insulators in DNA viruses is an area of interest. It has been shown in several beta- and gammaherpesviruses that insulators likely control the lytic transcriptional profile through protein recruitment and through the formation of three-dimensional (3D) chromatin loops. The ability of insulators to regulate alphaherpesviruses has been understudied to date. The alphaherpesvirus HSV-1 has seven conserved insulator binding motifs that flank regions of the genome known to contribute to the establishment of latency. Our work presented here contributes to the understanding of how insulators control transcription of HSV-1. Copyright © 2018 American Society for Microbiology.
Decoding the non-coding genome: elucidating genetic risk outside the coding genome.
Barr, C L; Misener, V L
2016-01-01
Current evidence emerging from genome-wide association studies indicates that the genetic underpinnings of complex traits are likely attributable to genetic variation that changes gene expression, rather than (or in combination with) variation that changes protein-coding sequences. This is particularly compelling with respect to psychiatric disorders, as genetic changes in regulatory regions may result in differential transcriptional responses to developmental cues and environmental/psychosocial stressors. Until recently, however, the link between transcriptional regulation and psychiatric genetic risk has been understudied. Multiple obstacles have contributed to the paucity of research in this area, including challenges in identifying the positions of remote (distal from the promoter) regulatory elements (e.g. enhancers) and their target genes and the underrepresentation of neural cell types and brain tissues in epigenome projects - the availability of high-quality brain tissues for epigenetic and transcriptome profiling, particularly for the adolescent and developing brain, has been limited. Further challenges have arisen in the prediction and testing of the functional impact of DNA variation with respect to multiple aspects of transcriptional control, including regulatory-element interaction (e.g. between enhancers and promoters), transcription factor binding and DNA methylation. Further, the brain has uncommon DNA-methylation marks with unique genomic distributions not found in other tissues - current evidence suggests the involvement of non-CG methylation and 5-hydroxymethylation in neurodevelopmental processes but much remains unknown. We review here knowledge gaps as well as both technological and resource obstacles that will need to be overcome in order to elucidate the involvement of brain-relevant gene-regulatory variants in genetic risk for psychiatric disorders. © 2015 John Wiley & Sons Ltd and International Behavioural and Neural Genetics Society.
Lumsden, Amanda L; Ma, Yuefang; Ashander, Liam M; Stempel, Andrew J; Keating, Damien J; Smith, Justine R; Appukuttan, Binoy
2018-05-09
Regulation of intercellular adhesion molecule (ICAM)-1 in retinal endothelial cells is a promising druggable target for retinal vascular diseases. The ICAM-1-related (ICR) long non-coding RNA stabilizes ICAM-1 transcript, increasing protein expression. However, studies of ICR involvement in disease have been limited as the promoter is uncharacterized. To address this issue, we undertook a comprehensive in silico analysis of the human ICR gene promoter region. We used genomic evolutionary rate profiling to identify a 115 base pair (bp) sequence within 500 bp upstream of the transcription start site of the annotated human ICR gene that was conserved across 25 eutherian genomes. A second constrained sequence upstream of the orthologous mouse gene (68 bp; conserved across 27 Eutherian genomes including human) was also discovered. Searching these elements identified 33 matrices predictive of binding sites for transcription factors known to be responsive to a broad range of pathological stimuli, including hypoxia, and metabolic and inflammatory proteins. Five phenotype-associated single nucleotide polymorphisms (SNPs) in the immediate vicinity of these elements included four SNPs (i.e. rs2569693, rs281439, rs281440 and rs11575074) predicted to impact binding motifs of transcription factors, and thus the expression of ICR and ICAM-1 genes, with potential to influence disease susceptibility. We verified that human retinal endothelial cells expressed ICR, and observed induction of expression by tumor necrosis factor-α.
Gessner, Denise K; Winkler, Anne; Koch, Christian; Dusel, Georg; Liebisch, Gerhard; Ringseis, Robert; Eder, Klaus
2017-03-23
It was recently reported that dairy cows fed a polyphenol-rich grape seed and grape marc meal extract (GSGME) during the transition period had an increased milk yield, but the underlying reasons remained unclear. As polyphenols exert a broad spectrum of metabolic effects, we hypothesized that feeding of GSGME influences metabolic pathways in the liver which could account for the positive effects of GSGME in dairy cows. In order to identify these pathways, we performed genome-wide transcript profiling in the liver and lipid profiling in plasma of dairy cows fed GSGME during the transition period at 1 week postpartum. Transcriptomic analysis of the liver revealed 207 differentially expressed transcripts, from which 156 were up- and 51 were down-regulated, between cows fed GSGME and control cows. Gene set enrichment analysis of the 155 up-regulated mRNAs showed that the most enriched gene ontology (GO) biological process terms were dealing with cell cycle regulation and the most enriched Kyoto Encyclopedia of Genes and Genomes pathways were p53 signaling and cell cycle. Functional analysis of the 43 down-regulated mRNAs revealed that a great part of these genes are involved in endoplasmic reticulum (ER) stress-induced unfolded protein response (UPR) and inflammatory processes. Accordingly, protein folding, response to unfolded protein, unfolded protein binding, chemokine activity and heat shock protein binding were identified as one of the most enriched GO biological process and molecular function terms assigned to the down-regulated genes. In line with the transcriptomics data the plasma concentrations of the acute phase proteins serum amyloid A (SAA) and haptoglobin were reduced in cows fed GSGME compared to control cows. Lipidomic analysis of plasma revealed no differences in the concentrations of individual species of major and minor lipid classes between cows fed GSGME and control cows. Analysis of hepatic transcript profile in cows fed GSGME during the transition period at 1 week postpartum indicates that polyphenol-rich feed components are able to inhibit ER stress-induced UPR and inflammatory processes, both of which are considered to contribute to liver-associated diseases and to impair milk performance in dairy cows, in the liver of dairy cows during early lactation.
Khan, Meraj A; Sengupta, Jayasree; Mittal, Suneeta; Ghosh, Debabrata
2012-09-24
In order to obtain a lead of the pathophysiology of endometriosis, genome-wide expressional analyses of eutopic and ectopic endometrium have earlier been reported, however, the effects of stages of severity and phases of menstrual cycle on expressional profiles have not been examined. The effect of genetic heterogeneity and fertility history on transcriptional activity was also not considered. In the present study, a genome-wide expression analysis of autologous, paired eutopic and ectopic endometrial samples obtained from fertile women (n=18) suffering from moderate (stage 3; n=8) or severe (stage 4; n=10) ovarian endometriosis during proliferative (n=13) and secretory (n=5) phases of menstrual cycle was performed. Individual pure RNA samples were subjected to Agilent's Whole Human Genome 44K microarray experiments. Microarray data were validated (P<0.01) by estimating transcript copy numbers by performing real time RT-PCR of seven (7) arbitrarily selected genes in all samples. The data obtained were subjected to differential expression (DE) and differential co-expression (DC) analyses followed by networks and enrichment analysis, and gene set enrichment analysis (GSEA). The reproducibility of prediction based on GSEA implementation of DC results was assessed by examining the relative expressions of twenty eight (28) selected genes in RNA samples obtained from fresh pool of eutopic and ectopic samples from confirmed ovarian endometriosis patients with stages 3 and 4 (n=4/each) during proliferative and secretory (n=4/each) phases. Higher clustering effect of pairing (cluster distance, cd=0.1) in samples from same individuals on expressional arrays among eutopic and ectopic samples was observed as compared to that of clinical stages of severity (cd=0.5) and phases of menstrual cycle (cd=0.6). Post hoc analysis revealed anomaly in the expressional profiles of several genes associated with immunological, neuracrine and endocrine functions and gynecological cancers however with no overt oncogenic potential in endometriotic tissue. Dys-regulation of three (CLOCK, ESR1, and MYC) major transcription factors appeared to be significant causative factors in the pathogenesis of ovarian endometriosis. A novel cohort of twenty-eight (28) genes representing potential marker for ovarian endometriosis in fertile women was discovered. Dysfunctional expression of immuno-neuro-endocrine behaviour in endometrium appeared critical to endometriosis. Although no overt oncogenic potential was evident, several genes associated with gynecological cancers were observed to be high in the expressional profiles in endometriotic tissue.
Targeting Transcriptional Regulators of CD8+ T Cell Dysfunction to Boost Anti-Tumor Immunity
Waugh, Katherine A.; Leach, Sonia M.; Slansky, Jill E.
2015-01-01
Transcription is a dynamic process influenced by the cellular environment: healthy, transformed, and otherwise. Genome-wide mRNA expression profiles reflect the collective impact of pathways modulating cell function under different conditions. In this review we focus on the transcriptional pathways that control tumor infiltrating CD8+ T cell (TIL) function. Simultaneous restraint of overlapping inhibitory pathways may confer TIL resistance to multiple mechanisms of suppression traditionally referred to as exhaustion, tolerance, or anergy. Although decades of work have laid a solid foundation of altered transcriptional networks underlying various subsets of hypofunctional or “dysfunctional” CD8+ T cells, an understanding of the relevance in TIL has just begun. With recent technological advances, it is now feasible to further elucidate and utilize these pathways in immunotherapy platforms that seek to increase TIL function. PMID:26393659
JASPAR 2010: the greatly expanded open-access database of transcription factor binding profiles
Portales-Casamar, Elodie; Thongjuea, Supat; Kwon, Andrew T.; Arenillas, David; Zhao, Xiaobei; Valen, Eivind; Yusuf, Dimas; Lenhard, Boris; Wasserman, Wyeth W.; Sandelin, Albin
2010-01-01
JASPAR (http://jaspar.genereg.net) is the leading open-access database of matrix profiles describing the DNA-binding patterns of transcription factors (TFs) and other proteins interacting with DNA in a sequence-specific manner. Its fourth major release is the largest expansion of the core database to date: the database now holds 457 non-redundant, curated profiles. The new entries include the first batch of profiles derived from ChIP-seq and ChIP-chip whole-genome binding experiments, and 177 yeast TF binding profiles. The introduction of a yeast division brings the convenience of JASPAR to an active research community. As binding models are refined by newer data, the JASPAR database now uses versioning of matrices: in this release, 12% of the older models were updated to improved versions. Classification of TF families has been improved by adopting a new DNA-binding domain nomenclature. A curated catalog of mammalian TFs is provided, extending the use of the JASPAR profiles to additional TFs belonging to the same structural family. The changes in the database set the system ready for more rapid acquisition of new high-throughput data sources. Additionally, three new special collections provide matrix profile data produced by recent alternative high-throughput approaches. PMID:19906716
Nash, Claire; Boufaied, Nadia; Mills, Ian G; Franco, Omar E; Hayward, Simon W; Thomson, Axel A
2017-05-05
The androgen receptor (AR) is a transcription factor, and key regulator of prostate development and cancer, which has discrete functions in stromal versus epithelial cells. AR expressed in mesenchyme is necessary and sufficient for prostate development while loss of stromal AR is predictive of prostate cancer progression. Many studies have characterized genome-wide binding of AR in prostate tumour cells but none have used primary mesenchyme or stroma. We applied ChIPseq to identify genomic AR binding sites in primary human fetal prostate fibroblasts and patient derived cancer associated fibroblasts, as well as the WPMY1 cell line overexpressing AR. We identified AR binding sites that were specific to fetal prostate fibroblasts (7534), cancer fibroblasts (629), WPMY1-AR (2561) as well as those common among all (783). Primary fibroblasts had a distinct AR binding profile versus prostate cancer cell lines and tissue, and showed a localisation to gene promoter binding sites 1 kb upstream of the transcriptional start site, as well as non-classical AR binding sequence motifs. We used RNAseq to define transcribed genes associated with AR binding sites and derived cistromes for embryonic and cancer fibroblasts as well as a cistrome common to both. These were compared to several in vivo ChIPseq and transcript expression datasets; which identified subsets of AR targets that were expressed in vivo and regulated by androgens. This analysis enabled us to deconvolute stromal AR targets active in stroma within tumour samples. Taken together, our data suggest that the AR shows significantly different genomic binding site locations in primary prostate fibroblasts compared to that observed in tumour cells. Validation of our AR binding site data with transcript expression in vitro and in vivo suggests that the AR target genes we have identified in primary fibroblasts may contribute to clinically significant and biologically important AR-regulated changes in prostate tissue. Copyright © 2017. Published by Elsevier B.V.
Mariani, Luca; Weinand, Kathryn; Vedenko, Anastasia; Barrera, Luis A; Bulyk, Martha L
2017-09-27
Transcription factors (TFs) control cellular processes by binding specific DNA motifs to modulate gene expression. Motif enrichment analysis of regulatory regions can identify direct and indirect TF binding sites. Here, we created a glossary of 108 non-redundant TF-8mer "modules" of shared specificity for 671 metazoan TFs from publicly available and new universal protein binding microarray data. Analysis of 239 ENCODE TF chromatin immunoprecipitation sequencing datasets and associated RNA sequencing profiles suggest the 8mer modules are more precise than position weight matrices in identifying indirect binding motifs and their associated tethering TFs. We also developed GENRE (genomically equivalent negative regions), a tunable tool for construction of matched genomic background sequences for analysis of regulatory regions. GENRE outperformed four state-of-the-art approaches to background sequence construction. We used our TF-8mer glossary and GENRE in the analysis of the indirect binding motifs for the co-occurrence of tethering factors, suggesting novel TF-TF interactions. We anticipate that these tools will aid in elucidating tissue-specific gene-regulatory programs. Copyright © 2017 Elsevier Inc. All rights reserved.
Vitamin D receptor signaling and its therapeutic implications: Genome-wide and structural view.
Carlberg, Carsten; Molnár, Ferdinand
2015-05-01
Vitamin D3 is one of the few natural compounds that has, via its metabolite 1α,25-dihydroxyvitamin D3 (1,25(OH)2D3) and the transcription factor vitamin D receptor (VDR), a direct effect on gene regulation. For efficiently applying the therapeutic and disease-preventing potential of 1,25(OH)2D3 and its synthetic analogs, the key steps in vitamin D signaling need to be understood. These are the different types of molecular interactions with the VDR, such as (i) the complex formation of VDR with genomic DNA, (ii) the interaction of VDR with its partner transcription factors, (iii) the binding of 1,25(OH)2D3 or its synthetic analogs within the ligand-binding pocket of the VDR, and (iv) the resulting conformational change on the surface of the VDR leading to a change of the protein-protein interaction profile of the receptor with other proteins. This review will present the latest genome-wide insight into vitamin D signaling, and will discuss its therapeutic implications.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Neuhof, Torsten; Seibold, Michael; Thewes, Sascha
This is First report on the antifungal effects of the new glycolipopeptide hassallidin A. Due to related molecular structure moieties between hassallidin A and the established antifungal drug caspofungin we assumed parallels in the effects on cell viability. Therefore we compared hassallidin A with caspofungin by antifungal susceptibility testing and by analysing the genome-wide transcriptional profile of Candida albicans. Furthermore, we examined modifications in ultracellular structure due to hassallidin A treatment by electron microscopy. Hassallidin A was found to be fungicidal against all tested Candida species and Cryptococcus neoformans isolates. MICs ranged from 4 to 8 {mu}g/ml, independently from themore » species. Electron microscopy revealed noticeable ultrastructural changes in C. albicans cells exposed to hassallidin A. Comparing the transcriptional profile of C. albicans cells treated with hassallidin A to that of cells exposed to caspofungin, only 20 genes were found to be similarly up- or down-regulated in both assays, while 227 genes were up- or down-regulated induced by hassallidin A specifically. Genes up-regulated in cells exposed to hassallidin A included metabolic and mitotic genes, while genes involved in DNA repair, vesicle docking, and membrane fusion were down-regulated. In summary, our data suggest that, although hassallidin A and caspofungin have similar structures, however, the effects on susceptibility and transcriptional response to yeasts seem to be different.« less
Castanera, Raúl; Pérez, Gúmer; Omarini, Alejandra; Alfaro, Manuel; Pisabarro, Antonio G.; Faraco, Vincenza; Amore, Antonella
2012-01-01
The genome of the white rot basidiomycete Pleurotus ostreatus includes 12 phenol oxidase (laccase) genes. In this study, we examined their expression profiles in different fungal strains under different culture conditions (submerged and solid cultures) and in the presence of a wheat straw extract, which was used as an inducer of the laccase gene family. We used a reverse transcription-quantitative PCR (RT-qPCR)-based approach and focused on determining the reaction parameters (in particular, the reference gene set for the normalization and reaction efficiency determinations) used to achieve an accurate estimation of the relative gene expression values. The results suggested that (i) laccase gene transcription is upregulated in the induced submerged fermentation (iSmF) cultures but downregulated in the solid fermentation (SSF) cultures, (ii) the Lacc2 and Lacc10 genes are the main sources of laccase activity in the iSmF cultures upon induction with water-soluble wheat straw extracts, and (iii) an additional, as-yet-uncharacterized activity (Unk1) is specifically induced in SSF cultures that complements the activity of Lacc2 and Lacc10. Moreover, both the enzymatic laccase activities and the Lacc gene family transcription profiles greatly differ between closely related strains. These differences can be targeted for biotechnological breeding programs for enzyme production in submerged fermentation reactors. PMID:22467498
2014-01-01
Background Apple tree breeding is slow and difficult due to long generation times, self-incompatibility, and complex genetics. The identification of molecular markers linked to traits of interest is a way to expedite the breeding process. In the present study, we aimed to identify genes whose steady-state transcript abundance was associated with inheritance of specific traits segregating in an apple (Malus × domestica) rootstock F1 breeding population, including resistance to powdery mildew (Podosphaera leucotricha) disease and woolly apple aphid (Eriosoma lanigerum). Results Transcription profiling was performed for 48 individual F1 apple trees from a cross of two highly heterozygous parents, using RNA isolated from healthy, actively-growing shoot tips and a custom apple DNA oligonucleotide microarray representing 26,000 unique transcripts. Genome-wide expression profiles were not clear indicators of powdery mildew or woolly apple aphid resistance phenotype. However, standard differential gene expression analysis between phenotypic groups of trees revealed relatively small sets of genes with trait-associated expression levels. For example, thirty genes were identified that were differentially expressed between trees resistant and susceptible to powdery mildew. Interestingly, the genes encoding twenty-four of these transcripts were physically clustered on chromosome 12. Similarly, seven genes were identified that were differentially expressed between trees resistant and susceptible to woolly apple aphid, and the genes encoding five of these transcripts were also clustered, this time on chromosome 17. In each case, the gene clusters were in the vicinity of previously identified major quantitative trait loci for the corresponding trait. Similar results were obtained for a series of molecular traits. Several of the differentially expressed genes were used to develop DNA polymorphism markers linked to powdery mildew disease and woolly apple aphid resistance. Conclusions Gene expression profiling and trait-associated transcript analysis using an apple F1 population readily identified genes physically linked to powdery mildew disease resistance and woolly apple aphid resistance loci. This result was especially useful in apple, where extreme levels of heterozygosity make the development of reliable DNA markers quite difficult. The results suggest that this approach could prove effective in crops with complicated genetics, or for which few genomic information resources are available. PMID:24708064
Transcriptional Profiling Identifies Functional Interactions of TGFβ and PPARβ/δ Signaling
Kaddatz, Kerstin; Adhikary, Till; Finkernagel, Florian; Meissner, Wolfgang; Müller-Brüsselbach, Sabine; Müller, Rolf
2010-01-01
Peroxisome proliferator-activated receptors (PPARs) not only play a key role in regulating metabolic pathways but also modulate inflammatory processes, pointing to a functional interaction between PPAR and cytokine signaling pathways. In this study, we show by genome-wide transcriptional profiling that PPARβ/δ and transforming growth factor-β (TGFβ) pathways functionally interact in human myofibroblasts and that a subset of these genes is cooperatively activated by TGFβ and PPARβ/δ. Using the angiopoietin-like 4 (ANGPTL4) gene as a model, we demonstrate that two enhancer regions cooperate to mediate the observed synergistic response. A TGFβ-responsive enhancer located ∼8 kb upstream of the transcriptional start site is regulated by a mechanism involving SMAD3, ETS1, RUNX, and AP-1 transcription factors that interact with multiple contiguous binding sites. A second enhancer (PPAR-E) consisting of three juxtaposed PPAR response elements is located in the third intron ∼3.5 kb downstream of the transcriptional start site. The PPAR-E is strongly activated by all three PPAR subtypes, with a novel type of PPAR response element motif playing a central role. Although the PPAR-E is not regulated by TGFβ, it interacts with SMAD3, ETS1, RUNX2, and AP-1 in vivo, providing a possible mechanistic explanation for the observed synergism. PMID:20595396
Lorz, Corina; García-Escudero, Ramón; Segrelles, Carmen; Garín, Marina I.; Ariza, José M.; Santos, Mirentxu; Ruiz, Sergio; Lara, María F.; Martínez-Cruz, Ana B.; Costa, Clotilde; Buitrago-Pérez, Águeda; Saiz-Ladera, Cristina; Dueñas, Marta
2010-01-01
Continuous cell renewal in mouse epidermis is at the expense of a pool of pluripotent cells that lie in a well defined niche in the hair follicle known as the bulge. To identify mechanisms controlling hair follicle stem cell homeostasis, we developed a strategy to isolate adult bulge stem cells in mice and to define their transcriptional profile. We observed that a large number of transcripts are underexpressed in hair follicle stem cells when compared to non-stem cells. Importantly, the majority of these downregulated genes are involved in cell cycle. Using bioinformatics tools, we identified the E2F transcription factor family as a potential element involved in the regulation of these transcripts. To determine their functional role, we used engineered mice lacking Rb gene in epidermis, which showed increased expression of most E2F family members and increased E2F transcriptional activity. Experiments designed to analyze epidermal stem cell functionality (i.e.: hair regrowth and wound healing) imply a role of the Rb-E2F axis in the control of stem cell quiescence in epidermis. Electronic supplementary material The online version of this article (doi:10.1007/s12015-010-9139-0) contains supplementary material, which is available to authorized users. PMID:20376578
Integrative Analysis of Sex-Specific microRNA Networks Following Stress in Mouse Nucleus Accumbens.
Pfau, Madeline L; Purushothaman, Immanuel; Feng, Jian; Golden, Sam A; Aleyasin, Hossein; Lorsch, Zachary S; Cates, Hannah M; Flanigan, Meghan E; Menard, Caroline; Heshmati, Mitra; Wang, Zichen; Ma'ayan, Avi; Shen, Li; Hodes, Georgia E; Russo, Scott J
2016-01-01
Adult women are twice as likely as men to suffer from affective and anxiety disorders, although the mechanisms underlying heightened female stress susceptibility are incompletely understood. Recent findings in mouse Nucleus Accumbens (NAc) suggest a role for DNA methylation-driven sex differences in genome-wide transcriptional profiles. However, the role of another epigenetic process-microRNA (miR) regulation-has yet to be explored. We exposed male and female mice to Subchronic Variable Stress (SCVS), a stress paradigm that produces depression-like behavior in female, but not male, mice, and performed next generation mRNA and miR sequencing on NAc tissue. We applied a combination of differential expression, miR-mRNA network and functional enrichment analyses to characterize the transcriptional and post-transcriptional landscape of sex differences in NAc stress response. We find that male and female mice exhibit largely non-overlapping miR and mRNA profiles following SCVS. The two sexes also show enrichment of different molecular pathways and functions. Collectively, our results suggest that males and females mount fundamentally different transcriptional and post-transcriptional responses to SCVS and engage sex-specific molecular processes following stress. These findings have implications for the pathophysiology and treatment of stress-related disorders in women.
Tsimberidou, Apostolia-Maria
In the last decade, breakthroughs in technology have improved our understanding of genomic, transcriptional, proteomic, epigenetic aberrations and immune mechanisms in carcinogenesis. Genomics and model systems have enabled the validation of novel therapeutic strategies. Based on these developments, in 2007, we initiated the IMPACT (Initiative for Molecular Profiling and Advanced Cancer Therapy) study, the first personalized medicine program for patients with advanced cancer at The University of Texas MD Anderson Cancer Center. We demonstrated that in patients referred for Phase I clinical trials, the use of tumor molecular profiling and treatment with matched targeted therapy was associated with encouraging rates of response, progression-free survival and overall survival compared to non-matched therapy. We are currently conducting IMPACT2, a randomized study evaluating molecular profiling and targeted agents in patients with metastatic cancer. Optimization of innovative biomarker-driven clinical trials that include targeted therapy and/or immunotherapeutic approaches for carefully selected patients will accelerate the development of novel drugs and the implementation of precision medicine. Copyright © 2017 Elsevier Inc. All rights reserved.
Burger, N F V; Venter, E; Botha, A-M
2017-04-01
The intimate relationship between an aphid and its host is mediated by the composition of the secreted saliva. In the present study, aphid heads were sampled and transcript profiling conducted after aphids were fed on their preference host and transferred to a variety of preference and nonpreference hosts. It was found that the virulent Diuraphis noxia (Kurdjumov) (Hemiptera: Aphididae) biotype SAM was able to selectively up-regulate more transcripts when confronted with feeding on a variety of hosts, than was the case with the less virulent D. noxia biotype SA1, suggesting increased genomic regulation when coping with a stressful environment. Collectively, the observed transcriptomic changes are supported by previous findings that host changes induce significant changes in the proteome of phytophagous hemipterans, unlike in many other entomophagous generalist species. The current data suggest that highly specialized hemipterans may be able to counter plant defenses with inducible salivary transcripts with resulting protein biosynthesis, as demonstrated here. © The Authors 2017. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Transcriptional profiling: a potential anti-doping strategy.
Rupert, J L
2009-12-01
Evolving challenges require evolving responses. The use of illicit performance enhancing drugs by athletes permeates the reality and the perception of elite sports. New drugs with ergogenic or masking potential are quickly adopted, driven by a desire to win and the necessity of avoiding detection. To counter this trend, anti-doping authorities are continually refining existing assays and developing new testing strategies. In the post-genome era, genetic- and molecular-based tests are being evaluated as potential approaches to detect new and sophisticated forms of doping. Transcriptome analysis, in which a tissue's complement of mRNA transcripts is characterized, is one such method. The quantity and composition of a tissue's transcriptome is highly reflective of milieu and metabolic activity. There is much interest in transcriptional profiling in medical diagnostics and, as transcriptional information can be obtained from a variety of easily accessed tissues, similar approaches could be used in doping control. This article briefly reviews current understanding of the transcriptome, common methods of global analysis of gene expression and non-invasive sample sources. While the focus of this article is on anti-doping, the principles and methodology described could be applied to any research in which non-invasive, yet biologically informative sampling is desired.
Defining pancreatic endocrine precursors and their descendants.
White, Peter; May, Catherine Lee; Lamounier, Rodrigo N; Brestelli, John E; Kaestner, Klaus H
2008-03-01
The global incidence of diabetes continues to increase. Cell replacement therapy and islet transplantation offer hope, especially for severely affected patients. Efforts to differentiate insulin-producing beta-cells from progenitor or stem cells require knowledge of the transcriptional programs that regulate the development of the endocrine pancreas. Differentiation toward the endocrine lineage is dependent on the transcription factor Neurogenin 3 (Neurog3, Ngn3). We utilize a Neurog3-enhanced green fluorescent protein knock-in mouse model to isolate endocrine progenitor cells from embryonic pancreata (embryonic day [E]13.5 through E17.5). Using advanced genomic approaches, we generate a comprehensive gene expression profile of these progenitors and their immediate descendants. A total of 1,029 genes were identified as being temporally regulated in the endocrine lineage during fetal development, 237 of which are transcriptional regulators. Through pathway analysis, we have modeled regulatory networks involving these proteins that highlight the complex transcriptional hierarchy governing endocrine differentiation. We have been able to accurately capture the gene expression profile of the pancreatic endocrine progenitors and their descendants. The list of temporally regulated genes identified in fetal endocrine precursors and their immediate descendants provides a novel and important resource for developmental biologists and diabetes researchers alike.
Kuang, Zheng; Ji, Zhicheng
2018-01-01
Abstract Biological processes are usually associated with genome-wide remodeling of transcription driven by transcription factors (TFs). Identifying key TFs and their spatiotemporal binding patterns are indispensable to understanding how dynamic processes are programmed. However, most methods are designed to predict TF binding sites only. We present a computational method, dynamic motif occupancy analysis (DynaMO), to infer important TFs and their spatiotemporal binding activities in dynamic biological processes using chromatin profiling data from multiple biological conditions such as time-course histone modification ChIP-seq data. In the first step, DynaMO predicts TF binding sites with a random forests approach. Next and uniquely, DynaMO infers dynamic TF binding activities at predicted binding sites using their local chromatin profiles from multiple biological conditions. Another landmark of DynaMO is to identify key TFs in a dynamic process using a clustering and enrichment analysis of dynamic TF binding patterns. Application of DynaMO to the yeast ultradian cycle, mouse circadian clock and human neural differentiation exhibits its accuracy and versatility. We anticipate DynaMO will be generally useful for elucidating transcriptional programs in dynamic processes. PMID:29325176
2013-01-01
Background Chrysanthemum is one of the most important ornamental crops in the world and drought stress seriously limits its production and distribution. In order to generate a functional genomics resource and obtain a deeper understanding of the molecular mechanisms regarding chrysanthemum responses to dehydration stress, we performed large-scale transcriptome sequencing of chrysanthemum plants under dehydration stress using the Illumina sequencing technology. Results Two cDNA libraries constructed from mRNAs of control and dehydration-treated seedlings were sequenced by Illumina technology. A total of more than 100 million reads were generated and de novo assembled into 98,180 unique transcripts which were further extensively annotated by comparing their sequencing to different protein databases. Biochemical pathways were predicted from these transcript sequences. Furthermore, we performed gene expression profiling analysis upon dehydration treatment in chrysanthemum and identified 8,558 dehydration-responsive unique transcripts, including 307 transcription factors and 229 protein kinases and many well-known stress responsive genes. Gene ontology (GO) term enrichment and biochemical pathway analyses showed that dehydration stress caused changes in hormone response, secondary and amino acid metabolism, and light and photoperiod response. These findings suggest that drought tolerance of chrysanthemum plants may be related to the regulation of hormone biosynthesis and signaling, reduction of oxidative damage, stabilization of cell proteins and structures, and maintenance of energy and carbon supply. Conclusions Our transcriptome sequences can provide a valuable resource for chrysanthemum breeding and research and novel insights into chrysanthemum responses to dehydration stress and offer candidate genes or markers that can be used to guide future studies attempting to breed drought tolerant chrysanthemum cultivars. PMID:24074255
Sex-specific hippocampal 5-hydroxymethylcytosine is disrupted in response to acute stress.
Papale, Ligia A; Li, Sisi; Madrid, Andy; Zhang, Qi; Chen, Li; Chopra, Pankaj; Jin, Peng; Keleş, Sündüz; Alisch, Reid S
2016-12-01
Environmental stress is among the most important contributors to increased susceptibility to develop psychiatric disorders. While it is well known that acute environmental stress alters gene expression, the molecular mechanisms underlying these changes remain largely unknown. 5-hydroxymethylcytosine (5hmC) is a novel environmentally sensitive epigenetic modification that is highly enriched in neurons and is associated with active neuronal transcription. Recently, we reported a genome-wide disruption of hippocampal 5hmC in male mice following acute stress that was correlated to altered transcript levels of genes in known stress related pathways. Since sex-specific endocrine mechanisms respond to environmental stimulus by altering the neuronal epigenome, we examined the genome-wide profile of hippocampal 5hmC in female mice following exposure to acute stress and identified 363 differentially hydroxymethylated regions (DhMRs) linked to known (e.g., Nr3c1 and Ntrk2) and potentially novel genes associated with stress response and psychiatric disorders. Integration of hippocampal expression data from the same female mice found stress-related hydroxymethylation correlated to altered transcript levels. Finally, characterization of stress-induced sex-specific 5hmC profiles in the hippocampus revealed 778 sex-specific acute stress-induced DhMRs some of which were correlated to altered transcript levels that produce sex-specific isoforms in response to stress. Together, the alterations in 5hmC presented here provide a possible molecular mechanism for the adaptive sex-specific response to stress that may augment the design of novel therapeutic agents that will have optimal effectiveness in each sex. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Lim, Sun-Hyung; Kim, Jae Kwang; Ha, Sun-Hwa
2015-01-01
Light quality is an important signaling component upon which plants orchestrate various morphological processes, including seed germination and seedling photomorphogenesis. However, it is still unclear how plants, especially food crops, sense various light qualities and modulate their cellular growth and other developmental processes. Therefore, in this work, we initially profiled the transcripts of a model crop, rice (Oryza sativa), under four different light treatments (blue, green, red, and white) as well as in the dark. Concurrently, we reconstructed a fully compartmentalized genome-scale metabolic model of rice cells, iOS2164, containing 2,164 unique genes, 2,283 reactions, and 1,999 metabolites. We then combined the model with transcriptome profiles to elucidate the light-specific transcriptional signatures of rice metabolism. Clearly, light signals mediated rice gene expressions, differentially regulating numerous metabolic pathways: photosynthesis and secondary metabolism were up-regulated in blue light, whereas reserve carbohydrates degradation was pronounced in the dark. The topological analysis of gene expression data with the rice genome-scale metabolic model further uncovered that phytohormones, such as abscisate, ethylene, gibberellin, and jasmonate, are the key biomarkers of light-mediated regulation, and subsequent analysis of the associated genes’ promoter regions identified several light-specific transcription factors. Finally, the transcriptional control of rice metabolism by red and blue light signals was assessed by integrating the transcriptome and metabolome data with constraint-based modeling. The biological insights gained from this integrative systems biology approach offer several potential applications, such as improving the agronomic traits of food crops and designing light-specific synthetic gene circuits in microbial and mammalian systems. PMID:26453433
Chakraborty, Sandeep; Britton, Monica; Martínez-García, P J; Dandekar, Abhaya M
2016-03-01
Deep RNA-Seq profiling, a revolutionary method used for quantifying transcriptional levels, often includes non-specific transcripts from other co-existing organisms in spite of stringent protocols. Using the recently published walnut genome sequence as a filter, we present a broad analysis of the RNA-Seq derived transcriptome profiles obtained from twenty different tissues to extract the biodiversity and possible plant-microbe interactions in the walnut ecosystem in California. Since the residual nature of the transcripts being analyzed does not provide sufficient information to identify the exact strain, inferences made are constrained to the genus level. The presence of the pathogenic oomycete Phytophthora was detected in the root through the presence of a glyceraldehyde-3-phosphate dehydrogenase. Cryptococcus, the causal agent of cryptococcosis, was found in the catkins and vegetative buds, corroborating previous work indicating that the plant surface supported the sexual cycle of this human pathogen. The RNA-Seq profile revealed several species of the endophytic nitrogen fixing Actinobacteria. Another bacterial species implicated in aerobic biodegradation of methyl tert-butyl ether (Methylibium petroleiphilum) is also found in the root. RNA encoding proteins from the pea aphid were found in the leaves and vegetative buds, while a serine protease from mosquito with significant homology to a female reproductive tract protease from Drosophila mojavensis in the vegetative bud suggests egg-laying activities. The comprehensive analysis of RNA-seq data present also unraveled detailed, tissue-specific information of ~400 transcripts encoded by the largest family of resistance (R) genes (NBS-LRR), which possibly rationalizes the resistance of the specific walnut plant to the pathogens detected. Thus, we elucidate the biodiversity and possible plant-microbe interactions in several walnut (Juglans regia) tissues in California using deep RNA-Seq profiling.
MicroRNA expression profiling of human breast cancer identifies new markers of tumor subtype.
Blenkiron, Cherie; Goldstein, Leonard D; Thorne, Natalie P; Spiteri, Inmaculada; Chin, Suet-Feung; Dunning, Mark J; Barbosa-Morais, Nuno L; Teschendorff, Andrew E; Green, Andrew R; Ellis, Ian O; Tavaré, Simon; Caldas, Carlos; Miska, Eric A
2007-01-01
MicroRNAs (miRNAs), a class of short non-coding RNAs found in many plants and animals, often act post-transcriptionally to inhibit gene expression. Here we report the analysis of miRNA expression in 93 primary human breast tumors, using a bead-based flow cytometric miRNA expression profiling method. Of 309 human miRNAs assayed, we identify 133 miRNAs expressed in human breast and breast tumors. We used mRNA expression profiling to classify the breast tumors as luminal A, luminal B, basal-like, HER2+ and normal-like. A number of miRNAs are differentially expressed between these molecular tumor subtypes and individual miRNAs are associated with clinicopathological factors. Furthermore, we find that miRNAs could classify basal versus luminal tumor subtypes in an independent data set. In some cases, changes in miRNA expression correlate with genomic loss or gain; in others, changes in miRNA expression are likely due to changes in primary transcription and or miRNA biogenesis. Finally, the expression of DICER1 and AGO2 is correlated with tumor subtype and may explain some of the changes in miRNA expression observed. This study represents the first integrated analysis of miRNA expression, mRNA expression and genomic changes in human breast cancer and may serve as a basis for functional studies of the role of miRNAs in the etiology of breast cancer. Furthermore, we demonstrate that bead-based flow cytometric miRNA expression profiling might be a suitable platform to classify breast cancer into prognostic molecular subtypes.
O'Rourke, Jamie A; Fu, Fengli; Bucciarelli, Bruna; Yang, S Sam; Samac, Deborah A; Lamb, JoAnn F S; Monteros, Maria J; Graham, Michelle A; Gronwald, John W; Krom, Nick; Li, Jun; Dai, Xinbin; Zhao, Patrick X; Vance, Carroll P
2015-07-07
Alfalfa (Medicago sativa L.) is the primary forage legume crop species in the United States and plays essential economic and ecological roles in agricultural systems across the country. Modern alfalfa is the result of hybridization between tetraploid M. sativa ssp. sativa and M. sativa ssp. falcata. Due to its large and complex genome, there are few genomic resources available for alfalfa improvement. A de novo transcriptome assembly from two alfalfa subspecies, M. sativa ssp. sativa (B47) and M. sativa ssp. falcata (F56) was developed using Illumina RNA-seq technology. Transcripts from roots, nitrogen-fixing root nodules, leaves, flowers, elongating stem internodes, and post-elongation stem internodes were assembled into the Medicago sativa Gene Index 1.2 (MSGI 1.2) representing 112,626 unique transcript sequences. Nodule-specific and transcripts involved in cell wall biosynthesis were identified. Statistical analyses identified 20,447 transcripts differentially expressed between the two subspecies. Pair-wise comparisons of each tissue combination identified 58,932 sequences differentially expressed in B47 and 69,143 sequences differentially expressed in F56. Comparing transcript abundance in floral tissues of B47 and F56 identified expression differences in sequences involved in anthocyanin and carotenoid synthesis, which determine flower pigmentation. Single nucleotide polymorphisms (SNPs) unique to each M. sativa subspecies (110,241) were identified. The Medicago sativa Gene Index 1.2 increases the expressed sequence data available for alfalfa by ninefold and can be expanded as additional experiments are performed. The MSGI 1.2 transcriptome sequences, annotations, expression profiles, and SNPs were assembled into the Alfalfa Gene Index and Expression Database (AGED) at http://plantgrn.noble.org/AGED/ , a publicly available genomic resource for alfalfa improvement and legume research.
Ultraconserved regions encoding ncRNAs are altered in human leukemias and carcinomas.
Calin, George A; Liu, Chang-gong; Ferracin, Manuela; Hyslop, Terry; Spizzo, Riccardo; Sevignani, Cinzia; Fabbri, Muller; Cimmino, Amelia; Lee, Eun Joo; Wojcik, Sylwia E; Shimizu, Masayoshi; Tili, Esmerina; Rossi, Simona; Taccioli, Cristian; Pichiorri, Flavia; Liu, Xiuping; Zupo, Simona; Herlea, Vlad; Gramantieri, Laura; Lanza, Giovanni; Alder, Hansjuerg; Rassenti, Laura; Volinia, Stefano; Schmittgen, Thomas D; Kipps, Thomas J; Negrini, Massimo; Croce, Carlo M
2007-09-01
Noncoding RNA (ncRNA) transcripts are thought to be involved in human tumorigenesis. We report that a large fraction of genomic ultraconserved regions (UCRs) encode a particular set of ncRNAs whose expression is altered in human cancers. Genome-wide profiling revealed that UCRs have distinct signatures in human leukemias and carcinomas. UCRs are frequently located at fragile sites and genomic regions involved in cancers. We identified certain UCRs whose expression may be regulated by microRNAs abnormally expressed in human chronic lymphocytic leukemia, and we proved that the inhibition of an overexpressed UCR induces apoptosis in colon cancer cells. Our findings argue that ncRNAs and interaction between noncoding genes are involved in tumorigenesis to a greater extent than previously thought.
Care, Matthew A.; Cocco, Mario; Laye, Jon P.; Barnes, Nicholas; Huang, Yuanxue; Wang, Ming; Barrans, Sharon; Du, Ming; Jack, Andrew; Westhead, David R.; Doody, Gina M.; Tooze, Reuben M.
2014-01-01
Interferon regulatory factor 4 (IRF4) is central to the transcriptional network of activated B-cell-like diffuse large B-cell lymphoma (ABC-DLBCL), an aggressive lymphoma subgroup defined by gene expression profiling. Since cofactor association modifies transcriptional regulatory input by IRF4, we assessed genome occupancy by IRF4 and endogenous cofactors in ABC-DLBCL cell lines. IRF4 partners with SPIB, PU.1 and BATF genome-wide, but SPIB provides the dominant IRF4 partner in this context. Upon SPIB knockdown IRF4 occupancy is depleted and neither PU.1 nor BATF acutely compensates. Integration with ENCODE data from lymphoblastoid cell line GM12878, demonstrates that IRF4 adopts either SPIB- or BATF-centric genome-wide distributions in related states of post-germinal centre B-cell transformation. In primary DLBCL high-SPIB and low-BATF or the reciprocal low-SPIB and high-BATF mRNA expression links to differential gene expression profiles across nine data sets, identifying distinct associations with SPIB occupancy, signatures of B-cell differentiation stage and potential pathogenetic mechanisms. In a population-based patient cohort, SPIBhigh/BATFlow-ABC-DLBCL is enriched for mutation of MYD88, and SPIBhigh/BATFlow-ABC-DLBCL with MYD88-L265P mutation identifies a small subgroup of patients among this otherwise aggressive disease subgroup with distinct favourable outcome. We conclude that differential expression of IRF4 cofactors SPIB and BATF identifies biologically and clinically significant heterogeneity among ABC-DLBCL. PMID:24875472
Pócsi, István; Miskei, Márton; Karányi, Zsolt; Emri, Tamás; Ayoubi, Patricia; Pusztahelyi, Tünde; Balla, György; Prade, Rolf A
2005-01-01
Background In addition to their cytotoxic nature, reactive oxygen species (ROS) are also signal molecules in diverse cellular processes in eukaryotic organisms. Linking genome-wide transcriptional changes to cellular physiology in oxidative stress-exposed Aspergillus nidulans cultures provides the opportunity to estimate the sizes of peroxide (O22-), superoxide (O2•-) and glutathione/glutathione disulphide (GSH/GSSG) redox imbalance responses. Results Genome-wide transcriptional changes triggered by diamide, H2O2 and menadione in A. nidulans vegetative tissues were recorded using DNA microarrays containing 3533 unique PCR-amplified probes. Evaluation of LOESS-normalized data indicated that 2499 gene probes were affected by at least one stress-inducing agent. The stress induced by diamide and H2O2 were pulse-like, with recovery after 1 h exposure time while no recovery was observed with menadione. The distribution of stress-responsive gene probes among major physiological functional categories was approximately the same for each agent. The gene group sizes solely responsive to changes in intracellular O22-, O2•- concentrations or to GSH/GSSG redox imbalance were estimated at 7.7, 32.6 and 13.0 %, respectively. Gene groups responsive to diamide, H2O2 and menadione treatments and gene groups influenced by GSH/GSSG, O22- and O2•- were only partly overlapping with distinct enrichment profiles within functional categories. Changes in the GSH/GSSG redox state influenced expression of genes coding for PBS2 like MAPK kinase homologue, PSK2 kinase homologue, AtfA transcription factor, and many elements of ubiquitin tagging, cell division cycle regulators, translation machinery proteins, defense and stress proteins, transport proteins as well as many enzymes of the primary and secondary metabolisms. Meanwhile, a separate set of genes encoding transport proteins, CpcA and JlbA amino acid starvation-responsive transcription factors, and some elements of sexual development and sporulation was ROS responsive. Conclusion The existence of separate O22-, O2•- and GSH/GSSG responsive gene groups in a eukaryotic genome has been demonstrated. Oxidant-triggered, genome-wide transcriptional changes should be analyzed considering changes in oxidative stress-responsive physiological conditions and not correlating them directly to the chemistry and concentrations of the oxidative stress-inducing agent. PMID:16368011
Expression profiling of Yersinia pestis during mouse pulmonary infection.
Lawson, Jonathan N; Lyons, C Rick; Johnston, Stephen Albert
2006-11-01
Yersinia pestis, the causative agent of plague, can be transmitted by infected flea bite or inhaled aerosol. Both routes of infection have a high mortality rate, and pneumonic infections of Y. pestis represent a significant concern as a tool of bioterrorism. Understanding the transcriptional program of this pathogen during pulmonary infection should be valuable in understanding plague pathogenesis, as well as potentially offering insights into new vaccines and therapeutics. Toward this goal we developed a long oligonucleotide microarray to the plague bacillus and evaluated the expression profiles of Y. pestis in vitro and in the mouse pulmonary infection model in vivo. The in vitro analysis compared expression patterns at 27 versus 37 degrees C, as a surrogate of the transition from the flea to the mammalian host. The in vivo analysis used intranasal challenge to the mouse lung. By amplifying the Y. pestis RNA from individual mouse lungs we were able to map the transcriptional profile of plague at postinfection days 1 to 3. Our data present a very different transcriptional profile between in vivo and in vitro expression, suggesting Y. pestis responds to a variety of host signals during infection. Of note was the number of genes found in genomic regions with altered %GC content that are upregulated within the mouse lung environment. These data suggest these regions may provide particularly promising targets for both vaccines and therapeutics.
Pal, Arnab; Srivastava, Tapasya; Sharma, Manish K; Mehndiratta, Mohit; Das, Prerna; Sinha, Subrata; Chattopadhyay, Parthaprasad
2010-11-01
Hypoxia is an integral part of tumorigenesis and contributes extensively to the neoplastic phenotype including drug resistance and genomic instability. It has also been reported that hypoxia results in global demethylation. Because a majority of the cytosine-phosphate-guanine (CpG) islands are found within the repeat elements of DNA, and are usually methylated under normoxic conditions, we suggested that retrotransposable Alu or short interspersed nuclear elements (SINEs) which show altered methylation and associated changes of gene expression during hypoxia, could be associated with genomic instability. U87MG glioblastoma cells were cultured in 0.1% O₂ for 6 weeks and compared with cells cultured in 21% O₂ for the same duration. Real-time PCR analysis showed a significant increase in SINE and reverse transcriptase coding long interspersed nuclear element (LINE) transcripts during hypoxia. Sequencing of bisulphite treated DNA as well as the Combined Bisulfite Restriction Analysis (COBRA) assay showed that the SINE loci studied underwent significant hypomethylation though there was patchy hypermethylation at a few sites. The inter-alu PCR profile of DNA from cells cultured under 6-week hypoxia, its 4-week revert back to normoxia and 6-week normoxia showed several changes in the band pattern indicating increased alu mediated genomic alteration. Our results show that aberrant methylation leading to increased transcription of SINE and reverse transcriptase associated LINE elements could lead to increased genomic instability in hypoxia. This might be a cause of genetic heterogeneity in tumours especially in variegated hypoxic environment and lead to a development of foci of more aggressive tumour cells. © 2009 The Authors Journal compilation © 2010 Foundation for Cellular and Molecular Medicine/Blackwell Publishing Ltd.
Genome-wide analysis of WRKY gene family in Cucumis sativus
2011-01-01
Background WRKY proteins are a large family of transcriptional regulators in higher plant. They are involved in many biological processes, such as plant development, metabolism, and responses to biotic and abiotic stresses. Prior to the present study, only one full-length cucumber WRKY protein had been reported. The recent publication of the draft genome sequence of cucumber allowed us to conduct a genome-wide search for cucumber WRKY proteins, and to compare these positively identified proteins with their homologs in model plants, such as Arabidopsis. Results We identified a total of 55 WRKY genes in the cucumber genome. According to structural features of their encoded proteins, the cucumber WRKY (CsWRKY) genes were classified into three groups (group 1-3). Analysis of expression profiles of CsWRKY genes indicated that 48 WRKY genes display differential expression either in their transcript abundance or in their expression patterns under normal growth conditions, and 23 WRKY genes were differentially expressed in response to at least one abiotic stresses (cold, drought or salinity). The expression profile of stress-inducible CsWRKY genes were correlated with those of their putative Arabidopsis WRKY (AtWRKY) orthologs, except for the group 3 WRKY genes. Interestingly, duplicated group 3 AtWRKY genes appear to have been under positive selection pressure during evolution. In contrast, there was no evidence of recent gene duplication or positive selection pressure among CsWRKY group 3 genes, which may have led to the expressional divergence of group 3 orthologs. Conclusions Fifty-five WRKY genes were identified in cucumber and the structure of their encoded proteins, their expression, and their evolution were examined. Considering that there has been extensive expansion of group 3 WRKY genes in angiosperms, the occurrence of different evolutionary events could explain the functional divergence of these genes. PMID:21955985
Genome-wide analysis of WRKY gene family in Cucumis sativus.
Ling, Jian; Jiang, Weijie; Zhang, Ying; Yu, Hongjun; Mao, Zhenchuan; Gu, Xingfang; Huang, Sanwen; Xie, Bingyan
2011-09-28
WRKY proteins are a large family of transcriptional regulators in higher plant. They are involved in many biological processes, such as plant development, metabolism, and responses to biotic and abiotic stresses. Prior to the present study, only one full-length cucumber WRKY protein had been reported. The recent publication of the draft genome sequence of cucumber allowed us to conduct a genome-wide search for cucumber WRKY proteins, and to compare these positively identified proteins with their homologs in model plants, such as Arabidopsis. We identified a total of 55 WRKY genes in the cucumber genome. According to structural features of their encoded proteins, the cucumber WRKY (CsWRKY) genes were classified into three groups (group 1-3). Analysis of expression profiles of CsWRKY genes indicated that 48 WRKY genes display differential expression either in their transcript abundance or in their expression patterns under normal growth conditions, and 23 WRKY genes were differentially expressed in response to at least one abiotic stresses (cold, drought or salinity). The expression profile of stress-inducible CsWRKY genes were correlated with those of their putative Arabidopsis WRKY (AtWRKY) orthologs, except for the group 3 WRKY genes. Interestingly, duplicated group 3 AtWRKY genes appear to have been under positive selection pressure during evolution. In contrast, there was no evidence of recent gene duplication or positive selection pressure among CsWRKY group 3 genes, which may have led to the expressional divergence of group 3 orthologs. Fifty-five WRKY genes were identified in cucumber and the structure of their encoded proteins, their expression, and their evolution were examined. Considering that there has been extensive expansion of group 3 WRKY genes in angiosperms, the occurrence of different evolutionary events could explain the functional divergence of these genes.
Liu, Chaoyang; Xie, Tao; Chen, Chenjie; Luan, Aiping; Long, Jianmei; Li, Chuhao; Ding, Yaqi; He, Yehua
2017-07-01
The MYB proteins comprise one of the largest families of plant transcription factors, which are involved in various plant physiological and biochemical processes. Pineapple (Ananas comosus) is one of three most important tropical fruits worldwide. The completion of pineapple genome sequencing provides a great opportunity to investigate the organization and evolutionary traits of pineapple MYB genes at the genome-wide level. In the present study, a total of 94 pineapple R2R3-MYB genes were identified and further phylogenetically classified into 26 subfamilies, as supported by the conserved gene structures and motif composition. Collinearity analysis indicated that the segmental duplication events played a crucial role in the expansion of pineapple MYB gene family. Further comparative phylogenetic analysis suggested that there have been functional divergences of MYB gene family during plant evolution. RNA-seq data from different tissues and developmental stages revealed distinct temporal and spatial expression profiles of the AcMYB genes. Further quantitative expression analysis showed the specific expression patterns of the selected putative stress-related AcMYB genes in response to distinct abiotic stress and hormonal treatments. The comprehensive expression analysis of the pineapple MYB genes, especially the tissue-preferential and stress-responsive genes, could provide valuable clues for further function characterization. In this work, we systematically identified AcMYB genes by analyzing the pineapple genome sequence using a set of bioinformatics approaches. Our findings provide a global insight into the organization, phylogeny and expression patterns of the pineapple R2R3-MYB genes, and hence contribute to the greater understanding of their biological roles in pineapple.
Chen, Josephine; Zhao, Po; Massaro, Donald; Clerch, Linda B; Almon, Richard R; DuBois, Debra C; Jusko, William J; Hoffman, Eric P
2004-01-01
Publicly accessible DNA databases (genome browsers) are rapidly accelerating post-genomic research (see http://www.genome.ucsc.edu/), with integrated genomic DNA, gene structure, EST/ splicing and cross-species ortholog data. DNA databases have relatively low dimensionality; the genome is a linear code that anchors all associated data. In contrast, RNA expression and protein databases need to be able to handle very high dimensional data, with time, tissue, cell type and genes, as interrelated variables. The high dimensionality of microarray expression profile data, and the lack of a standard experimental platform have complicated the development of web-accessible databases and analytical tools. We have designed and implemented a public resource of expression profile data containing 1024 human, mouse and rat Affymetrix GeneChip expression profiles, generated in the same laboratory, and subject to the same quality and procedural controls (Public Expression Profiling Resource; PEPR). Our Oracle-based PEPR data warehouse includes a novel time series query analysis tool (SGQT), enabling dynamic generation of graphs and spreadsheets showing the action of any transcript of interest over time. In this report, we demonstrate the utility of this tool using a 27 time point, in vivo muscle regeneration series. This data warehouse and associated analysis tools provides access to multidimensional microarray data through web-based interfaces, both for download of all types of raw data for independent analysis, and also for straightforward gene-based queries. Planned implementations of PEPR will include web-based remote entry of projects adhering to quality control and standard operating procedure (QC/SOP) criteria, and automated output of alternative probe set algorithms for each project (see http://microarray.cnmcresearch.org/pgadatatable.asp).
Chen, Josephine; Zhao, Po; Massaro, Donald; Clerch, Linda B.; Almon, Richard R.; DuBois, Debra C.; Jusko, William J.; Hoffman, Eric P.
2004-01-01
Publicly accessible DNA databases (genome browsers) are rapidly accelerating post-genomic research (see http://www.genome.ucsc.edu/), with integrated genomic DNA, gene structure, EST/ splicing and cross-species ortholog data. DNA databases have relatively low dimensionality; the genome is a linear code that anchors all associated data. In contrast, RNA expression and protein databases need to be able to handle very high dimensional data, with time, tissue, cell type and genes, as interrelated variables. The high dimensionality of microarray expression profile data, and the lack of a standard experimental platform have complicated the development of web-accessible databases and analytical tools. We have designed and implemented a public resource of expression profile data containing 1024 human, mouse and rat Affymetrix GeneChip expression profiles, generated in the same laboratory, and subject to the same quality and procedural controls (Public Expression Profiling Resource; PEPR). Our Oracle-based PEPR data warehouse includes a novel time series query analysis tool (SGQT), enabling dynamic generation of graphs and spreadsheets showing the action of any transcript of interest over time. In this report, we demonstrate the utility of this tool using a 27 time point, in vivo muscle regeneration series. This data warehouse and associated analysis tools provides access to multidimensional microarray data through web-based interfaces, both for download of all types of raw data for independent analysis, and also for straightforward gene-based queries. Planned implementations of PEPR will include web-based remote entry of projects adhering to quality control and standard operating procedure (QC/SOP) criteria, and automated output of alternative probe set algorithms for each project (see http://microarray.cnmcresearch.org/pgadatatable.asp). PMID:14681485
The Complex Transcriptional Response of Acaryochloris marina to Different Oxygen Levels.
Hernández-Prieto, Miguel A; Lin, Yuankui; Chen, Min
2017-02-09
Ancient oxygenic photosynthetic prokaryotes produced oxygen as a waste product, but existed for a long time under an oxygen-free (anoxic) atmosphere, before an oxic atmosphere emerged. The change in oxygen levels in the atmosphere influenced the chemistry and structure of many enzymes that contained prosthetic groups that were inactivated by oxygen. In the genome of Acaryochloris marina , multiple gene copies exist for proteins that are normally encoded by a single gene copy in other cyanobacteria. Using high throughput RNA sequencing to profile transcriptome responses from cells grown under microoxic and hyperoxic conditions, we detected 8446 transcripts out of the 8462 annotated genes in the Cyanobase database. Two-thirds of the 50 most abundant transcripts are key proteins in photosynthesis. Microoxic conditions negatively affected the levels of expression of genes encoding photosynthetic complexes, with the exception of some subunits. In addition to the known regulation of the multiple copies of psbA , we detected a similar transcriptional pattern for psbJ and psbU , which might play a key role in the altered components of photosystem II. Furthermore, regulation of genes encoding proteins important for reactive oxygen species-scavenging is discussed at genome level, including, for the first time, specific small RNAs having possible regulatory roles under varying oxygen levels. Copyright © 2017 Hernandez-Prieto et al.
The Complex Transcriptional Response of Acaryochloris marina to Different Oxygen Levels
Hernández-Prieto, Miguel A.; Lin, Yuankui; Chen, Min
2016-01-01
Ancient oxygenic photosynthetic prokaryotes produced oxygen as a waste product, but existed for a long time under an oxygen-free (anoxic) atmosphere, before an oxic atmosphere emerged. The change in oxygen levels in the atmosphere influenced the chemistry and structure of many enzymes that contained prosthetic groups that were inactivated by oxygen. In the genome of Acaryochloris marina, multiple gene copies exist for proteins that are normally encoded by a single gene copy in other cyanobacteria. Using high throughput RNA sequencing to profile transcriptome responses from cells grown under microoxic and hyperoxic conditions, we detected 8446 transcripts out of the 8462 annotated genes in the Cyanobase database. Two-thirds of the 50 most abundant transcripts are key proteins in photosynthesis. Microoxic conditions negatively affected the levels of expression of genes encoding photosynthetic complexes, with the exception of some subunits. In addition to the known regulation of the multiple copies of psbA, we detected a similar transcriptional pattern for psbJ and psbU, which might play a key role in the altered components of photosystem II. Furthermore, regulation of genes encoding proteins important for reactive oxygen species-scavenging is discussed at genome level, including, for the first time, specific small RNAs having possible regulatory roles under varying oxygen levels. PMID:27974439
The Landscape of Somatic Chromosomal Copy Number Aberrations in GEM Models of Prostate Carcinoma
Bianchi-Frias, Daniella; Hernandez, Susana A.; Coleman, Roger; Wu, Hong; Nelson, Peter S.
2015-01-01
Human prostate cancer (PCa) is known to harbor recurrent genomic aberrations consisting of chromosomal losses, gains, rearrangements and mutations that involve oncogenes and tumor suppressors. Genetically engineered mouse (GEM) models have been constructed to assess the causal role of these putative oncogenic events and provide molecular insight into disease pathogenesis. While GEM models generally initiate neoplasia by manipulating a single gene, expression profiles of GEM tumors typically comprise hundreds of transcript alterations. It is unclear whether these transcriptional changes represent the pleiotropic effects of single oncogenes, and/or cooperating genomic or epigenomic events. Therefore, it was determined if structural chromosomal alterations occur in GEM models of PCa and whether the changes are concordant with human carcinomas. Whole genome array-based comparative genomic hybridization (CGH) was used to identify somatic chromosomal copy number aberrations (SCNAs) in the widely used TRAMP, Hi-Myc, Pten-null and LADY GEM models. Interestingly, very few SCNAs were identified and the genomic architecture of Hi-Myc, Pten-null and LADY tumors were essentially identical to the germline. TRAMP neuroendocrine carcinomas contained SCNAs, which comprised three recurrent aberrations including a single copy loss of chromosome 19 (encoding Pten). In contrast, cell lines derived from the TRAMP, Hi-Myc, and Pten-null tumors were notable for numerous SCNAs that included copy gains of chromosome 15 (encoding Myc) and losses of chromosome 11 (encoding p53). PMID:25298407
Liu, Chaoyang; Wang, Xia; Xu, Yuantao; Deng, Xiuxin; Xu, Qiang
2014-10-01
MYB transcription factor represents one of the largest gene families in plant genomes. Sweet orange (Citrus sinensis) is one of the most important fruit crops worldwide, and recently the genome has been sequenced. This provides an opportunity to investigate the organization and evolutionary characteristics of sweet orange MYB genes from whole genome view. In the present study, we identified 100 R2R3-MYB genes in the sweet orange genome. A comprehensive analysis of this gene family was performed, including the phylogeny, gene structure, chromosomal localization and expression pattern analyses. The 100 genes were divided into 29 subfamilies based on the sequence similarity and phylogeny, and the classification was also well supported by the highly conserved exon/intron structures and motif composition. The phylogenomic comparison of MYB gene family among sweet orange and related plant species, Arabidopsis, cacao and papaya suggested the existence of functional divergence during evolution. Expression profiling indicated that sweet orange R2R3-MYB genes exhibited distinct temporal and spatial expression patterns. Our analysis suggested that the sweet orange MYB genes may play important roles in different plant biological processes, some of which may be potentially involved in citrus fruit quality. These results will be useful for future functional analysis of the MYB gene family in sweet orange.
High-Throughput Sequencing Reveals Principles of Adeno-Associated Virus Serotype 2 Integration
Janovitz, Tyler; Klein, Isaac A.; Oliveira, Thiago; Mukherjee, Piali; Nussenzweig, Michel C.; Sadelain, Michel
2013-01-01
Viral integrations are important in human biology, yet genome-wide integration profiles have not been determined for many viruses. Adeno-associated virus (AAV) infects most of the human population and is a prevalent gene therapy vector. AAV integrates into the human genome with preference for a single locus, termed AAVS1. However, the genome-wide integration of AAV has not been defined, and the principles underlying this recombination remain unclear. Using a novel high-throughput approach, integrant capture sequencing, nearly 12 million AAV junctions were recovered from a human cell line, providing five orders of magnitude more data than were previously available. Forty-five percent of integrations occurred near AAVS1, and several thousand novel integration hotspots were identified computationally. Most of these occurred in genes, with dozens of hotspots targeting known oncogenes. Viral replication protein binding sites (RBS) and transcriptional activity were major factors favoring integration. In a first for eukaryotic viruses, the data reveal a unique asymmetric integration profile with distinctive directional orientation of viral genomes. These studies provide a new understanding of AAV integration biology through the use of unbiased high-throughput data acquisition and bioinformatics. PMID:23720718
2013-01-01
Background MADS-domain transcription factors play important roles during plant development. The Arabidopsis MADS-box gene SHORT VEGETATIVE PHASE (SVP) is a key regulator of two developmental phases. It functions as a repressor of the floral transition during the vegetative phase and later it contributes to the specification of floral meristems. How these distinct activities are conferred by a single transcription factor is unclear, but interactions with other MADS domain proteins which specify binding to different genomic regions is likely one mechanism. Results To compare the genome-wide DNA binding profile of SVP during vegetative and reproductive development we performed ChIP-seq analyses. These ChIP-seq data were combined with tiling array expression analysis, induction experiments and qRT-PCR to identify biologically relevant binding sites. In addition, we compared genome-wide target genes of SVP with those published for the MADS domain transcription factors FLC and AP1, which interact with SVP during the vegetative and reproductive phases, respectively. Conclusions Our analyses resulted in the identification of pathways that are regulated by SVP including those controlling meristem development during vegetative growth and flower development whereas floral transition pathways and hormonal signaling were regulated predominantly during the vegetative phase. Thus, SVP regulates many developmental pathways, some of which are common to both of its developmental roles whereas others are specific to only one of them. PMID:23759218
Jin, Hyun Mi; Jeong, Hye Im; Kim, Kyung Hyun; Hahn, Yoonsoo; Madsen, Eugene L; Jeon, Che Ok
2016-02-18
A genome-wide transcriptional analysis of Alteromonas naphthalenivorans SN2 was performed to investigate its ecophysiological behavior in contaminated tidal flats and seawater. The experimental design mimicked these habitats that either added naphthalene or pyruvate; tidal flat-naphthalene (TF-N), tidal flat-pyruvate (TF-P), seawater-naphthalene (SW-N), and seawater-pyruvate (SW-P). The transcriptional profiles clustered by habitat (TF-N/TF-P and SW-N/SW-P), rather than carbon source, suggesting that the former may exert a greater influence on genome-wide expression in strain SN2 than the latter. Metabolic mapping of cDNA reads from strain SN2 based on KEGG pathway showed that metabolic and regulatory genes associated with energy metabolism, translation, and cell motility were highly expressed in all four test conditions, probably highlighting the copiotrophic properties of strain SN2 as an opportunistic marine r-strategist. Differential gene expression analysis revealed that strain SN2 displayed specific cellular responses to environmental variables (tidal flat, seawater, naphthalene, and pyruvate) and exhibited certain ecological fitness traits -- its notable PAH degradation capability in seasonally cold tidal flat might be reflected in elevated expression of stress response and chaperone proteins, while fast growth in nitrogen-deficient and aerobic seawater probably correlated with high expression of glutamine synthetase, enzymes utilizing nitrite/nitrate, and those involved in the removal of reactive oxygen species.
Jin, Hyun Mi; Jeong, Hye Im; Kim, Kyung Hyun; Hahn, Yoonsoo; Madsen, Eugene L.; Jeon, Che Ok
2016-01-01
A genome-wide transcriptional analysis of Alteromonas naphthalenivorans SN2 was performed to investigate its ecophysiological behavior in contaminated tidal flats and seawater. The experimental design mimicked these habitats that either added naphthalene or pyruvate; tidal flat-naphthalene (TF-N), tidal flat-pyruvate (TF-P), seawater-naphthalene (SW-N), and seawater-pyruvate (SW-P). The transcriptional profiles clustered by habitat (TF-N/TF-P and SW-N/SW-P), rather than carbon source, suggesting that the former may exert a greater influence on genome-wide expression in strain SN2 than the latter. Metabolic mapping of cDNA reads from strain SN2 based on KEGG pathway showed that metabolic and regulatory genes associated with energy metabolism, translation, and cell motility were highly expressed in all four test conditions, probably highlighting the copiotrophic properties of strain SN2 as an opportunistic marine r-strategist. Differential gene expression analysis revealed that strain SN2 displayed specific cellular responses to environmental variables (tidal flat, seawater, naphthalene, and pyruvate) and exhibited certain ecological fitness traits –– its notable PAH degradation capability in seasonally cold tidal flat might be reflected in elevated expression of stress response and chaperone proteins, while fast growth in nitrogen-deficient and aerobic seawater probably correlated with high expression of glutamine synthetase, enzymes utilizing nitrite/nitrate, and those involved in the removal of reactive oxygen species. PMID:26887987
What the Aspergillus genomes have told us.
Nierman, W C; May, G; Kim, H S; Anderson, M J; Chen, D; Denning, D W
2005-05-01
The sequencing and annotation of the genomes of the first strains of Aspergillus nidulans, Aspergillus oryzae, and Aspergillus fumigatus will be seen in retrospect as a transformational event in Aspergillus biology. With this event the entire genetic composition of A. nidulans, the sexual experimental model organism of the genus Aspergillus, A. oryzae, the food biotechnology organism which is the product of centuries of cultivation, and A. fumigatus, the most common causative agent of invasive aspergillosis is now revealed to the extent that we are at present able to understand. Each genome exhibits a large set of genes common to the three as well as a much smaller set of genes unique to each. Moreover, these sequences serve as resources providing the major tool to expanding our understanding of the biology of each. Transcription profiling of A. fumigatus at high temperatures and comparative genomic hybridization between A. fumigatus and a closely related Aspergillus species provides microarray based examples of the beginning of functional analysis of the genomes of these organisms going forward from the genome sequence.
Male reproductive development: gene expression profiling of maize anther and pollen ontogeny
Ma, Jiong; Skibbe, David S; Fernandes, John; Walbot, Virginia
2008-01-01
Background During flowering, central anther cells switch from mitosis to meiosis, ultimately forming pollen containing haploid sperm. Four rings of surrounding somatic cells differentiate to support first meiosis and later pollen dispersal. Synchronous development of many anthers per tassel and within each anther facilitates dissection of carefully staged maize anthers for transcriptome profiling. Results Global gene expression profiles of 7 stages representing 29 days of anther development are analyzed using a 44 K oligonucleotide array querying approximately 80% of maize protein-coding genes. Mature haploid pollen containing just two cell types expresses 10,000 transcripts. Anthers contain 5 major cell types and express >24,000 transcript types: each anther stage expresses approximately 10,000 constitutive and approximately 10,000 or more transcripts restricted to one or a few stages. The lowest complexity is present during meiosis. Large suites of stage-specific and co-expressed genes are identified through Gene Ontology and clustering analyses as functional classes for pre-meiotic, meiotic, and post-meiotic anther development. MADS box and zinc finger transcription factors with constitutive and stage-limited expression are identified. Conclusions We propose that the extensive gene expression of anther cells and pollen represents the key test of maize genome fitness, permitting strong selection against deleterious alleles in diploid anthers and haploid pollen. Because flowering plants show a substantial bias for male-sterile compared to female-sterile mutations, we propose that this fitness test is general. Because both somatic and germinal cells are transcriptionally quiescent during meiosis, we hypothesize that successful completion of meiosis is required to trigger maturation of anther somatic cells. PMID:19099579
Zhu, Jufen; Yu, Xinxu; Xie, Baogui; Gu, Xiaokui; Zhang, Zhenying; Li, Shaojie
2013-06-01
To gain insight into the regulatory mechanisms of oxidative stress responses in filamentous fungi, the genome-wide transcriptional response of Neurospora crassa to menadione was analysed by digital gene expression (DGE) profiling, which identified 779 upregulated genes and 576 downregulated genes. Knockout mutants affecting 130 highly-upregulated genes were tested for menadione sensitivity, which revealed that loss of the transcription factor siderophore regulation (SRE) (a transcriptional repressor for siderophore biosynthesis), catatase-3, cytochrome c peroxidase or superoxide dismutase 1 copper chaperone causes hypersensitivity to menadione. Deletion of sre dramatically increased transcription of the siderophore biosynthesis gene ono and the siderophore iron transporter gene sit during menadione stress, suggesting that SRE is required for repression of iron uptake under oxidative stress conditions. Contrary to its phenotype, the sre deletion mutant showed higher transcriptional levels of genes encoding reactive oxygen species (ROS) scavengers than wild type during menadione stress, which implies that the mutant suffers a higher level of oxidative stress than wild type. Uncontrolled iron uptake in the sre mutant might exacerbate cellular oxidative stress. This is the first report of a negative regulator of iron assimilation participating in the fungal oxidative stress response. In addition to SRE, eight other transcription factor genes were also menadione-responsive but their single gene knockout mutants showed wild-type menadione sensitivity. Two of them, named as mit-2 (menadione induced transcription factor-2) and mit-4 (menadione induced transcription factor-4), were selected for double mutant analysis. The double mutant was hypersensitive to menadione. Similarly, the double mutation of mit-2 and sre also had additive effects on menadione sensitivity, suggesting multiple transcription factors mediate oxidative stress resistance in an additive manner. Copyright © 2013 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.
Terrados, Gloria; Finkernagel, Florian; Stielow, Bastian; Sadic, Dennis; Neubert, Juliane; Herdt, Olga; Krause, Michael; Scharfe, Maren; Jarek, Michael; Suske, Guntram
2012-01-01
The transcription factor Sp2 is essential for early mouse development and for proliferation of mouse embryonic fibroblasts in culture. Yet its mechanisms of action and its target genes are largely unknown. In this study, we have combined RNA interference, in vitro DNA binding, chromatin immunoprecipitation sequencing and global gene-expression profiling to investigate the role of Sp2 for cellular functions, to define target sites and to identify genes regulated by Sp2. We show that Sp2 is important for cellular proliferation that it binds to GC-boxes and occupies proximal promoters of genes essential for vital cellular processes including gene expression, replication, metabolism and signalling. Moreover, we identified important key target genes and cellular pathways that are directly regulated by Sp2. Most significantly, Sp2 binds and activates numerous sequence-specific transcription factor and co-activator genes, and represses the whole battery of cholesterol synthesis genes. Our results establish Sp2 as a sequence-specific regulator of vitally important genes. PMID:22684502
Adamopoulos, Panagiotis G; Kontos, Christos K; Scorilas, Andreas
2018-03-31
Tissue kallikrein, kallikrein-related peptidases (KLKs), and plasma kallikrein form the largest group of serine proteases in the human genome, sharing many structural and functional properties. Several KLK transcripts have been found aberrantly expressed in numerous human malignancies, confirming their prognostic or/and diagnostic values. However, the process of alternative splicing can now be studied in-depth due to the development of Next-Generation Sequencing (NGS). In the present study, we used NGS to discover novel transcripts of the KLK1 and KLK2 genes, after nested touchdown PCR. Bioinformatics analysis and PCR experiments revealed a total of eleven novel KLK transcripts (two KLK1 and nine KLK2 transcripts). In addition, the expression profiles of each novel transcript were investigated with nested PCR experiments using variant-specific primers. Since KLKs are implicated in human malignancies, qualifying as potential biomarkers, the quantification of the presented novel transcripts in human samples may have clinical applications in different types of cancer. Copyright © 2018. Published by Elsevier Inc.
Grosso, Ana R; Leite, Ana P; Carvalho, Sílvia; Matos, Mafalda R; Martins, Filipa B; Vítor, Alexandra C; Desterro, Joana MP; Carmo-Fonseca, Maria; de Almeida, Sérgio F
2015-01-01
Aberrant expression of cancer genes and non-canonical RNA species is a hallmark of cancer. However, the mechanisms driving such atypical gene expression programs are incompletely understood. Here, our transcriptional profiling of a cohort of 50 primary clear cell renal cell carcinoma (ccRCC) samples from The Cancer Genome Atlas (TCGA) reveals that transcription read-through beyond the termination site is a source of transcriptome diversity in cancer cells. Amongst the genes most frequently mutated in ccRCC, we identified SETD2 inactivation as a potent enhancer of transcription read-through. We further show that invasion of neighbouring genes and generation of RNA chimeras are functional outcomes of transcription read-through. We identified the BCL2 oncogene as one of such invaded genes and detected a novel chimera, the CTSC-RAB38, in 20% of ccRCC samples. Collectively, our data highlight a novel link between transcription read-through and aberrant expression of oncogenes and chimeric transcripts that is prevalent in cancer. DOI: http://dx.doi.org/10.7554/eLife.09214.001 PMID:26575290
Trumbić, Željka; Bekaert, Michaël; Taggart, John B; Bron, James E; Gharbi, Karim; Mladineo, Ivona
2015-11-25
The largest of the tuna species, Atlantic bluefin tuna (Thunnus thynnus), inhabits the North Atlantic Ocean and the Mediterranean Sea and is considered to be an endangered species, largely a consequence of overfishing. T. thynnus aquaculture, referred to as fattening or farming, is a capture based activity dependent on yearly renewal from the wild. Thus, the development of aquaculture practices independent of wild resources can provide an important contribution towards ensuring security and sustainability of this species in the longer-term. The development of such practices is today greatly assisted by large scale transcriptomic studies. We have used pyrosequencing technology to sequence a mixed-tissue normalised cDNA library, derived from adult T. thynnus. A total of 976,904 raw sequence reads were assembled into 33,105 unique transcripts having a mean length of 893 bases and an N50 of 870. Of these, 33.4% showed similarity to known proteins or gene transcripts and 86.6% of them were matched to the congeneric Pacific bluefin tuna (Thunnus orientalis) genome, compared to 70.3% for the more distantly related Nile tilapia (Oreochromis niloticus) genome. Transcript sequences were used to develop a novel 15 K Agilent oligonucleotide DNA microarray for T. thynnus and comparative tissue gene expression profiles were inferred for gill, heart, liver, ovaries and testes. Functional contrasts were strongest between gills and ovaries. Gills were particularly associated with immune system, signal transduction and cell communication, while ovaries displayed signatures of glycan biosynthesis, nucleotide metabolism, transcription, translation, replication and repair. Sequence data generated from a novel mixed-tissue T. thynnus cDNA library provide an important transcriptomic resource that can be further employed for study of various aspects of T. thynnus ecology and genomics, with strong applications in aquaculture. Tissue-specific gene expression profiles inferred through the use of novel oligo-microarray can serve in the design of new and more focused transcriptomic studies for future research of tuna physiology and assessment of the welfare in a production environment.
Digital gene expression for non-model organisms
Hong, Lewis Z.; Li, Jun; Schmidt-Küntzel, Anne; Warren, Wesley C.; Barsh, Gregory S.
2011-01-01
Next-generation sequencing technologies offer new approaches for global measurements of gene expression but are mostly limited to organisms for which a high-quality assembled reference genome sequence is available. We present a method for gene expression profiling called EDGE, or EcoP15I-tagged Digital Gene Expression, based on ultra-high-throughput sequencing of 27-bp cDNA fragments that uniquely tag the corresponding gene, thereby allowing direct quantification of transcript abundance. We show that EDGE is capable of assaying for expression in >99% of genes in the genome and achieves saturation after 6–8 million reads. EDGE exhibits very little technical noise, reveals a large (106) dynamic range of gene expression, and is particularly suited for quantification of transcript abundance in non-model organisms where a high-quality annotated genome is not available. In a direct comparison with RNA-seq, both methods provide similar assessments of relative transcript abundance, but EDGE does better at detecting gene expression differences for poorly expressed genes and does not exhibit transcript length bias. Applying EDGE to laboratory mice, we show that a loss-of-function mutation in the melanocortin 1 receptor (Mc1r), recognized as a Mendelian determinant of yellow hair color in many different mammals, also causes reduced expression of genes involved in the interferon response. To illustrate the application of EDGE to a non-model organism, we examine skin biopsy samples from a cheetah (Acinonyx jubatus) and identify genes likely to control differences in the color of spotted versus non-spotted regions. PMID:21844123
Qi, Yanxiang; Liu, Xiaomei; Pu, Jinji
2018-01-01
The NAC transcription factors involved plant development and response to various stress stimuli. However, little information is available concerning the NAC family in the woodland strawberry. Herein, 37 NAC genes were identified from the woodland strawberry genome and were classified into 13 groups based on phylogenetic analysis. And further analyses of gene structure and conserved motifs showed closer relationship of them in every subgroup. Quantitative real-time PCR evaluation different tissues revealed distinct spatial expression profiles of the FvNAC genes. The comprehensive expression of FvNAC genes revealed under abiotic stress (cold, heat, drought, salt), signal molecule treatments (H2O2, ABA, melatonin, rapamycin), biotic stress (Colletotrichum gloeosporioides and Ralstonia solanacearum). Expression profiles derived from quantitative real-time PCR suggested that 5 FvNAC genes responded dramatically to the various abiotic and biotic stresses, indicating their contribution to abiotic and biotic stresses resistance in woodland strawberry. Interestingly, FvNAC genes showed greater extent responded to the cold treatment than other abiotic stress, and H2O2 exhibited a greater response than ABA, melatonin, and rapamycin. For biotic stresses, 3 FvNAC genes were up-regulated during infection with C. gloeosporioides, while 6 FvNAC genes were down-regulated during infection with R. solanacearum. In conclusion, this study identified candidate FvNAC genes to be used for the genetic improvement of abiotic and biotic stress tolerance in woodland strawberry. PMID:29897926
Giltnane, Jennifer M; Hutchinson, Katherine E; Stricker, Thomas P; Formisano, Luigi; Young, Christian D; Estrada, Monica V; Nixon, Mellissa J; Du, Liping; Sanchez, Violeta; Ericsson, Paula Gonzalez; Kuba, Maria G; Sanders, Melinda E; Mu, Xinmeng J; Van Allen, Eliezer M; Wagle, Nikhil; Mayer, Ingrid A; Abramson, Vandana; Gόmez, Henry; Rizzo, Monica; Toy, Weiyi; Chandarlapaty, Sarat; Mayer, Erica L; Christiansen, Jason; Murphy, Danielle; Fitzgerald, Kerry; Wang, Kai; Ross, Jeffrey S; Miller, Vincent A; Stephens, Phillip J; Yelensky, Roman; Garraway, Levi; Shyr, Yu; Meszoely, Ingrid; Balko, Justin M; Arteaga, Carlos L
2017-08-09
Inhibition of proliferation in estrogen receptor-positive (ER + ) breast cancers after short-term antiestrogen therapy correlates with long-term patient outcome. We profiled 155 ER + /human epidermal growth factor receptor 2-negative (HER2 - ) early breast cancers from 143 patients treated with the aromatase inhibitor letrozole for 10 to 21 days before surgery. Twenty-one percent of tumors remained highly proliferative, suggesting that these tumors harbor alterations associated with intrinsic endocrine therapy resistance. Whole-exome sequencing revealed a correlation between 8p11-12 and 11q13 gene amplifications, including FGFR1 and CCND1 , respectively, and high Ki67. We corroborated these findings in a separate cohort of serial pretreatment, postneoadjuvant chemotherapy, and recurrent ER + tumors. Combined inhibition of FGFR1 and CDK4/6 reversed antiestrogen resistance in ER + FGFR1 / CCND1 coamplified CAMA1 breast cancer cells. RNA sequencing of letrozole-treated tumors revealed the existence of intrachromosomal ESR1 fusion transcripts and increased expression of gene signatures indicative of enhanced E2F-mediated transcription and cell cycle processes in cancers with high Ki67. These data suggest that short-term preoperative estrogen deprivation followed by genomic profiling can be used to identify druggable alterations that may cause intrinsic endocrine therapy resistance. Copyright © 2017 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.
Identification of transcriptional regulators in the mouse immune system
Jojic, Vladimir; Shay, Tal; Sylvia, Katelyn; Zuk, Or; Sun, Xin; Kang, Joonsoo; Regev, Aviv; Koller, Daphne
2013-01-01
The differentiation of hematopoietic stem cells into immune cells has been extensively studied in mammals, but the transcriptional circuitry controlling it is still only partially understood. Here, the Immunological Genome Project gene expression profiles across mouse immune lineages allowed us to systematically analyze these circuits. Using a computational algorithm called Ontogenet, we uncovered differentiation-stage specific regulators of mouse hematopoiesis, identifying many known hematopoietic regulators, and 175 new candidate regulators, their target genes, and the cell types in which they act. Among the novel regulators, we highlight the role of ETV5 in γδT cells differntiation. Since the transcriptional program of human and mouse cells is highly conserved1, it is likely that many lessons learned from the mouse model apply to humans. PMID:23624555
Pujolar, J M; Milan, M; Marino, I A M; Capoccioni, F; Ciccotti, E; Belpaire, C; Covaci, A; Malarvannan, G; Patarnello, T; Bargelloni, L; Zane, L; Maes, G E
2013-05-15
The European eel illustrates an example of a critically endangered fish species strongly affected by human stressors throughout its life cycle, in which pollution is considered to be one of the factors responsible for the decline of the stock. The objective of our study was to better understand the transcriptional response of European eels chronically exposed to pollutants in their natural environment. A total of 42 pre-migrating (silver) female eels from lowly, highly and extremely polluted environments in Belgium and, for comparative purposes, a lowly polluted habitat in Italy were measured for polychlorinated biphenyls (PCBs), organochlorine pesticides (OCPs) and brominated flame retardants (BFRs). Multipollutant level of bioaccumulation was linked to their genome-wide gene transcription using an eel-specific array of 14,913 annotated cDNAs. Shared responses to pollutant exposure were observed when comparing the highly polluted site in Belgium with the relatively clean sites in Belgium and Italy. First, an altered pattern of transcription of genes was associated with detoxification, with a novel European eel CYP3A gene and gluthatione S-transferase transcriptionally up-regulated. Second, an altered pattern of transcription of genes associated with the oxidative phosphorylation pathway, with the following genes involved in the generation of ATP being transcriptionally down-regulated in individuals from the highly polluted site: NADH dehydrogenase, succinate dehydrogenase, ubiquinol-cytochrome c reductase, cytochrome c oxidase and ATP synthase. Although we did not measure metabolism directly, seeing that the transcription level of many genes encoding enzymes involved in the mitochondrial respiratory chain and oxidative phosphorylation were down-regulated in the highly polluted site suggests that pollutants may have a significant effect on energy metabolism in these fish. Copyright © 2013 Elsevier B.V. All rights reserved.
Li, Quan-Zhen; Li, Ping; Garcia, Gabriela E; Johnson, Richard J; Feng, Lili
2005-02-01
The great similarity of the genomes of humans and other species stimulated us to search for genes regulated by elements associated with human uniqueness, such as the mind-body interaction. DNA microarray technology offers the advantage of analyzing thousands of genes simultaneously, with the potential to determine healthy phenotypic changes in gene expression. The aim of this study was to determine the genomic profile and function of neutrophils in Falun Gong (FLG, an ancient Chinese Qigong) practitioners, with healthy subjects as controls. Six (6) Asian FLG practitioners and 6 Asian normal healthy controls were recruited for our study. The practitioners have practiced FLG for at least 1 year (range, 1-5 years). The practice includes daily reading of FLG books and daily practice of exercises lasting 1-2 hours. Selected normal healthy controls did not perform Qigong, yoga, t'ai chi, or any other type of mind-body practice, and had not followed any conventional physical exercise program for at least 1 year. Neutrophils were isolated from fresh blood and assayed for gene expression, using microarrays and RNase protection assay (RPA), as well as for function (phagocytosis) and survival (apoptosis). The changes in gene expression of FLG practitioners in contrast to normal healthy controls were characterized by enhanced immunity, downregulation of cellular metabolism, and alteration of apoptotic genes in favor of a rapid resolution of inflammation. The lifespan of normal neutrophils was prolonged, while the inflammatory neutrophils displayed accelerated cell death in FLG practitioners as determined by enzyme-linked immunosorbent assay. Correlating with enhanced immunity reflected by microarray data, neutrophil phagocytosis was significantly increased in Qigong practitioners. Some of the altered genes observed by microarray were confirmed by RPA. Qigong practice may regulate immunity, metabolic rate, and cell death, possibly at the transcriptional level. Our pilot study provides the first evidence that Qigong practice may exert transcriptional regulation at a genomic level. New approaches are needed to study how genes are regulated by elements associated with human uniqueness, such as consciousness, cognition, and spirituality.
Tai, Phillip W L; Wu, Hai; van Wijnen, André J; Stein, Gary S; Stein, Janet L; Lian, Jane B
2017-01-01
The ability to discover regulatory sequences that control bone-related genes during development has been greatly improved by massively parallel sequencing methodologies. To expand our understanding of cis-regulatory regions critical to the control of gene expression during osteoblastogenesis, we probed the presence of open chromatin states across the osteoblast genome using global DNase hypersensitivity (DHS) mapping. Our profiling of MC3T3 mouse pre-osteoblasts during differentiation has identified more than 224,000 unique DHS sites. Approximately 65% of these sites are dynamic during temporal stages of osteoblastogenesis, and a majority of them are located within non-promoter (intergenic and intronic) regions. Nearly half of all DHS sites (both constitutive and dynamic) overlap binding events of the bone-essential RUNX2 and/or the chromatin-related CTCF transcription factors. This finding reinforces the role of these regulatory proteins as essential components of the bone gene regulome. We observe a reduction in chromatin accessibility throughout the genome between pre-osteoblast and early osteoblasts. Our analysis also defined a class of differentially expressed genes that harbor DHS peaks centered within 1 kb downstream of transcriptional end sites (TES). These DHSs at the 3'-flanks of genes exhibit dynamic changes during differentiation that may impact regulation of the osteoblast genome. Taken together, the distribution of DHS regions within non-promoter locations harboring osteoblast and chromatin related transcription factor binding motifs, reflect novel cis-regulatory requirements to support temporal gene expression in differentiating osteoblasts.
Khan, Sumbul Jawed; Abidi, Syeda Nayab Fatima; Skinner, Andrea; Tian, Yuan; Smith-Bolton, Rachel K
2017-07-01
Regenerating tissue must initiate the signaling that drives regenerative growth, and sustain that signaling long enough for regeneration to complete. How these key signals are sustained is unclear. To gain a comprehensive view of the changes in gene expression that occur during regeneration, we performed whole-genome mRNAseq of actively regenerating tissue from damaged Drosophila wing imaginal discs. We used genetic tools to ablate the wing primordium to induce regeneration, and carried out transcriptional profiling of the regeneration blastema by fluorescently labeling and sorting the blastema cells, thus identifying differentially expressed genes. Importantly, by using genetic mutants of several of these differentially expressed genes we have confirmed that they have roles in regeneration. Using this approach, we show that high expression of the gene moladietz (mol), which encodes the Duox-maturation factor NIP, is required during regeneration to produce reactive oxygen species (ROS), which in turn sustain JNK signaling during regeneration. We also show that JNK signaling upregulates mol expression, thereby activating a positive feedback signal that ensures the prolonged JNK activation required for regenerative growth. Thus, by whole-genome transcriptional profiling of regenerating tissue we have identified a positive feedback loop that regulates the extent of regenerative growth.
Skinner, Andrea; Tian, Yuan
2017-01-01
Regenerating tissue must initiate the signaling that drives regenerative growth, and sustain that signaling long enough for regeneration to complete. How these key signals are sustained is unclear. To gain a comprehensive view of the changes in gene expression that occur during regeneration, we performed whole-genome mRNAseq of actively regenerating tissue from damaged Drosophila wing imaginal discs. We used genetic tools to ablate the wing primordium to induce regeneration, and carried out transcriptional profiling of the regeneration blastema by fluorescently labeling and sorting the blastema cells, thus identifying differentially expressed genes. Importantly, by using genetic mutants of several of these differentially expressed genes we have confirmed that they have roles in regeneration. Using this approach, we show that high expression of the gene moladietz (mol), which encodes the Duox-maturation factor NIP, is required during regeneration to produce reactive oxygen species (ROS), which in turn sustain JNK signaling during regeneration. We also show that JNK signaling upregulates mol expression, thereby activating a positive feedback signal that ensures the prolonged JNK activation required for regenerative growth. Thus, by whole-genome transcriptional profiling of regenerating tissue we have identified a positive feedback loop that regulates the extent of regenerative growth. PMID:28753614
2012-01-01
Background The Azadirachta indica (neem) tree is a source of a wide number of natural products, including the potent biopesticide azadirachtin. In spite of its widespread applications in agriculture and medicine, the molecular aspects of the biosynthesis of neem terpenoids remain largely unexplored. The current report describes the draft genome and four transcriptomes of A. indica and attempts to contextualise the sequence information in terms of its molecular phylogeny, transcript expression and terpenoid biosynthesis pathways. A. indica is the first member of the family Meliaceae to be sequenced using next generation sequencing approach. Results The genome and transcriptomes of A. indica were sequenced using multiple sequencing platforms and libraries. The A. indica genome is AT-rich, bears few repetitive DNA elements and comprises about 20,000 genes. The molecular phylogenetic analyses grouped A. indica together with Citrus sinensis from the Rutaceae family validating its conventional taxonomic classification. Comparative transcript expression analysis showed either exclusive or enhanced expression of known genes involved in neem terpenoid biosynthesis pathways compared to other sequenced angiosperms. Genome and transcriptome analyses in A. indica led to the identification of repeat elements, nucleotide composition and expression profiles of genes in various organs. Conclusions This study on A. indica genome and transcriptomes will provide a model for characterization of metabolic pathways involved in synthesis of bioactive compounds, comparative evolutionary studies among various Meliaceae family members and help annotate their genomes. A better understanding of molecular pathways involved in the azadirachtin synthesis in A. indica will pave ways for bulk production of environment friendly biopesticides. PMID:22958331
Ni, Julie Zhouli; Kalinava, Natallia; Chen, Esteban; Huang, Alex; Trinh, Thi; Gu, Sam Guoping
2016-01-01
Environmental stress-induced transgenerational epigenetic effects have been observed in various model organisms and human. The capacity and mechanism of such phenomena are poorly understood. In C. elegans, siRNA mediates transgenerational gene silencing through the germline nuclear RNAi pathway. This pathway is also required to maintain the germline immortality when C. elegans is under heat stress. However, the underlying molecular mechanism is unknown. In this study, we investigated the impact of heat stress on chromatin, transcription, and siRNAs at the whole-genome level, and whether any of the heat-induced effects is transgenerationally heritable in either the wild-type or the germline nuclear RNAi mutant animals. We performed 12-generation temperature-shift experiments using the wild-type C. elegans and a mutant strain that lacks the germline-specific nuclear Argonaute protein HRDE-1/WAGO-9. By examining the mRNA, small RNA, RNA polymerase II, and H3K9 trimethylation profiles at the whole-genome level, we revealed an epigenetic role of HRDE-1 in repressing heat stress-induced transcriptional activation of over 280 genes. Many of these genes are in or near LTR (long-terminal repeat) retrotransposons. Strikingly, for some of these genes, the heat stress-induced transcriptional activation in the hrde-1 mutant intensifies in the late generations under the heat stress and is heritable for at least two generations after the mutant animals are shifted back to lower temperature. hrde-1 mutation also leads to siRNA expression changes of many genes. This effect on siRNA is dependent on both the temperature and generation. Our study demonstrated that a large number of the endogenous targets of the germline nuclear RNAi pathway in C. elegans are sensitive to heat-induced transcriptional activation. This effect at certain genomic loci including LTR retrotransposons is transgenerational. Germline nuclear RNAi antagonizes this temperature effect at the transcriptional level and therefore may play a key role in heat stress response in C. elegans.
Moqtaderi, Zarmik; Wang, Jie; Raha, Debasish; White, Robert J.; Snyder, Michael; Weng, Zhiping; Struhl, Kevin
2012-01-01
Genome-wide occupancy profiles of five components of the RNA Polymerase III (Pol III) machinery in human cells identified the expected tRNA and non-coding RNA targets and revealed many additional Pol III-associated loci, mostly near SINEs. Several genes are targets of an alternative TFIIIB containing Brf2 instead of Brf1 and have extremely low levels of TFIIIC. Strikingly, expressed Pol III genes, unlike non-expressed Pol III genes, are situated in regions with a pattern of histone modifications associated with functional Pol II promoters. TFIIIC alone associates with numerous ETC loci, via the B box or a novel motif. ETCs are often near CTCF binding sites, suggesting a potential role in chromosome organization. Our results suggest that human Pol III complexes associate preferentially with regions near functional Pol II promoters and that TFIIIC-mediated recruitment of TFIIIB is regulated in a locus-specific manner. PMID:20418883
Fortes, Ana M; Agudelo-Romero, Patricia; Silva, Marta S; Ali, Kashif; Sousa, Lisete; Maltese, Federica; Choi, Young H; Grimplet, Jerome; Martinez-Zapater, José M; Verpoorte, Robert; Pais, Maria S
2011-11-02
Grapes (Vitis vinifera L.) are economically the most important fruit crop worldwide. However, the complexity of molecular and biochemical events that lead to the onset of ripening of nonclimacteric fruits is not fully understood which is further complicated in grapes due to seasonal and cultivar specific variation. The Portuguese wine variety Trincadeira gives rise to high quality wines but presents extremely irregular berry ripening among seasons probably due to high susceptibility to abiotic and biotic stresses. Ripening of Trincadeira grapes was studied taking into account the transcriptional and metabolic profilings complemented with biochemical data. The mRNA expression profiles of four time points spanning developmental stages from pea size green berries, through véraison and mature berries (EL 32, EL 34, EL 35 and EL 36) and in two seasons (2007 and 2008) were compared using the Affymetrix GrapeGen® genome array containing 23096 probesets corresponding to 18726 unique sequences. Over 50% of these probesets were significantly differentially expressed (1.5 fold) between at least two developmental stages. A common set of modulated transcripts corresponding to 5877 unigenes indicates the activation of common pathways between years despite the irregular development of Trincadeira grapes. These unigenes were assigned to the functional categories of "metabolism", "development", "cellular process", "diverse/miscellanenous functions", "regulation overview", "response to stimulus, stress", "signaling", "transport overview", "xenoprotein, transposable element" and "unknown". Quantitative RT-PCR validated microarrays results being carried out for eight selected genes and five developmental stages (EL 32, EL 34, EL 35, EL 36 and EL 38). Metabolic profiling using 1H NMR spectroscopy associated to two-dimensional techniques showed the importance of metabolites related to oxidative stress response, amino acid and sugar metabolism as well as secondary metabolism. These results were integrated with transcriptional profiling obtained using genome array to provide new information regarding the network of events leading to grape ripening. Altogether the data obtained provides the most extensive survey obtained so far for gene expression and metabolites accumulated during grape ripening. Moreover, it highlighted information obtained in a poorly known variety exhibiting particular characteristics that may be cultivar specific or dependent upon climatic conditions. Several genes were identified that had not been previously reported in the context of grape ripening namely genes involved in carbohydrate and amino acid metabolisms as well as in growth regulators; metabolism, epigenetic factors and signaling pathways. Some of these genes were annotated as receptors, transcription factors, and kinases and constitute good candidates for functional analysis in order to establish a model for ripening control of a non-climacteric fruit.
Kirchner, E A; Bornkamm, G W; Polack, A
1991-10-01
We have studied the relative rate of transcription across the Epstein-Barr virus genome in the Burkitt's lymphoma cell line Raji by nuclear run-on analysis during latency and after induction of an abortive lytic cycle with 12-0-tetradecanoylphorbol 13-acetate (TPA) and 5-iodo-2'-deoxyuridine (IUdR). During latency the entire, or almost the entire, viral genome was found to be transcriptionally active to a low or intermediate extent, with some variation in activity along the genome. The fragment with the highest transcriptional activity was EcoRI J, which contains the genes encoding the small nuclear RNAs EBER1 and -2, transcribed predominantly by RNA polymerase III. An intermediate level of transcription was observed between positions 10 and 138 (kb), with areas of slightly higher activity on the large internal repeats and the left duplicated region (DL). The remaining part of the viral genome, between position 138 and the termini, and the termini and position 10 (kb) (with the exception of the EcoRI J fragment), showed very little transcriptional activity, except for the intermediately active regions carrying the righthand oriLyt (DR) and the terminal repeats. Upon induction of the viral genome with TPA and IUdR, the viral genome was transcriptionally active at a rate at least tenfold that seen during latency. Polymerases were not equally distributed along the genome after induction; the highest density was found in regions 48 to 58 kb, 82 to 84 kb, 102 to 104 kb, 118 to 122 kb and 142 to 145 kb of the viral genome. High transcriptional activity correlated with distinct transcription units in some cases, i.e. BamHI H1LF1 (DL), BamHI MLF1, BamHI ZLF1/BamHI RLF1 and BamHI X (thymidine kinase), but not in others (BamHI H2). Besides initiation of transcription, other regulatory processes such as stabilization and processing of primary transcripts may also contribute to regulation of virus gene expression. Addition of cycloheximide completely abolished the transcriptional activation of the genome mediated by TPA and IUdR.
Zhou, Yunying; Zhang, Qishu; Gao, Ge; Zhang, Xiaoli; Liu, Yafei; Yuan, Shoudao
2016-01-01
ABSTRACT The E7 oncoprotein of the high-risk human papillomavirus (HPV) plays a major role in HPV-induced carcinogenesis. E7 abrogates the G1 cell cycle checkpoint and induces genomic instability, but the mechanism is not fully understood. In this study, we performed RNA sequencing (RNA-seq) to characterize the transcriptional profile of keratinocytes expressing HPV 16 (HPV-16) E7. At the transcriptome level, 236 genes were differentially expressed between E7 and vector control cells. A subset of the differentially expressed genes, most of them novel to E7-expressing cells, was further confirmed by real-time PCR. Of interest, the activities of multiple transcription factors were altered in E7-expressing cells. Through bioinformatics analysis, pathways altered in E7-expressing cells were investigated. The upregulated genes were enriched in cell cycle and DNA replication, as well as in the DNA metabolic process, transcription, DNA damage, DNA repair, and nucleotide metabolism. Specifically, we focused our studies on the gene encoding WDHD1 (WD repeat and high mobility group [HMG]-box DNA-binding protein), one of the genes that was upregulated in E7-expressing cells. WDHD1 is a component of the replisome that regulates DNA replication. Recent studies suggest that WDHD1 may also function as a DNA replication initiation factor as well as a G1 checkpoint regulator. We found that in E7-expressing cells, the steady-state level of WDHD1 protein was increased along with the half-life. Moreover, downregulation of WDHD1 reduced E7-induced G1 checkpoint abrogation and rereplication, demonstrating a novel function for WDHD1. These studies shed light on mechanisms by which HPV induces genomic instability and have therapeutic implications. IMPORTANCE The high-risk HPV types induce cervical cancer and encode an E7 oncoprotein that plays a major role in HPV-induced carcinogenesis. However, the mechanism by which E7 induces carcinogenesis is not fully understood; specific anti-HPV agents are not available. In this study, we performed RNA-seq to characterize transcriptional profiling of keratinocytes expressing HPV-16 E7 and identified more than 200 genes that were differentially expressed between E7 and vector control cells. Through bioinformatics analysis, pathways altered in E7-expressing cells were identified. Significantly, the WDHD1 gene, one of the genes that is upregulated in E7-expressing cells, was found to play an important role in E7-induced G1 checkpoint abrogation and rereplication. These studies shed light on mechanisms by which HPV induces genomic instability and have therapeutic implications. PMID:27099318
Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes.
Riechmann, J L; Heard, J; Martin, G; Reuber, L; Jiang, C; Keddie, J; Adam, L; Pineda, O; Ratcliffe, O J; Samaha, R R; Creelman, R; Pilgrim, M; Broun, P; Zhang, J Z; Ghandehari, D; Sherman, B K; Yu, G
2000-12-15
The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over 5% of its genome to code for more than 1500 transcription factors, about 45% of which are from families specific to plants. Arabidopsis transcription factors that belong to families common to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the conserved DNA binding domains, many of which have been arranged in combinations specific to each lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the regulation of transcription.
Dieterich, Christine; Puey, Angela; Lin, Sylvia; Lyn, Sylvia; Swezey, Robert; Furimsky, Anna; Fairchild, David; Mirsalis, Jon C; Ng, Hanna H
2009-01-01
Vancomycin, one of few effective treatments against methicillin-resistant Staphylococcus aureus, is nephrotoxic. The goals of this study were to (1) gain insights into molecular mechanisms of nephrotoxicity at the genomic level, (2) evaluate gene markers of vancomycin-induced kidney injury, and (3) compare gene expression responses after iv and ip administration. Groups of six female BALB/c mice were treated with seven daily iv or ip doses of vancomycin (50, 200, and 400 mg/kg) or saline, and sacrificed on day 8. Clinical chemistry and histopathology demonstrated kidney injury at 400 mg/kg only. Hierarchical clustering analysis revealed that kidney gene expression profiles of all mice treated at 400 mg/kg clustered with those of mice administered 200 mg/kg iv. Transcriptional profiling might thus be more sensitive than current clinical markers for detecting kidney damage, though the profiles can differ with the route of administration. Analysis of transcripts whose expression was changed by at least twofold compared with vehicle saline after high iv and ip doses of vancomycin suggested the possibility of oxidative stress and mitochondrial damage in vancomycin-induced toxicity. In addition, our data showed changes in expression of several transcripts from the complement and inflammatory pathways. Such expression changes were confirmed by relative real-time reverse transcription-polymerase chain reaction. Finally, our results further substantiate the use of gene markers of kidney toxicity such as KIM-1/Havcr1, as indicators of renal injury.
Howell, Kate Joanne; Kraiczy, Judith; Nayak, Komal M; Gasparetto, Marco; Ross, Alexander; Lee, Claire; Mak, Tim N; Koo, Bon-Kyoung; Kumar, Nitin; Lawley, Trevor; Sinha, Anupam; Rosenstiel, Philip; Heuschkel, Robert; Stegle, Oliver; Zilbauer, Matthias
2018-02-01
We analyzed DNA methylation patterns and transcriptomes of primary intestinal epithelial cells (IEC) of children newly diagnosed with inflammatory bowel diseases (IBD) to learn more about pathogenesis. We obtained mucosal biopsies (N = 236) collected from terminal ileum and ascending and sigmoid colons of children (median age 13 years) newly diagnosed with IBD (43 with Crohn's disease [CD], 23 with ulcerative colitis [UC]), and 30 children without IBD (controls). Patients were recruited and managed at a hospital in the United Kingdom from 2013 through 2016. We also obtained biopsies collected at later stages from a subset of patients. IECs were purified and analyzed for genome-wide DNA methylation patterns and gene expression profiles. Adjacent microbiota were isolated from biopsies and analyzed by 16S gene sequencing. We generated intestinal organoid cultures from a subset of samples and genome-wide DNA methylation analysis was performed. We found gut segment-specific differences in DNA methylation and transcription profiles of IECs from children with IBD vs controls; some were independent of mucosal inflammation. Changes in gut microbiota between IBD and control groups were not as large and were difficult to assess because of large amounts of intra-individual variation. Only IECs from patients with CD had changes in DNA methylation and transcription patterns in terminal ileum epithelium, compared with controls. Colon epithelium from patients with CD and from patients with ulcerative colitis had distinct changes in DNA methylation and transcription patterns, compared with controls. In IECs from patients with IBD, changes in DNA methylation, compared with controls, were stable over time and were partially retained in ex-vivo organoid cultures. Statistical analyses of epithelial cell profiles allowed us to distinguish children with CD or UC from controls; profiles correlated with disease outcome parameters, such as the requirement for treatment with biologic agents. We identified specific changes in DNA methylation and transcriptome patterns in IECs from pediatric patients with IBD compared with controls. These data indicate that IECs undergo changes during IBD development and could be involved in pathogenesis. Further analyses of primary IECs from patients with IBD could improve our understanding of the large variations in disease progression and outcomes. Copyright © 2018 AGA Institute. Published by Elsevier Inc. All rights reserved.
Transcript profiling reveals expression differences in wild-type and glabrous soybean lines
2011-01-01
Background Trichome hairs affect diverse agronomic characters such as seed weight and yield, prevent insect damage and reduce loss of water but their molecular control has not been extensively studied in soybean. Several detailed models for trichome development have been proposed for Arabidopsis thaliana, but their applicability to important crops such as cotton and soybean is not fully known. Results Two high throughput transcript sequencing methods, Digital Gene Expression (DGE) Tag Profiling and RNA-Seq, were used to compare the transcriptional profiles in wild-type (cv. Clark standard, CS) and a mutant (cv. Clark glabrous, i.e., trichomeless or hairless, CG) soybean isoline that carries the dominant P1 allele. DGE data and RNA-Seq data were mapped to the cDNAs (Glyma models) predicted from the reference soybean genome, Williams 82. Extending the model length by 250 bp at both ends resulted in significantly more matches of authentic DGE tags indicating that many of the predicted gene models are prematurely truncated at the 5' and 3' UTRs. The genome-wide comparative study of the transcript profiles of the wild-type versus mutant line revealed a number of differentially expressed genes. One highly-expressed gene, Glyma04g35130, in wild-type soybean was of interest as it has high homology to the cotton gene GhRDL1 gene that has been identified as being involved in cotton fiber initiation and is a member of the BURP protein family. Sequence comparison of Glyma04g35130 among Williams 82 with our sequences derived from CS and CG isolines revealed various SNPs and indels including addition of one nucleotide C in the CG and insertion of ~60 bp in the third exon of CS that causes a frameshift mutation and premature truncation of peptides in both lines as compared to Williams 82. Conclusion Although not a candidate for the P1 locus, a BURP family member (Glyma04g35130) from soybean has been shown to be abundantly expressed in the CS line and very weakly expressed in the glabrous CG line. RNA-Seq and DGE data are compared and provide experimental data on the expression of predicted soybean gene models as well as an overview of the genes expressed in young shoot tips of two closely related isolines. PMID:22029708
Gandhi, Deepa; Tarale, Prashant; Naoghare, Pravin K; Bafana, Amit; Kannan, Krishnamurthi; Sivanesan, Saravanadevi
2016-01-01
Endosulfan, an organochlorine pesticide, is known to induce multiple disorders/abnormalities including neuro-degenerative disorders in many animal species. However, the molecular mechanism of endosulfan induced neuronal alterations is still not well understood. In the present study, the effect of sub-lethal concentration of endosulfan (3 μM) on human neuroblastoma cells (SH-SY5Y) was investigated using genomic and proteomic approaches. Microarray and 2D-PAGE followed by MALDI-TOF-MS analysis revealed differential expression of 831 transcripts and 16 proteins in exposed cells. A gene ontology enrichment analysis revealed that the differentially expressed genes and proteins were involved in variety of cellular events such as neuronal developmental pathway, immune response, cell differentiation, apoptosis, transmission of nerve impulse, axonogenesis, etc. The present study attempted to explore the possible molecular mechanism of endosulfan induced neuronal alterations in SH-SY5Y cells using an integrated genomic and proteomic approach. Based on the gene and protein profile possible mechanisms underlying endosulfan neurotoxicity were predicted. Copyright © 2015 Elsevier B.V. All rights reserved.
Conditional Selection of Genomic Alterations Dictates Cancer Evolution and Oncogenic Dependencies.
Mina, Marco; Raynaud, Franck; Tavernari, Daniele; Battistello, Elena; Sungalee, Stephanie; Saghafinia, Sadegh; Laessle, Titouan; Sanchez-Vega, Francisco; Schultz, Nikolaus; Oricchio, Elisa; Ciriello, Giovanni
2017-08-14
Cancer evolves through the emergence and selection of molecular alterations. Cancer genome profiling has revealed that specific events are more or less likely to be co-selected, suggesting that the selection of one event depends on the others. However, the nature of these evolutionary dependencies and their impact remain unclear. Here, we designed SELECT, an algorithmic approach to systematically identify evolutionary dependencies from alteration patterns. By analyzing 6,456 genomes from multiple tumor types, we constructed a map of oncogenic dependencies associated with cellular pathways, transcriptional readouts, and therapeutic response. Finally, modeling of cancer evolution shows that alteration dependencies emerge only under conditional selection. These results provide a framework for the design of strategies to predict cancer progression and therapeutic response. Copyright © 2017 Elsevier Inc. All rights reserved.
Li, Caiqin; Wang, Yan; Ying, Peiyuan; Ma, Wuqiang; Li, Jianguo
2015-01-01
The high level of physiological fruitlet abscission in litchi (Litchi chinensis Sonn.) causes severe yield loss. Cell separation occurs at the fruit abscission zone (FAZ) and can be triggered by ethylene. However, a deep knowledge of the molecular events occurring in the FAZ is still unknown. Here, genome-wide digital transcript abundance (DTA) analysis of putative fruit abscission related genes regulated by ethephon in litchi were studied. More than 81 million high quality reads from seven ethephon treated and untreated control libraries were obtained by high-throughput sequencing. Through DTA profile analysis in combination with Gene Ontology and KEGG pathway enrichment analyses, a total of 2730 statistically significant candidate genes were involved in the ethephon-promoted litchi fruitlet abscission. Of these, there were 1867 early-responsive genes whose expressions were up- or down-regulated from 0 to 1 d after treatment. The most affected genes included those related to ethylene biosynthesis and signaling, auxin transport and signaling, transcription factors (TFs), protein ubiquitination, ROS response, calcium signal transduction, and cell wall modification. These genes could be clustered into four groups and 13 subgroups according to their similar expression patterns. qRT-PCR displayed the expression pattern of 41 selected candidate genes, which proved the accuracy of our DTA data. Ethephon treatment significantly increased fruit abscission and ethylene production of fruitlet. The possible molecular events to control the ethephon-promoted litchi fruitlet abscission were prompted out. The increased ethylene evolution in fruitlet would suppress the synthesis and polar transport of auxin and trigger abscission signaling. To the best of our knowledge, it is the first time to monitor the gene expression profile occurring in the FAZ-enriched pedicel during litchi fruit abscission induced by ethephon on the genome-wide level. This study will contribute to a better understanding for the molecular regulatory mechanism of fruit abscission in litchi. PMID:26217356
Licciardello, Concetta; D'Agostino, Nunzio; Traini, Alessandra; Recupero, Giuseppe Reforgiato; Frusciante, Luigi; Chiusano, Maria Luisa
2014-02-03
Glutathione S-transferases (GSTs) represent a ubiquitous gene family encoding detoxification enzymes able to recognize reactive electrophilic xenobiotic molecules as well as compounds of endogenous origin. Anthocyanin pigments require GSTs for their transport into the vacuole since their cytoplasmic retention is toxic to the cell. Anthocyanin accumulation in Citrus sinensis (L.) Osbeck fruit flesh determines different phenotypes affecting the typical pigmentation of Sicilian blood oranges. In this paper we describe: i) the characterization of the GST gene family in C. sinensis through a systematic EST analysis; ii) the validation of the EST assembly by exploiting the genome sequences of C. sinensis and C. clementina and their genome annotations; iii) GST gene expression profiling in six tissues/organs and in two different sweet orange cultivars, Cadenera (common) and Moro (pigmented). We identified 61 GST transcripts, described the full- or partial-length nature of the sequences and assigned to each sequence the GST class membership exploiting a comparative approach and the classification scheme proposed for plant species. A total of 23 full-length sequences were defined. Fifty-four of the 61 transcripts were successfully aligned to the C. sinensis and C. clementina genomes. Tissue specific expression profiling demonstrated that the expression of some GST transcripts was 'tissue-affected' and cultivar specific. A comparative analysis of C. sinensis GSTs with those from other plant species was also considered. Data from the current analysis are accessible at http://biosrv.cab.unina.it/citrusGST/, with the aim to provide a reference resource for C. sinensis GSTs. This study aimed at the characterization of the GST gene family in C. sinensis. Based on expression patterns from two different cultivars and on sequence-comparative analyses, we also highlighted that two sequences, a Phi class GST and a Mapeg class GST, could be involved in the conjugation of anthocyanin pigments and in their transport into the vacuole, specifically in fruit flesh of the pigmented cultivar.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Adamopoulos, Panagiotis G.; Kontos, Christos K.; Scorilas, Andreas
Tissue kallikrein and kallikrein-related peptidases (KLKs) form the largest group of serine proteases in the human genome, sharing many structural and functional characteristics. Multiple alternative transcripts have been reported for the most human KLK genes, while many of them are aberrantly expressed in various malignancies, thus possessing significant prognostic and/or diagnostic value. Alternative splicing of cancer-related genes is a common cellular mechanism accounting for cancer cell transcriptome complexity, as it affects cell cycle control, proliferation, apoptosis, invasion, and metastasis. In this study, we describe the identification and molecular cloning of eight novel transcripts of the human KLK10 gene using 3′more » rapid amplification of cDNA ends (3′ RACE) and next-generation sequencing (NGS), as well as their expression analysis in a wide panel of cell lines, originating from several distinct cancerous and normal tissues. Bioinformatic analysis revealed that the novel KLK10 transcripts contain new alternative splicing events between already annotated exons as well as novel exons. In addition, investigation of their expression profile in a wide panel of cell lines was performed with nested RT-PCR using variant-specific pairs of primers. Since many KLK mRNA transcripts possess clinical value, these newly discovered alternatively spliced KLK10 transcripts appear as new potential biomarkers for diagnostic and/or prognostic purposes or as targets for therapeutic strategies. - Highlights: • NGS was used to identify novel transcripts of the human KLK10 gene. • 8 novel KLK10 transcripts were identified. • A novel 3′UTR was detected and characterized. • The expression profiles of all 8 novel KLK10 transcripts were identified.« less
Aptazyme-embedded guide RNAs enable ligand-responsive genome editing and transcriptional activation
Tang, Weixin; Hu, Johnny H.; Liu, David R.
2017-01-01
Programmable sequence-specific genome editing agents such as CRISPR-Cas9 have greatly advanced our ability to manipulate the human genome. Although canonical forms of genome-editing agents and programmable transcriptional regulators are constitutively active, precise temporal and spatial control over genome editing and transcriptional regulation activities would enable the more selective and potentially safer use of these powerful technologies. Here, by incorporating ligand-responsive self-cleaving catalytic RNAs (aptazymes) into guide RNAs, we developed a set of aptazyme-embedded guide RNAs that enable small molecule-controlled nuclease-mediated genome editing and small molecule-controlled base editing, as well as small molecule-dependent transcriptional activation in mammalian cells. PMID:28656978
Martín-Subero, José I; Kreuz, Markus; Bibikova, Marina; Bentink, Stefan; Ammerpohl, Ole; Wickham-Garcia, Eliza; Rosolowski, Maciej; Richter, Julia; Lopez-Serra, Lidia; Ballestar, Esteban; Berger, Hilmar; Agirre, Xabier; Bernd, Heinz-Wolfram; Calvanese, Vincenzo; Cogliatti, Sergio B; Drexler, Hans G; Fan, Jian-Bing; Fraga, Mario F; Hansmann, Martin L; Hummel, Michael; Klapper, Wolfram; Korn, Bernhard; Küppers, Ralf; Macleod, Roderick A F; Möller, Peter; Ott, German; Pott, Christiane; Prosper, Felipe; Rosenwald, Andreas; Schwaenen, Carsten; Schübeler, Dirk; Seifert, Marc; Stürzenhofecker, Benjamin; Weber, Michael; Wessendorf, Swen; Loeffler, Markus; Trümper, Lorenz; Stein, Harald; Spang, Rainer; Esteller, Manel; Barker, David; Hasenclever, Dirk; Siebert, Reiner
2009-03-12
Lymphomas are assumed to originate at different stages of lymphocyte development through chromosomal aberrations. Thus, different lymphomas resemble lymphocytes at distinct differentiation stages and show characteristic morphologic, genetic, and transcriptional features. Here, we have performed a microarray-based DNA methylation profiling of 83 mature aggressive B-cell non-Hodgkin lymphomas (maB-NHLs) characterized for their morphologic, genetic, and transcriptional features, including molecular Burkitt lymphomas and diffuse large B-cell lymphomas. Hierarchic clustering indicated that methylation patterns in maB-NHLs were not strictly associated with morphologic, genetic, or transcriptional features. By supervised analyses, we identified 56 genes de novo methylated in all lymphoma subtypes studied and 22 methylated in a lymphoma subtype-specific manner. Remarkably, the group of genes de novo methylated in all lymphoma subtypes was significantly enriched for polycomb targets in embryonic stem cells. De novo methylated genes in all maB-NHLs studied were expressed at low levels in lymphomas and normal hematopoietic tissues but not in nonhematopoietic tissues. These findings, especially the enrichment for polycomb targets in stem cells, indicate that maB-NHLs with different morphologic, genetic, and transcriptional background share a similar stem cell-like epigenetic pattern. This suggests that maB-NHLs originate from cells with stem cell features or that stemness was acquired during lymphomagenesis by epigenetic remodeling.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Royland, Joyce E.; Kodavanti, Prasada Rao S.
2008-09-01
Epidemiological studies indicate that low levels of polychlorinated biphenyl (PCB) exposure can adversely affect neurocognitive development. In animal models, perturbations in calcium signaling, neurotransmitters, and thyroid hormones have been postulated as potential mechanisms for PCB-induced developmental neurotoxicity. In order to understand the role of these proposed mechanisms and to identify other mechanisms in PCB-induced neurotoxicity, we have chosen a global approach utilizing oligonucleotide microarrays to examine gene expression profiles in the brain following developmental exposure to Aroclor 1254 (0 or 6 mg/kg/day from gestation day 6 through postnatal day (PND) 21) in Long-Evans rats. Gene expression levels in the cerebellummore » and hippocampus from PNDs 7 and 14 animals were determined on Affymetrix rat 230A{sub 2}.0 chips. In the cerebellum, 87 transcripts were altered at PND7 compared to 27 transcripts at PND14 by Aroclor 1254 exposure, with only one transcript affected at both ages. In hippocampus, 175 transcripts and 50 transcripts were altered at PND7 and PND14, respectively, by Aroclor 1254 exposure with five genes commonly affected. Functional analysis suggests that pathways related to calcium homeostasis (Gng3, Ryr2, Trdn, Cacna1a), intracellular signaling (Camk2d, Stk17b, Pacsin2, Ryr2, Trio, Fert2, Ptk2b), axonal guidance (Lum, Mxd3, Akap11, Gucy1b3), aryl hydrocarbon receptor signaling (Nfia, Col1a2), and transcripts involved in cell proliferation (Gspt2, Cdkn1c, Ptk2b) and differentiation (Ifitm31, Hpca, Zfp260, Igsf4a, Hes5) leading to the development of nervous system were significantly altered by Aroclor 1254 exposure. Of the two brain regions examined, Aroclor 1254-induced genomic changes were greater in the hippocampus than the cerebellum. The genomic data suggests that PCB-induced neurotoxic effects were due to disruption of normal ontogenetic pattern of nervous system growth and development by altering intracellular signaling pathways but not by endocrine disruption.« less
Transcriptional atlas of cardiogenesis maps congenital heart disease interactome.
Li, Xing; Martinez-Fernandez, Almudena; Hartjes, Katherine A; Kocher, Jean-Pierre A; Olson, Timothy M; Terzic, Andre; Nelson, Timothy J
2014-07-01
Mammalian heart development is built on highly conserved molecular mechanisms with polygenetic perturbations resulting in a spectrum of congenital heart diseases (CHD). However, knowledge of cardiogenic ontogeny that regulates proper cardiogenesis remains largely based on candidate-gene approaches. Mapping the dynamic transcriptional landscape of cardiogenesis from a genomic perspective is essential to integrate the knowledge of heart development into translational applications that accelerate disease discovery efforts toward mechanistic-based treatment strategies. Herein, we designed a time-course transcriptome analysis to investigate the genome-wide dynamic expression landscape of innate murine cardiogenesis ranging from embryonic stem cells to adult cardiac structures. This comprehensive analysis generated temporal and spatial expression profiles, revealed stage-specific gene functions, and mapped the dynamic transcriptome of cardiogenesis to curated pathways. Reconciling known genetic underpinnings of CHD, we deconstructed a disease-centric dynamic interactome encoded within this cardiogenic atlas to identify stage-specific developmental disturbances clustered on regulation of epithelial-to-mesenchymal transition (EMT), BMP signaling, NF-AT signaling, TGFb-dependent EMT, and Notch signaling. Collectively, this cardiogenic transcriptional landscape defines the time-dependent expression of cardiac ontogeny and prioritizes regulatory networks at the interface between health and disease. Copyright © 2014 the American Physiological Society.
RPO41-independent maintenance of [rho-] mitochondrial DNA in Saccharomyces cerevisiae.
Fangman, W L; Henly, J W; Brewer, B J
1990-01-01
A subset of promoters in the mitochondrial DNA (mtDNA) of the yeast Saccharomyces cerevisiae has been proposed to participate in replication initiation, giving rise to a primer through site-specific cleavage of an RNA transcript. To test whether transcription is essential for mtDNA maintenance, we examined two simple mtDNA deletion ([rho-]) genomes in yeast cells. One genome (HS3324) contains a consensus promoter (ATATAAGTA) for the mitochondrial RNA polymerase encoded by the nuclear gene RPO41, and the other genome (4a) does not. As anticipated, in RPO41 cells transcripts from the HS3324 genome were more abundant than were transcripts from the 4a genome. When the RPO41 gene was disrupted, both [rho-] genomes were efficiently maintained. The level of transcripts from HS3324 mtDNA was decreased greater than 400-fold in cells carrying the RPO41 disrupted gene; however, the low-level transcripts from 4a mtDNA were undiminished. These results indicate that replication of [rho-] genomes can be initiated in the absence of wild-type levels of the RPO41-encoded RNA polymerase.
The Role of Genome Accessibility in Transcription Factor Binding in Bacteria.
Gomes, Antonio L C; Wang, Harris H
2016-04-01
ChIP-seq enables genome-scale identification of regulatory regions that govern gene expression. However, the biological insights generated from ChIP-seq analysis have been limited to predictions of binding sites and cooperative interactions. Furthermore, ChIP-seq data often poorly correlate with in vitro measurements or predicted motifs, highlighting that binding affinity alone is insufficient to explain transcription factor (TF)-binding in vivo. One possibility is that binding sites are not equally accessible across the genome. A more comprehensive biophysical representation of TF-binding is required to improve our ability to understand, predict, and alter gene expression. Here, we show that genome accessibility is a key parameter that impacts TF-binding in bacteria. We developed a thermodynamic model that parameterizes ChIP-seq coverage in terms of genome accessibility and binding affinity. The role of genome accessibility is validated using a large-scale ChIP-seq dataset of the M. tuberculosis regulatory network. We find that accounting for genome accessibility led to a model that explains 63% of the ChIP-seq profile variance, while a model based in motif score alone explains only 35% of the variance. Moreover, our framework enables de novo ChIP-seq peak prediction and is useful for inferring TF-binding peaks in new experimental conditions by reducing the need for additional experiments. We observe that the genome is more accessible in intergenic regions, and that increased accessibility is positively correlated with gene expression and anti-correlated with distance to the origin of replication. Our biophysically motivated model provides a more comprehensive description of TF-binding in vivo from first principles towards a better representation of gene regulation in silico, with promising applications in systems biology.
Schmidt, Martin; Van Bel, Michiel; Woloszynska, Magdalena; Slabbinck, Bram; Martens, Cindy; De Block, Marc; Coppens, Frederik; Van Lijsebettens, Mieke
2017-07-06
Cytosine methylation in plant genomes is important for the regulation of gene transcription and transposon activity. Genome-wide methylomes are studied upon mutation of the DNA methyltransferases, adaptation to environmental stresses or during development. However, from basic biology to breeding programs, there is a need to monitor multiple samples to determine transgenerational methylation inheritance or differential cytosine methylation. Methylome data obtained by sodium hydrogen sulfite (bisulfite)-conversion and next-generation sequencing (NGS) provide genome-wide information on cytosine methylation. However, a profiling method that detects cytosine methylation state dispersed over the genome would allow high-throughput analysis of multiple plant samples with distinct epigenetic signatures. We use specific restriction endonucleases to enrich for cytosine coverage in a bisulfite and NGS-based profiling method, which was compared to whole-genome bisulfite sequencing of the same plant material. We established an effective methylome profiling method in plants, termed plant-reduced representation bisulfite sequencing (plant-RRBS), using optimized double restriction endonuclease digestion, fragment end repair, adapter ligation, followed by bisulfite conversion, PCR amplification and NGS. We report a performant laboratory protocol and a straightforward bioinformatics data analysis pipeline for plant-RRBS, applicable for any reference-sequenced plant species. As a proof of concept, methylome profiling was performed using an Oryza sativa ssp. indica pure breeding line and a derived epigenetically altered line (epiline). Plant-RRBS detects methylation levels at tens of millions of cytosine positions deduced from bisulfite conversion in multiple samples. To evaluate the method, the coverage of cytosine positions, the intra-line similarity and the differential cytosine methylation levels between the pure breeding line and the epiline were determined. Plant-RRBS reproducibly covers commonly up to one fourth of the cytosine positions in the rice genome when using MspI-DpnII within a group of five biological replicates of a line. The method predominantly detects cytosine methylation in putative promoter regions and not-annotated regions in rice. Plant-RRBS offers high-throughput and broad, genome-dispersed methylation detection by effective read number generation obtained from reproducibly covered genome fractions using optimized endonuclease combinations, facilitating comparative analyses of multi-sample studies for cytosine methylation and transgenerational stability in experimental material and plant breeding populations.
The protein expression landscape of mitosis and meiosis in diploid budding yeast.
Becker, Emmanuelle; Com, Emmanuelle; Lavigne, Régis; Guilleux, Marie-Hélène; Evrard, Bertrand; Pineau, Charles; Primig, Michael
2017-03-06
Saccharomyces cerevisiae is an established model organism for the molecular analysis of fundamental biological processes. The genomes of numerous strains have been sequenced, and the transcriptome and proteome ofmajor phases during the haploid and diploid yeast life cycle have been determined. However, much less is known about dynamic changes of the proteome when cells switch from mitotic growth to meiotic development. We report a quantitative protein profiling analysis of yeast cell division and differentiation based on mass spectrometry. Information about protein levels was integrated with strand-specific tiling array expression data. We identified a total of 2366 proteins in at least one condition, including 175 proteins showing a statistically significant>5-fold change across the sample set, and 136 proteins detectable in sporulating but not respiring cells. We correlate protein expression patterns with biological processes and molecular function by Gene Ontology term enrichment, chemoprofiling, transcription interference and the formation of double stranded RNAs by overlapping sense/antisense transcripts. Our work provides initial quantitative insight into protein expression in diploid respiring and differentiating yeast cells. Critically, it associates developmentally regulated induction of antisense long noncoding RNAs and double stranded RNAs with fluctuating protein concentrations during growth and development. This integrated genomics analysis helps better understand how the transcriptome and the proteome correlate in diploid yeast cells undergoing mitotic growth in the presence of acetate (respiration) versus meiotic differentiation (Meiosis I and II). The study (i) provides quantitative expression data for 2366 proteins and their cognate mRNAs in at least one sample, (ii) shows strongly fluctuating protein levels during growth and differentiation for 175 cases, and (iii) identifies 136 proteins absent in mitotic but present in meiotic yeast cells. We have integrated protein profiling data using mass spectrometry with tiling array RNA profiling data and information on double-stranded RNAs (dsRNAs) by overlapping sense/antisense transcripts from an RNA-Sequencing experiment. This work therefore provides quantitative insight into protein expression during cell division and development and associates changing protein levels with developmental stage specific induction of antisense transcripts and the formation of dsRNAs. Copyright © 2017 Elsevier B.V. All rights reserved.
Tsirigos, Aristotelis; Lin, Zhao; Pavlides, Stephanos; Wang, Chengwang; Flomenberg, Neal; Knudsen, Erik S; Howell, Anthony; Pestell, Richard G
2011-01-01
Previously, we showed that high-energy metabolites (lactate and ketones) “fuel” tumor growth and experimental metastasis in an in vivo xenograft model, most likely by driving oxidative mitochondrial metabolism in breast cancer cells. To mechanistically understand how these metabolites affect tumor cell behavior, here we used genome-wide transcriptional profiling. Human breast cancer cells (MCF7) were cultured with lactate or ketones, and then subjected to transcriptional analysis (exon-array). Interestingly, our results show that treatment with these high-energy metabolites increases the transcriptional expression of gene profiles normally associated with “stemness”, including genes upregulated in embryonic stem (ES) cells. Similarly, we observe that lactate and ketones promote the growth of bonafide ES cells, providing functional validation. The lactate- and ketone-induced “gene signatures” were able to predict poor clinical outcome (including recurrence and metastasis) in human breast cancer patients. Taken together, our results are consistent with the idea that lactate and ketone utilization in cancer cells promotes the “cancer stem cell” phenotype, resulting in significant decreases in patient survival. One possible mechanism by which high-energy metabolites might induce stemness is by increasing the pool of Acetyl-CoA, leading to increased histone acetylation and elevated gene expression. Thus, our results mechanistically imply that clinical outcome in breast cancer could simply be determined by epigenetics and energy metabolism, rather than by the accumulation of specific “classical” gene mutations. We also suggest that high-risk cancer patients (identified by the lactate/ketone gene signatures) could be treated with new therapeutics that target oxidative mitochondrial metabolism, such as the anti-oxidant and “mitochondrial poison” metformin. Finally, we propose that this new approach to personalized cancer medicine be termed “metabolo-genomics,” which incorporates features of both (1) cell metabolism and (2) gene transcriptional profiling. This powerful new approach directly links cancer cell metabolism with clinical outcome, and suggests new therapeutic strategies for inhibiting the TCA cycle and mitochondrial oxidative phosphorylation in cancer cells. PMID:21512313
Zeng, Shaohua; Xiao, Gong; Wang, Gan; Wang, Ying; Peng, Ming; Huang, Hongwen
2015-01-01
Red-fleshed kiwifruit (Actinidia chinensis Planch. ‘Hongyang’) is a promising commercial cultivar due to its nutritious value and unique flesh color, derived from vitamin C and anthocyanins. In this study, we obtained transcriptome data of ‘Hongyang’ from seven developmental stages using Illumina sequencing. We mapped 39–54 million reads to the recently sequenced kiwifruit genome and other databases to define gene structure, to analyze alternative splicing, and to quantify gene transcript abundance at different developmental stages. The transcript profiles throughout red kiwifruit development were constructed and analyzed, with a focus on the biosynthesis and metabolism of compounds such as phytohormones, sugars, starch and L-ascorbic acid, which are indispensable for the development and formation of quality fruit. Candidate genes for these pathways were identified through MapMan and phylogenetic analysis. The transcript levels of genes involved in sucrose and starch metabolism were consistent with the change in soluble sugar and starch content throughout kiwifruit development. The metabolism of L-ascorbic acid was very active, primarily through the L-galactose pathway. The genes responsible for the accumulation of anthocyanin in red kiwifruit were identified, and their expression levels were investigated during kiwifruit development. This survey of gene expression during kiwifruit development paves the way for further investigation of the development of this uniquely colored and nutritious fruit and reveals which factors are needed for high quality fruit formation. This transcriptome data and its analysis will be useful for improving kiwifruit genome annotation, for basic fruit molecular biology research, and for kiwifruit breeding and improvement. PMID:26301713
Tiengwe, Calvin; Marcello, Lucio; Farr, Helen; Dickens, Nicholas; Kelly, Steven; Swiderski, Michal; Vaughan, Diane; Gull, Keith; Barry, J. David; Bell, Stephen D.; McCulloch, Richard
2012-01-01
Summary Identification of replication initiation sites, termed origins, is a crucial step in understanding genome transmission in any organism. Transcription of the Trypanosoma brucei genome is highly unusual, with each chromosome comprising a few discrete transcription units. To understand how DNA replication occurs in the context of such organization, we have performed genome-wide mapping of the binding sites of the replication initiator ORC1/CDC6 and have identified replication origins, revealing that both localize to the boundaries of the transcription units. A remarkably small number of active origins is seen, whose spacing is greater than in any other eukaryote. We show that replication and transcription in T. brucei have a profound functional overlap, as reducing ORC1/CDC6 levels leads to genome-wide increases in mRNA levels arising from the boundaries of the transcription units. In addition, ORC1/CDC6 loss causes derepression of silent Variant Surface Glycoprotein genes, which are critical for host immune evasion. PMID:22840408
Ribosome profiling-guided depletion of an mRNA increases cell growth rate and protein secretion
NASA Astrophysics Data System (ADS)
Kallehauge, Thomas Beuchert; Li, Shangzhong; Pedersen, Lasse Ebdrup; Ha, Tae Kwang; Ley, Daniel; Andersen, Mikael Rørdam; Kildegaard, Helene Faustrup; Lee, Gyun Min; Lewis, Nathan E.
2017-01-01
Recombinant protein production coopts the host cell machinery to provide high protein yields of industrial enzymes or biotherapeutics. However, since protein translation is energetically expensive and tightly controlled, it is unclear if highly expressed recombinant genes are translated as efficiently as host genes. Furthermore, it is unclear how the high expression impacts global translation. Here, we present the first genome-wide view of protein translation in an IgG-producing CHO cell line, measured with ribosome profiling. Through this we found that our recombinant mRNAs were translated as efficiently as the host cell transcriptome, and sequestered up to 15% of the total ribosome occupancy. During cell culture, changes in recombinant mRNA translation were consistent with changes in transcription, demonstrating that transcript levels influence specific productivity. Using this information, we identified the unnecessary resistance marker NeoR to be a highly transcribed and translated gene. Through siRNA knock-down of NeoR, we improved the production- and growth capacity of the host cell. Thus, ribosomal profiling provides valuable insights into translation in CHO cells and can guide efforts to enhance protein production.
Transcriptome of interstitial cells of Cajal reveals unique and selective gene signatures
Park, Paul J.; Fuchs, Robert; Wei, Lai; Jorgensen, Brian G.; Redelman, Doug; Ward, Sean M.; Sanders, Kenton M.
2017-01-01
Transcriptome-scale data can reveal essential clues into understanding the underlying molecular mechanisms behind specific cellular functions and biological processes. Transcriptomics is a continually growing field of research utilized in biomarker discovery. The transcriptomic profile of interstitial cells of Cajal (ICC), which serve as slow-wave electrical pacemakers for gastrointestinal (GI) smooth muscle, has yet to be uncovered. Using copGFP-labeled ICC mice and flow cytometry, we isolated ICC populations from the murine small intestine and colon and obtained their transcriptomes. In analyzing the transcriptome, we identified a unique set of ICC-restricted markers including transcription factors, epigenetic enzymes/regulators, growth factors, receptors, protein kinases/phosphatases, and ion channels/transporters. This analysis provides new and unique insights into the cellular and biological functions of ICC in GI physiology. Additionally, we constructed an interactive ICC genome browser (http://med.unr.edu/physio/transcriptome) based on the UCSC genome database. To our knowledge, this is the first online resource that provides a comprehensive library of all known genetic transcripts expressed in primary ICC. Our genome browser offers a new perspective into the alternative expression of genes in ICC and provides a valuable reference for future functional studies. PMID:28426719
Reanalysis of RNA-Sequencing Data Reveals Several Additional Fusion Genes with Multiple Isoforms
Kangaspeska, Sara; Hultsch, Susanne; Edgren, Henrik; Nicorici, Daniel; Murumägi, Astrid; Kallioniemi, Olli
2012-01-01
RNA-sequencing and tailored bioinformatic methodologies have paved the way for identification of expressed fusion genes from the chaotic genomes of solid tumors. We have recently successfully exploited RNA-sequencing for the discovery of 24 novel fusion genes in breast cancer. Here, we demonstrate the importance of continuous optimization of the bioinformatic methodology for this purpose, and report the discovery and experimental validation of 13 additional fusion genes from the same samples. Integration of copy number profiling with the RNA-sequencing results revealed that the majority of the gene fusions were promoter-donating events that occurred at copy number transition points or involved high-level DNA-amplifications. Sequencing of genomic fusion break points confirmed that DNA-level rearrangements underlie selected fusion transcripts. Furthermore, a significant portion (>60%) of the fusion genes were alternatively spliced. This illustrates the importance of reanalyzing sequencing data as gene definitions change and bioinformatic methods improve, and highlights the previously unforeseen isoform diversity among fusion transcripts. PMID:23119097
Reanalysis of RNA-sequencing data reveals several additional fusion genes with multiple isoforms.
Kangaspeska, Sara; Hultsch, Susanne; Edgren, Henrik; Nicorici, Daniel; Murumägi, Astrid; Kallioniemi, Olli
2012-01-01
RNA-sequencing and tailored bioinformatic methodologies have paved the way for identification of expressed fusion genes from the chaotic genomes of solid tumors. We have recently successfully exploited RNA-sequencing for the discovery of 24 novel fusion genes in breast cancer. Here, we demonstrate the importance of continuous optimization of the bioinformatic methodology for this purpose, and report the discovery and experimental validation of 13 additional fusion genes from the same samples. Integration of copy number profiling with the RNA-sequencing results revealed that the majority of the gene fusions were promoter-donating events that occurred at copy number transition points or involved high-level DNA-amplifications. Sequencing of genomic fusion break points confirmed that DNA-level rearrangements underlie selected fusion transcripts. Furthermore, a significant portion (>60%) of the fusion genes were alternatively spliced. This illustrates the importance of reanalyzing sequencing data as gene definitions change and bioinformatic methods improve, and highlights the previously unforeseen isoform diversity among fusion transcripts.
Genome-wide expression profile of first trimester villous and extravillous human trophoblast cells
Apps, R.; Sharkey, A.; Gardner, L.; Male, V.; Trotter, M.; Miller, N.; North, R.; Founds, S.; Moffett, A.
2011-01-01
We have examined the transcriptional changes associated with differentiation from villous to extravillous trophoblast using a whole genome microarray. Villous trophoblast (VT) is in contact with maternal blood and mediates nutrient exchange whereas extravillous trophoblast (EVT) invades the decidua and remodels uterine arteries. Using highly purified first trimester trophoblast we identified over 3000 transcripts that are differentially expressed. Many of these transcripts represent novel functions and pathways that show co-ordinated up-regulation in VT or EVT. In addition we identify new players in established functions such as migration, immune modulation and cytokine or angiogenic factor secretion by EVT. The transition from VT to EVT is also characterised by alterations in transcription factors such as STAT4 and IRF9, which may co-ordinate these changes. Transcripts encoding several members of the immunoglobulin-superfamily, which are normally expressed on leukocytes, were highly transcribed in EVT but not expressed as protein, indicating specific control of translation in EVT. Interactions of trophoblast with decidual leukocytes are involved in regulating EVT invasion. We show that decidual T-cells, macrophages and NK cells express the inhibitory collagen receptor LAIR-1 and that EVT secrete LAIR-2, which can block this interaction. This represents a new mechanism by which EVT can modulate leukocyte function in the decidua. Since LAIR-2 is detectable in the urine of pregnant, but not non-pregnant women, trophoblast-derived LAIR-2 may also have systemic effects during pregnancy. PMID:21075446
Castaneda, Francisco; Rosin-Steiner, Sigrid; Jung, Klaus
2006-12-21
We previously found that ethanol at millimolar level (1 mM) activates the expression of transcription factors with subsequent regulation of apoptotic genes in human hepatocellular carcinoma (HCC) HepG2 cells. However, the role of ethanol on the expression of genes implicated in transcriptional and translational processes remains unknown. Therefore, the aim of this study was to characterize the effect of low concentration of ethanol on gene expression profiling in HepG2 cells using cDNA microarrays with especial interest in genes with transcriptional and translational function. The gene expression pattern observed in the ethanol-treated HepG2 cells revealed a relatively similar pattern to that found in the untreated control cells. The pairwise comparison analysis demonstrated four significantly up-regulated (COBRA1, ITGB4, STAU2, and HMGN3) genes and one down-regulated (ANK3) gene. All these genes exert their function on transcriptional and translational processes and until now none of these genes have been associated with ethanol. This functional genomic analysis demonstrates the reported interaction between ethanol and ethanol-regulated genes. Moreover, it confirms the relationship between ethanol-regulated genes and various signaling pathways associated with ethanol-induced apoptosis. The data presented in this study represents an important contribution toward the understanding of the molecular mechanisms of ethanol at low concentration in HepG2 cells, a HCC-derived cell line.
Castaneda, Francisco; Rosin-Steiner, Sigrid; Jung, Klaus
2007-01-01
We previously found that ethanol at millimolar level (1 mM) activates the expression of transcription factors with subsequent regulation of apoptotic genes in human hepatocellular carcinoma (HCC) HepG2 cells. However, the role of ethanol on the expression of genes implicated in transcriptional and translational processes remains unknown. Therefore, the aim of this study was to characterize the effect of low concentration of ethanol on gene expression profiling in HepG2 cells using cDNA microarrays with especial interest in genes with transcriptional and translational function. The gene expression pattern observed in the ethanol-treated HepG2 cells revealed a relatively similar pattern to that found in the untreated control cells. The pairwise comparison analysis demonstrated four significantly up-regulated (COBRA1, ITGB4, STAU2, and HMGN3) genes and one down-regulated (ANK3) gene. All these genes exert their function on transcriptional and translational processes and until now none of these genes have been associated with ethanol. This functional genomic analysis demonstrates the reported interaction between ethanol and ethanol-regulated genes. Moreover, it confirms the relationship between ethanol-regulated genes and various signaling pathways associated with ethanol-induced apoptosis. The data presented in this study represents an important contribution toward the understanding of the molecular mechanisms of ethanol at low concentration in HepG2 cells, a HCC-derived cell line. PMID:17211498
Kamber, Tim; Buchmann, Jan P; Pothier, Joël F; Smits, Theo H M; Wicker, Thomas; Duffy, Brion
2016-02-17
The molecular basis of resistance and susceptibility of host plants to fire blight, a major disease threat to pome fruit production globally, is largely unknown. RNA-sequencing data from challenged and mock-inoculated flowers were analyzed to assess the susceptible response of apple to the fire blight pathogen Erwinia amylovora. In presence of the pathogen 1,080 transcripts were differentially expressed at 48 h post inoculation. These included putative disease resistance, stress, pathogen related, general metabolic, and phytohormone related genes. Reads, mapped to regions on the apple genome where no genes were assigned, were used to identify potential novel genes and open reading frames. To identify transcripts specifically expressed in response to E. amylovora, RT-PCRs were conducted and compared to the expression patterns of the fire blight biocontrol agent Pantoea vagans strain C9-1, another apple pathogen Pseudomonas syringae pv. papulans, and mock inoculated apple flowers. This led to the identification of a peroxidase superfamily gene that was lower expressed in response to E. amylovora suggesting a potential role in the susceptibility response. Overall, this study provides the first transcriptional profile by RNA-seq of the host plant during fire blight disease and insights into the response of susceptible apple plants to E. amylovora.
Kamber, Tim; Buchmann, Jan P.; Pothier, Joël F.; Smits, Theo H. M.; Wicker, Thomas; Duffy, Brion
2016-01-01
The molecular basis of resistance and susceptibility of host plants to fire blight, a major disease threat to pome fruit production globally, is largely unknown. RNA-sequencing data from challenged and mock-inoculated flowers were analyzed to assess the susceptible response of apple to the fire blight pathogen Erwinia amylovora. In presence of the pathogen 1,080 transcripts were differentially expressed at 48 h post inoculation. These included putative disease resistance, stress, pathogen related, general metabolic, and phytohormone related genes. Reads, mapped to regions on the apple genome where no genes were assigned, were used to identify potential novel genes and open reading frames. To identify transcripts specifically expressed in response to E. amylovora, RT-PCRs were conducted and compared to the expression patterns of the fire blight biocontrol agent Pantoea vagans strain C9-1, another apple pathogen Pseudomonas syringae pv. papulans, and mock inoculated apple flowers. This led to the identification of a peroxidase superfamily gene that was lower expressed in response to E. amylovora suggesting a potential role in the susceptibility response. Overall, this study provides the first transcriptional profile by RNA-seq of the host plant during fire blight disease and insights into the response of susceptible apple plants to E. amylovora. PMID:26883568
Hemschemeier, Anja; Casero, David; Liu, Bensheng; Benning, Christoph; Pellegrini, Matteo; Happe, Thomas; Merchant, Sabeeha S.
2013-01-01
Anaerobiosis is a stress condition for aerobic organisms and requires extensive acclimation responses. We used RNA-Seq for a whole-genome view of the acclimation of Chlamydomonas reinhardtii to anoxic conditions imposed simultaneously with transfer to the dark. Nearly 1.4 × 103 genes were affected by hypoxia. Comparing transcript profiles from early (hypoxic) with those from late (anoxic) time points indicated that cells activate oxidative energy generation pathways before employing fermentation. Probable substrates include amino acids and fatty acids (FAs). Lipid profiling of the C. reinhardtii cells revealed that they degraded FAs but also accumulated triacylglycerols (TAGs). In contrast with N-deprived cells, the TAGs in hypoxic cells were enriched in desaturated FAs, suggesting a distinct pathway for TAG accumulation. To distinguish transcriptional responses dependent on COPPER RESPONSE REGULATOR1 (CRR1), which is also involved in hypoxic gene regulation, we compared the transcriptomes of crr1 mutants and complemented strains. In crr1 mutants, ∼40 genes were aberrantly regulated, reaffirming the importance of CRR1 for the hypoxic response, but indicating also the contribution of additional signaling strategies to account for the remaining differentially regulated transcripts. Based on transcript patterns and previous results, we conclude that nitric oxide–dependent signaling cascades operate in anoxic C. reinhardtii cells. PMID:24014546
Guan, Zhiyong; Feng, Yitong; Song, Aiping; Shi, Xiaomeng; Mao, Yachao; Chen, Sumei; Jiang, Jiafu; Ding, Lian; Chen, Fadi
2017-01-01
Chrysanthemum crassum is a decaploid species of Chrysanthemum with high stress tolerance that allows survival under salinity stress while maintaining a relatively ideal growth rate. We previously recorded morphological changes after salt treatment, such as the expansion of leaf cells. To explore the underlying salinity tolerance mechanisms, we used an Illumina platform and obtained three sequencing libraries from samples collected after 0 h, 12 h and 24 h of salt treatment. Following de novo assembly, 154,944 transcripts were generated, and 97,833 (63.14%) transcripts were annotated, including 55 Gene Ontology (GO) terms and 128 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. The expression profile of C. crassum was globally altered after salt treatment. We selected functional genes and pathways that may contribute to salinity tolerance and identified some factors involved in the salinity tolerance strategies of C. crassum, such as signal transduction, transcription factors and plant hormone regulation, enhancement of energy metabolism, functional proteins and osmolyte synthesis, reactive oxygen species (ROS) scavenging, photosystem protection and recovery, and cell wall protein modifications. Forty-six genes were selected for quantitative real-time polymerase chain reaction detection, and their expression patterns were shown to be consistent with the changes in their transcript abundance determined by RNA sequencing. PMID:28437448
Kuang, Zheng; Ji, Zhicheng; Boeke, Jef D; Ji, Hongkai
2018-01-09
Biological processes are usually associated with genome-wide remodeling of transcription driven by transcription factors (TFs). Identifying key TFs and their spatiotemporal binding patterns are indispensable to understanding how dynamic processes are programmed. However, most methods are designed to predict TF binding sites only. We present a computational method, dynamic motif occupancy analysis (DynaMO), to infer important TFs and their spatiotemporal binding activities in dynamic biological processes using chromatin profiling data from multiple biological conditions such as time-course histone modification ChIP-seq data. In the first step, DynaMO predicts TF binding sites with a random forests approach. Next and uniquely, DynaMO infers dynamic TF binding activities at predicted binding sites using their local chromatin profiles from multiple biological conditions. Another landmark of DynaMO is to identify key TFs in a dynamic process using a clustering and enrichment analysis of dynamic TF binding patterns. Application of DynaMO to the yeast ultradian cycle, mouse circadian clock and human neural differentiation exhibits its accuracy and versatility. We anticipate DynaMO will be generally useful for elucidating transcriptional programs in dynamic processes. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Lee, Soohyun; Seo, Chae Hwa; Alver, Burak Han; Lee, Sanghyuk; Park, Peter J
2015-09-03
RNA-seq has been widely used for genome-wide expression profiling. RNA-seq data typically consists of tens of millions of short sequenced reads from different transcripts. However, due to sequence similarity among genes and among isoforms, the source of a given read is often ambiguous. Existing approaches for estimating expression levels from RNA-seq reads tend to compromise between accuracy and computational cost. We introduce a new approach for quantifying transcript abundance from RNA-seq data. EMSAR (Estimation by Mappability-based Segmentation And Reclustering) groups reads according to the set of transcripts to which they are mapped and finds maximum likelihood estimates using a joint Poisson model for each optimal set of segments of transcripts. The method uses nearly all mapped reads, including those mapped to multiple genes. With an efficient transcriptome indexing based on modified suffix arrays, EMSAR minimizes the use of CPU time and memory while achieving accuracy comparable to the best existing methods. EMSAR is a method for quantifying transcripts from RNA-seq data with high accuracy and low computational cost. EMSAR is available at https://github.com/parklab/emsar.
Cognitive-behavioral stress management reverses anxiety-related leukocyte transcriptional dynamics
Antoni, Michael H.; Lutgendorf, Susan K.; Blomberg, Bonnie; Carver, Charles S.; Lechner, Suzanne; Diaz, Alain; Stagl, Jamie; Arevalo, Jesusa M.G.; Cole, Steven W.
2011-01-01
Background Chronic threat and anxiety are associated with pro-inflammatory transcriptional profiles in circulating leukocytes, but the causal direction of that relationship has not been established. This study tested whether a Cognitive-Behavioral Stress Management (CBSM) intervention targeting negative affect and cognition might counteract anxiety-related transcriptional alterations in people confronting a major medical threat. Methods 199 women undergoing primary treatment of Stage 0–III breast cancer were randomized to a 10-week CBSM protocol or an active control condition. 79 provided peripheral blood leukocyte samples for genome-wide transcriptional profiling and bioinformatic analyses at baseline, 6-, and 12-month follow-ups. Results Baseline negative affect was associated with > 50% differential expression of 201 leukocyte transcripts, including up-regulated expression of pro-inflammatory and metastasis-related genes. CBSM altered leukocyte expression of 91 genes by > 50% at follow-up (Group × Time interaction), including down-regulation of pro-inflammatory and metastasis-related genes and up-regulation of Type I interferon response genes. Promoter-based bioinformatic analyses implicated decreased activity of NF-κB/Rel and GATA family transcription factors and increased activity of Interferon Response Factors and the Glucocorticoid Receptor (GR) as potential mediators of CBSM-induced transcriptional alterations. Conclusions In early stage breast cancer patients, a 10-week CBSM intervention can reverse anxiety-related up-regulation of pro-inflammatory gene expression in circulating leukocytes. These findings clarify the molecular signaling pathways by which behavioral interventions can influence physical health and alter peripheral inflammatory processes that may reciprocally affect brain affective and cognitive processes. PMID:22088795
Kuang, Jian-Fei; Chen, Jian-Ye; Liu, Xun-Cheng; Han, Yan-Chao; Xiao, Yun-Yi; Shan, Wei; Tang, Yang; Wu, Ke-Qiang; He, Jun-Xian; Lu, Wang-Jin
2017-04-01
Fruit ripening is a complex, genetically programmed process involving the action of critical transcription factors (TFs). Despite the established significance of dehydration-responsive element binding (DREB) TFs in plant abiotic stress responses, the involvement of DREBs in fruit ripening is yet to be determined. Here, we identified four genes encoding ripening-regulated DREB TFs in banana (Musa acuminata), MaDREB1, MaDREB2, MaDREB3, and MaDREB4, and demonstrated that they play regulatory roles in fruit ripening. We showed that MaDREB1-MaDREB4 are nucleus-localized, induced by ethylene and encompass transcriptional activation activities. We performed a genome-wide chromatin immunoprecipitation and high-throughput sequencing (ChIP-Seq) experiment for MaDREB2 and identified 697 genomic regions as potential targets of MaDREB2. MaDREB2 binds to hundreds of loci with diverse functions and its binding sites are distributed in the promoter regions proximal to the transcriptional start site (TSS). Most of the MaDREB2-binding targets contain the conserved (A/G)CC(G/C)AC motif and MaDREB2 appears to directly regulate the expression of a number of genes involved in fruit ripening. In combination with transcriptome profiling (RNA sequencing) data, our results indicate that MaDREB2 may serve as both transcriptional activator and repressor during banana fruit ripening. In conclusion, our study suggests a hierarchical regulatory model of fruit ripening in banana and that the MaDREB TFs may act as transcriptional regulators in the regulatory network. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
Lavysh, Daria; Sokolova, Maria; Slashcheva, Marina; Förstner, Konrad U; Severinov, Konstantin
2017-02-14
Bacteriophage AR9 is a recently sequenced jumbo phage that encodes two multisubunit RNA polymerases. Here we investigated the AR9 transcription strategy and the effect of AR9 infection on the transcription of its host, Bacillus subtilis Analysis of whole-genome transcription revealed early, late, and continuously expressed AR9 genes. Alignment of sequences upstream of the 5' ends of AR9 transcripts revealed consensus sequences that define early and late phage promoters. Continuously expressed AR9 genes have both early and late promoters in front of them. Early AR9 transcription is independent of protein synthesis and must be determined by virion RNA polymerase injected together with viral DNA. During infection, the overall amount of host mRNAs is significantly decreased. Analysis of relative amounts of host transcripts revealed notable differences in the levels of some mRNAs. The physiological significance of up- or downregulation of host genes for AR9 phage infection remains to be established. AR9 infection is significantly affected by rifampin, an inhibitor of host RNA polymerase transcription. The effect is likely caused by the antibiotic-induced killing of host cells, while phage genome transcription is solely performed by viral RNA polymerases. IMPORTANCE Phages regulate the timing of the expression of their own genes to coordinate processes in the infected cell and maximize the release of viral progeny. Phages also alter the levels of host transcripts. Here we present the results of a temporal analysis of the host and viral transcriptomes of Bacillus subtilis infected with a giant phage, AR9. We identify viral promoters recognized by two virus-encoded RNA polymerases that are a unique feature of the phiKZ-related group of phages to which AR9 belongs. Our results set the stage for future analyses of highly unusual RNA polymerases encoded by AR9 and other phiKZ-related phages. Copyright © 2017 Lavysh et al.
King, Benjamin R; Samacoits, Aubin; Eisenhauer, Philip L; Ziegler, Christopher M; Bruce, Emily A; Zenklusen, Daniel; Zimmer, Christophe; Mueller, Florian; Botten, Jason
2018-06-15
Lymphocytic choriomeningitis mammarenavirus (LCMV) is an enveloped, negative-strand RNA virus that causes serious disease in humans but establishes an asymptomatic, lifelong infection in reservoir rodents. Different models have been proposed to describe how arenaviruses regulate the replication and transcription of their bisegmented, single-stranded RNA genomes, particularly during persistent infection. However, these models were based largely on viral RNA profiling data derived from entire populations of cells. To better understand LCMV replication and transcription at the single-cell level, we established a high-throughput, single-molecule fluorescence in situ hybridization (smFISH) image acquisition and analysis pipeline and examined viral RNA species at discrete time points from virus entry through the late stages of persistent infection in vitro We observed the transcription of viral nucleoprotein and polymerase mRNAs from the incoming S and L segment genomic RNAs, respectively, within 1 h of infection, whereas the transcription of glycoprotein mRNA from the S segment antigenome required ∼4 to 6 h. This confirms the temporal separation of viral gene expression expected due to the ambisense coding strategy of arenaviruses and also suggests that antigenomic RNA contained in virions is not transcriptionally active upon entry. Viral replication and transcription peaked at 36 h postinfection, followed by a progressive loss of viral RNAs over the next several days. During persistence, the majority of cells showed repeating cyclical waves of viral transcription and replication followed by the clearance of viral RNA. Thus, our data support a model of LCMV persistence whereby infected cells can spontaneously clear infection and become reinfected by viral reservoir cells that remain in the population. IMPORTANCE Arenaviruses are human pathogens that can establish asymptomatic, lifelong infections in their rodent reservoirs. Several models have been proposed to explain how arenavirus spread is restricted within host rodents, including the periodic accumulation and loss of replication-competent, but transcriptionally incompetent, viral genomes. A limitation of previous studies was the inability to enumerate viral RNA species at the single-cell level. We developed a high-throughput, smFISH assay and used it to quantitate lymphocytic choriomeningitis mammarenavirus (LCMV) replicative and transcriptional RNA species in individual cells at distinct time points following infection. Our findings support a model whereby productively infected cells can clear infection, including viral RNAs and antigen, and later be reinfected. This information improves our understanding of the timing and possible regulation of LCMV genome replication and transcription during infection. Importantly, the smFISH assay and data analysis pipeline developed here is easily adaptable to other RNA viruses. Copyright © 2018 American Society for Microbiology.
2014-01-01
Background Basic leucine zipper (bZIP) transcription factor gene family is one of the largest and most diverse families in plants. Current studies have shown that the bZIP proteins regulate numerous growth and developmental processes and biotic and abiotic stress responses. Nonetheless, knowledge concerning the specific expression patterns and evolutionary history of plant bZIP family members remains very limited. Results We identified 55 bZIP transcription factor-encoding genes in the grapevine (Vitis vinifera) genome, and divided them into 10 groups according to the phylogenetic relationship with those in Arabidopsis. The chromosome distribution and the collinearity analyses suggest that expansion of the grapevine bZIP (VvbZIP) transcription factor family was greatly contributed by the segment/chromosomal duplications, which may be associated with the grapevine genome fusion events. Nine intron/exon structural patterns within the bZIP domain and the additional conserved motifs were identified among all VvbZIP proteins, and showed a high group-specificity. The predicted specificities on DNA-binding domains indicated that some highly conserved amino acid residues exist across each major group in the tree of land plant life. The expression patterns of VvbZIP genes across the grapevine gene expression atlas, based on microarray technology, suggest that VvbZIP genes are involved in grapevine organ development, especially seed development. Expression analysis based on qRT-PCR indicated that VvbZIP genes are extensively involved in drought- and heat-responses, with possibly different mechanisms. Conclusions The genome-wide identification, chromosome organization, gene structures, evolutionary and expression analyses of grapevine bZIP genes provide an overall insight of this gene family and their potential involvement in growth, development and stress responses. This will facilitate further research on the bZIP gene family regarding their evolutionary history and biological functions. PMID:24725365
2013-01-01
Background The brown planthopper (Nilaparvata lugens) is one of the most serious rice plant pests in Asia. N. lugens causes extensive rice damage by sucking rice phloem sap, which results in stunted plant growth and the transmission of plant viruses. Despite the importance of this insect pest, little is known about the immunological mechanisms occurring in this hemimetabolous insect species. Results In this study, we performed a genome- and transcriptome-wide analysis aiming at the immune-related genes. The transcriptome datasets include the N. lugens intestine, the developmental stage, wing formation, and sex-specific expression information that provided useful gene expression sequence data for the genome-wide analysis. As a result, we identified a large number of genes encoding N. lugens pattern recognition proteins, modulation proteins in the prophenoloxidase (proPO) activating cascade, immune effectors, and the signal transduction molecules involved in the immune pathways, including the Toll, Immune deficiency (Imd) and Janus kinase signal transducers and activators of transcription (JAK-STAT) pathways. The genome scale analysis revealed detailed information of the gene structure, distribution and transcription orientations in scaffolds. A comparison of the genome-available hemimetabolous and metabolous insect species indicate the differences in the immune-related gene constitution. We investigated the gene expression profiles with regards to how they responded to bacterial infections and tissue, as well as development and sex expression specificity. Conclusions The genome- and transcriptome-wide analysis of immune-related genes including pattern recognition and modulation molecules, immune effectors, and the signal transduction molecules involved in the immune pathways is an important step in determining the overall architecture and functional network of the immune components in N. lugens. Our findings provide the comprehensive gene sequence resource and expression profiles of the immune-related genes of N. lugens, which could facilitate the understanding of the innate immune mechanisms in the hemimetabolous insect species. These data give insight into clarifying the potential functional roles of the immune-related genes involved in the biological processes of development, reproduction, and virus transmission in N. lugens. PMID:23497397
Transcriptome profiling of Zymomonas mobilis under furfural stress.
He, Ming-xiong; Wu, Bo; Shui, Zong-xia; Hu, Qi-chun; Wang, Wen-guo; Tan, Fu-rong; Tang, Xiao-yu; Zhu, Qi-li; Pan, Ke; Li, Qing; Su, Xiao-hong
2012-07-01
Furfural from lignocellulosic hydrolysates is the prevalent inhibitor to microorganisms during cellulosic ethanol production, but the molecular mechanisms of tolerance to this inhibitor in Zymomonas mobilis are still unclear. In this study, genome-wide transcriptional responses to furfural were investigated in Z. mobilis using microarray analysis. We found that 433 genes were differentially expressed in response to furfural. Furfural up- or down-regulated genes related to cell wall/membrane biogenesis, metabolism, and transcription. However, furfural has a subtle negative effect on Entner-Doudoroff pathway mRNAs. Our results revealed that furfural had effects on multiple aspects of cellular metabolism at the transcriptional level and that membrane might play important roles in response to furfural. This research has provided insights into the molecular response to furfural in Z. mobilis, and it will be helpful to construct more furfural-resistant strains for cellulosic ethanol production.
Basnet, Ram Kumar; Moreno-Pachon, Natalia; Lin, Ke; Bucher, Johan; Visser, Richard G F; Maliepaard, Chris; Bonnema, Guusje
2013-12-01
Brassica seeds are important as basic units of plant growth and sources of vegetable oil. Seed development is regulated by many dynamic metabolic processes controlled by complex networks of spatially and temporally expressed genes. We conducted a global microarray gene co-expression analysis by measuring transcript abundance of developing seeds from two diverse B. rapa morphotypes: a pak choi (leafy-type) and a yellow sarson (oil-type), and two of their doubled haploid (DH) progenies, (1) to study the timing of metabolic processes in developing seeds, (2) to explore the major transcriptional differences in developing seeds of the two morphotypes, and (3) to identify the optimum stage for a genetical genomics study in B. rapa seed. Seed developmental stages were similar in developing seeds of pak choi and yellow sarson of B. rapa; however, the colour of embryo and seed coat differed among these two morphotypes. In this study, most transcriptional changes occurred between 25 and 35 DAP, which shows that the timing of seed developmental processes in B. rapa is at later developmental stages than in the related species B. napus. Using a Weighted Gene Co-expression Network Analysis (WGCNA), we identified 47 "gene modules", of which 27 showed a significant association with temporal and/or genotypic variation. An additional hierarchical cluster analysis identified broad spectra of gene expression patterns during seed development. The predominant variation in gene expression was according to developmental stages rather than morphotype differences. Since lipids are the major storage compounds of Brassica seeds, we investigated in more detail the regulation of lipid metabolism. Four co-regulated gene clusters were identified with 17 putative cis-regulatory elements predicted in their 1000 bp upstream region, either specific or common to different lipid metabolic pathways. This is the first study of genome-wide profiling of transcript abundance during seed development in B. rapa. The identification of key physiological events, major expression patterns, and putative cis-regulatory elements provides useful information to construct gene regulatory networks in B. rapa developing seeds and provides a starting point for a genetical genomics study of seed quality traits.
Temporal Expression Profiling Identifies Pathways Mediating Effect of Causal Variant on Phenotype
Gupta, Saumya; Radhakrishnan, Aparna; Raharja-Liu, Pandu; Lin, Gen; Steinmetz, Lars M.; Gagneur, Julien; Sinha, Himanshu
2015-01-01
Even with identification of multiple causal genetic variants for common human diseases, understanding the molecular processes mediating the causal variants’ effect on the disease remains a challenge. This understanding is crucial for the development of therapeutic strategies to prevent and treat disease. While static profiling of gene expression is primarily used to get insights into the biological bases of diseases, it makes differentiating the causative from the correlative effects difficult, as the dynamics of the underlying biological processes are not monitored. Using yeast as a model, we studied genome-wide gene expression dynamics in the presence of a causal variant as the sole genetic determinant, and performed allele-specific functional validation to delineate the causal effects of the genetic variant on the phenotype. Here, we characterized the precise genetic effects of a functional MKT1 allelic variant in sporulation efficiency variation. A mathematical model describing meiotic landmark events and conditional activation of MKT1 expression during sporulation specified an early meiotic role of this variant. By analyzing the early meiotic genome-wide transcriptional response, we demonstrate an MKT1-dependent role of novel modulators, namely, RTG1/3, regulators of mitochondrial retrograde signaling, and DAL82, regulator of nitrogen starvation, in additively effecting sporulation efficiency. In the presence of functional MKT1 allele, better respiration during early sporulation was observed, which was dependent on the mitochondrial retrograde regulator, RTG3. Furthermore, our approach showed that MKT1 contributes to sporulation independent of Puf3, an RNA-binding protein that steady-state transcription profiling studies have suggested to mediate MKT1-pleiotropic effects during mitotic growth. These results uncover interesting regulatory links between meiosis and mitochondrial retrograde signaling. In this study, we highlight the advantage of analyzing allele-specific transcriptional dynamics of mediating genes. Applications in higher eukaryotes can be valuable for inferring causal molecular pathways underlying complex dynamic processes, such as development, physiology and disease progression. PMID:26039065
Pervasive Transcription of a Herpesvirus Genome Generates Functionally Important RNAs
Canny, Susan P.; Reese, Tiffany A.; Johnson, L. Steven; Zhang, Xin; Kambal, Amal; Duan, Erning; Liu, Catherine Y.; Virgin, Herbert W.
2014-01-01
ABSTRACT Pervasive transcription is observed in a wide range of organisms, including humans, mice, and viruses, but the functional significance of the resulting transcripts remains uncertain. Current genetic approaches are often limited by their emphasis on protein-coding open reading frames (ORFs). We previously identified extensive pervasive transcription from the murine gammaherpesvirus 68 (MHV68) genome outside known ORFs and antisense to known genes (termed expressed genomic regions [EGRs]). Similar antisense transcripts have been identified in many other herpesviruses, including Kaposi’s sarcoma-associated herpesvirus and human and murine cytomegalovirus. Despite their prevalence, whether these RNAs have any functional importance in the viral life cycle is unknown, and one interpretation is that these are merely “noise” generated by functionally unimportant transcriptional events. To determine whether pervasive transcription of a herpesvirus genome generates RNA molecules that are functionally important, we used a strand-specific functional approach to target transcripts from thirteen EGRs in MHV68. We found that targeting transcripts from six EGRs reduced viral protein expression, proving that pervasive transcription can generate functionally important RNAs. We characterized transcripts emanating from EGRs 26 and 27 in detail using several methods, including RNA sequencing, and identified several novel polyadenylated transcripts that were enriched in the nuclei of infected cells. These data provide the first evidence of the functional importance of regions of pervasive transcription emanating from MHV68 EGRs. Therefore, studies utilizing mutation of a herpesvirus genome must account for possible effects on RNAs generated by pervasive transcription. PMID:24618256
Pan, Zhiqiang; Agarwal, Ameeta K; Xu, Tao; Feng, Qin; Baerson, Scott R; Duke, Stephen O; Rimando, Agnes M
2008-01-01
Background Pterostilbene, a naturally occurring phenolic compound produced by agronomically important plant genera such as Vitis and Vacciunium, is a phytoalexin exhibiting potent antifungal activity. Additionally, recent studies have demonstrated several important pharmacological properties associated with pterostilbene. Despite this, a systematic study of the effects of pterostilbene on eukaryotic cells at the molecular level has not been previously reported. Thus, the aim of the present study was to identify the cellular pathways affected by pterostilbene by performing transcript profiling studies, employing the model yeast Saccharomyces cerevisiae. Methods S. cerevisiae strain S288C was exposed to pterostilbene at the IC50 concentration (70 μM) for one generation (3 h). Transcript profiling experiments were performed on three biological replicate samples using the Affymetrix GeneChip Yeast Genome S98 Array. The data were analyzed using the statistical methods available in the GeneSifter microarray data analysis system. To validate the results, eleven differentially expressed genes were further examined by quantitative real-time RT-PCR, and S. cerevisiae mutant strains with deletions in these genes were analyzed for altered sensitivity to pterostilbene. Results Transcript profiling studies revealed that pterostilbene exposure significantly down-regulated the expression of genes involved in methionine metabolism, while the expression of genes involved in mitochondrial functions, drug detoxification, and transcription factor activity were significantly up-regulated. Additional analyses revealed that a large number of genes involved in lipid metabolism were also affected by pterostilbene treatment. Conclusion Using transcript profiling, we have identified the cellular pathways targeted by pterostilbene, an analog of resveratrol. The observed response in lipid metabolism genes is consistent with its known hypolipidemic properties, and the induction of mitochondrial genes is consistent with its demonstrated role in apoptosis in human cancer cell lines. Furthermore, our data show that pterostilbene has a significant effect on methionine metabolism, a previously unreported effect for this compound. PMID:18366703
TRANSFAC: an integrated system for gene expression regulation.
Wingender, E; Chen, X; Hehl, R; Karas, H; Liebich, I; Matys, V; Meinhardt, T; Prüss, M; Reuter, I; Schacherer, F
2000-01-01
TRANSFAC is a database on transcription factors, their genomic binding sites and DNA-binding profiles (http://transfac.gbf.de/TRANSFAC/). Its content has been enhanced, in particular by information about training sequences used for the construction of nucleotide matrices as well as by data on plant sites and factors. Moreover, TRANSFAC has been extended by two new modules: PathoDB provides data on pathologically relevant mutations in regulatory regions and transcription factor genes, whereas S/MARt DB compiles features of scaffold/matrix attached regions (S/MARs) and the proteins binding to them. Additionally, the databases TRANSPATH, about signal transduction, and CYTOMER, about organs and cell types, have been extended and are increasingly integrated with the TRANSFAC data sources.
Trichostatin A effects on gene expression in the protozoan parasite Entamoeba histolytica
Ehrenkaufer, Gretchen M; Eichinger, Daniel J; Singh, Upinder
2007-01-01
Background Histone modification regulates chromatin structure and influences gene expression associated with diverse biological functions including cellular differentiation, cancer, maintenance of genome architecture, and pathogen virulence. In Entamoeba, a deep-branching eukaryote, short chain fatty acids (SCFA) affect histone acetylation and parasite development. Additionally, a number of active histone modifying enzymes have been identified in the parasite genome. However, the overall extent of gene regulation tied to histone acetylation is not known. Results In order to identify the genome-wide effects of histone acetylation in regulating E. histolytica gene expression, we used whole-genome expression profiling of parasites treated with SCFA and Trichostatin A (TSA). Despite significant changes in histone acetylation patterns, exposure of parasites to SCFA resulted in minimal transcriptional changes (11 out of 9,435 genes transcriptionally regulated). In contrast, exposure to TSA, a more specific inhibitor of histone deacetylases, significantly affected transcription of 163 genes (122 genes upregulated and 41 genes downregulated). Genes modulated by TSA were not regulated by treatment with 5-Azacytidine, an inhibitor of DNA-methyltransferase, indicating that in E. histolytica the crosstalk between DNA methylation and histone modification is not substantial. However, the set of genes regulated by TSA overlapped substantially with genes regulated during parasite development: 73/122 genes upregulated by TSA exposure were upregulated in E. histolytica cysts (p-value = 6 × 10-53) and 15/41 genes downregulated by TSA exposure were downregulated in E. histolytica cysts (p-value = 3 × 10-7). Conclusion This work represents the first genome-wide analysis of histone acetylation and its effects on gene expression in E. histolytica. The data indicate that SCFAs, despite their ability to influence histone acetylation, have minimal effects on gene transcription in cultured parasites. In contrast, the effect of TSA on E. histolytica gene expression is more substantial and includes genes involved in the encystation pathway. These observations will allow further dissection of the effects of histone acetylation and the genetic pathways regulating stage conversion in this pathogenic parasite. PMID:17612405
Genome-wide analysis of miRNA and mRNA transcriptomes during amelogenesis.
Yin, Kaifeng; Hacia, Joseph G; Zhong, Zhe; Paine, Michael L
2014-11-19
In the rodent incisor during amelogenesis, as ameloblast cells transition from secretory stage to maturation stage, their morphology and transcriptome profiles change dramatically. Prior whole genome transcriptome analysis has given a broad picture of the molecular activities dominating both stages of amelogenesis, but this type of analysis has not included miRNA transcript profiling. In this study, we set out to document which miRNAs and corresponding target genes change significantly as ameloblasts transition from secretory- to maturation-stage amelogenesis. Total RNA samples from both secretory- and maturation-stage rat enamel organs were subjected to genome-wide miRNA and mRNA transcript profiling. We identified 59 miRNAs that were differentially expressed at the maturation stage relative to the secretory stage of enamel development (False Discovery Rate (FDR)<0.05, fold change (FC)≥1.8). In parallel, transcriptome profiling experiments identified 1,729 mRNA transcripts that were differentially expressed in the maturation stage compared to the secretory stage (FDR<0.05, FC≥1.8). Based on bioinformatics analyses, 5.8% (629 total) of these differentially expressed genes (DEGS) were highlighted as being the potential targets of 59 miRNAs that were differentially expressed in the opposite direction, in the same tissue samples. Although the number of predicted target DEGs was not higher than baseline expectations generated by examination of stably expressed miRNAs, Gene Ontology (GO) analysis showed that these 629 DEGS were enriched for ion transport, pH regulation, calcium handling, endocytotic, and apoptotic activities. Seven differentially expressed miRNAs (miR-21, miR-31, miR-488, miR-153, miR-135b, miR-135a and miR298) in secretory- and/or maturation-stage enamel organs were confirmed by in situ hybridization. Further, we used luciferase reporter assays to provide evidence that two of these differentially expressed miRNAs, miR-153 and miR-31, are potential regulators for their predicated target mRNAs, Lamp1 (miR-153) and Tfrc (miR-31). In conclusion, these data indicate that miRNAs exhibit a dynamic expression pattern during the transition from secretory-stage to maturation-stage tooth enamel formation. Although they represent only one of numerous mechanisms influencing gene activities, miRNAs specific to the maturation stage could be involved in regulating several key processes of enamel maturation by influencing mRNA stability and translation.
Bashiri, Asher; Heo, Hye J.; Ben-Avraham, Danny; Mazor, Moshe; Budagov, Temuri; Einstein, Francine H.; Atzmon, Gil
2014-01-01
Maternal obesity is a significant risk factor for development of both maternal and fetal metabolic complications. Increase in visceral fat and insulin resistance is a metabolic hallmark of pregnancy, yet little is known how obesity alters adipose cellular function and how this may contribute to pregnancy morbidities. We sought to identify alterations in genome-wide transcription expression in both visceral (omental) and abdominal subcutaneous fat deposits in pregnancy complicated by obesity. Visceral and abdominal subcutaneous fat deposits were collected from normal weight and obese pregnant women (n=4/group) at time of scheduled uncomplicated cesarean section. A genome-wide expression array (Affymetrix Human Exon 1.0 st platform), validated by quantitative real-time PCR, was utilized to establish the gene transcript expression profile in both visceral and abdominal subcutaneous fat in normal weight and obese pregnant women. Global alteration in gene expression was identified in pregnancy complicated by obesity. These regions of variations lead to identification of indolethylamine N-methyltransferase (INMT), tissue factor pathway inhibitor-2 (TFPI-2), and ephrin type-B receptor 6 (EPHB6), not previously associated with fat metabolism during pregnancy. In addition, subcutaneous fat of obese pregnant women demonstrated increased coding protein transcripts associated with apoptosis compared to lean counterparts. Global alteration of gene expression in adipose tissue may contribute to adverse pregnancy outcomes associated with obesity. PMID:24696292
Khosa, Jiffinvir S.; Lee, Robyn; Bräuning, Sophia; Lord, Janice; Pither-Joyce, Meeghan; McCallum, John; Macknight, Richard C.
2016-01-01
Researchers working on model plants have derived great benefit from developing genomic and genetic resources using ‘reference’ genotypes. Onion has a large and highly heterozygous genome making the sharing of germplasm and analysis of sequencing data complicated. To simplify the discovery and analysis of genes underlying important onion traits, we are promoting the use of the homozygous double haploid line ‘CUDH2107’ by the onion research community. In the present investigation, we performed transcriptome sequencing on vegetative and reproductive tissues of CUDH2107 to develop a multi-organ reference transcriptome catalogue. A total of 396 million 100 base pair paired reads was assembled using the Trinity pipeline, resulting in 271,665 transcript contigs. This dataset was analysed for gene ontology and transcripts were classified on the basis of putative biological processes, molecular function and cellular localization. Significant differences were observed in transcript expression profiles between different tissues. To demonstrate the utility of our CUDH2107 transcriptome catalogue for understanding the genetic and molecular basis of various traits, we identified orthologues of rice genes involved in male fertility and flower development. These genes provide an excellent starting point for studying the molecular regulation, and the engineering of reproductive traits. PMID:27861615
Francisco, Joel Celio; Dai, Qian; Luo, Zhuojuan; Wang, Yan; Chong, Roxanne Hui-Heng; Tan, Yee Joo; Xie, Wei; Lee, Guan-Huei; Lin, Chengqi
2017-10-01
Chronic hepatitis B virus (HBV) infection can lead to liver cirrhosis and hepatocellular carcinoma. HBV reactivation during or after chemotherapy is a potentially fatal complication for cancer patients with chronic HBV infection. Transcription of HBV is a critical intermediate step of the HBV life cycle. However, factors controlling HBV transcription remain largely unknown. Here, we found that different P-TEFb complexes are involved in the transcription of the HBV viral genome. Both BRD4 and the super elongation complex (SEC) bind to the HBV genome. The treatment of bromodomain inhibitor JQ1 stimulates HBV transcription and increases the occupancy of BRD4 on the HBV genome, suggesting the bromodomain-independent recruitment of BRD4 to the HBV genome. JQ1 also leads to the increased binding of SEC to the HBV genome, and SEC is required for JQ1-induced HBV transcription. These findings reveal a novel mechanism by which the HBV genome hijacks the host P-TEFb-containing complexes to promote its own transcription. Our findings also point out an important clinical implication, that is, the potential risk of HBV reactivation during therapy with a BRD4 inhibitor, such as JQ1 or its analogues, which are a potential treatment for acute myeloid leukemia. Copyright © 2017 American Society for Microbiology.
Bathige, S D N K; Umasuthan, Navaneethaiyer; Priyathilaka, Thanthrige Thiunuwan; Thulasitha, William Shanthakumar; Jayasinghe, J D H E; Wan, Qiang; Nam, Bo-Hye; Lee, Jehee
2017-08-30
Signal transducer and activator of transcription 2 (STAT2) is a key element that transduces signals from the cell membrane to the nucleus via the type I interferon-signaling pathway. Although the structural and functional aspects of STAT proteins are well studied in mammals, information on teleostean STATs is very limited. In this study, a STAT paralog, which is highly homologous to the STAT2 members, was identified from a commercially important fish species called rock bream and designated as RbSTAT2. The RbSTAT2 gene was characterized at complementary DNA (cDNA) and genomic sequence levels, and was found to possess structural features common with its mammalian counterparts. The complete cDNA sequence was distributed into 24 exons in the genomic sequence. The promoter proximal region was analyzed and found to contain potential transcription factor binding sites to regulate the transcription of RbSTAT2. Phylogenetic studies and comparative genomic structure organization revealed the distinguishable evolution for fish and other vertebrate STAT2 orthologs. Transcriptional quantification was performed by SYBR Green quantitative real-time PCR (qPCR) and the ubiquitous expression of RbSTAT2 transcripts was observed in all tissues analyzed from healthy fish, with a remarkably high expression in blood cells. Significantly (P<0.05) altered transcription of RbSTAT2 was detected after immune challenge experiments with viral (rock bream irido virus; RBIV), bacterial (Edwardsiella tarda and Streptococcus iniae), and immune stimulants (poly I:C and LPS). Antiviral potential was further confirmed by WST-1 assay, by measuring the viability of rock bream heart cells treated with RBIV. In addition, results of an in vitro challenge experiment signified the influence of rock bream interleukin-10 (RbIL-10) on transcription of RbSTAT2. Subcellular localization studies by transfection of pEGFP-N1/RbSTAT2 into rock bream heart cells revealed that the RbSTAT2 was usually located in the cytoplasm and translocated near to the nucleus upon poly I:C administration. Altogether, these findings suggest that RbSTAT2 is involved in various biologically crucial mechanisms, and provides immune protection to the rock bream. Copyright © 2017 Elsevier B.V. All rights reserved.
Comprehensive Transcriptome Analysis of Response to Nickel Stress in White Birch (Betula papyrifera)
Theriault, Gabriel; Michael, Paul; Nkongolo, Kabwe
2016-01-01
White birch (Betula papyrifera) is a dominant tree species of the Boreal Forest. Recent studies have shown that it is fairly resistant to heavy metal contamination, specifically to nickel. Knowledge of regulation of genes associated with metal resistance in higher plants is very sketchy. Availability and annotation of the dwarf birch (B. nana) enables the use of high throughout sequencing approaches to understanding responses to environmental challenges in other Betula species such as B. papyrifera. The main objectives of this study are to 1) develop and characterize the B. papyrifera transcriptome, 2) assess gene expression dynamics of B. papyrifera in response to nickel stress, and 3) describe gene function based on ontology. Nickel resistant and susceptible genotypes were selected and used for transcriptome analysis. A total of 208,058 trinity genes were identified and were assembled to 275,545 total trinity transcripts. The transcripts were mapped to protein sequences and based on best match; we annotated the B. papyrifera genes and assigned gene ontology. In total, 215,700 transcripts were annotated and were compared to the published B. nana genome. Overall, a genomic match for 61% transcripts with the reference genome was found. Expression profiles were generated and 62,587 genes were found to be significantly differentially expressed among the nickel resistant, susceptible, and untreated libraries. The main nickel resistance mechanism in B. papyrifera is a downregulation of genes associated with translation (in ribosome), binding, and transporter activities. Five candidate genes associated to nickel resistance were identified. They include Glutathione S–transferase, thioredoxin family protein, putative transmembrane protein and two Nramp transporters. These genes could be useful for genetic engineering of birch trees. PMID:27082755
Tadra-Sfeir, Michelle Z; Faoro, Helisson; Camilios-Neto, Doumit; Brusamarello-Santos, Liziane; Balsanelli, Eduardo; Weiss, Vinicius; Baura, Valter A; Wassem, Roseli; Cruz, Leonardo M; De Oliveira Pedrosa, Fábio; Souza, Emanuel M; Monteiro, Rose A
2015-01-01
Herbaspirillum seropedicae is a diazotrophic bacterium which associates endophytically with economically important gramineae. Flavonoids such as naringenin have been shown to have an effect on the interaction between H. seropedicae and its host plants. We used a high-throughput sequencing based method (RNA-Seq) to access the influence of naringenin on the whole transcriptome profile of H. seropedicae. Three hundred and four genes were downregulated and seventy seven were upregulated by naringenin. Data analysis revealed that genes related to bacterial flagella biosynthesis, chemotaxis and biosynthesis of peptidoglycan were repressed by naringenin. Moreover, genes involved in aromatic metabolism and multidrug transport efllux were actived.
Tahara, Kenichi; Takizawa, Makiko; Yamane, Arito; Osaki, Yohei; Ishizaki, Takuma; Mitsui, Takeki; Yokohama, Akihiko; Saitoh, Takayuki; Tsukamoto, Norifumi; Matsumoto, Morio; Murakami, Hirokazu; Nojima, Yoshihisa; Handa, Hiroshi
2017-08-01
B-cell lymphoma 6 (BCL6) attenuates DNA damage response (DDR) through gene repression and facilitates tolerance to genomic instability during immunoglobulin affinity maturation in germinal center (GC) B cells. Although BCL6 expression is repressed through normal differentiation of GC B cells into plasma cells, a recent study showed the ectopic expression of BCL6 in primary multiple myeloma (MM) cells. However, the functional roles of BCL6 in MM cells are largely unknown. Here, we report that overexpression of BCL6 in a MM cell line, KMS12PE, induced transcriptional repression of ataxia telangiectasia mutated (ATM), a DDR signaling kinase, which was associated with a reduction in γH2AX formation after DNA damage. In contrast, transcription of known targets of BCL6 in GC B cells was not affected, suggesting a cell type-specific function of BCL6. To further investigate the effects of BCL6 overexpression on the MM cell line, we undertook mRNA sequence analysis and found an upregulation in the genomic mutator activation-induced cytidine deaminase (AID) with alteration in the gene expression profile, which is suggestive of de-differentiation from plasma cells. Moreover, interleukin-6 exposure to KMS12PE led to upregulation of BCL6 and AID, downregulation of ATM, and attenuation of DDR, which were consistent with the effects of BCL6 overexpression in this MM cell line. Taken together, these results indicated that overexpression of BCL6 alters gene expression profile and confers decreased DDR in MM cells. This phenotypic change could be reproduced by interleukin-6 stimulation, suggesting an important role of external stimuli in inducing genomic instability, which is a hallmark of MM cells. © 2017 The Authors. Cancer Science published by John Wiley & Sons Australia, Ltd on behalf of Japanese Cancer Association.
Advances in Genetical Genomics of Plants
Joosen, R.V.L.; Ligterink, W.; Hilhorst, H.W.M.; Keurentjes, J.J.B.
2009-01-01
Natural variation provides a valuable resource to study the genetic regulation of quantitative traits. In quantitative trait locus (QTL) analyses this variation, captured in segregating mapping populations, is used to identify the genomic regions affecting these traits. The identification of the causal genes underlying QTLs is a major challenge for which the detection of gene expression differences is of major importance. By combining genetics with large scale expression profiling (i.e. genetical genomics), resulting in expression QTLs (eQTLs), great progress can be made in connecting phenotypic variation to genotypic diversity. In this review we discuss examples from human, mouse, Drosophila, yeast and plant research to illustrate the advances in genetical genomics, with a focus on understanding the regulatory mechanisms underlying natural variation. With their tolerance to inbreeding, short generation time and ease to generate large families, plants are ideal subjects to test new concepts in genetics. The comprehensive resources which are available for Arabidopsis make it a favorite model plant but genetical genomics also found its way to important crop species like rice, barley and wheat. We discuss eQTL profiling with respect to cis and trans regulation and show how combined studies with other ‘omics’ technologies, such as metabolomics and proteomics may further augment current information on transcriptional, translational and metabolomic signaling pathways and enable reconstruction of detailed regulatory networks. The fast developments in the ‘omics’ area will offer great potential for genetical genomics to elucidate the genotype-phenotype relationships for both fundamental and applied research. PMID:20514216
SiNoPsis: Single Nucleotide Polymorphisms selection and promoter profiling.
Boloc, Daniel; Rodríguez, Natalia; Gassó, Patricia; Abril, Josep F; Bernardo, Miquel; Lafuente, Amalia; Mas, Sergi
2017-09-14
The selection of a Single Nucleotide Polymorphism (SNP) using bibliographic methods can be a very time-consuming task. Moreover, a SNP selected in this way may not be easily visualized in its genomic context by a standard user hoping to correlate it with other valuable information. Here we propose a web form built on top of Circos that can assist SNP-centred screening, based on their location in the genome and the regulatory modules they can disrupt. Its use may allow researchers to prioritize SNPs in genotyping and disease studies. SiNoPsis is bundled as a web portal. It focuses on the different structures involved in the genomic expression of a gene, especially those found in the core promoter upstream region. These structures include transcription factor binding sites (for promoter and enhancer signals), histones, and promoter flanking regions. Additionally, the tool provides eQTL and linkage disequilibrium (LD) properties for a given SNP query, yielding further clues about other indirectly associated SNPs. Possible disruptions of the aforementioned structures affecting gene transcription are reported using multiple resource databases. SiNoPsis has a simple user-friendly interface, which allows single queries by gene symbol, genomic coordinates, Ensembl gene identifiers, RefSeq transcript identifiers and SNPs. It is the only portal providing useful SNP selection based on regulatory modules and LD with functional variants in both textual and graphic modes (by properly defining the arguments and parameters needed to run Circos). SiNoPsis is freely available at https://compgen.bio.ub.edu/SiNoPsis /. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Martínez-Núñez, Mario Alberto; Poot-Hernandez, Augusto Cesar; Rodríguez-Vázquez, Katya; Perez-Rueda, Ernesto
2013-01-01
In this work, the content of enzymes and DNA-binding transcription factors (TFs) in 794 non-redundant prokaryotic genomes was evaluated. The identification of enzymes was based on annotations deposited in the KEGG database as well as in databases of functional domains (COG and PFAM) and structural domains (Superfamily). For identifications of the TFs, hidden Markov profiles were constructed based on well-known transcriptional regulatory families. From these analyses, we obtained diverse and interesting results, such as the negative rate of incremental changes in the number of detected enzymes with respect to the genome size. On the contrary, for TFs the rate incremented as the complexity of genome increased. This inverse related performance shapes the diversity of metabolic and regulatory networks and impacts the availability of enzymes and TFs. Furthermore, the intersection of the derivatives between enzymes and TFs was identified at 9,659 genes, after this point, the regulatory complexity grows faster than metabolic complexity. In addition, TFs have a low number of duplications, in contrast to the apparent high number of duplications associated with enzymes. Despite the greater number of duplicated enzymes versus TFs, the increment by which duplicates appear is higher in TFs. A lower proportion of enzymes among archaeal genomes (22%) than in the bacterial ones (27%) was also found. This low proportion might be compensated by the interconnection between the metabolic pathways in Archaea. A similar proportion was also found for the archaeal TFs, for which the formation of regulatory complexes has been proposed. Finally, an enrichment of multifunctional enzymes in Bacteria, as a mechanism of ecological adaptation, was detected.
Martínez-Núñez, Mario Alberto; Poot-Hernandez, Augusto Cesar; Rodríguez-Vázquez, Katya; Perez-Rueda, Ernesto
2013-01-01
In this work, the content of enzymes and DNA-binding transcription factors (TFs) in 794 non-redundant prokaryotic genomes was evaluated. The identification of enzymes was based on annotations deposited in the KEGG database as well as in databases of functional domains (COG and PFAM) and structural domains (Superfamily). For identifications of the TFs, hidden Markov profiles were constructed based on well-known transcriptional regulatory families. From these analyses, we obtained diverse and interesting results, such as the negative rate of incremental changes in the number of detected enzymes with respect to the genome size. On the contrary, for TFs the rate incremented as the complexity of genome increased. This inverse related performance shapes the diversity of metabolic and regulatory networks and impacts the availability of enzymes and TFs. Furthermore, the intersection of the derivatives between enzymes and TFs was identified at 9,659 genes, after this point, the regulatory complexity grows faster than metabolic complexity. In addition, TFs have a low number of duplications, in contrast to the apparent high number of duplications associated with enzymes. Despite the greater number of duplicated enzymes versus TFs, the increment by which duplicates appear is higher in TFs. A lower proportion of enzymes among archaeal genomes (22%) than in the bacterial ones (27%) was also found. This low proportion might be compensated by the interconnection between the metabolic pathways in Archaea. A similar proportion was also found for the archaeal TFs, for which the formation of regulatory complexes has been proposed. Finally, an enrichment of multifunctional enzymes in Bacteria, as a mechanism of ecological adaptation, was detected. PMID:23922780
Transposable elements contribute to activation of maize genes in response to abiotic stress.
Makarevitch, Irina; Waters, Amanda J; West, Patrick T; Stitzer, Michelle; Hirsch, Candice N; Ross-Ibarra, Jeffrey; Springer, Nathan M
2015-01-01
Transposable elements (TEs) account for a large portion of the genome in many eukaryotic species. Despite their reputation as "junk" DNA or genomic parasites deleterious for the host, TEs have complex interactions with host genes and the potential to contribute to regulatory variation in gene expression. It has been hypothesized that TEs and genes they insert near may be transcriptionally activated in response to stress conditions. The maize genome, with many different types of TEs interspersed with genes, provides an ideal system to study the genome-wide influence of TEs on gene regulation. To analyze the magnitude of the TE effect on gene expression response to environmental changes, we profiled gene and TE transcript levels in maize seedlings exposed to a number of abiotic stresses. Many genes exhibit up- or down-regulation in response to these stress conditions. The analysis of TE families inserted within upstream regions of up-regulated genes revealed that between four and nine different TE families are associated with up-regulated gene expression in each of these stress conditions, affecting up to 20% of the genes up-regulated in response to abiotic stress, and as many as 33% of genes that are only expressed in response to stress. Expression of many of these same TE families also responds to the same stress conditions. The analysis of the stress-induced transcripts and proximity of the transposon to the gene suggests that these TEs may provide local enhancer activities that stimulate stress-responsive gene expression. Our data on allelic variation for insertions of several of these TEs show strong correlation between the presence of TE insertions and stress-responsive up-regulation of gene expression. Our findings suggest that TEs provide an important source of allelic regulatory variation in gene response to abiotic stress in maize.
2014-01-01
Background Nematode-trapping fungi are a unique group of organisms that can capture nematodes using sophisticated trapping structures. The genome of Drechslerella stenobrocha, a constricting-ring-forming fungus, has been sequenced and reported, and provided new insights into the evolutionary origins of nematode predation in fungi, the trapping mechanisms, and the dual lifestyles of saprophagy and predation. Results The genome of the fungus Drechslerella stenobrocha, which mechanically traps nematodes using a constricting ring, was sequenced. The genome was 29.02 Mb in size and was found rare instances of transposons and repeat induced point mutations, than that of Arthrobotrys oligospora. The functional proteins involved in nematode-infection, such as chitinases, subtilisins, and adhesive proteins, underwent a significant expansion in the A. oligospora genome, while there were fewer lectin genes that mediate fungus-nematode recognition in the D. stenobrocha genome. The carbohydrate-degrading enzyme catalogs in both species were similar to those of efficient cellulolytic fungi, suggesting a saprophytic origin of nematode-trapping fungi. In D. stenobrocha, the down-regulation of saprophytic enzyme genes and the up-regulation of infection-related genes during the capture of nematodes indicated a transition between dual life strategies of saprophagy and predation. The transcriptional profiles also indicated that trap formation was related to the protein kinase C (PKC) signal pathway and regulated by Zn(2)–C6 type transcription factors. Conclusions The genome of D. stenobrocha provides support for the hypothesis that nematode trapping fungi evolved from saprophytic fungi in a high carbon and low nitrogen environment. It reveals the transition between saprophagy and predation of these fungi and also proves new insights into the mechanisms of mechanical trapping. PMID:24507587
Chao, Tianle; Wang, Guizhi; Wang, Jianmin; Liu, Zhaohua; Ji, Zhibin; Hou, Lei; Zhang, Chunlan
2016-01-01
High-throughput mRNA sequencing enables the discovery of new transcripts and additional parts of incompletely annotated transcripts. Compared with the human and cow genomes, the reference annotation level of the sheep genome is still low. An investigation of new transcripts in sheep skeletal muscle will improve our understanding of muscle development. Therefore, applying high-throughput sequencing, two cDNA libraries from the biceps brachii of small-tailed Han sheep and Dorper sheep were constructed, and whole-transcriptome analysis was performed to determine the unknown transcript catalogue of this tissue. In this study, 40,129 transcripts were finally mapped to the sheep genome. Among them, 3,467 transcripts were determined to be unannotated in the current reference sheep genome and were defined as new transcripts. Based on protein-coding capacity prediction and comparative analysis of sequence similarity, 246 transcripts were classified as portions of unannotated genes or incompletely annotated genes. Another 1,520 transcripts were predicted with high confidence to be long non-coding RNAs. Our analysis also revealed 334 new transcripts that displayed specific expression in ruminants and uncovered a number of new transcripts without intergenus homology but with specific expression in sheep skeletal muscle. The results confirmed a complex transcript pattern of coding and non-coding RNA in sheep skeletal muscle. This study provided important information concerning the sheep genome and transcriptome annotation, which could provide a basis for further study.
ChIP-seq and ChIP-exo profiling of Pol II, H2A.Z, and H3K4me3 in human K562 cells.
Mchaourab, Zenab F; Perreault, Andrea A; Venters, Bryan J
2018-03-06
The human K562 chronic myeloid leukemia cell line has long served as an experimental paradigm for functional genomic studies. To systematically and functionally annotate the human genome, the ENCODE consortium generated hundreds of functional genomic data sets, such as chromatin immunoprecipitation coupled to sequencing (ChIP-seq). While ChIP-seq analyses have provided tremendous insights into gene regulation, spatiotemporal insights were limited by a resolution of several hundred base pairs. ChIP-exonuclease (ChIP-exo) is a refined version of ChIP-seq that overcomes this limitation by providing higher precision mapping of protein-DNA interactions. To study the interplay of transcription initiation and chromatin, we profiled the genome-wide locations for RNA polymerase II (Pol II), the histone variant H2A.Z, and the histone modification H3K4me3 using ChIP-seq and ChIP-exo. In this Data Descriptor, we present detailed information on parallel experimental design, data generation, quality control analysis, and data validation. We discuss how these data lay the foundation for future analysis to understand the relationship between the occupancy of Pol II and nucleosome positions at near base pair resolution.
Gene expression profiles of fin regeneration in loach (Paramisgurnus dabryanu).
Li, Li; He, Jingya; Wang, Linlin; Chen, Weihua; Chang, Zhongjie
2017-11-01
Teleost fins can regenerate accurate position-matched structure and function after amputation. However, we still lack systematic transcriptional profiling and methodologies to understand the molecular basis of fin regeneration. After histological analysis, we established a suppression subtraction hybridization library containing 418 distinct sequences expressed differentially during the process of blastema formation and differentiation in caudal fin regeneration. Genome ontology and comparative analysis of differential distribution of our data and the reference zebrafish genome showed notable subcategories, including multi-organism processes, response to stimuli, extracellular matrix, antioxidant activity, and cell junction function. KEGG pathway analysis allowed the effective identification of relevant genes in those pathways involved in tissue morphogenesis and regeneration, including tight junction, cell adhesion molecules, mTOR and Jak-STAT signaling pathway. From relevant function subcategories and signaling pathways, 78 clones were examined for further Southern-blot hybridization. Then, 17 genes were chosen and characterized using semi-quantitative PCR. Then 4 candidate genes were identified, including F11r, Mmp9, Agr2 and one without a match to any database. After real-time quantitative PCR, the results showed obvious expression changes in different periods of caudal fin regeneration. We can assume that the 4 candidates, likely valuable genes associated with fin regeneration, deserve additional attention. Thus, our study demonstrated how to investigate the transcript profiles with an emphasis on bioinformatics intervention and how to identify potential genes related to fin regeneration processes. The results also provide a foundation or knowledge for further research into genes and molecular mechanisms of fin regeneration. Copyright © 2017 Elsevier B.V. All rights reserved.
Widespread antisense transcription of Populus genome under drought.
Yuan, Yinan; Chen, Su
2018-06-06
Antisense transcription is widespread in many genomes and plays important regulatory roles in gene expression. The objective of our study was to investigate the extent and functional relevance of antisense transcription in forest trees. We employed Populus, a model tree species, to probe the antisense transcriptional response of tree genome under drought, through stranded RNA-seq analysis. We detected nearly 48% of annotated Populus gene loci with antisense transcripts and 44% of them with co-transcription from both DNA strands. Global distribution of reads pattern across annotated gene regions uncovered that antisense transcription was enriched in untranslated regions while sense reads were predominantly mapped in coding exons. We further detected 1185 drought-responsive sense and antisense gene loci and identified a strong positive correlation between the expression of antisense and sense transcripts. Additionally, we assessed the antisense expression in introns and found a strong correlation between intronic expression and exonic expression, confirming antisense transcription of introns contributes to transcriptional activity of Populus genome under drought. Finally, we functionally characterized drought-responsive sense-antisense transcript pairs through gene ontology analysis and discovered that functional groups including transcription factors and histones were concordantly regulated at both sense and antisense transcriptional level. Overall, our study demonstrated the extensive occurrence of antisense transcripts of Populus genes under drought and provided insights into genome structure, regulation pattern and functional significance of drought-responsive antisense genes in forest trees. Datasets generated in this study serve as a foundation for future genetic analysis to improve our understanding of gene regulation by antisense transcription.
Parallel factor ChIP provides essential internal control for quantitative differential ChIP-seq.
Guertin, Michael J; Cullen, Amy E; Markowetz, Florian; Holding, Andrew N
2018-04-17
A key challenge in quantitative ChIP combined with high-throughput sequencing (ChIP-seq) is the normalization of data in the presence of genome-wide changes in occupancy. Analysis-based normalization methods were developed for transcriptomic data and these are dependent on the underlying assumption that total transcription does not change between conditions. For genome-wide changes in transcription factor (TF) binding, these assumptions do not hold true. The challenges in normalization are confounded by experimental variability during sample preparation, processing and recovery. We present a novel normalization strategy utilizing an internal standard of unchanged peaks for reference. Our method can be readily applied to monitor genome-wide changes by ChIP-seq that are otherwise lost or misrepresented through analytical normalization. We compare our approach to normalization by total read depth and two alternative methods that utilize external experimental controls to study TF binding. We successfully resolve the key challenges in quantitative ChIP-seq analysis and demonstrate its application by monitoring the loss of Estrogen Receptor-alpha (ER) binding upon fulvestrant treatment, ER binding in response to estrodiol, ER mediated change in H4K12 acetylation and profiling ER binding in patient-derived xenographs. This is supported by an adaptable pipeline to normalize and quantify differential TF binding genome-wide and generate metrics for differential binding at individual sites.
Zhang, Qi; Zeng, Xin; Younkin, Sam; Kawli, Trupti; Snyder, Michael P; Keleş, Sündüz
2016-02-24
Chromatin immunoprecipitation followed by sequencing (ChIP-seq) experiments revolutionized genome-wide profiling of transcription factors and histone modifications. Although maturing sequencing technologies allow these experiments to be carried out with short (36-50 bps), long (75-100 bps), single-end, or paired-end reads, the impact of these read parameters on the downstream data analysis are not well understood. In this paper, we evaluate the effects of different read parameters on genome sequence alignment, coverage of different classes of genomic features, peak identification, and allele-specific binding detection. We generated 101 bps paired-end ChIP-seq data for many transcription factors from human GM12878 and MCF7 cell lines. Systematic evaluations using in silico variations of these data as well as fully simulated data, revealed complex interplay between the sequencing parameters and analysis tools, and indicated clear advantages of paired-end designs in several aspects such as alignment accuracy, peak resolution, and most notably, allele-specific binding detection. Our work elucidates the effect of design on the downstream analysis and provides insights to investigators in deciding sequencing parameters in ChIP-seq experiments. We present the first systematic evaluation of the impact of ChIP-seq designs on allele-specific binding detection and highlights the power of pair-end designs in such studies.
Identification and characterization of a subtelomeric satellite DNA in Callitrichini monkeys.
Araújo, Naiara Pereira; de Lima, Leonardo Gomes; Dias, Guilherme Borges; Kuhn, Gustavo Campos Silva; de Melo, Alan Lane; Yonenaga-Yassuda, Yatiyo; Stanyon, Roscoe; Svartman, Marta
2017-08-01
Repetitive DNAs are abundant fast-evolving components of eukaryotic genomes, which often possess important structural and functional roles. Despite their ubiquity, repetitive DNAs are poorly studied when compared with the genic fraction of genomes. Here, we took advantage of the availability of the sequenced genome of the common marmoset Callithrix jacchus to assess its satellite DNAs (satDNAs) and their distribution in Callitrichini. After clustering analysis of all reads and comparisons by similarity, we identified a satDNA composed by 171 bp motifs, named MarmoSAT, which composes 1.09% of the C. jacchus genome. Fluorescent in situ hybridization on chromosomes of species from the genera Callithrix, Mico and Callimico showed that MarmoSAT had a subtelomeric location. In addition to the common monomeric, we found that MarmoSAT was also organized in higher-order repeats of 338 bp in Callimico goeldii. Our phylogenetic analyses showed that MarmoSAT repeats from C. jacchus lack chromosome-specific features, suggesting exchange events among subterminal regions of non-homologous chromosomes. MarmoSAT is transcribed in several tissues of C. jacchus, with the highest transcription levels in spleen, thymus and heart. The transcription profile and subtelomeric location suggest that MarmoSAT may be involved in the regulation of telomerase and modulation of telomeric chromatin. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
[Transcription activator-like effectors(TALEs)based genome engineering].
Zhao, Mei-Wei; Duan, Cheng-Li; Liu, Jiang
2013-10-01
Systematic reverse-engineering of functional genome architecture requires precise modifications of gene sequences and transcription levels. The development and application of transcription activator-like effectors(TALEs) has created a wealth of genome engineering possibilities. TALEs are a class of naturally occurring DNA-binding proteins found in the plant pathogen Xanthomonas species. The DNA-binding domain of each TALE typically consists of tandem 34-amino acid repeat modules rearranged according to a simple cipher to target new DNA sequences. Customized TALEs can be used for a wide variety of genome engineering applications, including transcriptional modulation and genome editing. Such "genome engineering" has now been established in human cells and a number of model organisms, thus opening the door to better understanding gene function in model organisms, improving traits in crop plants and treating human genetic disorders.
Sorting Five Human Tumor Types Reveals Specific Biomarkers and Background Classification Genes.
Roche, Kimberly E; Weinstein, Marvin; Dunwoodie, Leland J; Poehlman, William L; Feltus, Frank A
2018-05-25
We applied two state-of-the-art, knowledge independent data-mining methods - Dynamic Quantum Clustering (DQC) and t-Distributed Stochastic Neighbor Embedding (t-SNE) - to data from The Cancer Genome Atlas (TCGA). We showed that the RNA expression patterns for a mixture of 2,016 samples from five tumor types can sort the tumors into groups enriched for relevant annotations including tumor type, gender, tumor stage, and ethnicity. DQC feature selection analysis discovered 48 core biomarker transcripts that clustered tumors by tumor type. When these transcripts were removed, the geometry of tumor relationships changed, but it was still possible to classify the tumors using the RNA expression profiles of the remaining transcripts. We continued to remove the top biomarkers for several iterations and performed cluster analysis. Even though the most informative transcripts were removed from the cluster analysis, the sorting ability of remaining transcripts remained strong after each iteration. Further, in some iterations we detected a repeating pattern of biological function that wasn't detectable with the core biomarker transcripts present. This suggests the existence of a "background classification" potential in which the pattern of gene expression after continued removal of "biomarker" transcripts could still classify tumors in agreement with the tumor type.
Wu, Yi-Hsuan; Taggart, Janet; Song, Pamela Xiyao; MacDiarmid, Colin; Eide, David J.
2016-01-01
The Msc2 and Zrg17 proteins of Saccharomyces cerevisiae form a complex to transport zinc into the endoplasmic reticulum. ZRG17 is transcriptionally induced in zinc-limited cells by the Zap1 transcription factor. In this report, we show that MSC2 mRNA also increases (~1.5 fold) in zinc-limited cells. The MSC2 gene has two in-frame ATG codons at its 5’ end, ATG1 and ATG2; ATG2 is the predicted initiation codon. When the MSC2 promoter was fused at ATG2 to the lacZ gene, we found that unlike the chromosomal gene this reporter showed a 4-fold decrease in lacZ mRNA in zinc-limited cells. Surprisingly, β-galactosidase activity generated by this fusion gene increased ~7 fold during zinc deficiency suggesting the influence of post-transcriptional factors. Transcription of MSC2ATG2-lacZ was found to start upstream of ATG1 in zinc-replete cells. In zinc-limited cells, transcription initiation shifted to sites just upstream of ATG2. From the results of mutational and polysome profile analyses, we propose the following explanation for these effects. In zinc-replete cells, MSC2ATG2-lacZ mRNA with long 5’ UTRs fold into secondary structures that inhibit translation. In zinc-limited cells, transcripts with shorter unstructured 5’ UTRs are generated that are more efficiently translated. Surprisingly, chromosomal MSC2 did not show start site shifts in response to zinc status and only shorter 5’ UTRs were observed. However, the shifts that occur in the MSC2ATG2-lacZ construct led us to identify significant transcription start site changes affecting the expression of ~3% of all genes. Therefore, zinc status can profoundly alter transcription initiation across the yeast genome. PMID:27657924
Downstream targets of HOXB4 in a cell line model of primitive hematopoietic progenitor cells.
Lee, Han M; Zhang, Hui; Schulz, Vincent; Tuck, David P; Forget, Bernard G
2010-08-05
Enforced expression of the homeobox transcription factor HOXB4 has been shown to enhance hematopoietic stem cell self-renewal and expansion ex vivo and in vivo. To investigate the downstream targets of HOXB4 in hematopoietic progenitor cells, HOXB4 was constitutively overexpressed in the primitive hematopoietic progenitor cell line EML. Two genome-wide analytical techniques were used: RNA expression profiling using microarrays and chromatin immunoprecipitation (ChIP)-chip. RNA expression profiling revealed that 465 gene transcripts were differentially expressed in KLS (c-Kit(+), Lin(-), Sca-1(+))-EML cells that overexpressed HOXB4 (KLS-EML-HOXB4) compared with control KLS-EML cells that were transduced with vector alone. In particular, erythroid-specific gene transcripts were observed to be highly down-regulated in KLS-EML-HOXB4 cells. ChIP-chip analysis revealed that the promoter region for 1910 genes, such as CD34, Sox4, and B220, were occupied by HOXB4 in KLS-EML-HOXB4 cells. Side-by-side comparison of the ChIP-chip and RNA expression profiling datasets provided correlative information and identified Gp49a and Laptm4b as candidate "stemness-related" genes. Both genes were highly ranked in both dataset lists and have been previously shown to be preferentially expressed in hematopoietic stem cells and down-regulated in mature hematopoietic cells, thus making them attractive candidates for future functional studies in hematopoietic cells.
Godahewa, G I; Perera, N C N; Lee, Sukkyoung; Kim, Myoung-Jin; Lee, Jehee
2017-09-05
Cathepsin Z (CTSZ) is lysosomal cysteine protease of the papain superfamily. It participates in the host immune defense via phagocytosis, signal transduction, cell-cell communication, proliferation, and migration of immune cells such as monocytes, macrophages, and dendritic cells. Hence, CTSZ is also acknowledged as an acute-phase protein in host immunity. In this study, we sought to identify the CTSZ homolog from disk abalone (AbCTSZ) and characterize it at the molecular, genomic, and transcriptional levels. AbCTSZ encodes a protein with 318 amino acids and a molecular mass of 36kDa. The structure of AbCTSZ reveals amino acid sequences that are characteristic of the signal sequence, pro-peptide, peptidase-C1 papain family cysteine protease domain, mini-loop, HIP motif, N-linked glycosylation sites, active sites, and conserved Cys residues. A pairwise comparison revealed that AbCTSZ shared the highest amino acid homology with its molluscan counterpart from Crassostrea gigas. A multiple alignment analysis revealed the conservation of functionally crucial elements of AbCTSZ, and a phylogenetic study further confirmed a proximal evolutionary relationship with its invertebrate counterparts. Further, an analysis of AbCTSZ genomic structure revealed seven exons separated by six introns, which differs from that of its vertebrate counterparts. Quantitative real time PCR (qPCR) detected the transcripts of AbCTSZ in early developmental stages and in eight different tissues. Higher levels of AbCTSZ transcripts were found in trochophore, gill, and hemocytes, highlighting its importance in the early development and immunity of disk abalone. In addition, we found that viable bacteria (Vibrio parahaemolyticus and Listeria monocytogenes) and bacterial lipopolysaccharides significantly modulated AbCTSZ transcription. Collectively, these lines of evidences suggest that AbCTSZ plays an indispensable role in the innate immunity of disk abalone. Copyright © 2017. Published by Elsevier B.V.
DuanMu, Huizi; Wang, Yang; Bai, Xi; Cheng, Shufei; Deyholos, Michael K; Wong, Gane Ka-Shu; Li, Dan; Zhu, Dan; Li, Ran; Yu, Yang; Cao, Lei; Chen, Chao; Zhu, Yanming
2015-11-01
Soil alkalinity is an important environmental problem limiting agricultural productivity. Wild soybean (Glycine soja) shows strong alkaline stress tolerance, so it is an ideal plant candidate for studying the molecular mechanisms of alkaline tolerance and identifying alkaline stress-responsive genes. However, limited information is available about G. soja responses to alkaline stress on a genomic scale. Therefore, in the present study, we used RNA sequencing to compare transcript profiles of G. soja root responses to sodium bicarbonate (NaHCO3) at six time points, and a total of 68,138,478 pairs of clean reads were obtained using the Illumina GAIIX. Expression patterns of 46,404 G. soja genes were profiled in all six samples based on RNA-seq data using Cufflinks software. Then, t12 transcription factors from MYB, WRKY, NAC, bZIP, C2H2, HB, and TIFY families and 12 oxidation reduction related genes were chosen and verified to be induced in response to alkaline stress by using quantitative real-time polymerase chain reaction (qRT-PCR). The GO functional annotation analysis showed that besides "transcriptional regulation" and "oxidation reduction," these genes were involved in a variety of processes, such as "binding" and "response to stress." This is the first comprehensive transcriptome profiling analysis of wild soybean root under alkaline stress by RNA sequencing. Our results highlight changes in the gene expression patterns and identify a set of genes induced by NaHCO3 stress. These findings provide a base for the global analyses of G. soja alkaline stress tolerance mechanisms.
Gene expression profile in mesenchymal stem cells derived from dental tissues and bone marrow
Kim, Su-Hwan; Kim, Young-Sung; Lee, Su-Yeon; Kim, Kyoung-Hwa; Lee, Yong-Moo; Kim, Won-Kyung
2011-01-01
Purpose The aim of this study is to compare the gene expression profile in mesenchymal stem cells derived from dental tissues and bone marrow for characterization of dental stem cells. Methods We employed GeneChip analysis to the expression levels of approximately 32,321 kinds of transcripts in 5 samples of bone-marrow-derived mesenchymal stem cells (BMSCs) (n=1), periodontal ligament stem cells (PDLSCs) (n=2), and dental pulp stem cells (DPSCs) (n=2). Each cell was sorted by a FACS Vantage Sorter using immunocytochemical staining of the early mesenchymal stem cell surface marker STRO-1 before the microarray analysis. Results We identified 379 up-regulated and 133 down-regulated transcripts in BMSCs, 68 up-regulated and 64 down-regulated transcripts in PDLSCs, and 218 up-regulated and 231 down-regulated transcripts in DPSCs. In addition, anatomical structure development and anatomical structure morphogenesis gene ontology (GO) terms were over-represented in all three different mesenchymal stem cells and GO terms related to blood vessels, and neurons were over-represented only in DPSCs. Conclusions This study demonstrated the genome-wide gene expression patterns of STRO-1+ mesenchymal stem cells derived from dental tissues and bone marrow. The differences among the expression profiles of BMSCs, PDLSCs, and DPSCs were shown, and 999 candidate genes were found to be definitely up- or down-regulated. In addition, GOstat analyses of regulated gene products provided over-represented GO classes. These data provide a first step for discovering molecules key to the characteristics of dental stem cells. PMID:21954424
We and others have shown that transition and maintenance of biological states is controlled by master regulator proteins, which can be inferred by interrogating tissue-specific regulatory models (interactomes) with transcriptional signatures, using the VIPER algorithm. Yet, some tissues may lack molecular profiles necessary for interactome inference (orphan tissues), or, as for single cells isolated from heterogeneous samples, their tissue context may be undetermined.
Grade, Marian; Ghadimi, B Michael; Varma, Sudhir; Simon, Richard; Wangsa, Danny; Barenboim-Stapleton, Linda; Liersch, Torsten; Becker, Heinz; Ried, Thomas; Difilippantonio, Michael J
2006-01-01
To identify genetic alterations underlying rectal carcinogenesis, we used global gene expression profiling of a series of 17 locally advanced rectal adenocarcinomas and 20 normal rectal mucosa biopsies on oligonucleotide arrays. A total of 351 genes were differentially expressed (P < 1.0e-7) between normal rectal mucosa and rectal carcinomas, 77 genes had a >5-fold difference, and 85 genes always had at least a 2-fold change in all of the matched samples. Twelve genes satisfied all three of these criteria. Altered expression of genes such as PTGS2 (COX-2), WNT1, TGFB1, VEGF, and MYC was confirmed, whereas our data for other genes, like PPARD and LEF1, were inconsistent with previous reports. In addition, we found deregulated expression of many genes whose involvement in rectal carcinogenesis has not been reported. By mapping the genomic imbalances in the tumors using comparative genomic hybridization, we could show that DNA copy number gains of recurrently aneuploid chromosome arms 7p, 8q, 13q, 18q, 20p, and 20q correlated significantly with their average chromosome arm expression profile. Taken together, our results show that both the high-level, significant transcriptional deregulation of specific genes and general modification of the average transcriptional activity of genes residing on aneuploid chromosomes coexist in rectal adenocarcinomas.
Grade, Marian; Ghadimi, B. Michael; Varma, Sudhir; Simon, Richard; Wangsa, Danny; Barenboim-Stapleton, Linda; Liersch, Torsten; Becker, Heinz; Ried, Thomas; Difilippantonio, Michael J.
2016-01-01
To identify genetic alterations underlying rectal carcinogenesis, we used global gene expression profiling of a series of 17 locally advanced rectal adenocarcinomas and 20 normal rectal mucosa biopsies on oligonucleotide arrays. A total of 351 genes were differentially expressed (P < 1.0e–7) between normal rectal mucosa and rectal carcinomas, 77 genes had a >5-fold difference, and 85 genes always had at least a 2-fold change in all of the matched samples. Twelve genes satisfied all three of these criteria. Altered expression of genes such as PTGS2 (COX-2), WNT1, TGFB1, VEGF, and MYC was confirmed, whereas our data for other genes, like PPARD and LEF1, were inconsistent with previous reports. In addition, we found deregulated expression of many genes whose involvement in rectal carcinogenesis has not been reported. By mapping the genomic imbalances in the tumors using comparative genomic hybridization, we could show that DNA copy number gains of recurrently aneuploid chromosome arms 7p, 8q, 13q, 18q, 20p, and 20q correlated significantly with their average chromosome arm expression profile. Taken together, our results show that both the high-level, significant transcriptional deregulation of specific genes and general modification of the average transcriptional activity of genes residing on aneuploid chromosomes coexist in rectal adenocarcinomas. PMID:16397240
Lacruz, Rodrigo S; Smith, Charles E; Bringas, Pablo; Chen, Yi-Bu; Smith, Susan M; Snead, Malcolm L; Kurtz, Ira; Hacia, Joseph G; Hubbard, Michael J; Paine, Michael L
2012-05-01
The gene repertoire regulating vertebrate biomineralization is poorly understood. Dental enamel, the most highly mineralized tissue in mammals, differs from other calcifying systems in that the formative cells (ameloblasts) lack remodeling activity and largely degrade and resorb the initial extracellular matrix. Enamel mineralization requires that ameloblasts undergo a profound functional switch from matrix-secreting to maturational (calcium transport, protein resorption) roles as mineralization progresses. During the maturation stage, extracellular pH decreases markedly, placing high demands on ameloblasts to regulate acidic environments present around the growing hydroxyapatite crystals. To identify the genetic events driving enamel mineralization, we conducted genome-wide transcript profiling of the developing enamel organ from rat incisors and highlight over 300 genes differentially expressed during maturation. Using multiple bioinformatics analyses, we identified groups of maturation-associated genes whose functions are linked to key mineralization processes including pH regulation, calcium handling, and matrix turnover. Subsequent qPCR and Western blot analyses revealed that a number of solute carrier (SLC) gene family members were up-regulated during maturation, including the novel protein Slc24a4 involved in calcium handling as well as other proteins of similar function (Stim1). By providing the first global overview of the cellular machinery required for enamel maturation, this study provide a strong foundation for improving basic understanding of biomineralization and its practical applications in healthcare. Copyright © 2011 Wiley Periodicals, Inc.
Liu, Shi-Huo; Li, Hong-Fei; Yang, Yang; Yang, Rui-Lin; Yang, Wen-Jia; Jiang, Hong-Bo; Dou, Wei; Smagghe, Guy; Wang, Jin-Jun
2018-05-01
Chitinases (Chts) and chitin deacetylases (CDAs) are important enzymes required for chitin metabolism in insects. In this study, 12 Cht-related genes (including seven Cht genes and five imaginal disc growth factor genes) and 6 CDA genes (encoding seven proteins) were identified in Bactrocera dorsalis using genome-wide searching and transcript profiling. Based on the conserved sequences and phylogenetic relationships, 12 Cht-related proteins were clustered into eight groups (group I-V and VII-IX). Further domain architecture analysis showed that all contained at least one chitinase catalytic domain, however, only four (BdCht5, BdCht7, BdCht8 and BdCht10) possessed chitin-binding domains. The subsequent phylogenetic analysis revealed that seven CDAs were clustered into five groups (group I-V), and all had one chitin deacetylase catalytic domain. However, only six exhibited chitin-binding domains. Finally, the development- and tissue-specific expression profiling showed that transcript levels of the 12 Cht-related genes and 6 CDA genes varied considerably among eggs, larvae, pupae and adults, as well as among different tissues of larvae and adults. Our findings illustrate the structural differences and expression patterns of Cht and CDA genes in B. dorsalis, and provide important information for the development of new pest control strategies based on these vital enzymes. Copyright © 2018. Published by Elsevier Inc.
Wei, Wei; Chai, Zhuangzhuang; Xie, Yinge; Gao, Kuan; Cui, Mengyuan; Jiang, Ying
2017-01-01
Mitogen-activated protein kinases (MAPKs) play essential roles in mediating biotic and abiotic stress responses in plants. However, the MAPK gene family in strawberry has not been systematically characterized. Here, we performed a genome-wide survey and identified 12 MAPK genes in the Fragaria vesca genome. Protein domain analysis indicated that all FvMAPKs have typical protein kinase domains. Sequence alignments and phylogenetic analysis classified the FvMAPK genes into four different groups. Conserved motif and exon-intron organization supported the evolutionary relationships inferred from the phylogenetic analysis. Analysis of the stress-related cis-regulatory element in the promoters and subcellular localization predictions of FvMAPKs were also performed. Gene transcript profile analysis showed that the majority of the FvMAPK genes were ubiquitously transcribed in strawberry leaves after Podosphaera aphanis inoculation and after treatment with cold, heat, drought, salt and the exogenous hormones abscisic acid, ethephon, methyl jasmonate, and salicylic acid. RT-qPCR showed that six selected FvMAPK genes comprehensively responded to various stimuli. Additionally, interaction networks revealed that the crucial signaling transduction controlled by FvMAPKs may be involved in the biotic and abiotic stress responses. Our results may provide useful information for future research on the function of the MAPK gene family and the genetic improvement of strawberry resistance to environmental stresses. PMID:28562633
Narnoliya, Lokesh K; Kaushal, Girija; Singh, Sudhir P; Sangwan, Rajender S
2017-01-13
Rose-scented geranium (Pelargonium sp.) is a perennial herb that produces a high value essential oil of fragrant significance due to the characteristic compositional blend of rose-oxide and acyclic monoterpenoids in foliage. Recently, the plant has also been shown to produce tartaric acid in leaf tissues. Rose-scented geranium represents top-tier cash crop in terms of economic returns and significance of the plant and plant products. However, there has hardly been any study on its metabolism and functional genomics, nor any genomic expression dataset resource is available in public domain. Therefore, to begin the gains in molecular understanding of specialized metabolic pathways of the plant, de novo sequencing of rose-scented geranium leaf transcriptome, transcript assembly, annotation, expression profiling as well as their validation were carried out. De novo transcriptome analysis resulted a total of 78,943 unique contigs (average length: 623 bp, and N50 length: 752 bp) from 15.44 million high quality raw reads. In silico functional annotation led to the identification of several putative genes representing terpene, ascorbic acid and tartaric acid biosynthetic pathways, hormone metabolism, and transcription factors. Additionally, a total of 6,040 simple sequence repeat (SSR) motifs were identified in 6.8% of the expressed transcripts. The highest frequency of SSR was of tri-nucleotides (50%). Further, transcriptome assembly was validated for randomly selected putative genes by standard PCR-based approach. In silico expression profile of assembled contigs were validated by real-time PCR analysis of selected transcripts. Being the first report on transcriptome analysis of rose-scented geranium the data sets and the leads and directions reflected in this investigation will serve as a foundation for pursuing and understanding molecular aspects of its biology, and specialized metabolic pathways, metabolic engineering, genetic diversity as well as molecular breeding.
Transcriptional Architecture of Synaptic Communication Delineates GABAergic Neuron Identity.
Paul, Anirban; Crow, Megan; Raudales, Ricardo; He, Miao; Gillis, Jesse; Huang, Z Josh
2017-10-19
Understanding the organizational logic of neural circuits requires deciphering the biological basis of neuronal diversity and identity, but there is no consensus on how neuron types should be defined. We analyzed single-cell transcriptomes of a set of anatomically and physiologically characterized cortical GABAergic neurons and conducted a computational genomic screen for transcriptional profiles that distinguish them from one another. We discovered that cardinal GABAergic neuron types are delineated by a transcriptional architecture that encodes their synaptic communication patterns. This architecture comprises 6 categories of ∼40 gene families, including cell-adhesion molecules, transmitter-modulator receptors, ion channels, signaling proteins, neuropeptides and vesicular release components, and transcription factors. Combinatorial expression of select members across families shapes a multi-layered molecular scaffold along the cell membrane that may customize synaptic connectivity patterns and input-output signaling properties. This molecular genetic framework of neuronal identity integrates cell phenotypes along multiple axes and provides a foundation for discovering and classifying neuron types. Copyright © 2017 Elsevier Inc. All rights reserved.
Singh, Sudhir P; Jeet, Raja; Kumar, Jitendra; Shukla, Vishnu; Srivastava, Rakesh; Mantri, Shrikant S; Tuli, Rakesh
2014-01-01
Wheat is one of the most important cereal crops in the world. To identify the candidate genes for mineral accumulation, it is important to examine differential transcriptome between wheat genotypes, with contrasting levels of minerals in grains. A transcriptional comparison of developing grains was carried out between two wheat genotypes- Triticum aestivum Cv. WL711 (low grain mineral), and T. aestivum L. IITR26 (high grain mineral), using Affymetrix GeneChip Wheat Genome Array. The study identified a total of 580 probe sets as differentially expressed (with log2 fold change of ≥2 at p≤0.01) between the two genotypes, during grain filling. Transcripts with significant differences in induction or repression between the two genotypes included genes related to metal homeostasis, metal tolerance, lignin and flavonoid biosynthesis, amino acid and protein transport, vacuolar-sorting receptor, aquaporins, and stress responses. Meta-analysis revealed spatial and temporal signatures of a majority of the differentially regulated transcripts.
Singh, Sudhir P.; Jeet, Raja; Kumar, Jitendra; Shukla, Vishnu; Srivastava, Rakesh; Mantri, Shrikant S.; Tuli, Rakesh
2014-01-01
Wheat is one of the most important cereal crops in the world. To identify the candidate genes for mineral accumulation, it is important to examine differential transcriptome between wheat genotypes, with contrasting levels of minerals in grains. A transcriptional comparison of developing grains was carried out between two wheat genotypes- Triticum aestivum Cv. WL711 (low grain mineral), and T. aestivum L. IITR26 (high grain mineral), using Affymetrix GeneChip Wheat Genome Array. The study identified a total of 580 probe sets as differentially expressed (with log2 fold change of ≥2 at p≤0.01) between the two genotypes, during grain filling. Transcripts with significant differences in induction or repression between the two genotypes included genes related to metal homeostasis, metal tolerance, lignin and flavonoid biosynthesis, amino acid and protein transport, vacuolar-sorting receptor, aquaporins, and stress responses. Meta-analysis revealed spatial and temporal signatures of a majority of the differentially regulated transcripts. PMID:25364903
Transcriptional Profile of Brain Injury in Hypothermic Circulatory Arrest and Cardiopulmonary Bypass
Allen, Jeremiah G.; Weiss, Eric S.; Wilson, Mary Ann; Arnaoutakis, George J.; Blue, Mary E.; Talbot, C. Conover; Jie, Chunfa; Lange, Mary S.; Troncoso, Juan C.; Johnston, Michael V.; Baumgartner, William A.
2011-01-01
Background Little is known about the molecular mechanisms of neurologic complications after hypothermic circulatory arrest (HCA) with cardiopulmonary bypass (CPB). Canine genome sequencing allows profiling of genomic changes after HCA and CPB alone. We hypothesize that gene regulation will increase with increased severity of injury. Methods Dogs underwent 2-hour HCA at 18°C (n = 10), 1-hour HCA (n = 8), or 2-hour CPB at 32°C alone (n = 8). In each group, half were sacrificed at 8 hours and half at 24 hours after treatment. After neurologic scoring, brains were harvested for genomic analysis. Hippocampal RNA isolates were analyzed using canine oligonucleotide expression arrays containing 42,028 probes. Results Consistent with prior work, dogs that underwent 2-hour HCA experienced severe neurologic injury. One hour of HCA caused intermediate clinical damage. Cardiopulmonary bypass alone yielded normal clinical scores. Cardiopulmonary bypass, 1-hour HCA, and 2-hour HCA groups historically demonstrated increasing degrees of histopathologic damage (previously published). Exploratory analysis revealed differences in significantly regulated genes (false discovery rate < 10%, absolute fold change ≥ 1.2), with increases in differential gene expression with injury severity. At 8 hours and 24 hours after insult, 2-hour HCA dogs had 502 and 1,057 genes regulated, respectively; 1-hour HCA dogs had 179 and 56 genes regulated; and CPB alone dogs had 5 and 0 genes regulated. Conclusions Our genomic profile of canine brains after HCA and CPB revealed 1-hour and 2-hour HCA induced markedly increased gene regulation, in contrast to the minimal effect of CPB alone. This adds to the body of neurologic literature supporting the safety of CPB alone and the minimal effect of CPB on a normal brain, while illuminating genomic results of both. PMID:20494057
Molecular pathways: transcription factories and chromosomal translocations.
Osborne, Cameron S
2014-01-15
The mammalian nucleus is a highly complex structure that carries out a diverse range of functions such as DNA replication, cell division, RNA processing, and nuclear export/import. Many of these activities occur at discrete subcompartments that intersect with specific regions of the genome. Over the past few decades, evidence has accumulated to suggest that RNA transcription also occurs in specialized sites, called transcription factories, that may influence how the genome is organized. There may be certain efficiency benefits to cluster transcriptional activity in this way. However, the clustering of genes at transcription factories may have consequences for genome stability, and increase the susceptibility to recurrent chromosomal translocations that lead to cancer. The relationships between genome organization, transcription, and chromosomal translocation formation will have important implications in understanding the causes of therapy-related cancers. ©2013 AACR.
DNA microarray analysis is plagued by a lack of data reproducibility and by limits to the detectability of transcripts by hybridization. To mitigate these limitations, we employed transcriptional coupling within the S. typhimurium genome. This genome has 2664 transcriptionally co...
Microplate-based platform for combined chromatin and DNA methylation immunoprecipitation assays
2011-01-01
Background The processes that compose expression of a given gene are far more complex than previously thought presenting unprecedented conceptual and mechanistic challenges that require development of new tools. Chromatin structure, which is regulated by DNA methylation and histone modification, is at the center of gene regulation. Immunoprecipitations of chromatin (ChIP) and methylated DNA (MeDIP) represent a major achievement in this area that allow researchers to probe chromatin modifications as well as specific protein-DNA interactions in vivo and to estimate the density of proteins at specific sites genome-wide. Although a critical component of chromatin structure, DNA methylation has often been studied independently of other chromatin events and transcription. Results To allow simultaneous measurements of DNA methylation with other genomic processes, we developed and validated a simple and easy-to-use high throughput microplate-based platform for analysis of DNA methylation. Compared to the traditional beads-based MeDIP the microplate MeDIP was more sensitive and had lower non-specific binding. We integrated the MeDIP method with a microplate ChIP assay which allows measurements of both DNA methylation and histone marks at the same time, Matrix ChIP-MeDIP platform. We illustrated several applications of this platform to relate DNA methylation, with chromatin and transcription events at selected genes in cultured cells, human cancer and in a model of diabetic kidney disease. Conclusion The high throughput capacity of Matrix ChIP-MeDIP to profile tens and potentially hundreds of different genomic events at the same time as DNA methylation represents a powerful platform to explore complex genomic mechanism at selected genes in cultured cells and in whole tissues. In this regard, Matrix ChIP-MeDIP should be useful to complement genome-wide studies where the rich chromatin and transcription database resources provide fruitful foundation to pursue mechanistic, functional and diagnostic information at genes of interest in health and disease. PMID:22098709
Lapierre, Pascal; Mir, Mushtaq; Chase, Michael R.; Pyle, Margaret M.; Gawande, Richa; Ahmad, Rushdy; Sarracino, David A.; Ioerger, Thomas R.; Fortune, Sarah M.; Derbyshire, Keith M.; Wade, Joseph T.; Gray, Todd A.
2015-01-01
RNA-seq technologies have provided significant insight into the transcription networks of mycobacteria. However, such studies provide no definitive information on the translational landscape. Here, we use a combination of high-throughput transcriptome and proteome-profiling approaches to more rigorously understand protein expression in two mycobacterial species. RNA-seq and ribosome profiling in Mycobacterium smegmatis, and transcription start site (TSS) mapping and N-terminal peptide mass spectrometry in Mycobacterium tuberculosis, provide complementary, empirical datasets to examine the congruence of transcription and translation in the Mycobacterium genus. We find that nearly one-quarter of mycobacterial transcripts are leaderless, lacking a 5’ untranslated region (UTR) and Shine-Dalgarno ribosome-binding site. Our data indicate that leaderless translation is a major feature of mycobacterial genomes and is comparably robust to leadered initiation. Using translational reporters to systematically probe the cis-sequence requirements of leaderless translation initiation in mycobacteria, we find that an ATG or GTG at the mRNA 5’ end is both necessary and sufficient. This criterion, together with our ribosome occupancy data, suggests that mycobacteria encode hundreds of small, unannotated proteins at the 5’ ends of transcripts. The conservation of small proteins in both mycobacterial species tested suggests that some play important roles in mycobacterial physiology. Our translational-reporter system further indicates that mycobacterial leadered translation initiation requires a Shine Dalgarno site in the 5’ UTR and that ATG, GTG, TTG, and ATT codons can robustly initiate translation. Our combined approaches provide the first comprehensive view of mycobacterial gene structures and their non-canonical mechanisms of protein expression. PMID:26536359
Shell, Scarlet S; Wang, Jing; Lapierre, Pascal; Mir, Mushtaq; Chase, Michael R; Pyle, Margaret M; Gawande, Richa; Ahmad, Rushdy; Sarracino, David A; Ioerger, Thomas R; Fortune, Sarah M; Derbyshire, Keith M; Wade, Joseph T; Gray, Todd A
2015-11-01
RNA-seq technologies have provided significant insight into the transcription networks of mycobacteria. However, such studies provide no definitive information on the translational landscape. Here, we use a combination of high-throughput transcriptome and proteome-profiling approaches to more rigorously understand protein expression in two mycobacterial species. RNA-seq and ribosome profiling in Mycobacterium smegmatis, and transcription start site (TSS) mapping and N-terminal peptide mass spectrometry in Mycobacterium tuberculosis, provide complementary, empirical datasets to examine the congruence of transcription and translation in the Mycobacterium genus. We find that nearly one-quarter of mycobacterial transcripts are leaderless, lacking a 5' untranslated region (UTR) and Shine-Dalgarno ribosome-binding site. Our data indicate that leaderless translation is a major feature of mycobacterial genomes and is comparably robust to leadered initiation. Using translational reporters to systematically probe the cis-sequence requirements of leaderless translation initiation in mycobacteria, we find that an ATG or GTG at the mRNA 5' end is both necessary and sufficient. This criterion, together with our ribosome occupancy data, suggests that mycobacteria encode hundreds of small, unannotated proteins at the 5' ends of transcripts. The conservation of small proteins in both mycobacterial species tested suggests that some play important roles in mycobacterial physiology. Our translational-reporter system further indicates that mycobacterial leadered translation initiation requires a Shine Dalgarno site in the 5' UTR and that ATG, GTG, TTG, and ATT codons can robustly initiate translation. Our combined approaches provide the first comprehensive view of mycobacterial gene structures and their non-canonical mechanisms of protein expression.
MethHC: a database of DNA methylation and gene expression in human cancer.
Huang, Wei-Yun; Hsu, Sheng-Da; Huang, Hsi-Yuan; Sun, Yi-Ming; Chou, Chih-Hung; Weng, Shun-Long; Huang, Hsien-Da
2015-01-01
We present MethHC (http://MethHC.mbc.nctu.edu.tw), a database comprising a systematic integration of a large collection of DNA methylation data and mRNA/microRNA expression profiles in human cancer. DNA methylation is an important epigenetic regulator of gene transcription, and genes with high levels of DNA methylation in their promoter regions are transcriptionally silent. Increasing numbers of DNA methylation and mRNA/microRNA expression profiles are being published in different public repositories. These data can help researchers to identify epigenetic patterns that are important for carcinogenesis. MethHC integrates data such as DNA methylation, mRNA expression, DNA methylation of microRNA gene and microRNA expression to identify correlations between DNA methylation and mRNA/microRNA expression from TCGA (The Cancer Genome Atlas), which includes 18 human cancers in more than 6000 samples, 6548 microarrays and 12 567 RNA sequencing data. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Hicks, Chindo; Kumar, Ranjit; Pannuti, Antonio; Miele, Lucio
2012-01-01
Variable response and resistance to tamoxifen treatment in breast cancer patients remains a major clinical problem. To determine whether genes and biological pathways containing SNPs associated with risk for breast cancer are dysregulated in response to tamoxifen treatment, we performed analysis combining information from 43 genome-wide association studies with gene expression data from 298 ER(+) breast cancer patients treated with tamoxifen and 125 ER(+) controls. We identified 95 genes which distinguished tamoxifen treated patients from controls. Additionally, we identified 54 genes which stratified tamoxifen treated patients into two distinct groups. We identified biological pathways containing SNPs associated with risk for breast cancer, which were dysregulated in response to tamoxifen treatment. Key pathways identified included the apoptosis, P53, NFkB, DNA repair and cell cycle pathways. Combining GWAS with transcription profiling provides a unified approach for associating GWAS findings with response to drug treatment and identification of potential drug targets.
Transcription as a Threat to Genome Integrity.
Gaillard, Hélène; Aguilera, Andrés
2016-06-02
Genomes undergo different types of sporadic alterations, including DNA damage, point mutations, and genome rearrangements, that constitute the basis for evolution. However, these changes may occur at high levels as a result of cell pathology and trigger genome instability, a hallmark of cancer and a number of genetic diseases. In the last two decades, evidence has accumulated that transcription constitutes an important natural source of DNA metabolic errors that can compromise the integrity of the genome. Transcription can create the conditions for high levels of mutations and recombination by its ability to open the DNA structure and remodel chromatin, making it more accessible to DNA insulting agents, and by its ability to become a barrier to DNA replication. Here we review the molecular basis of such events from a mechanistic perspective with particular emphasis on the role of transcription as a genome instability determinant.
Genome-wide mapping of infection-induced SINE RNAs reveals a role in selective mRNA export.
Karijolich, John; Zhao, Yang; Alla, Ravi; Glaunsinger, Britt
2017-06-02
Short interspersed nuclear elements (SINEs) are retrotransposons evolutionarily derived from endogenous RNA Polymerase III RNAs. Though SINE elements have undergone exaptation into gene regulatory elements, how transcribed SINE RNA impacts transcriptional and post-transcriptional regulation is largely unknown. This is partly due to a lack of information regarding which of the loci have transcriptional potential. Here, we present an approach (short interspersed nuclear element sequencing, SINE-seq), which selectively profiles RNA Polymerase III-derived SINE RNA, thereby identifying transcriptionally active SINE loci. Applying SINE-seq to monitor murine B2 SINE expression during a gammaherpesvirus infection revealed transcription from 28 270 SINE loci, with ∼50% of active SINE elements residing within annotated RNA Polymerase II loci. Furthermore, B2 RNA can form intermolecular RNA-RNA interactions with complementary mRNAs, leading to nuclear retention of the targeted mRNA via a mechanism involving p54nrb. These findings illuminate a pathway for the selective regulation of mRNA export during stress via retrotransposon activation. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Genome-wide mapping of infection-induced SINE RNAs reveals a role in selective mRNA export
Zhao, Yang; Alla, Ravi
2017-01-01
Abstract Short interspersed nuclear elements (SINEs) are retrotransposons evolutionarily derived from endogenous RNA Polymerase III RNAs. Though SINE elements have undergone exaptation into gene regulatory elements, how transcribed SINE RNA impacts transcriptional and post-transcriptional regulation is largely unknown. This is partly due to a lack of information regarding which of the loci have transcriptional potential. Here, we present an approach (short interspersed nuclear element sequencing, SINE-seq), which selectively profiles RNA Polymerase III-derived SINE RNA, thereby identifying transcriptionally active SINE loci. Applying SINE-seq to monitor murine B2 SINE expression during a gammaherpesvirus infection revealed transcription from 28 270 SINE loci, with ∼50% of active SINE elements residing within annotated RNA Polymerase II loci. Furthermore, B2 RNA can form intermolecular RNA–RNA interactions with complementary mRNAs, leading to nuclear retention of the targeted mRNA via a mechanism involving p54nrb. These findings illuminate a pathway for the selective regulation of mRNA export during stress via retrotransposon activation. PMID:28334904
Comprehensive Analysis of Transcription Dynamics from Brain Samples Following Behavioral Experience
Turm, Hagit; Mukherjee, Diptendu; Haritan, Doron; Tahor, Maayan; Citri, Ami
2014-01-01
The encoding of experiences in the brain and the consolidation of long-term memories depend on gene transcription. Identifying the function of specific genes in encoding experience is one of the main objectives of molecular neuroscience. Furthermore, the functional association of defined genes with specific behaviors has implications for understanding the basis of neuropsychiatric disorders. Induction of robust transcription programs has been observed in the brains of mice following various behavioral manipulations. While some genetic elements are utilized recurrently following different behavioral manipulations and in different brain nuclei, transcriptional programs are overall unique to the inducing stimuli and the structure in which they are studied1,2. In this publication, a protocol is described for robust and comprehensive transcriptional profiling from brain nuclei of mice in response to behavioral manipulation. The protocol is demonstrated in the context of analysis of gene expression dynamics in the nucleus accumbens following acute cocaine experience. Subsequent to a defined in vivo experience, the target neural tissue is dissected; followed by RNA purification, reverse transcription and utilization of microfluidic arrays for comprehensive qPCR analysis of multiple target genes. This protocol is geared towards comprehensive analysis (addressing 50-500 genes) of limiting quantities of starting material, such as small brain samples or even single cells. The protocol is most advantageous for parallel analysis of multiple samples (e.g. single cells, dynamic analysis following pharmaceutical, viral or behavioral perturbations). However, the protocol could also serve for the characterization and quality assurance of samples prior to whole-genome studies by microarrays or RNAseq, as well as validation of data obtained from whole-genome studies. PMID:25225819
Caldwell, Julie M.; Collins, Margaret H.; Stucke, Emily M.; Putnam, Philip E.; Franciosi, James P.; Kushner, Jonathan P.; Abonia, J. Pablo; Rothenberg, Marc E.
2014-01-01
Background The definition of eosinophilic gastritis (EG) is currently limited to histological EG based on the tissue eosinophil count. Objective We aimed to provide additional fundamental information about the molecular, histopathological, and clinical characteristics of EG. Methods Genome-wide transcript profiles and histological features of gastric biopsies as well as blood eosinophil numbers were analyzed in EG and control patients (n = 15 each). Results The peak gastric antrum eosinophil count was 282.7 ± 163.9 eosinophils/400X high-power field (HPF) in EG and 11.0 ± 8.5 eosinophils/HPF in control patients (P = 6.1 × 10−7). EG patients (87%) had co-existing eosinophilic inflammation in multiple gastrointestinal segments; the esophagus represented the most common secondary site. Elevated peripheral blood eosinophil numbers (EG 1.09 ± 0.88 × 103 [K]/μl vs. control 0.09 ± 0.08 K/μl, P = .0027) positively correlated with peak gastric eosinophil counts (Pearson r2 = .8102, P < .0001). MIB-1+ (proliferating), CD117+ (mast cells), and FOXP3+ cells (regulatory and/or activated T cells) were increased in EG. Transcript profiling revealed changes in 8% of the genome in EG gastric tissue. Only 7% of this EG transcriptome overlapped with the eosinophilic esophagitis (EoE) transcriptome. Significantly increased IL4, IL5, IL13, IL17, CCL26 and mast cell-specific transcripts and decreased IL33 were observed. Conclusion EG is a systemic disorder involving profound blood and gastrointestinal tract eosinophilia, Th2 immunity, and a conserved gastric transcriptome markedly distinct from the EoE transcriptome. The data herein define germane cellular and molecular pathways of EG and provide a basis for improving diagnosis and treatment. PMID:25234644
A systems analysis of the chemosensitivity of breast cancer cells to the polyamine analogue PG-11047
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kuo, Wen-Lin; Das, Debopriya; Ziyad, Safiyyah
2009-11-14
Polyamines regulate important cellular functions and polyamine dysregulation frequently occurs in cancer. The objective of this study was to use a systems approach to study the relative effects of PG-11047, a polyamine analogue, across breast cancer cells derived from different patients and to identify genetic markers associated with differential cytotoxicity. A panel of 48 breast cell lines that mirror many transcriptional and genomic features present in primary human breast tumours were used to study the antiproliferative activity of PG-11047. Sensitive cell lines were further examined for cell cycle distribution and apoptotic response. Cell line responses, quantified by the GI50 (dosemore » required for 50% relative growth inhibition) were correlated with the omic profiles of the cell lines to identify markers that predict response and cellular functions associated with drug sensitivity. The concentrations of PG-11047 needed to inhibit growth of members of the panel of breast cell lines varied over a wide range, with basal-like cell lines being inhibited at lower concentrations than the luminal cell lines. Sensitive cell lines showed a significant decrease in S phase fraction at doses that produced little apoptosis. Correlation of the GI50 values with the omic profiles of the cell lines identified genomic, transcriptional and proteomic variables associated with response. A 13-gene transcriptional marker set was developed as a predictor of response to PG-11047 that warrants clinical evaluation. Analyses of the pathways, networks and genes associated with response to PG-11047 suggest that response may be influenced by interferon signaling and differential inhibition of aspects of motility and epithelial to mesenchymal transition.« less
Tu, Ying; Xu, Dan; Feng, Jiaqi; He, Li
2017-01-01
Sensitive skin (SS) is a condition of subjective cutaneous hyper-reactivity. The role of long non-coding RNAs (lncRNAs) in subjects with SS is unclear. Therefore, the aim of the present study was to provide a comprehensive profile of the mRNAs and lncRNAs in subjects with SS. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis presented the characteristics of associated protein-coding genes. In addition, a co-expression network of lncRNA and mRNA was constructed to identify potential underlying regulation targets; the results were verified by quantitative real-time PCR (qRT-PCR) and RNA-seq analyses in patients with SS and normal samples. Compared with the normal skin group, 266 novel lncRNAs and 6750 annotated lncRNAs were identified in the SS group. A total of 71 lncRNA transcripts and 2615 mRNA transcripts were differentially expressed (P < 0.05). The heat signature of the SS samples could be distinguished from the normal skin samples, whereas the majority of the genes that were present in enriched pathways were those that participated in focal adhesion, PI3K-Akt signaling, and cancer-related pathways. Five transcripts were selected for qRT-PCR analysis and the results were consistent with RNA-seq. The results suggested that LNC_000265 may play a role in the epidermal barrier structure of patient with SS. The data suggest novel genes and pathways that may be involved in the pathogenesis of SS and highlight potential targets that could be used for individualized treatment applications. PMID:29383128
Joseph, Shai R; Pálfy, Máté; Hilbert, Lennart; Kumar, Mukesh; Karschau, Jens; Zaburdaev, Vasily; Shevchenko, Andrej; Vastenhouw, Nadine L
2017-01-01
Upon fertilization, the genome of animal embryos remains transcriptionally inactive until the maternal-to-zygotic transition. At this time, the embryo takes control of its development and transcription begins. How the onset of zygotic transcription is regulated remains unclear. Here, we show that a dynamic competition for DNA binding between nucleosome-forming histones and transcription factors regulates zebrafish genome activation. Taking a quantitative approach, we found that the concentration of non-DNA-bound core histones sets the time for the onset of transcription. The reduction in nuclear histone concentration that coincides with genome activation does not affect nucleosome density on DNA, but allows transcription factors to compete successfully for DNA binding. In agreement with this, transcription factor binding is sensitive to histone levels and the concentration of transcription factors also affects the time of transcription. Our results demonstrate that the relative levels of histones and transcription factors regulate the onset of transcription in the embryo. DOI: http://dx.doi.org/10.7554/eLife.23326.001 PMID:28425915
Joseph, Shai R; Pálfy, Máté; Hilbert, Lennart; Kumar, Mukesh; Karschau, Jens; Zaburdaev, Vasily; Shevchenko, Andrej; Vastenhouw, Nadine L
2017-04-20
Upon fertilization, the genome of animal embryos remains transcriptionally inactive until the maternal-to-zygotic transition. At this time, the embryo takes control of its development and transcription begins. How the onset of zygotic transcription is regulated remains unclear. Here, we show that a dynamic competition for DNA binding between nucleosome-forming histones and transcription factors regulates zebrafish genome activation. Taking a quantitative approach, we found that the concentration of non-DNA-bound core histones sets the time for the onset of transcription. The reduction in nuclear histone concentration that coincides with genome activation does not affect nucleosome density on DNA, but allows transcription factors to compete successfully for DNA binding. In agreement with this, transcription factor binding is sensitive to histone levels and the concentration of transcription factors also affects the time of transcription. Our results demonstrate that the relative levels of histones and transcription factors regulate the onset of transcription in the embryo.
2011-01-01
Background Grapes (Vitis vinifera L.) are economically the most important fruit crop worldwide. However, the complexity of molecular and biochemical events that lead to the onset of ripening of nonclimacteric fruits is not fully understood which is further complicated in grapes due to seasonal and cultivar specific variation. The Portuguese wine variety Trincadeira gives rise to high quality wines but presents extremely irregular berry ripening among seasons probably due to high susceptibility to abiotic and biotic stresses. Results Ripening of Trincadeira grapes was studied taking into account the transcriptional and metabolic profilings complemented with biochemical data. The mRNA expression profiles of four time points spanning developmental stages from pea size green berries, through véraison and mature berries (EL 32, EL 34, EL 35 and EL 36) and in two seasons (2007 and 2008) were compared using the Affymetrix GrapeGen® genome array containing 23096 probesets corresponding to 18726 unique sequences. Over 50% of these probesets were significantly differentially expressed (1.5 fold) between at least two developmental stages. A common set of modulated transcripts corresponding to 5877 unigenes indicates the activation of common pathways between years despite the irregular development of Trincadeira grapes. These unigenes were assigned to the functional categories of "metabolism", "development", "cellular process", "diverse/miscellanenous functions", "regulation overview", "response to stimulus, stress", "signaling", "transport overview", "xenoprotein, transposable element" and "unknown". Quantitative RT-PCR validated microarrays results being carried out for eight selected genes and five developmental stages (EL 32, EL 34, EL 35, EL 36 and EL 38). Metabolic profiling using 1H NMR spectroscopy associated to two-dimensional techniques showed the importance of metabolites related to oxidative stress response, amino acid and sugar metabolism as well as secondary metabolism. These results were integrated with transcriptional profiling obtained using genome array to provide new information regarding the network of events leading to grape ripening. Conclusions Altogether the data obtained provides the most extensive survey obtained so far for gene expression and metabolites accumulated during grape ripening. Moreover, it highlighted information obtained in a poorly known variety exhibiting particular characteristics that may be cultivar specific or dependent upon climatic conditions. Several genes were identified that had not been previously reported in the context of grape ripening namely genes involved in carbohydrate and amino acid metabolisms as well as in growth regulators; metabolism, epigenetic factors and signaling pathways. Some of these genes were annotated as receptors, transcription factors, and kinases and constitute good candidates for functional analysis in order to establish a model for ripening control of a non-climacteric fruit. PMID:22047180
Genome-wide profiling of DNA-binding proteins using barcode-based multiplex Solexa sequencing.
Raghav, Sunil Kumar; Deplancke, Bart
2012-01-01
Chromatin immunoprecipitation (ChIP) is a commonly used technique to detect the in vivo binding of proteins to DNA. ChIP is now routinely paired to microarray analysis (ChIP-chip) or next-generation sequencing (ChIP-Seq) to profile the DNA occupancy of proteins of interest on a genome-wide level. Because ChIP-chip introduces several biases, most notably due to the use of a fixed number of probes, ChIP-Seq has quickly become the method of choice as, depending on the sequencing depth, it is more sensitive, quantitative, and provides a greater binding site location resolution. With the ever increasing number of reads that can be generated per sequencing run, it has now become possible to analyze several samples simultaneously while maintaining sufficient sequence coverage, thus significantly reducing the cost per ChIP-Seq experiment. In this chapter, we provide a step-by-step guide on how to perform multiplexed ChIP-Seq analyses. As a proof-of-concept, we focus on the genome-wide profiling of RNA Polymerase II as measuring its DNA occupancy at different stages of any biological process can provide insights into the gene regulatory mechanisms involved. However, the protocol can also be used to perform multiplexed ChIP-Seq analyses of other DNA-binding proteins such as chromatin modifiers and transcription factors.
Systematic Analysis of the Transcriptional Switch Inducing Migration of Border Cells
Borghese, Lodovica; Fletcher, Georgina; Mathieu, Juliette; Atzberger, Ann; Eades, William C.; Cagan, Ross L.; Rørth, Pernille
2010-01-01
Summary Cell migration within a natural context is tightly controlled, often by specific transcription factors. However, the switch from stationary to migratory behavior is poorly understood. Border cells perform a spatially and temporally controlled invasive migration during Drosophila oogenesis. Slbo, a C/EBP family transcriptional activator, is required for them to become migratory. We purified wild-type and slbo mutant border cells as well as nonmigratory follicle cells and performed comparative whole-genome expression profiling, followed by functional tests of the contributions of identified targets to migration. About 300 genes were significantly upregulated in border cells, many dependent on Slbo. Among these, the microtubule regulator Stathmin was strongly upregulated and was required for normal migration. Actin cytoskeleton regulators were also induced, including, surprisingly, a large cluster of “muscle-specific” genes. We conclude that Slbo induces multiple cytoskeletal effectors, and that each contributes to the behavioral changes in border cells. PMID:16580994
Gat-Viks, Irit; Chevrier, Nicolas; Wilentzik, Roni; Eisenhaure, Thomas; Raychowdhury, Raktima; Steuerman, Yael; Shalek, Alex; Hacohen, Nir; Amit, Ido; Regev, Aviv
2013-01-01
Individual genetic variation affects gene expression in response to stimuli, often by influencing complex molecular circuits. Here we combine genomic and intermediate-scale transcriptional profiling with computational methods to identify variants that affect the responsiveness of genes to stimuli (responsiveness QTLs; reQTLs) and to position these variants in molecular circuit diagrams. We apply this approach to study variation in transcriptional responsiveness to pathogen components in dendritic cells from recombinant inbred mouse strains. We identify reQTLs that correlate with particular stimuli and position them in known pathways. For example, in response to a virus-like stimulus, a trans-acting variant acts as an activator of the antiviral response; using RNAi, we identify Rgs16 as the likely causal gene. Our approach charts an experimental and analytic path to decipher the mechanisms underlying genetic variation in circuits that control responses to stimuli. PMID:23503680
Genetic, molecular and physiological basis of variation in Drosophila gut immunocompetence.
Bou Sleiman, Maroun S; Osman, Dani; Massouras, Andreas; Hoffmann, Ary A; Lemaitre, Bruno; Deplancke, Bart
2015-07-27
Gut immunocompetence involves immune, stress and regenerative processes. To investigate the determinants underlying inter-individual variation in gut immunocompetence, we perform enteric infection of 140 Drosophila lines with the entomopathogenic bacterium Pseudomonas entomophila and observe extensive variation in survival. Using genome-wide association analysis, we identify several novel immune modulators. Transcriptional profiling further shows that the intestinal molecular state differs between resistant and susceptible lines, already before infection, with one transcriptional module involving genes linked to reactive oxygen species (ROS) metabolism contributing to this difference. This genetic and molecular variation is physiologically manifested in lower ROS activity, lower susceptibility to ROS-inducing agent, faster pathogen clearance and higher stem cell activity in resistant versus susceptible lines. This study provides novel insights into the determinants underlying population-level variability in gut immunocompetence, revealing how relatively minor, but systematic genetic and transcriptional variation can mediate overt physiological differences that determine enteric infection susceptibility.
Gat-Viks, Irit; Chevrier, Nicolas; Wilentzik, Roni; Eisenhaure, Thomas; Raychowdhury, Raktima; Steuerman, Yael; Shalek, Alex K; Hacohen, Nir; Amit, Ido; Regev, Aviv
2013-04-01
Individual genetic variation affects gene responsiveness to stimuli, often by influencing complex molecular circuits. Here we combine genomic and intermediate-scale transcriptional profiling with computational methods to identify variants that affect the responsiveness of genes to stimuli (responsiveness quantitative trait loci or reQTLs) and to position these variants in molecular circuit diagrams. We apply this approach to study variation in transcriptional responsiveness to pathogen components in dendritic cells from recombinant inbred mouse strains. We identify reQTLs that correlate with particular stimuli and position them in known pathways. For example, in response to a virus-like stimulus, a trans-acting variant responds as an activator of the antiviral response; using RNA interference, we identify Rgs16 as the likely causal gene. Our approach charts an experimental and analytic path to decipher the mechanisms underlying genetic variation in circuits that control responses to stimuli.
Li, Zhiqian; Zhang, Chen; Guo, Yurui; Niu, Weili; Wang, Yuejin; Xu, Yan
2017-09-21
The HD-Zip family has a diversity of functions during plant development. In this study, we identify 33 HD-Zip transcription factors in grape and detect their expressions in ovules and somatic embryos, as well as in various vegetative organs. A genome-wide survey for HD-Zip transcription factors in Vitis was conducted based on the 12 X grape genome (V. vinifera L.). A total of 33 members were identified and classified into four subfamilies (I-IV) based on phylogeny analysis with Arabidopsis, rice and maize. VvHDZs in the same subfamily have similar protein motifs and intron/exon structures. An evaluation of duplication events suggests several HD-Zip genes arose before the divergence of the grape and Arabidopsis lineages. The 33 members of HD-Zip were differentially expressed in ovules of the stenospermic grape, Thompson Seedless and of the seeded grape, Pinot noir. Most have higher expressions during ovule abortion in Thompson Seedless. In addition, transcripts of the HD-Zip family were also detected in somatic embryogenesis of Thompson Seedless and in different vegetative organs of Thompson Seedless at varying levels. Additionally, VvHDZ28 is located in the nucleus and had transcriptional activity consistent with the typical features of the HD-Zip family. Our results provide a foundation for future grape HD-Zip gene function research. The identification and expression profiles of the HD-Zip transcription factors in grape, reveal their diverse roles during ovule abortion and organ development. Our results lay a foundation for functional analysis of grape HDZ genes.
Bruce, A. Gregory; Barcy, Serge; DiMaio, Terri; Gan, Emilia; Garrigues, H. Jacques; Lagunoff, Michael; Rose, Timothy M.
2017-01-01
The transcriptome of the Kaposi’s sarcoma-associated herpesvirus (KSHV/HHV8) after primary latent infection of human blood (BEC), lymphatic (LEC) and immortalized (TIME) endothelial cells was analyzed using RNAseq, and compared to long-term latency in BCBL-1 lymphoma cells. Naturally expressed transcripts were obtained without artificial induction, and a comprehensive annotation of the KSHV genome was determined. A set of unique coding sequence (UCDS) features and a process to resolve overlapping transcripts were developed to accurately quantitate transcript levels from specific promoters. Similar patterns of KSHV expression were detected in BCBL-1 cells undergoing long-term latent infections and in primary latent infections of both BEC and LEC cultures. High expression levels of poly-adenylated nuclear (PAN) RNA and spliced and unspliced transcripts encoding the K12 Kaposin B/C complex and associated microRNA region were detected, with an elevated expression of a large set of lytic genes in all latently infected cultures. Quantitation of non-overlapping regions of transcripts across the complete KSHV genome enabled for the first time accurate evaluation of the KSHV transcriptome associated with viral latency in different cell types. Hierarchical clustering applied to a gene correlation matrix identified modules of co-regulated genes with similar correlation profiles, which corresponded with biological and functional similarities of the encoded gene products. Gene modules were differentially upregulated during latency in specific cell types indicating a role for cellular factors associated with differentiated and/or proliferative states of the host cell to influence viral gene expression. PMID:28335496
RNAi Functions in Adaptive Reprogramming of the Genome | Center for Cancer Research
The regulation of transcribing DNA into RNA, including the production, processing, and degradation of RNA transcripts, affects the expression and the regulation of the genome in ways that are just beginning to be unraveled. A surprising discovery in recent years is that the vast majority of the genome is transcribed to yield an abundance of RNA transcripts. Many transcripts
Optimization of cDNA-AFLP experiments using genomic sequence data.
Kivioja, Teemu; Arvas, Mikko; Saloheimo, Markku; Penttilä, Merja; Ukkonen, Esko
2005-06-01
cDNA amplified fragment length polymorphism (cDNA-AFLP) is one of the few genome-wide level expression profiling methods capable of finding genes that have not yet been cloned or even predicted from sequence but have interesting expression patterns under the studied conditions. In cDNA-AFLP, a complex cDNA mixture is divided into small subsets using restriction enzymes and selective PCR. A large cDNA-AFLP experiment can require a substantial amount of resources, such as hundreds of PCR amplifications and gel electrophoresis runs, followed by manual cutting of a large number of bands from the gels. Our aim was to test whether this workload can be reduced by rational design of the experiment. We used the available genomic sequence information to optimize cDNA-AFLP experiments beforehand so that as many transcripts as possible could be profiled with a given amount of resources. Optimization of the selection of both restriction enzymes and selective primers for cDNA-AFLP experiments has not been performed previously. The in silico tests performed suggest that substantial amounts of resources can be saved by the optimization of cDNA-AFLP experiments.
2014-01-01
Background Imprinted genes have been extensively documented in eutherian mammals and found to exhibit significant interspecific variation in the suites of genes that are imprinted and in their regulation between tissues and developmental stages. Much less is known about imprinted loci in metatherian (marsupial) mammals, wherein studies have been limited to a small number of genes previously known to be imprinted in eutherians. We describe the first ab initio search for imprinted marsupial genes, in fibroblasts from the opossum, Monodelphis domestica, based on a genome-wide ChIP-seq strategy to identify promoters that are simultaneously marked by mutually exclusive, transcriptionally opposing histone modifications. Results We identified a novel imprinted gene (Meis1) and two additional monoallelically expressed genes, one of which (Cstb) showed allele-specific, but non-imprinted expression. Imprinted vs. allele-specific expression could not be resolved for the third monoallelically expressed gene (Rpl17). Transcriptionally opposing histone modifications H3K4me3, H3K9Ac, and H3K9me3 were found at the promoters of all three genes, but differential DNA methylation was not detected at CpG islands at any of these promoters. Conclusions In generating the first genome-wide histone modification profiles for a marsupial, we identified the first gene that is imprinted in a marsupial but not in eutherian mammals. This outcome demonstrates the practicality of an ab initio discovery strategy and implicates histone modification, but not differential DNA methylation, as a conserved mechanism for marking imprinted genes in all therian mammals. Our findings suggest that marsupials use multiple epigenetic mechanisms for imprinting and support the concept that lineage-specific selective forces can produce sets of imprinted genes that differ between metatherian and eutherian lines. PMID:24484454
Ayadi, M; Hanana, M; Kharrat, N; Merchaoui, H; Marzoug, R Ben; Lauvergeat, V; Rebaï, A; Mzid, R
2016-10-01
WRKY transcription factors belong to a large family of plant transcriptional regulators whose members have been reported to be involved in a wide range of biological roles including plant development, adaptation to environmental constraints and response to several diseases. However, little or poor information is available about WRKY's in Citrus. The recent release of completely assembled genomes sequences of Citrus sinensis and Citrus clementina and the availability of ESTs sequences from other citrus species allowed us to perform a genome survey for Citrus WRKY proteins. In the present study, we identified 100 WRKY members from C. sinensis (51), C. clementina (48) and Citrus unshiu (1), and analyzed their chromosomal distribution, gene structure, gene duplication, syntenic relation and phylogenetic analysis. A phylogenetic tree of 100 Citrus WRKY sequences with their orthologs from Arabidopsis has distinguished seven groups. The CsWRKY genes were distributed across all ten sweet orange chromosomes. A comprehensive approach and an integrative analysis of Citrus WRKY gene expression revealed variable profiles of expression within tissues and stress conditions indicating functional diversification. Thus, candidate Citrus WRKY genes have been proposed as potentially involved in fruit acidification, essential oil biosynthesis and abiotic/biotic stress tolerance. Our results provided essential prerequisites for further WRKY genes cloning and functional analysis with an aim of citrus crop improvement.
Salt-Responsive Transcriptome Profiling of Suaeda glauca via RNA Sequencing
Jin, Hangxia; Dong, Dekun; Yang, Qinghua; Zhu, Danhua
2016-01-01
Background Suaeda glauca, a succulent halophyte of the Chenopodiaceae family, is widely distributed in coastal areas of China. Suaeda glauca is highly resistant to salt and alkali stresses. In the present study, the salt-responsive transcriptome of Suaeda glauca was analyzed to identify genes involved in salt tolerance and study halophilic mechanisms in this halophyte. Results Illumina HiSeq 2500 was used to sequence cDNA libraries from salt-treated and control samples with three replicates each treatment. De novo assembly of the six transcriptomes identified 75,445 unigenes. A total of 23,901 (31.68%) unigenes were annotated. Compared with transcriptomes from the three salt-treated and three salt-free samples, 231 differentially expressed genes (DEGs) were detected (including 130 up-regulated genes and 101 down-regulated genes), and 195 unigenes were functionally annotated. Based on the Gene Ontology (GO), Clusters of Orthologous Groups (COG) and Kyoto Encyclopedia of Genes and Genomes (KEGG) classifications of the DEGs, more attention should be paid to transcripts associated with signal transduction, transporters, the cell wall and growth, defense metabolism and transcription factors involved in salt tolerance. Conclusions This report provides a genome-wide transcriptional analysis of a halophyte, Suaeda glauca, under salt stress. Further studies of the genetic basis of salt tolerance in halophytes are warranted. PMID:26930632
Muleke, Everlyne M’mbone; Jabir, Bashir Mohammed; Xie, Yang; Zhu, Xianwen; Cheng, Wanwan
2017-01-01
NAC (NAM, no apical meristem; ATAF, Arabidopsis transcription activation factor and CUC, cup-shaped cotyledon) proteins are among the largest transcription factor (TF) families playing fundamental biological processes, including cell expansion and differentiation, and hormone signaling in response to biotic and abiotic stresses. In this study, 172 RsNACs comprising 17 membrane-bound members were identified from the whole radish genome. In total, 98 RsNAC genes were non-uniformly distributed across the nine radish chromosomes. In silico analysis revealed that expression patterns of several NAC genes were tissue-specific such as a preferential expression in roots and leaves. In addition, 21 representative NAC genes were selected to investigate their responses to heavy metals (HMs), salt, heat, drought and abscisic acid (ABA) stresses using real-time polymerase chain reaction (RT-qPCR). As a result, differential expressions among these genes were identified where RsNAC023 and RsNAC080 genes responded positively to all stresses except ABA, while RsNAC145 responded more actively to salt, heat and drought stresses compared with other genes. The results provides more valuable information and robust candidate genes for future functional analysis for improving abiotic stress tolerances in radish. PMID:29259849
A genome-wide longitudinal transcriptome analysis of the aging model Podospora anserina.
Philipp, Oliver; Hamann, Andrea; Servos, Jörg; Werner, Alexandra; Koch, Ina; Osiewacz, Heinz D
2013-01-01
Aging of biological systems is controlled by various processes which have a potential impact on gene expression. Here we report a genome-wide transcriptome analysis of the fungal aging model Podospora anserina. Total RNA of three individuals of defined age were pooled and analyzed by SuperSAGE (serial analysis of gene expression). A bioinformatics analysis identified different molecular pathways to be affected during aging. While the abundance of transcripts linked to ribosomes and to the proteasome quality control system were found to decrease during aging, those associated with autophagy increase, suggesting that autophagy may act as a compensatory quality control pathway. Transcript profiles associated with the energy metabolism including mitochondrial functions were identified to fluctuate during aging. Comparison of wild-type transcripts, which are continuously down-regulated during aging, with those down-regulated in the long-lived, copper-uptake mutant grisea, validated the relevance of age-related changes in cellular copper metabolism. Overall, we (i) present a unique age-related data set of a longitudinal study of the experimental aging model P. anserina which represents a reference resource for future investigations in a variety of organisms, (ii) suggest autophagy to be a key quality control pathway that becomes active once other pathways fail, and (iii) present testable predictions for subsequent experimental investigations.
RNA-Rocket: an RNA-Seq analysis resource for infectious disease research
Warren, Andrew S.; Aurrecoechea, Cristina; Brunk, Brian; Desai, Prerak; Emrich, Scott; Giraldo-Calderón, Gloria I.; Harb, Omar; Hix, Deborah; Lawson, Daniel; Machi, Dustin; Mao, Chunhong; McClelland, Michael; Nordberg, Eric; Shukla, Maulik; Vosshall, Leslie B.; Wattam, Alice R.; Will, Rebecca; Yoo, Hyun Seung; Sobral, Bruno
2015-01-01
Motivation: RNA-Seq is a method for profiling transcription using high-throughput sequencing and is an important component of many research projects that wish to study transcript isoforms, condition specific expression and transcriptional structure. The methods, tools and technologies used to perform RNA-Seq analysis continue to change, creating a bioinformatics challenge for researchers who wish to exploit these data. Resources that bring together genomic data, analysis tools, educational material and computational infrastructure can minimize the overhead required of life science researchers. Results: RNA-Rocket is a free service that provides access to RNA-Seq and ChIP-Seq analysis tools for studying infectious diseases. The site makes available thousands of pre-indexed genomes, their annotations and the ability to stream results to the bioinformatics resources VectorBase, EuPathDB and PATRIC. The site also provides a combination of experimental data and metadata, examples of pre-computed analysis, step-by-step guides and a user interface designed to enable both novice and experienced users of RNA-Seq data. Availability and implementation: RNA-Rocket is available at rnaseq.pathogenportal.org. Source code for this project can be found at github.com/cidvbi/PathogenPortal. Contact: anwarren@vt.edu Supplementary information: Supplementary materials are available at Bioinformatics online. PMID:25573919
RNA-Rocket: an RNA-Seq analysis resource for infectious disease research.
Warren, Andrew S; Aurrecoechea, Cristina; Brunk, Brian; Desai, Prerak; Emrich, Scott; Giraldo-Calderón, Gloria I; Harb, Omar; Hix, Deborah; Lawson, Daniel; Machi, Dustin; Mao, Chunhong; McClelland, Michael; Nordberg, Eric; Shukla, Maulik; Vosshall, Leslie B; Wattam, Alice R; Will, Rebecca; Yoo, Hyun Seung; Sobral, Bruno
2015-05-01
RNA-Seq is a method for profiling transcription using high-throughput sequencing and is an important component of many research projects that wish to study transcript isoforms, condition specific expression and transcriptional structure. The methods, tools and technologies used to perform RNA-Seq analysis continue to change, creating a bioinformatics challenge for researchers who wish to exploit these data. Resources that bring together genomic data, analysis tools, educational material and computational infrastructure can minimize the overhead required of life science researchers. RNA-Rocket is a free service that provides access to RNA-Seq and ChIP-Seq analysis tools for studying infectious diseases. The site makes available thousands of pre-indexed genomes, their annotations and the ability to stream results to the bioinformatics resources VectorBase, EuPathDB and PATRIC. The site also provides a combination of experimental data and metadata, examples of pre-computed analysis, step-by-step guides and a user interface designed to enable both novice and experienced users of RNA-Seq data. RNA-Rocket is available at rnaseq.pathogenportal.org. Source code for this project can be found at github.com/cidvbi/PathogenPortal. anwarren@vt.edu Supplementary materials are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
New Era of Studying RNA Secondary Structure and Its Influence on Gene Regulation in Plants.
Yang, Xiaofei; Yang, Minglei; Deng, Hongjing; Ding, Yiliang
2018-01-01
The dynamic structure of RNA plays a central role in post-transcriptional regulation of gene expression such as RNA maturation, degradation, and translation. With the rise of next-generation sequencing, the study of RNA structure has been transformed from in vitro low-throughput RNA structure probing methods to in vivo high-throughput RNA structure profiling. The development of these methods enables incremental studies on the function of RNA structure to be performed, revealing new insights of novel regulatory mechanisms of RNA structure in plants. Genome-wide scale RNA structure profiling allows us to investigate general RNA structural features over 10s of 1000s of mRNAs and to compare RNA structuromes between plant species. Here, we provide a comprehensive and up-to-date overview of: (i) RNA structure probing methods; (ii) the biological functions of RNA structure; (iii) genome-wide RNA structural features corresponding to their regulatory mechanisms; and (iv) RNA structurome evolution in plants.
Transcriptome changes and cAMP oscillations in an archaeal cell cycle.
Baumann, Anke; Lange, Christian; Soppa, Jörg
2007-06-11
The cell cycle of all organisms includes mass increase by a factor of two, replication of the genetic material, segregation of the genome to different parts of the cell, and cell division into two daughter cells. It is tightly regulated and typically includes cell cycle-specific oscillations of the levels of transcripts, proteins, protein modifications, and signaling molecules. Until now cell cycle-specific transcriptome changes have been described for four eukaryotic species ranging from yeast to human, but only for two prokaryotic species. Similarly, oscillations of small signaling molecules have been identified in very few eukaryotic species, but not in any prokaryote. A synchronization procedure for the archaeon Halobacterium salinarum was optimized, so that nearly 100% of all cells divide in a time interval that is 1/4th of the generation time of exponentially growing cells. The method was used to characterize cell cycle-dependent transcriptome changes using a genome-wide DNA microarray. The transcript levels of 87 genes were found to be cell cycle-regulated, corresponding to 3% of all genes. They could be clustered into seven groups with different transcript level profiles. Cluster-specific sequence motifs were detected around the start of the genes that are predicted to be involved in cell cycle-specific transcriptional regulation. Notably, many cell cycle genes that have oscillating transcript levels in eukaryotes are not regulated on the transcriptional level in H. salinarum. Synchronized cultures were also used to identify putative small signaling molecules. H. salinarum was found to contain a basal cAMP concentration of 200 microM, considerably higher than that of yeast. The cAMP concentration is shortly induced directly prior to and after cell division, and thus cAMP probably is an important signal for cell cycle progression. The analysis of cell cycle-specific transcriptome changes of H. salinarum allowed to identify a strategy of transcript level regulation that is different from all previously characterized species. The transcript levels of only 3% of all genes are regulated, a fraction that is considerably lower than has been reported for four eukaryotic species (6%-28%) and for the bacterium C. crescentus (19%). It was shown that cAMP is present in significant concentrations in an archaeon, and the phylogenetic profile of the adenylate cyclase indicates that this signaling molecule is widely distributed in archaea. The occurrence of cell cycle-dependent oscillations of the cAMP concentration in an archaeon and in several eukaryotic species indicates that cAMP level changes might be a phylogenetically old signal for cell cycle progression.
TEA: the epigenome platform for Arabidopsis methylome study.
Su, Sheng-Yao; Chen, Shu-Hwa; Lu, I-Hsuan; Chiang, Yih-Shien; Wang, Yu-Bin; Chen, Pao-Yang; Lin, Chung-Yen
2016-12-22
Bisulfite sequencing (BS-seq) has become a standard technology to profile genome-wide DNA methylation at single-base resolution. It allows researchers to conduct genome-wise cytosine methylation analyses on issues about genomic imprinting, transcriptional regulation, cellular development and differentiation. One single data from a BS-Seq experiment is resolved into many features according to the sequence contexts, making methylome data analysis and data visualization a complex task. We developed a streamlined platform, TEA, for analyzing and visualizing data from whole-genome BS-Seq (WGBS) experiments conducted in the model plant Arabidopsis thaliana. To capture the essence of the genome methylation level and to meet the efficiency for running online, we introduce a straightforward method for measuring genome methylation in each sequence context by gene. The method is scripted in Java to process BS-Seq mapping results. Through a simple data uploading process, the TEA server deploys a web-based platform for deep analysis by linking data to an updated Arabidopsis annotation database and toolkits. TEA is an intuitive and efficient online platform for analyzing the Arabidopsis genomic DNA methylation landscape. It provides several ways to help users exploit WGBS data. TEA is freely accessible for academic users at: http://tea.iis.sinica.edu.tw .
Microbial metatranscriptomics in a permanent marine oxygen minimum zone.
Stewart, Frank J; Ulloa, Osvaldo; DeLong, Edward F
2012-01-01
Simultaneous characterization of taxonomic composition, metabolic gene content and gene expression in marine oxygen minimum zones (OMZs) has potential to broaden perspectives on the microbial and biogeochemical dynamics in these environments. Here, we present a metatranscriptomic survey of microbial community metabolism in the Eastern Tropical South Pacific OMZ off northern Chile. Community RNA was sampled in late austral autumn from four depths (50, 85, 110, 200 m) extending across the oxycline and into the upper OMZ. Shotgun pyrosequencing of cDNA yielded 180,000 to 550,000 transcript sequences per depth. Based on functional gene representation, transcriptome samples clustered apart from corresponding metagenome samples from the same depth, highlighting the discrepancies between metabolic potential and actual transcription. BLAST-based characterizations of non-ribosomal RNA sequences revealed a dominance of genes involved with both oxidative (nitrification) and reductive (anammox, denitrification) components of the marine nitrogen cycle. Using annotations of protein-coding genes as proxies for taxonomic affiliation, we observed depth-specific changes in gene expression by key functional taxonomic groups. Notably, transcripts most closely matching the genome of the ammonia-oxidizing archaeon Nitrosopumilus maritimus dominated the transcriptome in the upper three depths, representing one in five protein-coding transcripts at 85 m. In contrast, transcripts matching the anammox bacterium Kuenenia stuttgartiensis dominated at the core of the OMZ (200 m; 1 in 12 protein-coding transcripts). The distribution of N. maritimus-like transcripts paralleled that of transcripts matching ammonia monooxygenase genes, which, despite being represented by both bacterial and archaeal sequences in the community DNA, were dominated (> 99%) by archaeal sequences in the RNA, suggesting a substantial role for archaeal nitrification in the upper OMZ. These data, as well as those describing other key OMZ metabolic processes (e.g. sulfur oxidation), highlight gene-specific expression patterns in the context of the entire community transcriptome, as well as identify key functional groups for taxon-specific genomic profiling. © 2011 Society for Applied Microbiology and Blackwell Publishing Ltd.
Gonsalves, Sarah E.; Moses, Alan M.; Razak, Zak; Robert, Francois; Westwood, J. Timothy
2011-01-01
During heat shock (HS) and other stresses, HS gene transcription in eukaryotes is up-regulated by the transcription factor heat shock factor (HSF). While the identities of the major HS genes have been known for more than 30 years, it has been suspected that HSF binds to numerous other genes and potentially regulates their transcription. In this study, we have used a chromatin immunoprecipitation and microarray (ChIP-chip) approach to identify 434 regions in the Drosophila genome that are bound by HSF. We have also performed a transcript analysis of heat shocked Kc167 cells and third instar larvae and compared them to HSF binding sites. The heat-induced transcription profiles were quite different between cells and larvae and surprisingly only about 10% of the genes associated with HSF binding sites show changed transcription. There were also genes that showed changes in transcript levels that did not appear to correlate with HSF binding sites. Analysis of the locations of the HSF binding sites revealed that 57% were contained within genes with approximately 2/3rds of these sites being in introns. We also found that the insulator protein, BEAF, has enriched binding prior to HS to promoters of genes that are bound by HSF upon HS but that are not transcriptionally induced during HS. When the genes associated with HSF binding sites in promoters were analyzed for gene ontology terms, categories such as stress response and transferase activity were enriched whereas analysis of genes having HSF binding sites in introns identified those categories plus ones related to developmental processes and reproduction. These results suggest that Drosophila HSF may be regulating many genes besides the known HS genes and that some of these genes may be regulated during non-stress conditions. PMID:21264254
Gonsalves, Sarah E; Moses, Alan M; Razak, Zak; Robert, Francois; Westwood, J Timothy
2011-01-14
During heat shock (HS) and other stresses, HS gene transcription in eukaryotes is up-regulated by the transcription factor heat shock factor (HSF). While the identities of the major HS genes have been known for more than 30 years, it has been suspected that HSF binds to numerous other genes and potentially regulates their transcription. In this study, we have used a chromatin immunoprecipitation and microarray (ChIP-chip) approach to identify 434 regions in the Drosophila genome that are bound by HSF. We have also performed a transcript analysis of heat shocked Kc167 cells and third instar larvae and compared them to HSF binding sites. The heat-induced transcription profiles were quite different between cells and larvae and surprisingly only about 10% of the genes associated with HSF binding sites show changed transcription. There were also genes that showed changes in transcript levels that did not appear to correlate with HSF binding sites. Analysis of the locations of the HSF binding sites revealed that 57% were contained within genes with approximately 2/3rds of these sites being in introns. We also found that the insulator protein, BEAF, has enriched binding prior to HS to promoters of genes that are bound by HSF upon HS but that are not transcriptionally induced during HS. When the genes associated with HSF binding sites in promoters were analyzed for gene ontology terms, categories such as stress response and transferase activity were enriched whereas analysis of genes having HSF binding sites in introns identified those categories plus ones related to developmental processes and reproduction. These results suggest that Drosophila HSF may be regulating many genes besides the known HS genes and that some of these genes may be regulated during non-stress conditions.
Anderson, Letícia; Gomes, Monete Rajão; daSilva, Lucas Ferreira; Pereira, Adriana da Silva Andrade; Mourão, Marina M.; Romier, Christophe; Pierce, Raymond
2017-01-01
Background Schistosomiasis is a parasitic disease infecting hundreds of millions of people worldwide. Treatment depends on a single drug, praziquantel, which kills the Schistosoma spp. parasite only at the adult stage. HDAC inhibitors (HDACi) such as Trichostatin A (TSA) induce parasite mortality in vitro (schistosomula and adult worms), however the downstream effects of histone hyperacetylation on the parasite are not known. Methodology/Principal findings TSA treatment of adult worms in vitro increased histone acetylation at H3K9ac and H3K14ac, which are transcription activation marks, not affecting the unrelated transcription repression mark H3K27me3. We investigated the effect of TSA HDACi on schistosomula gene expression at three different time points, finding a marked genome-wide change in the transcriptome profile. Gene transcription activity was correlated with changes on the chromatin acetylation mark at gene promoter regions. Moreover, combining expression data with ChIP-Seq public data for schistosomula, we found that differentially expressed genes having the H3K4me3 mark at their promoter region in general showed transcription activation upon HDACi treatment, compared with those without the mark, which showed transcription down-regulation. Affected genes are enriched for DNA replication processes, most of them being up-regulated. Twenty out of 22 genes encoding proteins involved in reducing reactive oxygen species accumulation were down-regulated. Dozens of genes encoding proteins with histone reader motifs were changed, including SmEED from the PRC2 complex. We targeted SmEZH2 methyltransferase PRC2 component with a new EZH2 inhibitor (GSK343) and showed a synergistic effect with TSA, significantly increasing schistosomula mortality. Conclusions/Significance Genome-wide gene expression analyses have identified important pathways and cellular functions that were affected and may explain the schistosomicidal effect of TSA HDACi. The change in expression of dozens of histone reader genes involved in regulation of the epigenetic program in S. mansoni can be used as a starting point to look for possible novel schistosomicidal targets. PMID:28406899
Thomas, Rachael; Borst, Luke; Rotroff, Daniel; Motsinger-Reif, Alison; Lindblad-Toh, Kerstin; Modiano, Jaime F.; Breen, Matthew
2017-01-01
Canine hemangiosarcoma is a highly aggressive vascular neoplasm associated with extensive clinical and anatomical heterogeneity and a grave prognosis. Comprehensive molecular characterization of hemangiosarcoma may identify novel therapeutic targets and advanced clinical management strategies, but there are no published reports of tumor-associated genome instability and disrupted gene dosage in this cancer. We performed genome-wide microarray-based somatic DNA copy number profiling of 75 primary intra-abdominal hemangiosarcomas from five popular dog breeds that are highly predisposed to this disease. The cohort exhibited limited global genomic instability, compared to other canine sarcomas studied to date, and DNA copy number aberrations (CNAs) were predominantly of low amplitude. Recurrent imbalances of several key cancer-associated genes were evident; however the global penetrance of any single CNA was low and no distinct hallmark aberrations were evident. Copy number gains of dog chromosomes 13, 24 and 31, and loss of chromosome 16, were the most recurrent CNAs involving large chromosome regions, but their relative distribution within and between cases suggests they most likely represent passenger aberrations. CNAs involving CDKN2A, VEGFA and the SKI oncogene were identified as potential driver aberrations of hemangiosarcoma development, highlighting potential targets for therapeutic modulation. CNA profiles were broadly conserved between the five breeds, although subregional variation was evident, including a near two-fold lower incidence of VEGFA gain in Golden Retrievers versus other breeds (22% versus 40%). These observations support prior transcriptional studies suggesting that the clinical heterogeneity of this cancer may reflect the existence of multiple, molecularly-distinct subtypes of canine hemangiosarcoma. PMID:24599718
Thomas, Rachael; Borst, Luke; Rotroff, Daniel; Motsinger-Reif, Alison; Lindblad-Toh, Kerstin; Modiano, Jaime F; Breen, Matthew
2014-09-01
Canine hemangiosarcoma is a highly aggressive vascular neoplasm associated with extensive clinical and anatomical heterogeneity and a grave prognosis. Comprehensive molecular characterization of hemangiosarcoma may identify novel therapeutic targets and advanced clinical management strategies, but there are no published reports of tumor-associated genome instability and disrupted gene dosage in this cancer. We performed genome-wide microarray-based somatic DNA copy number profiling of 75 primary intra-abdominal hemangiosarcomas from five popular dog breeds that are highly predisposed to this disease. The cohort exhibited limited global genomic instability, compared to other canine sarcomas studied to date, and DNA copy number aberrations (CNAs) were predominantly of low amplitude. Recurrent imbalances of several key cancer-associated genes were evident; however, the global penetrance of any single CNA was low and no distinct hallmark aberrations were evident. Copy number gains of dog chromosomes 13, 24, and 31, and loss of chromosome 16, were the most recurrent CNAs involving large chromosome regions, but their relative distribution within and between cases suggests they most likely represent passenger aberrations. CNAs involving CDKN2A, VEGFA, and the SKI oncogene were identified as potential driver aberrations of hemangiosarcoma development, highlighting potential targets for therapeutic modulation. CNA profiles were broadly conserved between the five breeds, although subregional variation was evident, including a near twofold lower incidence of VEGFA gain in Golden Retrievers versus other breeds (22 versus 40 %). These observations support prior transcriptional studies suggesting that the clinical heterogeneity of this cancer may reflect the existence of multiple, molecularly distinct subtypes of canine hemangiosarcoma.
DiRE: identifying distant regulatory elements of co-expressed genes
Gotea, Valer; Ovcharenko, Ivan
2008-01-01
Regulation of gene expression in eukaryotic genomes is established through a complex cooperative activity of proximal promoters and distant regulatory elements (REs) such as enhancers, repressors and silencers. We have developed a web server named DiRE, based on the Enhancer Identification (EI) method, for predicting distant regulatory elements in higher eukaryotic genomes, namely for determining their chromosomal location and functional characteristics. The server uses gene co-expression data, comparative genomics and profiles of transcription factor binding sites (TFBSs) to determine TFBS-association signatures that can be used for discriminating specific regulatory functions. DiRE's unique feature is its ability to detect REs outside of proximal promoter regions, as it takes advantage of the full gene locus to conduct the search. DiRE can predict common REs for any set of input genes for which the user has prior knowledge of co-expression, co-function or other biologically meaningful grouping. The server predicts function-specific REs consisting of clusters of specifically-associated TFBSs and it also scores the association of individual transcription factors (TFs) with the biological function shared by the group of input genes. Its integration with the Array2BIO server allows users to start their analysis with raw microarray expression data. The DiRE web server is freely available at http://dire.dcode.org. PMID:18487623
Delineation of Two Clinically and Molecularly Distinct Subgroups of Posterior Fossa Ependymoma
Witt, Hendrik; Mack, Stephen C.; Ryzhova, Marina; Bender, Sebastian; Sill, Martin; Isserlin, Ruth; Benner, Axel; Hielscher, Thomas; Milde, Till; Remke, Marc; Jones, David T.W.; Northcott, Paul A.; Garzia, Livia; Bertrand, Kelsey C.; Wittmann, Andrea; Yao, Yuan; Roberts, Stephen S.; Massimi, Luca; Van Meter, Tim; Weiss, William A.; Gupta, Nalin; Grajkowska, Wiesia; Lach, Boleslaw; Cho, Yoon-Jae; von Deimling, Andreas; Kulozik, Andreas E.; Witt, Olaf; Bader, Gary D.; Hawkins, Cynthia E.; Tabori, Uri; Guha, Abhijit; Rutka, James T.; Lichter, Peter; Korshunov, Andrey
2014-01-01
Summary Despite the histological similarity of ependymomas from throughout the neuroaxis, the disease likely comprises multiple independent entities, each with a distinct molecular pathogenesis. Transcriptional profiling of two large independent cohorts of ependymoma reveals the existence of two demographically, transcriptionally, genetically, and clinically distinct groups of posterior fossa (PF) ependymomas. Group A patients are younger, have laterally located tumors with a balanced genome, and are much more likely to exhibit recurrence, metastasis at recurrence, and death compared with Group B patients. Identification and optimization of immunohistochemical (IHC) markers for PF ependymoma subgroups allowed validation of our findings on a third independent cohort, using a human ependymoma tissue microarray, and provides a tool for prospective prognostication and stratification of PF ependymoma patients. PMID:21840481
Regulation of Androgen Receptor-Mediated Transcription by RPB5 Binding Protein URI/RMP ▿
Mita, Paolo; Savas, Jeffrey N.; Djouder, Nabil; Yates, John R.; Ha, Susan; Ruoff, Rachel; Schafler, Eric D.; Nwachukwu, Jerome C.; Tanese, Naoko; Cowan, Nicholas J.; Zavadil, Jiri; Garabedian, Michael J.; Logan, Susan K.
2011-01-01
Androgen receptor (AR)-mediated transcription is modulated by interaction with coregulatory proteins. We demonstrate that the unconventional prefoldin RPB5 interactor (URI) is a new regulator of AR transcription and is critical for antagonist (bicalutamide) action. URI is phosphorylated upon androgen treatment, suggesting communication between the URI and AR signaling pathways. Whereas depletion of URI enhances AR-mediated gene transcription, overexpression of URI suppresses AR transcriptional activation and anchorage-independent prostate cancer cell growth. Repression of AR-mediated transcription is achieved, in part, by URI binding and regulation of androgen receptor trapped clone 27 (Art-27), a previously characterized AR corepressor. Consistent with this idea, genome-wide expression profiling in prostate cancer cells upon depletion of URI or Art-27 reveals substantially overlapping patterns of gene expression. Further, depletion of URI increases the expression of the AR target gene NKX-3.1, decreases the recruitment of Art-27, and increases AR occupancy at the NKX-3.1 promoter. While Art-27 can bind AR directly, URI is bound to chromatin prior to hormone-dependent recruitment of AR, suggesting a role for URI in modulating AR recruitment to target genes. PMID:21730289
De la Cruz, Miguel A; Ares, Miguel A; von Bargen, Kristine; Panunzi, Leonardo G; Martínez-Cruz, Jessica; Valdez-Salazar, Hilda A; Jiménez-Galicia, César; Torres, Javier
2017-01-01
Helicobacter pylori is a Gram-negative bacterium that colonizes the human gastric mucosa and causes peptic ulcers and gastric carcinoma. H. pylori strain 26695 has a small genome (1.67 Mb), which codes for few known transcriptional regulators that control bacterial metabolism and virulence. We analyzed by qRT-PCR the expression of 16 transcriptional regulators in H. pylori 26695, including the three sigma factors under different environmental conditions. When bacteria were exposed to acidic pH, urea, nickel, or iron, the sigma factors were differentially expressed with a particularly strong induction of fliA . The regulatory genes hrcA, hup , and crdR were highly induced in the presence of urea, nickel, and iron. In terms of biofilm formation fliA, flgR, hp1021, fur, nikR , and crdR were induced in sessile bacteria. Transcriptional expression levels of rpoD, flgR, hspR, hp1043 , and cheY were increased in contact with AGS epithelial cells. Kanamycin, chloramphenicol, and tetracycline increased or decreased expression of regulatory genes, showing that these antibiotics affect the transcription of H. pylori . Our data indicate that environmental cues which may be present in the human stomach modulate H. pylori transcription.
Expression Profile of Long Noncoding RNAs in Human Earlobe Keloids: A Microarray Analysis
Guo, Liang; Xu, Kai; Yan, Hongbo; Feng, Haifeng
2016-01-01
Background. Long noncoding RNAs (lncRNAs) play key roles in a wide range of biological processes and their deregulation results in human disease, including keloids. Earlobe keloid is a type of pathological skin scar, and the molecular pathogenesis of this disease remains largely unknown. Methods. In this study, microarray analysis was used to determine the expression profiles of lncRNAs and mRNAs between 3 pairs of earlobe keloid and normal specimens. Gene Ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were performed to identify the main functions of the differentially expressed genes and earlobe keloid-related pathways. Results. A total of 2068 lncRNAs and 1511 mRNAs were differentially expressed between earlobe keloid and normal tissues. Among them, 1290 lncRNAs and 1092 mRNAs were upregulated, and 778 lncRNAs and 419 mRNAs were downregulated. Pathway analysis revealed that 24 pathways were correlated to the upregulated transcripts, while 11 pathways were associated with the downregulated transcripts. Conclusion. We characterized the expression profiles of lncRNA and mRNA in earlobe keloids and suggest that lncRNAs may serve as diagnostic biomarkers for the therapy of earlobe keloid. PMID:28101509
TRACTOR_DB: a database of regulatory networks in gamma-proteobacterial genomes
González, Abel D.; Espinosa, Vladimir; Vasconcelos, Ana T.; Pérez-Rueda, Ernesto; Collado-Vides, Julio
2005-01-01
Experimental data on the Escherichia coli transcriptional regulatory system has been used in the past years to predict new regulatory elements (promoters, transcription factors (TFs), TFs' binding sites and operons) within its genome. As more genomes of gamma-proteobacteria are being sequenced, the prediction of these elements in a growing number of organisms has become more feasible, as a step towards the study of how different bacteria respond to environmental changes at the level of transcriptional regulation. In this work, we present TRACTOR_DB (TRAnscription FaCTORs' predicted binding sites in prokaryotic genomes), a relational database that contains computational predictions of new members of 74 regulons in 17 gamma-proteobacterial genomes. For these predictions we used a comparative genomics approach regarding which several proof-of-principle articles for large regulons have been published. TRACTOR_DB may be currently accessed at http://www.bioinfo.cu/Tractor_DB, http://www.tractor.lncc.br/ or at http://www.cifn.unam.mx/Computational_Genomics/tractorDB. Contact Email id is tractor@cifn.unam.mx. PMID:15608293
Hao, Ming; Li, Aili; Shi, Tongwei; Luo, Jiangtao; Zhang, Lianquan; Zhang, Xuechuan; Ning, Shunzong; Yuan, Zhongwei; Zeng, Deying; Kong, Xingchen; Li, Xiaolong; Zheng, Hongkun; Lan, Xiujin; Zhang, Huaigang; Zheng, Youliang; Mao, Long; Liu, Dengcai
2017-02-10
The formation of an allopolyploid is a two step process, comprising an initial wide hybridization event, which is later followed by a whole genome doubling. Both processes can affect the transcription of homoeologues. Here, RNA-Seq was used to obtain the genome-wide leaf transcriptome of two independent Triticum turgidum × Aegilops tauschii allotriploids (F1), along with their spontaneous allohexaploids (S1) and their parental lines. The resulting sequence data were then used to characterize variation in homoeologue transcript abundance. The hybridization event strongly down-regulated D-subgenome homoeologues, but this effect was in many cases reversed by whole genome doubling. The suppression of D-subgenome homoeologue transcription resulted in a marked frequency of parental transcription level dominance, especially with respect to genes encoding proteins involved in photosynthesis. Singletons (genes where no homoeologues were present) were frequently transcribed at both the allotriploid and allohexaploid plants. The implication is that whole genome doubling helps to overcome the phenotypic weakness of the allotriploid, restoring a more favourable gene dosage in genes experiencing transcription level dominance in hexaploid wheat.
Comparative Analysis of Transcription Factors Families across Fungal Tree of Life
DOE Office of Scientific and Technical Information (OSTI.GOV)
Salamov, Asaf; Grigoriev, Igor
2015-03-19
Transcription factors (TFs) are proteins that regulate the transcription of genes, by binding to specific DNA sequences. Based on literature (Shelest, 2008; Weirauch and Hughes,2011) collected and manually curated list of DBD Pfam domains (in total 62 DBD domains) We looked for distribution of TFs in 395 fungal genomes plus additionally in plant genomes (Phytozome), prokaryotes(IMG), some animals/metazoans and protists genomes
Tadra-Sfeir, Michelle Z.; Faoro, Helisson; Camilios-Neto, Doumit; Brusamarello-Santos, Liziane; Balsanelli, Eduardo; Weiss, Vinicius; Baura, Valter A.; Wassem, Roseli; Cruz, Leonardo M.; De Oliveira Pedrosa, Fábio; Souza, Emanuel M.; Monteiro, Rose A.
2015-01-01
Herbaspirillum seropedicae is a diazotrophic bacterium which associates endophytically with economically important gramineae. Flavonoids such as naringenin have been shown to have an effect on the interaction between H. seropedicae and its host plants. We used a high-throughput sequencing based method (RNA-Seq) to access the influence of naringenin on the whole transcriptome profile of H. seropedicae. Three hundred and four genes were downregulated and seventy seven were upregulated by naringenin. Data analysis revealed that genes related to bacterial flagella biosynthesis, chemotaxis and biosynthesis of peptidoglycan were repressed by naringenin. Moreover, genes involved in aromatic metabolism and multidrug transport efllux were actived. PMID:26052319
The 3D genome in transcriptional regulation and pluripotency.
Gorkin, David U; Leung, Danny; Ren, Bing
2014-06-05
It can be convenient to think of the genome as simply a string of nucleotides, the linear order of which encodes an organism's genetic blueprint. However, the genome does not exist as a linear entity within cells where this blueprint is actually utilized. Inside the nucleus, the genome is organized in three-dimensional (3D) space, and lineage-specific transcriptional programs that direct stem cell fate are implemented in this native 3D context. Here, we review principles of 3D genome organization in mammalian cells. We focus on the emerging relationship between genome organization and lineage-specific transcriptional regulation, which we argue are inextricably linked. Copyright © 2014 Elsevier Inc. All rights reserved.
2017-10-01
tasks associated with this milestone. RESULTS SUMMARY (WOLF): 5 FOR SPECIFIC AIM 1: The reversal effect of exercise training on muscle genomic... with exercise training ; gene expressions of Fgl2 in blood clotting cascade and Col1a1 in inflammatory response pathway decreased in response to...exercise training as well. In summary, miRNAs and transcript gene profiles were affected in burn and hindlimb unloading, those changes are associated with
Epigenetic Regulation of ZBTB18 Promotes Glioblastoma Progression. | Office of Cancer Genomics
Glioblastoma (GBM) comprises distinct subtypes characterized by their molecular profile. Mesenchymal identity in GBM has been associated with a comparatively unfavorable prognosis, primarily due to inherent resistance of these tumors to current therapies. The identification of molecular determinants of mesenchymal transformation could potentially allow for the discovery of new therapeutic targets. Zinc Finger and BTB Domain Containing 18 (ZBTB18/ZNF238/RP58) is a zinc finger transcriptional repressor with a crucial role in brain development and neuronal differentiation.
Functional regression method for whole genome eQTL epistasis analysis with sequencing data.
Xu, Kelin; Jin, Li; Xiong, Momiao
2017-05-18
Epistasis plays an essential rule in understanding the regulation mechanisms and is an essential component of the genetic architecture of the gene expressions. However, interaction analysis of gene expressions remains fundamentally unexplored due to great computational challenges and data availability. Due to variation in splicing, transcription start sites, polyadenylation sites, post-transcriptional RNA editing across the entire gene, and transcription rates of the cells, RNA-seq measurements generate large expression variability and collectively create the observed position level read count curves. A single number for measuring gene expression which is widely used for microarray measured gene expression analysis is highly unlikely to sufficiently account for large expression variation across the gene. Simultaneously analyzing epistatic architecture using the RNA-seq and whole genome sequencing (WGS) data poses enormous challenges. We develop a nonlinear functional regression model (FRGM) with functional responses where the position-level read counts within a gene are taken as a function of genomic position, and functional predictors where genotype profiles are viewed as a function of genomic position, for epistasis analysis with RNA-seq data. Instead of testing the interaction of all possible pair-wises SNPs, the FRGM takes a gene as a basic unit for epistasis analysis, which tests for the interaction of all possible pairs of genes and use all the information that can be accessed to collectively test interaction between all possible pairs of SNPs within two genome regions. By large-scale simulations, we demonstrate that the proposed FRGM for epistasis analysis can achieve the correct type 1 error and has higher power to detect the interactions between genes than the existing methods. The proposed methods are applied to the RNA-seq and WGS data from the 1000 Genome Project. The numbers of pairs of significantly interacting genes after Bonferroni correction identified using FRGM, RPKM and DESeq were 16,2361, 260 and 51, respectively, from the 350 European samples. The proposed FRGM for epistasis analysis of RNA-seq can capture isoform and position-level information and will have a broad application. Both simulations and real data analysis highlight the potential for the FRGM to be a good choice of the epistatic analysis with sequencing data.
Poly A- transcripts expressed in HeLa cells.
Wu, Qingfa; Kim, Yeong C; Lu, Jian; Xuan, Zhenyu; Chen, Jun; Zheng, Yonglan; Zhou, Tom; Zhang, Michael Q; Wu, Chung-I; Wang, San Ming
2008-07-30
Transcripts expressed in eukaryotes are classified as poly A+ transcripts or poly A- transcripts based on the presence or absence of the 3' poly A tail. Most transcripts identified so far are poly A+ transcripts, whereas the poly A- transcripts remain largely unknown. We developed the TRD (Total RNA Detection) system for transcript identification. The system detects the transcripts through the following steps: 1) depleting the abundant ribosomal and small-size transcripts; 2) synthesizing cDNA without regard to the status of the 3' poly A tail; 3) applying the 454 sequencing technology for massive 3' EST collection from the cDNA; and 4) determining the genome origins of the detected transcripts by mapping the sequences to the human genome reference sequences. Using this system, we characterized the cytoplasmic transcripts from HeLa cells. Of the 13,467 distinct 3' ESTs analyzed, 24% are poly A-, 36% are poly A+, and 40% are bimorphic with poly A+ features but without the 3' poly A tail. Most of the poly A- 3' ESTs do not match known transcript sequences; they have a similar distribution pattern in the genome as the poly A+ and bimorphic 3' ESTs, and their mapped intergenic regions are evolutionarily conserved. Experiments confirmed the authenticity of the detected poly A- transcripts. Our study provides the first large-scale sequence evidence for the presence of poly A- transcripts in eukaryotes. The abundance of the poly A- transcripts highlights the need for comprehensive identification of these transcripts for decoding the transcriptome, annotating the genome and studying biological relevance of the poly A- transcripts.
Genetics of climate change adaptation.
Franks, Steven J; Hoffmann, Ary A
2012-01-01
The rapid rate of current global climate change is having strong effects on many species and, at least in some cases, is driving evolution, particularly when changes in conditions alter patterns of selection. Climate change thus provides an opportunity for the study of the genetic basis of adaptation. Such studies include a variety of observational and experimental approaches, such as sampling across clines, artificial evolution experiments, and resurrection studies. These approaches can be combined with a number of techniques in genetics and genomics, including association and mapping analyses, genome scans, and transcription profiling. Recent research has revealed a number of candidate genes potentially involved in climate change adaptation and has also illustrated that genetic regulatory networks and epigenetic effects may be particularly relevant for evolution driven by climate change. Although genetic and genomic data are rapidly accumulating, we still have much to learn about the genetic architecture of climate change adaptation.
Morales, Mónica; Martínez-Paz, Pedro; Martín, Raquel; Planelló, Rosario; Urien, Josune; Martínez-Guitarte, José Luis; Morcillo, Gloria
2014-12-01
Pentachlorophenol (PCP) has been extensively used worldwide as a pesticide and biocide and is frequently detected in the aquatic environment. In the present work, the toxicity of PCP was investigated in Chironomus riparius aquatic larvae. The effects following short- and long-term exposures were evaluated at the molecular level by analyzing changes in the transcriptional profile of different endocrine genes, as well as in genes involved in the stress response and detoxification. Interestingly, although no differences were found after 12- and 24-h treatments, at 96-h exposures PCP was able to induce significant increases in transcripts from the ecdysone receptor gene (EcR), the early ecdysone-inducible E74 gene, the estrogen-related receptor gene (ERR), the Hsp70 gene and the CYP4G gene. In contrast, the Hsp27 gene appeared to be downregulated, while the ultraspiracle gene (usp) (insect ortholog of the retinoid X receptor) was not altered in any of the conditions assayed. Moreover, Glutathione-S-Transferase (GST) activity was not affected. The results obtained show the ability of PCP to modulate transcription of different biomarker genes from important cellular metabolic activities, which could be useful in genomic approaches to monitoring. In particular, the significant upregulation of hormonal genes represents the first evidence at the genomic level of the potential endocrine disruptive effects of PCP on aquatic invertebrates. Copyright © 2014 Elsevier B.V. All rights reserved.
Differential Sox10 Genomic Occupancy in Myelinating Glia
Lopez-Anido, Camila; Sun, Guannan; Koenning, Matthias; Srinivasan, Rajini; Hung, Holly A.; Emery, Ben; Keles, Sunduz; Svaren, John
2015-01-01
Myelin is formed by specialized myelinating glia: oligodendrocytes and Schwann cells in the central and peripheral nervous systems, respectively. While there are distinct developmental aspects and regulatory pathways in these two cell types, myelination in both systems requires the transcriptional activator Sox10. Sox10 interacts with cell type-specific transcription factors at some loci to induce myelin gene expression, but it is largely unknown how Sox10 transcriptional networks globally compare between oligodendrocytes and Schwann cells. We used in vivo ChIP-Seq analysis of spinal cord and peripheral nerve (sciatic nerve) to identify unique and shared Sox10 binding sites and assess their correlation with active enhancers and transcriptional profiles in oligodendrocytes and Schwann cells. Sox10 binding sites overlap with active enhancers and critical cell type-specific regulators of myelination, such as Olig2 and Myrf in oligodendrocytes, and Egr2/Krox20 in Schwann cells. Sox10 sites also associate with genes critical for myelination in both oligodendrocytes and Schwann cells, and are found within super-enhancers previously defined in brain. In Schwann cells, Sox10 sites contain binding motifs of putative partners in the Sp/Klf, Tead, and nuclear receptor protein families. Specifically, siRNA analysis of nuclear receptors Nr2f1 and Nr2f2 revealed downregulation of myelin genes Mbp and Ndrg1 in primary Schwann cells. Our analysis highlights different mechanisms that establish cell type-specific genomic occupancy of Sox10, which reflects the unique characteristics of oligodendrocyte and Schwann cell differentiation. PMID:25974668
Each cell counts: Hematopoiesis and immunity research in the era of single cell genomics.
Jaitin, Diego Adhemar; Keren-Shaul, Hadas; Elefant, Naama; Amit, Ido
2015-02-01
Hematopoiesis and immunity are mediated through complex interactions between multiple cell types and states. This complexity is currently addressed following a reductionist approach of characterizing cell types by a small number of cell surface molecular features and gross functions. While the introduction of global transcriptional profiling technologies enabled a more comprehensive view, heterogeneity within sampled populations remained unaddressed, obscuring the true picture of hematopoiesis and immune system function. A critical mass of technological advances in molecular biology and genomics has enabled genome-wide measurements of single cells - the fundamental unit of immunity. These new advances are expected to boost detection of less frequent cell types and fuzzy intermediate cell states, greatly expanding the resolution of current available classifications. This new era of single-cell genomics in immunology research holds great promise for further understanding of the mechanisms and circuits regulating hematopoiesis and immunity in both health and disease. In the near future, the accuracy of single-cell genomics will ultimately enable precise diagnostics and treatment of multiple hematopoietic and immune related diseases. Copyright © 2015 Elsevier Ltd. All rights reserved.
Snipen, Lars; Nes, Ingolf F.; Brede, Dag A.
2010-01-01
Urinary tract infection (UTI) is the most common infection caused by enterococci, and Enterococcus faecalis accounts for the majority of enterococcal infections. Although a number of virulence related traits have been established, no comprehensive genomic or transcriptomic studies have been conducted to investigate how to distinguish pathogenic from non-pathogenic E. faecalis in their ability to cause UTI. In order to identify potential genetic traits or gene regulatory features that distinguish pathogenic from non-pathogenic E. faecalis with respect to UTI, we have performed comparative genomic analysis, and investigated growth capacity and transcriptome profiling in human urine in vitro. Six strains of different origins were cultivated and all grew readily in human urine. The three strains chosen for transcriptional analysis showed an overall similar response with respect to energy and nitrogen metabolism, stress mechanism, cell envelope modifications, and trace metal acquisition. Our results suggest that citrate and aspartate are significant for growth of E. faecalis in human urine, and manganese appear to be a limiting factor. The majority of virulence factors were either not differentially regulated or down-regulated. Notably, a significant up-regulation of genes involved in biofilm formation was observed. Strains from different origins have similar capacity to grow in human urine. The overall similar transcriptional responses between the two pathogenic and the probiotic strain suggest that the pathogenic potential of a certain E. faecalis strain may to a great extent be determined by presence of fitness and virulence factors, rather than the level of expression of such traits. PMID:20824220
Wei, Yingying; Wu, George; Ji, Hongkai
2013-05-01
Mapping genome-wide binding sites of all transcription factors (TFs) in all biological contexts is a critical step toward understanding gene regulation. The state-of-the-art technologies for mapping transcription factor binding sites (TFBSs) couple chromatin immunoprecipitation (ChIP) with high-throughput sequencing (ChIP-seq) or tiling array hybridization (ChIP-chip). These technologies have limitations: they are low-throughput with respect to surveying many TFs. Recent advances in genome-wide chromatin profiling, including development of technologies such as DNase-seq, FAIRE-seq and ChIP-seq for histone modifications, make it possible to predict in vivo TFBSs by analyzing chromatin features at computationally determined DNA motif sites. This promising new approach may allow researchers to monitor the genome-wide binding sites of many TFs simultaneously. In this article, we discuss various experimental design and data analysis issues that arise when applying this approach. Through a systematic analysis of the data from the Encyclopedia Of DNA Elements (ENCODE) project, we compare the predictive power of individual and combinations of chromatin marks using supervised and unsupervised learning methods, and evaluate the value of integrating information from public ChIP and gene expression data. We also highlight the challenges and opportunities for developing novel analytical methods, such as resolving the one-motif-multiple-TF ambiguity and distinguishing functional and non-functional TF binding targets from the predicted binding sites. The online version of this article (doi:10.1007/s12561-012-9066-5) contains supplementary material, which is available to authorized users.
Muthamilarasan, Mehanathan; Bonthala, Venkata Suresh; Mishra, Awdhesh Kumar; Khandelwal, Rohit; Khan, Yusuf; Roy, Riti; Prasad, Manoj
2014-09-01
C2H2 type of zinc finger transcription factors (TFs) play crucial roles in plant stress response and hormone signal transduction. Hence considering its importance, genome-wide investigation and characterization of C2H2 zinc finger proteins were performed in Arabidopsis, rice and poplar but no such study was conducted in foxtail millet which is a C4 Panicoid model crop well known for its abiotic stress tolerance. The present study identified 124 C2H2-type zinc finger TFs in foxtail millet (SiC2H2) and physically mapped them onto the genome. The gene duplication analysis revealed that SiC2H2s primarily expanded in the genome through tandem duplication. The phylogenetic tree classified these TFs into five groups (I-V). Further, miRNAs targeting SiC2H2 transcripts in foxtail millet were identified. Heat map demonstrated differential and tissue-specific expression patterns of these SiC2H2 genes. Comparative physical mapping between foxtail millet SiC2H2 genes and its orthologs of sorghum, maize and rice revealed the evolutionary relationships of C2H2 type of zinc finger TFs. The duplication and divergence data provided novel insight into the evolutionary aspects of these TFs in foxtail millet and related grass species. Expression profiling of candidate SiC2H2 genes in response to salinity, dehydration and cold stress showed differential expression pattern of these genes at different time points of stresses.
Transcriptional profiling reveals elevated Sox2 in DNA polymerase ß null mouse embryonic fibroblasts
Li, Jianfeng; Luthra, Soumya; Wang, Xiao-Hong; Chandran, Uma R; Sobol, Robert W
2012-01-01
There are over 150 human proteins that have been categorized as bona fide DNA repair proteins. These DNA repair proteins maintain the integrity of the genome, reducing the onset of cancer, disease and aging phenotypes. Variations in expression and/or function would therefore impact genome integrity as well as the cellular response to genotoxins. Global gene expression analysis is an effective approach to uncover defects in DNA repair gene expression and to discover cellular and/or organismal effects brought about by external stimuli such as environmental genotoxicants, chemotherapeutic regimens, viral infections as well as developmental and age-related stimuli. Given the significance of genome stability in cell survival and response to stimuli, we have hypothesized that cells may undergo transcriptional re-programming to accommodate defects in basal DNA repair capacity to promote survival. As a test of this hypothesis, we have compared the transcriptome in three DNA polymerase ß knockout (Polß-KO) mouse embryonic fibroblasts (MEFs) and the corresponding wild-type (WT) littermate control cell lines. Each Polß-KO cell line was found to have a range of genes up-regulated, when compared to its WT littermate control cell line. Interestingly, six (6) genes were commonly up regulated in all three Polß-KO cell lines, including Sox2, one of several genes associated with the induction of pluripotent stem cells. Herein, we present these findings and suggest that loss of DNA repair and the induction of cellular transcriptional re-programming may, in part, contribute to tumor formation and the cellular response to external stimuli. PMID:23226616
RegPrecise 3.0--a resource for genome-scale exploration of transcriptional regulation in bacteria.
Novichkov, Pavel S; Kazakov, Alexey E; Ravcheev, Dmitry A; Leyn, Semen A; Kovaleva, Galina Y; Sutormin, Roman A; Kazanov, Marat D; Riehl, William; Arkin, Adam P; Dubchak, Inna; Rodionov, Dmitry A
2013-11-01
Genome-scale prediction of gene regulation and reconstruction of transcriptional regulatory networks in prokaryotes is one of the critical tasks of modern genomics. Bacteria from different taxonomic groups, whose lifestyles and natural environments are substantially different, possess highly diverged transcriptional regulatory networks. The comparative genomics approaches are useful for in silico reconstruction of bacterial regulons and networks operated by both transcription factors (TFs) and RNA regulatory elements (riboswitches). RegPrecise (http://regprecise.lbl.gov) is a web resource for collection, visualization and analysis of transcriptional regulons reconstructed by comparative genomics. We significantly expanded a reference collection of manually curated regulons we introduced earlier. RegPrecise 3.0 provides access to inferred regulatory interactions organized by phylogenetic, structural and functional properties. Taxonomy-specific collections include 781 TF regulogs inferred in more than 160 genomes representing 14 taxonomic groups of Bacteria. TF-specific collections include regulogs for a selected subset of 40 TFs reconstructed across more than 30 taxonomic lineages. Novel collections of regulons operated by RNA regulatory elements (riboswitches) include near 400 regulogs inferred in 24 bacterial lineages. RegPrecise 3.0 provides four classifications of the reference regulons implemented as controlled vocabularies: 55 TF protein families; 43 RNA motif families; ~150 biological processes or metabolic pathways; and ~200 effectors or environmental signals. Genome-wide visualization of regulatory networks and metabolic pathways covered by the reference regulons are available for all studied genomes. A separate section of RegPrecise 3.0 contains draft regulatory networks in 640 genomes obtained by an conservative propagation of the reference regulons to closely related genomes. RegPrecise 3.0 gives access to the transcriptional regulons reconstructed in bacterial genomes. Analytical capabilities include exploration of: regulon content, structure and function; TF binding site motifs; conservation and variations in genome-wide regulatory networks across all taxonomic groups of Bacteria. RegPrecise 3.0 was selected as a core resource on transcriptional regulation of the Department of Energy Systems Biology Knowledgebase, an emerging software and data environment designed to enable researchers to collaboratively generate, test and share new hypotheses about gene and protein functions, perform large-scale analyses, and model interactions in microbes, plants, and their communities.
Applications of nanotechnology, next generation sequencing and microarrays in biomedical research.
Elingaramil, Sauli; Li, Xiaolong; He, Nongyue
2013-07-01
Next-generation sequencing technologies, microarrays and advances in bio nanotechnology have had an enormous impact on research within a short time frame. This impact appears certain to increase further as many biomedical institutions are now acquiring these prevailing new technologies. Beyond conventional sampling of genome content, wide-ranging applications are rapidly evolving for next-generation sequencing, microarrays and nanotechnology. To date, these technologies have been applied in a variety of contexts, including whole-genome sequencing, targeted re sequencing and discovery of transcription factor binding sites, noncoding RNA expression profiling and molecular diagnostics. This paper thus discusses current applications of nanotechnology, next-generation sequencing technologies and microarrays in biomedical research and highlights the transforming potential these technologies offer.