Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui; ...
2014-10-02
Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptionalmore » regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui
Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptionalmore » regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.« less
Long non-coding RNAs and mRNAs profiling during spleen development in pig.
Che, Tiandong; Li, Diyan; Jin, Long; Fu, Yuhua; Liu, Yingkai; Liu, Pengliang; Wang, Yixin; Tang, Qianzi; Ma, Jideng; Wang, Xun; Jiang, Anan; Li, Xuewei; Li, Mingzhou
2018-01-01
Genome-wide transcriptomic studies in humans and mice have become extensive and mature. However, a comprehensive and systematic understanding of protein-coding genes and long non-coding RNAs (lncRNAs) expressed during pig spleen development has not been achieved. LncRNAs are known to participate in regulatory networks for an array of biological processes. Here, we constructed 18 RNA libraries from developing fetal pig spleen (55 days before birth), postnatal pig spleens (0, 30, 180 days and 2 years after birth), and the samples from the 2-year-old Wild Boar. A total of 15,040 lncRNA transcripts were identified among these samples. We found that the temporal expression pattern of lncRNAs was more restricted than observed for protein-coding genes. Time-series analysis showed two large modules for protein-coding genes and lncRNAs. The up-regulated module was enriched for genes related to immune and inflammatory function, while the down-regulated module was enriched for cell proliferation processes such as cell division and DNA replication. Co-expression networks indicated the functional relatedness between protein-coding genes and lncRNAs, which were enriched for similar functions over the series of time points examined. We identified numerous differentially expressed protein-coding genes and lncRNAs in all five developmental stages. Notably, ceruloplasmin precursor (CP), a protein-coding gene participating in antioxidant and iron transport processes, was differentially expressed in all stages. This study provides the first catalog of the developing pig spleen, and contributes to a fuller understanding of the molecular mechanisms underpinning mammalian spleen development.
Genes uniquely expressed in human growth plate chondrocytes uncover a distinct regulatory network.
Li, Bing; Balasubramanian, Karthika; Krakow, Deborah; Cohn, Daniel H
2017-12-20
Chondrogenesis is the earliest stage of skeletal development and is a highly dynamic process, integrating the activities and functions of transcription factors, cell signaling molecules and extracellular matrix proteins. The molecular mechanisms underlying chondrogenesis have been extensively studied and multiple key regulators of this process have been identified. However, a genome-wide overview of the gene regulatory network in chondrogenesis has not been achieved. In this study, employing RNA sequencing, we identified 332 protein coding genes and 34 long non-coding RNA (lncRNA) genes that are highly selectively expressed in human fetal growth plate chondrocytes. Among the protein coding genes, 32 genes were associated with 62 distinct human skeletal disorders and 153 genes were associated with skeletal defects in knockout mice, confirming their essential roles in skeletal formation. These gene products formed a comprehensive physical interaction network and participated in multiple cellular processes regulating skeletal development. The data also revealed 34 transcription factors and 11,334 distal enhancers that were uniquely active in chondrocytes, functioning as transcriptional regulators for the cartilage-selective genes. Our findings revealed a complex gene regulatory network controlling skeletal development whereby transcription factors, enhancers and lncRNAs participate in chondrogenesis by transcriptional regulation of key genes. Additionally, the cartilage-selective genes represent candidate genes for unsolved human skeletal disorders.
Methylation of miRNA genes and oncogenesis.
Loginov, V I; Rykov, S V; Fridman, M V; Braga, E A
2015-02-01
Interaction between microRNA (miRNA) and messenger RNA of target genes at the posttranscriptional level provides fine-tuned dynamic regulation of cell signaling pathways. Each miRNA can be involved in regulating hundreds of protein-coding genes, and, conversely, a number of different miRNAs usually target a structural gene. Epigenetic gene inactivation associated with methylation of promoter CpG-islands is common to both protein-coding genes and miRNA genes. Here, data on functions of miRNAs in development of tumor-cell phenotype are reviewed. Genomic organization of promoter CpG-islands of the miRNA genes located in inter- and intragenic areas is discussed. The literature and our own results on frequency of CpG-island methylation in miRNA genes from tumors are summarized, and data regarding a link between such modification and changed activity of miRNA genes and, consequently, protein-coding target genes are presented. Moreover, the impact of miRNA gene methylation on key oncogenetic processes as well as affected signaling pathways is discussed.
DOE R&D Accomplishments Database
Liang, X.
1998-06-10
The genome of Methanococcus jannaschii has been sequenced completely and has been found to contain approximately 1,770 predicted protein-coding regions. When these coding regions are expressed and how their expression is regulated, however, remain open questions. In this work, mass spectrometry was combined with two-dimensional gel electrophoresis to identify which proteins the genes produce under different growth conditions, and thus investigate the regulation of genes responsible for functions characteristic of this thermophilic representative of the methanogenic Archaea.
Cheng, Chao; Ung, Matthew; Grant, Gavin D.; Whitfield, Michael L.
2013-01-01
Cell cycle is a complex and highly supervised process that must proceed with regulatory precision to achieve successful cellular division. Despite the wide application, microarray time course experiments have several limitations in identifying cell cycle genes. We thus propose a computational model to predict human cell cycle genes based on transcription factor (TF) binding and regulatory motif information in their promoters. We utilize ENCODE ChIP-seq data and motif information as predictors to discriminate cell cycle against non-cell cycle genes. Our results show that both the trans- TF features and the cis- motif features are predictive of cell cycle genes, and a combination of the two types of features can further improve prediction accuracy. We apply our model to a complete list of GENCODE promoters to predict novel cell cycle driving promoters for both protein-coding genes and non-coding RNAs such as lincRNAs. We find that a similar percentage of lincRNAs are cell cycle regulated as protein-coding genes, suggesting the importance of non-coding RNAs in cell cycle division. The model we propose here provides not only a practical tool for identifying novel cell cycle genes with high accuracy, but also new insights on cell cycle regulation by TFs and cis-regulatory elements. PMID:23874175
Zhang, Qingbin; Chen, Li; Cui, Shiman; Li, Yan; Zhao, Qi; Cao, Wei; Lai, Shixiang; Yin, Sanjun; Zuo, Zhixiang; Ren, Jian
2017-10-25
Although long noncoding RNAs (lncRNAs) have been emerging as critical regulators in various tissues and biological processes, little is known about their expression and regulation during the osteogenic differentiation of periodontal ligament stem cells (PDLSCs) in inflammatory microenvironment. In this study, we have identified 63 lncRNAs that are not annotated in previous database. These novel lncRNAs were not randomly located in the genome but preferentially located near protein-coding genes related to particular functions and diseases, such as stem cell maintenance and differentiation, development disorders and inflammatory diseases. Moreover, we have identified 650 differentially expressed lncRNAs among different subsets of PDLSCs. Pathway enrichment analysis for neighboring protein-coding genes of these differentially expressed lncRNAs revealed stem cell differentiation related functions. Many of these differentially expressed lncRNAs function as competing endogenous RNAs that regulate protein-coding transcripts through competing shared miRNAs.
Origin and evolution of the long non-coding genes in the X-inactivation center.
Romito, Antonio; Rougeulle, Claire
2011-11-01
Random X chromosome inactivation (XCI), the eutherian mechanism of X-linked gene dosage compensation, is controlled by a cis-acting locus termed the X-inactivation center (Xic). One of the striking features that characterize the Xic landscape is the abundance of loci transcribing non-coding RNAs (ncRNAs), including Xist, the master regulator of the inactivation process. Recent comparative genomic analyses have depicted the evolutionary scenario behind the origin of the X-inactivation center, revealing that this locus evolved from a region harboring protein-coding genes. During mammalian radiation, this ancestral protein-coding region was disrupted in the marsupial group, whilst it provided in eutherian lineage the starting material for the non-translated RNAs of the X-inactivation center. The emergence of non-coding genes occurred by a dual mechanism involving loss of protein-coding function of the pre-existing genes and integration of different classes of mobile elements, some of which modeled the structure and sequence of the non-coding genes in a species-specific manner. The rising genes started to produce transcripts that acquired function in regulating the epigenetic status of the X chromosome, as shown for Xist, its antisense Tsix, Jpx, and recently suggested for Ftx. Thus, the appearance of the Xic, which occurred after the divergence between eutherians and marsupials, was the basis for the evolution of random X inactivation as a strategy to achieve dosage compensation. Copyright © 2011. Published by Elsevier Masson SAS.
Dynamic gene expression response to altered gravity in human T cells.
Thiel, Cora S; Hauschild, Swantje; Huge, Andreas; Tauber, Svantje; Lauber, Beatrice A; Polzer, Jennifer; Paulsen, Katrin; Lier, Hartwin; Engelmann, Frank; Schmitz, Burkhard; Schütte, Andreas; Layer, Liliana E; Ullrich, Oliver
2017-07-12
We investigated the dynamics of immediate and initial gene expression response to different gravitational environments in human Jurkat T lymphocytic cells and compared expression profiles to identify potential gravity-regulated genes and adaptation processes. We used the Affymetrix GeneChip® Human Transcriptome Array 2.0 containing 44,699 protein coding genes and 22,829 non-protein coding genes and performed the experiments during a parabolic flight and a suborbital ballistic rocket mission to cross-validate gravity-regulated gene expression through independent research platforms and different sets of control experiments to exclude other factors than alteration of gravity. We found that gene expression in human T cells rapidly responded to altered gravity in the time frame of 20 s and 5 min. The initial response to microgravity involved mostly regulatory RNAs. We identified three gravity-regulated genes which could be cross-validated in both completely independent experiment missions: ATP6V1A/D, a vacuolar H + -ATPase (V-ATPase) responsible for acidification during bone resorption, IGHD3-3/IGHD3-10, diversity genes of the immunoglobulin heavy-chain locus participating in V(D)J recombination, and LINC00837, a long intergenic non-protein coding RNA. Due to the extensive and rapid alteration of gene expression associated with regulatory RNAs, we conclude that human cells are equipped with a robust and efficient adaptation potential when challenged with altered gravitational environments.
[Regulation of heat shock gene expression in response to stress].
Garbuz, D G
2017-01-01
Heat shock (HS) genes, or stress genes, code for a number of proteins that collectively form the most ancient and universal stress defense system. The system determines the cell capability of adaptation to various adverse factors and performs a variety of auxiliary functions in normal physiological conditions. Common stress factors, such as higher temperatures, hypoxia, heavy metals, and others, suppress transcription and translation for the majority of genes, while HS genes are upregulated. Transcription of HS genes is controlled by transcription factors of the HS factor (HSF) family. Certain HSFs are activated on exposure to higher temperatures or other adverse factors to ensure stress-induced HS gene expression, while other HSFs are specifically activated at particular developmental stages. The regulation of the main mammalian stress-inducible factor HSF1 and Drosophila melanogaster HSF includes many components, such as a variety of early warning signals indicative of abnormal cell activity (e.g., increases in intracellular ceramide, cytosolic calcium ions, or partly denatured proteins); protein kinases, which phosphorylate HSFs at various Ser residues; acetyltransferases; and regulatory proteins, such as SUMO and HSBP1. Transcription factors other than HSFs are also involved in activating HS gene transcription; the set includes D. melanogaster GAF, mammalian Sp1 and NF-Y, and other factors. Transcription of several stress genes coding for molecular chaperones of the glucose-regulated protein (GRP) family is predominantly regulated by another stress-detecting system, which is known as the unfolded protein response (UPR) system and is activated in response to massive protein misfolding in the endoplasmic reticulum and mitochondrial matrix. A translational fine tuning of HS protein expression occurs via changing the phosphorylation status of several proteins involved in translation initiation. In addition, specific signal sequences in the 5'-UTRs of some HS protein mRNAs ensure their preferential translation in stress.
Liu, Yangyang; Han, Xiao; Yuan, Junting; Geng, Tuoyu; Chen, Shihao; Hu, Xuming; Cui, Isabelle H; Cui, Hengmi
2017-04-07
The type II bacterial CRISPR/Cas9 system is a simple, convenient, and powerful tool for targeted gene editing. Here, we describe a CRISPR/Cas9-based approach for inserting a poly(A) transcriptional terminator into both alleles of a targeted gene to silence protein-coding and non-protein-coding genes, which often play key roles in gene regulation but are difficult to silence via insertion or deletion of short DNA fragments. The integration of 225 bp of bovine growth hormone poly(A) signals into either the first intron or the first exon or behind the promoter of target genes caused efficient termination of expression of PPP1R12C , NSUN2 (protein-coding genes), and MALAT1 (non-protein-coding gene). Both NeoR and PuroR were used as markers in the selection of clonal cell lines with biallelic integration of a poly(A) signal. Genotyping analysis indicated that the cell lines displayed the desired biallelic silencing after a brief selection period. These combined results indicate that this CRISPR/Cas9-based approach offers an easy, convenient, and efficient novel technique for gene silencing in cell lines, especially for those in which gene integration is difficult because of a low efficiency of homology-directed repair. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Basu, Swaraj; Larsson, Erik
2018-05-31
Antisense transcripts and other long non-coding RNAs are pervasive in mammalian cells, and some of these molecules have been proposed to regulate proximal protein-coding genes in cis For example, non-coding transcription can contribute to inactivation of tumor suppressor genes in cancer, and antisense transcripts have been implicated in the epigenetic inactivation of imprinted genes. However, our knowledge is still limited and more such regulatory interactions likely await discovery. Here, we make use of available gene expression data from a large compendium of human tumors to generate hypotheses regarding non-coding-to-coding cis -regulatory relationships with emphasis on negative associations, as these are less likely to arise for reasons other than cis -regulation. We document a large number of possible regulatory interactions, including 193 coding/non-coding pairs that show expression patterns compatible with negative cis -regulation. Importantly, by this approach we capture several known cases, and many of the involved coding genes have known roles in cancer. Our study provides a large catalog of putative non-coding/coding cis -regulatory pairs that may serve as a basis for further experimental validation and characterization. Copyright © 2018 Basu and Larsson.
Zhou, Ke-Ren; Liu, Shun; Sun, Wen-Ju; Zheng, Ling-Ling; Zhou, Hui; Yang, Jian-Hua; Qu, Liang-Hu
2017-01-04
The abnormal transcriptional regulation of non-coding RNAs (ncRNAs) and protein-coding genes (PCGs) is contributed to various biological processes and linked with human diseases, but the underlying mechanisms remain elusive. In this study, we developed ChIPBase v2.0 (http://rna.sysu.edu.cn/chipbase/) to explore the transcriptional regulatory networks of ncRNAs and PCGs. ChIPBase v2.0 has been expanded with ∼10 200 curated ChIP-seq datasets, which represent about 20 times expansion when comparing to the previous released version. We identified thousands of binding motif matrices and their binding sites from ChIP-seq data of DNA-binding proteins and predicted millions of transcriptional regulatory relationships between transcription factors (TFs) and genes. We constructed 'Regulator' module to predict hundreds of TFs and histone modifications that were involved in or affected transcription of ncRNAs and PCGs. Moreover, we built a web-based tool, Co-Expression, to explore the co-expression patterns between DNA-binding proteins and various types of genes by integrating the gene expression profiles of ∼10 000 tumor samples and ∼9100 normal tissues and cell lines. ChIPBase also provides a ChIP-Function tool and a genome browser to predict functions of diverse genes and visualize various ChIP-seq data. This study will greatly expand our understanding of the transcriptional regulations of ncRNAs and PCGs. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Cipriano, Andrea; Ballarino, Monica
2018-01-01
The completion of the human genome sequence together with advances in sequencing technologies have shifted the paradigm of the genome, as composed of discrete and hereditable coding entities, and have shown the abundance of functional noncoding DNA. This part of the genome, previously dismissed as “junk” DNA, increases proportionally with organismal complexity and contributes to gene regulation beyond the boundaries of known protein-coding genes. Different classes of functionally relevant nonprotein-coding RNAs are transcribed from noncoding DNA sequences. Among them are the long noncoding RNAs (lncRNAs), which are thought to participate in the basal regulation of protein-coding genes at both transcriptional and post-transcriptional levels. Although knowledge of this field is still limited, the ability of lncRNAs to localize in different cellular compartments, to fold into specific secondary structures and to interact with different molecules (RNA or proteins) endows them with multiple regulatory mechanisms. It is becoming evident that lncRNAs may play a crucial role in most biological processes such as the control of development, differentiation and cell growth. This review places the evolution of the concept of the gene in its historical context, from Darwin's hypothetical mechanism of heredity to the post-genomic era. We discuss how the original idea of protein-coding genes as unique determinants of phenotypic traits has been reconsidered in light of the existence of noncoding RNAs. We summarize the technological developments which have been made in the genome-wide identification and study of lncRNAs and emphasize the methodologies that have aided our understanding of the complexity of lncRNA-protein interactions in recent years. PMID:29560353
Nutt, S L; Morrison, A M; Dörfler, P; Rolink, A; Busslinger, M
1998-01-01
The Pax-5 gene codes for the transcription factor BSAP which is essential for the progression of adult B lymphopoiesis beyond an early progenitor (pre-BI) cell stage. Although several genes have been proposed to be regulated by BSAP, CD19 is to date the only target gene which has been genetically confirmed to depend on this transcription factor for its expression. We have now taken advantage of cultured pre-BI cells of wild-type and Pax-5 mutant bone marrow to screen a large panel of B lymphoid genes for additional BSAP target genes. Four differentially expressed genes were shown to be under the direct control of BSAP, as their expression was rapidly regulated in Pax-5-deficient pre-BI cells by a hormone-inducible BSAP-estrogen receptor fusion protein. The genes coding for the B-cell receptor component Ig-alpha (mb-1) and the transcription factors N-myc and LEF-1 are positively regulated by BSAP, while the gene coding for the cell surface protein PD-1 is efficiently repressed. Distinct regulatory mechanisms of BSAP were revealed by reconstituting Pax-5-deficient pre-BI cells with full-length BSAP or a truncated form containing only the paired domain. IL-7 signalling was able to efficiently induce the N-myc gene only in the presence of full-length BSAP, while complete restoration of CD19 synthesis was critically dependent on the BSAP protein concentration. In contrast, the expression of the mb-1 and LEF-1 genes was already reconstituted by the paired domain polypeptide lacking any transactivation function, suggesting that the DNA-binding domain of BSAP is sufficient to recruit other transcription factors to the regulatory regions of these two genes. In conclusion, these loss- and gain-of-function experiments demonstrate that BSAP regulates four newly identified target genes as a transcriptional activator, repressor or docking protein depending on the specific regulatory sequence context. PMID:9545244
2014-01-01
Background Nrd1 and Nab3 are essential sequence-specific yeast RNA binding proteins that function as a heterodimer in the processing and degradation of diverse classes of RNAs. These proteins also regulate several mRNA coding genes; however, it remains unclear exactly what percentage of the mRNA component of the transcriptome these proteins control. To address this question, we used the pyCRAC software package developed in our laboratory to analyze CRAC and PAR-CLIP data for Nrd1-Nab3-RNA interactions. Results We generated high-resolution maps of Nrd1-Nab3-RNA interactions, from which we have uncovered hundreds of new Nrd1-Nab3 mRNA targets, representing between 20 and 30% of protein-coding transcripts. Although Nrd1 and Nab3 showed a preference for binding near 5′ ends of relatively short transcripts, they bound transcripts throughout coding sequences and 3′ UTRs. Moreover, our data for Nrd1-Nab3 binding to 3′ UTRs was consistent with a role for these proteins in the termination of transcription. Our data also support a tight integration of Nrd1-Nab3 with the nutrient response pathway. Finally, we provide experimental evidence for some of our predictions, using northern blot and RT-PCR assays. Conclusions Collectively, our data support the notion that Nrd1 and Nab3 function is tightly integrated with the nutrient response and indicate a role for these proteins in the regulation of many mRNA coding genes. Further, we provide evidence to support the hypothesis that Nrd1-Nab3 represents a failsafe termination mechanism in instances of readthrough transcription. PMID:24393166
Decoding sORF translation - from small proteins to gene regulation.
Cabrera-Quio, Luis Enrique; Herberg, Sarah; Pauli, Andrea
2016-11-01
Translation is best known as the fundamental mechanism by which the ribosome converts a sequence of nucleotides into a string of amino acids. Extensive research over many years has elucidated the key principles of translation, and the majority of translated regions were thought to be known. The recent discovery of wide-spread translation outside of annotated protein-coding open reading frames (ORFs) came therefore as a surprise, raising the intriguing possibility that these newly discovered translated regions might have unrecognized protein-coding or gene-regulatory functions. Here, we highlight recent findings that provide evidence that some of these newly discovered translated short ORFs (sORFs) encode functional, previously missed small proteins, while others have regulatory roles. Based on known examples we will also speculate about putative additional roles and the potentially much wider impact that these translated regions might have on cellular homeostasis and gene regulation.
Darbani, Behrooz; Noeparvar, Shahin; Borg, Søren
2016-01-01
RNA circularization made by head-to-tail back-splicing events is involved in the regulation of gene expression from transcriptional to post-translational levels. By exploiting RNA-Seq data and down-stream analysis, we shed light on the importance of circular RNAs in plants. The results introduce circular RNAs as novel interactors in the regulation of gene expression in plants and imply the comprehensiveness of this regulatory pathway by identifying circular RNAs for a diverse set of genes. These genes are involved in several aspects of cellular metabolism as hormonal signaling, intracellular protein sorting, carbohydrate metabolism and cell-wall biogenesis, respiration, amino acid biosynthesis, transcription and translation, and protein ubiquitination. Additionally, these parental loci of circular RNAs, from both nuclear and mitochondrial genomes, encode for different transcript classes including protein coding transcripts, microRNA, rRNA, and long non-coding/microprotein coding RNAs. The results shed light on the mitochondrial exonic circular RNAs and imply the importance of circular RNAs for regulation of mitochondrial genes. Importantly, we introduce circular RNAs in barley and elucidate their cellular-level alterations across tissues and in response to micronutrients iron and zinc. In further support of circular RNAs' functional roles in plants, we report several cases where fluctuations of circRNAs do not correlate with the levels of their parental-loci encoded linear transcripts. PMID:27375638
Behind the curtain of non-coding RNAs; long non-coding RNAs regulating hepatocarcinogenesis
El Khodiry, Aya; Afify, Menna; El Tayebi, Hend M
2018-01-01
Hepatocellular carcinoma (HCC) is one of the most common and aggressive cancers worldwide. HCC is the fifth common malignancy in the world and the second leading cause of cancer death in Asia. Long non-coding RNAs (lncRNAs) are RNAs with a length greater than 200 nucleotides that do not encode proteins. lncRNAs can regulate gene expression and protein synthesis in several ways by interacting with DNA, RNA and proteins in a sequence specific manner. They could regulate cellular and developmental processes through either gene inhibition or gene activation. Many studies have shown that dysregulation of lncRNAs is related to many human diseases such as cardiovascular diseases, genetic disorders, neurological diseases, immune mediated disorders and cancers. However, the study of lncRNAs is challenging as they are poorly conserved between species, their expression levels aren’t as high as that of mRNAs and have great interpatient variations. The study of lncRNAs expression in cancers have been a breakthrough as it unveils potential biomarkers and drug targets for cancer therapy and helps understand the mechanism of pathogenesis. This review discusses many long non-coding RNAs and their contribution in HCC, their role in development, metastasis, and prognosis of HCC and how to regulate and target these lncRNAs as a therapeutic tool in HCC treatment in the future. PMID:29434445
Boyd, David A.; Thevenot, Tracy; Gumbmann, Markus; Honeyman, Allen L.; Hamilton, Ian R.
2000-01-01
Transposon mutagenesis and marker rescue were used to isolate and identify an 8.5-kb contiguous region containing six open reading frames constituting the operon for the sorbitol P-enolpyruvate phosphotransferase transport system (PTS) of Streptococcus mutans LT11. The first gene, srlD, codes for sorbitol-6-phosphate dehydrogenase, followed downstream by srlR, coding for a transcriptional regulator; srlM, coding for a putative activator; and the srlA, srlE, and srlB genes, coding for the EIIC, EIIBC, and EIIA components of the sorbitol PTS, respectively. Among all sorbitol PTS operons characterized to date, the srlD gene is found after the genes coding for the EII components; thus, the location of the gene in S. mutans is unique. The SrlR protein is similar to several transcriptional regulators found in Bacillus spp. that contain PTS regulator domains (J. Stülke, M. Arnaud, G. Rapoport, and I. Martin-Verstraete, Mol. Microbiol. 28:865–874, 1998), and its gene overlaps the srlM gene by 1 bp. The arrangement of these two regulatory genes is unique, having not been reported for other bacteria. PMID:10639465
DOE Office of Scientific and Technical Information (OSTI.GOV)
Smialowska, Agata, E-mail: smialowskaa@gmail.com; School of Life Sciences, Södertörn Högskola, Huddinge 141-89; Djupedal, Ingela
Highlights: • Protein coding genes accumulate anti-sense sRNAs in fission yeast S. pombe. • RNAi represses protein-coding genes in S. pombe. • RNAi-mediated gene repression is post-transcriptional. - Abstract: RNA interference (RNAi) is a gene silencing mechanism conserved from fungi to mammals. Small interfering RNAs are products and mediators of the RNAi pathway and act as specificity factors in recruiting effector complexes. The Schizosaccharomyces pombe genome encodes one of each of the core RNAi proteins, Dicer, Argonaute and RNA-dependent RNA polymerase (dcr1, ago1, rdp1). Even though the function of RNAi in heterochromatin assembly in S. pombe is established, its rolemore » in controlling gene expression is elusive. Here, we report the identification of small RNAs mapped anti-sense to protein coding genes in fission yeast. We demonstrate that these genes are up-regulated at the protein level in RNAi mutants, while their mRNA levels are not significantly changed. We show that the repression by RNAi is not a result of heterochromatin formation. Thus, we conclude that RNAi is involved in post-transcriptional gene silencing in S. pombe.« less
The Human Cell Surfaceome of Breast Tumors
da Cunha, Júlia Pinheiro Chagas; Galante, Pedro Alexandre Favoretto; de Souza, Jorge Estefano Santana; Pieprzyk, Martin; Carraro, Dirce Maria; Old, Lloyd J.; Camargo, Anamaria Aranha; de Souza, Sandro José
2013-01-01
Introduction. Cell surface proteins are ideal targets for cancer therapy and diagnosis. We have identified a set of more than 3700 genes that code for transmembrane proteins believed to be at human cell surface. Methods. We used a high-throuput qPCR system for the analysis of 573 cell surface protein-coding genes in 12 primary breast tumors, 8 breast cell lines, and 21 normal human tissues including breast. To better understand the role of these genes in breast tumors, we used a series of bioinformatics strategies to integrates different type, of the datasets, such as KEGG, protein-protein interaction databases, ONCOMINE, and data from, literature. Results. We found that at least 77 genes are overexpressed in breast primary tumors while at least 2 of them have also a restricted expression pattern in normal tissues. We found common signaling pathways that may be regulated in breast tumors through the overexpression of these cell surface protein-coding genes. Furthermore, a comparison was made between the genes found in this report and other genes associated with features clinically relevant for breast tumorigenesis. Conclusions. The expression profiling generated in this study, together with an integrative bioinformatics analysis, allowed us to identify putative targets for breast tumors. PMID:24195083
Vicente, Juan J; Galardi-Castilla, María; Escalante, Ricardo; Sastre, Leandro
2008-01-03
The social amoeba Dictyostelium discoideum executes a multicellular development program upon starvation. This morphogenetic process requires the differential regulation of a large number of genes and is coordinated by extracellular signals. The MADS-box transcription factor SrfA is required for several stages of development, including slug migration and spore terminal differentiation. Subtractive hybridization allowed the isolation of a gene, sigN (SrfA-induced gene N), that was dependent on the transcription factor SrfA for expression at the slug stage of development. Homology searches detected the existence of a large family of sigN-related genes in the Dictyostelium discoideum genome. The 13 most similar genes are grouped in two regions of chromosome 2 and have been named Group1 and Group2 sigN genes. The putative encoded proteins are 87-89 amino acids long. All these genes have a similar structure, composed of a first exon containing a 13 nucleotides long open reading frame and a second exon comprising the remaining of the putative coding region. The expression of these genes is induced at10 hours of development. Analyses of their promoter regions indicate that these genes are expressed in the prestalk region of developing structures. The addition of antibodies raised against SigN Group 2 proteins induced disintegration of multi-cellular structures at the mound stage of development. A large family of genes coding for small proteins has been identified in D. discoideum. Two groups of very similar genes from this family have been shown to be specifically expressed in prestalk cells during development. Functional studies using antibodies raised against Group 2 SigN proteins indicate that these genes could play a role during multicellular development.
Next generation sequencing and analysis of a conserved transcriptome of New Zealand's kiwi.
Subramanian, Sankar; Huynen, Leon; Millar, Craig D; Lambert, David M
2010-12-15
Kiwi is a highly distinctive, flightless and endangered ratite bird endemic to New Zealand. To understand the patterns of molecular evolution of the nuclear protein-coding genes in brown kiwi (Apteryx australis mantelli) and to determine the timescale of avian history we sequenced a transcriptome obtained from a kiwi embryo using next generation sequencing methods. We then assembled the conserved protein-coding regions using the chicken proteome as a scaffold. Using 1,543 conserved protein coding genes we estimated the neutral evolutionary divergence between the kiwi and chicken to be ~45%, which is approximately equal to the divergence computed for the human-mouse pair using the same set of genes. A large fraction of genes was found to be under high selective constraint, as most of the expressed genes appeared to be involved in developmental gene regulation. Our study suggests a significant relationship between gene expression levels and protein evolution. Using sequences from over 700 nuclear genes we estimated the divergence between the two basal avian groups, Palaeognathae and Neognathae to be 132 million years, which is consistent with previous studies using mitochondrial genes. The results of this investigation revealed patterns of mutation and purifying selection in conserved protein coding regions in birds. Furthermore this study suggests a relatively cost-effective way of obtaining a glimpse into the fundamental molecular evolutionary attributes of a genome, particularly when no closely related genomic sequence is available.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wu, Yongyan; Key Laboratory of Animal Biotechnology, Ministry of Agriculture, Northwest A and F University, Yangling 712100, Shaanxi; Ai, Zhiying
2013-10-15
Embryonic stem cells (ESCs) can proliferate indefinitely in vitro and differentiate into cells of all three germ layers. These unique properties make them exceptionally valuable for drug discovery and regenerative medicine. However, the practical application of ESCs is limited because it is difficult to derive and culture ESCs. It has been demonstrated that CHIR99021 (CHIR) promotes self-renewal and enhances the derivation efficiency of mouse (m)ESCs. However, the downstream targets of CHIR are not fully understood. In this study, we identified CHIR-regulated genes in mESCs using microarray analysis. Our microarray data demonstrated that CHIR not only influenced the Wnt/β-catenin pathway bymore » stabilizing β-catenin, but also modulated several other pluripotency-related signaling pathways such as TGF-β, Notch and MAPK signaling pathways. More detailed analysis demonstrated that CHIR inhibited Nodal signaling, while activating bone morphogenetic protein signaling in mESCs. In addition, we found that pluripotency-maintaining transcription factors were up-regulated by CHIR, while several developmental-related genes were down-regulated. Furthermore, we found that CHIR altered the expression of epigenetic regulatory genes and long intergenic non-coding RNAs. Quantitative real-time PCR results were consistent with microarray data, suggesting that CHIR alters the expression pattern of protein-encoding genes (especially transcription factors), epigenetic regulatory genes and non-coding RNAs to establish a relatively stable pluripotency-maintaining network. - Highlights: • Combined use of CHIR with LIF promotes self-renewal of J1 mESCs. • CHIR-regulated genes are involved in multiple pathways. • CHIR inhibits Nodal signaling and promotes Bmp4 expression to activate BMP signaling. • Expression of epigenetic regulatory genes and lincRNAs is altered by CHIR.« less
Duellman, Tyler; Warren, Christopher; Yang, Jay
2014-01-01
Microribonucleic acids (miRNAs) work with exquisite specificity and are able to distinguish a target from a non-target based on a single nucleotide mismatch in the core nucleotide domain. We questioned whether miRNA regulation of gene expression could occur in a single nucleotide polymorphism (SNP)-specific manner, manifesting as a post-transcriptional control of expression of genetic polymorphisms. In our recent study of the functional consequences of matrix metalloproteinase (MMP)-9 SNPs, we discovered that expression of a coding exon SNP in the pro-domain of the protein resulted in a profound decrease in the secreted protein. This missense SNP results in the N38S amino acid change and a loss of an N-glycosylation site. A systematic study demonstrated that the loss of secreted protein was due not to the loss of an N-glycosylation site, but rather an SNP-specific targeting by miR-671-3p and miR-657. Bioinformatics analysis identified 41 SNP-specific miRNA targeting MMP-9 SNPs, mostly in the coding exon and an extension of the analysis to chromosome 20, where the MMP-9 gene is located, suggesting that SNP-specific miRNAs targeting the coding exon are prevalent. This selective post-transcriptional regulation of a target messenger RNA harboring genetic polymorphisms by miRNAs offers an SNP-dependent post-transcriptional regulatory mechanism, allowing for polymorphic-specific differential gene regulation. PMID:24627221
Seligmann, Hervé
2013-03-01
Usual DNA→RNA transcription exchanges T→U. Assuming different systematic symmetric nucleotide exchanges during translation, some GenBank RNAs match exactly human mitochondrial sequences (exchange rules listed in decreasing transcript frequencies): C↔U, A↔U, A↔U+C↔G (two nucleotide pairs exchanged), G↔U, A↔G, C↔G, none for A↔C, A↔G+C↔U, and A↔C+G↔U. Most unusual transcripts involve exchanging uracil. Independent measures of rates of rare replicational enzymatic DNA nucleotide misinsertions predict frequencies of RNA transcripts systematically exchanging the corresponding misinserted nucleotides. Exchange transcripts self-hybridize less than other gene regions, self-hybridization increases with length, suggesting endoribonuclease-limited elongation. Blast detects stop codon depleted putative protein coding overlapping genes within exchange-transcribed mitochondrial genes. These align with existing GenBank proteins (mainly metazoan origins, prokaryotic and viral origins underrepresented). These GenBank proteins frequently interact with RNA/DNA, are membrane transporters, or are typical of mitochondrial metabolism. Nucleotide exchange transcript frequencies increase with overlapping gene densities and stop densities, indicating finely tuned counterbalancing regulation of expression of systematic symmetric nucleotide exchange-encrypted proteins. Such expression necessitates combined activities of suppressor tRNAs matching stops, and nucleotide exchange transcription. Two independent properties confirm predicted exchanged overlap coding genes: discrepancy of third codon nucleotide contents from replicational deamination gradients, and codon usage according to circular code predictions. Predictions from both properties converge, especially for frequent nucleotide exchange types. Nucleotide exchanging transcription apparently increases coding densities of protein coding genes without lengthening genomes, revealing unsuspected functional DNA coding potential. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Dobrenel, Thomas; Mancera-Martínez, Eder; Forzani, Céline; Azzopardi, Marianne; Davanture, Marlène; Moreau, Manon; Schepetilnikov, Mikhail; Chicher, Johana; Langella, Olivier; Zivy, Michel; Robaglia, Christophe; Ryabova, Lyubov A.; Hanson, Johannes; Meyer, Christian
2016-01-01
Protein translation is an energy consuming process that has to be fine-tuned at both the cell and organism levels to match the availability of resources. The target of rapamycin kinase (TOR) is a key regulator of a large range of biological processes in response to environmental cues. In this study, we have investigated the effects of TOR inactivation on the expression and regulation of Arabidopsis ribosomal proteins at different levels of analysis, namely from transcriptomic to phosphoproteomic. TOR inactivation resulted in a coordinated down-regulation of the transcription and translation of nuclear-encoded mRNAs coding for plastidic ribosomal proteins, which could explain the chlorotic phenotype of the TOR silenced plants. We have identified in the 5′ untranslated regions (UTRs) of this set of genes a conserved sequence related to the 5′ terminal oligopyrimidine motif, which is known to confer translational regulation by the TOR kinase in other eukaryotes. Furthermore, the phosphoproteomic analysis of the ribosomal fraction following TOR inactivation revealed a lower phosphorylation of the conserved Ser240 residue in the C-terminal region of the 40S ribosomal protein S6 (RPS6). These results were confirmed by Western blot analysis using an antibody that specifically recognizes phosphorylated Ser240 in RPS6. Finally, this antibody was used to follow TOR activity in plants. Our results thus uncover a multi-level regulation of plant ribosomal genes and proteins by the TOR kinase. PMID:27877176
Liu, Wenjing; Ma, Rui; Yuan, Yuan
2017-01-01
Noncoding RNAs play critical roles in regulating protein-coding genes and comprise two major classes: long noncoding RNAs (lncRNAs) and microRNAs (miRNAs). LncRNAs regulate gene expression at transcriptional, post-transcriptional, and epigenetic levels via multiple action modes. LncRNAs can also function as endogenous competitive RNAs for miRNAs and indirectly regulate gene expression post-transcriptionally. By binding to the 3'-untranslated regions (3'-UTR) of target genes, miRNAs post-transcriptionally regulate gene expression. Herein, we conducted a review of post-transcriptional regulation by lncRNAs and miRNAs of genes associated with biological behaviors of gastric cancer. PMID:29187891
Neuhaus, Klaus; Landstorfer, Richard; Fellner, Lea; Simon, Svenja; Schafferhans, Andrea; Goldberg, Tatyana; Marx, Harald; Ozoline, Olga N; Rost, Burkhard; Kuster, Bernhard; Keim, Daniel A; Scherer, Siegfried
2016-02-24
Genomes of E. coli, including that of the human pathogen Escherichia coli O157:H7 (EHEC) EDL933, still harbor undetected protein-coding genes which, apparently, have escaped annotation due to their small size and non-essential function. To find such genes, global gene expression of EHEC EDL933 was examined, using strand-specific RNAseq (transcriptome), ribosomal footprinting (translatome) and mass spectrometry (proteome). Using the above methods, 72 short, non-annotated protein-coding genes were detected. All of these showed signals in the ribosomal footprinting assay indicating mRNA translation. Seven were verified by mass spectrometry. Fifty-seven genes are annotated in other enterobacteriaceae, mainly as hypothetical genes; the remaining 15 genes constitute novel discoveries. In addition, protein structure and function were predicted computationally and compared between EHEC-encoded proteins and 100-times randomly shuffled proteins. Based on this comparison, 61 of the 72 novel proteins exhibit predicted structural and functional features similar to those of annotated proteins. Many of the novel genes show differential transcription when grown under eleven diverse growth conditions suggesting environmental regulation. Three genes were found to confer a phenotype in previous studies, e.g., decreased cattle colonization. These findings demonstrate that ribosomal footprinting can be used to detect novel protein coding genes, contributing to the growing body of evidence that hypothetical genes are not annotation artifacts and opening an additional way to study their functionality. All 72 genes are taxonomically restricted and, therefore, appear to have evolved relatively recently de novo.
Dimitrieva, Slavica; Anisimova, Maria
2014-01-01
In protein-coding genes, synonymous mutations are often thought not to affect fitness and therefore are not subject to natural selection. Yet increasingly, cases of non-neutral evolution at certain synonymous sites were reported over the last decade. To evaluate the extent and the nature of site-specific selection on synonymous codons, we computed the site-to-site synonymous rate variation (SRV) and identified gene properties that make SRV more likely in a large database of protein-coding gene families and protein domains. To our knowledge, this is the first study that explores the determinants and patterns of the SRV in real data. We show that the SRV is widespread in the evolution of protein-coding sequences, putting in doubt the validity of the synonymous rate as a standard neutral proxy. While protein domains rarely undergo adaptive evolution, the SRV appears to play important role in optimizing the domain function at the level of DNA. In contrast, protein families are more likely to evolve by positive selection, but are less likely to exhibit SRV. Stronger SRV was detected in genes with stronger codon bias and tRNA reusage, those coding for proteins with larger number of interactions or forming larger number of structures, located in intracellular components and those involved in typically conserved complex processes and functions. Genes with extreme SRV show higher expression levels in nearly all tissues. This indicates that codon bias in a gene, which often correlates with gene expression, may often be a site-specific phenomenon regulating the speed of translation along the sequence, consistent with the co-translational folding hypothesis. Strikingly, genes with SRV were strongly overrepresented for metabolic pathways and those associated with several genetic diseases, particularly cancers and diabetes.
Schwab, Stefan; Ramos, Humberto J; Souza, Emanuel M; Pedrosa, Fábio O; Yates, Marshall G; Chubatsu, Leda S; Rigo, Liu U
2007-05-01
Random mutagenesis using transposons with promoterless reporter genes has been widely used to examine differential gene expression patterns in bacteria. Using this approach, we have identified 26 genes of the endophytic nitrogen-fixing bacterium Herbaspirillum seropedicae regulated in response to ammonium content in the growth medium. These include nine genes involved in the transport of nitrogen compounds, such as the high-affinity ammonium transporter AmtB, and uptake systems for alternative nitrogen sources; nine genes coding for proteins responsible for restoring intracellular ammonium levels through enzymatic reactions, such as nitrogenase, amidase, and arginase; and a third group includes metabolic switch genes, coding for sensor kinases or transcription regulation factors, whose role in metabolism was previously unknown. Also, four genes identified were of unknown function. This paper describes their involvement in response to ammonium limitation. The results provide a preliminary profile of the metabolic response of Herbaspirillum seropedicae to ammonium stress.
Connections Underlying Translation and mRNA Stability.
Radhakrishnan, Aditya; Green, Rachel
2016-09-11
Gene expression and regulation in organisms minimally depends on transcription by RNA polymerase and on the stability of the RNA product (for both coding and non-coding RNAs). For coding RNAs, gene expression is further influenced by the amount of translation by the ribosome and by the stability of the protein product. The stabilities of these two classes of RNA, non-coding and coding, vary considerably: tRNAs and rRNAs tend to be long lived while mRNAs tend to be more short lived. Even among mRNAs, however, there is a considerable range in stability (ranging from seconds to hours in bacteria and up to days in metazoans), suggesting a significant role for stability in the regulation of gene expression. Here, we review recent experiments from bacteria, yeast and metazoans indicating that the stability of most mRNAs is broadly impacted by the actions of ribosomes that translate them. Ribosomal recognition of defective mRNAs triggers "mRNA surveillance" pathways that target the mRNA for degradation [Shoemaker and Green (2012) ]. More generally, even the stability of perfectly functional mRNAs appears to be dictated by overall rates of translation by the ribosome [Herrick et al. (1990), Presnyak et al. (2015) ]. Given that mRNAs are synthesized for the purpose of being translated into proteins, it is reassuring that such intimate connections between mRNA and the ribosome can drive biological regulation. In closing, we consider the likelihood that these connections between protein synthesis and mRNA stability are widespread or whether other modes of regulation dominate the mRNA stability landscape in higher organisms. Copyright © 2016. Published by Elsevier Ltd.
Nicolas, Francisco Esteban; Moxon, Simon; de Haro, Juan P.; Calo, Silvia; Grigoriev, Igor V.; Torres-Martínez, Santiago; Moulton, Vincent; Ruiz-Vázquez, Rosa M.; Dalmay, Tamas
2010-01-01
Endogenous short RNAs (esRNAs) play diverse roles in eukaryotes and usually are produced from double-stranded RNA (dsRNA) by Dicer. esRNAs are grouped into different classes based on biogenesis and function but not all classes are present in all three eukaryotic kingdoms. The esRNA register of fungi is poorly described compared to other eukaryotes and it is not clear what esRNA classes are present in this kingdom and whether they regulate the expression of protein coding genes. However, evidence that some dicer mutant fungi display altered phenotypes suggests that esRNAs play an important role in fungi. Here, we show that the basal fungus Mucor circinelloides produces new classes of esRNAs that map to exons and regulate the expression of many protein coding genes. The largest class of these exonic-siRNAs (ex-siRNAs) are generated by RNA-dependent RNA Polymerase 1 (RdRP1) and dicer-like 2 (DCL2) and target the mRNAs of protein coding genes from which they were produced. Our results expand the range of esRNAs in eukaryotes and reveal a new role for esRNAs in fungi. PMID:20427422
Maier, Uwe-G; Zauner, Stefan; Woehle, Christian; Bolte, Kathrin; Hempel, Franziska; Allen, John F.; Martin, William F.
2013-01-01
Plastid and mitochondrial genomes have undergone parallel evolution to encode the same functional set of genes. These encode conserved protein components of the electron transport chain in their respective bioenergetic membranes and genes for the ribosomes that express them. This highly convergent aspect of organelle genome evolution is partly explained by the redox regulation hypothesis, which predicts a separate plastid or mitochondrial location for genes encoding bioenergetic membrane proteins of either photosynthesis or respiration. Here we show that convergence in organelle genome evolution is far stronger than previously recognized, because the same set of genes for ribosomal proteins is independently retained by both plastid and mitochondrial genomes. A hitherto unrecognized selective pressure retains genes for the same ribosomal proteins in both organelles. On the Escherichia coli ribosome assembly map, the retained proteins are implicated in 30S and 50S ribosomal subunit assembly and initial rRNA binding. We suggest that ribosomal assembly imposes functional constraints that govern the retention of ribosomal protein coding genes in organelles. These constraints are subordinate to redox regulation for electron transport chain components, which anchor the ribosome to the organelle genome in the first place. As organelle genomes undergo reduction, the rRNAs also become smaller. Below size thresholds of approximately 1,300 nucleotides (16S rRNA) and 2,100 nucleotides (26S rRNA), all ribosomal protein coding genes are lost from organelles, while electron transport chain components remain organelle encoded as long as the organelles use redox chemistry to generate a proton motive force. PMID:24259312
Long Non-Coding RNAs Regulating Immunity in Insects
Satyavathi, Valluri; Ghosh, Rupam; Subramanian, Srividya
2017-01-01
Recent advances in modern technology have led to the understanding that not all genetic information is coded into protein and that the genomes of each and every organism including insects produce non-coding RNAs that can control different biological processes. Among RNAs identified in the last decade, long non-coding RNAs (lncRNAs) represent a repertoire of a hidden layer of internal signals that can regulate gene expression in physiological, pathological, and immunological processes. Evidence shows the importance of lncRNAs in the regulation of host–pathogen interactions. In this review, an attempt has been made to view the role of lncRNAs regulating immune responses in insects. PMID:29657286
Zhang, Yan-Qiong; Chen, Dong-Liang; Tian, Hai-Feng; Zhang, Bao-Hong; Wen, Jian-Fan
2009-10-01
Using a combined computational program, we identified 50 potential microRNAs (miRNAs) in Giardia lamblia, one of the most primitive unicellular eukaryotes. These miRNAs are unique to G. lamblia and no homologues have been found in other organisms; miRNAs, currently known in other species, were not found in G. lamblia. This suggests that miRNA biogenesis and miRNA-mediated gene regulation pathway may evolve independently, especially in evolutionarily distant lineages. A majority (43) of the predicted miRNAs are located at one single locus; however, some miRNAs have two or more copies in the genome. Among the 58 miRNA genes, 28 are located in the intergenic regions whereas 30 are present in the anti-sense strands of the protein-coding sequences. Five predicted miRNAs are expressed in G. lamblia trophozoite cells evidenced by expressed sequence tags or RT-PCR. Thirty-seven identified miRNAs may target 50 protein-coding genes, including seven variant-specific surface proteins (VSPs). Our findings provide a clue that miRNA-mediated gene regulation may exist in the early stage of eukaryotic evolution, suggesting that it is an important regulation system ubiquitous in eukaryotes.
Moreno, Renata; Fonseca, Pilar; Rojo, Fernando
2010-08-06
In Pseudomonas putida, the expression of the pWW0 plasmid genes for the toluene/xylene assimilation pathway (the TOL pathway) is subject to complex regulation in response to environmental and physiological signals. This includes strong inhibition via catabolite repression, elicited by the carbon sources that the cells prefer to hydrocarbons. The Crc protein, a global regulator that controls carbon flow in pseudomonads, has an important role in this inhibition. Crc is a translational repressor that regulates the TOL genes, but how it does this has remained unknown. This study reports that Crc binds to sites located at the translation initiation regions of the mRNAs coding for XylR and XylS, two specific transcription activators of the TOL genes. Unexpectedly, eight additional Crc binding sites were found overlapping the translation initiation sites of genes coding for several enzymes of the pathway, all encoded within two polycistronic mRNAs. Evidence is provided supporting the idea that these sites are functional. This implies that Crc can differentially modulate the expression of particular genes within polycistronic mRNAs. It is proposed that Crc controls TOL genes in two ways. First, Crc inhibits the translation of the XylR and XylS regulators, thereby reducing the transcription of all TOL pathway genes. Second, Crc inhibits the translation of specific structural genes of the pathway, acting mainly on proteins involved in the first steps of toluene assimilation. This ensures a rapid inhibitory response that reduces the expression of the toluene/xylene degradation proteins when preferred carbon sources become available.
Seligmann, Hervé
2013-05-07
GenBank's EST database includes RNAs matching exactly human mitochondrial sequences assuming systematic asymmetric nucleotide exchange-transcription along exchange rules: A→G→C→U/T→A (12 ESTs), A→U/T→C→G→A (4 ESTs), C→G→U/T→C (3 ESTs), and A→C→G→U/T→A (1 EST), no RNAs correspond to other potential asymmetric exchange rules. Hypothetical polypeptides translated from nucleotide-exchanged human mitochondrial protein coding genes align with numerous GenBank proteins, predicted secondary structures resemble their putative GenBank homologue's. Two independent methods designed to detect overlapping genes (one based on nucleotide contents analyses in relation to replicative deamination gradients at third codon positions, and circular code analyses of codon contents based on frame redundancy), confirm nucleotide-exchange-encrypted overlapping genes. Methods converge on which genes are most probably active, and which not, and this for the various exchange rules. Mean EST lengths produced by different nucleotide exchanges are proportional to (a) extents that various bioinformatics analyses confirm the protein coding status of putative overlapping genes; (b) known kinetic chemistry parameters of the corresponding nucleotide substitutions by the human mitochondrial DNA polymerase gamma (nucleotide DNA misinsertion rates); (c) stop codon densities in predicted overlapping genes (stop codon readthrough and exchanging polymerization regulate gene expression by counterbalancing each other). Numerous rarely expressed proteins seem encoded within regular mitochondrial genes through asymmetric nucleotide exchange, avoiding lengthening genomes. Intersecting evidence between several independent approaches confirms the working hypothesis status of gene encryption by systematic nucleotide exchanges. Copyright © 2013 Elsevier Ltd. All rights reserved.
Jenkins, Adam M; Waterhouse, Robert M; Muskavitch, Marc A T
2015-04-23
Long non-coding RNAs (lncRNAs) have been defined as mRNA-like transcripts longer than 200 nucleotides that lack significant protein-coding potential, and many of them constitute scaffolds for ribonucleoprotein complexes with critical roles in epigenetic regulation. Various lncRNAs have been implicated in the modulation of chromatin structure, transcriptional and post-transcriptional gene regulation, and regulation of genomic stability in mammals, Caenorhabditis elegans, and Drosophila melanogaster. The purpose of this study is to identify the lncRNA landscape in the malaria vector An. gambiae and assess the evolutionary conservation of lncRNAs and their secondary structures across the Anopheles genus. Using deep RNA sequencing of multiple Anopheles gambiae life stages, we have identified 2,949 lncRNAs and more than 300 previously unannotated putative protein-coding genes. The lncRNAs exhibit differential expression profiles across life stages and adult genders. We find that across the genus Anopheles, lncRNAs display much lower sequence conservation than protein-coding genes. Additionally, we find that lncRNA secondary structure is highly conserved within the Gambiae complex, but diverges rapidly across the rest of the genus Anopheles. This study offers one of the first lncRNA secondary structure analyses in vector insects. Our description of lncRNAs in An. gambiae offers the most comprehensive genome-wide insights to date into lncRNAs in this vector mosquito, and defines a set of potential targets for the development of vector-based interventions that may further curb the human malaria burden in disease-endemic countries.
Ning, S B; Wang, L; Song, Y C
2000-01-01
Peroxidase plays a key role in plant disease resistance, cold stress and some developmental processes, and cold-regulated protein functions necessarily in reaction of plants on cold or heat stress. Recent studies showed that these processes in plant cells were involved in programmed cell death (PCD). Using a biotin-labelled in situ hybridization (ISH) technique, we physically mapped the genes px and cld coding peroxidase and cold-regulated protein respectively onto maize chromosomes. Both DAB and fluorescence detection systems gave the identical results, the probe uaz235 corresponding to gene px was localized onto the long arm of chromosome 2 (2L) and 7L, and csu19 corresponding to gene cld was hybridized onto 4L and 5L. The percentage distances (from the hybridization sites to centromeres) of uaz235 in 2L and 7L were 45.4 +/- 1.3 and 67.4 +/- 3.7 respectively, and those of csu19 in 4L and 5L were 68.6 +/- 2.6 and 58.2 +/- 1.6 respectively. The physical positions of px in 2L and cld in 4L coincide with those in their genetic map pattern. The results also show that both of these genes have duplicated sites in maize genome.
Decoding the function of nuclear long non-coding RNAs.
Chen, Ling-Ling; Carmichael, Gordon G
2010-06-01
Long non-coding RNAs (lncRNAs) are mRNA-like, non-protein-coding RNAs that are pervasively transcribed throughout eukaryotic genomes. Rather than silently accumulating in the nucleus, many of these are now known or suspected to play important roles in nuclear architecture or in the regulation of gene expression. In this review, we highlight some recent progress in how lncRNAs regulate these important nuclear processes at the molecular level. Copyright 2010 Elsevier Ltd. All rights reserved.
Ambigapathy, Ganesh; Zheng, Zhaoqing; Li, Wei; Keifer, Joyce
2013-01-01
Brain-derived neurotrophic factor (BDNF) has a diverse functional role and complex pattern of gene expression. Alternative splicing of mRNA transcripts leads to further diversity of mRNAs and protein isoforms. Here, we describe the regulation of BDNF mRNA transcripts in an in vitro model of eyeblink classical conditioning and a unique transcript that forms a functionally distinct truncated BDNF protein isoform. Nine different mRNA transcripts from the BDNF gene of the pond turtle Trachemys scripta elegans (tBDNF) are selectively regulated during classical conditioning: exon I mRNA transcripts show no change, exon II transcripts are downregulated, while exon III transcripts are upregulated. One unique transcript that codes from exon II, tBDNF2a, contains a 40 base pair deletion in the protein coding exon that generates a truncated tBDNF protein. The truncated transcript and protein are expressed in the naïve untrained state and are fully repressed during conditioning when full-length mature tBDNF is expressed, thereby having an alternate pattern of expression in conditioning. Truncated BDNF is not restricted to turtles as a truncated mRNA splice variant has been described for the human BDNF gene. Further studies are required to determine the ubiquity of truncated BDNF alternative splice variants across species and the mechanisms of regulation and function of this newly recognized BDNF protein.
Ambigapathy, Ganesh; Zheng, Zhaoqing; Li, Wei; Keifer, Joyce
2013-01-01
Brain-derived neurotrophic factor (BDNF) has a diverse functional role and complex pattern of gene expression. Alternative splicing of mRNA transcripts leads to further diversity of mRNAs and protein isoforms. Here, we describe the regulation of BDNF mRNA transcripts in an in vitro model of eyeblink classical conditioning and a unique transcript that forms a functionally distinct truncated BDNF protein isoform. Nine different mRNA transcripts from the BDNF gene of the pond turtle Trachemys scripta elegans (tBDNF) are selectively regulated during classical conditioning: exon I mRNA transcripts show no change, exon II transcripts are downregulated, while exon III transcripts are upregulated. One unique transcript that codes from exon II, tBDNF2a, contains a 40 base pair deletion in the protein coding exon that generates a truncated tBDNF protein. The truncated transcript and protein are expressed in the naïve untrained state and are fully repressed during conditioning when full-length mature tBDNF is expressed, thereby having an alternate pattern of expression in conditioning. Truncated BDNF is not restricted to turtles as a truncated mRNA splice variant has been described for the human BDNF gene. Further studies are required to determine the ubiquity of truncated BDNF alternative splice variants across species and the mechanisms of regulation and function of this newly recognized BDNF protein. PMID:23825634
MicroRNAs as New Characters in the Plot between Epigenetics and Prostate Cancer.
Paone, Alessio; Galli, Roberta; Fabbri, Muller
2011-01-01
Prostate cancer (PCA) still represents a leading cause of death. An increasing number of studies have documented that microRNAs (miRNAs), a subgroup of non-coding RNAs with gene regulatory functions, are differentially expressed in PCA respect to the normal tissue counterpart, suggesting their involvement in prostate carcinogenesis and dissemination. Interestingly, it has been shown that miRNAs undergo the same regulatory mechanisms than any other protein coding gene, including epigenetic regulation. In turn, miRNAs can also affect the expression of oncogenes and tumor suppressor genes by targeting effectors of the epigenetic machinery, therefore indirectly affecting the epigenetic controls on these genes. Among the genes that undergo this complex regulation, there is the androgen receptor (AR), a key therapeutic target for PCA. This review will focus on the role of epigenetically regulated and epigenetically regulating miRNAs in PCA and on the fine regulation of AR expression, as mediated by this miRNA-epigenetics interaction.
Sugai, Akihiro; Sato, Hiroki; Yoneda, Misako; Kai, Chieko
2017-08-01
The regulation of transcription during Nipah virus (NiV) replication is poorly understood. Using a bicistronic minigenome system, we investigated the involvement of non-coding regions (NCRs) in the transcriptional re-initiation efficiency of NiV RNA polymerase. Reporter assays revealed that attenuation of NiV gene expression was not constant at each gene junction, and that the attenuating property was controlled by the 3' NCR. However, this regulation was independent of the gene-end, gene-start and intergenic regions. Northern blot analysis indicated that regulation of viral gene expression by the phosphoprotein (P) and large protein (L) 3' NCRs occurred at the transcription level. We identified uridine-rich tracts within the L 3' NCR that are similar to gene-end signals. These gene-end-like sequences were recognized as weak transcription termination signals by the viral RNA polymerase, thereby reducing downstream gene transcription. Thus, we suggest that NiV has a unique mechanism of transcriptional regulation. Copyright © 2017 Elsevier Inc. All rights reserved.
Protein-DNA binding dynamics predict transcriptional response to nutrients in archaea.
Todor, Horia; Sharma, Kriti; Pittman, Adrianne M C; Schmid, Amy K
2013-10-01
Organisms across all three domains of life use gene regulatory networks (GRNs) to integrate varied stimuli into coherent transcriptional responses to environmental pressures. However, inferring GRN topology and regulatory causality remains a central challenge in systems biology. Previous work characterized TrmB as a global metabolic transcription factor in archaeal extremophiles. However, it remains unclear how TrmB dynamically regulates its ∼100 metabolic enzyme-coding gene targets. Using a dynamic perturbation approach, we elucidate the topology of the TrmB metabolic GRN in the model archaeon Halobacterium salinarum. Clustering of dynamic gene expression patterns reveals that TrmB functions alone to regulate central metabolic enzyme-coding genes but cooperates with various regulators to control peripheral metabolic pathways. Using a dynamical model, we predict gene expression patterns for some TrmB-dependent promoters and infer secondary regulators for others. Our data suggest feed-forward gene regulatory topology for cobalamin biosynthesis. In contrast, purine biosynthesis appears to require TrmB-independent regulators. We conclude that TrmB is an important component for mediating metabolic modularity, integrating nutrient status and regulating gene expression dynamics alone and in concert with secondary regulators.
AP1 Keeps Chromatin Poised for Action | Center for Cancer Research
The human genome harbors gene-encoding DNA, the blueprint for building proteins that regulate cellular function. Embedded across the genome, in non-coding regions, are DNA elements to which regulatory factors bind. The interaction of regulatory factors with DNA at these sites modifies gene expression to modulate cell activity. In cells, DNA exists in a complex with proteins
Non-coding RNAs—Novel targets in neurotoxicity
Tal, Tamara L.; Tanguay, Robert L.
2012-01-01
Over the past ten years non-coding RNAs (ncRNAs) have emerged as pivotal players in fundamental physiological and cellular processes and have been increasingly implicated in cancer, immune disorders, and cardiovascular, neurodegenerative, and metabolic diseases. MicroRNAs (miRNAs) represent a class of ncRNA molecules that function as negative regulators of post-transcriptional gene expression. miRNAs are predicted to regulate 60% of all human protein-coding genes and as such, play key roles in cellular and developmental processes, human health, and disease. Relative to counterparts that lack bindings sites for miRNAs, genes encoding proteins that are post-transcriptionally regulated by miRNAs are twice as likely to be sensitive to environmental chemical exposure. Not surprisingly, miRNAs have been recognized as targets or effectors of nervous system, developmental, hepatic, and carcinogenic toxicants, and have been identified as putative regulators of phase I xenobiotic-metabolizing enzymes. In this review, we give an overview of the types of ncRNAs and highlight their roles in neurodevelopment, neurological disease, activity-dependent signaling, and drug metabolism. We then delve into specific examples that illustrate their importance as mediators, effectors, or adaptive agents of neurotoxicants or neuroactive pharmaceutical compounds. Finally, we identify a number of outstanding questions regarding ncRNAs and neurotoxicity. PMID:22394481
Greif, Gonzalo; Rodriguez, Matias; Alvarez-Valin, Fernando
2017-01-01
American trypanosomiasis is a chronic and endemic disease which affects millions of people. Trypanosoma cruzi, its causative agent, has a life cycle that involves complex morphological and functional transitions, as well as a variety of environmental conditions. This requires a tight regulation of gene expression, which is achieved mainly by post-transcriptional regulation. In this work we conducted an RNAseq analysis of the three major life cycle stages of T. cruzi: amastigotes, epimastigotes and trypomastigotes. This analysis allowed us to delineate specific transcriptomic profiling for each stage, and also to identify those biological processes of major relevance in each state. Stage specific expression profiling evidenced the plasticity of T. cruzi to adapt quickly to different conditions, with particular focus on membrane remodeling and metabolic shifts along the life cycle. Epimastigotes, which replicate in the gut of insect vectors, showed higher expression of genes related to energy metabolism, mainly Krebs cycle, respiratory chain and oxidative phosphorylation related genes, and anabolism related genes associated to nucleotide and steroid biosynthesis; also, a general down-regulation of surface glycoprotein coding genes was seen at this stage. Trypomastigotes, living extracellularly in the bloodstream of mammals, express a plethora of surface proteins and signaling genes involved in invasion and evasion of immune response. Amastigotes mostly express membrane transporters and genes involved in regulation of cell cycle, and also express a specific subset of surface glycoprotein coding genes. In addition, these results allowed us to improve the annotation of the Dm28c genome, identifying new ORFs and set the stage for construction of networks of co-expression, which can give clues about coded proteins of unknown functions. PMID:28286708
Activity-Dependent Human Brain Coding/Noncoding Gene Regulatory Networks
Lipovich, Leonard; Dachet, Fabien; Cai, Juan; Bagla, Shruti; Balan, Karina; Jia, Hui; Loeb, Jeffrey A.
2012-01-01
While most gene transcription yields RNA transcripts that code for proteins, a sizable proportion of the genome generates RNA transcripts that do not code for proteins, but may have important regulatory functions. The brain-derived neurotrophic factor (BDNF) gene, a key regulator of neuronal activity, is overlapped by a primate-specific, antisense long noncoding RNA (lncRNA) called BDNFOS. We demonstrate reciprocal patterns of BDNF and BDNFOS transcription in highly active regions of human neocortex removed as a treatment for intractable seizures. A genome-wide analysis of activity-dependent coding and noncoding human transcription using a custom lncRNA microarray identified 1288 differentially expressed lncRNAs, of which 26 had expression profiles that matched activity-dependent coding genes and an additional 8 were adjacent to or overlapping with differentially expressed protein-coding genes. The functions of most of these protein-coding partner genes, such as ARC, include long-term potentiation, synaptic activity, and memory. The nuclear lncRNAs NEAT1, MALAT1, and RPPH1, composing an RNAse P-dependent lncRNA-maturation pathway, were also upregulated. As a means to replicate human neuronal activity, repeated depolarization of SY5Y cells resulted in sustained CREB activation and produced an inverse pattern of BDNF-BDNFOS co-expression that was not achieved with a single depolarization. RNAi-mediated knockdown of BDNFOS in human SY5Y cells increased BDNF expression, suggesting that BDNFOS directly downregulates BDNF. Temporal expression patterns of other lncRNA-messenger RNA pairs validated the effect of chronic neuronal activity on the transcriptome and implied various lncRNA regulatory mechanisms. lncRNAs, some of which are unique to primates, thus appear to have potentially important regulatory roles in activity-dependent human brain plasticity. PMID:22960213
Reggiani, Claudio; Coppens, Sandra; Sekhara, Tayeb; Dimov, Ivan; Pichon, Bruno; Lufin, Nicolas; Addor, Marie-Claude; Belligni, Elga Fabia; Digilio, Maria Cristina; Faletra, Flavio; Ferrero, Giovanni Battista; Gerard, Marion; Isidor, Bertrand; Joss, Shelagh; Niel-Bütschi, Florence; Perrone, Maria Dolores; Petit, Florence; Renieri, Alessandra; Romana, Serge; Topa, Alexandra; Vermeesch, Joris Robert; Lenaerts, Tom; Casimir, Georges; Abramowicz, Marc; Bontempi, Gianluca; Vilain, Catheline; Deconinck, Nicolas; Smits, Guillaume
2017-07-19
Tissue-specific integrative omics has the potential to reveal new genic elements important for developmental disorders. Two pediatric patients with global developmental delay and intellectual disability phenotype underwent array-CGH genetic testing, both showing a partial deletion of the DLG2 gene. From independent human and murine omics datasets, we combined copy number variations, histone modifications, developmental tissue-specific regulation, and protein data to explore the molecular mechanism at play. Integrating genomics, transcriptomics, and epigenomics data, we describe two novel DLG2 promoters and coding first exons expressed in human fetal brain. Their murine conservation and protein-level evidence allowed us to produce new DLG2 gene models for human and mouse. These new genic elements are deleted in 90% of 29 patients (public and in-house) showing partial deletion of the DLG2 gene. The patients' clinical characteristics expand the neurodevelopmental phenotypic spectrum linked to DLG2 gene disruption to cognitive and behavioral categories. While protein-coding genes are regarded as well known, our work shows that integration of multiple omics datasets can unveil novel coding elements. From a clinical perspective, our work demonstrates that two new DLG2 promoters and exons are crucial for the neurodevelopmental phenotypes associated with this gene. In addition, our work brings evidence for the lack of cross-annotation in human versus mouse reference genomes and nucleotide versus protein databases.
Decoding the non-coding RNAs in Alzheimer's disease.
Schonrock, Nicole; Götz, Jürgen
2012-11-01
Non-coding RNAs (ncRNAs) are integral components of biological networks with fundamental roles in regulating gene expression. They can integrate sequence information from the DNA code, epigenetic regulation and functions of multimeric protein complexes to potentially determine the epigenetic status and transcriptional network in any given cell. Humans potentially contain more ncRNAs than any other species, especially in the brain, where they may well play a significant role in human development and cognitive ability. This review discusses their emerging role in Alzheimer's disease (AD), a human pathological condition characterized by the progressive impairment of cognitive functions. We discuss the complexity of the ncRNA world and how this is reflected in the regulation of the amyloid precursor protein and Tau, two proteins with central functions in AD. By understanding this intricate regulatory network, there is hope for a better understanding of disease mechanisms and ultimately developing diagnostic and therapeutic tools.
Gaddelapati, Sharath Chandra; Kalsi, Megha; Roy, Amit; Palli, Subba Reddy
2018-08-01
The Colorado potato beetle (CPB), Leptinotarsa decemlineata developed resistance to imidacloprid after exposure to this insecticide for multiple generations. Our previous studies showed that xenobiotic transcription factor, cap 'n' collar isoform C (CncC) regulates the expression of multiple cytochrome P450 genes, which play essential roles in resistance to plant allelochemicals and insecticides. In this study, we sought to obtain a comprehensive picture of the genes regulated by CncC in imidacloprid-resistant CPB. We performed sequencing of RNA isolated from imidacloprid-resistant CPB treated with dsRNA targeting CncC or gene coding for green fluorescent protein (control). Comparative transcriptome analysis showed that CncC regulated the expression of 1798 genes, out of which 1499 genes were downregulated in CncC knockdown beetles. Interestingly, expression of 79% of imidacloprid induced P450 genes requires CncC. We performed quantitative real-time PCR to verify the reduction in the expression of 20 genes including those coding for detoxification enzymes (P450s, glutathione S-transferases, and esterases) and ABC transporters. The genes coding for ABC transporters are induced in insecticide resistant CPB and require CncC for their expression. Knockdown of genes coding for ABC transporters simultaneously or individually caused an increase in imidacloprid-induced mortality in resistant beetles confirming their contribution to insecticide resistance. These studies identified CncC as a transcription factor involved in regulation of genes responsible for imidacloprid resistance. Small molecule inhibitors of CncC or suppression of CncC by RNAi could provide effective synergists for pest control or management of insecticide resistance. Copyright © 2018 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Gong, Liang; Wu, Yu; Jian, Qijie; Yin, Chunxiao; Li, Taotao; Gupta, Vijai Kumar; Duan, Xuewu; Jiang, Yueming
2018-01-01
Vibrio qinghaiensis sp.-Q67 (Vqin-Q67) is a freshwater luminescent bacterium that continuously emits blue-green light (485 nm). The bacterium has been widely used for detecting toxic contaminants. Here, we report the complete genome sequence of Vqin-Q67, obtained using third-generation PacBio sequencing technology. Continuous long reads were attained from three PacBio sequencing runs and reads >500 bp with a quality value of >0.75 were merged together into a single dataset. This resultant highly-contiguous de novo assembly has no genome gaps, and comprises two chromosomes with substantial genetic information, including protein-coding genes, non-coding RNA, transposon and gene islands. Our dataset can be useful as a comparative genome for evolution and speciation studies, as well as for the analysis of protein-coding gene families, the pathogenicity of different Vibrio species in fish, the evolution of non-coding RNA and transposon, and the regulation of gene expression in relation to the bioluminescence of Vqin-Q67.
Maier, Lisa-Katharina; Benz, Juliane; Fischer, Susan; Alstetter, Martina; Jaschinski, Katharina; Hilker, Rolf; Becker, Anke; Allers, Thorsten; Soppa, Jörg; Marchfelder, Anita
2015-10-01
Members of the Sm protein family are important for the cellular RNA metabolism in all three domains of life. The family includes archaeal and eukaryotic Lsm proteins, eukaryotic Sm proteins and archaeal and bacterial Hfq proteins. While several studies concerning the bacterial and eukaryotic family members have been published, little is known about the archaeal Lsm proteins. Although structures for several archaeal Lsm proteins have been solved already more than ten years ago, we still do not know much about their biological function, however one can confidently propose that the archaeal Lsm proteins will also be involved in RNA metabolism. Therefore, we investigated this protein in the halophilic archaeon Haloferax volcanii. The Haloferax genome encodes a single Lsm protein, the lsm gene overlaps and is co-transcribed with the gene for the ribosomal L37.eR protein. Here, we show that the reading frame of the lsm gene contains a promoter which regulates expression of the overlapping rpl37R gene. This rpl37R specific promoter ensures high expression of the rpl37R gene in exponential growth phase. To investigate the biological function of the Lsm protein we generated a lsm deletion mutant that had the coding sequence for the Sm1 motif removed but still contained the internal promoter for the downstream rpl37R gene. The transcriptome of this deletion mutant was compared to the wild type transcriptome, revealing that several genes are down-regulated and many genes are up-regulated in the deletion strain. Northern blot analyses confirmed down-regulation of two genes. In addition, the deletion strain showed a gain of function in swarming, in congruence with the up-regulation of transcripts encoding proteins required for motility. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
Nucleic acids encoding plant glutamine phenylpyruvate transaminase (GPT) and uses thereof
Unkefer, Pat J.; Anderson, Penelope S.; Knight, Thomas J.
2016-03-29
Glutamine phenylpyruvate transaminase (GPT) proteins, nucleic acid molecules encoding GPT proteins, and uses thereof are disclosed. Provided herein are various GPT proteins and GPT gene coding sequences isolated from a number of plant species. As disclosed herein, GPT proteins share remarkable structural similarity within plant species, and are active in catalyzing the synthesis of 2-hydroxy-5-oxoproline (2-oxoglutaramate), a powerful signal metabolite which regulates the function of a large number of genes involved in the photosynthesis apparatus, carbon fixation and nitrogen metabolism.
Zhao, Zheng; Bai, Jing; Wu, Aiwei; Wang, Yuan; Zhang, Jinwen; Wang, Zishan; Li, Yongsheng; Xu, Juan; Li, Xia
2015-01-01
Long non-coding RNAs (lncRNAs) are emerging as key regulators of diverse biological processes and diseases. However, the combinatorial effects of these molecules in a specific biological function are poorly understood. Identifying co-expressed protein-coding genes of lncRNAs would provide ample insight into lncRNA functions. To facilitate such an effort, we have developed Co-LncRNA, which is a web-based computational tool that allows users to identify GO annotations and KEGG pathways that may be affected by co-expressed protein-coding genes of a single or multiple lncRNAs. LncRNA co-expressed protein-coding genes were first identified in publicly available human RNA-Seq datasets, including 241 datasets across 6560 total individuals representing 28 tissue types/cell lines. Then, the lncRNA combinatorial effects in a given GO annotations or KEGG pathways are taken into account by the simultaneous analysis of multiple lncRNAs in user-selected individual or multiple datasets, which is realized by enrichment analysis. In addition, this software provides a graphical overview of pathways that are modulated by lncRNAs, as well as a specific tool to display the relevant networks between lncRNAs and their co-expressed protein-coding genes. Co-LncRNA also supports users in uploading their own lncRNA and protein-coding gene expression profiles to investigate the lncRNA combinatorial effects. It will be continuously updated with more human RNA-Seq datasets on an annual basis. Taken together, Co-LncRNA provides a web-based application for investigating lncRNA combinatorial effects, which could shed light on their biological roles and could be a valuable resource for this community. Database URL: http://www.bio-bigdata.com/Co-LncRNA/ PMID:26363020
cncRNAs: Bi-functional RNAs with protein coding and non-coding functions
Kumari, Pooja; Sampath, Karuna
2015-01-01
For many decades, the major function of mRNA was thought to be to provide protein-coding information embedded in the genome. The advent of high-throughput sequencing has led to the discovery of pervasive transcription of eukaryotic genomes and opened the world of RNA-mediated gene regulation. Many regulatory RNAs have been found to be incapable of protein coding and are hence termed as non-coding RNAs (ncRNAs). However, studies in recent years have shown that several previously annotated non-coding RNAs have the potential to encode proteins, and conversely, some coding RNAs have regulatory functions independent of the protein they encode. Such bi-functional RNAs, with both protein coding and non-coding functions, which we term as ‘cncRNAs’, have emerged as new players in cellular systems. Here, we describe the functions of some cncRNAs identified from bacteria to humans. Because the functions of many RNAs across genomes remains unclear, we propose that RNAs be classified as coding, non-coding or both only after careful analysis of their functions. PMID:26498036
The Mediator complex: a central integrator of transcription
Allen, Benjamin L.; Taatjes, Dylan J.
2016-01-01
The RNA polymerase II (pol II) enzyme transcribes all protein-coding and most non-coding RNA genes and is globally regulated by Mediator, a large, conformationally flexible protein complex with variable subunit composition (for example, a four-subunit CDK8 module can reversibly associate). These biochemical characteristics are fundamentally important for Mediator's ability to control various processes important for transcription, including organization of chromatin architecture and regulation of pol II pre-initiation, initiation, re-initiation, pausing, and elongation. Although Mediator exists in all eukaryotes, a variety of Mediator functions appear to be specific to metazoans, indicative of more diverse regulatory requirements. PMID:25693131
Wen, Dong-Yue; Lin, Peng; Pang, Yu-Yan; Chen, Gang; He, Yun; Dang, Yi-Wu; Yang, Hong
2018-05-05
BACKGROUND Long non-coding RNAs (lncRNAs) have a role in physiological and pathological processes, including cancer. The aim of this study was to investigate the expression of the long intergenic non-protein coding RNA 665 (LINC00665) gene and the cell cycle in hepatocellular carcinoma (HCC) using database analysis including The Cancer Genome Atlas (TCGA), the Gene Expression Omnibus (GEO), and quantitative real-time polymerase chain reaction (qPCR). MATERIAL AND METHODS Expression levels of LINC00665 were compared between human tissue samples of HCC and adjacent normal liver, clinicopathological correlations were made using TCGA and the GEO, and qPCR was performed to validate the findings. Other public databases were searched for other genes associated with LINC00665 expression, including The Atlas of Noncoding RNAs in Cancer (TANRIC), the Multi Experiment Matrix (MEM), Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) and protein-protein interaction (PPI) networks. RESULTS Overexpression of LINC00665 in patients with HCC was significantly associated with gender, tumor grade, stage, and tumor cell type. Overexpression of LINC00665 in patients with HCC was significantly associated with overall survival (OS) (HR=1.47795%; CI: 1.046-2.086). Bioinformatics analysis identified 469 related genes and further analysis supported a hypothesis that LINC00665 regulates pathways in the cell cycle to facilitate the development and progression of HCC through ten identified core genes: CDK1, BUB1B, BUB1, PLK1, CCNB2, CCNB1, CDC20, ESPL1, MAD2L1, and CCNA2. CONCLUSIONS Overexpression of the lncRNA, LINC00665 may be involved in the regulation of cell cycle pathways in HCC through ten identified hub genes.
Fan, Zenghua; Zhao, Meng; Joshi, Parth D.; Li, Ping; Zhang, Yan; Guo, Weimin; Xu, Yichi; Wang, Haifang; Zhao, Zhihu
2017-01-01
Abstract Circadian rhythm exerts its influence on animal physiology and behavior by regulating gene expression at various levels. Here we systematically explored circadian long non-coding RNAs (lncRNAs) in mouse liver and examined their circadian regulation. We found that a significant proportion of circadian lncRNAs are expressed at enhancer regions, mostly bound by two key circadian transcription factors, BMAL1 and REV-ERBα. These circadian lncRNAs showed similar circadian phases with their nearby genes. The extent of their nuclear localization is higher than protein coding genes but less than enhancer RNAs. The association between enhancer and circadian lncRNAs is also observed in tissues other than liver. Comparative analysis between mouse and rat circadian liver transcriptomes showed that circadian transcription at lncRNA loci tends to be conserved despite of low sequence conservation of lncRNAs. One such circadian lncRNA termed lnc-Crot led us to identify a super-enhancer region interacting with a cluster of genes involved in circadian regulation of metabolism through long-range interactions. Further experiments showed that lnc-Crot locus has enhancer function independent of lnc-Crot's transcription. Our results suggest that the enhancer-associated circadian lncRNAs mark the genomic loci modulating long-range circadian gene regulation and shed new lights on the evolutionary origin of lncRNAs. PMID:28335007
Song, Yuepeng; Tian, Min; Ci, Dong; Zhang, Deqiang
2015-04-01
Previous studies showed sex-specific DNA methylation and expression of candidate genes in bisexual flowers of andromonoecious poplar, but the regulatory relationship between methylation and microRNAs (miRNAs) remains unclear. To investigate whether the methylation of miRNA genes regulates gene expression in bisexual flower development, the methylome, microRNA, and transcriptome were examined in female and male flowers of andromonoecious poplar. 27 636 methylated coding genes and 113 methylated miRNA genes were identified. In the coding genes, 64.5% of the methylated reads mapped to the gene body region; by contrast, 60.7% of methylated reads in miRNA genes mainly mapped in the 5' and 3' flanking regions. CHH methylation showed the highest methylation levels and CHG showed the lowest methylation levels. Correlation analysis showed a significant, negative, strand-specific correlation of methylation and miRNA gene expression (r=0.79, P <0.05). The methylated miRNA genes included eight long miRNAs (lmiRNAs) of 24 nucleotides and 11 miRNAs related to flower development. miRNA172b might play an important role in the regulation of bisexual flower development-related gene expression in andromonoecious poplar, via modification of methylation. Gynomonoecious, female, and male poplars were used to validate the methylation patterns of the miRNA172b gene, implying that hyper-methylation in andromonoecious and gynomonoecious poplar might function as an important regulator in bisexual flower development. Our data provide a useful resource for the study of flower development in poplar and improve our understanding of the effect of epigenetic regulation on genes other than protein-coding genes. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Song, Yuepeng; Tian, Min; Ci, Dong; Zhang, Deqiang
2015-01-01
Previous studies showed sex-specific DNA methylation and expression of candidate genes in bisexual flowers of andromonoecious poplar, but the regulatory relationship between methylation and microRNAs (miRNAs) remains unclear. To investigate whether the methylation of miRNA genes regulates gene expression in bisexual flower development, the methylome, microRNA, and transcriptome were examined in female and male flowers of andromonoecious poplar. 27 636 methylated coding genes and 113 methylated miRNA genes were identified. In the coding genes, 64.5% of the methylated reads mapped to the gene body region; by contrast, 60.7% of methylated reads in miRNA genes mainly mapped in the 5′ and 3′ flanking regions. CHH methylation showed the highest methylation levels and CHG showed the lowest methylation levels. Correlation analysis showed a significant, negative, strand-specific correlation of methylation and miRNA gene expression (r=0.79, P <0.05). The methylated miRNA genes included eight long miRNAs (lmiRNAs) of 24 nucleotides and 11 miRNAs related to flower development. miRNA172b might play an important role in the regulation of bisexual flower development-related gene expression in andromonoecious poplar, via modification of methylation. Gynomonoecious, female, and male poplars were used to validate the methylation patterns of the miRNA172b gene, implying that hyper-methylation in andromonoecious and gynomonoecious poplar might function as an important regulator in bisexual flower development. Our data provide a useful resource for the study of flower development in poplar and improve our understanding of the effect of epigenetic regulation on genes other than protein-coding genes. PMID:25617468
USDA-ARS?s Scientific Manuscript database
Retrograde signalling is a selective process defined by cues generated in chloroplast/mitochondria which traverse membranes and end up regulating nuclear gene expression and protein synthesis. The coding and encoding of organellar message(s) that alter nuclear gene expression and/or cellular metabo...
Transcriptional profiling of predator-induced phenotypic plasticity in Daphnia pulex.
Rozenberg, Andrey; Parida, Mrutyunjaya; Leese, Florian; Weiss, Linda C; Tollrian, Ralph; Manak, J Robert
2015-01-01
Predator-induced defences are a prominent example of phenotypic plasticity found from single-celled organisms to vertebrates. The water flea Daphnia pulex is a very convenient ecological genomic model for studying predator-induced defences as it exhibits substantial morphological changes under predation risk. Most importantly, however, genetically identical clones can be transcriptionally profiled under both control and predation risk conditions and be compared due to the availability of the sequenced reference genome. Earlier gene expression analyses of candidate genes as well as a tiled genomic microarray expression experiment have provided insights into some genes involved in predator-induced phenotypic plasticity. Here we performed the first RNA-Seq analysis to identify genes that were differentially expressed in defended vs. undefended D. pulex specimens in order to explore the genetic mechanisms underlying predator-induced defences at a qualitatively novel level. We report 230 differentially expressed genes (158 up- and 72 down-regulated) identified in at least two of three different assembly approaches. Several of the differentially regulated genes belong to families of paralogous genes. The most prominent classes amongst the up-regulated genes include cuticle genes, zinc-metalloproteinases and vitellogenin genes. Furthermore, several genes from this group code for proteins recruited in chromatin-reorganization or regulation of the cell cycle (cyclins). Down-regulated gene classes include C-type lectins, proteins involved in lipogenesis, and other families, some of which encode proteins with no known molecular function. The RNA-Seq transcriptome data presented in this study provide important insights into gene regulatory patterns underlying predator-induced defences. In particular, we characterized different effector genes and gene families found to be regulated in Daphnia in response to the presence of an invertebrate predator. These effector genes are mostly in agreement with expectations based on observed phenotypic changes including morphological alterations, i.e., expression of proteins involved in formation of protective structures and in cuticle strengthening, as well as proteins required for resource re-allocation. Our findings identify key genetic pathways associated with anti-predator defences.
Image-guided genomic analysis of tissue response to laser-induced thermal stress
NASA Astrophysics Data System (ADS)
Mackanos, Mark A.; Helms, Mike; Kalish, Flora; Contag, Christopher H.
2011-05-01
The cytoprotective response to thermal injury is characterized by transcriptional activation of ``heat shock proteins'' (hsp) and proinflammatory proteins. Expression of these proteins may predict cellular survival. Microarray analyses were performed to identify spatially distinct gene expression patterns responding to thermal injury. Laser injury zones were identified by expression of a transgene reporter comprised of the 70 kD hsp gene and the firefly luciferase coding sequence. Zones included the laser spot, the surrounding region where hsp70-luc expression was increased, and a region adjacent to the surrounding region. A total of 145 genes were up-regulated in the laser irradiated region, while 69 were up-regulated in the adjacent region. At 7 hours the chemokine Cxcl3 was the highest expressed gene in the laser spot (24 fold) and adjacent region (32 fold). Chemokines were the most common up-regulated genes identified. Microarray gene expression was successfully validated using qRT- polymerase chain reaction for selected genes of interest. The early response genes are likely involved in cytoprotection and initiation of the healing response. Their regulatory elements will benefit creating the next generation reporter mice and controlling expression of therapeutic proteins. The identified genes serve as drug development targets that may prevent acute tissue damage and accelerate healing.
Li, C-Q; Huang, G-W; Wu, Z-Y; Xu, Y-J; Li, X-C; Xue, Y-J; Zhu, Y; Zhao, J-M; Li, M; Zhang, J; Wu, J-Y; Lei, F; Wang, Q-Y; Li, S; Zheng, C-P; Ai, B; Tang, Z-D; Feng, C-C; Liao, L-D; Wang, S-H; Shen, J-H; Liu, Y-J; Bai, X-F; He, J-Z; Cao, H-H; Wu, B-L; Wang, M-R; Lin, D-C; Koeffler, H P; Wang, L-D; Li, X; Li, E-M; Xu, L-Y
2017-02-13
Long non-coding RNAs (lncRNAs) have a critical role in cancer initiation and progression, and thus may mediate oncogenic or tumor suppressing effects, as well as be a new class of cancer therapeutic targets. We performed high-throughput sequencing of RNA (RNA-seq) to investigate the expression level of lncRNAs and protein-coding genes in 30 esophageal samples, comprised of 15 esophageal squamous cell carcinoma (ESCC) samples and their 15 paired non-tumor tissues. We further developed an integrative bioinformatics method, denoted URW-LPE, to identify key functional lncRNAs that regulate expression of downstream protein-coding genes in ESCC. A number of known onco-lncRNA and many putative novel ones were effectively identified by URW-LPE. Importantly, we identified lncRNA625 as a novel regulator of ESCC cell proliferation, invasion and migration. ESCC patients with high lncRNA625 expression had significantly shorter survival time than those with low expression. LncRNA625 also showed specific prognostic value for patients with metastatic ESCC. Finally, we identified E1A-binding protein p300 (EP300) as a downstream executor of lncRNA625-induced transcriptional responses. These findings establish a catalog of novel cancer-associated functional lncRNAs, which will promote our understanding of lncRNA-mediated regulation in this malignancy.
The RB-related gene Rb2/p130 in neuroblastoma differentiation and in B-myb promoter down-regulation.
Raschellà, G; Tanno, B; Bonetto, F; Negroni, A; Claudio, P P; Baldi, A; Amendola, R; Calabretta, B; Giordano, A; Paggi, M G
1998-05-01
The retinoblastoma family of nuclear factors is composed of RB, the prototype of the tumour suppressor genes and of the strictly related genes p107 and Rb2/p130. The three genes code for proteins, namely pRb, p107 and pRb2/p130, that share similar structures and functions. These proteins are expressed, often simultaneously, in many cell types and are involved in the regulation of proliferation and differentiation. We determined the expression and the phosphorylation of the RB family gene products during the DMSO-induced differentiation of the N1E-115 murine neuroblastoma cells. In this system, pRb2/p130 was strongly up-regulated during mid-late differentiation stages, while, on the contrary, pRb and p107 resulted markedly decreased at late stages. Differentiating N1E-115 cells also showed a progressive decrease in B-myb levels, a proliferation-related protein whose constitutive expression inhibits neuronal differentiation. Transfection of each of the RB family genes in these cells was able, at different degrees, to induce neuronal differentiation, to inhibit [3H]thymidine incorporation and to down-regulate the activity of the B-myb promoter.
The Yersinia pestis gcvB gene encodes two small regulatory RNA molecules
McArthur, Sarah D; Pulvermacher, Sarah C; Stauffer, George V
2006-01-01
Background In recent years it has become clear that small non-coding RNAs function as regulatory elements in bacterial virulence and bacterial stress responses. We tested for the presence of the small non-coding GcvB RNAs in Y. pestis as possible regulators of gene expression in this organism. Results In this study, we report that the Yersinia pestis KIM6 gcvB gene encodes two small RNAs. Transcription of gcvB is activated by the GcvA protein and repressed by the GcvR protein. The gcvB-encoded RNAs are required for repression of the Y. pestis dppA gene, encoding the periplasmic-binding protein component of the dipeptide transport system, showing that the GcvB RNAs have regulatory activity. A deletion of the gcvB gene from the Y. pestis KIM6 chromosome results in a decrease in the generation time of the organism as well as a change in colony morphology. Conclusion The results of this study indicate that the Y. pestis gcvB gene encodes two small non-coding regulatory RNAs that repress dppA expression. A gcvB deletion is pleiotropic, suggesting that the sRNAs are likely involved in controlling genes in addition to dppA. PMID:16768793
Cheng, Yating; Jutooru, Indira; Chadalapaka, Gayathri; Corton, J Christopher; Safe, Stephen
2015-05-10
HOTTIP is a long non-coding RNA (lncRNA) transcribed from the 5' tip of the HOXA locus and is associated with the polycomb repressor complex 2 (PRC2) and WD repeat containing protein 5 (WDR5)/mixed lineage leukemia 1 (MLL1) chromatin modifying complexes. HOTTIP is expressed in pancreatic cancer cell lines and knockdown of HOTTIP by RNA interference (siHOTTIP) in Panc1 pancreatic cancer cells decreased proliferation, induced apoptosis and decreased migration. In Panc1 cells transfected with siHOTTIP, there was a decrease in expression of 757 genes and increased expression of 514 genes, and a limited gene analysis indicated that HOTTIP regulation of genes is complex. For example, Aurora kinase A, an important regulator of cell growth, is coregulated by MLL and not WDR5 and, in contrast to previous studies in liver cancer cells, HOTTIP does not regulate HOXA13 but plays a role in regulation of several other HOX genes including HOXA10, HOXB2, HOXA11, HOXA9 and HOXA1. Although HOTTIP and the HOX-associated lncRNA HOTAIR have similar pro-oncogenic functions, they regulate strikingly different sets of genes in Panc1 cells and in pancreatic tumors.
McLysaght, Aoife; Guerzoni, Daniele
2015-09-26
The origin of novel protein-coding genes de novo was once considered so improbable as to be impossible. In less than a decade, and especially in the last five years, this view has been overturned by extensive evidence from diverse eukaryotic lineages. There is now evidence that this mechanism has contributed a significant number of genes to genomes of organisms as diverse as Saccharomyces, Drosophila, Plasmodium, Arabidopisis and human. From simple beginnings, these genes have in some instances acquired complex structure, regulated expression and important functional roles. New genes are often thought of as dispensable late additions; however, some recent de novo genes in human can play a role in disease. Rather than an extremely rare occurrence, it is now evident that there is a relatively constant trickle of proto-genes released into the testing ground of natural selection. It is currently unknown whether de novo genes arise primarily through an 'RNA-first' or 'ORF-first' pathway. Either way, evolutionary tinkering with this pool of genetic potential may have been a significant player in the origins of lineage-specific traits and adaptations. © 2015 The Authors.
Zorc, Minja; Kunej, Tanja
2016-05-01
MicroRNAs (miRNAs) are a class of non-coding RNAs involved in posttranscriptional regulation of target genes. Regulation requires complementarity between target mRNA and the mature miRNA seed region, responsible for their recognition and binding. It has been estimated that each miRNA targets approximately 200 genes, and genetic variability of miRNA genes has been reported to affect phenotypic variability and disease susceptibility in humans, livestock species, and model organisms. Polymorphisms in miRNA genes could therefore represent biomarkers for phenotypic traits in livestock animals. In our previous study, we collected polymorphisms within miRNA genes in chicken. In the present study, we identified miRNA-related genomic overlaps to prioritize genomic regions of interest for further functional studies and biomarker discovery. Overlapping genomic regions in chicken were analyzed using the following bioinformatics tools and databases: miRNA SNiPer, Ensembl, miRBase, NCBI Blast, and QTLdb. Out of 740 known pre-miRNA genes, 263 (35.5 %) contain polymorphisms; among them, 35 contain more than three polymorphisms The most polymorphic miRNA genes in chicken are gga-miR-6662, containing 23 single nucleotide polymorphisms (SNPs) within the pre-miRNA region, including five consecutive SNPs, and gga-miR-6688, containing ten polymorphisms including three consecutive polymorphisms. Several miRNA-related genomic hotspots have been revealed in chicken genome; polymorphic miRNA genes are located within protein-coding and/or non-coding transcription units and quantitative trait loci (QTL) associated with production traits. The present study includes the first description of an exonic miRNA in a chicken genome, an overlap between the miRNA gene and the exon of the protein-coding gene (gga-miR-6578/HADHB), and the first report of a missense polymorphism located within a mature miRNA seed region. Identified miRNA-related genomic hotspots in chicken can serve researchers as a starting point for further functional studies and association studies with poultry production and health traits and the basis for systematic screening of exonic miRNAs and missense/miRNA seed polymorphisms in other genomes.
Djordjevic, Michael A; Chen, Han Cai; Natera, Siria; Van Noorden, Giel; Menzel, Christian; Taylor, Scott; Renard, Clotilde; Geiger, Otto; Weiller, Georg F
2003-06-01
A proteomic examination of Sinorhizobium meliloti strain 1021 was undertaken using a combination of 2-D gel electrophoresis, peptide mass fingerprinting, and bioinformatics. Our goal was to identify (i) putative symbiosis- or nutrient-stress-specific proteins, (ii) the biochemical pathways active under different conditions, (iii) potential new genes, and (iv) the extent of posttranslational modifications of S. meliloti proteins. In total, we identified the protein products of 810 genes (13.1% of the genome's coding capacity). The 810 genes generated 1,180 gene products, with chromosomal genes accounting for 78% of the gene products identified (18.8% of the chromosome's coding capacity). The activity of 53 metabolic pathways was inferred from bioinformatic analysis of proteins with assigned Enzyme Commission numbers. Of the remaining proteins that did not encode enzymes, ABC-type transporters composed 12.7% and regulatory proteins 3.4% of the total. Proteins with up to seven transmembrane domains were identified in membrane preparations. A total of 27 putative nodule-specific proteins and 35 nutrient-stress-specific proteins were identified and used as a basis to define genes and describe processes occurring in S. meliloti cells in nodules and under stress. Several nodule proteins from the plant host were present in the nodule bacteria preparations. We also identified seven potentially novel proteins not predicted from the DNA sequence. Post-translational modifications such as N-terminal processing could be inferred from the data. The posttranslational addition of UMP to the key regulator of nitrogen metabolism, PII, was demonstrated. This work demonstrates the utility of combining mass spectrometry with protein arraying or separation techniques to identify candidate genes involved in important biological processes and niche occupations that may be intransigent to other methods of gene expression profiling.
Regulation of neural macroRNAs by the transcriptional repressor REST
Johnson, Rory; Teh, Christina Hui-Leng; Jia, Hui; Vanisri, Ravi Raj; Pandey, Tridansh; Lu, Zhong-Hao; Buckley, Noel J.; Stanton, Lawrence W.; Lipovich, Leonard
2009-01-01
The essential transcriptional repressor REST (repressor element 1-silencing transcription factor) plays central roles in development and human disease by regulating a large cohort of neural genes. These have conventionally fallen into the class of known, protein-coding genes; recently, however, several noncoding microRNA genes were identified as REST targets. Given the widespread transcription of messenger RNA-like, noncoding RNAs (“macroRNAs”), some of which are functional and implicated in disease in mammalian genomes, we sought to determine whether this class of noncoding RNAs can also be regulated by REST. By applying a new, unbiased target gene annotation pipeline to computationally discovered REST binding sites, we find that 23% of mammalian REST genomic binding sites are within 10 kb of a macroRNA gene. These putative target genes were overlooked by previous studies. Focusing on a set of 18 candidate macroRNA targets from mouse, we experimentally demonstrate that two are regulated by REST in neural stem cells. Flanking protein-coding genes are, at most, weakly repressed, suggesting specific targeting of the macroRNAs by REST. Similar to the majority of known REST target genes, both of these macroRNAs are induced during nervous system development and have neurally restricted expression profiles in adult mouse. We observe a similar phenomenon in human: the DiGeorge syndrome-associated noncoding RNA, DGCR5, is repressed by REST through a proximal upstream binding site. Therefore neural macroRNAs represent an additional component of the REST regulatory network. These macroRNAs are new candidates for understanding the role of REST in neuronal development, neurodegeneration, and cancer. PMID:19050060
Regulation of neural macroRNAs by the transcriptional repressor REST.
Johnson, Rory; Teh, Christina Hui-Leng; Jia, Hui; Vanisri, Ravi Raj; Pandey, Tridansh; Lu, Zhong-Hao; Buckley, Noel J; Stanton, Lawrence W; Lipovich, Leonard
2009-01-01
The essential transcriptional repressor REST (repressor element 1-silencing transcription factor) plays central roles in development and human disease by regulating a large cohort of neural genes. These have conventionally fallen into the class of known, protein-coding genes; recently, however, several noncoding microRNA genes were identified as REST targets. Given the widespread transcription of messenger RNA-like, noncoding RNAs ("macroRNAs"), some of which are functional and implicated in disease in mammalian genomes, we sought to determine whether this class of noncoding RNAs can also be regulated by REST. By applying a new, unbiased target gene annotation pipeline to computationally discovered REST binding sites, we find that 23% of mammalian REST genomic binding sites are within 10 kb of a macroRNA gene. These putative target genes were overlooked by previous studies. Focusing on a set of 18 candidate macroRNA targets from mouse, we experimentally demonstrate that two are regulated by REST in neural stem cells. Flanking protein-coding genes are, at most, weakly repressed, suggesting specific targeting of the macroRNAs by REST. Similar to the majority of known REST target genes, both of these macroRNAs are induced during nervous system development and have neurally restricted expression profiles in adult mouse. We observe a similar phenomenon in human: the DiGeorge syndrome-associated noncoding RNA, DGCR5, is repressed by REST through a proximal upstream binding site. Therefore neural macroRNAs represent an additional component of the REST regulatory network. These macroRNAs are new candidates for understanding the role of REST in neuronal development, neurodegeneration, and cancer.
GeneBuilder: interactive in silico prediction of gene structure.
Milanesi, L; D'Angelo, D; Rogozin, I B
1999-01-01
Prediction of gene structure in newly sequenced DNA becomes very important in large genome sequencing projects. This problem is complicated due to the exon-intron structure of eukaryotic genes and because gene expression is regulated by many different short nucleotide domains. In order to be able to analyse the full gene structure in different organisms, it is necessary to combine information about potential functional signals (promoter region, splice sites, start and stop codons, 3' untranslated region) together with the statistical properties of coding sequences (coding potential), information about homologous proteins, ESTs and repeated elements. We have developed the GeneBuilder system which is based on prediction of functional signals and coding regions by different approaches in combination with similarity searches in proteins and EST databases. The potential gene structure models are obtained by using a dynamic programming method. The program permits the use of several parameters for gene structure prediction and refinement. During gene model construction, selecting different exon homology levels with a protein sequence selected from a list of homologous proteins can improve the accuracy of the gene structure prediction. In the case of low homology, GeneBuilder is still able to predict the gene structure. The GeneBuilder system has been tested by using the standard set (Burset and Guigo, Genomics, 34, 353-367, 1996) and the performances are: 0.89 sensitivity and 0.91 specificity at the nucleotide level. The total correlation coefficient is 0.88. The GeneBuilder system is implemented as a part of the WebGene a the URL: http://www.itba.mi. cnr.it/webgene and TRADAT (TRAncription Database and Analysis Tools) launcher URL: http://www.itba.mi.cnr.it/tradat.
The Evolution and Expression Pattern of Human Overlapping lncRNA and Protein-coding Gene Pairs.
Ning, Qianqian; Li, Yixue; Wang, Zhen; Zhou, Songwen; Sun, Hong; Yu, Guangjun
2017-03-27
Long non-coding RNA overlapping with protein-coding gene (lncRNA-coding pair) is a special type of overlapping genes. Protein-coding overlapping genes have been well studied and increasing attention has been paid to lncRNAs. By studying lncRNA-coding pairs in human genome, we showed that lncRNA-coding pairs were more likely to be generated by overprinting and retaining genes in lncRNA-coding pairs were given higher priority than non-overlapping genes. Besides, the preference of overlapping configurations preserved during evolution was based on the origin of lncRNA-coding pairs. Further investigations showed that lncRNAs promoting the splicing of their embedded protein-coding partners was a unilateral interaction, but the existence of overlapping partners improving the gene expression was bidirectional and the effect was decreased with the increased evolutionary age of genes. Additionally, the expression of lncRNA-coding pairs showed an overall positive correlation and the expression correlation was associated with their overlapping configurations, local genomic environment and evolutionary age of genes. Comparison of the expression correlation of lncRNA-coding pairs between normal and cancer samples found that the lineage-specific pairs including old protein-coding genes may play an important role in tumorigenesis. This work presents a systematically comprehensive understanding of the evolution and the expression pattern of human lncRNA-coding pairs.
Tang, Guo-Qing; Maxwell, E. Stuart
2008-01-01
The amphibian Xenopus provides a model organism for investigating microRNA expression during vertebrate embryogenesis and development. Searching available Xenopus genome databases using known human pre-miRNAs as query sequences, more than 300 genes encoding 142 Xenopus tropicalis miRNAs were identified. Analysis of Xenopus tropicalis miRNA genes revealed a predominate positioning within introns of protein-coding and nonprotein-coding RNA Pol II-transcribed genes. MiRNA genes were also located in pre-mRNA exons and positioned intergenically between known protein-coding genes. Many miRNA species were found in multiple locations and in more than one genomic context. MiRNA genes were also clustered throughout the genome, indicating the potential for the cotranscription and coordinate expression of miRNAs located in a given cluster. Northern blot analysis confirmed the expression of many identified miRNAs in both X. tropicalis and X. laevis. Comparison of X. tropicalis and X. laevis blots revealed comparable expression profiles, although several miRNAs exhibited species-specific expression in different tissues. More detailed analysis revealed that for some miRNAs, the tissue-specific expression profile of the pri-miRNA precursor was distinctly different from that of the mature miRNA profile. Differential miRNA precursor processing in both the nucleus and cytoplasm was implicated in the observed tissue-specific differences. These observations indicated that post-transcriptional processing plays an important role in regulating miRNA expression in the amphibian Xenopus. PMID:18032731
Bernick, David L.; Dennis, Patrick P.; Lui, Lauren M.; Lowe, Todd M.
2012-01-01
A great diversity of small, non-coding RNA (ncRNA) molecules with roles in gene regulation and RNA processing have been intensely studied in eukaryotic and bacterial model organisms, yet our knowledge of possible parallel roles for small RNAs (sRNA) in archaea is limited. We employed RNA-seq to identify novel sRNA across multiple species of the hyperthermophilic genus Pyrobaculum, known for unusual RNA gene characteristics. By comparing transcriptional data collected in parallel among four species, we were able to identify conserved RNA genes fitting into known and novel families. Among our findings, we highlight three novel cis-antisense sRNAs encoded opposite to key regulatory (ferric uptake regulator), metabolic (triose-phosphate isomerase), and core transcriptional apparatus genes (transcription factor B). We also found a large increase in the number of conserved C/D box sRNA genes over what had been previously recognized; many of these genes are encoded antisense to protein coding genes. The conserved opposition to orthologous genes across the Pyrobaculum genus suggests similarities to other cis-antisense regulatory systems. Furthermore, the genus-specific nature of these sRNAs indicates they are relatively recent, stable adaptations. PMID:22783241
Chan, Wen-Ling; Yang, Wen-Kuang; Huang, Hsien-Da; Chang, Jan-Gowth
2013-01-01
RNA interference (RNAi) is a gene silencing process within living cells, which is controlled by the RNA-induced silencing complex with a sequence-specific manner. In flies and mice, the pseudogene transcripts can be processed into short interfering RNAs (siRNAs) that regulate protein-coding genes through the RNAi pathway. Following these findings, we construct an innovative and comprehensive database to elucidate siRNA-mediated mechanism in human transcribed pseudogenes (TPGs). To investigate TPG producing siRNAs that regulate protein-coding genes, we mapped the TPGs to small RNAs (sRNAs) that were supported by publicly deep sequencing data from various sRNA libraries and constructed the TPG-derived siRNA-target interactions. In addition, we also presented that TPGs can act as a target for miRNAs that actually regulate the parental gene. To enable the systematic compilation and updating of these results and additional information, we have developed a database, pseudoMap, capturing various types of information, including sequence data, TPG and cognate annotation, deep sequencing data, RNA-folding structure, gene expression profiles, miRNA annotation and target prediction. As our knowledge, pseudoMap is the first database to demonstrate two mechanisms of human TPGs: encoding siRNAs and decoying miRNAs that target the parental gene. pseudoMap is freely accessible at http://pseudomap.mbc.nctu.edu.tw/. Database URL: http://pseudomap.mbc.nctu.edu.tw/
Quantifying the Effect of DNA Packaging on Gene Expression Level
NASA Astrophysics Data System (ADS)
Kim, Harold
2010-10-01
Gene expression, the process by which the genetic code comes alive in the form of proteins, is one of the most important biological processes in living cells, and begins when transcription factors bind to specific DNA sequences in the promoter region upstream of a gene. The relationship between gene expression output and transcription factor input which is termed the gene regulation function is specific to each promoter, and predicting this gene regulation function from the locations of transcription factor binding sites is one of the challenges in biology. In eukaryotic organisms (for example, animals, plants, fungi etc), DNA is highly compacted into nucleosomes, 147-bp segments of DNA tightly wrapped around histone protein core, and therefore, the accessibility of transcription factor binding sites depends on their locations with respect to nucleosomes - sites inside nucleosomes are less accessible than those outside nucleosomes. To understand how transcription factor binding sites contribute to gene expression in a quantitative manner, we obtain gene regulation functions of promoters with various configurations of transcription factor binding sites by using fluorescent protein reporters to measure transcription factor input and gene expression output in single yeast cells. In this talk, I will show that the affinity of a transcription factor binding site inside and outside the nucleosome controls different aspects of the gene regulation function, and explain this finding based on a mass-action kinetic model that includes competition between nucleosomes and transcription factors.
Untangling the Web: The Diverse Functions of the PIWI/piRNA Pathway
Mani, Sneha Ramesh; Juliano, Celina E.
2014-01-01
SUMMARY Small RNAs impact several cellular processes through gene regulation. Argonaute proteins bind small RNAs to form effector complexes that control transcriptional and post-transcriptional gene expression. PIWI proteins belong to the Argonaute protein family, and bind PIWI-interacting RNAs (piRNAs). They are highly abundant in the germline, but are also expressed in some somatic tissues. The PIWI/piRNA pathway has a role in transposon repression in Drosophila, which occurs both by epigenetic regulation and post-transcriptional degradation of transposon mRNAs. These functions are conserved, but clear differences in the extent and mechanism of transposon repression exist between species. Mutations in piwi genes lead to the upregulation of transposon mRNAs. It is hypothesized that this increased transposon mobilization leads to genomic instability and thus sterility, although no causal link has been established between transposon upregulation and genome instability. An alternative scenario could be that piwi mutations directly affect genomic instability, and thus lead to increased transposon expression. We propose that the PIWI/piRNA pathway controls genome stability in several ways: suppression of transposons, direct regulation of chromatin architecture and regulation of genes that control important biological processes related to genome stability. The PIWI/piRNA pathway also regulates at least some, if not many, protein-coding genes, which further lends support to the idea that piwi genes may have broader functions beyond transposon repression. An intriguing possibility is that the PIWI/piRNA pathway is using transposon sequences to coordinate the expression of large groups of genes to regulate cellular function. PMID:23712694
Genome-Wide Discovery of Long Non-Coding RNAs in Rainbow Trout.
Al-Tobasei, Rafet; Paneru, Bam; Salem, Mohamed
2016-01-01
The ENCODE project revealed that ~70% of the human genome is transcribed. While only 1-2% of the RNAs encode for proteins, the rest are non-coding RNAs. Long non-coding RNAs (lncRNAs) form a diverse class of non-coding RNAs that are longer than 200 nt. Emerging evidence indicates that lncRNAs play critical roles in various cellular processes including regulation of gene expression. LncRNAs show low levels of gene expression and sequence conservation, which make their computational identification in genomes difficult. In this study, more than two billion Illumina sequence reads were mapped to the genome reference using the TopHat and Cufflinks software. Transcripts shorter than 200 nt, with more than 83-100 amino acids ORF, or with significant homologies to the NCBI nr-protein database were removed. In addition, a computational pipeline was used to filter the remaining transcripts based on a protein-coding-score test. Depending on the filtering stringency conditions, between 31,195 and 54,503 lncRNAs were identified, with only 421 matching known lncRNAs in other species. A digital gene expression atlas revealed 2,935 tissue-specific and 3,269 ubiquitously-expressed lncRNAs. This study annotates the lncRNA rainbow trout genome and provides a valuable resource for functional genomics research in salmonids.
Saha, Anusree; Das, Shubhajit; Moin, Mazahar; Dutta, Mouboni; Bakshi, Achala; Madhav, M. S.; Kirti, P. B.
2017-01-01
Ribosomal proteins (RPs) are indispensable in ribosome biogenesis and protein synthesis, and play a crucial role in diverse developmental processes. Our previous studies on Ribosomal Protein Large subunit (RPL) genes provided insights into their stress responsive roles in rice. In the present study, we have explored the developmental and stress regulated expression patterns of Ribosomal Protein Small (RPS) subunit genes for their differential expression in a spatiotemporal and stress dependent manner. We have also performed an in silico analysis of gene structure, cis-elements in upstream regulatory regions, protein properties and phylogeny. Expression studies of the 34 RPS genes in 13 different tissues of rice covering major growth and developmental stages revealed that their expression was substantially elevated, mostly in shoots and leaves indicating their possible involvement in the development of vegetative organs. The majority of the RPS genes have manifested significant expression under all abiotic stress treatments with ABA, PEG, NaCl, and H2O2. Infection with important rice pathogens, Xanthomonas oryzae pv. oryzae (Xoo) and Rhizoctonia solani also induced the up-regulation of several of the RPS genes. RPS4, 13a, 18a, and 4a have shown higher transcript levels under all the abiotic stresses, whereas, RPS4 is up-regulated in both the biotic stress treatments. The information obtained from the present investigation would be useful in appreciating the possible stress-regulatory attributes of the genes coding for rice ribosomal small subunit proteins apart from their functions as house-keeping proteins. A detailed functional analysis of independent genes is required to study their roles in stress tolerance and generating stress- tolerant crops. PMID:28966624
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, J.H.; Walsh, C.J.
1988-06-01
The nuclear run-on technique was used to measure the rate of transcription of flagellar genes during the differentiation of Naegleria gruberi amebae into flagellates. Synthesis of mRNAs for the axonemal proteins ..cap alpha..- and BETA-tubulin and flagellar calmodulin, as well as a coordinately regulated poly(A)/sup +/ RNA that codes for an unidentified protein, showed transient increases averaging 22-fold. The rate of synthesis of two poly(A)/sup +/ RNAs common to ameobae and flagellates was low until the transcription of the flagellar genes began to decline, at which time synthesis of the RNAs found in ameobae increased 3- to 10-fold. The observedmore » changes in the rate of transcription can account quantitatively for the 20-fold increase in flagellar mRNA concentration during the differentiation. The data for the flagellar calmodulin gene demonstrate transcriptional regulation for a nontubulin axonemal protein. The data also demonstrate at least two programs of transcriptional regulation during the differentiation and raise the intriguing possibility that some significant fraction of the nearly 200 different proteins of the flagellar axoneme is transcriptionally regulated during the 1 h it takes N. gruberi amebae to form visible flagella.« less
Zhao, Zheng; Bai, Jing; Wu, Aiwei; Wang, Yuan; Zhang, Jinwen; Wang, Zishan; Li, Yongsheng; Xu, Juan; Li, Xia
2015-01-01
Long non-coding RNAs (lncRNAs) are emerging as key regulators of diverse biological processes and diseases. However, the combinatorial effects of these molecules in a specific biological function are poorly understood. Identifying co-expressed protein-coding genes of lncRNAs would provide ample insight into lncRNA functions. To facilitate such an effort, we have developed Co-LncRNA, which is a web-based computational tool that allows users to identify GO annotations and KEGG pathways that may be affected by co-expressed protein-coding genes of a single or multiple lncRNAs. LncRNA co-expressed protein-coding genes were first identified in publicly available human RNA-Seq datasets, including 241 datasets across 6560 total individuals representing 28 tissue types/cell lines. Then, the lncRNA combinatorial effects in a given GO annotations or KEGG pathways are taken into account by the simultaneous analysis of multiple lncRNAs in user-selected individual or multiple datasets, which is realized by enrichment analysis. In addition, this software provides a graphical overview of pathways that are modulated by lncRNAs, as well as a specific tool to display the relevant networks between lncRNAs and their co-expressed protein-coding genes. Co-LncRNA also supports users in uploading their own lncRNA and protein-coding gene expression profiles to investigate the lncRNA combinatorial effects. It will be continuously updated with more human RNA-Seq datasets on an annual basis. Taken together, Co-LncRNA provides a web-based application for investigating lncRNA combinatorial effects, which could shed light on their biological roles and could be a valuable resource for this community. Database URL: http://www.bio-bigdata.com/Co-LncRNA/. © The Author(s) 2015. Published by Oxford University Press.
Chen, Frank; Spano, Anthony; Goodman, Benjamin E.; Blasier, Kiev R.; Sabat, Agnes; Jeffery, Erin; Norris, Andrew; Shabanowitz, Jeffrey; Hunt, Donald F.; Lebedev, Nikolai
2010-01-01
The gene transfer agent of Rhodobacter capsulatus (GTA) is a unique phage-like particle that exchanges genetic information between members of this same species of bacterium. Besides being an excellent tool for genetic mapping, the GTA has a number of advantages for biotechnological and nanoengineering purposes. To facilitate the GTA purification and identify the proteins involved in GTA expression, assembly and regulation, in the present work we construct and transform into R. capsulatus Y262 a gene coding for a C-terminally His-tagged capsid protein. The constructed protein was expressed in the cells, assembled into chimeric GTA particles inside the cells and excreted from the cells into surrounding medium. Transmission electron micrographs of phosphotungstate-stained, NiNTA-purified chimeric GTA confirm that its structure is similar to normal GTA particles, with many particles composed both of a head and a tail. The mass spectrometric proteomic analysis of polypeptides present in the GTA recovered outside the cells shows that GTA is composed of at least 9 proteins represented in the GTA gene cluster including proteins coded for by Orf’s 3, 5, 6–9, 11, 13, and 15. PMID:19105630
Chen, Frank; Spano, Anthony; Goodman, Benjamin E; Blasier, Kiev R; Sabat, Agnes; Jeffery, Erin; Norris, Andrew; Shabanowitz, Jeffrey; Hunt, Donald F; Lebedev, Nikolai
2009-02-01
The gene transfer agent of Rhodobacter capsulatus (GTA) is a unique phage-like particle that exchanges genetic information between members of this same species of bacterium. Besides being an excellent tool for genetic mapping, the GTA has a number of advantages for biotechnological and nanoengineering purposes. To facilitate the GTA purification and identify the proteins involved in GTA expression, assembly and regulation, in the present work we construct and transform into R. capsulatus Y262 a gene coding for a C-terminally His-tagged capsid protein. The constructed protein was expressed in the cells, assembled into chimeric GTA particles inside the cells and excreted from the cells into surrounding medium. Transmission electron micrographs of phosphotungstate-stained, NiNTA-purified chimeric GTA confirm that its structure is similar to normal GTA particles, with many particles composed both of a head and a tail. The mass spectrometric proteomic analysis of polypeptides present in the GTA recovered outside the cells shows that GTA is composed of at least 9 proteins represented in the GTA gene cluster including proteins coded for by Orf's 3, 5, 6-9, 11, 13, and 15.
Yang, W; Du, W W; Li, X; Yee, A J; Yang, B B
2016-07-28
It has recently been shown that the upregulation of a pseudogene specific to a protein-coding gene could function as a sponge to bind multiple potential targeting microRNAs (miRNAs), resulting in increased gene expression. Similarly, it was recently demonstrated that circular RNAs can function as sponges for miRNAs, and could upregulate expression of mRNAs containing an identical sequence. Furthermore, some mRNAs are now known to not only translate protein, but also function to sponge miRNA binding, facilitating gene expression. Collectively, these appear to be effective mechanisms to ensure gene expression and protein activity. Here we show that expression of a member of the forkhead family of transcription factors, Foxo3, is regulated by the Foxo3 pseudogene (Foxo3P), and Foxo3 circular RNA, both of which bind to eight miRNAs. We found that the ectopic expression of the Foxo3P, Foxo3 circular RNA and Foxo3 mRNA could all suppress tumor growth and cancer cell proliferation and survival. Our results showed that at least three mechanisms are used to ensure protein translation of Foxo3, which reflects an essential role of Foxo3 and its corresponding non-coding RNAs.
Liu, Kuimei; Dong, Yanmei; Wang, Fangzhong; Jiang, Baojie; Wang, Mingyu; Fang, Xu
2016-01-01
Homologs of the velvet protein family are encoded by the ve1, vel2, and vel3 genes in Trichoderma reesei. To test their regulatory functions, the velvet protein-coding genes were disrupted, generating Δve1, Δvel2, and Δvel3 strains. The phenotypic features of these strains were examined to identify their functions in morphogenesis, sporulation, and cellulase expression. The three velvet-deficient strains produced more hyphal branches, indicating that velvet family proteins participate in the morphogenesis in T. reesei. Deletion of ve1 and vel3 did not affect biomass accumulation, while deletion of vel2 led to a significantly hampered growth when cellulose was used as the sole carbon source in the medium. The deletion of either ve1 or vel2 led to the sharp decrease of sporulation as well as a global downregulation of cellulase-coding genes. In contrast, although the expression of cellulase-coding genes of the ∆vel3 strain was downregulated in the dark, their expression in light condition was unaffected. Sporulation was hampered in the ∆vel3 strain. These results suggest that Ve1 and Vel2 play major roles, whereas Vel3 plays a minor role in sporulation, morphogenesis, and cellulase expression.
A human haploid gene trap collection to study lncRNAs with unusual RNA biology.
Kornienko, Aleksandra E; Vlatkovic, Irena; Neesen, Jürgen; Barlow, Denise P; Pauler, Florian M
2016-01-01
Many thousand long non-coding (lnc) RNAs are mapped in the human genome. Time consuming studies using reverse genetic approaches by post-transcriptional knock-down or genetic modification of the locus demonstrated diverse biological functions for a few of these transcripts. The Human Gene Trap Mutant Collection in haploid KBM7 cells is a ready-to-use tool for studying protein-coding gene function. As lncRNAs show remarkable differences in RNA biology compared to protein-coding genes, it is unclear if this gene trap collection is useful for functional analysis of lncRNAs. Here we use the uncharacterized LOC100288798 lncRNA as a model to answer this question. Using public RNA-seq data we show that LOC100288798 is ubiquitously expressed, but inefficiently spliced. The minor spliced LOC100288798 isoforms are exported to the cytoplasm, whereas the major unspliced isoform is nuclear localized. This shows that LOC100288798 RNA biology differs markedly from typical mRNAs. De novo assembly from RNA-seq data suggests that LOC100288798 extends 289kb beyond its annotated 3' end and overlaps the downstream SLC38A4 gene. Three cell lines with independent gene trap insertions in LOC100288798 were available from the KBM7 gene trap collection. RT-qPCR and RNA-seq confirmed successful lncRNA truncation and its extended length. Expression analysis from RNA-seq data shows significant deregulation of 41 protein-coding genes upon LOC100288798 truncation. Our data shows that gene trap collections in human haploid cell lines are useful tools to study lncRNAs, and identifies the previously uncharacterized LOC100288798 as a potential gene regulator.
Rattay, Stephanie; Trilling, Mirko; Megger, Dominik A; Sitek, Barbara; Meyer, Helmut E; Hengel, Hartmut; Le-Trilling, Vu Thuy Khanh
2015-08-01
Transcription of mouse cytomegalovirus (MCMV) immediate early ie1 and ie3 is controlled by the major immediate early promoter/enhancer (MIEP) and requires differential splicing. Based on complete loss of genome replication of an MCMV mutant carrying a deletion of the ie3-specific exon 5, the multifunctional IE3 protein (611 amino acids; pIE611) is considered essential for viral replication. Our analysis of ie3 transcription resulted in the identification of novel ie3 isoforms derived from alternatively spliced ie3 transcripts. Construction of an IE3-hemagglutinin (IE3-HA) virus by insertion of an in-frame HA epitope sequence allowed detection of the IE3 isoforms in infected cells, verifying that the newly identified transcripts code for proteins. This prompted the construction of an MCMV mutant lacking ie611 but retaining the coding capacity for the newly identified isoforms ie453 and ie310. Using Δie611 MCMV, we demonstrated the dispensability of the canonical ie3 gene product pIE611 for viral replication. To determine the role of pIE611 for viral gene expression during MCMV infection in an unbiased global approach, we used label-free quantitative mass spectrometry to delineate pIE611-dependent changes of the MCMV proteome. Interestingly, further analysis revealed transcriptional as well as posttranscriptional regulation of MCMV gene products by pIE611. Cytomegaloviruses are pathogenic betaherpesviruses persisting in a lifelong latency from which reactivation can occur under conditions of immunosuppression, immunoimmaturity, or inflammation. The switch from latency to reactivation requires expression of immediate early genes. Therefore, understanding of immediate early gene regulation might add insights into viral pathogenesis. The mouse cytomegalovirus (MCMV) immediate early 3 protein (611 amino acids; pIE611) is considered essential for viral replication. The identification of novel protein isoforms derived from alternatively spliced ie3 transcripts prompted the construction of an MCMV mutant lacking ie611 but retaining the coding capacity for the newly identified isoforms ie453 and ie310. Using Δie611 MCMV, we demonstrated the dispensability of the canonical ie3 gene product pIE611 for viral replication and delineated pIE611-dependent changes of the MCMV proteome. Our findings have fundamental implications for the interpretation of earlier studies on pIE3 functions and highlight the complex orchestration of MCMV gene regulation. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
MicroRNA regulation of F-box proteins and its role in cancer.
Wu, Zhao-Hui; Pfeffer, Lawrence M
2016-02-01
MicroRNAs (miRNAs) are small endogenous non-coding RNAs, which play critical roles in cancer development by suppressing gene expression at the post-transcriptional level. In general, oncogenic miRNAs are upregulated in cancer, while miRNAs that act as tumor suppressors are downregulated, leading to decreased expression of tumor suppressors and upregulated oncogene expression, respectively. F-box proteins function as the substrate-recognition components of the SKP1-CUL1-F-box (SCF)-ubiquitin ligase complex for the degradation of their protein targets by the ubiquitin-proteasome system. Therefore F-box proteins and miRNAs both negatively regulate target gene expression post-transcriptionally. Since each miRNA is capable of fine-tuning the expression of multiple target genes, multiple F-box proteins may be suppressed by the same miRNA. Meanwhile, one F-box proteins could be regulated by several miRNAs in different cancer types. In this review, we will focus on miRNA-mediated downregulation of various F-box proteins, the resulting stabilization of F-box protein substrates and the impact of these processes on human malignancies. We provide insight into how the miRNA: F-box protein axis may regulate cancer progression and metastasis. We also consider the broader role of F-box proteins in the regulation of pathways that are independent of the ubiquitin ligase complex and how that impacts on oncogenesis. The area of miRNAs and the F-box proteins that they regulate in cancer is an emerging field and will inform new strategies in cancer treatment. Copyright © 2015 Elsevier Ltd. All rights reserved.
RNA-Binding Proteins in Trichomonas vaginalis: Atypical Multifunctional Proteins.
Figueroa-Angulo, Elisa E; Calla-Choque, Jaeson S; Mancilla-Olea, Maria Inocente; Arroyo, Rossana
2015-11-26
Iron homeostasis is highly regulated in vertebrates through a regulatory system mediated by RNA-protein interactions between the iron regulatory proteins (IRPs) that interact with an iron responsive element (IRE) located in certain mRNAs, dubbed the IRE-IRP regulatory system. Trichomonas vaginalis, the causal agent of trichomoniasis, presents high iron dependency to regulate its growth, metabolism, and virulence properties. Although T. vaginalis lacks IRPs or proteins with aconitase activity, possesses gene expression mechanisms of iron regulation at the transcriptional and posttranscriptional levels. However, only one gene with iron regulation at the transcriptional level has been described. Recently, our research group described an iron posttranscriptional regulatory mechanism in the T. vaginalis tvcp4 and tvcp12 cysteine proteinase mRNAs. The tvcp4 and tvcp12 mRNAs have a stem-loop structure in the 5'-coding region or in the 3'-UTR, respectively that interacts with T. vaginalis multifunctional proteins HSP70, α-Actinin, and Actin under iron starvation condition, causing translation inhibition or mRNA stabilization similar to the previously characterized IRE-IRP system in eukaryotes. Herein, we summarize recent progress and shed some light on atypical RNA-binding proteins that may participate in the iron posttranscriptional regulation in T. vaginalis.
A subset of conserved mammalian long non-coding RNAs are fossils of ancestral protein-coding genes.
Hezroni, Hadas; Ben-Tov Perry, Rotem; Meir, Zohar; Housman, Gali; Lubelsky, Yoav; Ulitsky, Igor
2017-08-30
Only a small portion of human long non-coding RNAs (lncRNAs) appear to be conserved outside of mammals, but the events underlying the birth of new lncRNAs in mammals remain largely unknown. One potential source is remnants of protein-coding genes that transitioned into lncRNAs. We systematically compare lncRNA and protein-coding loci across vertebrates, and estimate that up to 5% of conserved mammalian lncRNAs are derived from lost protein-coding genes. These lncRNAs have specific characteristics, such as broader expression domains, that set them apart from other lncRNAs. Fourteen lncRNAs have sequence similarity with the loci of the contemporary homologs of the lost protein-coding genes. We propose that selection acting on enhancer sequences is mostly responsible for retention of these regions. As an example of an RNA element from a protein-coding ancestor that was retained in the lncRNA, we describe in detail a short translated ORF in the JPX lncRNA that was derived from an upstream ORF in a protein-coding gene and retains some of its functionality. We estimate that ~ 55 annotated conserved human lncRNAs are derived from parts of ancestral protein-coding genes, and loss of coding potential is thus a non-negligible source of new lncRNAs. Some lncRNAs inherited regulatory elements influencing transcription and translation from their protein-coding ancestors and those elements can influence the expression breadth and functionality of these lncRNAs.
Global Regulatory Functions of the Staphylococcus aureus Endoribonuclease III in Gene Expression
Lioliou, Efthimia; Sharma, Cynthia M.; Caldelari, Isabelle; Helfer, Anne-Catherine; Fechter, Pierre; Vandenesch, François; Vogel, Jörg; Romby, Pascale
2012-01-01
RNA turnover plays an important role in both virulence and adaptation to stress in the Gram-positive human pathogen Staphylococcus aureus. However, the molecular players and mechanisms involved in these processes are poorly understood. Here, we explored the functions of S. aureus endoribonuclease III (RNase III), a member of the ubiquitous family of double-strand-specific endoribonucleases. To define genomic transcripts that are bound and processed by RNase III, we performed deep sequencing on cDNA libraries generated from RNAs that were co-immunoprecipitated with wild-type RNase III or two different cleavage-defective mutant variants in vivo. Several newly identified RNase III targets were validated by independent experimental methods. We identified various classes of structured RNAs as RNase III substrates and demonstrated that this enzyme is involved in the maturation of rRNAs and tRNAs, regulates the turnover of mRNAs and non-coding RNAs, and autoregulates its synthesis by cleaving within the coding region of its own mRNA. Moreover, we identified a positive effect of RNase III on protein synthesis based on novel mechanisms. RNase III–mediated cleavage in the 5′ untranslated region (5′UTR) enhanced the stability and translation of cspA mRNA, which encodes the major cold-shock protein. Furthermore, RNase III cleaved overlapping 5′UTRs of divergently transcribed genes to generate leaderless mRNAs, which constitutes a novel way to co-regulate neighboring genes. In agreement with recent findings, low abundance antisense RNAs covering 44% of the annotated genes were captured by co-immunoprecipitation with RNase III mutant proteins. Thus, in addition to gene regulation, RNase III is associated with RNA quality control of pervasive transcription. Overall, this study illustrates the complexity of post-transcriptional regulation mediated by RNase III. PMID:22761586
Computational Identification and Functional Predictions of Long Noncoding RNA in Zea mays
Boerner, Susan; McGinnis, Karen M.
2012-01-01
Background Computational analysis of cDNA sequences from multiple organisms suggests that a large portion of transcribed DNA does not code for a functional protein. In mammals, noncoding transcription is abundant, and often results in functional RNA molecules that do not appear to encode proteins. Many long noncoding RNAs (lncRNAs) appear to have epigenetic regulatory function in humans, including HOTAIR and XIST. While epigenetic gene regulation is clearly an essential mechanism in plants, relatively little is known about the presence or function of lncRNAs in plants. Methodology/Principal Findings To explore the connection between lncRNA and epigenetic regulation of gene expression in plants, a computational pipeline using the programming language Python has been developed and applied to maize full length cDNA sequences to identify, classify, and localize potential lncRNAs. The pipeline was used in parallel with an SVM tool for identifying ncRNAs to identify the maximal number of ncRNAs in the dataset. Although the available library of sequences was small and potentially biased toward protein coding transcripts, 15% of the sequences were predicted to be noncoding. Approximately 60% of these sequences appear to act as precursors for small RNA molecules and may function to regulate gene expression via a small RNA dependent mechanism. ncRNAs were predicted to originate from both genic and intergenic loci. Of the lncRNAs that originated from genic loci, ∼20% were antisense to the host gene loci. Conclusions/Significance Consistent with similar studies in other organisms, noncoding transcription appears to be widespread in the maize genome. Computational predictions indicate that maize lncRNAs may function to regulate expression of other genes through multiple RNA mediated mechanisms. PMID:22916204
Non-coding RNAs in lung cancer
Ricciuti, Biagio; Mecca, Carmen; Crinò, Lucio; Baglivo, Sara; Cenci, Matteo; Metro, Giulio
2014-01-01
The discovery that protein-coding genes represent less than 2% of all human genome, and the evidence that more than 90% of it is actively transcribed, changed the classical point of view of the central dogma of molecular biology, which was always based on the assumption that RNA functions mainly as an intermediate bridge between DNA sequences and protein synthesis machinery. Accumulating data indicates that non-coding RNAs are involved in different physiological processes, providing for the maintenance of cellular homeostasis. They are important regulators of gene expression, cellular differentiation, proliferation, migration, apoptosis, and stem cell maintenance. Alterations and disruptions of their expression or activity have increasingly been associated with pathological changes of cancer cells, this evidence and the prospect of using these molecules as diagnostic markers and therapeutic targets, make currently non-coding RNAs among the most relevant molecules in cancer research. In this paper we will provide an overview of non-coding RNA function and disruption in lung cancer biology, also focusing on their potential as diagnostic, prognostic and predictive biomarkers. PMID:25593996
Thuan, Nguyen Huy; Dhakal, Dipesh; Pokhrel, Anaya Raj; Chu, Luan Luong; Van Pham, Thi Thuy; Shrestha, Anil; Sohng, Jae Kyung
2018-05-01
Streptomyces peucetius ATCC 27952 produces two major anthracyclines, doxorubicin (DXR) and daunorubicin (DNR), which are potent chemotherapeutic agents for the treatment of several cancers. In order to gain detailed insight on genetics and biochemistry of the strain, the complete genome was determined and analyzed. The result showed that its complete sequence contains 7187 protein coding genes in a total of 8,023,114 bp, whereas 87% of the genome contributed to the protein coding region. The genomic sequence included 18 rRNA, 66 tRNAs, and 3 non-coding RNAs. In silico studies predicted ~ 68 biosynthetic gene clusters (BCGs) encoding diverse classes of secondary metabolites, including non-ribosomal polyketide synthase (NRPS), polyketide synthase (PKS I, II, and III), terpenes, and others. Detailed analysis of the genome sequence revealed versatile biocatalytic enzymes such as cytochrome P450 (CYP), electron transfer systems (ETS) genes, methyltransferase (MT), glycosyltransferase (GT). In addition, numerous functional genes (transporter gene, SOD, etc.) and regulatory genes (afsR-sp, metK-sp, etc.) involved in the regulation of secondary metabolites were found. This minireview summarizes the genome-based genome mining (GM) of diverse BCGs and genome exploration (GE) of versatile biocatalytic enzymes, and other enzymes involved in maintenance and regulation of metabolism of S. peucetius. The detailed analysis of genome sequence provides critically important knowledge useful in the bioengineering of the strain or harboring catalytically efficient enzymes for biotechnological applications.
Loreni, F; Ruberti, I; Bozzoni, I; Pierandrei-Amaldi, P; Amaldi, F
1985-01-01
Ribosomal protein L1 is encoded by two genes in Xenopus laevis. The comparison of two cDNA sequences shows that the two L1 gene copies (L1a and L1b) have diverged in many silent sites and very few substitution sites; moreover a small duplication occurred at the very end of the coding region of the L1b gene which thus codes for a product five amino acids longer than that coded by L1a. Quantitatively the divergence between the two L1 genes confirms that a whole genome duplication took place in Xenopus laevis approximately 30 million years ago. A genomic fragment containing one of the two L1 gene copies (L1a), with its nine introns and flanking regions, has been completely sequenced. The 5' end of this gene has been mapped within a 20-pyridimine stretch as already found for other vertebrate ribosomal protein genes. Four of the nine introns have a 60-nucleotide sequence with 80% homology; within this region some boxes, one of which is 16 nucleotides long, are 100% homologous among the four introns. This feature of L1a gene introns is interesting since we have previously shown that the activity of this gene is regulated at a post-transcriptional level and it involves the block of the normal splicing of some intron sequences. Images Fig. 3. Fig. 5. PMID:3841512
Long non-coding RNAs involved in autophagy regulation
Yang, Lixian; Wang, Hanying; Shen, Qi; Feng, Lifeng; Jin, Hongchuan
2017-01-01
Autophagy degrades non-functioning or damaged proteins and organelles to maintain cellular homeostasis in a physiological or pathological context. Autophagy can be protective or detrimental, depending on its activation status and other conditions. Therefore, autophagy has a crucial role in a myriad of pathophysiological processes. From the perspective of autophagy-related (ATG) genes, the molecular dissection of autophagy process and the regulation of its level have been largely unraveled. However, the discovery of long non-coding RNAs (lncRNAs) provides a new paradigm of gene regulation in almost all important biological processes, including autophagy. In this review, we highlight recent advances in autophagy-associated lncRNAs and their specific autophagic targets, as well as their relevance to human diseases such as cancer, cardiovascular disease, diabetes and cerebral ischemic stroke. PMID:28981093
Luque-Almagro, V M; Escribano, M P; Manso, I; Sáez, L P; Cabello, P; Moreno-Vivián, C; Roldán, M D
2015-11-20
Pseudomonas pseudoalcaligenes CECT5344 is an alkaliphilic bacterium that can use cyanide as nitrogen source for growth, becoming a suitable candidate to be applied in biological treatment of cyanide-containing wastewaters. The assessment of the whole genome sequence of the strain CECT5344 has allowed the generation of DNA microarrays to analyze the response to different nitrogen sources. The mRNA of P. pseudoalcaligenes CECT5344 cells grown under nitrogen limiting conditions showed considerable changes when compared against the transcripts from cells grown with ammonium; up-regulated genes were, among others, the glnK gene encoding the nitrogen regulatory protein PII, the two-component ntrBC system involved in global nitrogen regulation, and the ammonium transporter-encoding amtB gene. The protein coding transcripts of P. pseudoalcaligenes CECT5344 cells grown with sodium cyanide or an industrial jewelry wastewater that contains high concentration of cyanide and metals like iron, copper and zinc, were also compared against the transcripts of cells grown with ammonium as nitrogen source. This analysis revealed the induction by cyanide and the cyanide-rich wastewater of four nitrilase-encoding genes, including the nitC gene that is essential for cyanide assimilation, the cyanase cynS gene involved in cyanate assimilation, the cioAB genes required for the cyanide-insensitive respiration, and the ahpC gene coding for an alkyl-hydroperoxide reductase that could be related with iron homeostasis and oxidative stress. The nitC and cynS genes were also induced in cells grown under nitrogen starvation conditions. In cells grown with the jewelry wastewater, a malate quinone:oxidoreductase mqoB gene and several genes coding for metal extrusion systems were specifically induced. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
Gene and genon concept: coding versus regulation
2007-01-01
We analyse here the definition of the gene in order to distinguish, on the basis of modern insight in molecular biology, what the gene is coding for, namely a specific polypeptide, and how its expression is realized and controlled. Before the coding role of the DNA was discovered, a gene was identified with a specific phenotypic trait, from Mendel through Morgan up to Benzer. Subsequently, however, molecular biologists ventured to define a gene at the level of the DNA sequence in terms of coding. As is becoming ever more evident, the relations between information stored at DNA level and functional products are very intricate, and the regulatory aspects are as important and essential as the information coding for products. This approach led, thus, to a conceptual hybrid that confused coding, regulation and functional aspects. In this essay, we develop a definition of the gene that once again starts from the functional aspect. A cellular function can be represented by a polypeptide or an RNA. In the case of the polypeptide, its biochemical identity is determined by the mRNA prior to translation, and that is where we locate the gene. The steps from specific, but possibly separated sequence fragments at DNA level to that final mRNA then can be analysed in terms of regulation. For that purpose, we coin the new term “genon”. In that manner, we can clearly separate product and regulative information while keeping the fundamental relation between coding and function without the need to introduce a conceptual hybrid. In mRNA, the program regulating the expression of a gene is superimposed onto and added to the coding sequence in cis - we call it the genon. The complementary external control of a given mRNA by trans-acting factors is incorporated in its transgenon. A consequence of this definition is that, in eukaryotes, the gene is, in most cases, not yet present at DNA level. Rather, it is assembled by RNA processing, including differential splicing, from various pieces, as steered by the genon. It emerges finally as an uninterrupted nucleic acid sequence at mRNA level just prior to translation, in faithful correspondence with the amino acid sequence to be produced as a polypeptide. After translation, the genon has fulfilled its role and expires. The distinction between the protein coding information as materialised in the final polypeptide and the processing information represented by the genon allows us to set up a new information theoretic scheme. The standard sequence information determined by the genetic code expresses the relation between coding sequence and product. Backward analysis asks from which coding region in the DNA a given polypeptide originates. The (more interesting) forward analysis asks in how many polypeptides of how many different types a given DNA segment is expressed. This concerns the control of the expression process for which we have introduced the genon concept. Thus, the information theoretic analysis can capture the complementary aspects of coding and regulation, of gene and genon. PMID:18087760
The Mediator complex and transcription regulation
Poss, Zachary C.; Ebmeier, Christopher C.
2013-01-01
The Mediator complex is a multi-subunit assembly that appears to be required for regulating expression of most RNA polymerase II (pol II) transcripts, which include protein-coding and most non-coding RNA genes. Mediator and pol II function within the pre-initiation complex (PIC), which consists of Mediator, pol II, TFIIA, TFIIB, TFIID, TFIIE, TFIIF and TFIIH and is approximately 4.0 MDa in size. Mediator serves as a central scaffold within the PIC and helps regulate pol II activity in ways that remain poorly understood. Mediator is also generally targeted by sequence-specific, DNA-binding transcription factors (TFs) that work to control gene expression programs in response to developmental or environmental cues. At a basic level, Mediator functions by relaying signals from TFs directly to the pol II enzyme, thereby facilitating TF-dependent regulation of gene expression. Thus, Mediator is essential for converting biological inputs (communicated by TFs) to physiological responses (via changes in gene expression). In this review, we summarize an expansive body of research on the Mediator complex, with an emphasis on yeast and mammalian complexes. We focus on the basics that underlie Mediator function, such as its structure and subunit composition, and describe its broad regulatory influence on gene expression, ranging from chromatin architecture to transcription initiation and elongation, to mRNA processing. We also describe factors that influence Mediator structure and activity, including TFs, non-coding RNAs and the CDK8 module. PMID:24088064
Sounds of silence: synonymous nucleotides as a key to biological regulation and complexity
Shabalina, Svetlana A.; Spiridonov, Nikolay A.; Kashina, Anna
2013-01-01
Messenger RNA is a key component of an intricate regulatory network of its own. It accommodates numerous nucleotide signals that overlap protein coding sequences and are responsible for multiple levels of regulation and generation of biological complexity. A wealth of structural and regulatory information, which mRNA carries in addition to the encoded amino acid sequence, raises the question of how these signals and overlapping codes are delineated along non-synonymous and synonymous positions in protein coding regions, especially in eukaryotes. Silent or synonymous codon positions, which do not determine amino acid sequences of the encoded proteins, define mRNA secondary structure and stability and affect the rate of translation, folding and post-translational modifications of nascent polypeptides. The RNA level selection is acting on synonymous sites in both prokaryotes and eukaryotes and is more common than previously thought. Selection pressure on the coding gene regions follows three-nucleotide periodic pattern of nucleotide base-pairing in mRNA, which is imposed by the genetic code. Synonymous positions of the coding regions have a higher level of hybridization potential relative to non-synonymous positions, and are multifunctional in their regulatory and structural roles. Recent experimental evidence and analysis of mRNA structure and interspecies conservation suggest that there is an evolutionary tradeoff between selective pressure acting at the RNA and protein levels. Here we provide a comprehensive overview of the studies that define the role of silent positions in regulating RNA structure and processing that exert downstream effects on proteins and their functions. PMID:23293005
DOE Office of Scientific and Technical Information (OSTI.GOV)
Depto, A.S.; Stenberg, R.M.
1989-03-01
To better understand the regulation of late gene expression in human cytomegalovirus (CMV)-infected cells, the authors examined expression of the gene that codes for the 65-kilodalton lower-matrix phosphoprotein (pp65). Analysis of RNA isolated at 72 h from cells infected with CMV Towne or ts66, a DNA-negative temperature-sensitive mutant, supported the fact that pp65 is expressed at low levels prior to viral DNA replication but maximally expressed after the initiation of viral DNA replication. To investigate promoter activation in a transient expression assay, the pp65 promoter was cloned into the indicator plasmid containing the gene for chloramphenicol acetyltransferase (CAT). Transfection ofmore » the promoter-CAT construct and subsequent superinfection with CMV resulted in activation of the promoter at early times after infection. Cotransfection with plasmids capable of expressing immediate-early (IE) proteins demonstrated that the promoter was activated by IE proteins and that both IE regions 1 and 2 were necessary. These studies suggest that interactions between IE proteins and this octamer sequence may be important for the regulation and expression of this CMV gene.« less
The long non-coding RNA HOTTIP enhances pancreatic cancer cell proliferation, survival and migration
Cheng, Yating; Jutooru, Indira; Chadalapaka, Gayathri; Corton, J. Christopher; Safe, Stephen
2015-01-01
HOTTIP is a long non-coding RNA (lncRNA) transcribed from the 5′ tip of the HOXA locus and is associated with the polycomb repressor complex 2 (PRC2) and WD repeat containing protein 5 (WDR5)/mixed lineage leukemia 1 (MLL1) chromatin modifying complexes. HOTTIP is expressed in pancreatic cancer cell lines and knockdown of HOTTIP by RNA interference (siHOTTIP) in Panc1 pancreatic cancer cells decreased proliferation, induced apoptosis and decreased migration. In Panc1 cells transfected with siHOTTIP, there was a decrease in expression of 757 genes and increased expression of 514 genes, and a limited gene analysis indicated that HOTTIP regulation of genes is complex. For example, Aurora kinase A, an important regulator of cell growth, is coregulated by MLL and not WDR5 and, in contrast to previous studies in liver cancer cells, HOTTIP does not regulate HOXA13 but plays a role in regulation of several other HOX genes including HOXA10, HOXB2, HOXA11, HOXA9 and HOXA1. Although HOTTIP and the HOX-associated lncRNA HOTAIR have similar pro-oncogenic functions, they regulate strikingly different sets of genes in Panc1 cells and in pancreatic tumors. PMID:25912306
Kirsten, Holger; Al-Hasani, Hoor; Holdt, Lesca; Gross, Arnd; Beutner, Frank; Krohn, Knut; Horn, Katrin; Ahnert, Peter; Burkhardt, Ralph; Reiche, Kristin; Hackermüller, Jörg; Löffler, Markus; Teupser, Daniel; Thiery, Joachim; Scholz, Markus
2015-01-01
Genetics of gene expression (eQTLs or expression QTLs) has proved an indispensable tool for understanding biological pathways and pathomechanisms of trait-associated SNPs. However, power of most genome-wide eQTL studies is still limited. We performed a large eQTL study in peripheral blood mononuclear cells of 2112 individuals increasing the power to detect trans-effects genome-wide. Going beyond univariate SNP-transcript associations, we analyse relations of eQTLs to biological pathways, polygenetic effects of expression regulation, trans-clusters and enrichment of co-localized functional elements. We found eQTLs for about 85% of analysed genes, and 18% of genes were trans-regulated. Local eSNPs were enriched up to a distance of 5 Mb to the transcript challenging typically implemented ranges of cis-regulations. Pathway enrichment within regulated genes of GWAS-related eSNPs supported functional relevance of identified eQTLs. We demonstrate that nearest genes of GWAS-SNPs might frequently be misleading functional candidates. We identified novel trans-clusters of potential functional relevance for GWAS-SNPs of several phenotypes including obesity-related traits, HDL-cholesterol levels and haematological phenotypes. We used chromatin immunoprecipitation data for demonstrating biological effects. Yet, we show for strongly heritable transcripts that still little trans-chromosomal heritability is explained by all identified trans-eSNPs; however, our data suggest that most cis-heritability of these transcripts seems explained. Dissection of co-localized functional elements indicated a prominent role of SNPs in loci of pseudogenes and non-coding RNAs for the regulation of coding genes. In summary, our study substantially increases the catalogue of human eQTLs and improves our understanding of the complex genetic regulation of gene expression, pathways and disease-related processes. PMID:26019233
Morris, H; Schlesinger, M J; Bracha, M; Yagil, E
1974-08-01
Induction of alkaline phosphatase in wild-type Escherichia coli K-12 leads to the appearance of three new proteins in addition to alkaline phosphatase in the periplasmic space of the bacteria. These proteins are detected in autoradiograms of sodium dodecyl sulfate-acrylamide gel electropherograms of extracts from cells labeled with [(35)S]methionine. Studies with constitutive mutants defective in the three genes phoS, phoT, and phoR that have been shown to regulate alkaline phosphatase synthesis indicate that the three periplasmic proteins are coregulated with alkaline phosphatase. A mutant that has a deletion in the alkaline phosphatase structural gene phoA produces the three proteins, but a newly discovered mutant phoB that has a defect in the expression of alkaline phosphatase fails to produce the three proteins. phoB mutants are shown here to be unable to make detectable amounts of alkaline phosphatase polypeptides, as measured by immunoprecipitins or acrylamide gel electropherograms. On the basis of these results we suggest a new model for the regulation of alkaline phosphatase biosynthesis. In this model, a ternary complex composed of phoB(+) and phoR(+) gene products and an internal metabolite functions as a positive control element to regulate the transcription of several cistrons coding for periplasmic proteins.
Yoon, Sung Ho; Turkarslan, Serdar; Reiss, David J.; Pan, Min; Burn, June A.; Costa, Kyle C.; Lie, Thomas J.; Slagel, Joseph; Moritz, Robert L.; Hackett, Murray; Leigh, John A.; Baliga, Nitin S.
2013-01-01
Methanogens catalyze the critical methane-producing step (called methanogenesis) in the anaerobic decomposition of organic matter. Here, we present the first predictive model of global gene regulation of methanogenesis in a hydrogenotrophic methanogen, Methanococcus maripaludis. We generated a comprehensive list of genes (protein-coding and noncoding) for M. maripaludis through integrated analysis of the transcriptome structure and a newly constructed Peptide Atlas. The environment and gene-regulatory influence network (EGRIN) model of the strain was constructed from a compendium of transcriptome data that was collected over 58 different steady-state and time-course experiments that were performed in chemostats or batch cultures under a spectrum of environmental perturbations that modulated methanogenesis. Analyses of the EGRIN model have revealed novel components of methanogenesis that included at least three additional protein-coding genes of previously unknown function as well as one noncoding RNA. We discovered that at least five regulatory mechanisms act in a combinatorial scheme to intercoordinate key steps of methanogenesis with different processes such as motility, ATP biosynthesis, and carbon assimilation. Through a combination of genetic and environmental perturbation experiments we have validated the EGRIN-predicted role of two novel transcription factors in the regulation of phosphate-dependent repression of formate dehydrogenase—a key enzyme in the methanogenesis pathway. The EGRIN model demonstrates regulatory affiliations within methanogenesis as well as between methanogenesis and other cellular functions. PMID:24089473
USDA-ARS?s Scientific Manuscript database
Genetic variants detected from sequence have been used to successfully identify causal variants and map complex traits in several organisms. High and moderate impact variants, those expected to alter or disrupt the protein coded by a gene and those that regulate protein production, likely have a mor...
Comparative architecture of silks, fibrous proteins and their encoding genes in insects and spiders.
Craig, Catherine L; Riekel, Christian
2002-12-01
The known silk fibroins and fibrous glues are thought to be encoded by members of the same gene family. All silk fibroins sequenced to date contain regions of long-range order (crystalline regions) and/or short-range order (non-crystalline regions). All of the sequenced fibroin silks (Flag or silk from flagelliform gland in spiders; Fhc or heavy chain fibroin silks produced by Lepidoptera larvae) are made up of hierarchically organized, repetitive arrays of amino acids. Fhc fibroin genes are characterized by a similar molecular genetic architecture of two exons and one intron, but the organization and size of these units differs. The Flag, Ser (sericin gene) and BR (Balbiani ring genes; both fibrous proteins) genes are made up of multiple exons and introns. Sequences coding for crystalline and non-crystalline protein domains are integrated in the repetitive regions of Fhc and MA exons, but not in the protein glues Ser1 and BR-1. Genetic 'hot-spots' promote recombination errors in Fhc, MA, and Flag. Codon bias, structural constraint, point mutations, and shortened coding arrays may be alternative means of stabilizing precursor mRNA transcripts. Differential regulation of gene expression and selective splicing of the mRNA transcript may allow rapid adaptation of silk functional properties to different physical environments.
Post-transcriptional trafficking and regulation of neuronal gene expression.
Goldie, Belinda J; Cairns, Murray J
2012-02-01
Intracellular messenger RNA (mRNA) traffic and translation must be highly regulated, both temporally and spatially, within eukaryotic cells to support the complex functional partitioning. This capacity is essential in neurons because it provides a mechanism for rapid input-restricted activity-dependent protein synthesis in individual dendritic spines. While this feature is thought to be important for synaptic plasticity, the structures and mechanisms that support this capability are largely unknown. Certainly specialized RNA binding proteins and binding elements in the 3' untranslated region (UTR) of translationally regulated mRNA are important, but the subtlety and complexity of this system suggests that an intermediate "specificity" component is also involved. Small non-coding microRNA (miRNA) are essential for CNS development and may fulfill this role by acting as the guide strand for mediating complex patterns of post-transcriptional regulation. In this review we examine post-synaptic gene regulation, mRNA trafficking and the emerging role of post-transcriptional gene silencing in synaptic plasticity.
A Heme-responsive Regulator Controls Synthesis of Staphyloferrin B in Staphylococcus aureus*♦
Laakso, Holly A.; Marolda, Cristina L.; Pinter, Tyler B.; Stillman, Martin J.; Heinrichs, David E.
2016-01-01
Staphylococcus aureus possesses a multitude of mechanisms by which it can obtain iron during growth under iron starvation conditions. It expresses an effective heme acquisition system (the iron-regulated surface determinant system), it produces two carboxylate-type siderophores staphyloferrin A and staphyloferrin B (SB), and it expresses transporters for many other siderophores that it does not synthesize. The ferric uptake regulator protein regulates expression of genes encoding all of these systems. Mechanisms of fine-tuning expression of iron-regulated genes, beyond simple iron regulation via ferric uptake regulator, have not been uncovered in this organism. Here, we identify the ninth gene of the sbn operon, sbnI, as encoding a ParB/Spo0J-like protein that is required for expression of genes in the sbn operon from sbnD onward. Expression of sbnD–I is drastically decreased in an sbnI mutant, and the mutant does not synthesize detectable SB during early phases of growth. Thus, SB-mediated iron acquisition is impaired in an sbnI mutant strain. We show that the protein forms dimers and tetramers in solution and binds to DNA within the sbnC coding region. Moreover, we show that SbnI binds heme and that heme-bound SbnI does not bind DNA. Finally, we show that providing exogenous heme to S. aureus growing in an iron-free medium results in delayed synthesis of SB. This is the first study in S. aureus that identifies a DNA-binding regulatory protein that senses heme to control gene expression for siderophore synthesis. PMID:26534960
Xiong, Yan; Yue, Feng; Jia, Zhihao; Gao, Yun; Jin, Wen; Hu, Keping; Zhang, Yong; Zhu, Dahai; Yang, Gongshe; Kuang, Shihuan
2018-04-01
The thermogenic activities of brown and beige adipocytes can be exploited to reduce energy surplus and counteract obesity. Recent RNA sequencing studies have uncovered a number of long noncoding RNAs (lncRNAs) uniquely expressed in white and brown adipose tissues (WAT and BAT), but whether and how these lncRNAs function in adipogenesis remain largely unknown. Here, we report the identification of a novel brown adipocyte-enriched LncRNA (AK079912), and its nuclear localization, function and regulation. The expression of AK079912 increases during brown preadipocyte differentiation and in response to cold-stimulated browning of white adipocytes. Knockdown of AK079912 inhibits brown preadipocyte differentiation, manifested by reductions in lipid accumulation and down-regulation of adipogenic and BAT-specific genes. Conversely, ectopic expression of AK079912 in white preadipocytes up-regulates the expression of genes involved in thermogenesis. Mechanistically, inhibition of AK079912 reduces mitochondrial copy number and protein levels of mitochondria electron transport chain (ETC) complexes, whereas AK079912 overexpression increases the levels of ETC proteins. Lastly, reporter and pharmacological assays identify Pparγ as an upstream regulator of AK079912. These results provide new insights into the function of non-coding RNAs in brown adipogenesis and regulating browning of white adipocytes. Copyright © 2018 Elsevier B.V. All rights reserved.
Kazakoff, Stephen H.; Imelfort, Michael; Edwards, David; Koehorst, Jasper; Biswas, Bandana; Batley, Jacqueline; Scott, Paul T.; Gresshoff, Peter M.
2012-01-01
Pongamia pinnata (syn. Millettia pinnata) is a novel, fast-growing arboreal legume that bears prolific quantities of oil-rich seeds suitable for the production of biodiesel and aviation biofuel. Here, we have used Illumina® ‘Second Generation DNA Sequencing (2GS)’ and a new short-read de novo assembler, SaSSY, to assemble and annotate the Pongamia chloroplast (152,968 bp; cpDNA) and mitochondrial (425,718 bp; mtDNA) genomes. We also show that SaSSY can be used to accurately assemble 2GS data, by re-assembling the Lotus japonicus cpDNA and in the process assemble its mtDNA (380,861 bp). The Pongamia cpDNA contains 77 unique protein-coding genes and is almost 60% gene-dense. It contains a 50 kb inversion common to other legumes, as well as a novel 6.5 kb inversion that is responsible for the non-disruptive, re-orientation of five protein-coding genes. Additionally, two copies of an inverted repeat firmly place the species outside the subclade of the Fabaceae lacking the inverted repeat. The Pongamia and L. japonicus mtDNA contain just 33 and 31 unique protein-coding genes, respectively, and like other angiosperm mtDNA, have expanded intergenic and multiple repeat regions. Through comparative analysis with Vigna radiata we measured the average synonymous and non-synonymous divergence of all three legume mitochondrial (1.59% and 2.40%, respectively) and chloroplast (8.37% and 8.99%, respectively) protein-coding genes. Finally, we explored the relatedness of Pongamia within the Fabaceae and showed the utility of the organellar genome sequences by mapping transcriptomic data to identify up- and down-regulated stress-responsive gene candidates and confirm in silico predicted RNA editing sites. PMID:23272141
Kazakoff, Stephen H; Imelfort, Michael; Edwards, David; Koehorst, Jasper; Biswas, Bandana; Batley, Jacqueline; Scott, Paul T; Gresshoff, Peter M
2012-01-01
Pongamia pinnata (syn. Millettia pinnata) is a novel, fast-growing arboreal legume that bears prolific quantities of oil-rich seeds suitable for the production of biodiesel and aviation biofuel. Here, we have used Illumina® 'Second Generation DNA Sequencing (2GS)' and a new short-read de novo assembler, SaSSY, to assemble and annotate the Pongamia chloroplast (152,968 bp; cpDNA) and mitochondrial (425,718 bp; mtDNA) genomes. We also show that SaSSY can be used to accurately assemble 2GS data, by re-assembling the Lotus japonicus cpDNA and in the process assemble its mtDNA (380,861 bp). The Pongamia cpDNA contains 77 unique protein-coding genes and is almost 60% gene-dense. It contains a 50 kb inversion common to other legumes, as well as a novel 6.5 kb inversion that is responsible for the non-disruptive, re-orientation of five protein-coding genes. Additionally, two copies of an inverted repeat firmly place the species outside the subclade of the Fabaceae lacking the inverted repeat. The Pongamia and L. japonicus mtDNA contain just 33 and 31 unique protein-coding genes, respectively, and like other angiosperm mtDNA, have expanded intergenic and multiple repeat regions. Through comparative analysis with Vigna radiata we measured the average synonymous and non-synonymous divergence of all three legume mitochondrial (1.59% and 2.40%, respectively) and chloroplast (8.37% and 8.99%, respectively) protein-coding genes. Finally, we explored the relatedness of Pongamia within the Fabaceae and showed the utility of the organellar genome sequences by mapping transcriptomic data to identify up- and down-regulated stress-responsive gene candidates and confirm in silico predicted RNA editing sites.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dolan, Kyle T.; Duguid, Erica M.; He, Chuan
2011-11-17
SlyA is a master virulence regulator that controls the transcription of numerous genes in Salmonella enterica. We present here crystal structures of SlyA by itself and bound to a high-affinity DNA operator sequence in the slyA gene. SlyA interacts with DNA through direct recognition of a guanine base by Arg-65, as well as interactions between conserved Arg-86 and the minor groove and a large network of non-base-specific contacts with the sugar phosphate backbone. Our structures, together with an unpublished structure of SlyA bound to the small molecule effector salicylate (Protein Data Bank code 3DEU), reveal that, unlike many other MarRmore » family proteins, SlyA dissociates from DNA without large conformational changes when bound to this effector. We propose that SlyA and other MarR global regulators rely more on indirect readout of DNA sequence to exert control over many genes, in contrast to proteins (such as OhrR) that recognize a single operator.« less
Fedoreyeva, L I; Dilovarova, T A; Ashapkin, V V; Martirosyan, Yu Ts; Khavinson, V Kh; Kharchenko, P N; Vanyushin, B F
2017-04-01
Exogenous short biologically active peptides epitalon (Ala-Glu-Asp-Gly), bronchogen (Ala-Glu-Asp-Leu), and vilon (Lys-Glu) at concentrations 10 -7 -10 -9 M significantly influence growth, development, and differentiation of tobacco (Nicotiana tabacum) callus cultures. Epitalon and bronchogen, in particular, both increase growth of calluses and stimulate formation and growth of leaves in plant regenerants. Because the regulatory activity of the short peptides appears at low peptide concentrations, their action to some extent is like that of the activity of phytohormones, and it seems to have signaling character and epigenetic nature. The investigated peptides modulate in tobacco cells the expression of genes including genes responsible for tissue formation and cell differentiation. These peptides differently modulate expression of CLE family genes coding for known endogenous regulatory peptides, the KNOX1 genes (transcription factor genes) and GRF (growth regulatory factor) genes coding for respective DNA-binding proteins such as topoisomerases, nucleases, and others. Thus, at the level of transcription, plants have a system of short peptide regulation of formation of long-known peptide regulators of growth and development. The peptides studied here may be related to a new generation of plant growth regulators. They can be used in the experimental botany, plant molecular biology, biotechnology, and practical agronomy.
Splicing factor SFRS1 recognizes a functionally diverse landscape of RNA transcripts.
Sanford, Jeremy R; Wang, Xin; Mort, Matthew; Vanduyn, Natalia; Cooper, David N; Mooney, Sean D; Edenberg, Howard J; Liu, Yunlong
2009-03-01
Metazoan genes are encrypted with at least two superimposed codes: the genetic code to specify the primary structure of proteins and the splicing code to expand their proteomic output via alternative splicing. Here, we define the specificity of a central regulator of pre-mRNA splicing, the conserved, essential splicing factor SFRS1. Cross-linking immunoprecipitation and high-throughput sequencing (CLIP-seq) identified 23,632 binding sites for SFRS1 in the transcriptome of cultured human embryonic kidney cells. SFRS1 was found to engage many different classes of functionally distinct transcripts including mRNA, miRNA, snoRNAs, ncRNAs, and conserved intergenic transcripts of unknown function. The majority of these diverse transcripts share a purine-rich consensus motif corresponding to the canonical SFRS1 binding site. The consensus site was not only enriched in exons cross-linked to SFRS1 in vivo, but was also enriched in close proximity to splice sites. mRNAs encoding RNA processing factors were significantly overrepresented, suggesting that SFRS1 may broadly influence the post-transcriptional control of gene expression in vivo. Finally, a search for the SFRS1 consensus motif within the Human Gene Mutation Database identified 181 mutations in 82 different genes that disrupt predicted SFRS1 binding sites. This comprehensive analysis substantially expands the known roles of human SR proteins in the regulation of a diverse array of RNA transcripts.
Corbi, N; Libri, V; Fanciulli, M; Tinsley, J M; Davies, K E; Passananti, C
2000-06-01
Up-regulation of utrophin gene expression is recognized as a plausible therapeutic approach in the treatment of Duchenne muscular dystrophy (DMD). We have designed and engineered new zinc finger-based transcription factors capable of binding and activating transcription from the promoter of the dystrophin-related gene, utrophin. Using the recognition 'code' that proposes specific rules between zinc finger primary structure and potential DNA binding sites, we engineered a new gene named 'Jazz' that encodes for a three-zinc finger peptide. Jazz belongs to the Cys2-His2 zinc finger type and was engineered to target the nine base pair DNA sequence: 5'-GCT-GCT-GCG-3', present in the promoter region of both the human and mouse utrophin gene. The entire zinc finger alpha-helix region, containing the amino acid positions that are crucial for DNA binding, was specifically chosen on the basis of the contacts more frequently represented in the available list of the 'code'. Here we demonstrate that Jazz protein binds specifically to the double-stranded DNA target, with a dissociation constant of about 32 nM. Band shift and super-shift experiments confirmed the high affinity and specificity of Jazz protein for its DNA target. Moreover, we show that chimeric proteins, named Gal4-Jazz and Sp1-Jazz, are able to drive the transcription of a test gene from the human utrophin promoter.
AP1 Keeps Chromatin Poised for Action | Center for Cancer Research
The human genome harbors gene-encoding DNA, the blueprint for building proteins that regulate cellular function. Embedded across the genome, in non-coding regions, are DNA elements to which regulatory factors bind. The interaction of regulatory factors with DNA at these sites modifies gene expression to modulate cell activity. In cells, DNA exists in a complex with proteins called chromatin that compacts the DNA in the nucleus, strongly restricting access to DNA sequences. As a result, regulatory factors only interact with a small subset of their potential binding elements in a given cell to regulate genes. How factors recognize and select sites in chromatin across the genome is not well understood -- but several discoveries in CCR’s Laboratory of Receptor Biology and Gene Expression (LRBGE) have shed light on the mechanisms that direct factors to DNA.
Rather, Irshad Ahmad; Awasthi, Praveen; Mahajan, Vidushi; Bedi, Yashbir S; Vishwakarma, Ram A; Gandhi, Sumit G
2015-03-01
Pathogenesis-related (PR) proteins are involved in biotic and abiotic stress responses of plants and are grouped into 17 families (PR-1 to PR-17). PR-5 family includes proteins related to thaumatin and osmotin, with several members possessing antimicrobial properties. In this study, a PR-5 gene showing a high degree of homology with osmotin-like protein was isolated from sweet basil (Ocimum basilicum L.). A complete open reading frame consisting of 675 nucleotides, coding for a precursor protein, was obtained by PCR amplification. Based on sequence comparisons with tobacco osmotin and other osmotin-like proteins (OLPs), this protein was named ObOLP. The predicted mature protein is 225 amino acids in length and contains 16 cysteine residues that may potentially form eight disulfide bonds, a signature common to most PR-5 proteins. Among the various abiotic stress treatments tested, including high salt, mechanical wounding and exogenous phytohormone/elicitor treatments; methyl jasmonate (MeJA) and mechanical wounding significantly induced the expression of ObOLP gene. The coding sequence of ObOLP was cloned and expressed in a bacterial host resulting in a 25kDa recombinant-HIS tagged protein, displaying antifungal activity. The ObOLP protein sequence appears to contain an N-terminal signal peptide with signatures of secretory pathway. Further, our experimental data shows that ObOLP expression is regulated transcriptionally and in silico analysis suggests that it may be post-transcriptionally and post-translationally regulated through microRNAs and post-translational protein modifications, respectively. This study appears to be the first report of isolation and characterization of osmotin-like protein gene from O. basilicum. Copyright © 2014 Elsevier B.V. All rights reserved.
The Long Noncoding RNA Landscape of the Mouse Eye.
Chen, Weiwei; Yang, Shuai; Zhou, Zhonglou; Zhao, Xiaoting; Zhong, Jiayun; Reinach, Peter S; Yan, Dongsheng
2017-12-01
Long noncoding RNAs (lncRNAs) are important regulators of diverse biological functions. However, an extensive in-depth analysis of their expression profile and function in mammalian eyes is still lacking. Here we describe comprehensive landscapes of stage-dependent and tissue-specific lncRNA expression in the mouse eye. Affymetrix transcriptome array profiled lncRNA signatures from six different ocular tissue subsets (i.e., cornea, lens, retina, RPE, choroid, and sclera) in newborn and 8-week-old mice. Quantitative RT-PCR analysis validated array findings. Cis analyses and Gene Ontology (GO) annotation of protein-coding genes adjacent to signature lncRNA loci clarified potential lncRNA roles in maintaining tissue identity and regulating eye maturation during the aforementioned phase. In newborn and 8-week-old mice, we identified 47,332 protein-coding and noncoding gene transcripts. LncRNAs comprise 19,313 of these transcripts annotated in public data banks. During this maturation phase of these six different tissue subsets, more than 1000 lncRNAs expression levels underwent ≥2-fold changes. qRT-PCR analysis confirmed part of the gene microarray analysis results. K-means clustering identified 910 lncRNAs in the P0 groups and 686 lncRNAs in the postnatal 8-week-old groups, suggesting distinct tissue-specific lncRNA clusters. GO analysis of protein-coding genes proximal to lncRNA signatures resolved close correlations with their tissue-specific functional maturation between P0 and 8 weeks of age in the 6 tissue subsets. Characterizating maturational changes in lncRNA expression patterns as well as tissue-specific lncRNA signatures in six ocular tissues suggest important contributions made by lncRNA to the control of developmental processes in the mouse eye.
Bacteriophage 5' untranslated regions for control of plastid transgene expression.
Yang, Huijun; Gray, Benjamin N; Ahner, Beth A; Hanson, Maureen R
2013-02-01
Expression of foreign proteins from transgenes incorporated into plastid genomes requires regulatory sequences that can be recognized by the plastid transcription and translation machinery. Translation signals harbored by the 5' untranslated region (UTR) of plastid transcripts can profoundly affect the level of accumulation of proteins expressed from chimeric transgenes. Both endogenous 5' UTRs and the bacteriophage T7 gene 10 (T7g10) 5' UTR have been found to be effective in combination with particular coding regions to mediate high-level expression of foreign proteins. We investigated whether two other bacteriophage 5' UTRs could be utilized in plastid transgenes by fusing them to the aadA (aminoglycoside-3'-adenyltransferase) coding region that is commonly used as a selectable marker in plastid transformation. Transplastomic plants containing either the T7g1.3 or T4g23 5' UTRs fused to Myc-epitope-tagged aadA were successfully obtained, demonstrating the ability of these 5' UTRs to regulate gene expression in plastids. Placing the Thermobifida fusca cel6A gene under the control of the T7g1.3 or T4g23 5' UTRs, along with a tetC downstream box, resulted in poor expression of the cellulase in contrast with high-level accumulation while using the T7g10 5' UTR. However, transplastomic plants with the bacteriophage 5' UTRs controlling the aadA coding region exhibited fewer undesired recombinant species than plants containing the same marker gene regulated by the Nicotiana tabacum psbA 5' UTR. Furthermore, expression of the T7g1.3 and T4g23 5' UTR::aadA fusions downstream of the cel6A gene provided sufficient spectinomycin resistance to allow selection of homoplasmic transgenic plants and had no effect on Cel6A accumulation.
Genes Involved in Anaerobic Metabolism of Phenol in the Bacterium Thauera aromatica
Breinig, Sabine; Schiltz, Emile; Fuchs, Georg
2000-01-01
Genes involved in the anaerobic metabolism of phenol in the denitrifying bacterium Thauera aromatica have been studied. The first two committed steps in this metabolism appear to be phosphorylation of phenol to phenylphosphate by an unknown phosphoryl donor (“phenylphosphate synthase”) and subsequent carboxylation of phenylphosphate to 4-hydroxybenzoate under release of phosphate (“phenylphosphate carboxylase”). Both enzyme activities are strictly phenol induced. Two-dimensional gel electrophoresis allowed identification of several phenol-induced proteins. Based on N-terminal and internal amino acid sequences of such proteins, degenerate oligonucleotides were designed to identify the corresponding genes. A chromosomal DNA segment of about 14 kbp was sequenced which contained 10 genes transcribed in the same direction. These are organized in two adjacent gene clusters and include the genes coding for five identified phenol-induced proteins. Comparison with sequences in the databases revealed the following similarities: the gene products of two open reading frames (ORFs) are each similar to either the central part and N-terminal part of phosphoenolpyruvate synthases. We propose that these ORFs are components of the phenylphosphate synthase system. Three ORFs showed similarity to the ubiD gene product, 3-octaprenyl-4-hydroxybenzoate carboxy lyase; UbiD catalyzes the decarboxylation of a 4-hydroxybenzoate analogue in ubiquinone biosynthesis. Another ORF was similar to the ubiX gene product, an isoenzyme of UbiD. We propose that (some of) these four proteins are involved in the carboxylation of phenylphosphate. A 700-bp PCR product derived from one of these ORFs cross-hybridized with DNA from different Thauera and Azoarcus strains, even from those which have not been reported to grow with phenol. One ORF showed similarity to the mutT gene product, and three ORFs showed no strong similarities to sequences in the databases. Upstream of the first gene cluster, an ORF which is transcribed in the opposite direction codes for a protein highly similar to the DmpR regulatory protein of Pseudomonas putida. DmpR controls transcription of the genes of aerobic phenol metabolism, suggesting a similar regulation of anaerobic phenol metabolism by the putative regulator. PMID:11004186
Rubel, Elisa Terumi; Raittz, Roberto Tadeu; Coimbra, Nilson Antonio da Rocha; Gehlen, Michelly Alves Coutinho; Pedrosa, Fábio de Oliveira
2016-12-15
Azopirillum brasilense is a plant-growth promoting nitrogen-fixing bacteria that is used as bio-fertilizer in agriculture. Since nitrogen fixation has a high-energy demand, the reduction of N 2 to NH 4 + by nitrogenase occurs only under limiting conditions of NH 4 + and O 2 . Moreover, the synthesis and activity of nitrogenase is highly regulated to prevent energy waste. In A. brasilense nitrogenase activity is regulated by the products of draG and draT. The product of the draB gene, located downstream in the draTGB operon, may be involved in the regulation of nitrogenase activity by an, as yet, unknown mechanism. A deep in silico analysis of the product of draB was undertaken aiming at suggesting its possible function and involvement with DraT and DraG in the regulation of nitrogenase activity in A. brasilense. In this work, we present a new artificial intelligence strategy for protein classification, named ProClaT. The features used by the pattern recognition model were derived from the primary structure of the DraB homologous proteins, calculated by a ProClaT internal algorithm. ProClaT was applied to this case study and the results revealed that the A. brasilense draB gene codes for a protein highly similar to the nitrogenase associated NifO protein of Azotobacter vinelandii. This tool allowed the reclassification of DraB/NifO homologous proteins, hypothetical, conserved hypothetical and those annotated as putative arsenate reductase, ArsC, as NifO-like. An analysis of co-occurrence of draB, draT, draG and of other nif genes was performed, suggesting the involvement of draB (nifO) in nitrogen fixation, however, without the definition of a specific function.
RNA- and protein-mediated control of Listeria monocytogenes virulence gene expression
Lebreton, Alice; Cossart, Pascale
2017-01-01
ABSTRACT The model opportunistic pathogen Listeria monocytogenes has been the object of extensive research, aiming at understanding its ability to colonize diverse environmental niches and animal hosts. Bacterial transcriptomes in various conditions reflect this efficient adaptability. We review here our current knowledge of the mechanisms allowing L. monocytogenes to respond to environmental changes and trigger pathogenicity, with a special focus on RNA-mediated control of gene expression. We highlight how these studies have brought novel concepts in prokaryotic gene regulation, such as the ‘excludon’ where the 5′-UTR of a messenger also acts as an antisense regulator of an operon transcribed in opposite orientation, or the notion that riboswitches can regulate non-coding RNAs to integrate complex metabolic stimuli into regulatory networks. Overall, the Listeria model exemplifies that fine RNA tuners act together with master regulatory proteins to orchestrate appropriate transcriptional programmes. PMID:27217337
Tau mRNA 3'UTR-to-CDS ratio is increased in Alzheimer disease.
García-Escudero, Vega; Gargini, Ricardo; Martín-Maestro, Patricia; García, Esther; García-Escudero, Ramón; Avila, Jesús
2017-08-10
Neurons frequently show an imbalance in expression of the 3' untranslated region (3'UTR) relative to the coding DNA sequence (CDS) region of mature messenger RNAs (mRNA). The ratio varies among different cells or parts of the brain. The Map2 protein levels per cell depend on the 3'UTR-to-CDS ratio rather than the total mRNA amount, which suggests powerful regulation of protein expression by 3'UTR sequences. Here we found that MAPT (the microtubule-associated protein tau gene) 3'UTR levels are particularly high with respect to other genes; indeed, the 3'UTR-to-CDS ratio of MAPT is balanced in healthy brain in mouse and human. The tau protein accumulates in Alzheimer diseased brain. We nonetheless observed that the levels of RNA encoding MAPT/tau were diminished in these patients' brains. To explain this apparently contradictory result, we studied MAPT mRNA stoichiometry in coding and non-coding regions, and found that the 3'UTR-to-CDS ratio was higher in the hippocampus of Alzheimer disease patients, with higher tau protein but lower total mRNA levels. Our data indicate that changes in the 3'UTR-to-CDS ratio have a regulatory role in the disease. Future research should thus consider not only mRNA levels, but also the ratios between coding and non-coding regions. Copyright © 2017 Elsevier B.V. All rights reserved.
Flather, Dylan; Semler, Bert L.
2015-01-01
The compartmentalization of DNA replication and gene transcription in the nucleus and protein production in the cytoplasm is a defining feature of eukaryotic cells. The nucleus functions to maintain the integrity of the nuclear genome of the cell and to control gene expression based on intracellular and environmental signals received through the cytoplasm. The spatial separation of the major processes that lead to the expression of protein-coding genes establishes the necessity of a transport network to allow biomolecules to translocate between these two regions of the cell. The nucleocytoplasmic transport network is therefore essential for regulating normal cellular functioning. The Picornaviridae virus family is one of many viral families that disrupt the nucleocytoplasmic trafficking of cells to promote viral replication. Picornaviruses contain positive-sense, single-stranded RNA genomes and replicate in the cytoplasm of infected cells. As a result of the limited coding capacity of these viruses, cellular proteins are required by these intracellular parasites for both translation and genomic RNA replication. Being of messenger RNA polarity, a picornavirus genome can immediately be translated upon entering the cell cytoplasm. However, the replication of viral RNA requires the activity of RNA-binding proteins, many of which function in host gene expression, and are consequently localized to the nucleus. As a result, picornaviruses disrupt nucleocytoplasmic trafficking to exploit protein functions normally localized to a different cellular compartment from which they translate their genome to facilitate efficient replication. Furthermore, picornavirus proteins are also known to enter the nucleus of infected cells to limit host-cell transcription and down-regulate innate antiviral responses. The interactions of picornavirus proteins and host-cell nuclei are extensive, required for a productive infection, and are the focus of this review. PMID:26150805
Dai, Ziyu; Lasure, Linda L.; Magnuson, Jon K.
2008-11-11
The present invention encompasses isolated gene regulatory elements and gene transcription terminators that are differentially expressed in a native fungus exhibiting a first morphology relative to the native fungus exhibiting a second morphology. The invention also encompasses a method of utilizing a fungus for protein or chemical production. A transformed fungus is produced by transforming a fungus with a recombinant polynucleotide molecule. The recombinant polynucleotide molecule contains an isolated polynucleotide sequence linked operably to another molecule comprising a coding region of a gene of interest. The gene regulatory element and gene transcription terminator may temporally and spatially regulate expression of particular genes for optimum production of compounds of interest in a transgenic fungus.
Dai, Ziyu; Lasure, Linda L.; Magnuson, Jon K.
2008-11-11
The present invention encompasses isolated gene regulatory elements and gene transcription terminators that are differentially expressed in a native fungus exhibiting a first morphology relative to the native fungus exhibiting a second morphology. The invention also encompasses a method of utilizing a fungus for protein or chemical production. A transformed fungus is produced by transforming a fungus with a recombinant polynucleotide molecule. The recombinant polynucleotide molecule contains an isolated polynucleotide sequence linked operably to another molecule comprising a coding region of a gene of interest. The gene regulatory element and gene transcription terminator may temporally and spatially regulate expression of particular genes for optimum production of compounds of interest in a transgenic fungus.
Dai, Ziyu; Lasure, Linda L; Magnuson, Jon K
2014-05-27
The present invention encompasses isolated gene regulatory elements and gene transcription terminators that are differentially expressed in a native fungus exhibiting a first morphology relative to the native fungus exhibiting a second morphology. The invention also encompasses a method of utilizing a fungus for protein or chemical production. A transformed fungus is produced by transforming a fungus with a recombinant polynucleotide molecule. The recombinant polynucleotide molecule contains an isolated polynucleotide sequence linked operably to another molecule comprising a coding region of a gene of interest. The gene regulatory element and gene transcription terminator may temporally and spatially regulate expression of particular genes for optimum production of compounds of interest in a transgenic fungus.
Almelli, Talleh; Nuel, Grégory; Bischoff, Emmanuel; Aubouy, Agnès; Elati, Mohamed; Wang, Christian William; Dillies, Marie-Agnès; Coppée, Jean-Yves; Ayissi, Georges Nko; Basco, Leonardo Kishi; Rogier, Christophe; Ndam, Nicaise Tuikue; Deloron, Philippe; Tahar, Rachida
2014-01-01
The mechanisms underlying the heterogeneity of clinical malaria remain largely unknown. We hypothesized that differential gene expression contributes to phenotypic variation of parasites which results in a specific interaction with the host, leading to different clinical features of malaria. In this study, we analyzed the transcriptomes of isolates obtained from asymptomatic carriers and patients with uncomplicated or cerebral malaria. We also investigated the transcriptomes of 3D7 clone and 3D7-Lib that expresses severe malaria associated-variant surface antigen. Our findings revealed a specific up-regulation of genes involved in pathogenesis, adhesion to host cell, and erythrocyte aggregation in parasites from patients with cerebral malaria and 3D7-Lib, compared to parasites from asymptomatic carriers and 3D7, respectively. However, we did not find any significant difference between the transcriptomes of parasites from cerebral malaria and uncomplicated malaria, suggesting similar transcriptomic pattern in these two parasite populations. The difference between isolates from asymptomatic children and cerebral malaria concerned genes coding for exported proteins, Maurer's cleft proteins, transcriptional factor proteins, proteins implicated in protein transport, as well as Plasmodium conserved and hypothetical proteins. Interestingly, UPs A1, A2, A3 and UPs B1 of var genes were predominantly found in cerebral malaria-associated isolates and those containing architectural domains of DC4, DC5, DC13 and their neighboring rif genes in 3D7-lib. Therefore, more investigations are needed to analyze the effective role of these genes during malaria infection to provide with new knowledge on malaria pathology. In addition, concomitant regulation of genes within the chromosomal neighborhood suggests a common mechanism of gene regulation in P. falciparum. PMID:25479608
Figueroa-Angulo, Elisa E.; Calla-Choque, Jaeson S.; Mancilla-Olea, Maria Inocente; Arroyo, Rossana
2015-01-01
Iron homeostasis is highly regulated in vertebrates through a regulatory system mediated by RNA-protein interactions between the iron regulatory proteins (IRPs) that interact with an iron responsive element (IRE) located in certain mRNAs, dubbed the IRE-IRP regulatory system. Trichomonas vaginalis, the causal agent of trichomoniasis, presents high iron dependency to regulate its growth, metabolism, and virulence properties. Although T. vaginalis lacks IRPs or proteins with aconitase activity, possesses gene expression mechanisms of iron regulation at the transcriptional and posttranscriptional levels. However, only one gene with iron regulation at the transcriptional level has been described. Recently, our research group described an iron posttranscriptional regulatory mechanism in the T. vaginalis tvcp4 and tvcp12 cysteine proteinase mRNAs. The tvcp4 and tvcp12 mRNAs have a stem-loop structure in the 5'-coding region or in the 3'-UTR, respectively that interacts with T. vaginalis multifunctional proteins HSP70, α-Actinin, and Actin under iron starvation condition, causing translation inhibition or mRNA stabilization similar to the previously characterized IRE-IRP system in eukaryotes. Herein, we summarize recent progress and shed some light on atypical RNA-binding proteins that may participate in the iron posttranscriptional regulation in T. vaginalis. PMID:26703754
High cholesterol diet increases osteoporosis risk via inhibiting bone formation in rats
You, Li; Sheng, Zheng-yan; Tang, Chuan-ling; Chen, Lin; Pan, Ling; Chen, Jin-yu
2011-01-01
Aim: To investigate the effects of high cholesterol diet on the development of osteoporosis and the underlying mechanisms in rats. Methods: Female Sprague-Dawley rats were randomly separated into 3 groups: (1) the high cholesterol fed rats were fed a high cholesterol diet containing 77% normal diet food, 3% cholesterol and 20% lard for 3 months; (2) ovariectomised (OVX) rats were bilaterally ovariectomised and fed a standard diet; and (3) the control rats were fed the standard diet. Bone mineral density (BMD) of the rats was measured using dual-energy X-ray absorptiometry. Serum levels of oestradiol (E2), osteocalcin (BGP) and carboxy-terminal collagen crosslinks (CTX) were measured using ELISA. Gene expression profile was determined with microarray. Mouse osteoblast cells (MC3T3-E1) were used for in vitro study. Proliferation, differentiation and oxidative stress of the osteoblasts were investigated using MTT, qRT-PCR and biochemical methods. Results: In high cholesterol fed rats, the femur BMD and serum BGP level were significantly reduced, while the CTX level was significantly increased. DNA microarray analysis showed that 2290 genes were down-regulated and 992 genes were up-regulated in this group of rats. Of these genes, 1626 were also down-regulated and 1466 were up-regulated in OVX rats. In total, 370 genes were up-regulated in both groups, and 976 genes were down-regulated. Some of the down-regulated genes were found to code for proteins involved in the transforming growth factor beta (TGF-β)/bone morphogenic protein (BMP) and Wnt signaling pathways. The up-regulated genes were found to code for IL-6 and Ager with bone-resorption functions. Treatment of MC3T3-E1 cells with cholesterol (12.5-50 μg/mL) inhibited the cell proliferation and differentiation in vitro in a concentration-dependent manner. The treatment also concentration-dependently reduced the expression of BMP2 and Cbfa1, and increased the oxidative injury in MC3T3-E1 cells. Conclusion: The results suggest a close correlation between hypercholesterolaemia and osteoporosis. High cholesterol diet increases the risk of osteoporosis, possible via inhibiting the differentiation and proliferation of osteoblasts. PMID:22036861
Ogawa, Yuko; Tsujimoto, Masafumi; Yanoshita, Ryohei
2016-01-01
Exosomes are small extracellular vesicles containing microRNAs and mRNAs that are produced by various types of cells. We previously used ultrafiltration and size-exclusion chromatography to isolate two types of human salivary exosomes (exosomes I, II) that are different in size and proteomes. We showed that salivary exosomes contain large repertoires of small RNAs. However, precise information regarding long RNAs in salivary exosomes has not been fully determined. In this study, we investigated the compositions of protein-coding RNAs (pcRNAs) and long non-protein-coding RNAs (lncRNAs) of exosome I, exosome II and whole saliva (WS) by next-generation sequencing technology. Although 11% of all RNAs were commonly detected among the three samples, the compositions of reads mapping to known RNAs were similar. The most abundant pcRNA is ribosomal RNA protein, and pcRNAs of some salivary proteins such as S100 calcium-binding protein A8 (protein S100-A8) were present in salivary exosomes. Interestingly, lncRNAs of pseudogenes (presumably, processed pseudogenes) were abundant in exosome I, exosome II and WS. Translationally controlled tumor protein gene, which plays an important role in cell proliferation, cell death and immune responses, was highly expressed as pcRNA and pseudogenes in salivary exosomes. Our results show that salivary exosomes contain various types of RNAs such as pseudogenes and small RNAs, and may mediate intercellular communication by transferring these RNAs to target cells as gene expression regulators.
Genomic and Epigenomic Insights into Nutrition and Brain Disorders
Dauncey, Margaret Joy
2013-01-01
Considerable evidence links many neuropsychiatric, neurodevelopmental and neurodegenerative disorders with multiple complex interactions between genetics and environmental factors such as nutrition. Mental health problems, autism, eating disorders, Alzheimer’s disease, schizophrenia, Parkinson’s disease and brain tumours are related to individual variability in numerous protein-coding and non-coding regions of the genome. However, genotype does not necessarily determine neurological phenotype because the epigenome modulates gene expression in response to endogenous and exogenous regulators, throughout the life-cycle. Studies using both genome-wide analysis of multiple genes and comprehensive analysis of specific genes are providing new insights into genetic and epigenetic mechanisms underlying nutrition and neuroscience. This review provides a critical evaluation of the following related areas: (1) recent advances in genomic and epigenomic technologies, and their relevance to brain disorders; (2) the emerging role of non-coding RNAs as key regulators of transcription, epigenetic processes and gene silencing; (3) novel approaches to nutrition, epigenetics and neuroscience; (4) gene-environment interactions, especially in the serotonergic system, as a paradigm of the multiple signalling pathways affected in neuropsychiatric and neurological disorders. Current and future advances in these four areas should contribute significantly to the prevention, amelioration and treatment of multiple devastating brain disorders. PMID:23503168
Identification of functional elements and regulatory circuits by Drosophila modENCODE
DOE Office of Scientific and Technical Information (OSTI.GOV)
Roy, Sushmita; Ernst, Jason; Kharchenko, Peter V.
2010-12-22
To gain insight into how genomic information is translated into cellular and developmental programs, the Drosophila model organism Encyclopedia of DNA Elements (modENCODE) project is comprehensively mapping transcripts, histone modifications, chromosomal proteins, transcription factors, replication proteins and intermediates, and nucleosome properties across a developmental time course and in multiple cell lines. We have generated more than 700 data sets and discovered protein-coding, noncoding, RNA regulatory, replication, and chromatin elements, more than tripling the annotated portion of the Drosophila genome. Correlated activity patterns of these elements reveal a functional regulatory network, which predicts putative new functions for genes, reveals stage- andmore » tissue-specific regulators, and enables gene-expression prediction. Our results provide a foundation for directed experimental and computational studies in Drosophila and related species and also a model for systematic data integration toward comprehensive genomic and functional annotation. Several years after the complete genetic sequencing of many species, it is still unclear how to translate genomic information into a functional map of cellular and developmental programs. The Encyclopedia of DNA Elements (ENCODE) (1) and model organism ENCODE (modENCODE) (2) projects use diverse genomic assays to comprehensively annotate the Homo sapiens (human), Drosophila melanogaster (fruit fly), and Caenorhabditis elegans (worm) genomes, through systematic generation and computational integration of functional genomic data sets. Previous genomic studies in flies have made seminal contributions to our understanding of basic biological mechanisms and genome functions, facilitated by genetic, experimental, computational, and manual annotation of the euchromatic and heterochromatic genome (3), small genome size, short life cycle, and a deep knowledge of development, gene function, and chromosome biology. The functions of {approx}40% of the protein and nonprotein-coding genes [FlyBase 5.12 (4)] have been determined from cDNA collections (5, 6), manual curation of gene models (7), gene mutations and comprehensive genome-wide RNA interference screens (8-10), and comparative genomic analyses (11, 12). The Drosophila modENCODE project has generated more than 700 data sets that profile transcripts, histone modifications and physical nucleosome properties, general and specific transcription factors (TFs), and replication programs in cell lines, isolated tissues, and whole organisms across several developmental stages (Fig. 1). Here, we computationally integrate these data sets and report (i) improved and additional genome annotations, including full-length proteincoding genes and peptides as short as 21 amino acids; (ii) noncoding transcripts, including 132 candidate structural RNAs and 1608 nonstructural transcripts; (iii) additional Argonaute (Ago)-associated small RNA genes and pathways, including new microRNAs (miRNAs) encoded within protein-coding exons and endogenous small interfering RNAs (siRNAs) from 3-inch untranslated regions; (iv) chromatin 'states' defined by combinatorial patterns of 18 chromatin marks that are associated with distinct functions and properties; (v) regions of high TF occupancy and replication activity with likely epigenetic regulation; (vi)mixed TF and miRNA regulatory networks with hierarchical structure and enriched feed-forward loops; (vii) coexpression- and co-regulation-based functional annotations for nearly 3000 genes; (viii) stage- and tissue-specific regulators; and (ix) predictive models of gene expression levels and regulator function.« less
ten Lohuis, Michael R.; Miller, David J.
1998-01-01
In the dinoflagellate Amphidinium carterae, photoadaptation involves changes in the transcription of genes encoding both of the major classes of light-harvesting proteins, the peridinin chlorophyll a proteins (PCPs) and the major a/c-containing intrinsic light-harvesting proteins (LHCs). PCP and LHC transcript levels were increased up to 86- and 6-fold higher, respectively, under low-light conditions relative to cells grown at high illumination. These increases in transcript abundance were accompanied by decreases in the extent of methylation of CpG and CpNpG motifs within or near PCP- and LHC-coding regions. Cytosine methylation levels in A. carterae are therefore nonstatic and may vary with environmental conditions in a manner suggestive of involvement in the regulation of gene expression. However, chemically induced undermethylation was insufficient in activating transcription, because treatment with two methylation inhibitors had no effect on PCP mRNA or protein levels. Regulation of gene activity through changes in DNA methylation has traditionally been assumed to be restricted to higher eukaryotes (deuterostomes and green plants); however, the atypically large genomes of dinoflagellates may have generated the requirement for systems of this type in a relatively “primitive” organism. Dinoflagellates may therefore provide a unique perspective on the evolution of eukaryotic DNA-methylation systems. PMID:9576788
A resource of vectors and ES cells for targeted deletion of microRNAs in mice
Prosser, Haydn M.; Koike-Yusa, Hiroko; Cooper, James D.; Law, Frances C.; Bradley, Allan
2011-01-01
The 21-23 nucleotide single-stranded RNAs classified as microRNAs (miRNA) perform fundamental roles in a wide range of cellular and developmental processes. miRNAs regulate protein expression through sequence-specific base pairing with target messenger RNAs (mRNA) reducing both their stability and the process of protein translation1, 2. At least 30% of protein coding genes appear to be conserved targets for miRNAs1. In contrast to the protein coding genes3, 4, no public resource of miRNA mouse mutant alleles exists. We have generated a library of highly germ-line transmissible C57BL/6N mouse mutant embryonic stem (ES) cells with targeted deletions for the majority of miRNA genes currently annotated within the miRBase registry5. These alleles have been designed to be highly adaptable research tools that can be efficiently altered to create reporter, conditional and other allelic variants. This ES cell resource can be searched electronically and is available from ES cell repositories for distribution to the scientific community6. PMID:21822254
A combinatorial code for pattern formation in Drosophila oogenesis.
Yakoby, Nir; Bristow, Christopher A; Gong, Danielle; Schafer, Xenia; Lembong, Jessica; Zartman, Jeremiah J; Halfon, Marc S; Schüpbach, Trudi; Shvartsman, Stanislav Y
2008-11-01
Two-dimensional patterning of the follicular epithelium in Drosophila oogenesis is required for the formation of three-dimensional eggshell structures. Our analysis of a large number of published gene expression patterns in the follicle cells suggests that they follow a simple combinatorial code based on six spatial building blocks and the operations of union, difference, intersection, and addition. The building blocks are related to the distribution of inductive signals, provided by the highly conserved epidermal growth factor receptor and bone morphogenetic protein signaling pathways. We demonstrate the validity of the code by testing it against a set of patterns obtained in a large-scale transcriptional profiling experiment. Using the proposed code, we distinguish 36 distinct patterns for 81 genes expressed in the follicular epithelium and characterize their joint dynamics over four stages of oogenesis. The proposed combinatorial framework allows systematic analysis of the diversity and dynamics of two-dimensional transcriptional patterns and guides future studies of gene regulation.
De Cegli, Rossella; Iacobacci, Simona; Flore, Gemma; Gambardella, Gennaro; Mao, Lei; Cutillo, Luisa; Lauria, Mario; Klose, Joachim; Illingworth, Elizabeth; Banfi, Sandro; di Bernardo, Diego
2013-01-01
Gene expression profiles can be used to infer previously unknown transcriptional regulatory interaction among thousands of genes, via systems biology 'reverse engineering' approaches. We 'reverse engineered' an embryonic stem (ES)-specific transcriptional network from 171 gene expression profiles, measured in ES cells, to identify master regulators of gene expression ('hubs'). We discovered that E130012A19Rik (E13), highly expressed in mouse ES cells as compared with differentiated cells, was a central 'hub' of the network. We demonstrated that E13 is a protein-coding gene implicated in regulating the commitment towards the different neuronal subtypes and glia cells. The overexpression and knock-down of E13 in ES cell lines, undergoing differentiation into neurons and glia cells, caused a strong up-regulation of the glutamatergic neurons marker Vglut2 and a strong down-regulation of the GABAergic neurons marker GAD65 and of the radial glia marker Blbp. We confirmed E13 expression in the cerebral cortex of adult mice and during development. By immuno-based affinity purification, we characterized protein partners of E13, involved in the Polycomb complex. Our results suggest a role of E13 in regulating the division between glutamatergic projection neurons and GABAergic interneurons and glia cells possibly by epigenetic-mediated transcriptional regulation.
Posttranscriptional regulation of the immediate-early gene EGR1 by light in the mouse retina.
Simon, Perikles; Schott, Klaus; Williams, Robert W; Schaeffel, Frank
2004-12-01
Synaptic plasticity is modulated by differential regulation of transcription factors such as EGR1 which binds to DNA via a zinc finger binding domain. Inactivation of EGR1 has implicated this gene as a key regulator of memory formation and learning. However, it remains puzzling how synaptic input can lead to an up-regulation of the EGR-1 protein within only a few minutes. Here, we show by immunohistochemical staining that the EGR-1 protein is localized in synapses throughout the mouse retina. We demonstrate for the first time that two variants of Egr-1 mRNA are produced in the retina by alternative polyadenylation, with the longer version having an additional 293 base pairs at the end of the 3'UTR. Remarkably, the use of the alternative polyadenylation site is controlled by light. The additional 3'UTR sequence of the longer variant displays an even higher level of phylogenetic conservation than the coding region of this highly conserved gene. Additionally, it harbours a cytoplasmic polyadenylation element which is known to respond to NMDA receptor activation. The longer version of the Egr-1 mRNA could therefore rapidly respond to excitatory stimuli such as light or glutamate release whereas the short variant, which is predominantly expressed and contains the full coding sequence, lacks the regulatory elements for cytoplasmic polyadenylation in its 3'UTR.
Among biomacromolecules, RNA is the most versatile, and it plays indispensable roles in almost all aspects of biology. For example, in addition to serving as mRNAs coding for proteins, RNAs regulate gene expression, such as controlling where, when, and how efficiently a gene gets expressed, participate in RNA processing, encode the genetic information of some viruses, serve as
Proudhon, D; Wei, J; Briat, J; Theil, E C
1996-03-01
Ferritin, a protein widespread in nature, concentrates iron approximately 10(11)-10(12)-fold above the solubility within a spherical shell of 24 subunits; it derives in plants and animals from a common ancestor (based on sequence) but displays a cytoplasmic location in animals compared to the plastid in contemporary plants. Ferritin gene regulation in plants and animals is altered by development, hormones, and excess iron; iron signals target DNA in plants but mRNA in animals. Evolution has thus conserved the two end points of ferritin gene expression, the physiological signals and the protein structure, while allowing some divergence of the genetic mechanisms. Comparison of ferritin gene organization in plants and animals, made possible by the cloning of a dicot (soybean) ferritin gene presented here and the recent cloning of two monocot (maize) ferritin genes, shows evolutionary divergence in ferritin gene organization between plants and animals but conservation among plants or among animals; divergence in the genetic mechanism for iron regulation is reflected by the absence in all three plant genes of the IRE, a highly conserved, noncoding sequence in vertebrate animal ferritin mRNA. In plant ferritin genes, the number of introns (n = 7) is higher than in animals (n = 3). Second, no intron positions are conserved when ferritin genes of plants and animals are compared, although all ferritin gene introns are in the coding region; within kingdoms, the intron positions in ferritin genes are conserved. Finally, secondary protein structure has no apparent relationship to intron/exon boundaries in plant ferritin genes, whereas in animal ferritin genes the correspondence is high. The structural differences in introns/exons among phylogenetically related ferritin coding sequences and the high conservation of the gene structure within plant or animal kingdoms of the gene structure within plant or animal kingdoms suggest that kingdom-specific functional constraints may exist to maintain a particular intron/exon pattern within ferritin genes. In the case of plants, where ferritin gene intron placement is unrelated to triplet codons or protein structure, and where ferritin is targeted to the plastid, the selection pressure on gene organization may relate to RNA function and plastid/nuclear signaling.
Richardson, Casey R.; Luo, Qing-Jun; Gontcharova, Viktoria; Jiang, Ying-Wen; Samanta, Manoj; Youn, Eunseog; Rock, Christopher D.
2010-01-01
Background MicroRNAs (miRNAs) and trans-acting small-interfering RNAs (tasi-RNAs) are small (20–22 nt long) RNAs (smRNAs) generated from hairpin secondary structures or antisense transcripts, respectively, that regulate gene expression by Watson-Crick pairing to a target mRNA and altering expression by mechanisms related to RNA interference. The high sequence homology of plant miRNAs to their targets has been the mainstay of miRNA prediction algorithms, which are limited in their predictive power for other kingdoms because miRNA complementarity is less conserved yet transitive processes (production of antisense smRNAs) are active in eukaryotes. We hypothesize that antisense transcription and associated smRNAs are biomarkers which can be computationally modeled for gene discovery. Principal Findings We explored rice (Oryza sativa) sense and antisense gene expression in publicly available whole genome tiling array transcriptome data and sequenced smRNA libraries (as well as C. elegans) and found evidence of transitivity of MIRNA genes similar to that found in Arabidopsis. Statistical analysis of antisense transcript abundances, presence of antisense ESTs, and association with smRNAs suggests several hundred Arabidopsis ‘orphan’ hypothetical genes are non-coding RNAs. Consistent with this hypothesis, we found novel Arabidopsis homologues of some MIRNA genes on the antisense strand of previously annotated protein-coding genes. A Support Vector Machine (SVM) was applied using thermodynamic energy of binding plus novel expression features of sense/antisense transcription topology and siRNA abundances to build a prediction model of miRNA targets. The SVM when trained on targets could predict the “ancient” (deeply conserved) class of validated Arabidopsis MIRNA genes with an accuracy of 84%, and 76% for “new” rapidly-evolving MIRNA genes. Conclusions Antisense and smRNA expression features and computational methods may identify novel MIRNA genes and other non-coding RNAs in plants and potentially other kingdoms, which can provide insight into antisense transcription, miRNA evolution, and post-transcriptional gene regulation. PMID:20520764
Calvanese, Vincenzo; Mallya, Meera; Campbell, R Duncan; Aguado, Begoña
2008-01-01
Background Regulation of the expression of particular genes can rely on mechanisms that are different from classical transcriptional and translational control. The LY6G5B and LY6G6D genes encode LY-6 domain proteins, whose expression seems to be regulated in an original fashion, consisting of an intron retention event which generates, through an early premature stop codon, a non-coding transcript, preventing expression in most cell lines and tissues. Results The MHC LY-6 non-coding transcripts have shown to be stable and very abundant in the cell, and not subject to Nonsense Mediated Decay (NMD). This retention event appears not to be solely dependent on intron features, because in the case of LY6G5B, when the intron is inserted in the artificial context of a luciferase expression plasmid, it is fully spliced but strongly stabilises the resulting luciferase transcript. In addition, by quantitative PCR we found that the retained and spliced forms are differentially expressed in tissues indicating an active regulation of the non-coding transcript. EST database analysis revealed that these genes have an alternative expression pathway with the formation of Transcription Induced Chimeras (TIC). This data was confirmed by RT-PCR, revealing the presence of different transcripts that would encode the chimeric proteins CSNKβ-LY6G5B and G6F-LY6G6D, in which the LY-6 domain would join to a kinase domain and an Ig-like domain, respectively. Conclusion In conclusion, the LY6G5B and LY6G6D intron-retained transcripts are not subjected to NMD and are more abundant than the properly spliced forms. In addition, these genes form chimeric transcripts with their neighbouring same orientation 5' genes. Of interest is the fact that the 5' genes (CSNKβ or G6F) undergo differential splicing only in the context of the chimera (CSNKβ-LY6G5B or G6F-LY6G6C) and not on their own. PMID:18817541
Histone-derived piRNA biogenesis depends on the ping-pong partners Piwi5 and Ago3 in Aedes aegypti
Girardi, Erika; Miesen, Pascal; Pennings, Bas; Frangeul, Lionel; Saleh, Maria-Carla
2017-01-01
Abstract The piRNA pathway is of key importance in controlling transposable elements in most animal species. In the vector mosquito Aedes aegypti, the presence of eight PIWI proteins and the accumulation of viral piRNAs upon arbovirus infection suggest additional functions of the piRNA pathway beyond genome defense. To better understand the regulatory potential of this pathway, we analyzed in detail host-derived piRNAs in A. aegypti Aag2 cells. We show that a large repertoire of protein-coding genes and non-retroviral integrated RNA virus elements are processed into genic piRNAs by different combinations of PIWI proteins. Among these, we identify a class of genes that produces piRNAs from coding sequences in an Ago3- and Piwi5-dependent fashion. We demonstrate that the replication-dependent histone gene family is a genic source of ping-pong dependent piRNAs and that histone-derived piRNAs are dynamically expressed throughout the cell cycle, suggesting a role for the piRNA pathway in the regulation of histone gene expression. Moreover, our results establish the Aag2 cell line as an accessible experimental model to study gene-derived piRNAs. PMID:28115625
DOE Office of Scientific and Technical Information (OSTI.GOV)
Omasits, U.; Quebatte, Maxime; Stekhoven, Daniel J.
2013-11-01
Prokaryotes, due to their moderate complexity, are particularly amenable to the comprehensive identification of the protein repertoire expressed under different conditions. We applied a generic strategy to identify a complete expressed prokaryotic proteome, which is based on the analysis of RNA and proteins extracted from matched samples. Saturated transcriptome profiling by RNA-seq provided an endpoint estimate of the protein-coding genes expressed under two conditions which mimic the interaction of Bartonella henselae with its mammalian host. Directed shotgun proteomics experiments were carried out on four subcellular fractions. By specifically targeting proteins which are short, basic, low abundant, and membrane localized, wemore » could eliminate their initial underrepresentation compared to the estimated endpoint. A total of 1250 proteins were identified with an estimated false discovery rate below 1%. This represents 85% of all distinct annotated proteins and ~90% of the expressed protein-coding genes. Genes that were detected at the transcript but not protein level, were found to be highly enriched in several genomic islands. Furthermore, genes that lacked an ortholog and a functional annotation were not detected at the protein level; these may represent examples of overprediction in genome annotations. A dramatic membrane proteome reorganization was observed, including differential regulation of autotransporters, adhesins, and hemin binding proteins. Particularly noteworthy was the complete membrane proteome coverage, which included expression of all members of the VirB/D4 type IV secretion system, a key virulence factor.« less
Omasits, Ulrich; Quebatte, Maxime; Stekhoven, Daniel J.; Fortes, Claudia; Roschitzki, Bernd; Robinson, Mark D.; Dehio, Christoph; Ahrens, Christian H.
2013-01-01
Prokaryotes, due to their moderate complexity, are particularly amenable to the comprehensive identification of the protein repertoire expressed under different conditions. We applied a generic strategy to identify a complete expressed prokaryotic proteome, which is based on the analysis of RNA and proteins extracted from matched samples. Saturated transcriptome profiling by RNA-seq provided an endpoint estimate of the protein-coding genes expressed under two conditions which mimic the interaction of Bartonella henselae with its mammalian host. Directed shotgun proteomics experiments were carried out on four subcellular fractions. By specifically targeting proteins which are short, basic, low abundant, and membrane localized, we could eliminate their initial underrepresentation compared to the estimated endpoint. A total of 1250 proteins were identified with an estimated false discovery rate below 1%. This represents 85% of all distinct annotated proteins and ∼90% of the expressed protein-coding genes. Genes that were detected at the transcript but not protein level, were found to be highly enriched in several genomic islands. Furthermore, genes that lacked an ortholog and a functional annotation were not detected at the protein level; these may represent examples of overprediction in genome annotations. A dramatic membrane proteome reorganization was observed, including differential regulation of autotransporters, adhesins, and hemin binding proteins. Particularly noteworthy was the complete membrane proteome coverage, which included expression of all members of the VirB/D4 type IV secretion system, a key virulence factor. PMID:23878158
Long Noncoding RNAs in the Yeast S. cerevisiae.
Niederer, Rachel O; Hass, Evan P; Zappulla, David C
2017-01-01
Long noncoding RNAs have recently been discovered to comprise a sizeable fraction of the RNA World. The scope of their functions, physical organization, and disease relevance remain in the early stages of characterization. Although many thousands of lncRNA transcripts recently have been found to emanate from the expansive DNA between protein-coding genes in animals, there are also hundreds that have been found in simple eukaryotes. Furthermore, lncRNAs have been found in the bacterial and archaeal branches of the tree of life, suggesting they are ubiquitous. In this chapter, we focus primarily on what has been learned so far about lncRNAs from the greatly studied single-celled eukaryote, the yeast Saccharomyces cerevisiae. Most lncRNAs examined in yeast have been implicated in transcriptional regulation of protein-coding genes-often in response to forms of stress-whereas a select few have been ascribed yet other functions. Of those known to be involved in transcriptional regulation of protein-coding genes, the vast majority function in cis. There are also some yeast lncRNAs identified that are not directly involved in regulation of transcription. Examples of these include the telomerase RNA and telomere-encoded transcripts. In addition to its role as a template-encoding telomeric DNA synthesis, telomerase RNA has been shown to function as a flexible scaffold for protein subunits of the RNP holoenzyme. The flexible scaffold model provides a specific mechanistic paradigm that is likely to apply to many other lncRNAs that assemble and orchestrate large RNP complexes, even in humans. Looking to the future, it is clear that considerable fundamental knowledge remains to be obtained about the architecture and functions of lncRNAs. Using genetically tractable unicellular model organisms should facilitate lncRNA characterization. The acquired basic knowledge will ultimately translate to better understanding of the growing list of lncRNAs linked to human maladies.
Bes, M T; Hernández, J A; Peleato, M L; Fillat, M F
2001-01-15
A gene coding for a Fur (ferric uptake regulation) protein from the cyanobacterium Anabaena PCC 7119 has been cloned and overexpressed in Escherichia coli. DNA sequence analysis confirmed the presence of a 151-amino-acid open reading frame that showed homology with the Fur proteins reported for the unicellular cyanobacteria Synechococcus 7942 and Synechocystis PCC 6803. Two putative Fur-binding sites were detected in the promoter regions of the fur gene from Anabaena. Partially purified recombinant Fur binds to the flavodoxin promoter as well as its own promoter. This suggests that the Fur gene is autoregulated in Anabaena.
The expanding regulatory universe of p53 in gastrointestinal cancer.
Fesler, Andrew; Zhang, Ning; Ju, Jingfang
2016-01-01
Tumor suppresser gene TP53 is one of the most frequently deleted or mutated genes in gastrointestinal cancers. As a transcription factor, p53 regulates a number of important protein coding genes to control cell cycle, cell death, DNA damage/repair, stemness, differentiation and other key cellular functions. In addition, p53 is also able to activate the expression of a number of small non-coding microRNAs (miRNAs) through direct binding to the promoter region of these miRNAs. Many miRNAs have been identified to be potential tumor suppressors by regulating key effecter target mRNAs. Our understanding of the regulatory network of p53 has recently expanded to include long non-coding RNAs (lncRNAs). Like miRNA, lncRNAs have been found to play important roles in cancer biology. With our increased understanding of the important functions of these non-coding RNAs and their relationship with p53, we are gaining exciting new insights into the biology and function of cells in response to various growth environment changes. In this review we summarize the current understanding of the ever expanding involvement of non-coding RNAs in the p53 regulatory network and its implications for our understanding of gastrointestinal cancer.
Ribosome reinitiation at leader peptides increases translation of bacterial proteins.
Korolev, Semen A; Zverkov, Oleg A; Seliverstov, Alexandr V; Lyubetsky, Vassily A
2016-04-16
Short leader genes usually do not encode stable proteins, although their importance in expression control of bacterial genomes is widely accepted. Such genes are often involved in the control of attenuation regulation. However, the abundance of leader genes suggests that their role in bacteria is not limited to regulation. Specifically, we hypothesize that leader genes increase the expression of protein-coding (structural) genes via ribosome reinitiation at the leader peptide in the case of a short distance between the stop codon of the leader gene and the start codon of the structural gene. For instance, in Actinobacteria, the frequency of leader genes at a distance of 10-11 bp is about 70 % higher than the mean frequency within the 1 to 65 bp range; and it gradually decreases as the range grows longer. A pronounced peak of this frequency-distance relationship is also observed in Proteobacteria, Bacteroidetes, Spirochaetales, Acidobacteria, the Deinococcus-Thermus group, and Planctomycetes. In contrast, this peak falls to the distance of 15-16 bp and is not very pronounced in Firmicutes; and no such peak is observed in cyanobacteria and tenericutes. Generally, this peak is typical for many bacteria. Some leader genes located close to a structural gene probably play a regulatory role as well.
Costa, Caroline B; Monteiro, Karina M; Teichmann, Aline; da Silva, Edileuza D; Lorenzatto, Karina R; Cancela, Martín; Paes, Jéssica A; Benitz, André de N D; Castillo, Estela; Margis, Rogério; Zaha, Arnaldo; Ferreira, Henrique B
2015-08-01
The histone chaperone SET/TAF-Iβ is implicated in processes of chromatin remodelling and gene expression regulation. It has been associated with the control of developmental processes, but little is known about its function in helminth parasites. In Mesocestoides corti, a partial cDNA sequence related to SET/TAF-Iβ was isolated in a screening for genes differentially expressed in larvae (tetrathyridia) and adult worms. Here, the full-length coding sequence of the M. corti SET/TAF-Iβ gene was analysed and the encoded protein (McSET/TAF) was compared with orthologous sequences, showing that McSET/TAF can be regarded as a SET/TAF-Iβ family member, with a typical nucleosome-assembly protein (NAP) domain and an acidic tail. The expression patterns of the McSET/TAF gene and protein were investigated during the strobilation process by RT-qPCR, using a set of five reference genes, and by immunoblot and immunofluorescence, using monospecific polyclonal antibodies. A gradual increase in McSET/TAF transcripts and McSET/TAF protein was observed upon development induction by trypsin, demonstrating McSET/TAF differential expression during strobilation. These results provided the first evidence for the involvement of a protein from the NAP family of epigenetic effectors in the regulation of cestode development.
Omeire, Destiny; Abdin, Shaunte; Brooks, Daniel M; Miranda, Hector C
2015-04-01
The Germain's Peacock-Pheasant Polyplectron germaini (Aves, Galliformes, Phasianidae) is classified as Near Threatened on the IUCN Red List. The complete mitochondrial genome of P. germaini is 16,699 bp, consisting of 13 protein-coding genes, 2 rRNA, 22 tRNA genes and 1 control region. All of the 13 protein-coding genes have ATG as start codon. Eight of the 13 protein-coding genes have TAA as stop codon.
Garcia de la Serrana, Daniel; Devlin, Robert H; Johnston, Ian A
2015-07-31
Coho salmon (Oncorhynchus kisutch) transgenic for growth hormone (Gh) express Gh in multiple tissues which results in increased appetite and continuous high growth with satiation feeding. Restricting Gh-transgenics to the same lower ration (TR) as wild-type fish (WT) results in similar growth, but with the recruitment of fewer, larger diameter, muscle skeletal fibres to reach a given body size. In order to better understand the genetic mechanisms behind these different patterns of muscle growth and to investigate how the decoupling of Gh and nutritional signals affects gene regulation we used RNA-seq to compare the fast skeletal muscle transcriptome in TR and WT coho salmon. Illumina sequencing of individually barcoded libraries from 6 WT and 6 TR coho salmon yielded 704,550,985 paired end reads which were used to construct 323,115 contigs containing 19,093 unique genes of which >10,000 contained >90 % of the coding sequence. Transcripts coding for 31 genes required for myoblast fusion were identified with 22 significantly downregulated in TR relative to WT fish, including 10 (vaspa, cdh15, graf1, crk, crkl, dock1, trio, plekho1a, cdc42a and dock5) associated with signaling through the cell surface protein cadherin. Nineteen out of 44 (43 %) translation initiation factors and 14 of 47 (30 %) protein chaperones were upregulated in TR relative to WT fish. TR coho salmon showed increased growth hormone transcripts and gene expression associated with protein synthesis and folding than WT fish even though net rates of protein accretion were similar. The uncoupling of Gh and amino acid signals likely results in additional costs of transcription associated with protein turnover in TR fish. The predicted reduction in the ionic costs of homeostasis in TR fish associated with increased fibre size were shown to involve multiple pathways regulating myotube fusion, particularly cadherin signaling.
Biology of childhood germ cell tumours, focussing on the significance of microRNAs.
Murray, M J; Nicholson, J C; Coleman, N
2015-01-01
Genomic and protein-coding transcriptomic data have suggested that germ cell tumours (GCTs) of childhood are biologically distinct from those of adulthood. Global messenger RNA profiles segregate malignant GCTs primarily by histology, but then also by age, with numerous transcripts showing age-related differential expression. Such differences are likely to account for the heterogeneous clinico-pathological behaviour of paediatric and adult malignant GCTs. In contrast, as global microRNA signatures of human tumours reflect their developmental lineage, we hypothesized that microRNA profiles would identify common biological abnormalities in all malignant GCTs owing to their presumed shared origin from primordial germ cells. MicroRNAs are short, non-protein-coding RNAs that regulate gene expression via translational repression and/or mRNA degradation. We showed that all malignant GCTs over-express the miR-371-373 and miR-302/367 clusters, regardless of patient age, histological subtype or anatomical tumour site. Furthermore, bioinformatic approaches and subsequent Gene Ontology analysis revealed that these two over-expressed microRNAs clusters co-ordinately down-regulated genes involved in biologically significant pathways in malignant GCTs. The translational potential of this finding has been demonstrated with the detection of elevated serum levels of miR-371-373 and miR-302/367 microRNAs at the time of malignant GCT diagnosis, with levels falling after treatment. The tumour-suppressor let-7 microRNA family has also been shown to be universally down-regulated in malignant GCTs, because of abundant expression of the regulatory gene LIN28. Low let-7 levels resulted in up-regulation of oncogenes including MYCN, AURKB and LIN28 itself, the latter through a direct feedback mechanism. Targeting LIN28, or restoring let-7 levels, both led to effective inhibition of this pathway. In summary, paediatric malignant GCTs show biological differences from their adult counterparts at a genomic and protein-coding transcriptome level, whereas they both display very similar microRNA expression profiles. These similarities and differences may be exploited for diagnostic and/or therapeutic purposes. © 2014 The Authors. Andrology published by John Wiley & Sons Ltd on behalf of American Society of Andrology.
NASA Technical Reports Server (NTRS)
Kano, Mihoko; Kitano, Takako; Ikemoto, Madoka; Hirasaka, Katsuya; Asanoma, Yuki; Ogawa, Takayuki; Takeda, Shinichi; Nonaka, Ikuya; Adams, Gregory R.; Baldwin, Kenneth M.;
2003-01-01
We obtained the skeletal muscle of rats exposed to weightless conditions during a 16-day-spaceflight (STS-90). By using a differential display technique, we identified 6 up-regulated and 3 down-regulated genes in the gastrocnemius muscle of the spaceflight rats, as compared to the ground control. The up-regulated genes included those coding Casitas B-lineage lymphoma-b, insulin growth factor binding protein-1, titin and mitochondrial gene 16 S rRNA and two novel genes (function unknown). The down-regulated genes included those encoding RNA polymerase II elongation factor-like protein, NADH dehydrogenase and one novel gene (function unknown). In the present study, we isolated and characterized one of two novel muscle genes that were remarkably up-regulated by spaceflight. The deduced amino acid sequence of the spaceflight-induced gene (sfig) comprises 86 amino acid residues and is well conserved from Drosophila to Homo sapiens. A putative leucine-zipper structure located at the N-terminal region of sfig suggests that this gene may encode a transcription factor. The up-regulated expression of this gene, confirmed by Northern blot analysis, was observed not only in the muscles of spaceflight rats but also in the muscles of tail-suspended rats, especially in the early stage of tail-suspension when gastrocnemius muscle atrophy initiated. The gene was predominantly expressed in the kidney, liver, small intestine and heart. When rat myoblastic L6 cells were grown to 100% confluence in the cell culture system, the expression of sfig was detected regardless of the cell differentiation state. These results suggest that spaceflight has many genetic effects on rat skeletal muscle.
Kirsten, Holger; Al-Hasani, Hoor; Holdt, Lesca; Gross, Arnd; Beutner, Frank; Krohn, Knut; Horn, Katrin; Ahnert, Peter; Burkhardt, Ralph; Reiche, Kristin; Hackermüller, Jörg; Löffler, Markus; Teupser, Daniel; Thiery, Joachim; Scholz, Markus
2015-08-15
Genetics of gene expression (eQTLs or expression QTLs) has proved an indispensable tool for understanding biological pathways and pathomechanisms of trait-associated SNPs. However, power of most genome-wide eQTL studies is still limited. We performed a large eQTL study in peripheral blood mononuclear cells of 2112 individuals increasing the power to detect trans-effects genome-wide. Going beyond univariate SNP-transcript associations, we analyse relations of eQTLs to biological pathways, polygenetic effects of expression regulation, trans-clusters and enrichment of co-localized functional elements. We found eQTLs for about 85% of analysed genes, and 18% of genes were trans-regulated. Local eSNPs were enriched up to a distance of 5 Mb to the transcript challenging typically implemented ranges of cis-regulations. Pathway enrichment within regulated genes of GWAS-related eSNPs supported functional relevance of identified eQTLs. We demonstrate that nearest genes of GWAS-SNPs might frequently be misleading functional candidates. We identified novel trans-clusters of potential functional relevance for GWAS-SNPs of several phenotypes including obesity-related traits, HDL-cholesterol levels and haematological phenotypes. We used chromatin immunoprecipitation data for demonstrating biological effects. Yet, we show for strongly heritable transcripts that still little trans-chromosomal heritability is explained by all identified trans-eSNPs; however, our data suggest that most cis-heritability of these transcripts seems explained. Dissection of co-localized functional elements indicated a prominent role of SNPs in loci of pseudogenes and non-coding RNAs for the regulation of coding genes. In summary, our study substantially increases the catalogue of human eQTLs and improves our understanding of the complex genetic regulation of gene expression, pathways and disease-related processes. © The Author 2015. Published by Oxford University Press.
Sinha, Pallavi; Pazhamala, Lekha T.; Singh, Vikas K.; Saxena, Rachit K.; Krishnamurthy, L.; Azam, Sarwar; Khan, Aamir W.; Varshney, Rajeev K.
2016-01-01
Pigeonpea is a resilient crop, which is relatively more drought tolerant than many other legume crops. To understand the molecular mechanisms of this unique feature of pigeonpea, 51 genes were selected using the Hidden Markov Models (HMM) those codes for proteins having close similarity to universal stress protein domain. Validation of these genes was conducted on three pigeonpea genotypes (ICPL 151, ICPL 8755, and ICPL 227) having different levels of drought tolerance. Gene expression analysis using qRT-PCR revealed 6, 8, and 18 genes to be ≥2-fold differentially expressed in ICPL 151, ICPL 8755, and ICPL 227, respectively. A total of 10 differentially expressed genes showed ≥2-fold up-regulation in the more drought tolerant genotype, which encoded four different classes of proteins. These include plant U-box protein (four genes), universal stress protein A-like protein (four genes), cation/H(+) antiporter protein (one gene) and an uncharacterized protein (one gene). Genes C.cajan_29830 and C.cajan_33874 belonging to uspA, were found significantly expressed in all the three genotypes with ≥2-fold expression variations. Expression profiling of these two genes on the four other legume crops revealed their specific role in pigeonpea. Therefore, these genes seem to be promising candidates for conferring drought tolerance specifically to pigeonpea. PMID:26779199
Mu, Chuang; Wang, Ruijia; Li, Tianqi; Li, Yuqiang; Tian, Meilin; Jiao, Wenqian; Huang, Xiaoting; Zhang, Lingling; Hu, Xiaoli; Wang, Shi; Bao, Zhenmin
2016-08-01
Long non-coding RNA (lncRNA) structurally resembles mRNA but cannot be translated into protein. Although the systematic identification and characterization of lncRNAs have been increasingly reported in model species, information concerning non-model species is still lacking. Here, we report the first systematic identification and characterization of lncRNAs in two sea cucumber species: (1) Apostichopus japonicus during lipopolysaccharide (LPS) challenge and in heathy tissues and (2) Holothuria glaberrima during radial organ complex regeneration, using RNA-seq datasets and bioinformatics analysis. We identified A. japonicus and H. glaberrima lncRNAs that were differentially expressed during LPS challenge and radial organ complex regeneration, respectively. Notably, the predicted lncRNA-microRNA-gene trinities revealed that, in addition to targeting protein-coding transcripts, miRNAs might also target lncRNAs, thereby participating in a potential novel layer of regulatory interactions among non-coding RNA classes in echinoderms. Furthermore, the constructed coding-non-coding network implied the potential involvement of lncRNA-gene interactions during the regulation of several important genes (e.g., Toll-like receptor 1 [TLR1] and transglutaminase-1 [TGM1]) in response to LPS challenge and radial organ complex regeneration in sea cucumbers. Overall, this pioneer systematic identification, annotation, and characterization of lncRNAs in echinoderm pave the way for similar studies and future genetic, genomic, and evolutionary research in non-model species.
Dissecting the chromatin interactome of microRNA genes.
Chen, Dijun; Fu, Liang-Yu; Zhang, Zhao; Li, Guoliang; Zhang, Hang; Jiang, Li; Harrison, Andrew P; Shanahan, Hugh P; Klukas, Christian; Zhang, Hong-Yu; Ruan, Yijun; Chen, Ling-Ling; Chen, Ming
2014-03-01
Our knowledge of the role of higher-order chromatin structures in transcription of microRNA genes (MIRs) is evolving rapidly. Here we investigate the effect of 3D architecture of chromatin on the transcriptional regulation of MIRs. We demonstrate that MIRs have transcriptional features that are similar to protein-coding genes. RNA polymerase II-associated ChIA-PET data reveal that many groups of MIRs and protein-coding genes are organized into functionally compartmentalized chromatin communities and undergo coordinated expression when their genomic loci are spatially colocated. We observe that MIRs display widespread communication in those transcriptionally active communities. Moreover, miRNA-target interactions are significantly enriched among communities with functional homogeneity while depleted from the same community from which they originated, suggesting MIRs coordinating function-related pathways at posttranscriptional level. Further investigation demonstrates the existence of spatial MIR-MIR chromatin interacting networks. We show that groups of spatially coordinated MIRs are frequently from the same family and involved in the same disease category. The spatial interaction network possesses both common and cell-specific subnetwork modules that result from the spatial organization of chromatin within different cell types. Together, our study unveils an entirely unexplored layer of MIR regulation throughout the human genome that links the spatial coordination of MIRs to their co-expression and function.
Huang, Xiaomei; Zhou, Xi; Hu, Qing; Sun, Binyu; Deng, Mingming; Qi, Xiaolong; Lü, Muhan
2018-01-28
Esophageal cancer is a malignant digestive tract cancer with high mortality. Although studies have found that esophageal cancer is involved in a complex and important gene regulation network, the pathogenesis remains unclear. The recently described long non-coding RNAs (lncRNAs) are one effective part of the gene regulation network. However, in past decades, lncRNAs were thought to be "transcript noise" or "pseudogenes" and were thus ignored. Early studies indicated that lncRNAs play pivotal roles during evolution. However, in recent years, increasing research has revealed that many lncRNAs are associated with tumorigenesis. In particular, lncRNAs may act as important elements for epigenetic regulation, transcription, post-transcriptional regulation and post-translational modification of proteins. Additionally, they may be novel biomarkers for tumors and therapeutic targets in cancer. Here, we summarize the functions of lncRNAs in esophageal cancer, with an emphasis on lncRNA-mediated regulatory mechanisms that affect the biological characteristics of esophageal cancer. Copyright © 2017 Elsevier B.V. All rights reserved.
Basak, Jolly; Nithin, Chandran
2015-01-01
Non-coding RNAs (ncRNAs) have emerged as versatile master regulator of biological functions in recent years. MicroRNAs (miRNAs) are small endogenous ncRNAs of 18-24 nucleotides in length that originates from long self-complementary precursors. Besides their direct involvement in developmental processes, plant miRNAs play key roles in gene regulatory networks and varied biological processes. Alternatively, long ncRNAs (lncRNAs) are a large and diverse class of transcribed ncRNAs whose length exceed that of 200 nucleotides. Plant lncRNAs are transcribed by different RNA polymerases, showing diverse structural features. Plant lncRNAs also are important regulators of gene expression in diverse biological processes. There has been a breakthrough in the technology of genome editing, the CRISPR-Cas9 (clustered regulatory interspaced short palindromic repeats/CRISPR-associated protein 9) technology, in the last decade. CRISPR loci are transcribed into ncRNA and eventually form a functional complex with Cas9 and further guide the complex to cleave complementary invading DNA. The CRISPR-Cas technology has been successfully applied in model plants such as Arabidopsis and tobacco and important crops like wheat, maize, and rice. However, all these studies are focused on protein coding genes. Information about targeting non-coding genes is scarce. Hitherto, the CRISPR-Cas technology has been exclusively used in vertebrate systems to engineer miRNA/lncRNAs, but it is still relatively unexplored in plants. While briefing miRNAs, lncRNAs and applications of the CRISPR-Cas technology in human and animals, this review essentially elaborates several strategies to overcome the challenges of applying the CRISPR-Cas technology in editing ncRNAs in plants and the future perspective of this field.
Non-coding functions of alternative pre-mRNA splicing in development
Mockenhaupt, Stefan; Makeyev, Eugene V.
2015-01-01
A majority of messenger RNA precursors (pre-mRNAs) in the higher eukaryotes undergo alternative splicing to generate more than one mature product. By targeting the open reading frame region this process increases diversity of protein isoforms beyond the nominal coding capacity of the genome. However, alternative splicing also frequently controls output levels and spatiotemporal features of cellular and organismal gene expression programs. Here we discuss how these non-coding functions of alternative splicing contribute to development through regulation of mRNA stability, translational efficiency and cellular localization. PMID:26493705
Role of Temperature Stress on Chloroplast Biogenesis and Protein Import in Pea1[OA
Dutta, Siddhartha; Mohanty, Sasmita; Tripathy, Baishnab C.
2009-01-01
Modulation of photosynthesis and chloroplast biogenesis, by low and high temperatures, was studied in 12-d-old pea (Pisum sativum) plants grown at 25°C and subsequently exposed to 7°C or 40°C up to 48 h. The decline in variable chlorophyll a fluorescence/maximum chlorophyll a fluorescence and estimated electron transport rate in temperature-stressed plants was substantially restored when they were transferred to room temperature. The ATP-driven import of precursor of small subunit of Rubisco (pRSS) into plastids was down-regulated by 67% and 49% in heat-stressed and chill-stressed plants, respectively. Reduction in binding of the pRSS to the chloroplast envelope membranes in heat-stressed plants could be due to the down-regulation of Toc159 gene/protein expression. In addition to impaired binding, reduced protein import into chloroplast in heat-stressed plants was likely due to decreased gene/protein expression of certain components of the TOC complex (Toc75), the TIC complex (Tic20, Tic32, Tic55, and Tic62), stromal Hsp93, and stromal processing peptidase. In chill-stressed plants, the gene/protein expression of most of the components of protein import apparatus other than Tic110 and Tic40 were not affected, suggesting the central role of Tic110 and Tic40 in inhibition of protein import at low temperature. Heating of intact chloroplasts at 35°C for 10 min inhibited protein import, implying a low thermal stability of the protein import apparatus. Results demonstrate that in addition to decreased gene and protein expression, down-regulation of photosynthesis in temperature-stressed plants is caused by reduced posttranslational import of plastidic proteins required for the replacement of impaired proteins coded by nuclear genome. PMID:19403728
de Ramón-Carbonell, Marta; Sánchez-Torres, Paloma
2017-12-01
The Slt2 mitogen-activated protein (MAP) kinase homologue of Penicillium digitatum, the most relevant pathogen-producing citrus green mould decay during postharvest, was identified and explored. The P. digitatum Slt2-MAPK coding gene (PdSlt2) was functionally characterized by homologous gene elimination and transcriptomic evaluation. The absence of PdSlt2 gene resulted in significantly reduced virulence during citrus infection. The ΔPdSlt2 mutants were also defective in asexual reproduction, showing impairment of sporulation during citrus infection. Gene expression analysis revealed that PdSlt2 was highly induced during citrus fruit infection at early stages (1 dpi). Moreover, PdSlt2 deletion altered gene expression profiles. The relative gene expression (RGE) of fungicide resistance- and fungal virulence-related genes showed that PdSlt2 acts as negative regulator of several transporter encoding genes (ABC and MFS transporters) and a positive regulator of two sterol demethylases. This study indicates that PdSlt2 MAPK is functionally preserved in P. digitatum and highlights the relevant role of the PdSlt2 MAP kinase-mediated signalling pathway in regulating diverse genes crucial for infection and asexual reproduction. Copyright © 2017 British Mycological Society. Published by Elsevier Ltd. All rights reserved.
A global view of the nonprotein-coding transcriptome in Plasmodium falciparum
Raabe, Carsten A.; Sanchez, Cecilia P.; Randau, Gerrit; Robeck, Thomas; Skryabin, Boris V.; Chinni, Suresh V.; Kube, Michael; Reinhardt, Richard; Ng, Guey Hooi; Manickam, Ravichandran; Kuryshev, Vladimir Y.; Lanzer, Michael; Brosius, Juergen; Tang, Thean Hock; Rozhdestvensky, Timofey S.
2010-01-01
Nonprotein-coding RNAs (npcRNAs) represent an important class of regulatory molecules that act in many cellular pathways. Here, we describe the experimental identification and validation of the small npcRNA transcriptome of the human malaria parasite Plasmodium falciparum. We identified 630 novel npcRNA candidates. Based on sequence and structural motifs, 43 of them belong to the C/D and H/ACA-box subclasses of small nucleolar RNAs (snoRNAs) and small Cajal body-specific RNAs (scaRNAs). We further observed the exonization of a functional H/ACA snoRNA gene, which might contribute to the regulation of ribosomal protein L7a gene expression. Some of the small npcRNA candidates are from telomeric and subtelomeric repetitive regions, suggesting their potential involvement in maintaining telomeric integrity and subtelomeric gene silencing. We also detected 328 cis-encoded antisense npcRNAs (asRNAs) complementary to P. falciparum protein-coding genes of a wide range of biochemical pathways, including determinants of virulence and pathology. All cis-encoded asRNA genes tested exhibit lifecycle-specific expression profiles. For all but one of the respective sense–antisense pairs, we deduced concordant patterns of expression. Our findings have important implications for a better understanding of gene regulatory mechanisms in P. falciparum, revealing an extended and sophisticated npcRNA network that may control the expression of housekeeping genes and virulence factors. PMID:19864253
A global view of the nonprotein-coding transcriptome in Plasmodium falciparum.
Raabe, Carsten A; Sanchez, Cecilia P; Randau, Gerrit; Robeck, Thomas; Skryabin, Boris V; Chinni, Suresh V; Kube, Michael; Reinhardt, Richard; Ng, Guey Hooi; Manickam, Ravichandran; Kuryshev, Vladimir Y; Lanzer, Michael; Brosius, Juergen; Tang, Thean Hock; Rozhdestvensky, Timofey S
2010-01-01
Nonprotein-coding RNAs (npcRNAs) represent an important class of regulatory molecules that act in many cellular pathways. Here, we describe the experimental identification and validation of the small npcRNA transcriptome of the human malaria parasite Plasmodium falciparum. We identified 630 novel npcRNA candidates. Based on sequence and structural motifs, 43 of them belong to the C/D and H/ACA-box subclasses of small nucleolar RNAs (snoRNAs) and small Cajal body-specific RNAs (scaRNAs). We further observed the exonization of a functional H/ACA snoRNA gene, which might contribute to the regulation of ribosomal protein L7a gene expression. Some of the small npcRNA candidates are from telomeric and subtelomeric repetitive regions, suggesting their potential involvement in maintaining telomeric integrity and subtelomeric gene silencing. We also detected 328 cis-encoded antisense npcRNAs (asRNAs) complementary to P. falciparum protein-coding genes of a wide range of biochemical pathways, including determinants of virulence and pathology. All cis-encoded asRNA genes tested exhibit lifecycle-specific expression profiles. For all but one of the respective sense-antisense pairs, we deduced concordant patterns of expression. Our findings have important implications for a better understanding of gene regulatory mechanisms in P. falciparum, revealing an extended and sophisticated npcRNA network that may control the expression of housekeeping genes and virulence factors.
Zhou, Daling; Du, Qingzhang; Chen, Jinhui; Wang, Qingshi; Zhang, Deqiang
2017-10-01
Long non-coding RNAs (lncRNAs) function in various biological processes. However, their roles in secondary growth of plants remain poorly understood. Here, 15,691 lncRNAs were identified from vascular cambium, developing xylem, and mature xylem of Populus tomentosa with high and low biomass using RNA-seq, including 1,994 lncRNAs that were differentially expressed (DE) among the six libraries. 3,569 cis-regulated and 3,297 trans-regulated protein-coding genes were predicted as potential target genes (PTGs) of the DE lncRNAs to participate in biological regulation. Then, 476 and 28 lncRNAs were identified as putative targets and endogenous target mimics (eTMs) of Populus known microRNAs (miRNAs), respectively. Genome re-sequencing of 435 individuals from a natural population of P. tomentosa found 34,015 single nucleotide polymorphisms (SNPs) within 178 lncRNA loci and 522 PTGs. Single-SNP associations analysis detected 2,993 associations with 10 growth and wood-property traits under additive and dominance model. Epistasis analysis identified 17,656 epistatic SNP pairs, providing evidence for potential regulatory interactions between lncRNAs and their PTGs. Furthermore, a reconstructed epistatic network, representing interactions of 8 lncRNAs and 15 PTGs, might enrich regulation roles of genes in the phenylpropanoid pathway. These findings may enhance our understanding of non-coding genes in plants. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Reiche, Kristin; Kasack, Katharina; Schreiber, Stephan; Lüders, Torben; Due, Eldri U.; Naume, Bjørn; Riis, Margit; Kristensen, Vessela N.; Horn, Friedemann; Børresen-Dale, Anne-Lise; Hackermüller, Jörg; Baumbusch, Lars O.
2014-01-01
Breast cancer, the second leading cause of cancer death in women, is a highly heterogeneous disease, characterized by distinct genomic and transcriptomic profiles. Transcriptome analyses prevalently assessed protein-coding genes; however, the majority of the mammalian genome is expressed in numerous non-coding transcripts. Emerging evidence supports that many of these non-coding RNAs are specifically expressed during development, tumorigenesis, and metastasis. The focus of this study was to investigate the expression features and molecular characteristics of long non-coding RNAs (lncRNAs) in breast cancer. We investigated 26 breast tumor and 5 normal tissue samples utilizing a custom expression microarray enclosing probes for mRNAs as well as novel and previously identified lncRNAs. We identified more than 19,000 unique regions significantly differentially expressed between normal versus breast tumor tissue, half of these regions were non-coding without any evidence for functional open reading frames or sequence similarity to known proteins. The identified non-coding regions were primarily located in introns (53%) or in the intergenic space (33%), frequently orientated in antisense-direction of protein-coding genes (14%), and commonly distributed at promoter-, transcription factor binding-, or enhancer-sites. Analyzing the most diverse mRNA breast cancer subtypes Basal-like versus Luminal A and B resulted in 3,025 significantly differentially expressed unique loci, including 682 (23%) for non-coding transcripts. A notable number of differentially expressed protein-coding genes displayed non-synonymous expression changes compared to their nearest differentially expressed lncRNA, including an antisense lncRNA strongly anticorrelated to the mRNA coding for histone deacetylase 3 (HDAC3), which was investigated in more detail. Previously identified chromatin-associated lncRNAs (CARs) were predominantly downregulated in breast tumor samples, including CARs located in the protein-coding genes for CALD1, FTX, and HNRNPH1. In conclusion, a number of differentially expressed lncRNAs have been identified with relation to cancer-related protein-coding genes. PMID:25264628
Reiche, Kristin; Kasack, Katharina; Schreiber, Stephan; Lüders, Torben; Due, Eldri U; Naume, Bjørn; Riis, Margit; Kristensen, Vessela N; Horn, Friedemann; Børresen-Dale, Anne-Lise; Hackermüller, Jörg; Baumbusch, Lars O
2014-01-01
Breast cancer, the second leading cause of cancer death in women, is a highly heterogeneous disease, characterized by distinct genomic and transcriptomic profiles. Transcriptome analyses prevalently assessed protein-coding genes; however, the majority of the mammalian genome is expressed in numerous non-coding transcripts. Emerging evidence supports that many of these non-coding RNAs are specifically expressed during development, tumorigenesis, and metastasis. The focus of this study was to investigate the expression features and molecular characteristics of long non-coding RNAs (lncRNAs) in breast cancer. We investigated 26 breast tumor and 5 normal tissue samples utilizing a custom expression microarray enclosing probes for mRNAs as well as novel and previously identified lncRNAs. We identified more than 19,000 unique regions significantly differentially expressed between normal versus breast tumor tissue, half of these regions were non-coding without any evidence for functional open reading frames or sequence similarity to known proteins. The identified non-coding regions were primarily located in introns (53%) or in the intergenic space (33%), frequently orientated in antisense-direction of protein-coding genes (14%), and commonly distributed at promoter-, transcription factor binding-, or enhancer-sites. Analyzing the most diverse mRNA breast cancer subtypes Basal-like versus Luminal A and B resulted in 3,025 significantly differentially expressed unique loci, including 682 (23%) for non-coding transcripts. A notable number of differentially expressed protein-coding genes displayed non-synonymous expression changes compared to their nearest differentially expressed lncRNA, including an antisense lncRNA strongly anticorrelated to the mRNA coding for histone deacetylase 3 (HDAC3), which was investigated in more detail. Previously identified chromatin-associated lncRNAs (CARs) were predominantly downregulated in breast tumor samples, including CARs located in the protein-coding genes for CALD1, FTX, and HNRNPH1. In conclusion, a number of differentially expressed lncRNAs have been identified with relation to cancer-related protein-coding genes.
Lenka, Sangram K; Lohia, Bikash; Kumar, Abhay; Chinnusamy, Viswanathan; Bansal, Kailash C
2009-02-01
Abscisic acid (ABA), the popular plant stress hormone, plays a key role in regulation of sub-set of stress responsive genes. These genes respond to ABA through specific transcription factors which bind to cis-regulatory elements present in their promoters. We discovered the ABA Responsive Element (ABRE) core (ACGT) containing CGMCACGTGB motif as over-represented motif among the promoters of ABA responsive co-expressed genes in rice. Targeted gene prediction strategy using this motif led to the identification of 402 protein coding genes potentially regulated by ABA-dependent molecular genetic network. RT-PCR analysis of arbitrarily chosen 45 genes from the predicted 402 genes confirmed 80% accuracy of our prediction. Plant Gene Ontology (GO) analysis of ABA responsive genes showed enrichment of signal transduction and stress related genes among diverse functional categories.
Guo, Yanqin; Jin, Long; Wang, Fengjiao; He, Mengnan; Liu, Rui; Li, Mingzhou; Shuai, Surong
2014-01-01
Skeletal and cardiac muscle have important roles in glucose uptake and utilization. However, changes in expression of protein coding genes and miRNAs that participate in glucose metabolism during development are not fully understood. In this study, we investigated the expression of genes related to glucose metabolism during muscle development. We found an age-dependent increase in gene expression in cardiac muscle, with enrichment in heart development- and energy-related metabolic processes. A subset of genes that were up-regulated until 30 or 180 days postnatally, and then down-regulated in psoas major muscle was significantly enriched in mitochondrial oxidative-related processes, while genes that up-regulated in longissimus doris muscle was significantly enriched in glycolysis-related processes. Meanwhile, expression of energy-related microRNAs decreased with increasing age. In addition, we investigated the correlation between microRNAs and mRNAs in three muscle types across different stages of development and found many potential microRNA-mRNA pairs involved in regulating glucose metabolism.
Chávez, Santiago; Eastman, Guillermo; Smircich, Pablo; Becco, Lorena Lourdes; Oliveira-Rizzo, Carolina; Fort, Rafael; Potenza, Mariana; Garat, Beatriz; Sotelo-Silveira, José Roberto
2017-01-01
Trypanosoma cruzi is the protozoan parasite causing American trypanosomiasis or Chagas disease, a neglected parasitosis with important human health impact in Latin America. The efficacy of current therapy is limited, and its toxicity is high. Since parasite proliferation is a fundamental target for rational drug design, we sought to progress into its understanding by applying a genome-wide approach. Treating a TcI linage strain with hydroxyurea, we isolated epimastigotes in late G1, S and G2/M cell cycle stages at 70% purity. The sequencing of each phase identified 305 stage-specific transcripts (1.5-fold change, p≤0.01), coding for conserved cell cycle regulated proteins and numerous proteins whose cell cycle dependence has not been recognized before. Comparisons with the parasite T. brucei and the human host reveal important differences. The meta-analysis of T. cruzi transcriptomic and ribonomic data indicates that cell cycle regulated mRNAs are subject to sub-cellular compartmentalization. Compositional and structural biases of these genes- including CAI, GC content, UTR length, and polycistron position- may contribute to their regulation. To discover nucleotide motifs responsible for the co-regulation of cell cycle regulated genes, we looked for overrepresented motifs at their UTRs and found a variant of the cell cycle sequence motif at the 3' UTR of most of the S and G2 stage genes. We additionally identified hairpin structures at the 5' UTRs of a high proportion of the transcripts, suggesting that periodic gene expression might also rely on translation initiation in T. cruzi. In summary, we report a comprehensive list of T. cruzi cell cycle regulated genes, including many previously unstudied proteins, we show evidence favoring a multi-step control of their expression, and we identify mRNA motifs that may mediate their regulation. Our results provide novel information of the T. cruzi proliferative proteins and the integrated levels of their gene expression control. PMID:29182646
Modulation of gene expression via overlapping binding sites exerted by ZNF143, Notch1 and THAP11
Ngondo-Mbongo, Richard Patryk; Myslinski, Evelyne; Aster, Jon C.; Carbon, Philippe
2013-01-01
ZNF143 is a zinc-finger protein involved in the transcriptional regulation of both coding and non-coding genes from polymerase II and III promoters. Our study deciphers the genome-wide regulatory role of ZNF143 in relation with the two previously unrelated transcription factors Notch1/ICN1 and thanatos-associated protein 11 (THAP11) in several human and murine cells. We show that two distinct motifs, SBS1 and SBS2, are associated to ZNF143-binding events in promoters of >3000 genes. Without co-occupation, these sites are also bound by Notch1/ICN1 in T-lymphoblastic leukaemia cells as well as by THAP11, a factor involved in self-renewal of embryonic stem cells. We present evidence that ICN1 binding overlaps with ZNF143 binding events at the SBS1 and SBS2 motifs, whereas the overlap occurs only at SBS2 for THAP11. We demonstrate that the three factors modulate expression of common target genes through the mutually exclusive occupation of overlapping binding sites. The model we propose predicts that the binding competition between the three factors controls biological processes such as rapid cell growth of both neoplastic and stem cells. Overall, our study establishes a novel relationship between ZNF143, THAP11 and ICN1 and reveals important insights into ZNF143-mediated gene regulation. PMID:23408857
Evolution of coding and non-coding genes in HOX clusters of a marsupial.
Yu, Hongshi; Lindsay, James; Feng, Zhi-Ping; Frankenberg, Stephen; Hu, Yanqiu; Carone, Dawn; Shaw, Geoff; Pask, Andrew J; O'Neill, Rachel; Papenfuss, Anthony T; Renfree, Marilyn B
2012-06-18
The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial.
Evolution of coding and non-coding genes in HOX clusters of a marsupial
2012-01-01
Background The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Results Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. Conclusions This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial. PMID:22708672
De Novo Origin of Human Protein-Coding Genes
Wu, Dong-Dong; Irwin, David M.; Zhang, Ya-Ping
2011-01-01
The de novo origin of a new protein-coding gene from non-coding DNA is considered to be a very rare occurrence in genomes. Here we identify 60 new protein-coding genes that originated de novo on the human lineage since divergence from the chimpanzee. The functionality of these genes is supported by both transcriptional and proteomic evidence. RNA–seq data indicate that these genes have their highest expression levels in the cerebral cortex and testes, which might suggest that these genes contribute to phenotypic traits that are unique to humans, such as improved cognitive ability. Our results are inconsistent with the traditional view that the de novo origin of new genes is very rare, thus there should be greater appreciation of the importance of the de novo origination of genes. PMID:22102831
Burdon, Kathryn P; McKay, James D; Sale, Michèle M; Russell-Eggitt, Isabelle M; Mackey, David A; Wirth, M Gabriela; Elder, James E; Nicoll, Alan; Clarke, Michael P; FitzGerald, Liesel M; Stankovich, James M; Shaw, Marie A; Sharma, Shiwani; Gajovic, Srecko; Gruss, Peter; Ross, Shelley; Thomas, Paul; Voss, Anne K; Thomas, Tim; Gécz, Jozef; Craig, Jamie E
2003-11-01
Nance-Horan syndrome (NHS) is an X-linked disorder characterized by congenital cataracts, dental anomalies, dysmorphic features, and, in some cases, mental retardation. NHS has been mapped to a 1.3-Mb interval on Xp22.13. We have confirmed the same localization in the original, extended Australian family with NHS and have identified protein-truncating mutations in a novel gene, which we have called "NHS," in five families. The NHS gene encompasses approximately 650 kb of genomic DNA, coding for a 1,630-amino acid putative nuclear protein. NHS orthologs were found in other vertebrates, but no sequence similarity to known genes was identified. The murine developmental expression profile of the NHS gene was studied using in situ hybridization and a mouse line containing a lacZ reporter-gene insertion in the Nhs locus. We found a complex pattern of temporally and spatially regulated expression, which, together with the pleiotropic features of NHS, suggests that this gene has key functions in the regulation of eye, tooth, brain, and craniofacial development.
Burdon, Kathryn P.; McKay, James D.; Sale, Michèle M.; Russell-Eggitt, Isabelle M.; Mackey, David A.; Wirth, M. Gabriela; Elder, James E.; Nicoll, Alan; Clarke, Michael P.; FitzGerald, Liesel M.; Stankovich, James M.; Shaw, Marie A.; Sharma, Shiwani; Gajovic, Srecko; Gruss, Peter; Ross, Shelley; Thomas, Paul; Voss, Anne K.; Thomas, Tim; Gécz, Jozef; Craig, Jamie E.
2003-01-01
Nance-Horan syndrome (NHS) is an X-linked disorder characterized by congenital cataracts, dental anomalies, dysmorphic features, and, in some cases, mental retardation. NHS has been mapped to a 1.3-Mb interval on Xp22.13. We have confirmed the same localization in the original, extended Australian family with NHS and have identified protein-truncating mutations in a novel gene, which we have called “NHS,” in five families. The NHS gene encompasses ∼650 kb of genomic DNA, coding for a 1,630–amino acid putative nuclear protein. NHS orthologs were found in other vertebrates, but no sequence similarity to known genes was identified. The murine developmental expression profile of the NHS gene was studied using in situ hybridization and a mouse line containing a lacZ reporter-gene insertion in the Nhs locus. We found a complex pattern of temporally and spatially regulated expression, which, together with the pleiotropic features of NHS, suggests that this gene has key functions in the regulation of eye, tooth, brain, and craniofacial development. PMID:14564667
De Cegli, Rossella; Iacobacci, Simona; Flore, Gemma; Gambardella, Gennaro; Mao, Lei; Cutillo, Luisa; Lauria, Mario; Klose, Joachim; Illingworth, Elizabeth; Banfi, Sandro; di Bernardo, Diego
2013-01-01
Gene expression profiles can be used to infer previously unknown transcriptional regulatory interaction among thousands of genes, via systems biology ‘reverse engineering’ approaches. We ‘reverse engineered’ an embryonic stem (ES)-specific transcriptional network from 171 gene expression profiles, measured in ES cells, to identify master regulators of gene expression (‘hubs’). We discovered that E130012A19Rik (E13), highly expressed in mouse ES cells as compared with differentiated cells, was a central ‘hub’ of the network. We demonstrated that E13 is a protein-coding gene implicated in regulating the commitment towards the different neuronal subtypes and glia cells. The overexpression and knock-down of E13 in ES cell lines, undergoing differentiation into neurons and glia cells, caused a strong up-regulation of the glutamatergic neurons marker Vglut2 and a strong down-regulation of the GABAergic neurons marker GAD65 and of the radial glia marker Blbp. We confirmed E13 expression in the cerebral cortex of adult mice and during development. By immuno-based affinity purification, we characterized protein partners of E13, involved in the Polycomb complex. Our results suggest a role of E13 in regulating the division between glutamatergic projection neurons and GABAergic interneurons and glia cells possibly by epigenetic-mediated transcriptional regulation. PMID:23180766
Chakraborty, Supriyo; Uddin, Arif; Mazumder, Tarikul Huda; Choudhury, Monisha Nath; Malakar, Arup Kumar; Paul, Prosenjit; Halder, Binata; Deka, Himangshu; Mazumder, Gulshana Akthar; Barbhuiya, Riazul Ahmed; Barbhuiya, Masuk Ahmed; Devi, Warepam Jesmi
2017-12-02
The study of codon usage coupled with phylogenetic analysis is an important tool to understand the genetic and evolutionary relationship of a gene. The 13 protein coding genes of human mitochondria are involved in electron transport chain for the generation of energy currency (ATP). However, no work has yet been reported on the codon usage of the mitochondrial protein coding genes across six continents. To understand the patterns of codon usage in mitochondrial genes across six different continents, we used bioinformatic analyses to analyze the protein coding genes. The codon usage bias was low as revealed from high ENC value. Correlation between codon usage and GC3 suggested that all the codons ending with G/C were positively correlated with GC3 but vice versa for A/T ending codons with the exception of ND4L and ND5 genes. Neutrality plot revealed that for the genes ATP6, COI, COIII, CYB, ND4 and ND4L, natural selection might have played a major role while mutation pressure might have played a dominant role in the codon usage bias of ATP8, COII, ND1, ND2, ND3, ND5 and ND6 genes. Phylogenetic analysis indicated that evolutionary relationships in each of 13 protein coding genes of human mitochondria were different across six continents and further suggested that geographical distance was an important factor for the origin and evolution of 13 protein coding genes of human mitochondria. Copyright © 2017 Elsevier B.V. and Mitochondria Research Society. All rights reserved.
Potential proteins targeted by let-7f-5p in HeLa cells.
Wang, Yu; Chen, Xiujuan; Zhang, Yi; Song, Jiandong
2017-07-24
MicroRNAs are a class of small, endogenous, non-coding RNAs mediating posttranscriptional gene silencing. The current authors hypothesized that let-7f-5p is likely involved in cell invasion and proliferation by regulating the expression of target genes. The current study combined let-7f-5p with iTRAQ to assess its effect on gene expression in HeLa cells. Results indicated that 164 proteins were expressed at different levels in HeLa cells overexpressing let-7f-5p and negative controls and that 172 proteins were expressed at different levels in let-7f-5p-silenced HeLa cells and negative controls. Results indicated that let-7f-5p may suppress insulin-like growth factor 2 mRNA binding protein 1 (IGF2BP1) in HeLa cells.
Sun, Jiajie; Gao, Yuan; Liu, Dong; Ma, Wei; Xue, Jing; Zhang, Chunlei; Lan, Xianyong; Lei, Chuzhao; Chen, Hong
2012-06-01
The insulin-induced gene 1 (INSIG1) gene encodes a protein that blocks proteolytic activation of sterol regulatory element binding proteins, which are transcription factors that activate genes that regulate cholesterol, fatty acid, and glucose metabolism. However, similar research for the bovine INSIG1 gene is lacking. Therefore, in this study, polymorphisms of the bovine INSIG1 gene were detected in 643 individuals from four cattle breeds by DNA pooling, forced PCR-RFLP, PCR-SSCP, and DNA sequencing methods. Only 10 novel SNPs were identified, which included four mutations in the coding region and the others in the introns. In Nanyang individuals, seven common haplotypes were identified based on four coding region SNPs. The haplotype GACT, with a frequency of 75.4%, was the most prevalent haplotypes and SNPs formed two linkage disequilibrium blocks with strong multi-allelic D' (D' = 1). Additionally, association analysis between mutations of the bovine INSIG1 gene and growth traits in Nanyang cattle at 6, 12, 18, and 24 months old was performed, and the results indicated that the polymorphisms were not significantly associated with body mass.
Roth, Melissa S; Cokus, Shawn J; Gallaher, Sean D; Walter, Andreas; Lopez, David; Erickson, Erika; Endelman, Benjamin; Westcott, Daniel; Larabell, Carolyn A; Merchant, Sabeeha S; Pellegrini, Matteo; Niyogi, Krishna K
2017-05-23
Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis , because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. To advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ∼58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniform gene density over chromosomes, low repetitive sequence content (∼6%), and a high fraction of protein-coding sequence (∼39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (∼73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase ( BKT ), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. The high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.
Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.; ...
2017-05-08
Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. Here, to advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ~58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniformmore » gene density over chromosomes, low repetitive sequence content (~6%), and a high fraction of protein-coding sequence (~39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (~73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. Finally, the high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.
Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. Here, to advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ~58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniformmore » gene density over chromosomes, low repetitive sequence content (~6%), and a high fraction of protein-coding sequence (~39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (~73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. Finally, the high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production.« less
Roth, Melissa S.; Cokus, Shawn J.; Gallaher, Sean D.; Walter, Andreas; Lopez, David; Erickson, Erika; Endelman, Benjamin; Westcott, Daniel; Larabell, Carolyn A.; Merchant, Sabeeha S.; Pellegrini, Matteo
2017-01-01
Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga Chromochloris zofingiensis, because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. To advance understanding of its biology and facilitate commercial development, we present a C. zofingiensis chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions. The assembly, derived from a combination of short- and long-read sequencing in conjunction with optical mapping, revealed a compact genome of ∼58 Mbp distributed over 19 chromosomes containing 15,274 predicted protein-coding genes. The genome has uniform gene density over chromosomes, low repetitive sequence content (∼6%), and a high fraction of protein-coding sequence (∼39%) with relatively long coding exons and few coding introns. Functional annotation of gene models identified orthologous families for the majority (∼73%) of genes. Synteny analysis uncovered localized but scrambled blocks of genes in putative orthologous relationships with other green algae. Two genes encoding beta-ketolase (BKT), the key enzyme synthesizing astaxanthin, were found in the genome, and both were up-regulated by high light. Isolation and molecular analysis of astaxanthin-deficient mutants showed that BKT1 is required for the production of astaxanthin. Moreover, the transcriptome under high light exposure revealed candidate genes that could be involved in critical yet missing steps of astaxanthin biosynthesis, including ABC transporters, cytochrome P450 enzymes, and an acyltransferase. The high-quality genome and transcriptome provide insight into the green algal lineage and carotenoid production. PMID:28484037
Lehnert, Sigrid A; Reverter, Antonio; Byrne, Keren A; Wang, Yonghong; Nattrass, Greg S; Hudson, Nicholas J; Greenwood, Paul L
2007-01-01
Background The muscle fiber number and fiber composition of muscle is largely determined during prenatal development. In order to discover genes that are involved in determining adult muscle phenotypes, we studied the gene expression profile of developing fetal bovine longissimus muscle from animals with two different genetic backgrounds using a bovine cDNA microarray. Fetal longissimus muscle was sampled at 4 stages of myogenesis and muscle maturation: primary myogenesis (d 60), secondary myogenesis (d 135), as well as beginning (d 195) and final stages (birth) of functional differentiation of muscle fibers. All fetuses and newborns (total n = 24) were from Hereford dams and crossed with either Wagyu (high intramuscular fat) or Piedmontese (GDF8 mutant) sires, genotypes that vary markedly in muscle and compositional characteristics later in postnatal life. Results We obtained expression profiles of three individuals for each time point and genotype to allow comparisons across time and between sire breeds. Quantitative reverse transcription-PCR analysis of RNA from developing longissimus muscle was able to validate the differential expression patterns observed for a selection of differentially expressed genes, with one exception. We detected large-scale changes in temporal gene expression between the four developmental stages in genes coding for extracellular matrix and for muscle fiber structural and metabolic proteins. FSTL1 and IGFBP5 were two genes implicated in growth and differentiation that showed developmentally regulated expression levels in fetal muscle. An abundantly expressed gene with no functional annotation was found to be developmentally regulated in the same manner as muscle structural proteins. We also observed differences in gene expression profiles between the two different sire breeds. Wagyu-sired calves showed higher expression of fatty acid binding protein 5 (FABP5) RNA at birth. The developing longissimus muscle of fetuses carrying the Piedmontese mutation shows an emphasis on glycolytic muscle biochemistry and a large-scale up-regulation of the translational machinery at birth. We also document evidence for timing differences in differentiation events between the two breeds. Conclusion Taken together, these findings provide a detailed description of molecular events accompanying skeletal muscle differentiation in the bovine, as well as gene expression differences that may underpin the phenotype differences between the two breeds. In addition, this study has highlighted a non-coding RNA, which is abundantly expressed and developmentally regulated in bovine fetal muscle. PMID:17697390
A Dual Origin of the Xist Gene from a Protein-Coding Gene and a Set of Transposable Elements
Elisaphenko, Eugeny A.; Kolesnikov, Nikolay N.; Shevchenko, Alexander I.; Rogozin, Igor B.; Nesterova, Tatyana B.; Brockdorff, Neil; Zakian, Suren M.
2008-01-01
X-chromosome inactivation, which occurs in female eutherian mammals is controlled by a complex X-linked locus termed the X-inactivation center (XIC). Previously it was proposed that genes of the XIC evolved, at least in part, as a result of pseudogenization of protein-coding genes. In this study we show that the key XIC gene Xist, which displays fragmentary homology to a protein-coding gene Lnx3, emerged de novo in early eutherians by integration of mobile elements which gave rise to simple tandem repeats. The Xist gene promoter region and four out of ten exons found in eutherians retain homology to exons of the Lnx3 gene. The remaining six Xist exons including those with simple tandem repeats detectable in their structure have similarity to different transposable elements. Integration of mobile elements into Xist accompanies the overall evolution of the gene and presumably continues in contemporary eutherian species. Additionally we showed that the combination of remnants of protein-coding sequences and mobile elements is not unique to the Xist gene and is found in other XIC genes producing non-coding nuclear RNA. PMID:18575625
Splicing regulation and dysregulation of cholinergic genes expressed at the neuromuscular junction.
Ohno, Kinji; Rahman, Mohammad Alinoor; Nazim, Mohammad; Nasrin, Farhana; Lin, Yingni; Takeda, Jun-Ichi; Masuda, Akio
2017-08-01
We humans have evolved by acquiring diversity of alternative RNA metabolisms including alternative means of splicing and transcribing non-coding genes, and not by acquiring new coding genes. Tissue-specific and developmental stage-specific alternative RNA splicing is achieved by tightly regulated spatiotemporal regulation of expressions and activations of RNA-binding proteins that recognize their cognate splicing cis-elements on nascent RNA transcripts. Genes expressed at the neuromuscular junction are also alternatively spliced. In addition, germline mutations provoke aberrant splicing by compromising binding of RNA-binding proteins, and cause congenital myasthenic syndromes (CMS). We present physiological splicing mechanisms of genes for agrin (AGRN), acetylcholinesterase (ACHE), MuSK (MUSK), acetylcholine receptor (AChR) α1 subunit (CHRNA1), and collagen Q (COLQ) in human, and their aberration in diseases. Splicing isoforms of AChE T , AChE H , and AChE R are generated by hnRNP H/F. Skipping of MUSK exon 10 makes a Wnt-insensitive MuSK isoform, which is unique to human. Skipping of exon 10 is achieved by coordinated binding of hnRNP C, YB-1, and hnRNP L to exon 10. Exon P3A of CHRNA1 is alternatively included to generate a non-functional AChR α1 subunit in human. Molecular dissection of splicing mutations in patients with CMS reveals that exon P3A is alternatively skipped by hnRNP H, polypyrimidine tract-binding protein 1, and hnRNP L. Similarly, analysis of an exonic mutation in COLQ exon 16 in a CMS patient discloses that constitutive splicing of exon 16 requires binding of serine arginine-rich splicing factor 1. Intronic and exonic splicing mutations in CMS enable us to dissect molecular mechanisms underlying alternative and constitutive splicing of genes expressed at the neuromuscular junction. This is an article for the special issue XVth International Symposium on Cholinergic Mechanisms. © 2017 International Society for Neurochemistry.
Chen, Ying; Dai, Hongzheng; Chen, Sidi; Zhang, Luoying; Long, Manyuan
2011-04-26
Sphinx is a lineage-specific non-coding RNA gene involved in regulating courtship behavior in Drosophila melanogaster. The 5' flanking region of the gene is conserved across Drosophila species, with the proximal 300 bp being conserved out to D. virilis and a further 600 bp region being conserved amongst the melanogaster subgroup (D. melanogaster, D. simulans, D. sechellia, D. yakuba, and D. erecta). Using a green fluorescence protein transformation system, we demonstrated that a 253 bp region of the highly conserved segment was sufficient to drive sphinx expression in male accessory gland. GFP signals were also observed in brain, wing hairs and leg bristles. An additional ∼800 bp upstream region was able to enhance expression specifically in proboscis, suggesting the existence of enhancer elements. Using anti-GFP staining, we identified putative sphinx expression signal in the brain antennal lobe and inner antennocerebral tract, suggesting that sphinx might be involved in olfactory neuron mediated regulation of male courtship behavior. Whole genome expression profiling of the sphinx knockout mutation identified significant up-regulated gene categories related to accessory gland protein function and odor perception, suggesting sphinx might be a negative regulator of its target genes.
Chen, Sidi; Zhang, Luoying; Long, Manyuan
2011-01-01
Sphinx is a lineage-specific non-coding RNA gene involved in regulating courtship behavior in Drosophila melanogaster. The 5′ flanking region of the gene is conserved across Drosophila species, with the proximal 300 bp being conserved out to D. virilis and a further 600 bp region being conserved amongst the melanogaster subgroup (D. melanogaster, D. simulans, D. sechellia, D. yakuba, and D. erecta). Using a green fluorescence protein transformation system, we demonstrated that a 253 bp region of the highly conserved segment was sufficient to drive sphinx expression in male accessory gland. GFP signals were also observed in brain, wing hairs and leg bristles. An additional ∼800 bp upstream region was able to enhance expression specifically in proboscis, suggesting the existence of enhancer elements. Using anti-GFP staining, we identified putative sphinx expression signal in the brain antennal lobe and inner antennocerebral tract, suggesting that sphinx might be involved in olfactory neuron mediated regulation of male courtship behavior. Whole genome expression profiling of the sphinx knockout mutation identified significant up-regulated gene categories related to accessory gland protein function and odor perception, suggesting sphinx might be a negative regulator of its target genes. PMID:21541324
Højland, Dorte H.; Jensen, Karl-Martin Vagn; Kristensen, Michael
2014-01-01
Background The housefly, Musca domestica, has developed resistance to most insecticides applied for its control. Expression of genes coding for detoxification enzymes play a role in the response of the housefly when encountered by a xenobiotic. The highest level of constitutive gene expression of nine P450 genes was previously found in a newly-collected susceptible field population in comparison to three insecticide-resistant laboratory strains and a laboratory reference strain. Results We compared gene expression of five P450s by qPCR as well as global gene expression by RNAseq in the newly-acquired field population (845b) in generation F1, F13 and F29 to test how gene expression changes following laboratory adaption. Four (CYP6A1, CYP6A36, CYP6D3, CYP6G4) of five investigated P450 genes adapted to breeding by decreasing expression. CYP6D1 showed higher female expression in F29 than in F1. For males, about half of the genes accessed in the global gene expression were up-regulated in F13 and F29 in comparison with the F1 population. In females, 60% of the genes were up-regulated in F13 in comparison with F1, while 33% were up-regulated in F29. Forty potential P450 genes were identified. In most cases, P450 gene expression was decreased in F13 flies in comparison with F1. Gene expression then increased from F13 to F29 in males and decreased further in females. Conclusion The global gene expression changes massively during adaptation to laboratory breeding. In general, global expression decreased as a result of laboratory adaption in males, while female expression was not unidirectional. Expression of P450 genes was in general down-regulated as a result of laboratory adaption. Expression of hexamerin, coding for a storage protein was increased, while gene expression of genes coding for amylases decreased. This suggests a major impact of the surrounding environment on gene response to xenobiotics and genetic composition of housefly strains. PMID:24489682
NASA Technical Reports Server (NTRS)
Weitzel, A. J.; Wyatt, S. E.; Parsons-Wingerter, P.
2016-01-01
Venation patterning in leaves is a major determinant of photosynthesis efficiency because of its dependency on vascular transport of photo-assimilates, water, and minerals. Arabidopsis thaliana grown in microgravity show delayed growth and leaf maturation. Gene expression data from the roots, hypocotyl, and leaves of A. thaliana grown during spaceflight vs. ground control analyzed by Affymetrix microarray are available through NASA's GeneLab (GLDS-7). We analyzed the data for differential expression of genes in leaves resulting from the effects of spaceflight on vascular patterning. Two genes were found by preliminary analysis to be up-regulated during spaceflight that may be related to vascular formation. The genes are responsible for coding an ARGOS (Auxin-Regulated Gene Involved in Organ Size)-like protein (potentially affecting cell elongation in the leaves), and an F-box/kelch-repeat protein (possibly contributing to protoxylem specification). Further analysis that will focus on raw data quality assessment and a moderated t-test may further confirm up-regulation of the two genes and/or identify other gene candidates. Plants defective in these genes will then be assessed for phenotype by the mapping and quantification of leaf vascular patterning by NASA's VESsel GENeration (VESGEN) software to model specific vascular differences of plants grown in spaceflight.
Bonato, Paloma; Alves, Lysangela R; Osaki, Juliana H; Rigo, Liu U; Pedrosa, Fabio O; Souza, Emanuel M; Zhang, Nan; Schumacher, Jörg; Buck, Martin; Wassem, Roseli; Chubatsu, Leda S
2016-11-01
Herbaspirillum seropedicae is a diazotrophic β-Proteobacterium found endophytically associated with gramineae (Poaceae or graminaceous plants) such as rice, sorghum and sugar cane. In this work we show that nitrate-dependent growth in this organism is regulated by the master nitrogen regulatory two-component system NtrB-NtrC, and by NtrY-NtrX, which functions to specifically regulate nitrate metabolism. NtrY is a histidine kinase sensor protein predicted to be associated with the membrane and NtrX is the response regulator partner. The ntrYntrX genes are widely distributed in Proteobacteria. In α-Proteobacteria they are frequently located downstream from ntrBC, whereas in β-Proteobacteria these genes are located downstream from genes encoding an RNA methyltransferase and a proline-rich protein with unknown function. The NtrX protein of α-Proteobacteria has an AAA+ domain, absent in those from β-Proteobacteria. An ntrY mutant of H. seropedicae showed the wild-type nitrogen fixation phenotype, but the nitrate-dependent growth was abolished. Gene fusion assays indicated that NtrY is involved in the expression of genes coding for the assimilatory nitrate reductase as well as the nitrate-responsive two-component system NarX-NarL (narK and narX promoters, respectively). The purified NtrX protein was capable of binding the narK and narX promoters, and the binding site at the narX promoter for the NtrX protein was determined by DNA footprinting. In silico analyses revealed similar sequences in other promoter regions of H. seropedicae that are related to nitrate assimilation, supporting the role of the NtrY-NtrX system in regulating nitrate metabolism in H. seropedicae. © 2016 Federation of European Biochemical Societies.
Gln3p and Nil1p regulation of invertase activity and SUC2 expression in Saccharomyces cerevisiae.
Oliveira, Edna Maria Morais; Mansure, José João; Bon, Elba Pinto da Silva
2005-04-01
In Saccharomyces cerevisiae, sensing and signalling pathways regulate gene expression in response to quality of carbon and nitrogen sources. One such system, the target of rapamycin (Tor) proteins, senses nutrients and uses the GATA activators Gln3p and Nil1p to regulate translation in response to low-quality carbon and nitrogen. The signal transduction, triggered in response to nitrogen nutrition that is sensed by the Tor proteins, operates via a regulatory pathway involving the cytoplasmic factor Ure2p. When carbon and nitrogen are abundant, the phosphorylated Ure2p anchors the also phosphorylated Gln3p and Nil1p in the cytoplasm. Upon a shift from high- to low-quality nitrogen or treatment with rapamycin all three proteins are dephosphorylated, causing Gln3p and Nil1p to enter the nucleus and promote transcription. The genes that code for yeast periplasmic enzymes with nutritional roles would be obvious targets for regulation by the sensing and signalling pathways that respond to quality of carbon and nitrogen sources. Indeed, previous results from our laboratory had shown that the GATA factors Gln3p, Nil1p, Dal80p, Nil2p and also the protein Ure2 regulate the expression of asparaginase II, coded by ASP3. We also had observed that the activity levels of the also periplasmic invertase, coded by SUC2, were 6-fold lower in ure2 mutant cells in comparison to wild-type cells collected at stationary phase. These results suggested similarities between the signalling pathways regulating the expression of ASP3 and SUC2. In the present work we showed that invertase levels displayed by the single nil1 and gln3 and by the double gln3nil1 mutant cells, cultivated in a sucrose-ammonium medium and collected at the exponential phase, were 6-, 10- and 60-fold higher, respectively, in comparison to their wild-type counterparts. RT-PCR data of SUC2 expression in the double-mutant cells indicated a 10-fold increase in the mRNA(SUC2) levels.
Samson, Marie-Laure
2008-01-01
Background The Drosophila gene embryonic lethal abnormal visual system (elav) is the prototype of a gene family present in all metazoans. Its members encode structurally conserved neuronal proteins with three RNA Recognition Motifs (RRM) but they paradoxically act at diverse levels of post-transcriptional regulation. In an attempt to understand the history of this family, we searched for orthologs in eleven completely sequenced genomes, including those of humans, D. melanogaster and C. elegans, for which cDNAs are available. Results We analyzed 23 orthologs/paralogs of elav, and found evidence of gain/loss of gene copy number. For one set of genes, including elav itself, the coding sequences are free of introns and their products most resemble ELAV. The remaining genes show remarkable conservation of their exon organization, and their products most resemble FNE and RBP9, proteins encoded by the two elav paralogs of Drosophila. Remarkably, three of the conserved exon junctions are both close to structural elements, involved respectively in protein-RNA interactions and in the regulation of sub-cellular localization, and in the vicinity of diverse sequence variations. Conclusion The data indicate that the essential elav gene of Drosophila is newly emerged, restricted to dipterans and of retrotransposed origin. We propose that the conserved exon junctions constitute potential sites for sequence/function modifications, and that RRM binding proteins, whose function relies upon plastic RNA-protein interactions, may have played an important role in brain evolution. PMID:18715504
Regulation of IAP (Inhibitor of Apoptosis) Gene Expression by the p53 Tumor Suppressor Protein
2005-05-01
adenovirus, gene therapy, polymorphism, 31 16. PRICE CODE 17. SECURITY CLASSIFICATION 18. SECURITY CLASSIFICATION 19. SECURITY CLASSIFICATION 20...averaged results of three inde- pendent experiments, with standard error. Right panel: Level of p53 in infected cells using the antibody Ab-6 (Calbiochem...with highly purified mitochondria as described in (2). The arrow marks oligomerized BAK. The right _ -. panel depicts the purity of BMH CrosIinked Mito
Gong, Chenguang; Li, Zhizhong; Ramanujan, Krishnan; Clay, Ieuan; Zhang, Yunyu; Lemire-Brachat, Sophie; Glass, David J
2015-07-27
Increasing evidence suggests that long non-coding RNAs (LncRNAs) represent a new class of regulators of stem cells. However, the roles of LncRNAs in stem cell maintenance and myogenesis remain largely unexamined. For this study, hundreds of intergenic LncRNAs were identified that are expressed in myoblasts and regulated during differentiation. One of these LncRNAs, termed LncMyoD, is encoded next to the Myod gene and is directly activated by MyoD during myoblast differentiation. Knockdown of LncMyoD strongly inhibits terminal muscle differentiation, largely due to a failure to exit the cell cycle. LncMyoD directly binds to IGF2-mRNA-binding protein 2 (IMP2) and negatively regulates IMP2-mediated translation of proliferation genes such as N-Ras and c-Myc. While the RNA sequence of LncMyoD is not well conserved between human and mouse, its locus, gene structure, and function are preserved. The MyoD-LncMyoD-IMP2 pathway elucidates a mechanism as to how MyoD blocks proliferation to create a permissive state for differentiation. Copyright © 2015 Elsevier Inc. All rights reserved.
The Big Entity of New RNA World: Long Non-Coding RNAs in Microvascular Complications of Diabetes.
Raut, Satish K; Khullar, Madhu
2018-01-01
A major part of the genome is known to be transcribed into non-protein coding RNAs (ncRNAs), such as microRNA and long non-coding RNA (lncRNA). The importance of ncRNAs is being increasingly recognized in physiological and pathological processes. lncRNAs are a novel class of ncRNAs that do not code for proteins and are important regulators of gene expression. In the past, these molecules were thought to be transcriptional "noise" with low levels of evolutionary conservation. However, recent studies provide strong evidence indicating that lncRNAs are (i) regulated during various cellular processes, (ii) exhibit cell type-specific expression, (iii) localize to specific organelles, and (iv) associated with human diseases. Emerging evidence indicates an aberrant expression of lncRNAs in diabetes and diabetes-related microvascular complications. In the present review, we discuss the current state of knowledge of lncRNAs, their genesis from genome, and the mechanism of action of individual lncRNAs in the pathogenesis of microvascular complications of diabetes and therapeutic approaches.
Liu, Feiling; Guo, Dianhao; Yuan, Zhuting; Chen, Chen; Xiao, Huamei
2017-11-20
Long non-coding RNA (lncRNA) is a class of noncoding RNA >200 bp in length that has essential roles in regulating a variety of biological processes. Here, we constructed a computational pipeline to identify lncRNA genes in the diamondback moth (Plutella xylostella), a major insect pest of cruciferous vegetables. In total, 3,324 lncRNAs corresponding to 2,475 loci were identified from 13 RNA-Seq datasets, including samples from parasitized, insecticide-resistant strains and different developmental stages. The identified P. xylostella lncRNAs had shorter transcripts and fewer exons than protein-coding genes. Seven out of nine randomly selected lncRNAs were validated by strand-specific RT-PCR. In total, 54-172 lncRNAs were specifically expressed in the insecticide resistant strains, among which one lncRNA was located adjacent to the sodium channel gene. In addition, 63-135 lncRNAs were specifically expressed in different developmental stages, among which three lncRNAs overlapped or were located adjacent to the metamorphosis-associated genes. These lncRNAs were either strongly or weakly co-expressed with their overlapping or neighboring mRNA genes. In summary, we identified thousands of lncRNAs and presented evidence that lncRNAs might have key roles in conferring insecticide resistance and regulating the metamorphosis development in P. xylostella.
The Nrf2-antioxidant response element pathway: a target for regulating energy metabolism
USDA-ARS?s Scientific Manuscript database
The nuclear factor E2-related factor 2 (Nrf2) is a transcription factor that responds to oxidative stress by binding to the antioxidant response element (ARE) in the promoter of genes coding for antioxidant enzymes like NAD(P)H:quinone oxidoreductase 1 (NQO1) and proteins for glutathione synthesis. ...
Modulation of Gene Expression in Contextual Fear Conditioning in the Rat
Macchi, Monica; Ciampini, Cristina; Bernardi, Rodolfo; Baldi, Elisabetta; Bucherelli, Corrado; Brunelli, Marcello; Scuri, Rossana
2013-01-01
In contextual fear conditioning (CFC) a single training leads to long-term memory of context-aversive electrical foot-shocks association. Mid-temporal regions of the brain of trained and naive rats were obtained 2 days after conditioning and screened by two-directional suppression subtractive hybridization. A pool of differentially expressed genes was identified and some of them were randomly selected and confirmed with qRT-PCR assay. These transcripts showed high homology for rat gene sequences coding for proteins involved in different cellular processes. The expression of the selected transcripts was also tested in rats which had freely explored the experimental apparatus (exploration) and in rats to which the same number of aversive shocks had been administered in the same apparatus, but temporally compressed so as to make the association between painful stimuli and the apparatus difficult (shock-only). Some genes resulted differentially expressed only in the rats subjected to CFC, others only in exploration or shock-only rats, whereas the gene coding for translocase of outer mitochondrial membrane 20 protein and nardilysin were differentially expressed in both CFC and exploration rats. For example, the expression of stathmin 1 whose transcripts resulted up regulated was also tested to evaluate the transduction and protein localization after conditioning. PMID:24278235
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhaxybayeva, Olga; Swithers, Kristen S; Foght, Julia
2012-01-01
Here we describe the genome of Mesotoga prima MesG1.Ag4.2, the first genome of a mesophilic Thermotogales bacterium. Mesotoga prima was isolated from a polychlorinated biphenyl (PCB)-dechlorinating enrichment culture from Baltimore Harbor sediments. Its 2.97 Mb genome is considerably larger than any previously sequenced Thermotogales genomes, which range between 1.86 and 2.30 Mb. This larger size is due to both higher numbers of protein-coding genes and larger intergenic regions. In particular, the M. prima genome contains more genes for proteins involved in regulatory functions, for instance those involved in regulation of transcription. Together with its closest relative, Kosmotoga olearia, it alsomore » encodes different types of proteins involved in environmental and cell-cell interactions as compared with other Thermotogales bacteria. Amino acid composition analysis of M. prima proteins implies that this lineage has inhabited low-temperature environments for a long time. A large fraction of the M. prima genome has been acquired by lateral gene transfer (LGT): a DarkHorse analysis suggests that 766 (32%) of predicted protein-coding genes have been involved in LGT after Mesotoga diverged from the other Thermotogales lineages. A notable example of a lineage-specific LGT event is a reductive dehalogenase gene - a key enzyme in dehalorespiration, indicating M. prima may have a more active role in PCB dechlorination than was previously assumed.« less
Kakizaki, Fumihiko; Sonoshita, Masahiro; Miyoshi, Hiroyuki; Itatani, Yoshiro; Ito, Shinji; Kawada, Kenji; Sakai, Yoshiharu; Taketo, M Mark
2016-11-01
We recently found that the product of the AES gene functions as a metastasis suppressor of colorectal cancer (CRC) in both humans and mice. Expression of amino-terminal enhancer of split (AES) protein is significantly decreased in liver metastatic lesions compared with primary colon tumors. To investigate its downregulation mechanism in metastases, we searched for transcriptional regulators of AES in human CRC and found that its expression is reduced mainly by transcriptional dysregulation and, in some cases, by additional haploidization of its coding gene. The AES promoter-enhancer is in a typical CpG island, and contains a Yin-Yang transcription factor recognition sequence (YY element). In human epithelial cells of normal colon and primary tumors, transcription factor YY2, a member of the YY family, binds directly to the YY element, and stimulates expression of AES. In a transplantation mouse model of liver metastases, however, expression of Yy2 (and therefore of Aes) is downregulated. In human CRC metastases to the liver, the levels of AES protein are correlated with those of YY2. In addition, we noticed copy-number reduction for the AES coding gene in chromosome 19p13.3 in 12% (5/42) of human CRC cell lines. We excluded other mechanisms such as point or indel mutations in the coding or regulatory regions of the AES gene, CpG methylation in the AES promoter enhancer, expression of microRNAs, and chromatin histone modifications. These results indicate that Aes may belong to a novel family of metastasis suppressors with a CpG-island promoter enhancer, and it is regulated transcriptionally. © 2016 The Authors. Cancer Science published by John Wiley & Sons Australia, Ltd on behalf of Japanese Cancer Association.
Prevalence of transcription promoters within archaeal operons and coding sequences.
Koide, Tie; Reiss, David J; Bare, J Christopher; Pang, Wyming Lee; Facciotti, Marc T; Schmid, Amy K; Pan, Min; Marzolf, Bruz; Van, Phu T; Lo, Fang-Yin; Pratap, Abhishek; Deutsch, Eric W; Peterson, Amelia; Martin, Dan; Baliga, Nitin S
2009-01-01
Despite the knowledge of complex prokaryotic-transcription mechanisms, generalized rules, such as the simplified organization of genes into operons with well-defined promoters and terminators, have had a significant role in systems analysis of regulatory logic in both bacteria and archaea. Here, we have investigated the prevalence of alternate regulatory mechanisms through genome-wide characterization of transcript structures of approximately 64% of all genes, including putative non-coding RNAs in Halobacterium salinarum NRC-1. Our integrative analysis of transcriptome dynamics and protein-DNA interaction data sets showed widespread environment-dependent modulation of operon architectures, transcription initiation and termination inside coding sequences, and extensive overlap in 3' ends of transcripts for many convergently transcribed genes. A significant fraction of these alternate transcriptional events correlate to binding locations of 11 transcription factors and regulators (TFs) inside operons and annotated genes-events usually considered spurious or non-functional. Using experimental validation, we illustrate the prevalence of overlapping genomic signals in archaeal transcription, casting doubt on the general perception of rigid boundaries between coding sequences and regulatory elements.
The point of no return: The poly(A)-associated elongation checkpoint.
Tellier, Michael; Ferrer-Vicens, Ivan; Murphy, Shona
2016-01-01
Cyclin-dependent kinases play critical roles in transcription by RNA polymerase II (pol II) and processing of the transcripts. For example, CDK9 regulates transcription of protein-coding genes, splicing, and 3' end formation of the transcripts. Accordingly, CDK9 inhibitors have a drastic effect on the production of mRNA in human cells. Recent analyses indicate that CDK9 regulates transcription at the early-elongation checkpoint of the vast majority of pol II-transcribed genes. Our recent discovery of an additional CDK9-regulated elongation checkpoint close to poly(A) sites adds a new layer to the control of transcription by this critical cellular kinase. This novel poly(A)-associated checkpoint has the potential to powerfully regulate gene expression just before a functional polyadenylated mRNA is produced: the point of no return. However, many questions remain to be answered before the role of this checkpoint becomes clear. Here we speculate on the possible biological significance of this novel mechanism of gene regulation and the players that may be involved.
MicroRNAs: regulators of gene expression and cell differentiation
Shivdasani, Ramesh A.
2006-01-01
The existence and roles of a class of abundant regulatory RNA molecules have recently come into sharp focus. Micro-RNAs (miRNAs) are small (approximately 22 bases), non–protein-coding RNAs that recognize target sequences of imperfect complementarity in cognate mRNAs and either destabilize them or inhibit protein translation. Although mechanisms of miRNA biogenesis have been elucidated in some detail, there is limited appreciation of their biological functions. Reported examples typically focus on miRNA regulation of a single tissue-restricted transcript, often one encoding a transcription factor, that controls a specific aspect of development, cell differentiation, or physiology. However, computational algorithms predict up to hundreds of putative targets for individual miRNAs, single transcripts may be regulated by multiple miRNAs, and miRNAs may either eliminate target gene expression or serve to finetune transcript and protein levels. Theoretical considerations and early experimental results hence suggest diverse roles for miRNAs as a class. One appealing possibility, that miRNAs eliminate low-level expression of unwanted genes and hence refine unilineage gene expression, may be especially amenable to evaluation in models of hematopoiesis. This review summarizes current understanding of miRNA mechanisms, outlines some of the important outstanding questions, and describes studies that attempt to define miRNA functions in hematopoiesis. PMID:16882713
Prediction of plant lncRNA by ensemble machine learning classifiers.
Simopoulos, Caitlin M A; Weretilnyk, Elizabeth A; Golding, G Brian
2018-05-02
In plants, long non-protein coding RNAs are believed to have essential roles in development and stress responses. However, relative to advances on discerning biological roles for long non-protein coding RNAs in animal systems, this RNA class in plants is largely understudied. With comparatively few validated plant long non-coding RNAs, research on this potentially critical class of RNA is hindered by a lack of appropriate prediction tools and databases. Supervised learning models trained on data sets of mostly non-validated, non-coding transcripts have been previously used to identify this enigmatic RNA class with applications largely focused on animal systems. Our approach uses a training set comprised only of empirically validated long non-protein coding RNAs from plant, animal, and viral sources to predict and rank candidate long non-protein coding gene products for future functional validation. Individual stochastic gradient boosting and random forest classifiers trained on only empirically validated long non-protein coding RNAs were constructed. In order to use the strengths of multiple classifiers, we combined multiple models into a single stacking meta-learner. This ensemble approach benefits from the diversity of several learners to effectively identify putative plant long non-coding RNAs from transcript sequence features. When the predicted genes identified by the ensemble classifier were compared to those listed in GreeNC, an established plant long non-coding RNA database, overlap for predicted genes from Arabidopsis thaliana, Oryza sativa and Eutrema salsugineum ranged from 51 to 83% with the highest agreement in Eutrema salsugineum. Most of the highest ranking predictions from Arabidopsis thaliana were annotated as potential natural antisense genes, pseudogenes, transposable elements, or simply computationally predicted hypothetical protein. Due to the nature of this tool, the model can be updated as new long non-protein coding transcripts are identified and functionally verified. This ensemble classifier is an accurate tool that can be used to rank long non-protein coding RNA predictions for use in conjunction with gene expression studies. Selection of plant transcripts with a high potential for regulatory roles as long non-protein coding RNAs will advance research in the elucidation of long non-protein coding RNA function.
The PhoP-Dependent ncRNA Mcr7 Modulates the TAT Secretion System in Mycobacterium tuberculosis
Benjak, Andrej; Uplekar, Swapna; Rougemont, Jacques; Guilhot, Christophe; Malaga, Wladimir; Martín, Carlos; Cole, Stewart T.
2014-01-01
The PhoPR two-component system is essential for virulence in Mycobacterium tuberculosis where it controls expression of approximately 2% of the genes, including those for the ESX-1 secretion apparatus, a major virulence determinant. Mutations in phoP lead to compromised production of pathogen-specific cell wall components and attenuation both ex vivo and in vivo. Using antibodies against the native protein in ChIP-seq experiments (chromatin immunoprecipitation followed by high-throughput sequencing) we demonstrated that PhoP binds to at least 35 loci on the M. tuberculosis genome. The PhoP regulon comprises several transcriptional regulators as well as genes for polyketide synthases and PE/PPE proteins. Integration of ChIP-seq results with high-resolution transcriptomic analysis (RNA-seq) revealed that PhoP controls 30 genes directly, whilst regulatory cascades are responsible for signal amplification and downstream effects through proteins like EspR, which controls Esx1 function, via regulation of the espACD operon. The most prominent site of PhoP regulation was located in the intergenic region between rv2395 and PE_PGRS41, where the mcr7 gene codes for a small non-coding RNA (ncRNA). Northern blot experiments confirmed the absence of Mcr7 in an M. tuberculosis phoP mutant as well as low-level expression of the ncRNA in M. tuberculosis complex members other than M. tuberculosis. By means of genetic and proteomic analyses we demonstrated that Mcr7 modulates translation of the tatC mRNA thereby impacting the activity of the Twin Arginine Translocation (Tat) protein secretion apparatus. As a result, secretion of the immunodominant Ag85 complex and the beta-lactamase BlaC is affected, among others. Mcr7, the first ncRNA of M. tuberculosis whose function has been established, therefore represents a missing link between the PhoPR two-component system and the downstream functions necessary for successful infection of the host. PMID:24874799
Specific DNA binding of the two chicken Deformed family homeodomain proteins, Chox-1.4 and Chox-a.
Sasaki, H; Yokoyama, E; Kuroiwa, A
1990-01-01
The cDNA clones encoding two chicken Deformed (Dfd) family homeobox containing genes Chox-1.4 and Chox-a were isolated. Comparison of their amino acid sequences with another chicken Dfd family homeodomain protein and with those of mouse homologues revealed that strong homologies are located in the amino terminal regions and around the homeodomains. Although homologies in other regions were relatively low, some short conserved sequences were also identified. E. coli-made full length proteins were purified and used for the production of specific antibodies and for DNA binding studies. The binding profiles of these proteins to the 5'-leader and 5'-upstream sequences of Chox-1.4 and Chox-a coding regions were analyzed by immunoprecipitation and DNase I footprint assays. These two Chox proteins bound to the same sites in the 5'-flanking sequences of their coding regions with various affinities and their binding affinities to each site were nearly the same. The consensus sequences of the high and low affinity binding sites were TAATGA(C/G) and CTAATTTT, respectively. A clustered binding site was identified in the 5'-upstream of the Chox-a gene, suggesting that this clustered binding site works as a cis-regulatory element for auto- and/or cross-regulation of Chox-a gene expression. Images PMID:1970866
Fransz, Paul F; de Jong, J Hans
2002-12-01
Recent studies in yeast, animals and plants have provided major breakthroughs in unraveling the molecular mechanism of higher-order gene regulation. In conjunction with the DNA code, proteins that are involved in chromatin remodeling, histone modification and epigenetic imprinting form a large network of interactions that control the nuclear programming of cell identity. New insight into how chromatin conformations are regulated in plants sheds light on the relationships between chromosome function, cell differentiation and developmental patterns.
The complete mitochondrial genome of Hydra vulgaris (Hydroida: Hydridae).
Pan, Hong-Chun; Fang, Hong-Yan; Li, Shi-Wei; Liu, Jun-Hong; Wang, Ying; Wang, An-Tai
2014-12-01
The complete mitochondrial genome of Hydra vulgaris (Hydroida: Hydridae) is composed of two linear DNA molecules. The mitochondrial DNA (mtDNA) molecule 1 is 8010 bp long and contains six protein-coding genes, large subunit rRNA, methionine and tryptophan tRNAs, two pseudogenes consisting respectively of a partial copy of COI, and terminal sequences at two ends of the linear mtDNA, while the mtDNA molecule 2 is 7576 bp long and contains seven protein-coding genes, small subunit rRNA, methionine tRNA, a pseudogene consisting of a partial copy of COI and terminal sequences at two ends of the linear mtDNA. COI gene begins with GTG as start codon, whereas other 12 protein-coding genes start with a typical ATG initiation codon. In addition, all protein-coding genes are terminated with TAA as stop codon.
Jiang, Shu-Ye; Sevugan, Mayalagu; Ramachandran, Srinivasan
2018-05-09
Valine-glutamine (VQ) motif containing proteins play important roles in abiotic and biotic stress responses in plants. However, little is known about the origin and evolution as well as comprehensive expression regulation of the VQ gene family. In this study, we systematically surveyed this gene family in 50 plant genomes from algae, moss, gymnosperm and angiosperm and explored their presence in other species from animals, bacteria, fungi and viruses. No VQs were detected in all tested algae genomes and all genomes from moss, gymnosperm and angiosperm encode varying numbers of VQs. Interestingly, some of fungi, lower animals and bacteria also encode single to a few VQs. Thus, they are not plant-specific and should be regarded as an ancient family. Their family expansion was mainly due to segmental duplication followed by tandem duplication and mobile elements. Limited contribution of gene conversion was detected to the family evolution. Generally, VQs were very much conserved in their motif coding region and were under purifying selection. However, positive selection was also observed during species divergence. Many VQs were up- or down-regulated by various abiotic / biotic stresses and phytohormones in rice and Arabidopsis. They were also co-expressed with some of other stress-related genes. All of the expression data suggest a comprehensive expression regulation of the VQ gene family. We provide new insights into gene expansion, divergence, evolution and their expression regulation of this VQ family. VQs were detectable not only in plants but also in some of fungi, lower animals and bacteria, suggesting the evolutionary conservation and the ancient origin. Overall, VQs are non-plant-specific and play roles in abiotic / biotic responses or other biological processes through comprehensive expression regulation.
Perspectives on the mechanism of transcriptional regulation by long non-coding RNAs.
Roberts, Thomas C; Morris, Kevin V; Weinberg, Marc S
2014-01-01
Long non-coding RNAs (lncRNAs) are increasingly being recognized as epigenetic regulators of gene transcription. The diversity and complexity of lncRNA genes means that they exert their regulatory effects by a variety of mechanisms. Although there is still much to be learned about the mechanism of lncRNA function, general principles are starting to emerge. In particular, the application of high throughput (deep) sequencing methodologies has greatly advanced our understanding of lncRNA gene function. lncRNAs function as adaptors that link specific chromatin loci with chromatin-remodeling complexes and transcription factors. lncRNAs can act in cis or trans to guide epigenetic-modifier complexes to distinct genomic sites, or act as scaffolds which recruit multiple proteins simultaneously, thereby coordinating their activities. In this review we discuss the genomic organization of lncRNAs, the importance of RNA secondary structure to lncRNA functionality, the multitude of ways in which they interact with the genome, and what evolutionary conservation tells us about their function.
NASA Astrophysics Data System (ADS)
Mohanan, Varsha C.; Chandarana, Pinal M.; Chattoo, Bharat. B.; Patkar, Rajesh N.; Manjrekar, Johannes
2017-05-01
Two-component signal transduction (TCST) pathways play crucial roles in many cellular functions such as stress responses, biofilm formation and sporulation. The histidine phosphotransferase (HPt), which is an intermediate phosphotransfer protein in a two-component system, transfers a phosphate group to a phosphorylatable aspartate residue in the target protein(s), and up-regulates stress-activated MAP kinase cascades. Most fungal genomes carry a single copy of the gene coding for HPt, which are potential antifungal targets. However, unlike the histidine kinases (HK) or the downstream response regulators (RR) in two-component system, the HPts have not been well studied in phytopathogenic fungi. In this study, we investigated the role of HPt in the model rice-blast fungal pathogen Magnaporthe oryzae. We found that in M. oryzae an additional isoform of the HPT gene YPD1 was expressed specifically in response to light. Further, the expression of light-regulated genes such as those encoding envoy and blue-light-harvesting protein, and PAS domain containing HKs was significantly reduced upon down-regulation of YPD1 in M. oryzae. Importantly, down-regulation of YPD1 led to a significant decrease in the ability to penetrate the host cuticle and in light-dependent conidiation in M. oryzae. Thus, our results indicate that Ypd1 plays an important role in asexual development and host invasion, and suggest that YPD1 isoforms likely have distinct roles to play in the rice-blast pathogen M. oryzae.
Schwientek, Patrick; Neshat, Armin; Kalinowski, Jörn; Klein, Andreas; Rückert, Christian; Schneiker-Bekel, Susanne; Wendler, Sergej; Stoye, Jens; Pühler, Alfred
2014-11-20
Actinoplanes sp. SE50/110 is the producer of the alpha-glucosidase inhibitor acarbose, which is an economically relevant and potent drug in the treatment of type-2 diabetes mellitus. In this study, we present the detection of transcription start sites on this genome by sequencing enriched 5'-ends of primary transcripts. Altogether, 1427 putative transcription start sites were initially identified. With help of the annotated genome sequence, 661 transcription start sites were found to belong to the leader region of protein-coding genes with the surprising result that roughly 20% of these genes rank among the class of leaderless transcripts. Next, conserved promoter motifs were identified for protein-coding genes with and without leader sequences. The mapped transcription start sites were finally used to improve the annotation of the Actinoplanes sp. SE50/110 genome sequence. Concerning protein-coding genes, 41 translation start sites were corrected and 9 novel protein-coding genes could be identified. In addition to this, 122 previously undetermined non-coding RNA (ncRNA) genes of Actinoplanes sp. SE50/110 were defined. Focusing on antisense transcription start sites located within coding genes or their leader sequences, it was discovered that 96 of those ncRNA genes belong to the class of antisense RNA (asRNA) genes. The remaining 26 ncRNA genes were found outside of known protein-coding genes. Four chosen examples of prominent ncRNA genes, namely the transfer messenger RNA gene ssrA, the ribonuclease P class A RNA gene rnpB, the cobalamin riboswitch RNA gene cobRS, and the selenocysteine-specific tRNA gene selC, are presented in more detail. This study demonstrates that sequencing of enriched 5'-ends of primary transcripts and the identification of transcription start sites are valuable tools for advanced genome annotation of Actinoplanes sp. SE50/110 and most probably also for other bacteria. Copyright © 2014 Elsevier B.V. All rights reserved.
Network perturbation by recurrent regulatory variants in cancer
Cho, Ara; Lee, Insuk; Choi, Jung Kyoon
2017-01-01
Cancer driving genes have been identified as recurrently affected by variants that alter protein-coding sequences. However, a majority of cancer variants arise in noncoding regions, and some of them are thought to play a critical role through transcriptional perturbation. Here we identified putative transcriptional driver genes based on combinatorial variant recurrence in cis-regulatory regions. The identified genes showed high connectivity in the cancer type-specific transcription regulatory network, with high outdegree and many downstream genes, highlighting their causative role during tumorigenesis. In the protein interactome, the identified transcriptional drivers were not as highly connected as coding driver genes but appeared to form a network module centered on the coding drivers. The coding and regulatory variants associated via these interactions between the coding and transcriptional drivers showed exclusive and complementary occurrence patterns across tumor samples. Transcriptional cancer drivers may act through an extensive perturbation of the regulatory network and by altering protein network modules through interactions with coding driver genes. PMID:28333928
Cioffi, Anna Valentina; Ferrara, Diana; Cubellis, Maria Vittoria; Aniello, Francesco; Corrado, Marcella; Liguori, Francesca; Amoroso, Alessandro; Fucci, Laura; Branno, Margherita
2002-08-01
Analysis of the genome structure of the Paracentrotus lividus (sea urchin) DNA methyltransferase (DNA MTase) gene showed the presence of an open reading frame, named METEX, in intron 7 of the gene. METEX expression is developmentally regulated, showing no correlation with DNA MTase expression. In fact, DNA MTase transcripts are present at high concentrations in the early developmental stages, while METEX is expressed at late stages of development. Two METEX cDNA clones (Met1 and Met2) that are different in the 3' end have been isolated in a cDNA library screening. The putative translated protein from Met2 cDNA clone showed similarity with Escherichia coli endonuclease III on the basis of sequence and predictive three-dimensional structure. The protein, overexpressed in E. coli and purified, had functional properties similar to the endonuclease specific for apurinic/apyrimidinic (AP) sites on the basis of the lyase activity. Therefore the open reading frame, present in intron 7 of the P. lividus DNA MTase gene, codes for a functional AP endonuclease designated SuAP1.
Evidence for Phex haploinsufficiency in murine X-linked hypophosphatemia.
Wang, L; Du, L; Ecarot, B
1999-04-01
Mutations in the PHEX gene (phosphate-regulating gene with homology to endopeptidases on the X-chromosome) are responsible for X-linked hypophosphatemia (HYP). We previously reported the full-length coding sequence of murine Phex cDNA and provided evidence of Phex expression in bone and tooth. Here, we report the cloning of the entire 3.5-kb 3'UTR of the Phex gene, yielding a total of 6248 bp for the Phex transcript. Southern blot and RT-PCR analyses revealed that the 3' end of the coding sequence and the 3'UTR of the Phex gene, spanning exons 16 to 22, are deleted in Hyp, the mouse model for HYP. Northern blot analysis of bone revealed lack of expression of stable Phex mRNA from the mutant allele and expression of Phex transcripts from the wild-type allele in Hyp heterozygous females. Expression of the Phex protein in heterozygotes was confirmed by Western analysis with antibodies raised against a COOH-terminal peptide of the mouse Phex protein. Taken together, these results indicate that the dominant pattern of Hyp inheritance in mice is due to Phex haploinsufficiency.
Ruan, Ruoxin; Chung, Kuang-Ren; Li, Hongye
2017-12-01
Sterol regulatory element binding proteins (SREBPs) are required for sterol homeostasis in eukaryotes. Activation of SREBPs is regulated by the Dsc E3 ligase complex in Schizosaccharomyces pombe and Aspergillus spp. Previous studies indicated that an SREBP-coding gene PdsreA is required for fungicide resistance and ergosterol biosynthesis in the citrus postharvest pathogen Penicillium digitatum. In this study, five genes, designated PddscA, PddscB, PddscC, PddscD, and PddscE encoding the Dsc E3 ligase complex were characterized to be required for fungicide resistance, ergosterol biosynthesis and CoCl 2 tolerance in P. digitatum. Each of the dsc genes was inactivated by target gene disruption and the resulted phenotypes were analyzed and compared. Genetic analysis reveals that, of five Dsc complex components, PddscB is the core subunit gene in P. digitatum. Although the resultant dsc mutants were able to infect citrus fruit and induce maceration lesions as the wild-type, the mutants rarely produced aerial mycelia on affected citrus fruit peels. P. digitatum Dsc proteins regulated not only the expression of genes involved in ergosterol biosynthesis but also that of PdsreA. Yeast two-hybrid assays revealed a direct interaction between the PdSreA protein and the Dsc proteins. Ectopic expression of the PdSreA N-terminus restored fungicide resistance in the dsc mutants. Our results provide important evidence to understand the mechanisms underlying SREBP activation and regulation of ergosterol biosynthesis in plant pathogenic fungi. Copyright © 2017 Elsevier GmbH. All rights reserved.
Non-coding functions of alternative pre-mRNA splicing in development.
Mockenhaupt, Stefan; Makeyev, Eugene V
2015-12-01
A majority of messenger RNA precursors (pre-mRNAs) in the higher eukaryotes undergo alternative splicing to generate more than one mature product. By targeting the open reading frame region this process increases diversity of protein isoforms beyond the nominal coding capacity of the genome. However, alternative splicing also frequently controls output levels and spatiotemporal features of cellular and organismal gene expression programs. Here we discuss how these non-coding functions of alternative splicing contribute to development through regulation of mRNA stability, translational efficiency and cellular localization. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.
Transcriptomic Responses to Salinity Stress in the Pacific Oyster Crassostrea gigas
Zhao, Xuelin; Yu, Hong; Kong, Lingfeng; Li, Qi
2012-01-01
Background Low salinity is one of the main factors limiting the distribution and survival of marine species. As a euryhaline species, the Pacific oyster Crassostrea gigas is considered to be tolerant to relative low salinity. The genes that regulate C. gigas responses to osmotic stress were monitored using the next-generation sequencing of whole transcriptome with samples taken from gills. By RNAseq technology, transcript catalogs of up- and down-regulated genes were generated from the oysters exposed to low and optimal salinity seawater. Methodology/Principal Findings Through Illumina sequencing, we reported 1665 up-regulated transcripts and 1815 down-regulated transcripts. A total of 45771 protein-coding contigs were identified from two groups based on sequence similarities with known proteins. As determined by GO annotation and KEGG pathway mapping, functional annotation of the genes recovered diverse biological functions and processes. The genes that changed expression significantly were highly represented in cellular process and regulation of biological process, intracellular and cell, binding and protein binding according to GO annotation. The results highlighted genes related to osmoregulation, signaling and interactions of osmotic stress response, anti-apoptotic reactions as well as immune response, cell adhesion and communication, cytoskeleton and cell cycle. Conclusions/Significance Through more than 1.5 million sequence reads and the expression data of the two libraries, the study provided some useful insights into signal transduction pathways in oysters and offered a number of candidate genes as potential markers of tolerance to hypoosmotic stress for oysters. In addition, the characterization of C. gigas transcriptome will not only provide a better understanding of the molecular mechanisms about the response to osmotic stress of the oysters, but also facilitate research into biological processes to find underlying physiological adaptations to hypoosmotic shock for marine invertebrates. PMID:23029449
Iparraguirre, Leire; Muñoz-Culla, Maider; Prada-Luengo, Iñigo; Castillo-Triviño, Tamara; Olascoaga, Javier; Otaegui, David
2017-09-15
Multiple sclerosis is an autoimmune disease, with higher prevalence in women, in whom the immune system is dysregulated. This dysregulation has been shown to correlate with changes in transcriptome expression as well as in gene-expression regulators, such as non-coding RNAs (e.g. microRNAs). Indeed, some of these have been suggested as biomarkers for multiple sclerosis even though few biomarkers have reached the clinical practice. Recently, a novel family of non-coding RNAs, circular RNAs, has emerged as a new player in the complex network of gene-expression regulation. MicroRNA regulation function through a 'sponge system' and a RNA splicing regulation function have been proposed for the circular RNAs. This regulating role together with their high stability in biofluids makes them seemingly good candidates as biomarkers. Given the dysregulation of both protein-coding and non-coding transcriptome that have been reported in multiple sclerosis patients, we hypothesised that circular RNA expression may also be altered. Therefore, we carried out expression profiling of 13.617 circular RNAs in peripheral blood leucocytes from multiple sclerosis patients and healthy controls finding 406 differentially expressed (P-value < 0.05, Fold change > 1.5) and demonstrate after validation that, circ_0005402 and circ_0035560 are underexpressed in multiple sclerosis patients and could be used as biomarkers of the disease. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Zou, Cheng; Li, Jingxuan; Luo, Wenzhe; Li, Long; Hu, An; Fu, Yuhua; Hou, Ye; Li, Changchun
2017-08-18
Long intergenic non-coding RNAs (lincRNAs) play essential roles in numerous biological processes and are widely studied. The skeletal muscle is an important tissue that plays an essential role in individual movement ability. However, lincRNAs in pig skeletal muscles are largely undiscovered and their biological functions remain elusive. In this study, we assembled transcriptomes using RNA-seq data published in previous studies of our laboratory group and identified 323 lincRNAs in porcine leg muscle. We found that these lincRNAs have shorter transcript length, fewer exons and lower expression level than protein-coding genes. Gene ontology and pathway analyses indicated that many potential target genes (PTGs) of lincRNAs were involved in skeletal-muscle-related processes, such as muscle contraction and muscle system process. Combined our previous studies, we found a potential regulatory mechanism in which the promoter methylation of lincRNAs can negatively regulate lincRNA expression and then positively regulate PTG expression, which can finally result in abnormal phenotypes of cloned piglets through a certain unknown pathway. This work detailed a number of lincRNAs and their target genes involved in skeletal muscle growth and development and can facilitate future studies on their roles in skeletal muscle growth and development.
Liang, Haihai; Zhao, Xiaoguang; Wang, Chengyu; Sun, Jian; Chen, Yingzhun; Wang, Guoyuan; Fang, Lei; Yang, Rui; Yu, Mengxue; Gu, Yunyan; Shan, Hongli
2018-06-21
A deeper mechanistic understanding of epithelial-to-mesenchymal transition (EMT) regulation is needed to improve current anti-metastasis strategies in ovarian cancer (OvCa). This study was designed to investigate the role of lncRNAs in EMT regulation during process of invasion-metastasis in serous OvCa to improve current anti-metastasis strategies for OvCa. We systematically analyzes high-throughput gene expression profiles of both lncRNAs and protein-coding genes in OvCa samples with integrated epithelial (iE) subtype and integrated mesenchymal (iM) subtype labels. Mouse models, cytobiology, molecular biology assays and clinical samples were performed to elucidate the function and underlying mechanisms of lncRNA PTAF-mediated promotion of EMT and invasion-metastasis in serous OvCa. We constructed a lncRNA-mediated competing endogenous RNA (ceRNA) regulatory network that affects the expression of many EMT-related protein-coding genes in mesenchymal OvCa. Using a combination of in vitro and in vivo studies, we provided evidence that the lncRNA PTAF-miR-25-SNAI2 axis controlled EMT in OvCa. Our results revealed that up-regulated PTAF induced elevated SNAI2 expression by competitively binding to miR-25, which in turn promoted OvCa cell EMT and invasion. Moreover, we found that silencing of PTAF inhibited tumor progression and metastasis in an orthotopic mouse model of OvCa. We then observed a significant correlation between PTAF expression and EMT markers in OvCa patients. The lncRNA PTAF, a mediator of TGF-β signaling, can predispose OvCa patients to metastases and may serve as a potential target for anti-metastatic therapies for mesenchymal OvCa patients.
Kramer, Marianne C.; Liang, Dongming; Tatomer, Deirdre C.; Gold, Beth; March, Zachary M.; Cherry, Sara; Wilusz, Jeremy E.
2015-01-01
Thousands of eukaryotic protein-coding genes are noncanonically spliced to produce circular RNAs. Bioinformatics has indicated that long introns generally flank exons that circularize in Drosophila, but the underlying mechanisms by which these circular RNAs are generated are largely unknown. Here, using extensive mutagenesis of expression plasmids and RNAi screening, we reveal that circularization of the Drosophila laccase2 gene is regulated by both intronic repeats and trans-acting splicing factors. Analogous to what has been observed in humans and mice, base-pairing between highly complementary transposable elements facilitates backsplicing. Long flanking repeats (∼400 nucleotides [nt]) promote circularization cotranscriptionally, whereas pre-mRNAs containing minimal repeats (<40 nt) generate circular RNAs predominately after 3′ end processing. Unlike the previously characterized Muscleblind (Mbl) circular RNA, which requires the Mbl protein for its biogenesis, we found that Laccase2 circular RNA levels are not controlled by Mbl or the Laccase2 gene product but rather by multiple hnRNP (heterogeneous nuclear ribonucleoprotein) and SR (serine–arginine) proteins acting in a combinatorial manner. hnRNP and SR proteins also regulate the expression of other Drosophila circular RNAs, including Plexin A (PlexA), suggesting a common strategy for regulating backsplicing. Furthermore, the laccase2 flanking introns support efficient circularization of diverse exons in Drosophila and human cells, providing a new tool for exploring the functional consequences of circular RNA expression across eukaryotes. PMID:26450910
Alcántara, Cristina; Sarmiento-Rubiano, Luz Adriana; Monedero, Vicente; Deutscher, Josef; Pérez-Martínez, Gaspar; Yebra, María J.
2008-01-01
Sequence analysis of the five genes (gutRMCBA) downstream from the previously described sorbitol-6-phosphate dehydrogenase-encoding Lactobacillus casei gutF gene revealed that they constitute a sorbitol (glucitol) utilization operon. The gutRM genes encode putative regulators, while the gutCBA genes encode the EIIC, EIIBC, and EIIA proteins of a phosphoenolpyruvate-dependent sorbitol phosphotransferase system (PTSGut). The gut operon is transcribed as a polycistronic gutFRMCBA messenger, the expression of which is induced by sorbitol and repressed by glucose. gutR encodes a transcriptional regulator with two PTS-regulated domains, a galactitol-specific EIIB-like domain (EIIBGat domain) and a mannitol/fructose-specific EIIA-like domain (EIIAMtl domain). Its inactivation abolished gut operon transcription and sorbitol uptake, indicating that it acts as a transcriptional activator. In contrast, cells carrying a gutB mutation expressed the gut operon constitutively, but they failed to transport sorbitol, indicating that EIIBCGut negatively regulates GutR. A footprint analysis showed that GutR binds to a 35-bp sequence upstream from the gut promoter. A sequence comparison with the presumed promoter region of gut operons from various firmicutes revealed a GutR consensus motif that includes an inverted repeat. The regulation mechanism of the L. casei gut operon is therefore likely to be operative in other firmicutes. Finally, gutM codes for a conserved protein of unknown function present in all sequenced gut operons. A gutM mutant, the first constructed in a firmicute, showed drastically reduced gut operon expression and sorbitol uptake, indicating a regulatory role also for GutM. PMID:18676710
Kantyka, Tomasz; Rawlings, Neil D.; Potempa, Jan
2010-01-01
In metazoan organisms protein inhibitors of peptidases are important factors essential for regulation of proteolytic activity. In vertebrates genes encoding peptidase inhibitors constitute up to 1% of genes reflecting a need for tight and specific control of proteolysis especially in extracellular body fluids. In stark contrast unicellular organisms, both prokaryotic and eukaryotic consistently contain only few, if any, genes coding for putative peptidase inhibitors. This may seem perplexing in the light of the fact that these organisms produce large numbers of proteases of different catalytic classes with the genes constituting up to 6% of the total gene count with the average being about 3%. Apparently, however, a unicellular life-style is fully compatible with other mechanisms of regulation of proteolysis and does not require protein inhibitors to control their intracellular and extracellular proteolytic activity. So in prokaryotes occurrence of genes encoding different types of peptidase inhibitors is infrequent and often scattered among phylogenetically distinct orders or even phyla of microbiota. Genes encoding proteins homologous to alpha-2-macroglobulin (family I39), serine carboxypeptidase Y inhibitor (family I51), alpha-1-peptidase inhibitor (family I4) and ecotin (family I11) are the most frequently represented in Bacteria. Although several of these gene products were shown to possess inhibitory activity, with an exception of ecotin and staphostatins, the biological function of microbial inhibitors is unclear. In this review we present distribution of protein inhibitors from different families among prokaryotes, describe their mode of action and hypothesize on their role in microbial physiology and interactions with hosts and environment. PMID:20558234
Mediator phosphorylation prevents stress response transcription during non-stress conditions.
Miller, Christian; Matic, Ivan; Maier, Kerstin C; Schwalb, Björn; Roether, Susanne; Strässer, Katja; Tresch, Achim; Mann, Matthias; Cramer, Patrick
2012-12-28
The multiprotein complex Mediator is a coactivator of RNA polymerase (Pol) II transcription that is required for the regulated expression of protein-coding genes. Mediator serves as an end point of signaling pathways and regulates Pol II transcription, but the mechanisms it uses are not well understood. Here, we used mass spectrometry and dynamic transcriptome analysis to investigate a functional role of Mediator phosphorylation in gene expression. Affinity purification and mass spectrometry revealed that Mediator from the yeast Saccharomyces cerevisiae is phosphorylated at multiple sites of 17 of its 25 subunits. Mediator phosphorylation levels change upon an external stimulus set by exposure of cells to high salt concentrations. Phosphorylated sites in the Mediator tail subunit Med15 are required for suppression of stress-induced changes in gene expression under non-stress conditions. Thus dynamic and differential Mediator phosphorylation contributes to gene regulation in eukaryotic cells.
Wu, Shengru; Liu, Yanli; Guo, Wei; Cheng, Xi; Ren, Xiaochun; Chen, Si; Li, Xueyuan; Duan, Yongle; Sun, Qingzhu; Yang, Xiaojun
2018-06-27
The liver is mainly hematopoietic in the embryo, and converts into a major metabolic organ in the adult. Therefore, it is intensively remodeled after birth to adapt and perform adult functions. Long non-coding RNAs (lncRNAs) are involved in organ development and cell differentiation, likely they have potential roles in regulating postnatal liver development. Herein, in order to understand the roles of lncRNAs in postnatal liver maturation, we analyzed the lncRNAs and mRNAs expression profiles in immature and mature livers from one-day-old and adult (40 weeks of age) breeder roosters by Ribo-Zero RNA-Sequencing. Around 21,939 protein-coding genes and 2220 predicted lncRNAs were expressed in livers of breeder roosters. Compared to protein-coding genes, the identified chicken lncRNAs shared fewer exons, shorter transcript length, and significantly lower expression levels. Notably, in comparison between the livers of newborn and adult breeder roosters, a total of 1570 mRNAs and 214 lncRNAs were differentially expressed with the criteria of log 2 fold change > 1 or < - 1 and P values < 0.05, which were validated by qPCR using randomly selected five mRNAs and five lncRNAs. Further GO and KEGG analyses have revealed that the differentially expressed mRNAs were involved in the hepatic metabolic and immune functional changes, as well as some biological processes and pathways including cell proliferation, apoptotic and cell cycle that are implicated in the development of liver. We also investigated the cis- and trans- regulatory effects of differentially expressed lncRNAs on its target genes. GO and KEGG analyses indicated that these lncRNAs had their neighbor protein coding genes and trans-regulated genes associated with adapting of adult hepatic functions, as well as some pathways involved in liver development, such as cell cycle pathway, Notch signaling pathway, Hedgehog signaling pathway, and Wnt signaling pathway. This study provides a catalog of mRNAs and lncRNAs related to postnatal liver maturation of chicken, and will contribute to a fuller understanding of biological processes or signaling pathways involved in significant functional transition during postnatal liver development that differentially expressed genes and lncRNAs could take part in.
Lavenu, A; Pistoi, S; Pournin, S; Babinet, C; Morello, D
1995-01-01
In vivo, the steady-state level of c-myc mRNA is mainly controlled by posttranscriptional mechanisms. Using a panel of transgenic mice in which various versions of the human c-myc proto-oncogene were under the control of major histocompatibility complex H-2Kb class I regulatory sequences, we have shown that the 5' and the 3' noncoding sequences are dispensable for obtaining a regulated expression of the transgene in adult quiescent tissues, at the start of liver regeneration, and after inhibition of protein synthesis. These results indicated that the coding sequences were sufficient to ensure a regulated c-myc expression. In the present study, we have pursued this analysis with transgenes containing one or the other of the two c-myc coding exons either alone or in association with the c-myc 3' untranslated region. We demonstrate that each of the exons contains determinants which control c-myc mRNA expression. Moreover, we show that in the liver, c-myc exon 2 sequences are able to down-regulate an otherwise stable H-2K mRNA when embedded within it and to induce its transient accumulation after cycloheximide treatment and soon after liver ablation. Finally, the use of transgenes with different coding capacities has allowed us to postulate that the primary mRNA sequence itself and not c-Myc peptides is an important component of c-myc posttranscriptional regulation. PMID:7623834
Sebaihia, Mohammed; Preston, Andrew; Maskell, Duncan J.; Kuzmiak, Holly; Connell, Terry D.; King, Natalie D.; Orndorff, Paul E.; Miyamoto, David M.; Thomson, Nicholas R.; Harris, David; Goble, Arlette; Lord, Angela; Murphy, Lee; Quail, Michael A.; Rutter, Simon; Squares, Robert; Squares, Steven; Woodward, John; Parkhill, Julian; Temple, Louise M.
2006-01-01
Bordetella avium is a pathogen of poultry and is phylogenetically distinct from Bordetella bronchiseptica, Bordetella pertussis, and Bordetella parapertussis, which are other species in the Bordetella genus that infect mammals. In order to understand the evolutionary relatedness of Bordetella species and further the understanding of pathogenesis, we obtained the complete genome sequence of B. avium strain 197N, a pathogenic strain that has been extensively studied. With 3,732,255 base pairs of DNA and 3,417 predicted coding sequences, it has the smallest genome and gene complement of the sequenced bordetellae. In this study, the presence or absence of previously reported virulence factors from B. avium was confirmed, and the genetic bases for growth characteristics were elucidated. Over 1,100 genes present in B. avium but not in B. bronchiseptica were identified, and most were predicted to encode surface or secreted proteins that are likely to define an organism adapted to the avian rather than the mammalian respiratory tracts. These include genes coding for the synthesis of a polysaccharide capsule, hemagglutinins, a type I secretion system adjacent to two very large genes for secreted proteins, and unique genes for both lipopolysaccharide and fimbrial biogenesis. Three apparently complete prophages are also present. The BvgAS virulence regulatory system appears to have polymorphisms at a poly(C) tract that is involved in phase variation in other bordetellae. A number of putative iron-regulated outer membrane proteins were predicted from the sequence, and this regulation was confirmed experimentally for five of these. PMID:16885469
Center for Regenerative Biology and Medicine at Mount Desert Island Biological Laboratory
2012-06-01
Code Axolotl microRNAs Zebrafish Polypterus 16. SECURITY CLASSIFICATION OF: 17. LIMITATION OF...controlled in both Polypterus and axolotl samples. These comparisons revealed a total of 2779 shared genes that are significantly upregulated during...UPREGULATED DOWNREGULATED Figure 1: Venn diagram of UniProt protein sequence IDs among Axolotl and Polypterus contigs that were up-regulated
Decoding the Long Noncoding RNA During Cardiac Maturation: A Roadmap for Functional Discovery.
Touma, Marlin; Kang, Xuedong; Zhao, Yan; Cass, Ashley A; Gao, Fuying; Biniwale, Reshma; Coppola, Giovanni; Xiao, Xinshu; Reemtsen, Brian; Wang, Yibin
2016-10-01
Cardiac maturation during perinatal transition of heart is critical for functional adaptation to hemodynamic load and nutrient environment. Perturbation in this process has major implications in congenital heart defects. Transcriptome programming during perinatal stages is an important information but incomplete in current literature, particularly, the expression profiles of the long noncoding RNAs (lncRNAs) are not fully elucidated. From comprehensive analysis of transcriptomes derived from neonatal mouse heart left and right ventricles, a total of 45 167 unique transcripts were identified, including 21 916 known and 2033 novel lncRNAs. Among these lncRNAs, 196 exhibited significant dynamic regulation along maturation process. By implementing parallel weighted gene co-expression network analysis of mRNA and lncRNA data sets, several lncRNA modules coordinately expressed in a developmental manner similar to protein coding genes, while few lncRNAs revealed chamber-specific patterns. Out of 2262 lncRNAs located within 50 kb of protein coding genes, 5% significantly correlate with the expression of their neighboring genes. The impact of Ppp1r1b-lncRNA on the corresponding partner gene Tcap was validated in cultured myoblasts. This concordant regulation was also conserved in human infantile hearts. Furthermore, the Ppp1r1b-lncRNA/Tcap expression ratio was identified as a molecular signature that differentiated congenital heart defect phenotypes. The study provides the first high-resolution landscape on neonatal cardiac lncRNAs and reveals their potential interaction with mRNA transcriptome during cardiac maturation. Ppp1r1b-lncRNA was identified as a regulator of Tcap expression, with dynamic interaction in postnatal cardiac development and congenital heart defects. © 2016 American Heart Association, Inc.
Xiong, Changyan; Li, Xuejiao; Liu, Juanli; Zhao, Xin; Xu, Shungao; Huang, Xinxiang
2018-01-01
Antisense RNAs from complementary strands of protein coding genes regulate the expression of genes involved in many cellular processes. Using deep sequencing analysis of the Salmonella enterica serovar Typhi ( S. Typhi) transcriptome, a novel antisense RNA encoded on the strand complementary to the rpoH gene was revealed. In this study, the molecular features of this antisense RNA were assessed using northern blotting and rapid amplification of cDNA ends. The 3,508 nt sequence of RNA was identified as the antisense RNA of the rpoH gene and was named ArpH. ArpH was found to attenuate the invasion of HeLa cells by S. Typhi by regulating the expression of SPI-1 genes. In an rpoH mutant strain, the invasive capacity of S. Typhi was increased, whereas overexpression of ArpH positively regulates rpoH mRNA levels. Results of this study suggest that the cis -encoded antisense RNA ArpH is likely to affect the invasive capacity of S. Typhi by regulating the expression of rpoH .
Schnable, James C; Pedersen, Brent S; Subramaniam, Sabarinath; Freeling, Michael
2011-01-01
Whole genome duplications, or tetraploidies, are an important source of increased gene content. Following whole genome duplication, duplicate copies of many genes are lost from the genome. This loss of genes is biased both in the classes of genes deleted and the subgenome from which they are lost. Many or all classes are genes preferentially retained as duplicate copies are engaged in dose sensitive protein-protein interactions, such that deletion of any one duplicate upsets the status quo of subunit concentrations, and presumably lowers fitness as a result. Transcription factors are also preferentially retained following every whole genome duplications studied. This has been explained as a consequence of protein-protein interactions, just as for other highly retained classes of genes. We show that the quantity of conserved noncoding sequences (CNSs) associated with genes predicts the likelihood of their retention as duplicate pairs following whole genome duplication. As many CNSs likely represent binding sites for transcriptional regulators, we propose that the likelihood of gene retention following tetraploidy may also be influenced by dose-sensitive protein-DNA interactions between the regulatory regions of CNS-rich genes - nicknamed bigfoot genes - and the proteins that bind to them. Using grass genomes, we show that differential loss of CNSs from one member of a pair following the pre-grass tetraploidy reduces its chance of retention in the subsequent maize lineage tetraploidy.
Argonaute: The executor of small RNA function.
Azlan, Azali; Dzaki, Najat; Azzam, Ghows
2016-08-20
The discovery of small non-coding RNAs - microRNA (miRNA), short interfering RNA (siRNA) and PIWI-interacting RNA (piRNA) - represents one of the most exciting frontiers in biology specifically on the mechanism of gene regulation. In order to execute their functions, these small RNAs require physical interactions with their protein partners, the Argonaute (AGO) family proteins. Over the years, numerous studies have made tremendous progress on understanding the roles of AGO in gene silencing in various organisms. In this review, we summarize recent progress of AGO-mediated gene silencing and other cellular processes in which AGO proteins have been implicated with a particular focus on progress made in flies, humans and other model organisms as compliment. Copyright © 2016 Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, and Genetics Society of China. Published by Elsevier Ltd. All rights reserved.
Hücker, Sarah M.; Ardern, Zachary; Goldberg, Tatyana; Schafferhans, Andrea; Bernhofer, Michael; Vestergaard, Gisle; Nelson, Chase W.; Schloter, Michael; Rost, Burkhard; Scherer, Siegfried
2017-01-01
In the past, short protein-coding genes were often disregarded by genome annotation pipelines. Transcriptome sequencing (RNAseq) signals outside of annotated genes have usually been interpreted to indicate either ncRNA or pervasive transcription. Therefore, in addition to the transcriptome, the translatome (RIBOseq) of the enteric pathogen Escherichia coli O157:H7 strain Sakai was determined at two optimal growth conditions and a severe stress condition combining low temperature and high osmotic pressure. All intergenic open reading frames potentially encoding a protein of ≥ 30 amino acids were investigated with regard to coverage by transcription and translation signals and their translatability expressed by the ribosomal coverage value. This led to discovery of 465 unique, putative novel genes not yet annotated in this E. coli strain, which are evenly distributed over both DNA strands of the genome. For 255 of the novel genes, annotated homologs in other bacteria were found, and a machine-learning algorithm, trained on small protein-coding E. coli genes, predicted that 89% of these translated open reading frames represent bona fide genes. The remaining 210 putative novel genes without annotated homologs were compared to the 255 novel genes with homologs and to 250 short annotated genes of this E. coli strain. All three groups turned out to be similar with respect to their translatability distribution, fractions of differentially regulated genes, secondary structure composition, and the distribution of evolutionary constraint, suggesting that both novel groups represent legitimate genes. However, the machine-learning algorithm only recognized a small fraction of the 210 genes without annotated homologs. It is possible that these genes represent a novel group of genes, which have unusual features dissimilar to the genes of the machine-learning algorithm training set. PMID:28902868
Liu, Cui; Yu, Yanbao; Liu, Feng; Wei, Xin; Wrobel, John A.; Gunawardena, Harsha P.; Zhou, Li; Jin, Jian; Chen, Xian
2015-01-01
Immune cells develop endotoxin tolerance (ET) after prolonged stimulation. ET increases the level of a repression mark H3K9me2 in the transcriptional-silent chromatin specifically associated with pro-inflammatory genes. However, it is not clear what proteins are functionally involved in this process. Here we show that a novel chromatin activity based chemoproteomic (ChaC) approach can dissect the functional chromatin protein complexes that regulate ET-associated inflammation. Using UNC0638 that binds the enzymatically active H3K9-specific methyltransferase G9a/GLP, ChaC reveals that G9a is constitutively active at a G9a-dependent mega-dalton repressome in primary endotoxin-tolerant macrophages. G9a/GLP broadly impacts the ET-specific reprogramming of the histone code landscape, chromatin remodeling, and the activities of select transcription factors. We discover that the G9a-dependent epigenetic environment promotes the transcriptional repression activity of c-Myc for gene-specific co-regulation of chronic inflammation. ChaC may be also applicable to dissect other functional protein complexes in the context of phenotypic chromatin architectures. PMID:25502336
SMN control of RNP assembly: from post-transcriptional gene regulation to motor neuron disease
Li, Darrick K.; Tisdale, Sarah; Lotti, Francesco; Pellizzoni, Livio
2014-01-01
At the post-transcriptional level, expression of protein-coding genes is controlled by a series of RNA regulatory events including nuclear processing of primary transcripts, transport of mature mRNAs to specific cellular compartments, translation and ultimately, turnover. These processes are orchestrated through the dynamic association of mRNAs with RNA binding proteins and ribonucleoprotein (RNP) complexes. Accurate formation of RNPs in vivo is fundamentally important to cellular development and function, and its impairment often leads to human disease. The survival motor neuron (SMN) protein is key to this biological paradigm: SMN is essential for the biogenesis of various RNPs that function in mRNA processing, and genetic mutations leading to SMN deficiency cause the neurodegenerative disease spinal muscular atrophy. Here we review the expanding role of SMN in the regulation of gene expression through its multiple functions in RNP assembly. We discuss advances in our understanding of SMN activity as a chaperone of RNPs and how disruption of SMN-dependent RNA pathways can cause motor neuron disease. PMID:24769255
Cheah, Y K; Cheng, R W; Yeap, S K; Khoo, C H; See, H S
2014-03-17
The identification of new biomarkers for early detection of highly recurrent head and neck cancer is urgently needed. MicroRNAs (miRNAs) are small and non-coding RNAs that regulate cancer-related gene expression, such as tumor protein 53 (TP53) gene expression. This study was carried out to analyze TP53 gene expression using real-time PCR and to determine changes in intracellular p53 level by flow cytometry after downregulation of miRNA-181a miRNA inhibitor in the FaDu cell line. TP53 gene expression showed a 3-fold increment and the p53 protein level was also increased in the miRNA-181a-treated cells. In conclusion, miRNA-181a binds to the TP53 gene and inhibits its expression, decreasing the synthesis of p53.
Adaptive evolution of the matrix extracellular phosphoglycoprotein in mammals
2011-01-01
Background Matrix extracellular phosphoglycoprotein (MEPE) belongs to a family of small integrin-binding ligand N-linked glycoproteins (SIBLINGs) that play a key role in skeleton development, particularly in mineralization, phosphate regulation and osteogenesis. MEPE associated disorders cause various physiological effects, such as loss of bone mass, tumors and disruption of renal function (hypophosphatemia). The study of this developmental gene from an evolutionary perspective could provide valuable insights on the adaptive diversification of morphological phenotypes in vertebrates. Results Here we studied the adaptive evolution of the MEPE gene in 26 Eutherian mammals and three birds. The comparative genomic analyses revealed a high degree of evolutionary conservation of some coding and non-coding regions of the MEPE gene across mammals indicating a possible regulatory or functional role likely related with mineralization and/or phosphate regulation. However, the majority of the coding region had a fast evolutionary rate, particularly within the largest exon (1467 bp). Rodentia and Scandentia had distinct substitution rates with an increased accumulation of both synonymous and non-synonymous mutations compared with other mammalian lineages. Characteristics of the gene (e.g. biochemical, evolutionary rate, and intronic conservation) differed greatly among lineages of the eight mammalian orders. We identified 20 sites with significant positive selection signatures (codon and protein level) outside the main regulatory motifs (dentonin and ASARM) suggestive of an adaptive role. Conversely, we find three sites under selection in the signal peptide and one in the ASARM motif that were supported by at least one selection model. The MEPE protein tends to accumulate amino acids promoting disorder and potential phosphorylation targets. Conclusion MEPE shows a high number of selection signatures, revealing the crucial role of positive selection in the evolution of this SIBLING member. The selection signatures were found mainly outside the functional motifs, reinforcing the idea that other regions outside the dentonin and the ASARM might be crucial for the function of the protein and future studies should be undertaken to understand its importance. PMID:22103247
Fu, Lijuan; Shi, Zhimin; Luo, Guanzheng; Tu, Weihong; Wang, XiuJie; Fang, Zhide; Li, XiaoChing
2014-10-01
Mutations in the human FOXP2 gene cause speech and language impairments. The FOXP2 protein is a transcription factor that regulates the expression of many downstream genes, which may have important roles in nervous system development and function. An adequate amount of functional FOXP2 protein is thought to be critical for the proper development of the neural circuitry underlying speech and language. However, how FOXP2 gene expression is regulated is not clearly understood. The FOXP2 mRNA has an approximately 4-kb-long 3' untranslated region (3' UTR), twice as long as its protein coding region, indicating that FOXP2 can be regulated by microRNAs (miRNAs). We identified multiple miRNAs that regulate the expression of the human FOXP2 gene using sequence analysis and in vitro cell systems. Focusing on let-7a, miR-9, and miR-129-5p, three brain-enriched miRNAs, we show that these miRNAs regulate human FOXP2 expression in a dosage-dependent manner and target specific sequences in the FOXP2 3' UTR. We further show that these three miRNAs are expressed in the cerebellum of the human fetal brain, where FOXP2 is known to be expressed. Our results reveal novel regulatory functions of the human FOXP2 3' UTR sequence and regulatory interactions between multiple miRNAs and the human FOXP2 gene. The expression of let-7a, miR-9, and miR-129-5p in the human fetal cerebellum is consistent with their roles in regulating FOXP2 expression during early cerebellum development. These results suggest that various genetic and environmental factors may contribute to speech and language development and related neural developmental disorders via the miRNA-FOXP2 regulatory network.
Shaw, Joseph R; Colbourne, John K; Davey, Jennifer C; Glaholt, Stephen P; Hampton, Thomas H; Chen, Celia Y; Folt, Carol L; Hamilton, Joshua W
2007-12-21
Genomic research tools such as microarrays are proving to be important resources to study the complex regulation of genes that respond to environmental perturbations. A first generation cDNA microarray was developed for the environmental indicator species Daphnia pulex, to identify genes whose regulation is modulated following exposure to the metal stressor cadmium. Our experiments revealed interesting changes in gene transcription that suggest their biological roles and their potentially toxicological features in responding to this important environmental contaminant. Our microarray identified genes reported in the literature to be regulated in response to cadmium exposure, suggested functional attributes for genes that share no sequence similarity to proteins in the public databases, and pointed to genes that are likely members of expanded gene families in the Daphnia genome. Genes identified on the microarray also were associated with cadmium induced phenotypes and population-level outcomes that we experimentally determined. A subset of genes regulated in response to cadmium exposure was independently validated using quantitative-realtime (Q-RT)-PCR. These microarray studies led to the discovery of three genes coding for the metal detoxication protein metallothionein (MT). The gene structures and predicted translated sequences of D. pulex MTs clearly place them in this gene family. Yet, they share little homology with previously characterized MTs. The genomic information obtained from this study represents an important first step in characterizing microarray patterns that may be diagnostic to specific environmental contaminants and give insights into their toxicological mechanisms, while also providing a practical tool for evolutionary, ecological, and toxicological functional gene discovery studies. Advances in Daphnia genomics will enable the further development of this species as a model organism for the environmental sciences.
Shaw, Joseph R; Colbourne, John K; Davey, Jennifer C; Glaholt, Stephen P; Hampton, Thomas H; Chen, Celia Y; Folt, Carol L; Hamilton, Joshua W
2007-01-01
Background Genomic research tools such as microarrays are proving to be important resources to study the complex regulation of genes that respond to environmental perturbations. A first generation cDNA microarray was developed for the environmental indicator species Daphnia pulex, to identify genes whose regulation is modulated following exposure to the metal stressor cadmium. Our experiments revealed interesting changes in gene transcription that suggest their biological roles and their potentially toxicological features in responding to this important environmental contaminant. Results Our microarray identified genes reported in the literature to be regulated in response to cadmium exposure, suggested functional attributes for genes that share no sequence similarity to proteins in the public databases, and pointed to genes that are likely members of expanded gene families in the Daphnia genome. Genes identified on the microarray also were associated with cadmium induced phenotypes and population-level outcomes that we experimentally determined. A subset of genes regulated in response to cadmium exposure was independently validated using quantitative-realtime (Q-RT)-PCR. These microarray studies led to the discovery of three genes coding for the metal detoxication protein metallothionein (MT). The gene structures and predicted translated sequences of D. pulex MTs clearly place them in this gene family. Yet, they share little homology with previously characterized MTs. Conclusion The genomic information obtained from this study represents an important first step in characterizing microarray patterns that may be diagnostic to specific environmental contaminants and give insights into their toxicological mechanisms, while also providing a practical tool for evolutionary, ecological, and toxicological functional gene discovery studies. Advances in Daphnia genomics will enable the further development of this species as a model organism for the environmental sciences. PMID:18154678
Ramesh, S V
2013-09-01
Of late non-coding RNAs (ncRNAs)-mediated gene silencing is an influential tool deliberately deployed to negatively regulate the expression of targeted genes. In addition to the widely employed small interfering RNA (siRNA)-mediated gene silencing approach, other variants like artificial miRNA (amiRNA), miRNA mimics, and artificial transacting siRNAs (tasiRNAs) are being explored and successfully deployed in developing non-coding RNA-based genetically modified plants. The ncRNA-based gene manipulations are typified with mobile nature of silencing signals, interference from viral genome-derived suppressor proteins, and an obligation for meticulous computational analysis to prevaricate any inadvertent effects. In a broad sense, risk assessment inquiries for genetically modified plants based on the expression of ncRNAs are competently addressed by the environmental risk assessment (ERA) models, currently in vogue, designed for the first generation transgenic plants which are based on the expression of heterologous proteins. Nevertheless, transgenic plants functioning on the foundation of ncRNAs warrant due attention with respect to their unique attributes like off-target or non-target gene silencing effects, small RNAs (sRNAs) persistence, food and feed safety assessments, problems in detection and tracking of sRNAs in food, impact of ncRNAs in plant protection measures, effect of mutations etc. The role of recent developments in sequencing techniques like next generation sequencing (NGS) and the ERA paradigm of the different countries in vogue are also discussed in the context of ncRNA-based gene manipulations.
Synaptic, transcriptional and chromatin genes disrupted in autism.
De Rubeis, Silvia; He, Xin; Goldberg, Arthur P; Poultney, Christopher S; Samocha, Kaitlin; Cicek, A Erucment; Kou, Yan; Liu, Li; Fromer, Menachem; Walker, Susan; Singh, Tarinder; Klei, Lambertus; Kosmicki, Jack; Shih-Chen, Fu; Aleksic, Branko; Biscaldi, Monica; Bolton, Patrick F; Brownfeld, Jessica M; Cai, Jinlu; Campbell, Nicholas G; Carracedo, Angel; Chahrour, Maria H; Chiocchetti, Andreas G; Coon, Hilary; Crawford, Emily L; Curran, Sarah R; Dawson, Geraldine; Duketis, Eftichia; Fernandez, Bridget A; Gallagher, Louise; Geller, Evan; Guter, Stephen J; Hill, R Sean; Ionita-Laza, Juliana; Jimenz Gonzalez, Patricia; Kilpinen, Helena; Klauck, Sabine M; Kolevzon, Alexander; Lee, Irene; Lei, Irene; Lei, Jing; Lehtimäki, Terho; Lin, Chiao-Feng; Ma'ayan, Avi; Marshall, Christian R; McInnes, Alison L; Neale, Benjamin; Owen, Michael J; Ozaki, Noriio; Parellada, Mara; Parr, Jeremy R; Purcell, Shaun; Puura, Kaija; Rajagopalan, Deepthi; Rehnström, Karola; Reichenberg, Abraham; Sabo, Aniko; Sachse, Michael; Sanders, Stephan J; Schafer, Chad; Schulte-Rüther, Martin; Skuse, David; Stevens, Christine; Szatmari, Peter; Tammimies, Kristiina; Valladares, Otto; Voran, Annette; Li-San, Wang; Weiss, Lauren A; Willsey, A Jeremy; Yu, Timothy W; Yuen, Ryan K C; Cook, Edwin H; Freitag, Christine M; Gill, Michael; Hultman, Christina M; Lehner, Thomas; Palotie, Aaarno; Schellenberg, Gerard D; Sklar, Pamela; State, Matthew W; Sutcliffe, James S; Walsh, Christiopher A; Scherer, Stephen W; Zwick, Michael E; Barett, Jeffrey C; Cutler, David J; Roeder, Kathryn; Devlin, Bernie; Daly, Mark J; Buxbaum, Joseph D
2014-11-13
The genetic architecture of autism spectrum disorder involves the interplay of common and rare variants and their impact on hundreds of genes. Using exome sequencing, here we show that analysis of rare coding variation in 3,871 autism cases and 9,937 ancestry-matched or parental controls implicates 22 autosomal genes at a false discovery rate (FDR) < 0.05, plus a set of 107 autosomal genes strongly enriched for those likely to affect risk (FDR < 0.30). These 107 genes, which show unusual evolutionary constraint against mutations, incur de novo loss-of-function mutations in over 5% of autistic subjects. Many of the genes implicated encode proteins for synaptic formation, transcriptional regulation and chromatin-remodelling pathways. These include voltage-gated ion channels regulating the propagation of action potentials, pacemaking and excitability-transcription coupling, as well as histone-modifying enzymes and chromatin remodellers-most prominently those that mediate post-translational lysine methylation/demethylation modifications of histones.
2014-01-01
Background The genome is pervasively transcribed but most transcripts do not code for proteins, constituting non-protein-coding RNAs. Despite increasing numbers of functional reports of individual long non-coding RNAs (lncRNAs), assessing the extent of functionality among the non-coding transcriptional output of mammalian cells remains intricate. In the protein-coding world, transcripts differentially expressed in the context of processes essential for the survival of multicellular organisms have been instrumental in the discovery of functionally relevant proteins and their deregulation is frequently associated with diseases. We therefore systematically identified lncRNAs expressed differentially in response to oncologically relevant processes and cell-cycle, p53 and STAT3 pathways, using tiling arrays. Results We found that up to 80% of the pathway-triggered transcriptional responses are non-coding. Among these we identified very large macroRNAs with pathway-specific expression patterns and demonstrated that these are likely continuous transcripts. MacroRNAs contain elements conserved in mammals and sauropsids, which in part exhibit conserved RNA secondary structure. Comparing evolutionary rates of a macroRNA to adjacent protein-coding genes suggests a local action of the transcript. Finally, in different grades of astrocytoma, a tumor disease unrelated to the initially used cell lines, macroRNAs are differentially expressed. Conclusions It has been shown previously that the majority of expressed non-ribosomal transcripts are non-coding. We now conclude that differential expression triggered by signaling pathways gives rise to a similar abundance of non-coding content. It is thus unlikely that the prevalence of non-coding transcripts in the cell is a trivial consequence of leaky or random transcription events. PMID:24594072
NASA Technical Reports Server (NTRS)
Patil, Shameekumar; Takezawa, D.; Poovaiah, B. W.
1995-01-01
Calcium, a universal second messenger, regulates diverse cellular processes in eukaryotes. Ca-2(+) and Ca-2(+)/calmodulin-regulated protein phosphorylation play a pivotal role in amplifying and diversifying the action of Ca-2(+)- mediated signals. A chimeric Ca-2(+)/calmodulin-dependent protein kinase (CCaMK) gene with a visinin-like Ca-2(+)- binding domain was cloned and characterized from lily. The cDNA clone contains an open reading frame coding for a protein of 520 amino acids. The predicted structure of CCaMK contains a catalytic domain followed by two regulatory domains, a calmodulin-binding domain and a visinin-like Ca-2(+)-binding domain. The amino-terminal region of CCaMK contains all 11 conserved subdomains characteristic of serine/threonine protein kinases. The calmodulin-binding region of CCaMK has high homology (79%) to alpha subunit of mammalian Ca-2(+)/calmodulin-dependent protein kinase. The calmodulin-binding region is fused to a neural visinin-like domain that contains three Ca-2(+)-binding EF-hand motifs and a biotin-binding site. The Escherichia coli-expressed protein (approx. 56 kDa) binds calmodulin in a Ca-2(+)-dependent manner. Furthermore, Ca-45-binding assays revealed that CCaMK directly binds Ca-2(+). The CCaMK gene is preferentially expressed in developing anthers. Southern blot analysis revealed that CCaMK is encoded by a single gene. The structural features of the gene suggest that it has multiple regulatory controls and could play a unique role in Ca-2(+) signaling in plants.
Cellular miR-2909 RNomics governs the genes that ensure immune checkpoint regulation.
Kaul, Deepak; Malik, Deepti; Wani, Sameena
2018-06-20
Cross-talk between coding RNAs and regulatory non-coding microRNAs, within human genome, has provided compelling evidence for the existence of flexible checkpoint control of T-Cell activation. The present study attempts to demonstrate that the interplay between miR-2909 and its effector KLF4 gene has the inherent capacity to regulate genes coding for CTLA4, CD28, CD40, CD134, PDL1, CD80, CD86, IL-6 and IL-10 within normal human peripheral blood mononuclear cells (PBMCs). Based upon these findings, we propose a pathway that links miR-2909 RNomics with the genes coding for immune checkpoint regulators required for the maintenance of immune homeostasis.
Effects of MicroRNA-23a on Differentiation and Gene Expression Profiles in 3T3-L1 Adipocytes
Huang, Yong; Huang, Jinxiu; Qi, Renli; Wang, Qi; Wu, Yongjiang; Wang, Jing
2016-01-01
MicroRNAs (miRNAs) are small non-coding RNA molecules that regulate growth, development, and programmed death of cells. A newly-published study has shown that miRNA-23a could regulate 3T3-L1 adipocyte differentiation. Here, we identified miRNA-23a as a negative regulator of 3T3-L1 adipocyte differentiation again. Over-expression of miRNA-23a inhibited differentiation and decreased lipogenesis as well as down-regulated mRNA and protein expression of both peroxisome proliferator-activated receptor (PPAR) γ and fatty acid binding protein (FABP) 4, whereas knock down of miRNA-23a showed the opposite effects on differentiation as well as increasing the number of apoptotic cells. Additionally, digital gene expression profiling sequencing (DGE-Seq) was used to assay changes in gene expression profiles following alterations in the level of miR-23a. In total, over-expression or knock down of miRNA-23a significantly changed the expression of 313 and 425 genes, respectively. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses indicated that these genes were mainly involved in the stress response, immune system, metabolism, cell cycle, among other pathways. Additionally, the signal transducer and activator of transcription 1 (Stat1) was shown to be a target of miRNA-23a by computational and dual-luciferase reporter assays that indicated Janus Kinase (Jak)-Stat signal pathway was implicated in regulating adipogenesis mediated by miRNA-23a in adipocytes. PMID:27783036
Singh, Manish K; Tiwari, Pramod K
2016-08-01
Hsp27, a highly conserved small molecular weight heat shock protein, is widely known to be developmentally regulated and heat inducible. Its role in thermotolerance is also implicated. This study is a sequel of our earlier studies to understand the molecular organization of heat shock genes/proteins and their role in development and thermal adaptation in a sheep pest, Lucilia cuprina (blowfly), which exhibits unusually high adaptability to a variety of environmental stresses, including heat and chemicals. In this report our aim was to understand the evolutionary relationship of Lucilia hsp27 gene/protein with those of other species and its role in thermal adaptation. We sequence characterized the Lchsp27 gene (coding region) and analyzed its expression in various larval and adult tissues under normal as well as heat shock conditions. The nucleotide sequence analysis of 678 bps long-coding region of Lchsp27 exhibited closest evolutionary proximity with Drosophila (90.09%), which belongs to the same order, Diptera. Heat shock caused significant enhancement in the expression of Lchsp27 gene in all the larval and adult tissues examined, however, in a tissue specific manner. Significantly, in Malpighian tubules, while the heat-induced level of hsp27 transcript (mRNA) appeared increased as compared to control, the protein level remained unaltered and nuclear localized. We infer that Lchsp27 may have significant role in the maintenance of cellular homeostasis, particularly, during summer months, when the fly remains exposed to high heat in its natural habitat. © 2015 Institute of Zoology, Chinese Academy of Sciences.
Schyth, Brian Dall; Bela-ong, Dennis Berbulla; Jalali, Seyed Amir Hossein; Kristensen, Lasse Bøgelund Juel; Einer-Jensen, Katja; Pedersen, Finn Skou; Lorenzen, Niels
2015-01-01
MicroRNAs (miRNAs) are ~22 base pair-long non-coding RNAs which regulate gene expression in the cytoplasm of eukaryotic cells by binding to specific target regions in mRNAs to mediate transcriptional blocking or mRNA cleavage. Through their fundamental roles in cellular pathways, gene regulation mediated by miRNAs has been shown to be involved in almost all biological phenomena, including development, metabolism, cell cycle, tumor formation, and host-pathogen interactions. To address the latter in a primitive vertebrate host, we here used an array platform to analyze the miRNA response in rainbow trout (Oncorhynchus mykiss) following inoculation with the virulent fish rhabdovirus Viral hemorrhagic septicaemia virus. Two clustered miRNAs, miR-462 and miR-731 (herein referred to as miR-462 cluster), described only in teleost fishes, were found to be strongly upregulated, indicating their involvement in fish-virus interactions. We searched for homologues of the two teleost miRNAs in other vertebrate species and investigated whether findings related to ours have been reported for these homologues. Gene synteny analysis along with gene sequence conservation suggested that the teleost fish miR-462 and miR-731 had evolved from the ancestral miR-191 and miR-425 (herein called miR-191 cluster), respectively. Whereas the miR-462 cluster locus is found between two protein-coding genes (intergenic) in teleost fish genomes, the miR-191 cluster locus is found within an intron of a protein-coding gene (intragenic) in the human genome. Interferon (IFN)-inducible and immune-related promoter elements found upstream of the teleost miR-462 cluster locus suggested roles in immune responses to viral pathogens in fish, while in humans, the miR-191 cluster functionally associated with cell cycle regulation. Stimulation of fish cell cultures with the IFN inducer poly I:C accordingly upregulated the expression of miR-462 and miR-731, while no stimulatory effect on miR-191 and miR-425 expression was observed in human cell lines. Despite high sequence conservation, evolution has thus resulted in different regulation and presumably also different functional roles of these orthologous miRNA clusters in different vertebrate lineages. PMID:26207374
Maize GO annotation—methods, evaluation, and review (maize-GAMER)
USDA-ARS?s Scientific Manuscript database
We created a new high-coverage, robust, and reproducible functional annotation of maize protein-coding genes based on Gene Ontology (GO) term assignments. Whereas the existing Phytozome and Gramene maize GO annotation sets only cover 41% and 56% of maize protein-coding genes, respectively, this stu...
mRNA stability in mammalian cells.
Ross, J
1995-01-01
This review concerns how cytoplasmic mRNA half-lives are regulated and how mRNA decay rates influence gene expression. mRNA stability influences gene expression in virtually all organisms, from bacteria to mammals, and the abundance of a particular mRNA can fluctuate manyfold following a change in the mRNA half-life, without any change in transcription. The processes that regulate mRNA half-lives can, in turn, affect how cells grow, differentiate, and respond to their environment. Three major questions are addressed. Which sequences in mRNAs determine their half-lives? Which enzymes degrade mRNAs? Which (trans-acting) factors regulate mRNA stability, and how do they function? The following specific topics are discussed: techniques for measuring eukaryotic mRNA stability and for calculating decay constants, mRNA decay pathways, mRNases, proteins that bind to sequences shared among many mRNAs [like poly(A)- and AU-rich-binding proteins] and proteins that bind to specific mRNAs (like the c-myc coding-region determinant-binding protein), how environmental factors like hormones and growth factors affect mRNA stability, and how translation and mRNA stability are linked. Some perspectives and predictions for future research directions are summarized at the end. PMID:7565413
Yıldırım, Kubilay; Kaya, Zeki
2017-06-01
Drought is the major environmental problem limiting the productivity and survival of plant species. Here, previously identified three black poplar genotypes having contrasting response to drought were subjected to gradual soil water depletion in a pot trial to identify their physiological, morphological and antioxidation related adaptations. We also performed a microarray based transcriptome analyses on the leaves of genotypes by using Affymetrix poplar Genome Array containing 56,000 transcripts. Phenotypic analyses of each genotype confirmed their differential adaptations to drought that could be classified as drought escape, avoidance and tolerance. Comparative transcriptomic analysis indicated highly divergent gene expression patterns among the genotypes in response to drought and post drought re-watering (PDR). We identified 10641, 3824 and 9411 transcripts exclusively regulated in drought escape, avoidance and tolerant genotypes, respectively. The key genes involved in metabolic pathways, such as carbohydrate metabolism, photosynthesis, lipid metabolism, generation of precursor metabolites/energy, protein folding, redox homeostasis, secondary metabolic process and cell wall component biogenesis, were affected by drought stresses in the leaves of these genotypes. Transcript isoforms showed increased expression specificity in the genes coding for bark storage proteins and small heat shock proteins in drought tolerant genotype. On the other hand, drought-avoiding genotype specifically induced the transcripts annotated to the genes functional in secondary metabolite production that linked to enhanced leaf water content and growth performance under drought stress. Transcriptome profiling of drought escape genotype indicated specific regulation of the genes functional in programmed cell death and leaf senescence. Specific upregulation of GTP cyclohydrolase II and transcription factors (WRKY and ERFs) in only this genotype were associated to ROS dependent signalling pathways and gene regulation network responsible in induction of many degrading enzymes acting on cell wall carbohydrates, fatty acids and proteins under drought stress. Our findings provide new insights into the transcriptome dynamics and components of regulatory network associated with drought adaptation strategies. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
Genome-Wide Identification and Characterization of WRKY Gene Family in Peanut.
Song, Hui; Wang, Pengfei; Lin, Jer-Young; Zhao, Chuanzhi; Bi, Yuping; Wang, Xingjun
2016-01-01
WRKY, an important transcription factor family, is widely distributed in the plant kingdom. Many reports focused on analysis of phylogenetic relationship and biological function of WRKY protein at the whole genome level in different plant species. However, little is known about WRKY proteins in the genome of Arachis species and their response to salicylic acid (SA) and jasmonic acid (JA) treatment. In this study, we identified 77 and 75 WRKY proteins from the two wild ancestral diploid genomes of cultivated tetraploid peanut, Arachis duranensis and Arachis ipaënsis, using bioinformatics approaches. Most peanut WRKY coding genes were located on A. duranensis chromosome A6 and A. ipaënsis chromosome B3, while the least number of WRKY genes was found in chromosome 9. The WRKY orthologous gene pairs in A. duranensis and A. ipaënsis chromosomes were highly syntenic. Our analysis indicated that segmental duplication events played a major role in AdWRKY and AiWRKY genes, and strong purifying selection was observed in gene duplication pairs. Furthermore, we translate the knowledge gained from the genome-wide analysis result of wild ancestral peanut to cultivated peanut to reveal that gene activities of specific cultivated peanut WRKY gene were changed due to SA and JA treatment. Peanut WRKY7, 8 and 13 genes were down-regulated, whereas WRKY1 and 12 genes were up-regulated with SA and JA treatment. These results could provide valuable information for peanut improvement.
Genome-Wide Identification and Characterization of WRKY Gene Family in Peanut
Song, Hui; Wang, Pengfei; Lin, Jer-Young; Zhao, Chuanzhi; Bi, Yuping; Wang, Xingjun
2016-01-01
WRKY, an important transcription factor family, is widely distributed in the plant kingdom. Many reports focused on analysis of phylogenetic relationship and biological function of WRKY protein at the whole genome level in different plant species. However, little is known about WRKY proteins in the genome of Arachis species and their response to salicylic acid (SA) and jasmonic acid (JA) treatment. In this study, we identified 77 and 75 WRKY proteins from the two wild ancestral diploid genomes of cultivated tetraploid peanut, Arachis duranensis and Arachis ipaënsis, using bioinformatics approaches. Most peanut WRKY coding genes were located on A. duranensis chromosome A6 and A. ipaënsis chromosome B3, while the least number of WRKY genes was found in chromosome 9. The WRKY orthologous gene pairs in A. duranensis and A. ipaënsis chromosomes were highly syntenic. Our analysis indicated that segmental duplication events played a major role in AdWRKY and AiWRKY genes, and strong purifying selection was observed in gene duplication pairs. Furthermore, we translate the knowledge gained from the genome-wide analysis result of wild ancestral peanut to cultivated peanut to reveal that gene activities of specific cultivated peanut WRKY gene were changed due to SA and JA treatment. Peanut WRKY7, 8 and 13 genes were down-regulated, whereas WRKY1 and 12 genes were up-regulated with SA and JA treatment. These results could provide valuable information for peanut improvement. PMID:27200012
Codon influence on protein expression in E. coli correlates with mRNA levels
Boël, Grégory; Wong, Kam-Ho; Su, Min; Luff, Jon; Valecha, Mayank; Everett, John K.; Acton, Thomas B.; Xiao, Rong; Montelione, Gaetano T.; Aalberts, Daniel P.; Hunt, John F.
2016-01-01
Degeneracy in the genetic code, which enables a single protein to be encoded by a multitude of synonymous gene sequences, has an important role in regulating protein expression, but substantial uncertainty exists concerning the details of this phenomenon. Here we analyze the sequence features influencing protein expression levels in 6,348 experiments using bacteriophage T7 polymerase to synthesize messenger RNA in Escherichia coli. Logistic regression yields a new codon-influence metric that correlates only weakly with genomic codon-usage frequency, but strongly with global physiological protein concentrations and also mRNA concentrations and lifetimes in vivo. Overall, the codon content influences protein expression more strongly than mRNA-folding parameters, although the latter dominate in the initial ~16 codons. Genes redesigned based on our analyses are transcribed with unaltered efficiency but translated with higher efficiency in vitro. The less efficiently translated native sequences show greatly reduced mRNA levels in vivo. Our results suggest that codon content modulates a kinetic competition between protein elongation and mRNA degradation that is a central feature of the physiology and also possibly the regulation of translation in E. coli. PMID:26760206
Microprocessor mediates transcriptional termination in long noncoding microRNA genes
Dhir, Ashish; Dhir, Somdutta; Proudfoot, Nick J.; Jopling, Catherine L.
2015-01-01
MicroRNA (miRNA) play a major role in the post-transcriptional regulation of gene expression. Mammalian miRNA biogenesis begins with co-transcriptional cleavage of RNA polymerase II (Pol II) transcripts by the Microprocessor complex. While most miRNA are located within introns of protein coding genes, a substantial minority of miRNA originate from long non coding (lnc) RNA where transcript processing is largely uncharacterized. We show, by detailed characterization of liver-specific lnc-pri-miR-122 and genome-wide analysis in human cell lines, that most lnc-pri-miRNA do not use the canonical cleavage and polyadenylation (CPA) pathway, but instead use Microprocessor cleavage to terminate transcription. This Microprocessor inactivation leads to extensive transcriptional readthrough of lnc-pri-miRNA and transcriptional interference with downstream genes. Consequently we define a novel RNase III-mediated, polyadenylation-independent mechanism of Pol II transcription termination in mammalian cells. PMID:25730776
Bagley, Joshua A.; Yan, Zhiqiang; Zhang, Wei; Wildonger, Jill
2014-01-01
A complex array of genetic factors regulates neuronal dendrite morphology. Epigenetic regulation of gene expression represents a plausible mechanism to control pathways responsible for specific dendritic arbor shapes. By studying the Drosophila dendritic arborization (da) neurons, we discovered a role of the double-bromodomain and extraterminal (BET) family proteins in regulating dendrite arbor complexity. A loss-of-function mutation in the single Drosophila BET protein encoded by female sterile 1 homeotic [fs(1)h] causes loss of fine, terminal dendritic branches. Moreover, fs(1)h is necessary for the induction of branching caused by a previously identified transcription factor, Cut (Ct), which regulates subtype-specific dendrite morphology. Finally, disrupting fs(1)h function impairs the mechanosensory response of class III da sensory neurons without compromising the expression of the ion channel NompC, which mediates the mechanosensitive response. Thus, our results identify a novel role for BET family proteins in regulating dendrite morphology and a possible separation of developmental pathways specifying neural cell morphology and ion channel expression. Since the BET proteins are known to bind acetylated histone tails, these results also suggest a role of epigenetic histone modifications and the “histone code,” in regulating dendrite morphology. PMID:25184680
New PAH gene promoter KLF1 and 3'-region C/EBPalpha motifs influence transcription in vitro.
Klaassen, Kristel; Stankovic, Biljana; Kotur, Nikola; Djordjevic, Maja; Zukic, Branka; Nikcevic, Gordana; Ugrin, Milena; Spasovski, Vesna; Srzentic, Sanja; Pavlovic, Sonja; Stojiljkovic, Maja
2017-02-01
Phenylketonuria (PKU) is a metabolic disease caused by mutations in the phenylalanine hydroxylase (PAH) gene. Although the PAH genotype remains the main determinant of PKU phenotype severity, genotype-phenotype inconsistencies have been reported. In this study, we focused on unanalysed sequences in non-coding PAH gene regions to assess their possible influence on the PKU phenotype. We transiently transfected HepG2 cells with various chloramphenicol acetyl transferase (CAT) reporter constructs which included PAH gene non-coding regions. Selected non-coding regions were indicated by in silico prediction to contain transcription factor binding sites. Furthermore, electrophoretic mobility shift assay (EMSA) and supershift assays were performed to identify which transcriptional factors were engaged in the interaction. We found novel KLF1 motif in the PAH promoter, which decreases CAT activity by 50 % in comparison to basal transcription in vitro. The cytosine at the c.-170 promoter position creates an additional binding site for the protein complex involving KLF1 transcription factor. Moreover, we assessed for the first time the role of a multivariant variable number tandem repeat (VNTR) region located in the 3'-region of the PAH gene. We found that the VNTR3, VNTR7 and VNTR8 constructs had approximately 60 % of CAT activity. The regulation is mediated by the C/EBPalpha transcription factor, present in protein complex binding to VNTR3. Our study highlighted two novel promoter KLF1 and 3'-region C/EBPalpha motifs in the PAH gene which decrease transcription in vitro and, thus, could be considered as PAH expression modifiers. New transcription motifs in non-coding regions will contribute to better understanding of the PKU phenotype complexity and may become important for the optimisation of PKU treatment.
Dubey, Bhawna; Meganathan, P R; Haque, Ikramul
2012-07-01
This paper reports the complete mitochondrial genome sequence of an endangered Indian snake, Python molurus molurus (Indian Rock Python). A typical snake mitochondrial (mt) genome of 17258 bp length comprising of 37 genes including the 13 protein coding genes, 22 tRNA genes, and 2 ribosomal RNA genes along with duplicate control regions is described herein. The P. molurus molurus mt. genome is relatively similar to other snake mt. genomes with respect to gene arrangement, composition, tRNA structures and skews of AT/GC bases. The nucleotide composition of the genome shows that there are more A-C % than T-G% on the positive strand as revealed by positive AT and CG skews. Comparison of individual protein coding genes, with other snake genomes suggests that ATP8 and NADH3 genes have high divergence rates. Codon usage analysis reveals a preference of NNC codons over NNG codons in the mt. genome of P. molurus. Also, the synonymous and non-synonymous substitution rates (ka/ks) suggest that most of the protein coding genes are under purifying selection pressure. The phylogenetic analyses involving the concatenated 13 protein coding genes of P. molurus molurus conformed to the previously established snake phylogeny.
Ren, Gang; Eskandari, Parisa; Wang, Siqian; Smas, Cynthia M
2016-01-15
The gene for Small Adipocyte Factor 1, Smaf1 (also known as adipogenin, ADIG), encodes a ∼600 base transcript that is highly upregulated during 3T3-L1 in vitro adipogenesis and markedly enriched in adipose tissues. Based on the lack of an obvious open reading frame in the Smaf1 transcript, it is not known if the Smaf1 gene is protein coding or non-coding RNA. Using a peptide from a putative open reading frame of Smaf1 as antigen, we generated antibodies for western analysis. Our studies prove that Smaf1 encodes an adipose-enriched protein which in western blot analysis migrates at ∼10 kDa. Rapid induction of Smaf1 protein occurs during in vitro adipogenesis and its expression in 3T3-L1 adipocytes is positively regulated by insulin and glucose. Moreover, siRNA studies reveal that expression of Smaf1 in adipocytes is wholly dependent on PPARγ. On the other hand, use of siRNA for Smaf1 to nearly abolish its protein expression in adipocytes revealed that Smaf1 does not have a major role in adipocyte triglyceride accumulation, lipolysis or insulin-stimulated pAkt induction. However, immunolocalization studies using HA-tagged Smaf1 reveal enrichment at adipocyte lipid droplets. Together our findings show that Smaf1 is a novel small protein endogenous to adipocytes and that Smaf1 expression is closely tied to PPARγ-mediated signals and the adipocyte phenotype. Copyright © 2015 Elsevier Inc. All rights reserved.
Venturi, V; Wolfs, K; Leong, J; Weisbeek, P J
1994-10-17
Pseudobactin 358 is the yellow-green fluorescent siderophore [microbial iron(III) transport agent] produced by Pseudomonas putida WCS358 under iron-limiting conditions. The genes encoding pseudobactin 358 biosynthesis are iron-regulated at the level of transcription. In this study, the molecular characterization is reported of a cosmid clone of WCS358 DNA that can stimulate, in an iron-dependent manner, the activity of a WCS358 siderophore gene promoter in the heterologous Pseudomonas strain A225. The functional region in the clone was identified by subcloning, transposon mutagenesis and DNA sequencing as the groESL operon of strain WCS358. This increase in promoter activity was not observed when the groESL genes of strain WCS358 were integrated via a transposon vector into the genome of Pseudomonas A225, indicating that multiple copies of the operon are necessary for the increase in siderophore gene promoter activity. Amplification of the Escherichia coli and WCS358 groESL genes also increased iron-regulated promoter activity in the parent strain WCS358. The groESL operon codes for the chaperone proteins GroES and GroEL, which are responsible for mediating the folding and assembly of many proteins.
Junk DNA and the long non-coding RNA twist in cancer genetics
Ling, Hui; Vincent, Kimberly; Pichler, Martin; Fodde, Riccardo; Berindan-Neagoe, Ioana; Slack, Frank J.; Calin, George A
2015-01-01
The central dogma of molecular biology states that the flow of genetic information moves from DNA to RNA to protein. However, in the last decade this dogma has been challenged by new findings on non-coding RNAs (ncRNAs) such as microRNAs (miRNAs). More recently, long non-coding RNAs (lncRNAs) have attracted much attention due to their large number and biological significance. Many lncRNAs have been identified as mapping to regulatory elements including gene promoters and enhancers, ultraconserved regions, and intergenic regions of protein-coding genes. Yet, the biological function and molecular mechanisms of lncRNA in human diseases in general and cancer in particular remain largely unknown. Data from the literature suggest that lncRNA, often via interaction with proteins, functions in specific genomic loci or use their own transcription loci for regulatory activity. In this review, we summarize recent findings supporting the importance of DNA loci in lncRNA function, and the underlying molecular mechanisms via cis or trans regulation, and discuss their implications in cancer. In addition, we use the 8q24 genomic locus, a region containing interactive SNPs, DNA regulatory elements and lncRNAs, as an example to illustrate how single nucleotide polymorphism (SNP) located within lncRNAs may be functionally associated with the individual’s susceptibility to cancer. PMID:25619839
Fourquin, Chloé; del Cerro, Carolina; Victoria, Filipe C.; Vialette-Guiraud, Aurélie; de Oliveira, Antonio C.; Ferrándiz, Cristina
2013-01-01
Angiosperms are the most diverse and numerous group of plants, and it is generally accepted that this evolutionary success owes in part to the diversity found in fruits, key for protecting the developing seeds and ensuring seed dispersal. Although studies on the molecular basis of morphological innovations are few, they all illustrate the central role played by transcription factors acting as developmental regulators. Here, we show that a small change in the protein sequence of a MADS-box transcription factor correlates with the origin of a highly modified fruit morphology and the change in seed dispersal strategies that occurred in Medicago, a genus belonging to the large legume family. This protein sequence modification alters the functional properties of the protein, affecting the affinities for other protein partners involved in high-order complexes. Our work illustrates that variation in coding regions can generate evolutionary novelties not based on gene duplication/subfunctionalization but by interactions in complex networks, contributing also to the current debate on the relative importance of changes in regulatory or coding regions of master regulators in generating morphological novelties. PMID:23640757
Blazie, Stephen M.; Geissel, Heather C.; Wilky, Henry; Joshi, Rajan; Newbern, Jason; Mangone, Marco
2017-01-01
mRNA expression dynamics promote and maintain the identity of somatic tissues in living organisms; however, their impact in post-transcriptional gene regulation in these processes is not fully understood. Here, we applied the PAT-Seq approach to systematically isolate, sequence, and map tissue-specific mRNA from five highly studied Caenorhabditis elegans somatic tissues: GABAergic and NMDA neurons, arcade and intestinal valve cells, seam cells, and hypodermal tissues, and studied their mRNA expression dynamics. The integration of these datasets with previously profiled transcriptomes of intestine, pharynx, and body muscle tissues, precisely assigns tissue-specific expression dynamics for 60% of all annotated C. elegans protein-coding genes, providing an important resource for the scientific community. The mapping of 15,956 unique high-quality tissue-specific polyA sites in all eight somatic tissues reveals extensive tissue-specific 3′untranslated region (3′UTR) isoform switching through alternative polyadenylation (APA) . Almost all ubiquitously transcribed genes use APA and harbor miRNA targets in their 3′UTRs, which are commonly lost in a tissue-specific manner, suggesting widespread usage of post-transcriptional gene regulation modulated through APA to fine tune tissue-specific protein expression. Within this pool, the human disease gene C. elegans orthologs rack-1 and tct-1 use APA to switch to shorter 3′UTR isoforms in order to evade miRNA regulation in the body muscle tissue, resulting in increased protein expression needed for proper body muscle function. Our results highlight a major positive regulatory role for APA, allowing genes to counteract miRNA regulation on a tissue-specific basis. PMID:28348061
Blazie, Stephen M; Geissel, Heather C; Wilky, Henry; Joshi, Rajan; Newbern, Jason; Mangone, Marco
2017-06-01
mRNA expression dynamics promote and maintain the identity of somatic tissues in living organisms; however, their impact in post-transcriptional gene regulation in these processes is not fully understood. Here, we applied the PAT-Seq approach to systematically isolate, sequence, and map tissue-specific mRNA from five highly studied Caenorhabditis elegans somatic tissues: GABAergic and NMDA neurons, arcade and intestinal valve cells, seam cells, and hypodermal tissues, and studied their mRNA expression dynamics. The integration of these datasets with previously profiled transcriptomes of intestine, pharynx, and body muscle tissues, precisely assigns tissue-specific expression dynamics for 60% of all annotated C. elegans protein-coding genes, providing an important resource for the scientific community. The mapping of 15,956 unique high-quality tissue-specific polyA sites in all eight somatic tissues reveals extensive tissue-specific 3'untranslated region (3'UTR) isoform switching through alternative polyadenylation (APA) . Almost all ubiquitously transcribed genes use APA and harbor miRNA targets in their 3'UTRs, which are commonly lost in a tissue-specific manner, suggesting widespread usage of post-transcriptional gene regulation modulated through APA to fine tune tissue-specific protein expression. Within this pool, the human disease gene C. elegans orthologs rack-1 and tct-1 use APA to switch to shorter 3'UTR isoforms in order to evade miRNA regulation in the body muscle tissue, resulting in increased protein expression needed for proper body muscle function. Our results highlight a major positive regulatory role for APA, allowing genes to counteract miRNA regulation on a tissue-specific basis. Copyright © 2017 Blazie et al.
Choudhry, H; Albukhari, A; Morotti, M; Haider, S; Moralli, D; Smythies, J; Schödel, J; Green, C M; Camps, C; Buffa, F; Ratcliffe, P; Ragoussis, J; Harris, A L; Mole, D R
2015-01-01
Activation of cellular transcriptional responses, mediated by hypoxia-inducible factor (HIF), is common in many types of cancer, and generally confers a poor prognosis. Known to induce many hundreds of protein-coding genes, HIF has also recently been shown to be a key regulator of the non-coding transcriptional response. Here, we show that NEAT1 long non-coding RNA (lncRNA) is a direct transcriptional target of HIF in many breast cancer cell lines and in solid tumors. Unlike previously described lncRNAs, NEAT1 is regulated principally by HIF-2 rather than by HIF-1. NEAT1 is a nuclear lncRNA that is an essential structural component of paraspeckles and the hypoxic induction of NEAT1 induces paraspeckle formation in a manner that is dependent upon both NEAT1 and on HIF-2. Paraspeckles are multifunction nuclear structures that sequester transcriptionally active proteins as well as RNA transcripts that have been subjected to adenosine-to-inosine (A-to-I) editing. We show that the nuclear retention of one such transcript, F11R (also known as junctional adhesion molecule 1, JAM1), in hypoxia is dependent upon the hypoxic increase in NEAT1, thereby conferring a novel mechanism of HIF-dependent gene regulation. Induction of NEAT1 in hypoxia also leads to accelerated cellular proliferation, improved clonogenic survival and reduced apoptosis, all of which are hallmarks of increased tumorigenesis. Furthermore, in patients with breast cancer, high tumor NEAT1 expression correlates with poor survival. Taken together, these results indicate a new role for HIF transcriptional pathways in the regulation of nuclear structure and that this contributes to the pro-tumorigenic hypoxia-phenotype in breast cancer. PMID:25417700
Higashi, Koichi; Tobe, Toru; Kanai, Akinori; Uyar, Ebru; Ishikawa, Shu; Suzuki, Yutaka; Ogasawara, Naotake; Kurokawa, Ken; Oshima, Taku
2016-01-01
Bacteria can acquire new traits through horizontal gene transfer. Inappropriate expression of transferred genes, however, can disrupt the physiology of the host bacteria. To reduce this risk, Escherichia coli expresses the nucleoid-associated protein, H-NS, which preferentially binds to horizontally transferred genes to control their expression. Once expression is optimized, the horizontally transferred genes may actually contribute to E. coli survival in new habitats. Therefore, we investigated whether and how H-NS contributes to this optimization process. A comparison of H-NS binding profiles on common chromosomal segments of three E. coli strains belonging to different phylogenetic groups indicated that the positions of H-NS-bound regions have been conserved in E. coli strains. The sequences of the H-NS-bound regions appear to have diverged more so than H-NS-unbound regions only when H-NS-bound regions are located upstream or in coding regions of genes. Because these regions generally contain regulatory elements for gene expression, sequence divergence in these regions may be associated with alteration of gene expression. Indeed, nucleotide substitutions in H-NS-bound regions of the ybdO promoter and coding regions have diversified the potential for H-NS-independent negative regulation among E. coli strains. The ybdO expression in these strains was still negatively regulated by H-NS, which reduced the effect of H-NS-independent regulation under normal growth conditions. Hence, we propose that, during E. coli evolution, the conservation of H-NS binding sites resulted in the diversification of the regulation of horizontally transferred genes, which may have facilitated E. coli adaptation to new ecological niches. PMID:26789284
Higashi, Koichi; Tobe, Toru; Kanai, Akinori; Uyar, Ebru; Ishikawa, Shu; Suzuki, Yutaka; Ogasawara, Naotake; Kurokawa, Ken; Oshima, Taku
2016-01-01
Bacteria can acquire new traits through horizontal gene transfer. Inappropriate expression of transferred genes, however, can disrupt the physiology of the host bacteria. To reduce this risk, Escherichia coli expresses the nucleoid-associated protein, H-NS, which preferentially binds to horizontally transferred genes to control their expression. Once expression is optimized, the horizontally transferred genes may actually contribute to E. coli survival in new habitats. Therefore, we investigated whether and how H-NS contributes to this optimization process. A comparison of H-NS binding profiles on common chromosomal segments of three E. coli strains belonging to different phylogenetic groups indicated that the positions of H-NS-bound regions have been conserved in E. coli strains. The sequences of the H-NS-bound regions appear to have diverged more so than H-NS-unbound regions only when H-NS-bound regions are located upstream or in coding regions of genes. Because these regions generally contain regulatory elements for gene expression, sequence divergence in these regions may be associated with alteration of gene expression. Indeed, nucleotide substitutions in H-NS-bound regions of the ybdO promoter and coding regions have diversified the potential for H-NS-independent negative regulation among E. coli strains. The ybdO expression in these strains was still negatively regulated by H-NS, which reduced the effect of H-NS-independent regulation under normal growth conditions. Hence, we propose that, during E. coli evolution, the conservation of H-NS binding sites resulted in the diversification of the regulation of horizontally transferred genes, which may have facilitated E. coli adaptation to new ecological niches.
Bao, Duran; Ganbaatar, Oyunchuluun; Cui, Xiuqi; Yu, Ruonan; Bao, Wenhua; Falk, Bryce W; Wuriyanghan, Hada
2018-04-01
Plants protect themselves from virus infections by several different defence mechanisms. RNA interference (RNAi) is one prominent antiviral mechanism, which requires the participation of AGO (Argonaute) and Dicer/DCL (Dicer-like) proteins. Effector-triggered immunity (ETI) is an antiviral mechanism mediated by resistance (R) genes, most of which encode nucleotide-binding site-leucine-rich repeat (NBS-LRR) family proteins. MicroRNAs (miRNAs) play important regulatory roles in plants, including the regulation of host defences. Soybean mosaic virus (SMV) is the most common virus in soybean and, in this work, we identified dozens of SMV-responsive miRNAs by microarray analysis in an SMV-susceptible soybean line. Amongst the up-regulated miRNAs, miR168a, miR403a, miR162b and miR1515a predictively regulate the expression of AGO1, AGO2, DCL1 and DCL2, respectively, and miR1507a, miR1507c and miR482a putatively regulate the expression of several NBS-LRR family disease resistance genes. The regulation of target gene expression by these seven miRNAs was validated by both transient expression assays and RNA ligase-mediated rapid amplification of cDNA ends (RLM-RACE) experiments. Transcript levels for AGO1, DCL1, DCL2 and five NBS-LRR family genes were repressed at different time points after SMV infection, whereas the corresponding miRNA levels were up-regulated at these same time points. Furthermore, inhibition of miR1507a, miR1507c, miR482a, miR168a and miR1515a by short tandem target mimic (STTM) technology compromised SMV infection efficiency in soybean. Our results imply that SMV can counteract soybean defence responses by the down-regulation of several RNAi pathway genes and NBS-LRR family resistance genes via the induction of the accumulation of their corresponding miRNA levels. © 2017 BSPP AND JOHN WILEY & SONS LTD.
2018-01-01
ABSTRACT Bacterial genomes sometimes contain genes that code for homologues of global regulators, the function of which is unclear. In members of the family Enterobacteriaceae, cells express the global regulator H-NS and its paralogue StpA. In Escherichia coli, out of providing a molecular backup for H-NS, the role of StpA is poorly characterized. The enteroaggregative E. coli strain 042 carries, in addition to the hns and stpA genes, a third gene encoding an hns paralogue (hns2). We present in this paper information about its biological function. Transcriptomic analysis has shown that the H-NS2 protein targets a subset of the genes targeted by H-NS. Genes targeted by H-NS2 correspond mainly with horizontally transferred (HGT) genes and are also targeted by the Hha protein, a fine-tuner of H-NS activity. Compared with H-NS, H-NS2 expression levels are lower. In addition, H-NS2 expression exhibits specific features: it is sensitive to the growth temperature and to the nature of the culture medium. This novel H-NS paralogue is widespread within the Enterobacteriaceae. IMPORTANCE Global regulators such as H-NS play key relevant roles enabling bacterial cells to adapt to a changing environment. H-NS modulates both core and horizontally transferred (HGT) genes, but the mechanism by which H-NS can differentially regulate these genes remains to be elucidated. There are several instances of bacterial cells carrying genes that encode homologues of the global regulators. The question is what the roles of these proteins are. We noticed that the enteroaggregative E. coli strain 042 carries a new hitherto uncharacterized copy of the hns gene. We decided to investigate why this pathogenic E. coli strain requires an extra H-NS paralogue, termed H-NS2. In our work, we show that H-NS2 displays specific expression and regulatory properties. H-NS2 targets a subset of H-NS-specific genes and may help to differentially modulate core and HGT genes by the H-NS cellular pool. PMID:29577085
Prieto, A; Bernabeu, M; Aznar, S; Ruiz-Cruz, S; Bravo, A; Queiroz, M H; Juárez, A
2018-01-01
Bacterial genomes sometimes contain genes that code for homologues of global regulators, the function of which is unclear. In members of the family Enterobacteriaceae , cells express the global regulator H-NS and its paralogue StpA. In Escherichia coli , out of providing a molecular backup for H-NS, the role of StpA is poorly characterized. The enteroaggregative E. coli strain 042 carries, in addition to the hns and stpA genes, a third gene encoding an hns paralogue ( hns2 ). We present in this paper information about its biological function. Transcriptomic analysis has shown that the H-NS2 protein targets a subset of the genes targeted by H-NS. Genes targeted by H-NS2 correspond mainly with horizontally transferred (HGT) genes and are also targeted by the Hha protein, a fine-tuner of H-NS activity. Compared with H-NS, H-NS2 expression levels are lower. In addition, H-NS2 expression exhibits specific features: it is sensitive to the growth temperature and to the nature of the culture medium. This novel H-NS paralogue is widespread within the Enterobacteriaceae . IMPORTANCE Global regulators such as H-NS play key relevant roles enabling bacterial cells to adapt to a changing environment. H-NS modulates both core and horizontally transferred (HGT) genes, but the mechanism by which H-NS can differentially regulate these genes remains to be elucidated. There are several instances of bacterial cells carrying genes that encode homologues of the global regulators. The question is what the roles of these proteins are. We noticed that the enteroaggregative E. coli strain 042 carries a new hitherto uncharacterized copy of the hns gene. We decided to investigate why this pathogenic E. coli strain requires an extra H-NS paralogue, termed H-NS2. In our work, we show that H-NS2 displays specific expression and regulatory properties. H-NS2 targets a subset of H-NS-specific genes and may help to differentially modulate core and HGT genes by the H-NS cellular pool.
AP-2α and AP-2β cooperatively orchestrate homeobox gene expression during branchial arch patterning.
Van Otterloo, Eric; Li, Hong; Jones, Kenneth L; Williams, Trevor
2018-01-25
The evolution of a hinged moveable jaw with variable morphology is considered a major factor behind the successful expansion of the vertebrates. DLX homeobox transcription factors are crucial for establishing the positional code that patterns the mandible, maxilla and intervening hinge domain, but how the genes encoding these proteins are regulated remains unclear. Herein, we demonstrate that the concerted action of the AP-2α and AP-2β transcription factors within the mouse neural crest is essential for jaw patterning. In the absence of these two proteins, the hinge domain is lost and there are alterations in the size and patterning of the jaws correlating with dysregulation of homeobox gene expression, with reduced levels of Emx, Msx and Dlx paralogs accompanied by an expansion of Six1 expression. Moreover, detailed analysis of morphological features and gene expression changes indicate significant overlap with various compound Dlx gene mutants. Together, these findings reveal that the AP-2 genes have a major function in mammalian neural crest development, influencing patterning of the craniofacial skeleton via the DLX code, an effect that has implications for vertebrate facial evolution, as well as for human craniofacial disorders. © 2018. Published by The Company of Biologists Ltd.
Pianigiani, Giulia; Licastro, Danilo; Fortugno, Paola; Castiglia, Daniele; Petrovic, Ivana; Pagani, Franco
2018-06-12
MicroRNAs are found throughout the genome and are processed by the microprocessor complex (MPC) from longer precursors. Some precursor miRNAs overlap intron:exon junctions. These Splice site Overlapping microRNAs (SO-miRNAs) are mostly located in coding genes. It has been intimated, in the rarer examples of SO-miRNAs in non-coding RNAs, that the competition between the spliceosome and the MPC modulates alternative splicing. However, the effect of this overlap on coding transcripts is unknown. Unexpectedly, we show that neither Drosha silencing nor SF3b1 silencing changed the inclusion ratio of SO-miRNA exons. Two SO-miRNAs, located in genes that code for basal membrane proteins, are known to inhibit proliferation in primary keratinocytes. These SO-miRNAs were upregulated during differentiation and the host mRNAs were downregulated, but again there was no change in inclusion ratio of the SO-miRNA exons. Interestingly, Drosha silencing increased nascent RNA density, on chromatin, downstream of SO-miRNA exons. Overall our data suggest a novel mechanism for regulating gene expression in which MPC-dependent cleavage of SO-miRNA exons could cause premature transcriptional termination of coding genes rather than affecting alternative splicing. Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Ikemura, Kenji; Iwamoto, Takuya; Okuda, Masahiro
2014-08-01
Drug transporters, drug-metabolizing enzymes, and tight junctions in the small intestine function as an absorption barrier and sometimes as a facilitator of orally administered drugs. The expression of these proteins often fluctuates and thereby causes individual pharmacokinetic variability. MicroRNAs (miRNAs), which are small non-coding RNAs, have recently emerged as a new class of gene regulator. MiRNAs post-transcriptionally regulate gene expression by binding to target mRNA to suppress its translation or regulate its degradation. They have been shown to be key regulators of proteins associated with pharmacokinetics. Moreover, the role of miRNAs on the expression of some proteins expressed in the small intestine has recently been clarified. In this review, we summarize current knowledge regarding the role of miRNAs in the regulation of drug transporters, drug-metabolizing enzymes, and tight junctions as well as its implication for intestinal barrier function. MiRNAs play vital roles in the differentiation, architecture, and barrier function of intestinal epithelial cells, and directly and/or indirectly regulate the expression and function of proteins associated with drug absorption in intestinal epithelial cells. Moreover, the variation of miRNA expression caused by pathological and physiological conditions as well as genetic factors should affect the expression of these proteins. Therefore, miRNAs could be significant factors affecting inter- and intra-individual variations in the pharmacokinetics and intestinal absorption of drugs. Overall, miRNAs could be promising targets for personalized pharmacotherapy or other attractive therapies through intestinal absorption of drugs. Copyright © 2014 Elsevier Inc. All rights reserved.
Lessard, Laurent; Liu, Michelle; Marzese, Diego M.; Wang, Hongwei; Chong, Kelly; Kawas, Neal; Donovan, Nicholas C; Kiyohara, Eiji; Hsu, Sandy; Nelson, Nellie; Izraely, Sivan; Sagi-Assif, Orit; Witz, Isaac P; Ma, Xiao-Jun; Luo, Yuling; Hoon, Dave SB
2015-01-01
In recent years, considerable advances have been made in the characterization of protein-coding alterations involved in the pathogenesis of melanoma. However, despite their growing implication in cancer, little is known about the role of long non-coding RNAs in melanoma progression. We hypothesized that copy number alterations of intergenic non-protein coding domains could help identify long intergenic non-coding RNAs (lincRNAs) associated with metastatic cutaneous melanoma. Among several candidates, our approach uncovered the chromosome 6p22.3 CASC15 lincRNA locus as a frequently gained genomic segment in metastatic melanoma tumors and cell lines. The locus was actively transcribed in metastatic melanoma cells, and up-regulation of CASC15 expression was associated with metastatic progression to brain metastasis in a mouse xenograft model. In clinical specimens, CASC15 levels increased during melanoma progression and were independent predictors of disease recurrence in a cohort of 141 patients with AJCC stage III lymph node metastasis. Moreover, siRNA knockdown experiments revealed that CASC15 regulates melanoma cell phenotype switching between proliferative and invasive states. Accordingly, CASC15 levels correlated with known gene signatures corresponding to melanoma proliferative and invasive phenotypes. These findings support a key role for CASC15 in metastatic melanoma. PMID:26016895
microRNA Therapeutics in Cancer - An Emerging Concept.
Shah, Maitri Y; Ferrajoli, Alessandra; Sood, Anil K; Lopez-Berestein, Gabriel; Calin, George A
2016-10-01
MicroRNAs (miRNAs) are an evolutionarily conserved class of small, regulatory non-coding RNAs that negatively regulate protein coding gene and other non-coding transcripts expression. miRNAs have been established as master regulators of cellular processes, and they play a vital role in tumor initiation, progression and metastasis. Further, widespread deregulation of microRNAs have been reported in several cancers, with several microRNAs playing oncogenic and tumor suppressive roles. Based on these, miRNAs have emerged as promising therapeutic tools for cancer management. In this review, we have focused on the roles of miRNAs in tumorigenesis, the miRNA-based therapeutic strategies currently being evaluated for use in cancer, and the advantages and current challenges to their use in the clinic. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
The point of no return: The poly(A)-associated elongation checkpoint
Tellier, Michael; Ferrer-Vicens, Ivan; Murphy, Shona
2016-01-01
abstract Cyclin-dependent kinases play critical roles in transcription by RNA polymerase II (pol II) and processing of the transcripts. For example, CDK9 regulates transcription of protein-coding genes, splicing, and 3′ end formation of the transcripts. Accordingly, CDK9 inhibitors have a drastic effect on the production of mRNA in human cells. Recent analyses indicate that CDK9 regulates transcription at the early-elongation checkpoint of the vast majority of pol II-transcribed genes. Our recent discovery of an additional CDK9-regulated elongation checkpoint close to poly(A) sites adds a new layer to the control of transcription by this critical cellular kinase. This novel poly(A)-associated checkpoint has the potential to powerfully regulate gene expression just before a functional polyadenylated mRNA is produced: the point of no return. However, many questions remain to be answered before the role of this checkpoint becomes clear. Here we speculate on the possible biological significance of this novel mechanism of gene regulation and the players that may be involved. PMID:26853452
Ngwa, Che J.; Kiesow, Meike J.; Papst, Olga; Orchard, Lindsey M.; Filarsky, Michael; Rosinski, Alina N.; Voss, Till S.; Llinás, Manuel; Pradel, Gabriele
2017-01-01
Transmission of the malaria parasite Plasmodium falciparum from the human to the mosquito is mediated by the intraerythrocytic gametocytes, which, once taken up during a blood meal, become activated to initiate sexual reproduction. Because gametocytes are the only parasite stages able to establish an infection in the mosquito, they are crucial for spreading the tropical disease. During gametocyte maturation, different repertoires of genes are switched on and off in a well-coordinated sequence, pointing to regulatory mechanisms of gene expression. While epigenetic gene control has been studied during erythrocytic schizogony of P. falciparum, little is known about this process during human-to-mosquito transmission of the parasite. To unveil the potential role of histone acetylation during gene expression in gametocytes, we carried out a microarray-based transcriptome analysis on gametocytes treated with the histone deacetylase inhibitor trichostatin A (TSA). TSA-treatment impaired gametocyte maturation and lead to histone hyper-acetylation in these stages. Comparative transcriptomics identified 294 transcripts, which were more than 2-fold up-regulated during gametocytogenesis following TSA-treatment. In activated gametocytes, which were less sensitive to TSA, the transcript levels of 48 genes were increased. TSA-treatment further led to repression of ~145 genes in immature and mature gametocytes and 7 genes in activated gametocytes. Up-regulated genes are mainly associated with functions in invasion, cytoadherence, and protein export, while down-regulated genes could particularly be assigned to transcription and translation. Chromatin immunoprecipitation demonstrated a link between gene activation and histone acetylation for selected genes. Among the genes up-regulated in TSA-treated mature gametocytes was a gene encoding the ring finger (RING)-domain protein PfRNF1, a putative E3 ligase of the ubiquitin-mediated signaling pathway. Immunochemistry demonstrated PfRNF1 expression mainly in the sexual stages of P. falciparum with peak expression in stage II gametocytes, where the protein localized to the nucleus and cytoplasm. Pfrnf1 promoter and coding regions associated with acetylated histones, and TSA-treatment resulted in increased PfRNF1 levels. Our combined data point to an essential role of histone acetylation for gene regulation in gametocytes, which can be exploited for malaria transmission-blocking interventions. PMID:28791254
Identification of Lmo1 as part of a Hox-dependent regulatory network for hindbrain patterning.
Matis, Christelle; Oury, Franck; Remacle, Sophie; Lampe, Xavier; Gofflot, Françoise; Picard, Jacques J; Rijli, Filippo M; Rezsohazy, René
2007-09-01
The embryonic functions of Hox proteins have been extensively investigated in several animal phyla. These transcription factors act as selectors of developmental programmes, to govern the morphogenesis of multiple structures and organs. However, despite the variety of morphogenetic processes Hox proteins are involved in, only a limited set of their target genes has been identified so far. To find additional targets, we used a strategy based upon the simultaneous overexpression of Hoxa2 and its cofactors Pbx1 and Prep in a cellular model. Among genes whose expression was upregulated, we identified LMO1, which codes for an intertwining LIM-only factor involved in protein-DNA oligomeric complexes. By analysing its expression in Hox knockout mice, we show that Lmo1 is differentially regulated by Hoxa2 and Hoxb2, in specific columns of hindbrain neuronal progenitors. These results suggest that Lmo1 takes part in a Hox paralogue 2-dependent network regulating anteroposterior and dorsoventral hindbrain patterning. (c) 2007 Wiley-Liss, Inc.
Ishiwata, Ryosuke R; Morioka, Masaki S; Ogishima, Soichi; Tanaka, Hiroshi
2009-02-15
BioCichlid is a 3D visualization system of time-course microarray data on molecular networks, aiming at interpretation of gene expression data by transcriptional relationships based on the central dogma with physical and genetic interactions. BioCichlid visualizes both physical (protein) and genetic (regulatory) network layers, and provides animation of time-course gene expression data on the genetic network layer. Transcriptional regulations are represented to bridge the physical network (transcription factors) and genetic network (regulated genes) layers, thus integrating promoter analysis into the pathway mapping. BioCichlid enhances the interpretation of microarray data and allows for revealing the underlying mechanisms causing differential gene expressions. BioCichlid is freely available and can be accessed at http://newton.tmd.ac.jp/. Source codes for both biocichlid server and client are also available.
Kramer, Marianne C; Liang, Dongming; Tatomer, Deirdre C; Gold, Beth; March, Zachary M; Cherry, Sara; Wilusz, Jeremy E
2015-10-15
Thousands of eukaryotic protein-coding genes are noncanonically spliced to produce circular RNAs. Bioinformatics has indicated that long introns generally flank exons that circularize in Drosophila, but the underlying mechanisms by which these circular RNAs are generated are largely unknown. Here, using extensive mutagenesis of expression plasmids and RNAi screening, we reveal that circularization of the Drosophila laccase2 gene is regulated by both intronic repeats and trans-acting splicing factors. Analogous to what has been observed in humans and mice, base-pairing between highly complementary transposable elements facilitates backsplicing. Long flanking repeats (∼ 400 nucleotides [nt]) promote circularization cotranscriptionally, whereas pre-mRNAs containing minimal repeats (<40 nt) generate circular RNAs predominately after 3' end processing. Unlike the previously characterized Muscleblind (Mbl) circular RNA, which requires the Mbl protein for its biogenesis, we found that Laccase2 circular RNA levels are not controlled by Mbl or the Laccase2 gene product but rather by multiple hnRNP (heterogeneous nuclear ribonucleoprotein) and SR (serine-arginine) proteins acting in a combinatorial manner. hnRNP and SR proteins also regulate the expression of other Drosophila circular RNAs, including Plexin A (PlexA), suggesting a common strategy for regulating backsplicing. Furthermore, the laccase2 flanking introns support efficient circularization of diverse exons in Drosophila and human cells, providing a new tool for exploring the functional consequences of circular RNA expression across eukaryotes. © 2015 Kramer et al.; Published by Cold Spring Harbor Laboratory Press.
Giarola, Valentino; Krey, Stephanie; von den Driesch, Barbara; Bartels, Dorothea
2016-04-01
Craterostigma plantagineum tolerates extreme desiccation. Leaves of this plant shrink and extensively fold during dehydration and expand again during rehydration, preserving their structural integrity. Genes were analysed that may participate in the reversible folding mechanism. Analysis of transcripts abundantly expressed in desiccated leaves identified a gene putatively coding for an apoplastic glycine-rich protein (CpGRP1). We studied the expression, regulation and subcellular localization of CpGRP1 and its ability to interact with a cell wall-associated protein kinase (CpWAK1) to understand the role of CpGRP1 in the cell wall during dehydration. The CpGRP1 protein accumulates in the apoplast of desiccated leaves. Analysis of the promoter revealed that the gene expression is mainly regulated at the transcriptional level, is independent of abscisic acid (ABA) and involves a drought-responsive cis-element (DRE). CpGRP1 interacts with CpWAK1 which is down-regulated in response to dehydration. Our data suggest a role of the CpGRP1-CpWAK1 complex in dehydration-induced morphological changes in the cell wall during dehydration in C. plantagineum. Cell wall pectins and dehydration-induced pectin modifications are predicted to be involved in the activity of the CpGRP1-CpWAK1 complex. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
The impact of rare variation on gene expression across tissues.
Li, Xin; Kim, Yungil; Tsang, Emily K; Davis, Joe R; Damani, Farhan N; Chiang, Colby; Hess, Gaelen T; Zappala, Zachary; Strober, Benjamin J; Scott, Alexandra J; Li, Amy; Ganna, Andrea; Bassik, Michael C; Merker, Jason D; Hall, Ira M; Battle, Alexis; Montgomery, Stephen B
2017-10-11
Rare genetic variants are abundant in humans and are expected to contribute to individual disease risk. While genetic association studies have successfully identified common genetic variants associated with susceptibility, these studies are not practical for identifying rare variants. Efforts to distinguish pathogenic variants from benign rare variants have leveraged the genetic code to identify deleterious protein-coding alleles, but no analogous code exists for non-coding variants. Therefore, ascertaining which rare variants have phenotypic effects remains a major challenge. Rare non-coding variants have been associated with extreme gene expression in studies using single tissues, but their effects across tissues are unknown. Here we identify gene expression outliers, or individuals showing extreme expression levels for a particular gene, across 44 human tissues by using combined analyses of whole genomes and multi-tissue RNA-sequencing data from the Genotype-Tissue Expression (GTEx) project v6p release. We find that 58% of underexpression and 28% of overexpression outliers have nearby conserved rare variants compared to 8% of non-outliers. Additionally, we developed RIVER (RNA-informed variant effect on regulation), a Bayesian statistical model that incorporates expression data to predict a regulatory effect for rare variants with higher accuracy than models using genomic annotations alone. Overall, we demonstrate that rare variants contribute to large gene expression changes across tissues and provide an integrative method for interpretation of rare variants in individual genomes.
Lin, Runmao; He, Liye; He, Jiayu; Qin, Peigang; Wang, Yanran; Deng, Qiming; Yang, Xiaoting; Li, Shuangcheng; Wang, Shiquan; Wang, Wenming; Liu, Huainian; Li, Ping; Zheng, Aiping
2016-07-03
MicroRNAs (miRNAs) are ∼22 nucleotide non-coding RNAs that regulate gene expression by targeting mRNAs for degradation or inhibiting protein translation. To investigate whether miRNAs regulate the pathogenesis in necrotrophic fungus Rhizoctonia solani AG1 IA, which causes significant yield loss in main economically important crops, and to determine the regulatory mechanism occurring during pathogenesis, we constructed hyphal small RNA libraries from six different infection periods of the rice leaf. Through sequencing and analysis, 177 miRNA-like small RNAs (milRNAs) were identified, including 15 candidate pathogenic novel milRNAs predicted by functional annotations of their target mRNAs and expression patterns of milRNAs and mRNAs during infection. Reverse transcription-quantitative polymerase chain reaction results for randomly selected milRNAs demonstrated that our novel comprehensive predictions had a high level of accuracy. In our predicted pathogenic protein-protein interaction network of R. solani, we added the related regulatory milRNAs of these core coding genes into the network, and could understand the relationships among these regulatory factors more clearly at the systems level. Furthermore, the putative pathogenic Rhi-milR-16, which negatively regulates target gene expression, was experimentally validated to have regulatory functions by a dual-luciferase reporter assay. Additionally, 23 candidate rice miRNAs that may involve in plant immunity against R. solani were discovered. This first study on novel pathogenic milRNAs of R. solani AG1 IA and the recognition of target genes involved in pathogenicity, as well as rice miRNAs, participated in defence against R. solani could provide new insights into revealing the pathogenic mechanisms of the severe rice sheath blight disease. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Chen, Geng; Yin, Kangping; Shi, Leming; Fang, Yuanzhang; Qi, Ya; Li, Peng; Luo, Jian; He, Bing; Liu, Mingyao; Shi, Tieliu
2011-01-01
In their expression process, different genes can generate diverse functional products, including various protein-coding or noncoding RNAs. Here, we investigated the protein-coding capacities and the expression levels of their isoforms for human known genes, the conservation and disease association of long noncoding RNAs (ncRNAs) with two transcriptome sequencing datasets from human brain tissues and 10 mixed cell lines. Comparative analysis revealed that about two-thirds of the genes expressed between brain and cell lines are the same, but less than one-third of their isoforms are identical. Besides those genes specially expressed in brain and cell lines, about 66% of genes expressed in common encoded different isoforms. Moreover, most genes dominantly expressed one isoform and some genes only generated protein-coding (or noncoding) RNAs in one sample but not in another. We found 282 human genes could encode both protein-coding and noncoding RNAs through alternative splicing in the two samples. We also identified more than 1,000 long ncRNAs, and most of those long ncRNAs contain conserved elements across either 46 vertebrates or 33 placental mammals or 10 primates. Further analysis showed that some long ncRNAs differentially expressed in human breast cancer or lung cancer, several of those differentially expressed long ncRNAs were validated by RT-PCR. In addition, those validated differentially expressed long ncRNAs were found significantly correlated with certain breast cancer or lung cancer related genes, indicating the important biological relevance between long ncRNAs and human cancers. Our findings reveal that the differences of gene expression profile between samples mainly result from the expressed gene isoforms, and highlight the importance of studying genes at the isoform level for completely illustrating the intricate transcriptome.
An expanding universe of the non-coding genome in cancer biology.
Xue, Bin; He, Lin
2014-06-01
Neoplastic transformation is caused by accumulation of genetic and epigenetic alterations that ultimately convert normal cells into tumor cells with uncontrolled proliferation and survival, unlimited replicative potential and invasive growth [Hanahan,D. et al. (2011) Hallmarks of cancer: the next generation. Cell, 144, 646-674]. Although the majority of the cancer studies have focused on the functions of protein-coding genes, emerging evidence has started to reveal the importance of the vast non-coding genome, which constitutes more than 98% of the human genome. A number of non-coding RNAs (ncRNAs) derived from the 'dark matter' of the human genome exhibit cancer-specific differential expression and/or genomic alterations, and it is increasingly clear that ncRNAs, including small ncRNAs and long ncRNAs (lncRNAs), play an important role in cancer development by regulating protein-coding gene expression through diverse mechanisms. In addition to ncRNAs, nearly half of the mammalian genomes consist of transposable elements, particularly retrotransposons. Once depicted as selfish genomic parasites that propagate at the expense of host fitness, retrotransposon elements could also confer regulatory complexity to the host genomes during development and disease. Reactivation of retrotransposons in cancer, while capable of causing insertional mutagenesis and genome rearrangements to promote oncogenesis, could also alter host gene expression networks to favor tumor development. Taken together, the functional significance of non-coding genome in tumorigenesis has been previously underestimated, and diverse transcripts derived from the non-coding genome could act as integral functional components of the oncogene and tumor suppressor network. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Discovery of rare protein-coding genes in model methylotroph Methylobacterium extorquens AM1.
Kumar, Dhirendra; Mondal, Anupam Kumar; Yadav, Amit Kumar; Dash, Debasis
2014-12-01
Proteogenomics involves the use of MS to refine annotation of protein-coding genes and discover genes in a genome. We carried out comprehensive proteogenomic analysis of Methylobacterium extorquens AM1 (ME-AM1) from publicly available proteomics data with a motive to improve annotation for methylotrophs; organisms capable of surviving in reduced carbon compounds such as methanol. Besides identifying 2482(50%) proteins, 29 new genes were discovered and 66 annotated gene models were revised in ME-AM1 genome. One such novel gene is identified with 75 peptides, lacks homolog in other methylobacteria but has glycosyl transferase and lipopolysaccharide biosynthesis protein domains, indicating its potential role in outer membrane synthesis. Many novel genes are present only in ME-AM1 among methylobacteria. Distant homologs of these genes in unrelated taxonomic classes and low GC-content of few genes suggest lateral gene transfer as a potential mode of their origin. Annotations of methylotrophy related genes were also improved by the discovery of a short gene in methylotrophy gene island and redefining a gene important for pyrroquinoline quinone synthesis, essential for methylotrophy. The combined use of proteogenomics and rigorous bioinformatics analysis greatly enhanced the annotation of protein-coding genes in model methylotroph ME-AM1 genome. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
An, Shi-Qi; Febrer, Melanie; McCarthy, Yvonne; Tang, Dong-Jie; Clissold, Leah; Kaithakottil, Gemy; Swarbreck, David; Tang, Ji-Liang; Rogers, Jane; Dow, J Maxwell; Ryan, Robert P
2013-01-01
The bacterium Xanthomonas campestris is an economically important pathogen of many crop species and a model for the study of bacterial phytopathogenesis. In X. campestris, a regulatory system mediated by the signal molecule DSF controls virulence to plants. The synthesis and recognition of the DSF signal depends upon different Rpf proteins. DSF signal generation requires RpfF whereas signal perception and transduction depends upon a system comprising the sensor RpfC and regulator RpfG. Here we have addressed the action and role of Rpf/DSF signalling in phytopathogenesis by high-resolution transcriptional analysis coupled to functional genomics. We detected transcripts for many genes that were unidentified by previous computational analysis of the genome sequence. Novel transcribed regions included intergenic transcripts predicted as coding or non-coding as well as those that were antisense to coding sequences. In total, mutation of rpfF, rpfG and rpfC led to alteration in transcript levels (more than fourfold) of approximately 480 genes. The regulatory influence of RpfF and RpfC demonstrated considerable overlap. Contrary to expectation, the regulatory influence of RpfC and RpfG had limited overlap, indicating complexities of the Rpf signalling system. Importantly, functional analysis revealed over 160 new virulence factors within the group of Rpf-regulated genes. PMID:23617851
Lapébie, Pascal; Ruggiero, Antonella; Barreau, Carine; Chevalier, Sandra; Chang, Patrick; Dru, Philippe; Houliston, Evelyn; Momose, Tsuyoshi
2014-01-01
We have used Digital Gene Expression analysis to identify, without bilaterian bias, regulators of cnidarian embryonic patterning. Transcriptome comparison between un-manipulated Clytia early gastrula embryos and ones in which the key polarity regulator Wnt3 was inhibited using morpholino antisense oligonucleotides (Wnt3-MO) identified a set of significantly over and under-expressed transcripts. These code for candidate Wnt signaling modulators, orthologs of other transcription factors, secreted and transmembrane proteins known as developmental regulators in bilaterian models or previously uncharacterized, and also many cnidarian-restricted proteins. Comparisons between embryos injected with morpholinos targeting Wnt3 and its receptor Fz1 defined four transcript classes showing remarkable correlation with spatiotemporal expression profiles. Class 1 and 3 transcripts tended to show sustained expression at “oral” and “aboral” poles respectively of the developing planula larva, class 2 transcripts in cells ingressing into the endodermal region during gastrulation, while class 4 gene expression was repressed at the early gastrula stage. The preferential effect of Fz1-MO on expression of class 2 and 4 transcripts can be attributed to Planar Cell Polarity (PCP) disruption, since it was closely matched by morpholino knockdown of the specific PCP protein Strabismus. We conclude that endoderm and post gastrula-specific gene expression is particularly sensitive to PCP disruption while Wnt-/β-catenin signaling dominates gene regulation along the oral-aboral axis. Phenotype analysis using morpholinos targeting a subset of transcripts indicated developmental roles consistent with expression profiles for both conserved and cnidarian-restricted genes. Overall our unbiased screen allowed systematic identification of regionally expressed genes and provided functional support for a shared eumetazoan developmental regulatory gene set with both predicted and previously unexplored members, but also demonstrated that fundamental developmental processes including axial patterning and endoderm formation in cnidarians can involve newly evolved (or highly diverged) genes. PMID:25233086
Lapébie, Pascal; Ruggiero, Antonella; Barreau, Carine; Chevalier, Sandra; Chang, Patrick; Dru, Philippe; Houliston, Evelyn; Momose, Tsuyoshi
2014-09-01
We have used Digital Gene Expression analysis to identify, without bilaterian bias, regulators of cnidarian embryonic patterning. Transcriptome comparison between un-manipulated Clytia early gastrula embryos and ones in which the key polarity regulator Wnt3 was inhibited using morpholino antisense oligonucleotides (Wnt3-MO) identified a set of significantly over and under-expressed transcripts. These code for candidate Wnt signaling modulators, orthologs of other transcription factors, secreted and transmembrane proteins known as developmental regulators in bilaterian models or previously uncharacterized, and also many cnidarian-restricted proteins. Comparisons between embryos injected with morpholinos targeting Wnt3 and its receptor Fz1 defined four transcript classes showing remarkable correlation with spatiotemporal expression profiles. Class 1 and 3 transcripts tended to show sustained expression at "oral" and "aboral" poles respectively of the developing planula larva, class 2 transcripts in cells ingressing into the endodermal region during gastrulation, while class 4 gene expression was repressed at the early gastrula stage. The preferential effect of Fz1-MO on expression of class 2 and 4 transcripts can be attributed to Planar Cell Polarity (PCP) disruption, since it was closely matched by morpholino knockdown of the specific PCP protein Strabismus. We conclude that endoderm and post gastrula-specific gene expression is particularly sensitive to PCP disruption while Wnt-/β-catenin signaling dominates gene regulation along the oral-aboral axis. Phenotype analysis using morpholinos targeting a subset of transcripts indicated developmental roles consistent with expression profiles for both conserved and cnidarian-restricted genes. Overall our unbiased screen allowed systematic identification of regionally expressed genes and provided functional support for a shared eumetazoan developmental regulatory gene set with both predicted and previously unexplored members, but also demonstrated that fundamental developmental processes including axial patterning and endoderm formation in cnidarians can involve newly evolved (or highly diverged) genes.
Silencing of X-Linked MicroRNAs by Meiotic Sex Chromosome Inactivation
Royo, Hélène; Seitz, Hervé; ElInati, Elias; Peters, Antoine H. F. M.; Stadler, Michael B.; Turner, James M. A.
2015-01-01
During the pachytene stage of meiosis in male mammals, the X and Y chromosomes are transcriptionally silenced by Meiotic Sex Chromosome Inactivation (MSCI). MSCI is conserved in therian mammals and is essential for normal male fertility. Transcriptomics approaches have demonstrated that in mice, most or all protein-coding genes on the X chromosome are subject to MSCI. However, it is unclear whether X-linked non-coding RNAs behave in a similar manner. The X chromosome is enriched in microRNA (miRNA) genes, with many exhibiting testis-biased expression. Importantly, high expression levels of X-linked miRNAs (X-miRNAs) have been reported in pachytene spermatocytes, indicating that these genes may escape MSCI, and perhaps play a role in the XY-silencing process. Here we use RNA FISH to examine X-miRNA expression in the male germ line. We find that, like protein-coding X-genes, X-miRNAs are expressed prior to prophase I and are thereafter silenced during pachynema. X-miRNA silencing does not occur in mouse models with defective MSCI. Furthermore, X-miRNAs are expressed at pachynema when present as autosomally integrated transgenes. Thus, we conclude that silencing of X-miRNAs during pachynema in wild type males is MSCI-dependent. Importantly, misexpression of X-miRNAs during pachynema causes spermatogenic defects. We propose that MSCI represents a chromosomal mechanism by which X-miRNAs, and other potential X-encoded repressors, can be silenced, thereby regulating genes with critical late spermatogenic functions. PMID:26509798
A Case-by-Case Evolutionary Analysis of Four Imprinted Retrogenes
McCole, Ruth B; Loughran, Noeleen B; Chahal, Mandeep; Fernandes, Luis P; Roberts, Roland G; Fraternali, Franca; O'Connell, Mary J; Oakey, Rebecca J
2011-01-01
Retroposition is a widespread phenomenon resulting in the generation of new genes that are initially related to a parent gene via very high coding sequence similarity. We examine the evolutionary fate of four retrogenes generated by such an event; mouse Inpp5f_v2, Mcts2, Nap1l5, and U2af1-rs1. These genes are all subject to the epigenetic phenomenon of parental imprinting. We first provide new data on the age of these retrogene insertions. Using codon-based models of sequence evolution, we show these retrogenes have diverse evolutionary trajectories, including divergence from the parent coding sequence under positive selection pressure, purifying selection pressure maintaining parent-retrogene similarity, and neutral evolution. Examination of the expression pattern of retrogenes shows an atypical, broad pattern across multiple tissues. Protein 3D structure modeling reveals that a positively selected residue in U2af1-rs1, not shared by its parent, may influence protein conformation. Our case-by-case analysis of the evolution of four imprinted retrogenes reveals that this interesting class of imprinted genes, while similar in regulation and sequence characteristics, follow very varied evolutionary paths. PMID:21166792
Utrophin Up-Regulation by an Artificial Transcription Factor in Transgenic Mice
Mattei, Elisabetta; Corbi, Nicoletta; Di Certo, Maria Grazia; Strimpakos, Georgios; Severini, Cinzia; Onori, Annalisa; Desantis, Agata; Libri, Valentina; Buontempo, Serena; Floridi, Aristide; Fanciulli, Maurizio; Baban, Dilair; Davies, Kay E.; Passananti, Claudio
2007-01-01
Duchenne Muscular Dystrophy (DMD) is a severe muscle degenerative disease, due to absence of dystrophin. There is currently no effective treatment for DMD. Our aim is to up-regulate the expression level of the dystrophin related gene utrophin in DMD, complementing in this way the lack of dystrophin functions. To this end we designed and engineered several synthetic zinc finger based transcription factors. In particular, we have previously shown that the artificial three zinc finger protein named Jazz, fused with the appropriate effector domain, is able to drive the transcription of a test gene from the utrophin promoter “A”. Here we report on the characterization of Vp16-Jazz-transgenic mice that specifically over-express the utrophin gene at the muscular level. A Chromatin Immunoprecipitation assay (ChIP) demonstrated the effective access/binding of the Jazz protein to active chromatin in mouse muscle and Vp16-Jazz was shown to be able to up-regulate endogenous utrophin gene expression by immunohistochemistry, western blot analyses and real-time PCR. To our knowledge, this is the first example of a transgenic mouse expressing an artificial gene coding for a zinc finger based transcription factor. The achievement of Vp16-Jazz transgenic mice validates the strategy of transcriptional targeting of endogenous genes and could represent an exclusive animal model for use in drug discovery and therapeutics. PMID:17712422
Long non-coding RNA and Polycomb: an intricate partnership in cancer biology.
Achour, Cyrinne; Aguilo, Francesca
2018-06-01
High-throughput analyses have revealed that the vast majority of the transcriptome does not code for proteins. These non-translated transcripts, when larger than 200 nucleotides, are termed long non-coding RNAs (lncRNAs), and play fundamental roles in diverse cellular processes. LncRNAs are subject to dynamic chemical modification, adding another layer of complexity to our understanding of the potential roles that lncRNAs play in health and disease. Many lncRNAs regulate transcriptional programs by influencing the epigenetic state through direct interactions with chromatin-modifying proteins. Among these proteins, Polycomb repressive complexes 1 and 2 (PRC1 and PRC2) have been shown to be recruited by lncRNAs to silence target genes. Aberrant expression, deficiency or mutation of both lncRNA and Polycomb have been associated with numerous human diseases, including cancer. In this review, we have highlighted recent findings regarding the concerted mechanism of action of Polycomb group proteins (PcG), acting together with some classically defined lncRNAs including X-inactive specific transcript ( XIST ), antisense non-coding RNA in the INK4 locus ( ANRIL ), metastasis associated lung adenocarcinoma transcript 1 ( MALAT1 ), and HOX transcript antisense RNA ( HOTAIR ).
Fan, SiGang; Hu, ChaoQun; Wen, Jing; Zhang, LvPing
2011-05-01
The complete mitochondrial DNA sequence contains useful information for phylogenetic analyses of metazoa. In this study, the complete mitochondrial DNA sequence of sea cucumber Stichopus horrens (Holothuroidea: Stichopodidae: Stichopus) is presented. The complete sequence was determined using normal and long PCRs. The mitochondrial genome of Stichopus horrens is a circular molecule 16257 bps long, composed of 13 protein-coding genes, two ribosomal RNA genes and 22 transfer RNA genes. Most of these genes are coded on the heavy strand except for one protein-coding gene (nad6) and five tRNA genes (tRNA ( Ser(UCN) ), tRNA ( Gln ), tRNA ( Ala ), tRNA ( Val ), tRNA ( Asp )) which are coded on the light strand. The composition of the heavy strand is 30.8% A, 23.7% C, 16.2% G, and 29.3% T bases (AT skew=0.025; GC skew=-0.188). A non-coding region of 675 bp was identified as a putative control region because of its location and AT richness. The intergenic spacers range from 1 to 50 bp in size, totaling 227 bp. A total of 25 overlapping nucleotides, ranging from 1 to 10 bp in size, exist among 11 genes. All 13 protein-coding genes are initiated with an ATG. The TAA codon is used as the stop codon in all the protein coding genes except nad3 and nad4 that use TAG as their termination codon. The most frequently used amino acids are Leu (16.29%), Ser (10.34%) and Phe (8.37%). All of the tRNA genes have the potential to fold into typical cloverleaf secondary structures. We also compared the order of the genes in the mitochondrial DNA from the five holothurians that are now available and found a novel gene arrangement in the mitochondrial DNA of Stichopus horrens.
APADB: a database for alternative polyadenylation and microRNA regulation events
Müller, Sören; Rycak, Lukas; Afonso-Grunz, Fabian; Winter, Peter; Zawada, Adam M.; Damrath, Ewa; Scheider, Jessica; Schmäh, Juliane; Koch, Ina; Kahl, Günter; Rotter, Björn
2014-01-01
Alternative polyadenylation (APA) is a widespread mechanism that contributes to the sophisticated dynamics of gene regulation. Approximately 50% of all protein-coding human genes harbor multiple polyadenylation (PA) sites; their selective and combinatorial use gives rise to transcript variants with differing length of their 3′ untranslated region (3′UTR). Shortened variants escape UTR-mediated regulation by microRNAs (miRNAs), especially in cancer, where global 3′UTR shortening accelerates disease progression, dedifferentiation and proliferation. Here we present APADB, a database of vertebrate PA sites determined by 3′ end sequencing, using massive analysis of complementary DNA ends. APADB provides (A)PA sites for coding and non-coding transcripts of human, mouse and chicken genes. For human and mouse, several tissue types, including different cancer specimens, are available. APADB records the loss of predicted miRNA binding sites and visualizes next-generation sequencing reads that support each PA site in a genome browser. The database tables can either be browsed according to organism and tissue or alternatively searched for a gene of interest. APADB is the largest database of APA in human, chicken and mouse. The stored information provides experimental evidence for thousands of PA sites and APA events. APADB combines 3′ end sequencing data with prediction algorithms of miRNA binding sites, allowing to further improve prediction algorithms. Current databases lack correct information about 3′UTR lengths, especially for chicken, and APADB provides necessary information to close this gap. Database URL: http://tools.genxpro.net/apadb/ PMID:25052703
RNA-Seq Based Transcriptional Map of Bovine Respiratory Disease Pathogen “Histophilus somni 2336”
Kumar, Ranjit; Lawrence, Mark L.; Watt, James; Cooksey, Amanda M.; Burgess, Shane C.; Nanduri, Bindu
2012-01-01
Genome structural annotation, i.e., identification and demarcation of the boundaries for all the functional elements in a genome (e.g., genes, non-coding RNAs, proteins and regulatory elements), is a prerequisite for systems level analysis. Current genome annotation programs do not identify all of the functional elements of the genome, especially small non-coding RNAs (sRNAs). Whole genome transcriptome analysis is a complementary method to identify “novel” genes, small RNAs, regulatory regions, and operon structures, thus improving the structural annotation in bacteria. In particular, the identification of non-coding RNAs has revealed their widespread occurrence and functional importance in gene regulation, stress and virulence. However, very little is known about non-coding transcripts in Histophilus somni, one of the causative agents of Bovine Respiratory Disease (BRD) as well as bovine infertility, abortion, septicemia, arthritis, myocarditis, and thrombotic meningoencephalitis. In this study, we report a single nucleotide resolution transcriptome map of H. somni strain 2336 using RNA-Seq method. The RNA-Seq based transcriptome map identified 94 sRNAs in the H. somni genome of which 82 sRNAs were never predicted or reported in earlier studies. We also identified 38 novel potential protein coding open reading frames that were absent in the current genome annotation. The transcriptome map allowed the identification of 278 operon (total 730 genes) structures in the genome. When compared with the genome sequence of a non-virulent strain 129Pt, a disproportionate number of sRNAs (∼30%) were located in genomic region unique to strain 2336 (∼18% of the total genome). This observation suggests that a number of the newly identified sRNAs in strain 2336 may be involved in strain-specific adaptations. PMID:22276113
RNA-seq based transcriptional map of bovine respiratory disease pathogen "Histophilus somni 2336".
Kumar, Ranjit; Lawrence, Mark L; Watt, James; Cooksey, Amanda M; Burgess, Shane C; Nanduri, Bindu
2012-01-01
Genome structural annotation, i.e., identification and demarcation of the boundaries for all the functional elements in a genome (e.g., genes, non-coding RNAs, proteins and regulatory elements), is a prerequisite for systems level analysis. Current genome annotation programs do not identify all of the functional elements of the genome, especially small non-coding RNAs (sRNAs). Whole genome transcriptome analysis is a complementary method to identify "novel" genes, small RNAs, regulatory regions, and operon structures, thus improving the structural annotation in bacteria. In particular, the identification of non-coding RNAs has revealed their widespread occurrence and functional importance in gene regulation, stress and virulence. However, very little is known about non-coding transcripts in Histophilus somni, one of the causative agents of Bovine Respiratory Disease (BRD) as well as bovine infertility, abortion, septicemia, arthritis, myocarditis, and thrombotic meningoencephalitis. In this study, we report a single nucleotide resolution transcriptome map of H. somni strain 2336 using RNA-Seq method.The RNA-Seq based transcriptome map identified 94 sRNAs in the H. somni genome of which 82 sRNAs were never predicted or reported in earlier studies. We also identified 38 novel potential protein coding open reading frames that were absent in the current genome annotation. The transcriptome map allowed the identification of 278 operon (total 730 genes) structures in the genome. When compared with the genome sequence of a non-virulent strain 129Pt, a disproportionate number of sRNAs (∼30%) were located in genomic region unique to strain 2336 (∼18% of the total genome). This observation suggests that a number of the newly identified sRNAs in strain 2336 may be involved in strain-specific adaptations.
Monitoring Autophagy in the Model Green Microalga Chlamydomonas reinhardtii.
Pérez-Pérez, María Esther; Couso, Inmaculada; Heredia-Martínez, Luis G; Crespo, José L
2017-10-22
Autophagy is an intracellular catabolic system that delivers cytoplasmic constituents and organelles in the vacuole. This degradative process is mediated by a group of proteins coded by autophagy-related ( ATG ) genes that are widely conserved from yeasts to plants and mammals. Homologs of ATG genes have been also identified in algal genomes including the unicellular model green alga Chlamydomonas reinhardtii . The development of specific tools to monitor autophagy in Chlamydomonas has expanded our current knowledge about the regulation and function of this process in algae. Recent findings indicated that autophagy is regulated by redox signals and the TOR network in Chlamydomonas and revealed that this process may play in important role in the control of lipid metabolism and ribosomal protein turnover in this alga. Here, we will describe the different techniques and approaches that have been reported to study autophagy and autophagic flux in Chlamydomonas.
Reinhardt, Josephine A.; Wanjiru, Betty M.; Brant, Alicia T.; Saelao, Perot; Begun, David J.; Jones, Corbin D.
2013-01-01
How non-coding DNA gives rise to new protein-coding genes (de novo genes) is not well understood. Recent work has revealed the origins and functions of a few de novo genes, but common principles governing the evolution or biological roles of these genes are unknown. To better define these principles, we performed a parallel analysis of the evolution and function of six putatively protein-coding de novo genes described in Drosophila melanogaster. Reconstruction of the transcriptional history of de novo genes shows that two de novo genes emerged from novel long non-coding RNAs that arose at least 5 MY prior to evolution of an open reading frame. In contrast, four other de novo genes evolved a translated open reading frame and transcription within the same evolutionary interval suggesting that nascent open reading frames (proto-ORFs), while not required, can contribute to the emergence of a new de novo gene. However, none of the genes arose from proto-ORFs that existed long before expression evolved. Sequence and structural evolution of de novo genes was rapid compared to nearby genes and the structural complexity of de novo genes steadily increases over evolutionary time. Despite the fact that these genes are transcribed at a higher level in males than females, and are most strongly expressed in testes, RNAi experiments show that most of these genes are essential in both sexes during metamorphosis. This lethality suggests that protein coding de novo genes in Drosophila quickly become functionally important. PMID:24146629
Maia, Rafaela M; Valente, Valeria; Cunha, Marco A V; Sousa, Josane F; Araujo, Daniela D; Silva, Wilson A; Zago, Marco A; Dias-Neto, Emmanuel; Souza, Sandro J; Simpson, Andrew J G; Monesi, Nadia; Ramos, Ricardo G P; Espreafico, Enilza M; Paçó-Larson, Maria L
2007-07-24
The sequencing of the D.melanogaster genome revealed an unexpected small number of genes (~ 14,000) indicating that mechanisms acting on generation of transcript diversity must have played a major role in the evolution of complex metazoans. Among the most extensively used mechanisms that accounts for this diversity is alternative splicing. It is estimated that over 40% of Drosophila protein-coding genes contain one or more alternative exons. A recent transcription map of the Drosophila embryogenesis indicates that 30% of the transcribed regions are unannotated, and that 1/3 of this is estimated as missed or alternative exons of previously characterized protein-coding genes. Therefore, the identification of the variety of expressed transcripts depends on experimental data for its final validation and is continuously being performed using different approaches. We applied the Open Reading Frame Expressed Sequence Tags (ORESTES) methodology, which is capable of generating cDNA data from the central portion of rare transcripts, in order to investigate the presence of hitherto unnanotated regions of Drosophila transcriptome. Bioinformatic analysis of 1,303 Drosophila ORESTES clusters identified 68 sequences derived from unannotated regions in the current Drosophila genome version (4.3). Of these, a set of 38 was analysed by polyA+ northern blot hybridization, validating 17 (50%) new exons of low abundance transcripts. For one of these ESTs, we obtained the cDNA encompassing the complete coding sequence of a new serine protease, named SP212. The SP212 gene is part of a serine protease gene cluster located in the chromosome region 88A12-B1. This cluster includes the predicted genes CG9631, CG9649 and CG31326, which were previously identified as up-regulated after immune challenges in genomic-scale microarray analysis. In agreement with the proposal that this locus is co-regulated in response to microorganisms infection, we show here that SP212 is also up-regulated upon injury. Using the ORESTES methodology we identified 17 novel exons from low abundance Drosophila transcripts, and through a PCR approach the complete CDS of one of these transcripts was defined. Our results show that the computational identification and manual inspection are not sufficient to annotate a genome in the absence of experimentally derived data.
Maia, Rafaela M; Valente, Valeria; Cunha, Marco AV; Sousa, Josane F; Araujo, Daniela D; Silva, Wilson A; Zago, Marco A; Dias-Neto, Emmanuel; Souza, Sandro J; Simpson, Andrew JG; Monesi, Nadia; Ramos, Ricardo GP; Espreafico, Enilza M; Paçó-Larson, Maria L
2007-01-01
Background The sequencing of the D.melanogaster genome revealed an unexpected small number of genes (~ 14,000) indicating that mechanisms acting on generation of transcript diversity must have played a major role in the evolution of complex metazoans. Among the most extensively used mechanisms that accounts for this diversity is alternative splicing. It is estimated that over 40% of Drosophila protein-coding genes contain one or more alternative exons. A recent transcription map of the Drosophila embryogenesis indicates that 30% of the transcribed regions are unannotated, and that 1/3 of this is estimated as missed or alternative exons of previously characterized protein-coding genes. Therefore, the identification of the variety of expressed transcripts depends on experimental data for its final validation and is continuously being performed using different approaches. We applied the Open Reading Frame Expressed Sequence Tags (ORESTES) methodology, which is capable of generating cDNA data from the central portion of rare transcripts, in order to investigate the presence of hitherto unnanotated regions of Drosophila transcriptome. Results Bioinformatic analysis of 1,303 Drosophila ORESTES clusters identified 68 sequences derived from unannotated regions in the current Drosophila genome version (4.3). Of these, a set of 38 was analysed by polyA+ northern blot hybridization, validating 17 (50%) new exons of low abundance transcripts. For one of these ESTs, we obtained the cDNA encompassing the complete coding sequence of a new serine protease, named SP212. The SP212 gene is part of a serine protease gene cluster located in the chromosome region 88A12-B1. This cluster includes the predicted genes CG9631, CG9649 and CG31326, which were previously identified as up-regulated after immune challenges in genomic-scale microarray analysis. In agreement with the proposal that this locus is co-regulated in response to microorganisms infection, we show here that SP212 is also up-regulated upon injury. Conclusion Using the ORESTES methodology we identified 17 novel exons from low abundance Drosophila transcripts, and through a PCR approach the complete CDS of one of these transcripts was defined. Our results show that the computational identification and manual inspection are not sufficient to annotate a genome in the absence of experimentally derived data. PMID:17650329
Jaag, Hannah Miriam; Kawchuk, Lawrence; Rohde, Wolfgang; Fischer, Rainer; Emans, Neil; Prüfer, Dirk
2003-01-01
Potato leafroll polerovirus (PLRV) genomic RNA acts as a polycistronic mRNA for the production of proteins P0, P1, and P2 translated from the 5′-proximal half of the genome. Within the P1 coding region we identified a 5-kDa replication-associated protein 1 (Rap1) essential for viral multiplication. An internal ribosome entry site (IRES) with unusual structure and location was identified that regulates Rap1 translation. Core structural elements for internal ribosome entry include a conserved AUG codon and a downstream GGAGAGAGAGG motif with inverted symmetry. Reporter gene expression in potato protoplasts confirmed the internal ribosome entry function. Unlike known IRES motifs, the PLRV IRES is located completely within the coding region of Rap1 at the center of the PLRV genome. PMID:12835413
Ganeshan, Seedhabadee; Sharma, Pallavi; Young, Lester; Kumar, Ashwani; Fowler, D Brian; Chibbar, Ravindra N
2011-03-01
Low-temperature (LT) tolerance in winter wheat (Triticum aestivum L.) is an economically important but complex trait. Four selected wheat genotypes, a winter hardy cultivar, Norstar, a tender spring cultivar, Manitou and two near-isogenic lines with Vrn-A1 (spring Norstar) and vrn-A1 (winter Manitou) alleles of Manitou and Norstar were cold-acclimated at 6°C and crown and leaf tissues were collected at 0, 2, 14, 21, 35, 42, 56 and 70 days of cold acclimation. cDNA-AFLP profiling was used to determine temporal expression profiles of transcripts during cold-acclimation in crown and leaf tissues, separately to determine if LT regulatory circuitries in crown and leaf tissues could be delineated using this approach. Screening 64 primer combinations identified 4,074 and 2,757 differentially expressed transcript-derived fragments (TDFs) out of which 38 and 16% were up-regulated as compared to 3 and 6% that were down-regulated in crown and leaf tissues, respectively. DNA sequencing of TDFs revealed sequences common to both tissues including genes coding for DEAD-box RNA helicase, choline-phosphate cytidylyltransferase and delta-1-pyrroline carboxylate synthetase. TDF specific to crown tissues included genes coding for phospahtidylinositol kinase, auxin response factor protein and brassinosteroid insensitive 1-associated receptor kinase. In leaf, genes such as methylene tetrahydrofolate reductase, NADH-cytochrome b5 reductase and malate dehydrogenase were identified. However, 30 and 14% of the DNA sequences from the crown and leaf tissues, respectively, were hypothetical or unknown proteins. Cluster analysis of up-, down-regulated and unique TDFs, DNA sequence and real-time PCR validation, infer that mechanisms operating in crown and leaf tissue in response to LT are differently regulated and warrant further studies.
Raju, Hemalatha B; Tsinoremas, Nicholas F; Capobianco, Enrico
2016-01-01
Regeneration of injured nerves is likely occurring in the peripheral nervous system, but not in the central nervous system. Although protein-coding gene expression has been assessed during nerve regeneration, little is currently known about the role of non-coding RNAs (ncRNAs). This leaves open questions about the potential effects of ncRNAs at transcriptome level. Due to the limited availability of human neuropathic pain (NP) data, we have identified the most comprehensive time-course gene expression profile referred to sciatic nerve (SN) injury and studied in a rat model using two neuronal tissues, namely dorsal root ganglion (DRG) and SN. We have developed a methodology to identify differentially expressed bioentities starting from microarray probes and repurposing them to annotate ncRNAs, while analyzing the expression profiles of protein-coding genes. The approach is designed to reuse microarray data and perform first profiling and then meta-analysis through three main steps. First, we used contextual analysis to identify what we considered putative or potential protein-coding targets for selected ncRNAs. Relevance was therefore assigned to differential expression of neighbor protein-coding genes, with neighborhood defined by a fixed genomic distance from long or antisense ncRNA loci, and of parental genes associated with pseudogenes. Second, connectivity among putative targets was used to build networks, in turn useful to conduct inference at interactomic scale. Last, network paths were annotated to assess relevance to NP. We found significant differential expression in long-intergenic ncRNAs (32 lincRNAs in SN and 8 in DRG), antisense RNA (31 asRNA in SN and 12 in DRG), and pseudogenes (456 in SN and 56 in DRG). In particular, contextual analysis centered on pseudogenes revealed some targets with known association to neurodegeneration and/or neurogenesis processes. While modules of the olfactory receptors were clearly identified in protein-protein interaction networks, other connectivity paths were identified between proteins already investigated in studies on disorders, such as Parkinson, Down syndrome, Huntington disease, and Alzheimer. Our findings suggest the importance of reusing gene expression data by meta-analysis approaches.
Disruption of long-distance highly conserved noncoding elements in neurocristopathies.
Amiel, Jeanne; Benko, Sabina; Gordon, Christopher T; Lyonnet, Stanislas
2010-12-01
One of the key discoveries of vertebrate genome sequencing projects has been the identification of highly conserved noncoding elements (CNEs). Some characteristics of CNEs include their high frequency in mammalian genomes, their potential regulatory role in gene expression, and their enrichment in gene deserts nearby master developmental genes. The abnormal development of neural crest cells (NCCs) leads to a broad spectrum of congenital malformation(s), termed neurocristopathies, and/or tumor predisposition. Here we review recent findings that disruptions of CNEs, within or at long distance from the coding sequences of key genes involved in NCC development, result in neurocristopathies via the alteration of tissue- or stage-specific long-distance regulation of gene expression. While most studies on human genetic disorders have focused on protein-coding sequences, these examples suggest that investigation of genomic alterations of CNEs will provide a broader understanding of the molecular etiology of both rare and common human congenital malformations. © 2010 New York Academy of Sciences.
Global analysis of the Burkholderia thailandensis quorum sensing-controlled regulon.
Majerczyk, Charlotte; Brittnacher, Mitchell; Jacobs, Michael; Armour, Christopher D; Radey, Mathew; Schneider, Emily; Phattarasokul, Somsak; Bunt, Richard; Greenberg, E Peter
2014-04-01
Burkholderia thailandensis contains three acyl-homoserine lactone quorum sensing circuits and has two additional LuxR homologs. To identify B. thailandensis quorum sensing-controlled genes, we carried out transcriptome sequencing (RNA-seq) analyses of quorum sensing mutants and their parent. The analyses were grounded in the fact that we identified genes coding for factors shown previously to be regulated by quorum sensing among a larger set of quorum-controlled genes. We also found that genes coding for contact-dependent inhibition were induced by quorum sensing and confirmed that specific quorum sensing mutants had a contact-dependent inhibition defect. Additional quorum-controlled genes included those for the production of numerous secondary metabolites, an uncharacterized exopolysaccharide, and a predicted chitin-binding protein. This study provides insights into the roles of the three quorum sensing circuits in the saprophytic lifestyle of B. thailandensis, and it provides a foundation on which to build an understanding of the roles of quorum sensing in the biology of B. thailandensis and the closely related pathogenic Burkholderia pseudomallei and Burkholderia mallei.
The metazoan Mediator co-activator complex as an integrative hub for transcriptional regulation.
Malik, Sohail; Roeder, Robert G
2010-11-01
The Mediator is an evolutionarily conserved, multiprotein complex that is a key regulator of protein-coding genes. In metazoan cells, multiple pathways that are responsible for homeostasis, cell growth and differentiation converge on the Mediator through transcriptional activators and repressors that target one or more of the almost 30 subunits of this complex. Besides interacting directly with RNA polymerase II, Mediator has multiple functions and can interact with and coordinate the action of numerous other co-activators and co-repressors, including those acting at the level of chromatin. These interactions ultimately allow the Mediator to deliver outputs that range from maximal activation of genes to modulation of basal transcription to long-term epigenetic silencing.
Complexity of the Alternative Splicing Landscape in Plants[C][W][OPEN
Reddy, Anireddy S.N.; Marquez, Yamile; Kalyna, Maria; Barta, Andrea
2013-01-01
Alternative splicing (AS) of precursor mRNAs (pre-mRNAs) from multiexon genes allows organisms to increase their coding potential and regulate gene expression through multiple mechanisms. Recent transcriptome-wide analysis of AS using RNA sequencing has revealed that AS is highly pervasive in plants. Pre-mRNAs from over 60% of intron-containing genes undergo AS to produce a vast repertoire of mRNA isoforms. The functions of most splice variants are unknown. However, emerging evidence indicates that splice variants increase the functional diversity of proteins. Furthermore, AS is coupled to transcript stability and translation through nonsense-mediated decay and microRNA-mediated gene regulation. Widespread changes in AS in response to developmental cues and stresses suggest a role for regulated splicing in plant development and stress responses. Here, we review recent progress in uncovering the extent and complexity of the AS landscape in plants, its regulation, and the roles of AS in gene regulation. The prevalence of AS in plants has raised many new questions that require additional studies. New tools based on recent technological advances are allowing genome-wide analysis of RNA elements in transcripts and of chromatin modifications that regulate AS. Application of these tools in plants will provide significant new insights into AS regulation and crosstalk between AS and other layers of gene regulation. PMID:24179125
miRNA as a New Regulatory Mechanism of Estrogen Vascular Action.
Pérez-Cremades, Daniel; Mompeón, Ana; Vidal-Gómez, Xavier; Hermenegildo, Carlos; Novella, Susana
2018-02-06
The beneficial effects of estrogen on the cardiovascular system have been reported extensively. In fact, the incidence of cardiovascular diseases in women is lower than in age-matched men during their fertile stage of life, a benefit that disappears after menopause. These sex-related differences point to sexual hormones, mainly estrogen, as possible cardiovascular protective factors. The regulation of vascular function by estrogen is mainly related to the maintenance of normal endothelial function and is mediated by both direct and indirect gene transcription through the activity of specific estrogen receptors. Some of these mechanisms are known, but many remain to be elucidated. In recent years, microRNAs have been established as non-coding RNAs that regulate the expression of a high percentage of protein-coding genes in mammals and are related to the correct function of human physiology. Moreover, within the cardiovascular system, miRNAs have been related to physiological and pathological conditions. In this review, we address what is known about the role of estrogen-regulated miRNAs and their emerging involvement in vascular biology.
Goalpha regulates volatile anesthetic action in Caenorhabditis elegans.
van Swinderen, B; Metz, L B; Shebester, L D; Mendel, J E; Sternberg, P W; Crowder, C M
2001-01-01
To identify genes controlling volatile anesthetic (VA) action, we have screened through existing Caenorhabditis elegans mutants and found that strains with a reduction in Go signaling are VA resistant. Loss-of-function mutants of the gene goa-1, which codes for the alpha-subunit of Go, have EC(50)s for the VA isoflurane of 1.7- to 2.4-fold that of wild type. Strains overexpressing egl-10, which codes for an RGS protein negatively regulating goa-1, are also isoflurane resistant. However, sensitivity to halothane, a structurally distinct VA, is differentially affected by Go pathway mutants. The RGS overexpressing strains, a goa-1 missense mutant found to carry a novel mutation near the GTP-binding domain, and eat-16(rf) mutants, which suppress goa-1(gf) mutations, are all halothane resistant; goa-1(null) mutants have wild-type sensitivities. Double mutant strains carrying mutations in both goa-1 and unc-64, which codes for a neuronal syntaxin previously found to regulate VA sensitivity, show that the syntaxin mutant phenotypes depend in part on goa-1 expression. Pharmacological assays using the cholinesterase inhibitor aldicarb suggest that VAs and GOA-1 similarly downregulate cholinergic neurotransmitter release in C. elegans. Thus, the mechanism of action of VAs in C. elegans is regulated by Goalpha, and presynaptic Goalpha-effectors are candidate VA molecular targets. PMID:11404329
Stenz, Ludwig; Escoffier, Jessica; Rahban, Rita; Nef, Serge; Paoloni-Giacobino, Ariane
2017-01-01
The endocrine disruptor bis(2-ethylhexyl) phthalate (DEHP) has been shown to exert adverse effects on the male animal reproductive system. However, its mode of action is unclear and a systematic analysis of its molecular targets is needed. In the present study, we investigated the effects of prenatal exposure to 300 mg/kg/day DEHP during a critical period for gonads differentiation to testes on male mice offspring reproductive parameters, including the genome-wide RNA expression and associated promoter methylation status in the sperm of the first filial generation. It was observed that adult male offspring displayed symptoms similar to the human testicular dysgenesis syndrome. A combination of sperm transcriptome and methylome data analysis allowed to detect a long-lasting DEHP-induced and robust promoter methylation-associated silencing of almost the entire cluster of the seminal vesicle secretory proteins and antigen genes, which are known to play a fundamental role in sperm physiology. It also resulted in the detection of a DEHP-induced promoter demethylation associated with an up-regulation of three genes apparently not relevant for sperm physiology and partially related to the immune system. As previously reported, DEHP induced an increase in mir-615 microRNA expression and a genome-wide decrease in microRNA promoter methylation. A functional analysis revealed DEHP-induced enrichments in down-regulated gene transcripts coding for peroxisome proliferator-activated receptors and tumor necrosis factor signaling pathways, and in up-regulated gene transcripts coding for calcium binding and numerous myosin proteins. All these enriched pathways and networks have been described to be associated in some way with the reproductive system. This study identifies a large new array of genes dysregulated by DEHP that may play a role in the complex system controlling the development of the male reproductive system.
Tissue- and Time-Specific Expression of Otherwise Identical tRNA Genes
Adir, Idan; Dahan, Orna; Broday, Limor; Pilpel, Yitzhak; Rechavi, Oded
2016-01-01
Codon usage bias affects protein translation because tRNAs that recognize synonymous codons differ in their abundance. Although the current dogma states that tRNA expression is exclusively regulated by intrinsic control elements (A- and B-box sequences), we revealed, using a reporter that monitors the levels of individual tRNA genes in Caenorhabditis elegans, that eight tryptophan tRNA genes, 100% identical in sequence, are expressed in different tissues and change their expression dynamically. Furthermore, the expression levels of the sup-7 tRNA gene at day 6 were found to predict the animal’s lifespan. We discovered that the expression of tRNAs that reside within introns of protein-coding genes is affected by the host gene’s promoter. Pairing between specific Pol II genes and the tRNAs that are contained in their introns is most likely adaptive, since a genome-wide analysis revealed that the presence of specific intronic tRNAs within specific orthologous genes is conserved across Caenorhabditis species. PMID:27560950
Rozenchan, Patricia Bortman; Carraro, Dirce Maria; Brentani, Helena; de Carvalho Mota, Louise Danielle; Bastos, Elen Pereira; e Ferreira, Elisa Napolitano; Torres, Cesar H; Katayama, Maria Lúcia Hirata; Roela, Rosimeire Aparecida; Lyra, Eduardo C; Soares, Fernando Augusto; Folgueira, Maria Aparecida Azevedo Koike; Góes, João Carlos Guedes Sampaio; Brentani, Maria Mitzi
2009-12-15
The importance of epithelial-stroma interaction in normal breast development and tumor progression has been recognized. To identify genes that were regulated by these reciprocal interactions, we cocultured a nonmalignant (MCF10A) and a breast cancer derived (MDA-MB231) basal cell lines, with fibroblasts isolated from breast benign-disease adjacent tissues (NAF) or with carcinoma-associated fibroblasts (CAF), in a transwell system. Gene expression profiles of each coculture pair were compared with the correspondent monocultures, using a customized microarray. Contrariwise to large alterations in epithelial cells genomic profiles, fibroblasts were less affected. In MDA-MB231 highly represented genes downregulated by CAF derived factors coded for proteins important for the specificity of vectorial transport between ER and golgi, possibly affecting cell polarity whereas the response of MCF10A comprised an induction of genes coding for stress responsive proteins, representing a prosurvival effect. While NAF downregulated genes encoding proteins associated to glycolipid and fatty acid biosynthesis in MDA-MB231, potentially affecting membrane biogenesis, in MCF10A, genes critical for growth control and adhesion were altered. NAFs responded to coculture with MDA-MB231 by a decrease in the expression of genes induced by TGFbeta1 and associated to motility. However, there was little change in NAFs gene expression profile influenced by MCF10A. CAFs responded to the presence of both epithelial cells inducing genes implicated in cell proliferation. Our data indicate that interactions between breast fibroblasts and basal epithelial cells resulted in alterations in the genomic profiles of both cell types which may help to clarify some aspects of this heterotypic signaling. Copyright (c) 2009 UICC.
McTavish, H; LaQuier, F; Arciero, D; Logan, M; Mundfrom, G; Fuchs, J A; Hooper, A B
1993-04-01
The genome of Nitrosomonas europaea contains at least three copies each of the genes coding for hydroxylamine oxidoreductase (HAO) and cytochrome c554. A copy of an HAO gene is always located within 2.7 kb of a copy of a cytochrome c554 gene. Cytochrome P-460, a protein that shares very unusual spectral features with HAO, was found to be encoded by a gene separate from the HAO genes.
Marsolais, Frédéric
2012-01-01
The lack of phaseolin and phytohaemagglutinin in common bean (dry bean, Phaseolus vulgaris) is associated with an increase in total cysteine and methionine concentrations by 70% and 10%, respectively, mainly at the expense of an abundant non-protein amino acid, S-methyl-cysteine. Transcripts were profiled between two genetically related lines differing for this trait at four stages of seed development using a high density microarray designed for common bean. Transcripts of multiple sulphur-rich proteins were elevated, several previously identified by proteomics, including legumin, basic 7S globulin, albumin-2, defensin, albumin-1, the Bowman–Birk type proteinase inhibitor, the double-headed trypsin inhibitor, and the Kunitz trypsin inhibitor. A co-ordinated regulation of transcripts coding for sulphate transporters, sulphate assimilatory enzymes, serine acetyltransferases, cystathionine β-lyase, homocysteine S-methyltransferase and methionine gamma-lyase was associated with changes in cysteine and methionine concentrations. Differential gene expression of sulphur-rich proteins preceded that of sulphur metabolic enzymes, suggesting a regulation by demand from the protein sink. Up-regulation of SERAT1;1 and -1;2 expression revealed an activation of cytosolic O-acetylserine biosynthesis. Down-regulation of SERAT2;1 suggested that cysteine and S-methyl-cysteine biosynthesis may be spatially separated in different subcellular compartments. Analysis of free amino acid profiles indicated that enhanced cysteine biosynthesis was correlated with a depletion of O-acetylserine. These results contribute to our understanding of the regulation of sulphur metabolism in developing seed in response to a change in the composition of endogenous proteins. PMID:23066144
Liao, Dengqun; Pajak, Agnieszka; Karcz, Steven R; Chapman, B Patrick; Sharpe, Andrew G; Austin, Ryan S; Datla, Raju; Dhaubhadel, Sangeeta; Marsolais, Frédéric
2012-10-01
The lack of phaseolin and phytohaemagglutinin in common bean (dry bean, Phaseolus vulgaris) is associated with an increase in total cysteine and methionine concentrations by 70% and 10%, respectively, mainly at the expense of an abundant non-protein amino acid, S-methyl-cysteine. Transcripts were profiled between two genetically related lines differing for this trait at four stages of seed development using a high density microarray designed for common bean. Transcripts of multiple sulphur-rich proteins were elevated, several previously identified by proteomics, including legumin, basic 7S globulin, albumin-2, defensin, albumin-1, the Bowman-Birk type proteinase inhibitor, the double-headed trypsin inhibitor, and the Kunitz trypsin inhibitor. A co-ordinated regulation of transcripts coding for sulphate transporters, sulphate assimilatory enzymes, serine acetyltransferases, cystathionine β-lyase, homocysteine S-methyltransferase and methionine gamma-lyase was associated with changes in cysteine and methionine concentrations. Differential gene expression of sulphur-rich proteins preceded that of sulphur metabolic enzymes, suggesting a regulation by demand from the protein sink. Up-regulation of SERAT1;1 and -1;2 expression revealed an activation of cytosolic O-acetylserine biosynthesis. Down-regulation of SERAT2;1 suggested that cysteine and S-methyl-cysteine biosynthesis may be spatially separated in different subcellular compartments. Analysis of free amino acid profiles indicated that enhanced cysteine biosynthesis was correlated with a depletion of O-acetylserine. These results contribute to our understanding of the regulation of sulphur metabolism in developing seed in response to a change in the composition of endogenous proteins.
Divergent transcription is associated with promoters of transcriptional regulators
2013-01-01
Background Divergent transcription is a wide-spread phenomenon in mammals. For instance, short bidirectional transcripts are a hallmark of active promoters, while longer transcripts can be detected antisense from active genes in conditions where the RNA degradation machinery is inhibited. Moreover, many described long non-coding RNAs (lncRNAs) are transcribed antisense from coding gene promoters. However, the general significance of divergent lncRNA/mRNA gene pair transcription is still poorly understood. Here, we used strand-specific RNA-seq with high sequencing depth to thoroughly identify antisense transcripts from coding gene promoters in primary mouse tissues. Results We found that a substantial fraction of coding-gene promoters sustain divergent transcription of long non-coding RNA (lncRNA)/mRNA gene pairs. Strikingly, upstream antisense transcription is significantly associated with genes related to transcriptional regulation and development. Their promoters share several characteristics with those of transcriptional developmental genes, including very large CpG islands, high degree of conservation and epigenetic regulation in ES cells. In-depth analysis revealed a unique GC skew profile at these promoter regions, while the associated coding genes were found to have large first exons, two genomic features that might enforce bidirectional transcription. Finally, genes associated with antisense transcription harbor specific H3K79me2 epigenetic marking and RNA polymerase II enrichment profiles linked to an intensified rate of early transcriptional elongation. Conclusions We concluded that promoters of a class of transcription regulators are characterized by a specialized transcriptional control mechanism, which is directly coupled to relaxed bidirectional transcription. PMID:24365181
McGuire, Austen B; Rafi, Syed K; Manzardo, Ann M; Butler, Merlin G
2016-05-05
Mammalian chromosomes are comprised of complex chromatin architecture with the specific assembly and configuration of each chromosome influencing gene expression and function in yet undefined ways by varying degrees of heterochromatinization that result in Giemsa (G) negative euchromatic (light) bands and G-positive heterochromatic (dark) bands. We carried out morphometric measurements of high-resolution chromosome ideograms for the first time to characterize the total euchromatic and heterochromatic chromosome band length, distribution and localization of 20,145 known protein-coding genes, 790 recognized autism spectrum disorder (ASD) genes and 365 obesity genes. The individual lengths of G-negative euchromatin and G-positive heterochromatin chromosome bands were measured in millimeters and recorded from scaled and stacked digital images of 850-band high-resolution ideograms supplied by the International Society of Chromosome Nomenclature (ISCN) 2013. Our overall measurements followed established banding patterns based on chromosome size. G-negative euchromatic band regions contained 60% of protein-coding genes while the remaining 40% were distributed across the four heterochromatic dark band sub-types. ASD genes were disproportionately overrepresented in the darker heterochromatic sub-bands, while the obesity gene distribution pattern did not significantly differ from protein-coding genes. Our study supports recent trends implicating genes located in heterochromatin regions playing a role in biological processes including neurodevelopment and function, specifically genes associated with ASD.
Fujisawa, Takatomo; Narikawa, Rei; Okamoto, Shinobu; Ehira, Shigeki; Yoshimura, Hidehisa; Suzuki, Iwane; Masuda, Tatsuru; Mochimaru, Mari; Takaichi, Shinichi; Awai, Koichiro; Sekine, Mitsuo; Horikawa, Hiroshi; Yashiro, Isao; Omata, Seiha; Takarada, Hiromi; Katano, Yoko; Kosugi, Hiroki; Tanikawa, Satoshi; Ohmori, Kazuko; Sato, Naoki; Ikeuchi, Masahiko; Fujita, Nobuyuki; Ohmori, Masayuki
2010-01-01
A filamentous non-N2-fixing cyanobacterium, Arthrospira (Spirulina) platensis, is an important organism for industrial applications and as a food supply. Almost the complete genome of A. platensis NIES-39 was determined in this study. The genome structure of A. platensis is estimated to be a single, circular chromosome of 6.8 Mb, based on optical mapping. Annotation of this 6.7 Mb sequence yielded 6630 protein-coding genes as well as two sets of rRNA genes and 40 tRNA genes. Of the protein-coding genes, 78% are similar to those of other organisms; the remaining 22% are currently unknown. A total 612 kb of the genome comprise group II introns, insertion sequences and some repetitive elements. Group I introns are located in a protein-coding region. Abundant restriction-modification systems were determined. Unique features in the gene composition were noted, particularly in a large number of genes for adenylate cyclase and haemolysin-like Ca2+-binding proteins and in chemotaxis proteins. Filament-specific genes were highlighted by comparative genomic analysis. PMID:20203057
DOE Office of Scientific and Technical Information (OSTI.GOV)
Steinkasserer, A.; Koettnitz, K.; Hauber, J.
1995-02-10
The eukaryotic initiation factor 5A (eIF-5A) has been identified as an essential cofactor for the HIV-1 transactivator protein Rev. Rev plays a key role in the complex regulation of HIV-1 gene expression and thereby in the generation of infectious virus particles. Expression of eIF-5A is vital for Rev function, and inhibition of this interaction leads to a block of the viral replication cycle. In humans, four different eIF-5A genes have been identified. One codes for the eIF-5A protein and the other three are pseudogenes. Using a panel of somatic rodent-human cell hybrids in combination with fluorescence in situ hybridization analysis,more » we show that the four genes map to three different chromosomes. The coding eIF-5A gene (EIF5A) maps to 17p12-p13, and the three pseudogenes EIF5AP1, EIF5AP2, and EIF5AP3 map to 10q23.3, 17q25, and 19q13.2, respectively. This is the first localization report for a eukaryotic cofactor for a regulatory HIV-1 protein. 16 refs., 1 fig.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Eipers, P.G.
1992-01-01
The gene for the human p58[sup clk[minus]1] protein kinase, a cell division control-related gene, has been mapped by somatic cell hybrid analyses, in situ localization with the chromosomal gene, and nested polymerase chain reaction amplification of microdissected chromosomes. These studies indicate that the expressed p58[sup clk[minus]1] chromosomal gene maps to 1p36, while a highly related p58[sup clk[minus]1] sequence of unknown nature maps to chromosome 15. Assignment of a p34[sup cdc2]-related gene to 1p36 region, including neuroblastoma, ductal carcinoma of the breast, malignant melanoma, Merkel cell carcinoma and endocrine neoplasia among others. Aberrant expression of this protein kinase negatively regulates normalmore » cellular growth. The p58[sup clk[minus]1] protein contains a central domain of 299 amino acids that is 46% identical to human p34[sup cdc2], the master mitotic protein kinase. This dissertation details the complete structure of the p58[sup clk[minus]1] chromosomal gene, including its putative promoter region, transcriptional start sites, exonic sequences, and intron/exon boundary sequences. The gene is 10 kb in size and contains 12 exons and 11 introns. Interestingly, the rather large 2.0 kb 3[prime] untranslated region is interrupted by an intron that separates a region containing numerous AUUUA destabilization motifs from the coding region. Furthermore, the expression of this gene in normal human tissues, as well as several human tumor cell samples and lines, is examined. The origin of multiple human transcripts from the same chromosomal gene, and the possible differential stability of these various transcripts, is discussed with regard to the transcriptional and post-transcriptional regulation of this gene. This is the first report of the chromosomal gene structure of a member of the p34[sup cdc2] supergene family.« less
miRNA-dependent gene silencing involving Ago2-mediated cleavage of a circular antisense RNA
Hansen, Thomas B; Wiklund, Erik D; Bramsen, Jesper B; Villadsen, Sune B; Statham, Aaron L; Clark, Susan J; Kjems, Jørgen
2011-01-01
MicroRNAs (miRNAs) are ∼22 nt non-coding RNAs that typically bind to the 3′ UTR of target mRNAs in the cytoplasm, resulting in mRNA destabilization and translational repression. Here, we report that miRNAs can also regulate gene expression by targeting non-coding antisense transcripts in human cells. Specifically, we show that miR-671 directs cleavage of a circular antisense transcript of the Cerebellar Degeneration-Related protein 1 (CDR1) locus in an Ago2-slicer-dependent manner. The resulting downregulation of circular antisense has a concomitant decrease in CDR1 mRNA levels, independently of heterochromatin formation. This study provides the first evidence for non-coding antisense transcripts as functional miRNA targets, and a novel regulatory mechanism involving a positive correlation between mRNA and antisense circular RNA levels. PMID:21964070
Neil, H; Lemaire, M; Wésolowski-Louvel, M
2004-03-01
In Kluyveromyces lactis, the casein kinase I (Rag8p) regulates the transcription of glycolytic genes and the expression of the low-affinity glucose transporter gene RAG1. This control involves the transcription factor Sck1p, a homologue of Sgc1p of Saccharomyces cerevisiae. SGC1 is known to interact genetically with ScGCR1 and ScGCR2, which code for regulators of glycolytic gene expression. Therefore, we studied the role of KlGCR1 and KlGCR2 genes in K. lactis. The Klgcr1 null mutant could not grow on glucose when respiration was blocked by antimycin A (Rag(- )phenotype). In contrast, the Klgcr2 null mutant could grow under the same conditions, although at a reduced rate. In both mutants, the transcription of glycolytic genes was affected, while that of ribosomal protein genes was not modified. Furthermore, the transcription of the glucose permease genes was also found to be affected in the two mutants, although dissimilarly. While RAG1 transcription decreased at high glucose concentrations, the expression of the high-affinity glucose permease gene HGT1 was unexpectedly impaired under gluconeogenic conditions, in the absence of glucose. Gel mobility shift assays performed with purified maltose-binding protein-KlGcr1p showed that KlGcr1p could interact directly with the promoters of the glycolytic genes, but not with the promoters of the glucose permease genes. Thus, the control exerted by KlGcr1p and KlGcr2p upon glucose transporter genes is probably indirect.
Ancona, Veronica; Lee, Jae Hoon; Zhao, Youfu
2016-01-01
The GacS/GacA two-component system (also called GrrS/GrrA) is a global regulatory system which is highly conserved among gamma-proteobacteria. This system positively regulates non-coding small regulatory RNA csrB, which in turn binds to the RNA-binding protein CsrA. However, how GacS/GacA-Csr system regulates virulence traits in E. amylovora remains unknown. Results from mutant characterization showed that the csrB mutant was hypermotile, produced higher amount of exopolysaccharide amylovoran, and had increased expression of type III secretion (T3SS) genes in vitro. In contrast, the csrA mutant exhibited complete opposite phenotypes, including non-motile, reduced amylovoran production and expression of T3SS genes. Furthermore, the csrA mutant did not induce hypersensitive response on tobacco or cause disease on immature pear fruits, indicating that CsrA is a positive regulator of virulence factors. These findings demonstrated that CsrA plays a critical role in E. amylovora virulence and suggested that negative regulation of virulence by GacS/GacA acts through csrB sRNA, which binds to CsrA and neutralizes its positive effect on T3SS gene expression, flagellar formation and amylovoran production. Future research will be focused on determining the molecular mechanism underlying the positive regulation of virulence traits by CsrA. PMID:27845410
Ancona, Veronica; Lee, Jae Hoon; Zhao, Youfu
2016-11-15
The GacS/GacA two-component system (also called GrrS/GrrA) is a global regulatory system which is highly conserved among gamma-proteobacteria. This system positively regulates non-coding small regulatory RNA csrB, which in turn binds to the RNA-binding protein CsrA. However, how GacS/GacA-Csr system regulates virulence traits in E. amylovora remains unknown. Results from mutant characterization showed that the csrB mutant was hypermotile, produced higher amount of exopolysaccharide amylovoran, and had increased expression of type III secretion (T3SS) genes in vitro. In contrast, the csrA mutant exhibited complete opposite phenotypes, including non-motile, reduced amylovoran production and expression of T3SS genes. Furthermore, the csrA mutant did not induce hypersensitive response on tobacco or cause disease on immature pear fruits, indicating that CsrA is a positive regulator of virulence factors. These findings demonstrated that CsrA plays a critical role in E. amylovora virulence and suggested that negative regulation of virulence by GacS/GacA acts through csrB sRNA, which binds to CsrA and neutralizes its positive effect on T3SS gene expression, flagellar formation and amylovoran production. Future research will be focused on determining the molecular mechanism underlying the positive regulation of virulence traits by CsrA.
Cullingford, Timothy E; Markou, Thomais; Fuller, Stephen J; Giraldo, Alejandro; Pikkarainen, Sampsa; Zoumpoulidou, Georgia; Alsafi, Ali; Ekere, Collins; Kemp, Timothy J; Dennis, Jayne L; Game, Laurence; Sugden, Peter H; Clerk, Angela
2008-01-01
Background Endothelin-1 stimulates Gq protein-coupled receptors to promote proliferation in dividing cells or hypertrophy in terminally differentiated cardiomyocytes. In cardiomyocytes, endothelin-1 rapidly (within minutes) stimulates protein kinase signaling, including extracellular-signal regulated kinases 1/2 (ERK1/2; though not ERK5), with phenotypic/physiological changes developing from approximately 12 h. Hypertrophy is associated with changes in mRNA/protein expression, presumably consequent to protein kinase signaling, but the connections between early, transient signaling events and developed hypertrophy are unknown. Results Using microarrays, we defined the early transcriptional responses of neonatal rat cardiomyocytes to endothelin-1 over 4 h, differentiating between immediate early gene (IEG) and second phase RNAs with cycloheximide. IEGs exhibited differential temporal and transient regulation, with expression of second phase RNAs within 1 h. Of transcripts upregulated at 30 minutes encoding established proteins, 28 were inhibited >50% by U0126 (which inhibits ERK1/2/5 signaling), with 9 inhibited 25-50%. Expression of only four transcripts was not inhibited. At 1 h, most RNAs (approximately 67%) were equally changed in total and polysomal RNA with approximately 17% of transcripts increased to a greater extent in polysomes. Thus, changes in expression of most protein-coding RNAs should be reflected in protein synthesis. However, approximately 16% of transcripts were essentially excluded from the polysomes, including some protein-coding mRNAs, presumably inefficiently translated. Conclusion The phasic, temporal regulation of early transcriptional responses induced by endothelin-1 in cardiomyocytes indicates that, even in terminally differentiated cells, signals are propagated beyond the primary signaling pathways through transcriptional networks leading to phenotypic changes (that is, hypertrophy). Furthermore, ERK1/2 signaling plays a major role in this response. PMID:18275597
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa.
Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A
2015-01-01
Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity.
Epigenetic Regulation of Transcription in Trypanosomatid Protozoa.
Martínez-Calvillo, Santiago; Romero-Meza, Gabriela; Vizuet-de-Rueda, Juan C; Florencio-Martínez, Luis E; Manning-Cela, Rebeca; Nepomuceno-Mejía, Tomás
2018-02-01
The Trypanosomatid family includes flagellated parasites that cause fatal human diseases. Remarkably, protein-coding genes in these organisms are positioned in long tandem arrays that are transcribed polycistronically. However, the knowledge about regulation of transcription initiation and termination in trypanosomatids is scarce. The importance of epigenetic regulation in these processes has become evident in the last years, as distinctive histone modifications and histone variants have been found in transcription initiation and termination regions. Moreover, multiple chromatin-related proteins have been identified and characterized in trypanosomatids, including histone-modifying enzymes, effector complexes, chromatin-remodelling enzymes and histone chaperones. Notably, base J, a modified thymine residue present in the nuclear DNA of trypanosomatids, has been implicated in transcriptional regulation. Here we review the current knowledge on epigenetic control of transcription by all three RNA polymerases in this group of early-diverged eukaryotes.
A discovery of novel microRNAs in the silkworm (Bombyx mori) genome.
Yu, Xiaomin; Zhou, Qing; Cai, Yimei; Luo, Qibin; Lin, Hongbin; Hu, Songnian; Yu, Jun
2009-12-01
MicroRNAs (miRNAs) are pivotal regulators involved in various physiological and pathological processes via their post-transcriptional regulation of gene expressions. We sequenced 14 libraries of small RNAs constructed from samples spanning the life cycle of silkworms, and discovered 50 novel miRNAs previously not known in animals and verified 43 of them using stem-loop RT-PCR. Our genome-wide analyses of 27 species-specific miRNAs suggest they arise from transposable elements, protein-coding genes duplication/transposition and random foldback sequences; which is consistent with the idea that novel animal miRNAs may evolve from incomplete self-complementary transcripts and become fixed in the process of co-adaptation with their targets. Computational prediction suggests that the silkworm-specific miRNAs may have a preference of regulating genes that are related to life-cycle-associated traits, and these genes can serve as potential targets for subsequent studies of the modulating networks in the development of Bombyx mori.
NASA Astrophysics Data System (ADS)
Babbick, Maren; Hampp, Rudiger
2005-08-01
Callus cultures of Arabidopsis thaliana (cv. Columbia) were used to screen for early changes in gene expression in response to altered gravitational fields. In a recent microarray study we found hyper- g dependent changes in gene expression which indicated the involvement of WRKY genes [Martzivanou M. and Hampp R., Physiol. Plant., 118, 221-231,2003]. WRKY genes code for a family of plant-specific regulators of gene expression. In this study we report on the exposure of Arabidopsis callus cultures to 8g for up to 30 min. Quantitative analysis by real time RT-PCR of the amount of transcripts of WRKYs 3, 6, 22, 46, 65 and 70 showed individual changes in expression. As far as their function is known, these WRKY proteins are mainly involved in stress responses. As most alterations in transcript amount occurred within 10 min of treatment, such genes can be used for the investigation of microgravity-related effects on gene expression under sounding rocket conditions (TEXUS, MAXUS).
Averina, O V; Nezametdinova, V Z; Alekseeva, M G; Danilenko, V N
2012-11-01
The stability of inheriting several genes in the Russian commercial strain Bifidobacterium longum subsp. longum B379M during cultivation and maintenance under laboratory conditions has been studied. The examined genes code for probiotic characteristics, such as utilization of several sugars (lacA2 gene, encoding beta-galactosidase; ara gene, encoding arabinosidase; and galA gene, encoding arabinogalactan endo-beta-galactosidase); synthesis of bacteriocins (lans gene, encoding lanthionine synthetase); and mobile gene tet(W), conferring resistance to the antibiotic tetracycline. The other gene families studied include the genes responsible for signal transduction and adaptation to stress conditions in the majority of bacteria (serine/threonine protein kinases and the toxin-antitoxin systems of MazEF and RelBE types) and transcription regulators (genes encoding WhiB family proteins). Genomic DNA was analyzed by PCR using specially selected primers. A loss of the genes galA and tet(W) has been shown. It is proposed to expand the requirements on probiotic strains, namely, to control retention of the key probiotic genes using molecular biological methods.
Analysis of the Genome of the Sexually Transmitted Insect Virus Helicoverpa zea Nudivirus 2
Burand, John P.; Kim, Woojin; Afonso, Claudio L.; Tulman, Edan R.; Kutish, Gerald F.; Lu, Zhiqiang; Rock, Daniel L.
2012-01-01
The sexually transmitted insect virus Helicoverpa zea nudivirus 2 (HzNV-2) was determined to have a circular double-stranded DNA genome of 231,621 bp coding for an estimated 113 open reading frames (ORFs). HzNV-2 is most closely related to the nudiviruses, a sister group of the insect baculoviruses. Several putative ORFs that share homology with the baculovirus core genes were identified in the viral genome. However, HzNV-2 lacks several key genetic features of baculoviruses including the late transcriptional regulation factor, LEF-1 and the palindromic hrs, which serve as origins of replication. The HzNV-2 genome was found to code for three ORFs that had significant sequence homology to cellular genes which are not generally found in viral genomes. These included a presumed juvenile hormone esterase gene, a gene coding for a putative zinc-dependent matrix metalloprotease, and a major facilitator superfamily protein gene; all of which are believed to play a role in the cellular proliferation and the tissue hypertrophy observed in the malformation of reproductive organs observed in HzNV-2 infected corn earworm moths, Helicoverpa zea. PMID:22355451
Analysis of the genome of the sexually transmitted insect virus Helicoverpa zea nudivirus 2.
Burand, John P; Kim, Woojin; Afonso, Claudio L; Tulman, Edan R; Kutish, Gerald F; Lu, Zhiqiang; Rock, Daniel L
2012-01-01
The sexually transmitted insect virus Helicoverpa zea nudivirus 2 (HzNV-2) was determined to have a circular double-stranded DNA genome of 231,621 bp coding for an estimated 113 open reading frames (ORFs). HzNV-2 is most closely related to the nudiviruses, a sister group of the insect baculoviruses. Several putative ORFs that share homology with the baculovirus core genes were identified in the viral genome. However, HzNV-2 lacks several key genetic features of baculoviruses including the late transcriptional regulation factor, LEF-1 and the palindromic hrs, which serve as origins of replication. The HzNV-2 genome was found to code for three ORFs that had significant sequence homology to cellular genes which are not generally found in viral genomes. These included a presumed juvenile hormone esterase gene, a gene coding for a putative zinc-dependent matrix metalloprotease, and a major facilitator superfamily protein gene; all of which are believed to play a role in the cellular proliferation and the tissue hypertrophy observed in the malformation of reproductive organs observed in HzNV-2 infected corn earworm moths, Helicoverpa zea.
Syed, Mustafa H; Karpinets, Tatiana V; Leuze, Michael R; Kora, Guruprasad H; Romine, Margaret R; Uberbacher, Edward C
2009-01-01
Shewanella oneidensis MR-1 is an important model organism for environmental research as it has an exceptional metabolic and respiratory versatility regulated by a complex regulatory network. We have developed a database to collect experimental and computational data relating to regulation of gene and protein expression, and, a visualization environment that enables integration of these data types. The regulatory information in the database includes predictions of DNA regulator binding sites, sigma factor binding sites, transcription units, operons, promoters, and RNA regulators including non-coding RNAs, riboswitches, and different types of terminators. Availability http://shewanella-knowledgebase.org:8080/Shewanella/gbrowserLanding.jsp PMID:20198195
Balfanz, Sabine; Strünker, Timo; Frings, Stephan; Baumann, Arnd
2005-04-01
In invertebrates, the biogenic-amine octopamine is an important physiological regulator. It controls and modulates neuronal development, circadian rhythm, locomotion, 'fight or flight' responses, as well as learning and memory. Octopamine mediates its effects by activation of different GTP-binding protein (G protein)-coupled receptor types, which induce either cAMP production or Ca(2+) release. Here we describe the functional characterization of two genes from Drosophila melanogaster that encode three octopamine receptors. The first gene (Dmoa1) codes for two polypeptides that are generated by alternative splicing. When heterologously expressed, both receptors cause oscillatory increases of the intracellular Ca(2+) concentration in response to applying nanomolar concentrations of octopamine. The second gene (Dmoa2) codes for a receptor that specifically activates adenylate cyclase and causes a rise of intracellular cAMP with an EC(50) of approximately 3 x 10(-8) m octopamine. Tyramine, the precursor of octopamine biosynthesis, activates all three receptors at > or = 100-fold higher concentrations, whereas dopamine and serotonin are non-effective. Developmental expression of Dmoa genes was assessed by RT-PCR. Overlapping but not identical expression patterns were observed for the individual transcripts. The genes characterized in this report encode unique receptors that display signature properties of native octopamine receptors.
Transcriptional profiling of murine osteoblast differentiation based on RNA-seq expression analyses.
Khayal, Layal Abo; Grünhagen, Johannes; Provazník, Ivo; Mundlos, Stefan; Kornak, Uwe; Robinson, Peter N; Ott, Claus-Eric
2018-04-11
Osteoblastic differentiation is a multistep process characterized by osteogenic induction of mesenchymal stem cells, which then differentiate into proliferative pre-osteoblasts that produce copious amounts of extracellular matrix, followed by stiffening of the extracellular matrix, and matrix mineralization by hydroxylapatite deposition. Although these processes have been well characterized biologically, a detailed transcriptional analysis of murine primary calvaria osteoblast differentiation based on RNA sequencing (RNA-seq) analyses has not previously been reported. Here, we used RNA-seq to obtain expression values of 29,148 genes at four time points as murine primary calvaria osteoblasts differentiate in vitro until onset of mineralization was clearly detectable by microscopic inspection. Expression of marker genes confirmed osteogenic differentiation. We explored differential expression of 1386 protein-coding genes using unsupervised clustering and GO analyses. 100 differentially expressed lncRNAs were investigated by co-expression with protein-coding genes that are localized within the same topologically associated domain. Additionally, we monitored expression of 237 genes that are silent or active at distinct time points and compared differential exon usage. Our data represent an in-depth profiling of murine primary calvaria osteoblast differentiation by RNA-seq and contribute to our understanding of genetic regulation of this key process in osteoblast biology. Copyright © 2018 Elsevier Inc. All rights reserved.
Solov'ev, V V; Kel', A E; Kolchanov, N A
1989-01-01
The factors, determining the presence of inverted and symmetrical repeats in genes coding for globular proteins, have been analysed. An interesting property of genetical code has been revealed in the analysis of symmetrical repeats: the pairs of symmetrical codons corresponded to pairs of amino acids with mostly similar physical-chemical parameters. This property may explain the presence of symmetrical repeats and palindromes only in genes coding for beta-structural proteins-polypeptides, where amino acids with similar physical-chemical properties occupy symmetrical positions. A stochastic model of evolution of polynucleotide sequences has been used for analysis of inverted repeats. The modelling demonstrated that only limiting of sequences (uneven frequencies of used codons) is enough for arising of nonrandom inverted repeats in genes.
Transcription factor GATA-1 regulates human HOXB2 gene expression in erythroid cells.
Vieille-Grosjean, I; Huber, P
1995-03-03
The human HOXB2 gene is a member of the vertebrate Hox gene family that contains genes coding for specific developmental stage DNA-binding proteins. Remarkably, within the hematopoietic compartment, genes of the HOXB complex are expressed specifically in erythromegakaryocytic cell lines and, for some of them, in hematopoietic progenitors. Here, we report the study of HOXB2 gene transcriptional regulation in hematopoietic cells, an initial step in understanding the lineage-specific expression of the whole HOXB complex in these cells. We have isolated the HOXB2 5'-flanking sequence and have characterized a promoter fragment extending 323 base pairs upstream from the transcriptional start site, which, in transfection experiments, was sufficient to direct the tissue-specific expression of HOXB2 in the erythroid cell line K562. In this fragment, we have identified a potential GATA-binding site that is essential to the promoter activity as demonstrated by point mutation experiments. Gel shift analysis revealed the formation of a specific complex in both erythroleukemic lines K562 and HEL that could be prevented by the addition of a specific antiserum raised against GATA-1 protein. These findings suggest a regulatory hierarchy in which GATA-1 is upstream of the HOXB2 gene in erythroid cells.
Trystuła, M; Żychowska, M; Wilk-Frańczuk, M; Kropotov, J D; Pąchalska, M
2017-02-16
The aim of this study was to evaluate dysregulation of gene expression associated with the cellular stress response in a patient with a post-"warning stroke" depressive disorder confirmed by the presence of a neurophysiological neuromarker through the use of quantitative EEG and event-related potentials. The patient was tested for seven genes associated with the stress reaction: HSPA1A, HSPB1, IL6, IL10, CRP, and HSF-1 along with NF-κB, compared to gene expression in health controls. A 54-year-old patient with a past history of schizophrenia (at the age of 20), and of transient ischemic attack (at the age of 53) and depressive disorder confirmed by functional, cognitive, emotional, and affectional diagnostics underwent additional testing for expression of the genes associated with stress response. The expression of genes coding for heat shock protein (HSPA1A, HSPB1), interleukins (IL6, IL10), and C-reactive protein was tested along with factors that regulate their expression. The results of the tests conducted on this patient were compared with 42 healthy control subjects. Diagnostic testing revealed upregulation in expression of these genes, presenting as increased expression of the target genes and of the regulatory genes. A post-"warning stroke" depressive disorder appears to be associated with overexpression of the genes coding for HSP and interleukins. Further research on larger groups of people may provide grounds for treatment modification.
Tsiagkas, Giannis; Nikolaou, Christoforos; Almirantis, Yannis
2014-12-01
CpG Islands (CGIs) are compositionally defined short genomic stretches, which have been studied in the human, mouse, chicken and later in several other genomes. Initially, they were assigned the role of transcriptional regulation of protein-coding genes, especially the house-keeping ones, while more recently there is found evidence that they are involved in several other functions as well, which might include regulation of the expression of RNA genes, DNA replication etc. Here, an investigation of their distributional characteristics in a variety of genomes is undertaken for both whole CGI populations as well as for CGI subsets that lie away from known genes (gene-unrelated or "orphan" CGIs). In both cases power-law-like linearity in double logarithmic scale is found. An evolutionary model, initially put forward for the explanation of a similar pattern found in gene populations is implemented. It includes segmental duplication events and eliminations of most of the duplicated CGIs, while a moderate rate of non-duplicated CGI eliminations is also applied in some cases. Simulations reproduce all the main features of the observed inter-CGI chromosomal size distributions. Our results on power-law-like linearity found in orphan CGI populations suggest that the observed distributional pattern is independent of the analogous pattern that protein coding segments were reported to follow. The power-law-like patterns in the genomic distributions of CGIs described herein are found to be compatible with several other features of the composition, abundance or functional role of CGIs reported in the current literature across several genomes, on the basis of the proposed evolutionary model. Copyright © 2014 Elsevier Ltd. All rights reserved.
Survival of Listeria monocytogenes in Soil Requires AgrA-Mediated Regulation
Vivant, Anne-Laure; Garmyn, Dominique; Gal, Laurent; Hartmann, Alain
2015-01-01
In a recent paper, we demonstrated that inactivation of the Agr system affects the patterns of survival of Listeria monocytogenes (A.-L. Vivant, D. Garmyn, L. Gal, and P. Piveteau, Front Cell Infect Microbiol 4:160, http://dx.doi.org/10.3389/fcimb.2014.00160). In this study, we investigated whether the Agr-mediated response is triggered during adaptation in soil, and we compared survival patterns in a set of 10 soils. The fate of the parental strain L. monocytogenes L9 (a rifampin-resistant mutant of L. monocytogenes EGD-e) and that of a ΔagrA deletion mutant were compared in a collection of 10 soil microcosms. The ΔagrA mutant displayed significantly reduced survival in these biotic soil microcosms, and differential transcriptome analyses showed large alterations of the transcriptome when AgrA was not functional, while the variations in the transcriptomes between the wild type and the ΔagrA deletion mutant were modest under abiotic conditions. Indeed, in biotic soil environments, 578 protein-coding genes and an extensive repertoire of noncoding RNAs (ncRNAs) were differentially transcribed. The transcription of genes coding for proteins involved in cell envelope and cellular processes, including the phosphotransferase system and ABC transporters, and proteins involved in resistance to antimicrobial peptides was affected. Under sterilized soil conditions, the differences were limited to 86 genes and 29 ncRNAs. These results suggest that the response regulator AgrA of the Agr communication system plays important roles during the saprophytic life of L. monocytogenes in soil. PMID:26002901
Survival of Listeria monocytogenes in Soil Requires AgrA-Mediated Regulation.
Vivant, Anne-Laure; Garmyn, Dominique; Gal, Laurent; Hartmann, Alain; Piveteau, Pascal
2015-08-01
In a recent paper, we demonstrated that inactivation of the Agr system affects the patterns of survival of Listeria monocytogenes (A.-L. Vivant, D. Garmyn, L. Gal, and P. Piveteau, Front Cell Infect Microbiol 4:160, http://dx.doi.org/10.3389/fcimb.2014.00160). In this study, we investigated whether the Agr-mediated response is triggered during adaptation in soil, and we compared survival patterns in a set of 10 soils. The fate of the parental strain L. monocytogenes L9 (a rifampin-resistant mutant of L. monocytogenes EGD-e) and that of a ΔagrA deletion mutant were compared in a collection of 10 soil microcosms. The ΔagrA mutant displayed significantly reduced survival in these biotic soil microcosms, and differential transcriptome analyses showed large alterations of the transcriptome when AgrA was not functional, while the variations in the transcriptomes between the wild type and the ΔagrA deletion mutant were modest under abiotic conditions. Indeed, in biotic soil environments, 578 protein-coding genes and an extensive repertoire of noncoding RNAs (ncRNAs) were differentially transcribed. The transcription of genes coding for proteins involved in cell envelope and cellular processes, including the phosphotransferase system and ABC transporters, and proteins involved in resistance to antimicrobial peptides was affected. Under sterilized soil conditions, the differences were limited to 86 genes and 29 ncRNAs. These results suggest that the response regulator AgrA of the Agr communication system plays important roles during the saprophytic life of L. monocytogenes in soil. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Using the NCBI Genome Databases to Compare the Genes for Human & Chimpanzee Beta Hemoglobin
ERIC Educational Resources Information Center
Offner, Susan
2010-01-01
The beta hemoglobin protein is identical in humans and chimpanzees. In this tutorial, students see that even though the proteins are identical, the genes that code for them are not. There are many more differences in the introns than in the exons, which indicates that coding regions of DNA are more highly conserved than non-coding regions.
Pandolfi, V; Jorge, E C; Melo, C M R; Albuquerque, A C S; Carrer, H
2010-07-06
The pathogenic fungus Fusarium graminearum is an ongoing threat to agriculture, causing losses in grain yield and quality in diverse crops. Substantial progress has been made in the identification of genes involved in the suppression of phytopathogens by antagonistic microorganisms; however, limited information regarding responses of plant pathogens to these biocontrol agents is available. Gene expression analysis was used to identify differentially expressed transcripts of the fungal plant pathogen F. graminearum under antagonistic effect of the bacterium Pantoea agglomerans. A macroarray was constructed, using 1014 transcripts from an F. graminearum cDNA library. Probes consisted of the cDNA of F. graminearum grown in the presence and in the absence of P. agglomerans. Twenty-nine genes were either up (19) or down (10) regulated during interaction with the antagonist bacterium. Genes encoding proteins associated with fungal defense and/or virulence or with nutritional and oxidative stress responses were induced. The repressed genes coded for a zinc finger protein associated with cell division, proteins containing cellular signaling domains, respiratory chain proteins, and chaperone-type proteins. These data give molecular and biochemical evidence of response of F. graminearum to an antagonist and could help develop effective biocontrol procedures for pathogenic plant fungi.
Milanesi, Luciano; Petrillo, Mauro; Sepe, Leandra; Boccia, Angelo; D'Agostino, Nunzio; Passamano, Myriam; Di Nardo, Salvatore; Tasco, Gianluca; Casadio, Rita; Paolella, Giovanni
2005-01-01
Background Protein kinases are a well defined family of proteins, characterized by the presence of a common kinase catalytic domain and playing a significant role in many important cellular processes, such as proliferation, maintenance of cell shape, apoptosys. In many members of the family, additional non-kinase domains contribute further specialization, resulting in subcellular localization, protein binding and regulation of activity, among others. About 500 genes encode members of the kinase family in the human genome, and although many of them represent well known genes, a larger number of genes code for proteins of more recent identification, or for unknown proteins identified as kinase only after computational studies. Results A systematic in silico study performed on the human genome, led to the identification of 5 genes, on chromosome 1, 11, 13, 15 and 16 respectively, and 1 pseudogene on chromosome X; some of these genes are reported as kinases from NCBI but are absent in other databases, such as KinBase. Comparative analysis of 483 gene regions and subsequent computational analysis, aimed at identifying unannotated exons, indicates that a large number of kinase may code for alternately spliced forms or be incorrectly annotated. An InterProScan automated analysis was perfomed to study domain distribution and combination in the various families. At the same time, other structural features were also added to the annotation process, including the putative presence of transmembrane alpha helices, and the cystein propensity to participate into a disulfide bridge. Conclusion The predicted human kinome was extended by identifiying both additional genes and potential splice variants, resulting in a varied panorama where functionality may be searched at the gene and protein level. Structural analysis of kinase proteins domains as defined in multiple sources together with transmembrane alpha helices and signal peptide prediction provides hints to function assignment. The results of the human kinome analysis are collected in the KinWeb database, available for browsing and searching over the internet, where all results from the comparative analysis and the gene structure annotation are made available, alongside the domain information. Kinases may be searched by domain combinations and the relative genes may be viewed in a graphic browser at various level of magnification up to gene organization on the full chromosome set. PMID:16351747
Niarchos, Athanasios; Siora, Anastasia; Konstantinou, Evangelia; Kalampoki, Vasiliki; Lagoumintzis, George; Poulas, Konstantinos
2017-01-01
During the last few decades, the recombinant protein expression finds more and more applications. The cloning of protein-coding genes into expression vectors is required to be directional for proper expression, and versatile in order to facilitate gene insertion in multiple different vectors for expression tests. In this study, the TA-GC cloning method is proposed, as a new, simple and efficient method for the directional cloning of protein-coding genes in expression vectors. The presented method features several advantages over existing methods, which tend to be relatively more labour intensive, inflexible or expensive. The proposed method relies on the complementarity between single A- and G-overhangs of the protein-coding gene, obtained after a short incubation with T4 DNA polymerase, and T and C overhangs of the novel vector pET-BccI, created after digestion with the restriction endonuclease BccI. The novel protein-expression vector pET-BccI also facilitates the screening of transformed colonies for recombinant transformants. Evaluation experiments of the proposed TA-GC cloning method showed that 81% of the transformed colonies contained recombinant pET-BccI plasmids, and 98% of the recombinant colonies expressed the desired protein. This demonstrates that TA-GC cloning could be a valuable method for cloning protein-coding genes in expression vectors.
Niarchos, Athanasios; Siora, Anastasia; Konstantinou, Evangelia; Kalampoki, Vasiliki; Poulas, Konstantinos
2017-01-01
During the last few decades, the recombinant protein expression finds more and more applications. The cloning of protein-coding genes into expression vectors is required to be directional for proper expression, and versatile in order to facilitate gene insertion in multiple different vectors for expression tests. In this study, the TA-GC cloning method is proposed, as a new, simple and efficient method for the directional cloning of protein-coding genes in expression vectors. The presented method features several advantages over existing methods, which tend to be relatively more labour intensive, inflexible or expensive. The proposed method relies on the complementarity between single A- and G-overhangs of the protein-coding gene, obtained after a short incubation with T4 DNA polymerase, and T and C overhangs of the novel vector pET-BccI, created after digestion with the restriction endonuclease BccI. The novel protein-expression vector pET-BccI also facilitates the screening of transformed colonies for recombinant transformants. Evaluation experiments of the proposed TA-GC cloning method showed that 81% of the transformed colonies contained recombinant pET-BccI plasmids, and 98% of the recombinant colonies expressed the desired protein. This demonstrates that TA-GC cloning could be a valuable method for cloning protein-coding genes in expression vectors. PMID:29091919
Romero, Roberto; Tarca, Adi L; Chaemsaithong, Piya; Miranda, Jezid; Chaiworapongsa, Tinnakorn; Jia, Hui; Hassan, Sonia S; Kalita, Cynthia A; Cai, Juan; Yeo, Lami; Lipovich, Leonard
2014-09-01
To identify differentially expressed long non-coding RNA (lncRNA) genes in human myometrium in women with spontaneous labor at term. Myometrium was obtained from women undergoing cesarean deliveries who were not in labor (n = 19) and women in spontaneous labor at term (n = 20). RNA was extracted and profiled using an Illumina® microarray platform. We have used computational approaches to bound the extent of long non-coding RNA representation on this platform, and to identify co-differentially expressed and correlated pairs of long non-coding RNA genes and protein-coding genes sharing the same genomic loci. We identified co-differential expression and correlation at two genomic loci that contain coding-lncRNA gene pairs: SOCS2-AK054607 and LMCD1-NR_024065 in women in spontaneous labor at term. This co-differential expression and correlation was validated by qRT-PCR, an experimental method completely independent of the microarray analysis. Intriguingly, one of the two lncRNA genes differentially expressed in term labor had a key genomic structure element, a splice site, that lacked evolutionary conservation beyond primates. We provide, for the first time, evidence for coordinated differential expression and correlation of cis-encoded antisense lncRNAs and protein-coding genes with known as well as novel roles in pregnancy in the myometrium of women in spontaneous labor at term.
Laitz, Alessandra Vasconcellos Nunes; Acencio, Marcio Luis; Budzinski, Ilara G F; Labate, Mônica T V; Lemke, Ney; Ribolla, Paulo Eduardo Martins; Maia, Ivan G
2015-01-01
Mitochondrial inner membrane uncoupling proteins (UCP) dissipate the proton electrochemical gradient established by the respiratory chain, thus affecting the yield of ATP synthesis. UCP overexpression in plants has been correlated with oxidative stress tolerance, improved photosynthetic efficiency and increased mitochondrial biogenesis. This study reports the main transcriptomic responses associated with the overexpression of an UCP (AtUCP1) in tobacco seedlings. Compared to wild-type (WT), AtUCP1 transgenic seedlings showed unaltered ATP levels and higher accumulation of serine. By using RNA-sequencing, a total of 816 differentially expressed genes between the investigated overexpressor lines and the untransformed WT control were identified. Among them, 239 were up-regulated and 577 were down-regulated. As a general response to AtUCP1 overexpression, noticeable changes in the expression of genes involved in energy metabolism and redox homeostasis were detected. A substantial set of differentially expressed genes code for products targeted to the chloroplast and mainly involved in photosynthesis. The overall results demonstrate that the alterations in mitochondrial function provoked by AtUCP1 overexpression require important transcriptomic adjustments to maintain cell homeostasis. Moreover, the occurrence of an important cross-talk between chloroplast and mitochondria, which culminates in the transcriptional regulation of several genes involved in different pathways, was evidenced.
Silva, Andrea X.; Jander, Georg; Samaniego, Horacio; Ramsey, John S; Figueroa, Christian C.
2012-01-01
Background Insecticide resistance is one of the best examples of rapid micro-evolution found in nature. Since the development of the first synthetic insecticide in 1939, humans have invested considerable effort to stay ahead of resistance phenotypes that repeatedly develop in insects. Aphids are a group of insects that have become global pests in agriculture and frequently exhibit insecticide resistance. The green peach aphid, Myzus persicae, has developed resistance to at least seventy different synthetic compounds, and different insecticide resistance mechanisms have been reported worldwide. Methodology/Principal Findings To further characterize this resistance, we analyzed genome-wide transcriptional responses in three genotypes of M. persicae, each exhibiting different resistance mechanisms, in response to an anti-cholinesterase insecticide. The sensitive genotype (exhibiting no resistance mechanism) responded to the insecticide by up-regulating 183 genes primarily ones related to energy metabolism, detoxifying enzymes, proteins of extracellular transport, peptidases and cuticular proteins. The second genotype (resistant through a kdr sodium channel mutation), up-regulated 17 genes coding for detoxifying enzymes, peptidase and cuticular proteins. Finally, a multiply resistant genotype (carrying kdr and a modified acetylcholinesterase), up-regulated only 7 genes, appears not to require induced insecticide detoxification, and instead down-regulated many genes. Conclusions/Significance This study suggests strongly that insecticide resistance in M. persicae is more complex that has been described, with the participation of a broad array of resistance mechanisms. The sensitive genotype exhibited the highest transcriptional plasticity, accounting for the wide range of potential adaptations to insecticides that this species can evolve. In contrast, the multiply resistant genotype exhibited a low transcriptional plasticity, even for the expression of genes encoding enzymes involved in insecticide detoxification. Our results emphasize the value of microarray studies to search for regulated genes in insects, but also highlights the many ways those different genotypes can assemble resistant phenotypes depending on the environmental pressure. PMID:22685538
Gene networks in the synthesis and deposition of protein polymers during grain development of wheat.
She, Maoyun; Ye, Xingguo; Yan, Yueming; Howit, C; Belgard, M; Ma, Wujun
2011-03-01
As the amino acid storing organelle, the protein bodies provide nutrients for embryo development, seed germination and early seedling growth through storage proteolysis in cereal plants, such as wheat and rice. In protein bodies, the monomeric and polymeric prolamins, i.e. gliadins and glutenins, form gluten and play a key role in determining dough functionality and end-product quality of wheat. The formation of intra- and intermolecular bonds, including disulphide and tyrosine bonds, in and between prolamins confers cohesivity, viscosity, elasticity and extensibility to wheat dough during mixing and processing. In this review, we summarize recent progress in wheat gluten research with a focus on the fundamental molecular biological aspects, including transcriptional regulation on genes coding for prolamin components, biosynthesis, deposition and secretion of protein polymers, formation of protein bodies, genetic control of seed storage proteins, the transportation of the protein bodies and key enzymes for determining the formation of disulphide bonds of prolamin polymers.
Peng, Rui; Zeng, Bo; Meng, Xiuxiang; Yue, Bisong; Zhang, Zhihe; Zou, Fangdong
2007-08-01
The complete mitochondrial genome sequence of the giant panda, Ailuropoda melanoleuca, was determined by the long and accurate polymerase chain reaction (LA-PCR) with conserved primers and primer walking sequence methods. The complete mitochondrial DNA is 16,805 nucleotides in length and contains two ribosomal RNA genes, 13 protein-coding genes, 22 transfer RNA genes and one control region. The total length of the 13 protein-coding genes is longer than the American black bear, brown bear and polar bear by 3 amino acids at the end of ND5 gene. The codon usage also followed the typical vertebrate pattern except for an unusual ATT start codon, which initiates the NADH dehydrogenase subunit 5 (ND5) gene. The molecular phylogenetic analysis was performed on the sequences of 12 concatenated heavy-strand encoded protein-coding genes, and suggested that the giant panda is most closely related to bears.
Nedelcu, Aurora M.; Lee, Robert W.; Lemieux, Claude; Gray, Michael W.; Burger, Gertraud
2000-01-01
Two distinct mitochondrial genome types have been described among the green algal lineages investigated to date: a reduced–derived, Chlamydomonas-like type and an ancestral, Prototheca-like type. To determine if this unexpected dichotomy is real or is due to insufficient or biased sampling and to define trends in the evolution of the green algal mitochondrial genome, we sequenced and analyzed the mitochondrial DNA (mtDNA) of Scenedesmus obliquus. This genome is 42,919 bp in size and encodes 42 conserved genes (i.e., large and small subunit rRNA genes, 27 tRNA and 13 respiratory protein-coding genes), four additional free-standing open reading frames with no known homologs, and an intronic reading frame with endonuclease/maturase similarity. No 5S rRNA or ribosomal protein-coding genes have been identified in Scenedesmus mtDNA. The standard protein-coding genes feature a deviant genetic code characterized by the use of UAG (normally a stop codon) to specify leucine, and the unprecedented use of UCA (normally a serine codon) as a signal for termination of translation. The mitochondrial genome of Scenedesmus combines features of both green algal mitochondrial genome types: the presence of a more complex set of protein-coding and tRNA genes is shared with the ancestral type, whereas the lack of 5S rRNA and ribosomal protein-coding genes as well as the presence of fragmented and scrambled rRNA genes are shared with the reduced–derived type of mitochondrial genome organization. Furthermore, the gene content and the fragmentation pattern of the rRNA genes suggest that this genome represents an intermediate stage in the evolutionary process of mitochondrial genome streamlining in green algae. [The sequence data described in this paper have been submitted to the GenBank data library under accession no. AF204057.] PMID:10854413
Kirschner, Doris B; vom Baur, Elmar; Thibault, Christelle; Sanders, Steven L; Gangloff, Yann-Gaël; Davidson, Irwin; Weil, P Anthony; Tora, Làszlò
2002-05-01
The RNA polymerase II transcription factor TFIID, composed of the TATA-binding protein (TBP) and TBP-associated factors (TAF(II)s), nucleates preinitiation complex formation at protein-coding gene promoters. SAGA, a second TAF(II)-containing multiprotein complex, is involved in transcription regulation in Saccharomyces cerevisiae. One of the essential protein components common to SAGA and TFIID is yTAF(II)25. We define a minimal evolutionarily conserved 91-amino-acid region of TAF(II)25 containing a histone fold domain that is necessary and sufficient for growth in vivo. Different temperature-sensitive mutations of yTAF(II)25 or chimeras with the human homologue TAF(II)30 arrested cell growth at either the G(1) or G(2)/M cell cycle phase and displayed distinct phenotypic changes and gene expression patterns. Immunoprecipitation studies revealed that TAF(II)25 mutation-dependent gene expression and phenotypic changes correlated at least partially with the integrity of SAGA and TFIID. Genome-wide expression analysis revealed that the five TAF(II)25 temperature-sensitive mutant alleles individually affect the expression of between 18 and 33% of genes, whereas taken together they affect 64% of all class II genes. Thus, different yTAF(II)25 mutations induce distinct phenotypes and affect the regulation of different subsets of genes, demonstrating that no individual TAF(II) mutant allele reflects the full range of its normal functions.
Regulation of Global Transcription in Escherichia coli by Rsd and 6S RNA
Lal, Avantika; Krishna, Sandeep; Seshasayee, Aswin Sai Narain
2018-01-01
In Escherichia coli, the sigma factor σ70 directs RNA polymerase to transcribe growth-related genes, while σ38 directs transcription of stress response genes during stationary phase. Two molecules hypothesized to regulate RNA polymerase are the protein Rsd, which binds to σ70, and the non-coding 6S RNA which binds to the RNA polymerase-σ70 holoenzyme. Despite multiple studies, the functions of Rsd and 6S RNA remain controversial. Here we use RNA-Seq in five phases of growth to elucidate their function on a genome-wide scale. We show that Rsd and 6S RNA facilitate σ38 activity throughout bacterial growth, while 6S RNA also regulates widely different genes depending upon growth phase. We discover novel interactions between 6S RNA and Rsd and show widespread expression changes in a strain lacking both regulators. Finally, we present a mathematical model of transcription which highlights the crosstalk between Rsd and 6S RNA as a crucial factor in controlling sigma factor competition and global gene expression. PMID:29686109
Regulation of Global Transcription in Escherichia coli by Rsd and 6S RNA.
Lal, Avantika; Krishna, Sandeep; Seshasayee, Aswin Sai Narain
2018-05-31
In Escherichia coli , the sigma factor σ 70 directs RNA polymerase to transcribe growth-related genes, while σ 38 directs transcription of stress response genes during stationary phase. Two molecules hypothesized to regulate RNA polymerase are the protein Rsd, which binds to σ 70 , and the non-coding 6S RNA which binds to the RNA polymerase-σ 70 holoenzyme. Despite multiple studies, the functions of Rsd and 6S RNA remain controversial. Here we use RNA-Seq in five phases of growth to elucidate their function on a genome-wide scale. We show that Rsd and 6S RNA facilitate σ 38 activity throughout bacterial growth, while 6S RNA also regulates widely different genes depending upon growth phase. We discover novel interactions between 6S RNA and Rsd and show widespread expression changes in a strain lacking both regulators. Finally, we present a mathematical model of transcription which highlights the crosstalk between Rsd and 6S RNA as a crucial factor in controlling sigma factor competition and global gene expression. Copyright © 2018 Lal et al.
DNA sequence-dependent mechanics and protein-assisted bending in repressor-mediated loop formation
Boedicker, James Q.; Garcia, Hernan G.; Johnson, Stephanie; Phillips, Rob
2014-01-01
As the chief informational molecule of life, DNA is subject to extensive physical manipulations. The energy required to deform double-helical DNA depends on sequence, and this mechanical code of DNA influences gene regulation, such as through nucleosome positioning. Here we examine the sequence-dependent flexibility of DNA in bacterial transcription factor-mediated looping, a context for which the role of sequence remains poorly understood. Using a suite of synthetic constructs repressed by the Lac repressor and two well-known sequences that show large flexibility differences in vitro, we make precise statistical mechanical predictions as to how DNA sequence influences loop formation and test these predictions using in vivo transcription and in vitro single-molecule assays. Surprisingly, sequence-dependent flexibility does not affect in vivo gene regulation. By theoretically and experimentally quantifying the relative contributions of sequence and the DNA-bending protein HU to DNA mechanical properties, we reveal that bending by HU dominates DNA mechanics and masks intrinsic sequence-dependent flexibility. Such a quantitative understanding of how mechanical regulatory information is encoded in the genome will be a key step towards a predictive understanding of gene regulation at single-base pair resolution. PMID:24231252
MERP1: a mammalian ependymin-related protein gene differentially expressed in hematopoietic cells.
Gregorio-King, Claudia C; McLeod, Janet L; Collier, Fiona McL; Collier, Gregory R; Bolton, Karyn A; Van Der Meer, Gavin J; Apostolopoulos, Jim; Kirkland, Mark A
2002-03-20
We have utilized differential display polymerase chain reaction to investigate the gene expression of hematopoietic progenitor cells from adult bone marrow and umbilical cord blood. A differentially expressed gene was identified in CD34+ hematopoietic progenitor cells, with low expression in CD34- cells. We have obtained the full coding sequence of this gene which we designated human mammalian ependymin-related protein 1 (MERP1). Expression of MERP1 was found in a variety of normal human tissues, and is 4- and 10-fold higher in adult bone marrow and umbilical cord blood CD34+ cells, respectively, compared to CD34- cells. Additionally, MERP1 expression in a hematopoietic stem cell enriched population was down-regulated with proliferation and differentiation. Conceptual translation of the MERP1 open reading frame reveals significant homology to two families of glycoprotein calcium-dependant cell adhesion molecules: ependymins and protocadherins.
Benítez-Burraco, A
FOXP2 is the first gene linked to a hereditary variant of specific language impairment and seems to code for a transcriptional repressor that intervenes in the regulation of the development and the functioning of certain thalamic-cortical-striatal circuits. In the last three years, significant progress has been made in the determination of the structural and functional properties of the gene. These advances essentially have to do with the precise analysis of the most important structural motifs of the protein that it codes for and the main parameters that determine its interaction with DNA. They also concern the determination of the functional and behavioural properties in vivo of the main isoforms of the FOXP2 protein, the exact determination of the pattern of expression of new orthologues of the gene, and the identification of the different target genes for factor FOXP2. This new evidence suggests that protein FOXP2 protein has a high degree of versatility in vivo when it comes to binding to DNA; that its different isoforms are biologically functional; and that the FOXP2 gene is functional during embryonic development and during the adult phase. It also suggests that it is involved in the development and/or functioning of the thalamic-cortical-striatal circuits associated to motor planning, sequential behaviour and procedural learning (a significant saving in developmental terms of the regulatory mechanism in which the gene is involved), as well as the accuracy of the models of linguistic processing that consider language to be, to a large extent, the result of an interaction between certain cortical and subcortical structures.
Görner, Wolfram; Durchschlag, Erich; Martinez-Pastor, Maria Teresa; Estruch, Francisco; Ammerer, Gustav; Hamilton, Barbara; Ruis, Helmut; Schüller, Christoph
1998-01-01
Msn2p and the partially redundant factor Msn4p are key regulators of stress-responsive gene expression in Saccharomyces cerevisiae. They are required for the transcription of a number of genes coding for proteins with stress-protective functions. Both Msn2p and Msn4p are Cys2His2 zinc finger proteins and bind to the stress response element (STRE). In vivo footprinting studies show that the occupation of STREs is enhanced in stressed cells and dependent on the presence of Msn2p and Msn4p. Both factors accumulate in the nucleus under stress conditions, such as heat shock, osmotic stress, carbon-source starvation, and in the presence of ethanol or sorbate. Stress-induced nuclear localization was found to be rapid, reversible, and independent of protein synthesis. Nuclear localization of Msn2p and Msn4p was shown to be correlated inversely to cAMP levels and protein kinase A (PKA) activity. A region with significant homologies shared between Msn2p and Msn4p is sufficient to confer stress-regulated localization to a SV40–NLS–GFP fusion protein. Serine to alanine or aspartate substitutions in a conserved PKA consensus site abolished cAMP-driven nuclear export and cytoplasmic localization in unstressed cells. We propose stress and cAMP-regulated intracellular localization of Msn2p to be a key step in STRE-dependent transcription and in the general stress response. PMID:9472026
Structure and regulation of KGD1, the structural gene for yeast alpha-ketoglutarate dehydrogenase.
Repetto, B; Tzagoloff, A
1989-06-01
Nuclear respiratory-defective mutants of Saccharomyces cerevisiae have been screened for lesions in the mitochondrial alpha-ketoglutarate dehydrogenase complex. Strains assigned to complementation group G70 were ascertained to be deficient in enzyme activity due to mutations in the KGD1 gene coding for the alpha-ketoglutarate dehydrogenase component of the complex. The KGD1 gene has been cloned by transformation of a representative kgd1 mutant, C225/U1, with a recombinant plasmid library of wild-type yeast nuclear DNA. Transformants containing the gene on a multicopy plasmid had three- to four-times-higher alpha-ketoglutarate dehydrogenase activity than did wild-type S. cerevisiae. Substitution of the chromosomal copy of KGD1 with a disrupted allele (kgd1::URA3) induced a deficiency in alpha-ketoglutarate dehydrogenase. The sequence of the cloned region of DNA which complements kgd1 mutants was found to have an open reading frame of 3,042 nucleotides capable of coding for a protein of Mw 114,470. The encoded protein had 38% identical residues with the reported sequence of alpha-ketoglutarate dehydrogenase from Escherichia coli. Two lines of evidence indicated that transcription of KGD1 is catabolite repressed. Higher steady-state levels of KGD1 mRNA were detected in wild-type yeast grown on the nonrepressible sugar galactose than in yeast grown on high glucose. Regulation of KGD1 was also studied by fusing different 5'-flanking regions of KGD1 to the lacZ gene of E. coli and measuring the expression of beta-galactosidase in yeast. Transformants harboring a fusion of 693 nucleotides of the 5'-flanking sequence expressed 10 times more beta-galactosidase activity when grown under derepressed conditions. The response to the carbon source was reduced dramatically when the same lacZ fusion was present in a hap2 or hap3 mutant. The promoter element(s) responsible for the regulated expression of KGD1 has been mapped to the -354 to -143 region. This region contained several putative activation sites with sequences matching the core element proposed to be essential for binding of the HAP2 and HAP3 regulatory proteins.
Bagley, Joshua A; Yan, Zhiqiang; Zhang, Wei; Wildonger, Jill; Jan, Lily Yeh; Jan, Yuh Nung
2014-09-01
A complex array of genetic factors regulates neuronal dendrite morphology. Epigenetic regulation of gene expression represents a plausible mechanism to control pathways responsible for specific dendritic arbor shapes. By studying the Drosophila dendritic arborization (da) neurons, we discovered a role of the double-bromodomain and extraterminal (BET) family proteins in regulating dendrite arbor complexity. A loss-of-function mutation in the single Drosophila BET protein encoded by female sterile 1 homeotic [fs(1)h] causes loss of fine, terminal dendritic branches. Moreover, fs(1)h is necessary for the induction of branching caused by a previously identified transcription factor, Cut (Ct), which regulates subtype-specific dendrite morphology. Finally, disrupting fs(1)h function impairs the mechanosensory response of class III da sensory neurons without compromising the expression of the ion channel NompC, which mediates the mechanosensitive response. Thus, our results identify a novel role for BET family proteins in regulating dendrite morphology and a possible separation of developmental pathways specifying neural cell morphology and ion channel expression. Since the BET proteins are known to bind acetylated histone tails, these results also suggest a role of epigenetic histone modifications and the "histone code," in regulating dendrite morphology. © 2014 Bagley et al.; Published by Cold Spring Harbor Laboratory Press.
Functional Genomic Analysis of the let-7 Regulatory Network in Caenorhabditis elegans
Zisoulis, Dimitrios G.; Lovci, Michael T.; Melnik-Martinez, Katya V.; Yeo, Gene W.; Pasquinelli, Amy E.
2013-01-01
The let-7 microRNA (miRNA) regulates cellular differentiation across many animal species. Loss of let-7 activity causes abnormal development in Caenorhabditis elegans and unchecked cellular proliferation in human cells, which contributes to tumorigenesis. These defects are due to improper expression of protein-coding genes normally under let-7 regulation. While some direct targets of let-7 have been identified, the genome-wide effect of let-7 insufficiency in a developing animal has not been fully investigated. Here we report the results of molecular and genetic assays aimed at determining the global network of genes regulated by let-7 in C. elegans. By screening for mis-regulated genes that also contribute to let-7 mutant phenotypes, we derived a list of physiologically relevant potential targets of let-7 regulation. Twenty new suppressors of the rupturing vulva or extra seam cell division phenotypes characteristic of let-7 mutants emerged. Three of these genes, opt-2, prmt-1, and T27D12.1, were found to associate with Argonaute in a let-7–dependent manner and are likely novel direct targets of this miRNA. Overall, a complex network of genes with various activities is subject to let-7 regulation to coordinate developmental timing across tissues during worm development. PMID:23516374
In vivo expression and purification of aptamer-tagged small RNA regulators
Said, Nelly; Rieder, Renate; Hurwitz, Robert; Deckert, Jochen; Urlaub, Henning; Vogel, Jörg
2009-01-01
Small non-coding RNAs (sRNAs) are an emerging class of post-transcriptional regulators of bacterial gene expression. To study sRNAs and their potential protein interaction partners, it is desirable to purify sRNAs from cells in their native form. Here, we used RNA-based affinity chromatography to purify sRNAs following their expression as aptamer-tagged variants in vivo. To this end, we developed a family of plasmids to express sRNAs with any of three widely used aptamer sequences (MS2, boxB, eIF4A), and systematically tested how the aptamer tagging impacted on intracellular accumulation and target regulation of the Salmonella GcvB, InvR or RybB sRNAs. In addition, we successfully tagged the chromosomal rybB gene with MS2 to observe that RybB-MS2 is fully functional as an envelope stress-induced repressor of ompN mRNA following induction of sigmaE. We further demonstrate that the common sRNA-binding protein, Hfq, co-purifies with MS2-tagged sRNAs of Salmonella. The presented affinity purification strategy may facilitate the isolation of in vivo assembled sRNA–protein complexes in a wide range of bacteria. PMID:19726584
Raju, Hemalatha B.; Tsinoremas, Nicholas F.; Capobianco, Enrico
2016-01-01
Regeneration of injured nerves is likely occurring in the peripheral nervous system, but not in the central nervous system. Although protein-coding gene expression has been assessed during nerve regeneration, little is currently known about the role of non-coding RNAs (ncRNAs). This leaves open questions about the potential effects of ncRNAs at transcriptome level. Due to the limited availability of human neuropathic pain (NP) data, we have identified the most comprehensive time-course gene expression profile referred to sciatic nerve (SN) injury and studied in a rat model using two neuronal tissues, namely dorsal root ganglion (DRG) and SN. We have developed a methodology to identify differentially expressed bioentities starting from microarray probes and repurposing them to annotate ncRNAs, while analyzing the expression profiles of protein-coding genes. The approach is designed to reuse microarray data and perform first profiling and then meta-analysis through three main steps. First, we used contextual analysis to identify what we considered putative or potential protein-coding targets for selected ncRNAs. Relevance was therefore assigned to differential expression of neighbor protein-coding genes, with neighborhood defined by a fixed genomic distance from long or antisense ncRNA loci, and of parental genes associated with pseudogenes. Second, connectivity among putative targets was used to build networks, in turn useful to conduct inference at interactomic scale. Last, network paths were annotated to assess relevance to NP. We found significant differential expression in long-intergenic ncRNAs (32 lincRNAs in SN and 8 in DRG), antisense RNA (31 asRNA in SN and 12 in DRG), and pseudogenes (456 in SN and 56 in DRG). In particular, contextual analysis centered on pseudogenes revealed some targets with known association to neurodegeneration and/or neurogenesis processes. While modules of the olfactory receptors were clearly identified in protein–protein interaction networks, other connectivity paths were identified between proteins already investigated in studies on disorders, such as Parkinson, Down syndrome, Huntington disease, and Alzheimer. Our findings suggest the importance of reusing gene expression data by meta-analysis approaches. PMID:27803687
Biomimetic Artificial Epigenetic Code for Targeted Acetylation of Histones.
Taniguchi, Junichi; Feng, Yihong; Pandian, Ganesh N; Hashiya, Fumitaka; Hidaka, Takuya; Hashiya, Kaori; Park, Soyoung; Bando, Toshikazu; Ito, Shinji; Sugiyama, Hiroshi
2018-06-13
While the central role of locus-specific acetylation of histone proteins in eukaryotic gene expression is well established, the availability of designer tools to regulate acetylation at particular nucleosome sites remains limited. Here, we develop a unique strategy to introduce acetylation by constructing a bifunctional molecule designated Bi-PIP. Bi-PIP has a P300/CBP-selective bromodomain inhibitor (Bi) as a P300/CBP recruiter and a pyrrole-imidazole polyamide (PIP) as a sequence-selective DNA binder. Biochemical assays verified that Bi-PIPs recruit P300 to the nucleosomes having their target DNA sequences and extensively accelerate acetylation. Bi-PIPs also activated transcription of genes that have corresponding cognate DNA sequences inside living cells. Our results demonstrate that Bi-PIPs could act as a synthetic programmable histone code of acetylation, which emulates the bromodomain-mediated natural propagation system of histone acetylation to activate gene expression in a sequence-selective manner.
2011-01-01
Background The green crab Carcinus maenas is known for its high acclimation potential to varying environmental abiotic conditions. A high ability for ion and acid-base regulation is mainly based on an efficient regulation apparatus located in gill epithelia. However, at present it is neither known which ion transport proteins play a key role in the acid-base compensation response nor how gill epithelia respond to elevated seawater pCO2 as predicted for the future. In order to promote our understanding of the responses of green crab acid-base regulatory epithelia to high pCO2, Baltic Sea green crabs were exposed to a pCO2 of 400 Pa. Gills were screened for differentially expressed gene transcripts using a 4,462-feature microarray and quantitative real-time PCR. Results Crabs responded mainly through fine scale adjustment of gene expression to elevated pCO2. However, 2% of all investigated transcripts were significantly regulated 1.3 to 2.2-fold upon one-week exposure to CO2 stress. Most of the genes known to code for proteins involved in osmo- and acid-base regulation, as well as cellular stress response, were were not impacted by elevated pCO2. However, after one week of exposure, significant changes were detected in a calcium-activated chloride channel, a hyperpolarization activated nucleotide-gated potassium channel, a tetraspanin, and an integrin. Furthermore, a putative syntaxin-binding protein, a protein of the transmembrane 9 superfamily, and a Cl-/HCO3- exchanger of the SLC 4 family were differentially regulated. These genes were also affected in a previously published hypoosmotic acclimation response study. Conclusions The moderate, but specific response of C. maenas gill gene expression indicates that (1) seawater acidification does not act as a strong stressor on the cellular level in gill epithelia; (2) the response to hypercapnia is to some degree comparable to a hypoosmotic acclimation response; (3) the specialization of each of the posterior gill arches might go beyond what has been demonstrated up to date; and (4) a re-configuration of gill epithelia might occur in response to hypercapnia. PMID:21978240
Allen, S P; Polazzi, J O; Gierse, J K; Easton, A M
1992-01-01
In Escherichia coli high-level production of some heterologous proteins (specifically, human prorenin, renin, and bovine insulin-like growth factor 2) resulted in the induction of two new E. coli heat shock proteins, both of which have molecular masses of 16 kDa and are tightly associated with inclusion bodies formed during heterologous protein production. We named these inclusion body-associated proteins IbpA and IbpB. The coding sequences for IbpA and IbpB were identified and isolated from the Kohara E. coli gene bank. The genes for these proteins (ibpA and ibpB) are located at 82.5 min on the chromosome. Nucleotide sequencing of the two genes revealed that they are transcribed in the same direction and are separated by 110 bp. Putative Shine-Dalgarno sequences are located upstream from the initiation codons of both genes. A putative heat shock promoter is located upstream from ibpA, and a putative transcription terminator is located downstream from ibpB. A temperature upshift experiment in which we used a wild-type E. coli strain and an isogenic rpoH mutant strain indicated that a sigma 32-containing RNA polymerase is involved in the regulation of expression of these genes. There is 57.5% identity between the genes at the nucleotide level and 52.2% identity at the amino acid level. A search of the protein data bases showed that both of these 16-kDa proteins exhibit low levels of homology to low-molecular-weight heat shock proteins from eukaryotic species. Images PMID:1356969
Rossello, Jessica; Lima, Analía; Gil, Magdalena; Rodríguez Duarte, Jorge; Correa, Agustín; Carvalho, Paulo C; Kierbel, Arlinet; Durán, Rosario
2017-08-31
The second messenger c-di-GMP regulates the switch between motile and sessile bacterial lifestyles. A general feature of c-di-GMP metabolism is the presence of a surprisingly large number of genes coding for diguanylate cyclases and phosphodiesterases, the enzymes responsible for its synthesis and degradation respectively. However, the physiological relevance of this apparent redundancy is not clear, emphasizing the need for investigating the functions of each of these enzymes. Here we focused on the phosphodiesterase PA2133 from Pseudomonas aeruginosa, an important opportunistic pathogen. We phenotypically characterized P. aeruginosa strain K overexpressing PA2133 or its inactive mutant. We showed that biofilm formation and motility are severely impaired by overexpression of PA2133. Our quantitative proteomic approach applied to the membrane and exoprotein fractions revealed that proteins involved in three processes were mostly affected: flagellar motility, type III secretion system and chemotaxis. While inhibition of biofilm formation can be ascribed to the phosphodiesterase activity of PA2133, down-regulation of flagellar, chemotaxis, and type III secretion system proteins is independent of this enzymatic activity. Based on these unexpected effects of PA2133, we propose to rename this gene product FcsR, for Flagellar, chemotaxis and type III secretion system Regulator.
Prasad, Pushplata; Varshney, Deepti; Adholeya, Alok
2015-11-25
The fungus Purpureocillium lilacinum is widely known as a biological control agent against plant parasitic nematodes. This research article consists of genomic annotation of the first draft of whole genome sequence of P. lilacinum. The study aims to decipher the putative genetic components of the fungus involved in nematode pathogenesis by performing comparative genomic analysis with nine closely related fungal species in Hypocreales. de novo genomic assembly was done and a total of 301 scaffolds were constructed for P. lilacinum genomic DNA. By employing structural genome prediction models, 13, 266 genes coding for proteins were predicted in the genome. Approximately 73% of the predicted genes were functionally annotated using Blastp, InterProScan and Gene Ontology. A 14.7% fraction of the predicted genes shared significant homology with genes in the Pathogen Host Interactions (PHI) database. The phylogenomic analysis carried out using maximum likelihood RAxML algorithm provided insight into the evolutionary relationship of P. lilacinum. In congruence with other closely related species in the Hypocreales namely, Metarhizium spp., Pochonia chlamydosporia, Cordyceps militaris, Trichoderma reesei and Fusarium spp., P. lilacinum has large gene sets coding for G-protein coupled receptors (GPCRs), proteases, glycoside hydrolases and carbohydrate esterases that are required for degradation of nematode-egg shell components. Screening of the genome by Antibiotics & Secondary Metabolite Analysis Shell (AntiSMASH) pipeline indicated that the genome potentially codes for a variety of secondary metabolites, possibly required for adaptation to heterogeneous lifestyles reported for P. lilacinum. Significant up-regulation of subtilisin-like serine protease genes in presence of nematode eggs in quantitative real-time analyses suggested potential role of serine proteases in nematode pathogenesis. The data offer a better understanding of Purpureocillium lilacinum genome and will enhance our understanding on the molecular mechanism involved in nematophagy.
Cross-species inference of long non-coding RNAs greatly expands the ruminant transcriptome.
Bush, Stephen J; Muriuki, Charity; McCulloch, Mary E B; Farquhar, Iseabail L; Clark, Emily L; Hume, David A
2018-04-24
mRNA-like long non-coding RNAs (lncRNAs) are a significant component of mammalian transcriptomes, although most are expressed only at low levels, with high tissue-specificity and/or at specific developmental stages. Thus, in many cases lncRNA detection by RNA-sequencing (RNA-seq) is compromised by stochastic sampling. To account for this and create a catalogue of ruminant lncRNAs, we compared de novo assembled lncRNAs derived from large RNA-seq datasets in transcriptional atlas projects for sheep and goats with previous lncRNAs assembled in cattle and human. We then combined the novel lncRNAs with the sheep transcriptional atlas to identify co-regulated sets of protein-coding and non-coding loci. Few lncRNAs could be reproducibly assembled from a single dataset, even with deep sequencing of the same tissues from multiple animals. Furthermore, there was little sequence overlap between lncRNAs that were assembled from pooled RNA-seq data. We combined positional conservation (synteny) with cross-species mapping of candidate lncRNAs to identify a consensus set of ruminant lncRNAs and then used the RNA-seq data to demonstrate detectable and reproducible expression in each species. In sheep, 20 to 30% of lncRNAs were located close to protein-coding genes with which they are strongly co-expressed, which is consistent with the evolutionary origin of some ncRNAs in enhancer sequences. Nevertheless, most of the lncRNAs are not co-expressed with neighbouring protein-coding genes. Alongside substantially expanding the ruminant lncRNA repertoire, the outcomes of our analysis demonstrate that stochastic sampling can be partly overcome by combining RNA-seq datasets from related species. This has practical implications for the future discovery of lncRNAs in other species.
Dominguez, Daniel; Tsai, Yi-Hsuan; Gomez, Nicholas; Jha, Deepak Kumar; Davis, Ian; Wang, Zefeng
2016-01-01
Progression through the cell cycle is largely dependent on waves of periodic gene expression, and the regulatory networks for these transcriptome dynamics have emerged as critical points of vulnerability in various aspects of tumor biology. Through RNA-sequencing of human cells during two continuous cell cycles (>2.3 billion paired reads), we identified over 1 000 mRNAs, non-coding RNAs and pseudogenes with periodic expression. Periodic transcripts are enriched in functions related to DNA metabolism, mitosis, and DNA damage response, indicating these genes likely represent putative cell cycle regulators. Using our set of periodic genes, we developed a new approach termed “mitotic trait” that can classify primary tumors and normal tissues by their transcriptome similarity to different cell cycle stages. By analyzing >4 000 tumor samples in The Cancer Genome Atlas (TCGA) and other expression data sets, we found that mitotic trait significantly correlates with genetic alterations, tumor subtype and, notably, patient survival. We further defined a core set of 67 genes with robust periodic expression in multiple cell types. Proteins encoded by these genes function as major hubs of protein-protein interaction and are mostly required for cell cycle progression. The core genes also have unique chromatin features including increased levels of CTCF/RAD21 binding and H3K36me3. Loss of these features in uterine and kidney cancers is associated with altered expression of the core 67 genes. Our study suggests new chromatin-associated mechanisms for periodic gene regulation and offers a predictor of cancer patient outcomes. PMID:27364684
Delcourt, Vivian; Lucier, Jean-François; Gagnon, Jules; Beaudoin, Maxime C; Vanderperre, Benoît; Breton, Marc-André; Motard, Julie; Jacques, Jean-François; Brunelle, Mylène; Gagnon-Arsenault, Isabelle; Fournier, Isabelle; Ouangraoua, Aida; Hunting, Darel J; Cohen, Alan A; Landry, Christian R; Scott, Michelle S
2017-01-01
Recent functional, proteomic and ribosome profiling studies in eukaryotes have concurrently demonstrated the translation of alternative open-reading frames (altORFs) in addition to annotated protein coding sequences (CDSs). We show that a large number of small proteins could in fact be coded by these altORFs. The putative alternative proteins translated from altORFs have orthologs in many species and contain functional domains. Evolutionary analyses indicate that altORFs often show more extreme conservation patterns than their CDSs. Thousands of alternative proteins are detected in proteomic datasets by reanalysis using a database containing predicted alternative proteins. This is illustrated with specific examples, including altMiD51, a 70 amino acid mitochondrial fission-promoting protein encoded in MiD51/Mief1/SMCR7L, a gene encoding an annotated protein promoting mitochondrial fission. Our results suggest that many genes are multicoding genes and code for a large protein and one or several small proteins. PMID:29083303
Li, Wencheng; Laishram, Rakesh S.; Hoque, Mainul; Ji, Zhe
2017-01-01
Abstract Polyadenylation of nascent RNA by poly(A) polymerase (PAP) is important for 3′ end maturation of almost all eukaryotic mRNAs. Most mammalian genes harbor multiple polyadenylation sites (PASs), leading to expression of alternative polyadenylation (APA) isoforms with distinct functions. How poly(A) polymerases may regulate PAS usage and hence gene expression is poorly understood. Here, we show that the nuclear canonical (PAPα and PAPγ) and non-canonical (Star-PAP) PAPs play diverse roles in PAS selection and gene expression. Deficiencies in the PAPs resulted in perturbations of gene expression, with Star-PAP impacting lowly expressed mRNAs and long-noncoding RNAs to the greatest extent. Importantly, different PASs of a gene are distinctly regulated by different PAPs, leading to widespread relative expression changes of APA isoforms. The location and surrounding sequence motifs of a PAS appear to differentiate its regulation by the PAPs. We show Star-PAP-specific PAS usage regulates the expression of the eukaryotic translation initiation factor EIF4A1, the tumor suppressor gene PTEN and the long non-coding RNA NEAT1. The Star-PAP-mediated APA of PTEN is essential for DNA damage-induced increase of PTEN protein levels. Together, our results reveal a PAS-guided and PAP-mediated paradigm for gene expression in response to cellular signaling cues. PMID:28911096
Cis-acting elements in its 3′ UTR mediate post-transcriptional regulation of KRAS
Kim, Minlee; Kogan, Nicole; Slack, Frank J.
2016-01-01
Multiple RNA-binding proteins and non-coding RNAs, such as microRNAs (miRNAs), are involved in post-transcriptional gene regulation through recognition motifs in the 3′ untranslated region (UTR) of their target genes. The KRAS gene encodes a key signaling protein, and its messenger RNA (mRNA) contains an exceptionally long 3′ UTR; this suggests that it may be subject to a highly complex set of regulatory processes. However, 3′ UTR-dependent regulation of KRAS expression has not been explored in detail. Using extensive deletion and mutational analyses combined with luciferase reporter assays, we have identified inhibitory and stabilizing cis-acting regions within the KRAS 3′ UTR that may interact with miRNAs and RNA-binding proteins, such as HuR. Particularly, we have identified an AU-rich 49-nt fragment in the KRAS 3′ UTR that is required for KRAS 3′ UTR reporter repression. This element contains a miR-185 complementary element, and we show that overexpression of miR-185 represses endogenous KRAS mRNA and protein in vitro. In addition, we have identified another 49-nt fragment that is required to promote KRAS 3′ UTR reporter expression. These findings indicate that multiple cis-regulatory motifs in the 3′ UTR of KRAS finely modulate its expression, and sequence alterations within a binding motif may disrupt the precise functions of trans-regulatory factors, potentially leading to aberrant KRAS expression. PMID:26930719
Basic Concepts in Molecular Biology Related to Genetics and Epigenetics.
Corella, Dolores; Ordovas, Jose M
2017-09-01
The observation that "one size does not fit all" for the prevention and treatment of cardiovascular disease, among other diseases, has driven the concept of precision medicine. The goal of precision medicine is to provide the best-targeted interventions tailored to an individual's genome. The human genome is composed of billions of sequence arrangements containing a code that controls how genes are expressed. This code depends on other nonstatic regulators that surround the DNA and constitute the epigenome. Moreover, environmental factors also play an important role in this complex regulation. This review provides a general perspective on the basic concepts of molecular biology related to genetics and epigenetics and a glossary of key terms. Several examples are given of polymorphisms and genetic risk scores related to cardiovascular risk. Likewise, an overview is presented of the main epigenetic regulators, including DNA methylation, methylcytosine-phosphate-guanine-binding proteins, histone modifications, other histone regulations, micro-RNA effects, and additional emerging regulators. One of the greatest challenges is to understand how environmental factors (diet, physical activity, smoking, etc.) could alter the epigenome, resulting in healthy or unhealthy cardiovascular phenotypes. We discuss some gene-environment interactions and provide a methodological overview. Copyright © 2017 Sociedad Española de Cardiología. Published by Elsevier España, S.L.U. All rights reserved.
Quantitative Proteomic Analysis of the Hfq-Regulon in Sinorhizobium meliloti 2011
Sobrero, Patricio; Schlüter, Jan-Philip; Lanner, Ulrike; Schlosser, Andreas; Becker, Anke; Valverde, Claudio
2012-01-01
Riboregulation stands for RNA-based control of gene expression. In bacteria, small non-coding RNAs (sRNAs) are a major class of riboregulatory elements, most of which act at the post-transcriptional level by base-pairing target mRNA genes. The RNA chaperone Hfq facilitates antisense interactions between target mRNAs and regulatory sRNAs, thus influencing mRNA stability and/or translation rate. In the α-proteobacterium Sinorhizobium meliloti strain 2011, the identification and detection of multiple sRNAs genes and the broadly pleitropic phenotype associated to the absence of a functional Hfq protein both support the existence of riboregulatory circuits controlling gene expression to ensure the fitness of this bacterium in both free living and symbiotic conditions. In order to identify target mRNAs subject to Hfq-dependent riboregulation, we have compared the proteome of an hfq mutant and the wild type S. meliloti by quantitative proteomics following protein labelling with 15N. Among 2139 univocally identified proteins, a total of 195 proteins showed a differential abundance between the Hfq mutant and the wild type strain; 65 proteins accumulated ≥2-fold whereas 130 were downregulated (≤0.5-fold) in the absence of Hfq. This profound proteomic impact implies a major role for Hfq on regulation of diverse physiological processes in S. meliloti, from transport of small molecules to homeostasis of iron and nitrogen. Changes in the cellular levels of proteins involved in transport of nucleotides, peptides and amino acids, and in iron homeostasis, were confirmed with phenotypic assays. These results represent the first quantitative proteomic analysis in S. meliloti. The comparative analysis of the hfq mutant proteome allowed identification of novel strongly Hfq-regulated genes in S. meliloti. PMID:23119037
Quantitative proteomic analysis of the Hfq-regulon in Sinorhizobium meliloti 2011.
Sobrero, Patricio; Schlüter, Jan-Philip; Lanner, Ulrike; Schlosser, Andreas; Becker, Anke; Valverde, Claudio
2012-01-01
Riboregulation stands for RNA-based control of gene expression. In bacteria, small non-coding RNAs (sRNAs) are a major class of riboregulatory elements, most of which act at the post-transcriptional level by base-pairing target mRNA genes. The RNA chaperone Hfq facilitates antisense interactions between target mRNAs and regulatory sRNAs, thus influencing mRNA stability and/or translation rate. In the α-proteobacterium Sinorhizobium meliloti strain 2011, the identification and detection of multiple sRNAs genes and the broadly pleitropic phenotype associated to the absence of a functional Hfq protein both support the existence of riboregulatory circuits controlling gene expression to ensure the fitness of this bacterium in both free living and symbiotic conditions. In order to identify target mRNAs subject to Hfq-dependent riboregulation, we have compared the proteome of an hfq mutant and the wild type S. meliloti by quantitative proteomics following protein labelling with (15)N. Among 2139 univocally identified proteins, a total of 195 proteins showed a differential abundance between the Hfq mutant and the wild type strain; 65 proteins accumulated ≥2-fold whereas 130 were downregulated (≤0.5-fold) in the absence of Hfq. This profound proteomic impact implies a major role for Hfq on regulation of diverse physiological processes in S. meliloti, from transport of small molecules to homeostasis of iron and nitrogen. Changes in the cellular levels of proteins involved in transport of nucleotides, peptides and amino acids, and in iron homeostasis, were confirmed with phenotypic assays. These results represent the first quantitative proteomic analysis in S. meliloti. The comparative analysis of the hfq mutant proteome allowed identification of novel strongly Hfq-regulated genes in S. meliloti.
Musante, Luciana; Bartsch, Oliver; Ropers, Hans-Hilger; Kalscheuer, Vera M
2004-05-12
Characterization of a balanced t(2;12)(q37;q24) translocation in a patient with suspicion of Noonan syndrome revealed that the chromosome 12 breakpoint lies in the vicinity of a novel human gene, thyroid hormone receptor-associated protein 2 (THRAP2). We therefore characterized this gene and its mouse counterpart in more detail. Human and mouse THRAP2/Thrap2 span a genomic region of about 310 and >170 kilobases (kb), and both contain 31 exons. Corresponding transcripts are approximately 9.5 kb long. Their open reading frames code for proteins of 2210 and 2203 amino acids, which are 93% identical. By northern blot analysis, human and mouse THRAP2/Thrap2 genes showed ubiquitous expression. Transcripts were most abundant in human skeletal muscle and in mouse heart. THRAP2 protein is 56% identical to human TRAP240, which belongs to the thyroid hormone receptor associated protein (TRAP) complex and is evolutionary conserved up to yeast. This complex is involved in transcriptional regulation and is believed to serve as adapting interface between regulatory proteins bound to specific DNA sequences and RNA polymerase II.
Dennis, P P
1977-01-01
The fraction of the total ribonucleic acid (RNA) synthesis rate that is messenger RNA (mRNA) for ribosomal protein (r-protein) and ribosomal RNA (rRNA) has been estimated in valS(Ts) rel+ stringent and valS(Ts) relA1 relaxed strains of Escherichia coli during a partial inhibition of valyl-transfer RNA aminoacylation. The partial inhibition was accomplished by shifting the strains from the permissive growth temperature of 29.5 degrees C to the semipermissive temperature of 35.5 degrees C. The RNA synthesized at the elevated temperature was pulse labeled with [3H]uracil. The fraction of the total incorpoarted 3H radioactivity in r-protein mRNA or in rRNA was estimated by specific hybridization to the transducing phages gammaspc1, which carries about 15 r-protein genes and lambdailv5, which carries an rRNA transcription unit. The results clearly demonstrate that the rel gene influences the fraction of the total RNA synthesis rate that is r protein mRNA and rRNA; in the rel+ strain they are significantly increased relative to control cultures. This indicates that the expression of the genes coding for the RNA and protein component of the ribosome are most likely regulated at the level of transcription. Furthermore, it appears that the distribution of functioning RNA polymerase between rRNA genes, r-protein genes, and other types of genes is influenced by the rel gene control system; presumably this influence is mediated through the unusual nucleotide guanosine tetraphosphate. PMID:320185
Romero, Roberto; Tarca, Adi; Chaemsaithong, Piya; Miranda, Jezid; Chaiworapongsa, Tinnakorn; Jia, Hui; Hassan, Sonia S.; Kalita, Cynthia A.; Cai, Juan; Yeo, Lami; Lipovich, Leonard
2014-01-01
Objective The mechanisms responsible for normal and abnormal parturition are poorly understood. Myometrial activation leading to regular uterine contractions is a key component of labor. Dysfunctional labor (arrest of dilatation and/or descent) is a leading indication for cesarean delivery. Compelling evidence suggests that most of these disorders are functional in nature, and not the result of cephalopelvic disproportion. The methodology and the datasets afforded by the post-genomic era provide novel opportunities to understand and target gene functions in these disorders. In 2012, the ENCODE Consortium elucidated the extraordinary abundance and functional complexity of long non-coding RNA genes in the human genome. The purpose of the study was to identify differentially expressed long non-coding RNA genes in human myometrium in women in spontaneous labor at term. Materials and Methods Myometrium was obtained from women undergoing cesarean deliveries who were not in labor (n=19) and women in spontaneous labor at term (n=20). RNA was extracted and profiled using an Illumina® microarray platform. The analysis of the protein coding genes from this study has been previously reported. Here, we have used computational approaches to bound the extent of long non-coding RNA representation on this platform, and to identify co-differentially expressed and correlated pairs of long non-coding RNA genes and protein-coding genes sharing the same genomic loci. Results Upon considering more than 18,498 distinct lncRNA genes compiled nonredundantly from public experimental data sources, and interrogating 2,634 that matched Illumina microarray probes, we identified co-differential expression and correlation at two genomic loci that contain coding-lncRNA gene pairs: SOCS2-AK054607 and LMCD1-NR_024065 in women in spontaneous labor at term. This co-differential expression and correlation was validated by qRT-PCR, an independent experimental method. Intriguingly, one of the two lncRNA genes differentially expressed in term labor had a key genomic structure element, a splice site that lacked evolutionary conservation beyond primates. Conclusions We provide for the first time evidence for coordinated differential expression and correlation of cis-encoded antisense lncRNAs and protein-coding genes with known, as well as novel roles in pregnancy in the myometrium of women in spontaneous labor at term. PMID:24168098
Long-Range Control of Gene Expression: Emerging Mechanisms and Disruption in Disease
Kleinjan, Dirk A.; van Heyningen, Veronica
2005-01-01
Transcriptional control is a major mechanism for regulating gene expression. The complex machinery required to effect this control is still emerging from functional and evolutionary analysis of genomic architecture. In addition to the promoter, many other regulatory elements are required for spatiotemporally and quantitatively correct gene expression. Enhancer and repressor elements may reside in introns or up- and downstream of the transcription unit. For some genes with highly complex expression patterns—often those that function as key developmental control genes—the cis-regulatory domain can extend long distances outside the transcription unit. Some of the earliest hints of this came from disease-associated chromosomal breaks positioned well outside the relevant gene. With the availability of wide-ranging genome sequence comparisons, strong conservation of many noncoding regions became obvious. Functional studies have shown many of these conserved sites to be transcriptional regulatory elements that sometimes reside inside unrelated neighboring genes. Such sequence-conserved elements generally harbor sites for tissue-specific DNA-binding proteins. Developmentally variable chromatin conformation can control protein access to these sites and can regulate transcription. Disruption of these finely tuned mechanisms can cause disease. Some regulatory element mutations will be associated with phenotypes distinct from any identified for coding-region mutations. PMID:15549674
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ansong, Charles; Tolic, Nikola; Purvine, Samuel O.
Complete and accurate genome annotation is crucial for comprehensive and systematic studies of biological systems. For example systems biology-oriented genome scale modeling efforts greatly benefit from accurate annotation of protein-coding genes to develop proper functioning models. However, determining protein-coding genes for most new genomes is almost completely performed by inference, using computational predictions with significant documented error rates (> 15%). Furthermore, gene prediction programs provide no information on biologically important post-translational processing events critical for protein function. With the ability to directly measure peptides arising from expressed proteins, mass spectrometry-based proteomics approaches can be used to augment and verify codingmore » regions of a genomic sequence and importantly detect post-translational processing events. In this study we utilized “shotgun” proteomics to guide accurate primary genome annotation of the bacterial pathogen Salmonella Typhimurium 14028 to facilitate a systems-level understanding of Salmonella biology. The data provides protein-level experimental confirmation for 44% of predicted protein-coding genes, suggests revisions to 48 genes assigned incorrect translational start sites, and uncovers 13 non-annotated genes missed by gene prediction programs. We also present a comprehensive analysis of post-translational processing events in Salmonella, revealing a wide range of complex chemical modifications (70 distinct modifications) and confirming more than 130 signal peptide and N-terminal methionine cleavage events in Salmonella. This study highlights several ways in which proteomics data applied during the primary stages of annotation can improve the quality of genome annotations, especially with regards to the annotation of mature protein products.« less
Wieczorek, D F; Smith, C W; Nadal-Ginard, B
1988-01-01
Tropomyosin (TM), a ubiquitous protein, is a component of the contractile apparatus of all cells. In nonmuscle cells, it is found in stress fibers, while in sarcomeric and nonsarcomeric muscle, it is a component of the thin filament. Several different TM isoforms specific for nonmuscle cells and different types of muscle cell have been described. As for other contractile proteins, it was assumed that smooth, striated, and nonmuscle isoforms were each encoded by different sets of genes. Through the use of S1 nuclease mapping, RNA blots, and 5' extension analyses, we showed that the rat alpha-TM gene, whose expression was until now considered to be restricted to muscle cells, generates many different tissue-specific isoforms. The promoter of the gene appears to be very similar to other housekeeping promoters in both its pattern of utilization, being active in most cell types, and its lack of any canonical sequence elements. The rat alpha-TM gene is split into at least 13 exons, 7 of which are alternatively spliced in a tissue-specific manner. This gene arrangement, which also includes two different 3' ends, generates a minimum of six different mRNAs each with the capacity to code for a different protein. These distinct TM isoforms are expressed specifically in nonmuscle and smooth and striated (cardiac and skeletal) muscle cells. The tissue-specific expression and developmental regulation of these isoforms is, therefore, produced by alternative mRNA processing. Moreover, structural and sequence comparisons among TM genes from different phyla suggest that alternative splicing is evolutionarily a very old event that played an important role in gene evolution and might have appeared concomitantly with or even before constitutive splicing. Images PMID:3352602
de Porcellinis, Alice J; Klähn, Stephan; Rosgaard, Lisa; Kirsch, Rebekka; Gutekunst, Kirstin; Georg, Jens; Hess, Wolfgang R; Sakuragi, Yumiko
2016-10-01
Carbohydrate metabolism is a tightly regulated process in photosynthetic organisms. In the cyanobacterium Synechocystis sp. PCC 6803, the photomixotrophic growth protein A (PmgA) is involved in the regulation of glucose and storage carbohydrate (i.e. glycogen) metabolism, while its biochemical activity and possible factors acting downstream of PmgA are unknown. Here, a genome-wide microarray analysis of a ΔpmgA strain identified the expression of 36 protein-coding genes and 42 non-coding transcripts as significantly altered. From these, the non-coding RNA Ncr0700 was identified as the transcript most strongly reduced in abundance. Ncr0700 is widely conserved among cyanobacteria. In Synechocystis its expression is inversely correlated with light intensity. Similarly to a ΔpmgA mutant, a Δncr0700 deletion strain showed an approximately 2-fold increase in glycogen content under photoautotrophic conditions and wild-type-like growth. Moreover, its growth was arrested by 38 h after a shift to photomixotrophic conditions. Ectopic expression of Ncr0700 in Δncr0700 and ΔpmgA restored the glycogen content and photomixotrophic growth to wild-type levels. These results indicate that Ncr0700 is required for photomixotrophic growth and the regulation of glycogen accumulation, and acts downstream of PmgA. Hence Ncr0700 is renamed here as PmgR1 for photomixotrophic growth RNA 1. © The Author 2016. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Profiling and bioinformatic analysis of circular RNA expression regulated by c-Myc.
Gou, Qiheng; Wu, Ke; Zhou, Jian-Kang; Xie, Yuxin; Liu, Lunxu; Peng, Yong
2017-09-22
The c-Myc transcription factor is involved in cell proliferation, cell cycle and apoptosis by activating or repressing transcription of multiple genes. Circular RNAs (circRNAs) are widely expressed non-coding RNAs participating in the regulation of gene expression. Using a high-throughput microarray assay, we showed that Myc regulates the expression of certain circRNAs. A total of 309 up- and 252 down-regulated circRNAs were identified. Among them, randomly selected 8 circRNAs were confirmed by real-time PCR. Subsequently, Myc-binding sites were found to generally exist in the promoter regions of differentially expressed circRNAs. Based on miRNA sponge mechanism, we constructed circRNAs/miRNAs network regulated by Myc, suggesting that circRNAs may widely regulate protein expression through miRNA sponge mechanism. Lastly, we took advantage of Gene Ontology and KEGG analyses to point out that Myc-regulated circRNAs could impact cell proliferation through affecting Ras signaling pathway and pathways in cancer. Our study for the first time demonstrated that Myc transcription factor regulates the expression of circRNAs, adding a novel component of the Myc tumorigenic program and opening a window to investigate the function of certain circRNAs in tumorigenesis.
Möller, André; Xie, Sheila Q.; Hosp, Fabian; Lang, Benjamin; Phatnani, Hemali P.; James, Sonya; Ramirez, Francisco; Collin, Gayle B.; Naggert, Jürgen K.; Babu, M. Madan; Greenleaf, Arno L.; Selbach, Matthias; Pombo, Ana
2012-01-01
RNA polymerase II (RNAPII) transcribes protein-coding genes in eukaryotes and interacts with factors involved in chromatin remodeling, transcriptional activation, elongation, and RNA processing. Here, we present the isolation of native RNAPII complexes using mild extraction conditions and immunoaffinity purification. RNAPII complexes were extracted from mitotic cells, where they exist dissociated from chromatin. The proteomic content of native complexes in total and size-fractionated extracts was determined using highly sensitive LC-MS/MS. Protein associations with RNAPII were validated by high-resolution immunolocalization experiments in both mitotic cells and in interphase nuclei. Functional assays of transcriptional activity were performed after siRNA-mediated knockdown. We identify >400 RNAPII associated proteins in mitosis, among these previously uncharacterized proteins for which we show roles in transcriptional elongation. We also identify, as novel functional RNAPII interactors, two proteins involved in human disease, ALMS1 and TFG, emphasizing the importance of gene regulation for normal development and physiology. PMID:22199231
Complete mitochondrial genome of the agarophyte red alga Gelidium vagum (Gelidiales).
Yang, Eun Chan; Kim, Kyeong Mi; Boo, Ga Hun; Lee, Jung-Hyun; Boo, Sung Min; Yoon, Hwan Su
2014-08-01
We describe the first complete mitochondrial genome of Gelidium vagum (Gelidiales) (24,901 bp, 30.4% GC content), an agar-producing red alga. The circular mitochondrial genome contains 43 genes, including 23 protein-coding, 18 tRNA and 2 rRNA genes. All the protein-coding genes have a typical ATG start codon. No introns were found. Two genes, secY and rps12, were overlapped by 41 bp.
Li, Ao; Zhao, Haizhou; Lai, Qingying; Huang, Zhihong; Yuan, Meijin
2015-01-01
ABSTRACT Many viruses utilize viral or cellular chromatin machinery for efficient infection. Baculoviruses encode a conserved protamine-like protein, P6.9. This protein plays essential roles in various viral physiological processes during infection. However, the mechanism by which P6.9 regulates transcription remains unknown. In this study, 7 phosphorylated species of P6.9 were resolved in Sf9 cells infected with the baculovirus type species Autographa californica multiple nucleopolyhedrovirus (AcMNPV). Mass spectrometry identified 22 phosphorylation and 10 methylation sites but no acetylation sites in P6.9. Immunofluorescence demonstrated that the P6.9 and virus-encoded serine/threonine kinase PK1 exhibited similar distribution patterns in infected cells, and coimmunoprecipitation confirmed the interaction between them. Upon pk1 deletion, nucleocapsid assembly and polyhedron formation were interrupted and the transcription of viral very late genes was downregulated. Interestingly, we found that the 3 most phosphorylated P6.9 species vanished from Sf9 cells transfected with the pk1 deletion mutant, suggesting that PK1 is involved in the hyperphosphorylation of P6.9. Mass spectrometry suggested that the phosphorylation of the 7 Ser/Thr and 5 Arg residues in P6.9 was PK1 dependent. Replacement of the 7 Ser/Thr residues with Ala resulted in a P6.9 phosphorylation pattern similar to that of the pk1 deletion mutant. Importantly, the decreases in the transcription level of viral very late genes and viral infectivity were consistent. Our findings reveal that P6.9 hyperphosphorylation is a precondition for the maximal hyperexpression of baculovirus very late genes and provide the first experimental insights into the function of the baculovirus protamine-like protein and the related protein kinase in epigenetics. IMPORTANCE Diverse posttranslational modifications (PTMs) of histones constitute a code that creates binding platforms that recruit transcription factors to regulate gene expression. Many viruses also utilize host- or virus-induced chromatin machinery to promote efficient infections. Baculoviruses encode a protamine-like protein, P6.9, which is required for a variety of processes in the infection cycle. Currently, P6.9's PTM sites and its regulating factors remain unknown. Here, we found that P6.9 could be categorized as unphosphorylated, hypophosphorylated, and hyperphosphorylated species and that a virus-encoded serine/threonine kinase, PK1, was essential for P6.9 hyperphosphorylation. Abundant PTM sites on P6.9 were identified, among which 7 Ser/Thr phosphorylated sites were PK1 dependent. Mutation of these Ser/Thr sites reduced very late viral gene transcription and viral infectivity, indicating that the PK1-mediated P6.9 hyperphosphorylation contributes to viral proliferation. These data suggest that a code exists in the sophisticated PTM of viral protamine-like proteins and participates in viral gene transcription. PMID:25972542
Carbon source-dependent expansion of the genetic code in bacteria
Prat, Laure; Heinemann, Ilka U.; Aerni, Hans R.; Rinehart, Jesse; O’Donoghue, Patrick; Söll, Dieter
2012-01-01
Despite the fact that the genetic code is known to vary between organisms in rare cases, it is believed that in the lifetime of a single cell the code is stable. We found Acetohalobium arabaticum cells grown on pyruvate genetically encode 20 amino acids, but in the presence of trimethylamine (TMA), A. arabaticum dynamically expands its genetic code to 21 amino acids including pyrrolysine (Pyl). A. arabaticum is the only known organism that modulates the size of its genetic code in response to its environment and energy source. The gene cassette pylTSBCD, required to biosynthesize and genetically encode UAG codons as Pyl, is present in the genomes of 24 anaerobic archaea and bacteria. Unlike archaeal Pyl-decoding organisms that constitutively encode Pyl, we observed that A. arabaticum controls Pyl encoding by down-regulating transcription of the entire Pyl operon under growth conditions lacking TMA, to the point where no detectable Pyl-tRNAPyl is made in vivo. Pyl-decoding archaea adapted to an expanded genetic code by minimizing TAG codon frequency to typically ∼5% of ORFs, whereas Pyl-decoding bacteria (∼20% of ORFs contain in-frame TAGs) regulate Pyl-tRNAPyl formation and translation of UAG by transcriptional deactivation of genes in the Pyl operon. We further demonstrate that Pyl encoding occurs in a bacterium that naturally encodes the Pyl operon, and identified Pyl residues by mass spectrometry in A. arabaticum proteins including two methylamine methyltransferases. PMID:23185002
Zhang, Haiyun; Sun, Dejun; Li, Defu; Zheng, Zeguang; Xu, Jingyi; Liang, Xue; Zhang, Chenting; Wang, Sheng; Wang, Jian; Lu, Wenju
2018-05-15
Long non-coding RNAs (lncRNAs) have critical regulatory roles in protein-coding gene expression. Aberrant expression profiles of lncRNAs have been observed in various human diseases. In this study, we investigated transcriptome profiles in lung tissues of chronic cigarette smoke (CS)-induced COPD mouse model. We found that 109 lncRNAs and 260 mRNAs were significantly differential expressed in lungs of chronic CS-induced COPD mouse model compared with control animals. GO and KEGG analyses indicated that differentially expressed lncRNAs associated protein-coding genes were mainly involved in protein processing of endoplasmic reticulum pathway, and taurine and hypotaurine metabolism pathway. The combination of high throughput data analysis and the results of qRT-PCR validation in lungs of chronic CS-induced COPD mouse model, 16HBE cells with CSE treatment and PBMC from patients with COPD revealed that NR_102714 and its associated protein-coding gene UCHL1 might be involved in the development of COPD both in mouse and human. In conclusion, our study demonstrated that aberrant expression profiles of lncRNAs and mRNAs existed in lungs of chronic CS-induced COPD mouse model. From animal models perspective, these results might provide further clues to investigate biological functions of lncRNAs and their potential target protein-coding genes in the pathogenesis of COPD.
Kanda, Kojun; Pflug, James M; Sproul, John S; Dasenko, Mark A; Maddison, David R
2015-01-01
In this paper we explore high-throughput Illumina sequencing of nuclear protein-coding, ribosomal, and mitochondrial genes in small, dried insects stored in natural history collections. We sequenced one tenebrionid beetle and 12 carabid beetles ranging in size from 3.7 to 9.7 mm in length that have been stored in various museums for 4 to 84 years. Although we chose a number of old, small specimens for which we expected low sequence recovery, we successfully recovered at least some low-copy nuclear protein-coding genes from all specimens. For example, in one 56-year-old beetle, 4.4 mm in length, our de novo assembly recovered about 63% of approximately 41,900 nucleotides in a target suite of 67 nuclear protein-coding gene fragments, and 70% using a reference-based assembly. Even in the least successfully sequenced carabid specimen, reference-based assembly yielded fragments that were at least 50% of the target length for 34 of 67 nuclear protein-coding gene fragments. Exploration of alternative references for reference-based assembly revealed few signs of bias created by the reference. For all specimens we recovered almost complete copies of ribosomal and mitochondrial genes. We verified the general accuracy of the sequences through comparisons with sequences obtained from PCR and Sanger sequencing, including of conspecific, fresh specimens, and through phylogenetic analysis that tested the placement of sequences in predicted regions. A few possible inaccuracies in the sequences were detected, but these rarely affected the phylogenetic placement of the samples. Although our sample sizes are low, an exploratory regression study suggests that the dominant factor in predicting success at recovering nuclear protein-coding genes is a high number of Illumina reads, with success at PCR of COI and killing by immersion in ethanol being secondary factors; in analyses of only high-read samples, the primary significant explanatory variable was body length, with small beetles being more successfully sequenced.
Dasenko, Mark A.
2015-01-01
In this paper we explore high-throughput Illumina sequencing of nuclear protein-coding, ribosomal, and mitochondrial genes in small, dried insects stored in natural history collections. We sequenced one tenebrionid beetle and 12 carabid beetles ranging in size from 3.7 to 9.7 mm in length that have been stored in various museums for 4 to 84 years. Although we chose a number of old, small specimens for which we expected low sequence recovery, we successfully recovered at least some low-copy nuclear protein-coding genes from all specimens. For example, in one 56-year-old beetle, 4.4 mm in length, our de novo assembly recovered about 63% of approximately 41,900 nucleotides in a target suite of 67 nuclear protein-coding gene fragments, and 70% using a reference-based assembly. Even in the least successfully sequenced carabid specimen, reference-based assembly yielded fragments that were at least 50% of the target length for 34 of 67 nuclear protein-coding gene fragments. Exploration of alternative references for reference-based assembly revealed few signs of bias created by the reference. For all specimens we recovered almost complete copies of ribosomal and mitochondrial genes. We verified the general accuracy of the sequences through comparisons with sequences obtained from PCR and Sanger sequencing, including of conspecific, fresh specimens, and through phylogenetic analysis that tested the placement of sequences in predicted regions. A few possible inaccuracies in the sequences were detected, but these rarely affected the phylogenetic placement of the samples. Although our sample sizes are low, an exploratory regression study suggests that the dominant factor in predicting success at recovering nuclear protein-coding genes is a high number of Illumina reads, with success at PCR of COI and killing by immersion in ethanol being secondary factors; in analyses of only high-read samples, the primary significant explanatory variable was body length, with small beetles being more successfully sequenced. PMID:26716693
New technologies accelerate the exploration of non-coding RNAs in horticultural plants
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Degao; Mewalal, Ritesh; Hu, Rongbin
Non-coding RNAs (ncRNAs), that is, RNAs not translated into proteins, are crucial regulators of a variety of biological processes in plants. While protein-encoding genes have been relatively well-annotated in sequenced genomes, accounting for a small portion of the genome space in plants, the universe of plant ncRNAs is rapidly expanding. Recent advances in experimental and computational technologies have generated a great momentum for discovery and functional characterization of ncRNAs. Here we summarize the classification and known biological functions of plant ncRNAs, review the application of next-generation sequencing (NGS) technology and ribosome profiling technology to ncRNA discovery in horticultural plants andmore » discuss the application of new technologies, especially the new genome-editing tool clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR-associated protein 9 (Cas9) systems, to functional characterization of plant ncRNAs.« less
New technologies accelerate the exploration of non-coding RNAs in horticultural plants
Liu, Degao; Mewalal, Ritesh; Hu, Rongbin; Tuskan, Gerald A; Yang, Xiaohan
2017-01-01
Non-coding RNAs (ncRNAs), that is, RNAs not translated into proteins, are crucial regulators of a variety of biological processes in plants. While protein-encoding genes have been relatively well-annotated in sequenced genomes, accounting for a small portion of the genome space in plants, the universe of plant ncRNAs is rapidly expanding. Recent advances in experimental and computational technologies have generated a great momentum for discovery and functional characterization of ncRNAs. Here we summarize the classification and known biological functions of plant ncRNAs, review the application of next-generation sequencing (NGS) technology and ribosome profiling technology to ncRNA discovery in horticultural plants and discuss the application of new technologies, especially the new genome-editing tool clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR-associated protein 9 (Cas9) systems, to functional characterization of plant ncRNAs. PMID:28698797
Dover, Nir; Barash, Jason R.; Burke, Julianne N.; ...
2014-05-22
Botulinum neurotoxin (BoNT) is the most poisonous substances known and its eight toxin types (A to H) are distinguished by the inability of polyclonal antibodies that neutralize one toxin type to neutralize any of the other seven toxin types. Infant botulism, an intestinal toxemia orphan disease, is the most common form of human botulism in the United States. It results from swallowed spores of Clostridium botulinum (or rarely, neurotoxigenic Clostridium butyricum or Clostridium baratii) that germinate and temporarily colonize the lumen of the large intestine, where, as vegetative cells, they produce botulinum toxin. Botulinum neurotoxin is encoded by the bontmore » gene that is part of a toxin gene cluster that includes several accessory genes. In this paper, we sequenced for the first time the complete botulinum neurotoxin gene cluster of nonproteolytic C. baratii type F7. Like the type E and the nonproteolytic type F6 botulinum toxin gene clusters, the C. baratii type F7 had an orfX toxin gene cluster that lacked the regulatory botR gene which is found in proteolytic C. botulinum strains and codes for an alternative σ factor. In the absence of botR, we identified a putative alternative regulatory gene located upstream of the C. baratii type F7 toxin gene cluster. This putative regulatory gene codes for a predicted σ factor that contains DNA-binding-domain homologues to the DNA-binding domains both of BotR and of other members of the TcdR-related group 5 of the σ 70 family that are involved in the regulation of toxin gene expression in clostridia. We showed that this TcdR-related protein in association with RNA polymerase core enzyme specifically binds to the C. baratii type F7 botulinum toxin gene cluster promoters. Finally, this TcdR-related protein may therefore be involved in regulating the expression of the genes of the botulinum toxin gene cluster in neurotoxigenic C. baratii.« less
Kim, Seungill; Kim, Myung-Shin; Kim, Yong-Min; Yeom, Seon-In; Cheong, Kyeongchae; Kim, Ki-Tae; Jeon, Jongbum; Kim, Sunggil; Kim, Do-Sun; Sohn, Seong-Han; Lee, Yong-Hwan; Choi, Doil
2015-01-01
The onion (Allium cepa L.) is one of the most widely cultivated and consumed vegetable crops in the world. Although a considerable amount of onion transcriptome data has been deposited into public databases, the sequences of the protein-coding genes are not accurate enough to be used, owing to non-coding sequences intermixed with the coding sequences. We generated a high-quality, annotated onion transcriptome from de novo sequence assembly and intensive structural annotation using the integrated structural gene annotation pipeline (ISGAP), which identified 54,165 protein-coding genes among 165,179 assembled transcripts totalling 203.0 Mb by eliminating the intron sequences. ISGAP performed reliable annotation, recognizing accurate gene structures based on reference proteins, and ab initio gene models of the assembled transcripts. Integrative functional annotation and gene-based SNP analysis revealed a whole biological repertoire of genes and transcriptomic variation in the onion. The method developed in this study provides a powerful tool for the construction of reference gene sets for organisms based solely on de novo transcriptome data. Furthermore, the reference genes and their variation described here for the onion represent essential tools for molecular breeding and gene cloning in Allium spp. PMID:25362073
Reschen, Michael E; Lin, Da; Chalisey, Anil; Soilleux, Elizabeth J; O'Callaghan, Christopher A
2016-07-01
Coronary artery disease (CAD) risk is associated with non-coding genetic variants at the phosphatase and actin regulating protein 1(PHACTR1) gene locus. The PHACTR1 gene encodes an actin-binding protein with phosphatase regulating activity. The mechanism whereby PHACTR1 influences CAD risk is unknown. We hypothesized that PHACTR1 would be expressed in human cell types relevant to CAD and regulated by atherogenic or genetic factors. Using immunohistochemistry, we demonstrate that PHACTR1 protein is expressed strongly in human atherosclerotic plaque macrophages, lipid-laden foam cells, adventitial lymphocytes and endothelial cells. Using a combination of genomic analysis and molecular techniques, we demonstrate that PHACTR1 is expressed as multiple previously uncharacterized transcripts in macrophages, foam cells, lymphocytes and endothelial cells. Immunoblotting confirmed a total absence of PHACTR1 in vascular smooth muscle cells. Real-time quantitative PCR showed that PHACTR1 is regulated by atherogenic and inflammatory stimuli. In aortic endothelial cells, oxLDL and TNF-alpha both upregulated an intermediate length transcript. A short transcript expressed only in immune cells was upregulated in macrophages by oxidized low-density lipoprotein, and oxidized phospholipids but suppressed by lipopolysaccharide or TNF-alpha. In primary human macrophages, we identified a novel expression quantitative trait locus (eQTL) specific for this short transcript, whereby the risk allele at CAD risk SNP rs9349379 is associated with reduced PHACTR1 expression, similar to the effect of an inflammatory stimulus. Our data demonstrate that PHACTR1 is a key atherosclerosis candidate gene since it is regulated by atherogenic stimuli in macrophages and endothelial cells and we identify an effect of the genetic risk variant on PHACTR1 expression in macrophages that is similar to that of an inflammatory stimulus. Copyright © 2016 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
Asteri, Ioanna-Areti; Boutou, Effrossyni; Anastasiou, Rania; Pot, Bruno; Vorgias, Constantinos E.; Tsakalidou, Effie; Papadimitriou, Konstantinos
2011-01-01
gsiB, coding for glucose starvation-inducible protein B, is a characteristic member of the σΒ stress regulon of Bacillus subtilis and several other Gram-positive bacteria. Here we provide in silico evidence for the horizontal transfer of gsiB in lactic acid bacteria that are devoid of the σΒ factor. PMID:21421783
Unraveling the Molecular Basis of Temperature-Dependent Genetic Regulation in Penicillium marneffei
Yang, Ence; Wang, Gang; Woo, Patrick C. Y.; Lau, Susanna K. P.; Chow, Wang-Ngai; Chong, Ken T. K.; Tse, Herman; Kao, Richard Y. T.; Chan, Che-Man; Che, Xiaoyan; Yuen, Kwok-Yung
2013-01-01
Penicillium marneffei is an opportunistic fungal pathogen endemic in Southeast Asia, causing lethal systemic infections in immunocompromised patients. P. marneffei grows in a mycelial form at the ambient temperature of 25°C and transitions to a yeast form at 37°C. The ability to alternate between the mycelial and yeast forms at different temperatures, namely, thermal dimorphism, has long been considered critical for the pathogenicity of P. marneffei, yet the underlying genetic mechanisms remain elusive. Here we employed high-throughput sequencing to unravel global transcriptional profiles of P. marneffei PM1 grown at 25 and 37°C. Among ∼11,000 protein-coding genes, 1,447 were overexpressed and 1,414 were underexpressed at 37°C. Counterintuitively, heat-responsive genes, predicted in P. marneffei through sequence comparison, did not tend to be overexpressed at 37°C. These results suggest that P. marneffei may take a distinct strategy of genetic regulation at the elevated temperature; the current knowledge concerning fungal heat response, based on studies of model fungal organisms, may not be applicable to P. marneffei. Our results further showed that the tandem repeat sequences (TRSs) are overrepresented in coding regions of P. marneffei genes, and TRS-containing genes tend to be overexpressed at 37°C. Furthermore, genomic sequences and expression data were integrated to characterize gene clusters, multigene families, and species-specific genes of P. marneffei. In sum, we present an integrated analysis and a comprehensive resource toward a better understanding of temperature-dependent genetic regulation in P. marneffei. PMID:23851338
Dominant genetics using a yeast genomic library under the control of a strong inducible promoter.
Ramer, S W; Elledge, S J; Davis, R W
1992-12-01
In Saccharomyces cerevisiae, numerous genes have been identified by selection from high-copy-number libraries based on "multicopy suppression" or other phenotypic consequences of overexpression. Although fruitful, this approach suffers from two major drawbacks. First, high copy number alone may not permit high-level expression of tightly regulated genes. Conversely, other genes expressed in proportion to dosage cannot be identified if their products are toxic at elevated levels. This work reports construction of a genomic DNA expression library for S. cerevisiae that circumvents both limitations by fusing randomly sheared genomic DNA to the strong, inducible yeast GAL1 promoter, which can be regulated by carbon source. The library obtained contains 5 x 10(7) independent recombinants, representing a breakpoint at every base in the yeast genome. This library was used to examine aberrant gene expression in S. cerevisiae. A screen for dominant activators of yeast mating response identified eight genes that activate the pathway in the absence of exogenous mating pheromone, including one previously unidentified gene. One activator was a truncated STE11 gene lacking approximately 1000 base pairs of amino-terminal coding sequence. In two different clones, the same GAL1 promoter-proximal ATG is in-frame with the coding sequence of STE11, suggesting that internal initiation of translation there results in production of a biologically active, truncated STE11 protein. Thus this library allows isolation based on dominant phenotypes of genes that might have been difficult or impossible to isolate from high-copy-number libraries.
Moin, Mazahar; Bakshi, Achala; Saha, Anusree; Udaya Kumar, M; Reddy, Attipalli R; Rao, K V; Siddiq, E A; Kirti, P B
2016-11-01
We have generated 3900 enhancer-based activation-tagged plants, in addition to 1030 stable Dissociator-enhancer plants in a widely cultivated indica rice variety, BPT-5204. Of them, 3000 were screened for water-use efficiency (WUE) by analysing photosynthetic quantum efficiency and yield-related attributes under water-limiting conditions that identified 200 activation-tagged mutants, which were analysed for flanking sequences at the site of enhancer integration in the genome. We have further selected five plants with low Δ 13 C, high quantum efficiency and increased plant yield compared with wild type for a detailed investigation. Expression studies of 18 genes in these mutants revealed that in four plants one of the three to four tagged genes became activated, while two genes were concurrently up-regulated in the fifth plant. Two genes coding for proteins involved in 60S ribosomal assembly, RPL6 and RPL23A, were among those that became activated by enhancers. Quantitative expression analysis of these two genes also corroborated the results on activating-tagging. The high up-regulation of RPL6 and RPL23A in various stress treatments and the presence of significant cis-regulatory elements in their promoter regions along with the high up-regulation of several of RPL genes in various stress treatments indicate that they are potential targets for manipulating WUE/abiotic stress tolerance. © 2016 John Wiley & Sons Ltd.
Hah, Nasun; Danko, Charles G.; Core, Leighton; Waterfall, Joshua J.; Siepel, Adam; Lis, John T.; Kraus, W. Lee
2011-01-01
Summary We report the immediate effects of estrogen signaling on the transcriptome of breast cancer cells using Global Run-On and sequencing (GRO-seq). The data were analyzed using a new bioinformatic approach that allowed us to identify transcripts directly from the GRO-seq data. We found that estrogen signaling directly regulates a strikingly large fraction of the transcriptome in a rapid, robust, and unexpectedly transient manner. In addition to protein coding genes, estrogen regulates the distribution and activity of all three RNA polymerases, and virtually every class of non-coding RNA that has been described to date. We also identified a large number of previously undetected estrogen-regulated intergenic transcripts, many of which are found proximal to estrogen receptor binding sites. Collectively, our results provide the most comprehensive measurement of the primary and immediate estrogen effects to date and a resource for understanding rapid signal-dependent transcription in other systems. PMID:21549415
Spatial regulation of a common precursor from two distinct genes generates metabolite diversity
Guo, Chun -Jun; Sun, Wei -Wen; Bruno, Kenneth S.; ...
2015-07-13
In secondary metabolite biosynthesis, core synthetic genes such as polyketide synthase genes usually encode proteins that generate various backbone precursors. These precursors are modified by other tailoring enzymes to yield a large variety of different secondary metabolites. The number of core synthesis genes in a given species correlates, therefore, with the number of types of secondary metabolites the organism can produce. In our study, heterologous expression of all the A. terreus NRPSlike genes showed that two NRPS-like proteins, encoded by atmelA and apvA, release the same natural product, aspulvinone E. In hyphae this compound is converted to aspulvinones whereas inmore » conidia it is converted to melanin. The genes are expressed in different tissues and this spatial control is probably regulated by their own specific promoters. Comparative genomics indicates that atmelA and apvA might share a same ancestral gene and the gene apvA is located in a highly conserved region in Aspergillus species that contains genes coding for life-essential proteins. Our data reveal the first case in secondary metabolite biosynthesis in which the tissue specific production of a single compound directs it into two separate pathways, producing distinct compounds with different functions. Our data also reveal that a single trans-prenyltransferase, AbpB, prenylates two substrates, aspulvinones and butyrolactones, revealing that genes outside of contiguous secondary metabolism gene clusters can modify more than one compound thereby expanding metabolite diversity. Our study raises the possibility of incorporation of spatial, cell-type specificity in expression of secondary metabolites of biological interest and provides new insight into designing and reconstituting their biosynthetic pathways.« less
EZH2 in Cancer Progression and Potential Application in Cancer Therapy: A Friend or Foe?
Yan, Ke-Sin; Lin, Chia-Yuan; Liao, Tan-Wei; Peng, Cheng-Ming; Lee, Shou-Chun; Liu, Yi-Jui; Chan, Wing P.; Chou, Ruey-Hwang
2017-01-01
Enhancer of zeste homolog 2 (EZH2), a histone methyltransferase, catalyzes tri-methylation of histone H3 at Lys 27 (H3K27me3) to regulate gene expression through epigenetic machinery. EZH2 functions as a double-facet molecule in regulation of gene expression via repression or activation mechanisms, depending on the different cellular contexts. EZH2 interacts with both histone and non-histone proteins to modulate diverse physiological functions including cancer progression and malignancy. In this review article, we focused on the updated information regarding microRNAs (miRNAs) and long non coding RNAs (lncRNAs) in regulation of EZH2, the oncogenic and tumor suppressive roles of EZH2 in cancer progression and malignancy, as well as current pre-clinical and clinical trials of EZH2 inhibitors. PMID:28561778
Peng, Hui; Lan, Chaowang; Liu, Yuansheng; Liu, Tao; Blumenstein, Michael; Li, Jinyan
2017-10-03
Disease-related protein-coding genes have been widely studied, but disease-related non-coding genes remain largely unknown. This work introduces a new vector to represent diseases, and applies the newly vectorized data for a positive-unlabeled learning algorithm to predict and rank disease-related long non-coding RNA (lncRNA) genes. This novel vector representation for diseases consists of two sub-vectors, one is composed of 45 elements, characterizing the information entropies of the disease genes distribution over 45 chromosome substructures. This idea is supported by our observation that some substructures (e.g., the chromosome 6 p-arm) are highly preferred by disease-related protein coding genes, while some (e.g., the 21 p-arm) are not favored at all. The second sub-vector is 30-dimensional, characterizing the distribution of disease gene enriched KEGG pathways in comparison with our manually created pathway groups. The second sub-vector complements with the first one to differentiate between various diseases. Our prediction method outperforms the state-of-the-art methods on benchmark datasets for prioritizing disease related lncRNA genes. The method also works well when only the sequence information of an lncRNA gene is known, or even when a given disease has no currently recognized long non-coding genes.
Peng, Hui; Lan, Chaowang; Liu, Yuansheng; Liu, Tao; Blumenstein, Michael; Li, Jinyan
2017-01-01
Disease-related protein-coding genes have been widely studied, but disease-related non-coding genes remain largely unknown. This work introduces a new vector to represent diseases, and applies the newly vectorized data for a positive-unlabeled learning algorithm to predict and rank disease-related long non-coding RNA (lncRNA) genes. This novel vector representation for diseases consists of two sub-vectors, one is composed of 45 elements, characterizing the information entropies of the disease genes distribution over 45 chromosome substructures. This idea is supported by our observation that some substructures (e.g., the chromosome 6 p-arm) are highly preferred by disease-related protein coding genes, while some (e.g., the 21 p-arm) are not favored at all. The second sub-vector is 30-dimensional, characterizing the distribution of disease gene enriched KEGG pathways in comparison with our manually created pathway groups. The second sub-vector complements with the first one to differentiate between various diseases. Our prediction method outperforms the state-of-the-art methods on benchmark datasets for prioritizing disease related lncRNA genes. The method also works well when only the sequence information of an lncRNA gene is known, or even when a given disease has no currently recognized long non-coding genes. PMID:29108274
Pdsg1 and Pdsg2, Novel Proteins Involved in Developmental Genome Remodelling in Paramecium
Hoehener, Cristina; Singh, Aditi; Swart, Estienne C.; Nowacki, Mariusz
2014-01-01
The epigenetic influence of maternal cells on the development of their progeny has long been studied in various eukaryotes. Multicellular organisms usually provide their zygotes not only with nutrients but also with functional elements required for proper development, such as coding and non-coding RNAs. These maternally deposited RNAs exhibit a variety of functions, from regulating gene expression to assuring genome integrity. In ciliates, such as Paramecium these RNAs participate in the programming of large-scale genome reorganization during development, distinguishing germline-limited DNA, which is excised, from somatic-destined DNA. Only a handful of proteins playing roles in this process have been identified so far, including typical RNAi-derived factors such as Dicer-like and Piwi proteins. Here we report and characterize two novel proteins, Pdsg1 and Pdsg2 (Paramecium protein involved in Development of the Somatic Genome 1 and 2), involved in Paramecium genome reorganization. We show that these proteins are necessary for the excision of germline-limited DNA during development and the survival of sexual progeny. Knockdown of PDSG1 and PDSG2 genes affects the populations of small RNAs known to be involved in the programming of DNA elimination (scanRNAs and iesRNAs) and chromatin modification patterns during development. Our results suggest an association between RNA-mediated trans-generational epigenetic signal and chromatin modifications in the process of Paramecium genome reorganization. PMID:25397898
Pdsg1 and Pdsg2, novel proteins involved in developmental genome remodelling in Paramecium.
Arambasic, Miroslav; Sandoval, Pamela Y; Hoehener, Cristina; Singh, Aditi; Swart, Estienne C; Nowacki, Mariusz
2014-01-01
The epigenetic influence of maternal cells on the development of their progeny has long been studied in various eukaryotes. Multicellular organisms usually provide their zygotes not only with nutrients but also with functional elements required for proper development, such as coding and non-coding RNAs. These maternally deposited RNAs exhibit a variety of functions, from regulating gene expression to assuring genome integrity. In ciliates, such as Paramecium these RNAs participate in the programming of large-scale genome reorganization during development, distinguishing germline-limited DNA, which is excised, from somatic-destined DNA. Only a handful of proteins playing roles in this process have been identified so far, including typical RNAi-derived factors such as Dicer-like and Piwi proteins. Here we report and characterize two novel proteins, Pdsg1 and Pdsg2 (Paramecium protein involved in Development of the Somatic Genome 1 and 2), involved in Paramecium genome reorganization. We show that these proteins are necessary for the excision of germline-limited DNA during development and the survival of sexual progeny. Knockdown of PDSG1 and PDSG2 genes affects the populations of small RNAs known to be involved in the programming of DNA elimination (scanRNAs and iesRNAs) and chromatin modification patterns during development. Our results suggest an association between RNA-mediated trans-generational epigenetic signal and chromatin modifications in the process of Paramecium genome reorganization.
Ribosome profiling reveals the what, when, where and how of protein synthesis.
Brar, Gloria A; Weissman, Jonathan S
2015-11-01
Ribosome profiling, which involves the deep sequencing of ribosome-protected mRNA fragments, is a powerful tool for globally monitoring translation in vivo. The method has facilitated discovery of the regulation of gene expression underlying diverse and complex biological processes, of important aspects of the mechanism of protein synthesis, and even of new proteins, by providing a systematic approach for experimental annotation of coding regions. Here, we introduce the methodology of ribosome profiling and discuss examples in which this approach has been a key factor in guiding biological discovery, including its prominent role in identifying thousands of novel translated short open reading frames and alternative translation products.
Wheeler, Bayly S
2013-12-01
Transposons are mobile genetic elements that are a major constituent of most genomes. Organisms regulate transposable element expression, transposition, and insertion site preference, mitigating the genome instability caused by uncontrolled transposition. A recent burst of research has demonstrated the critical role of small non-coding RNAs in regulating transposition in fungi, plants, and animals. While mechanistically distinct, these pathways work through a conserved paradigm. The presence of a transposon is communicated by the presence of its RNA or by its integration into specific genomic loci. These signals are then translated into small non-coding RNAs that guide epigenetic modifications and gene silencing back to the transposon. In addition to being regulated by the host, transposable elements are themselves capable of influencing host gene expression. Transposon expression is responsive to environmental signals, and many transposons are activated by various cellular stresses. TEs can confer local gene regulation by acting as enhancers and can also confer global gene regulation through their non-coding RNAs. Thus, transposable elements can act as stress-responsive regulators that control host gene expression in cis and trans.
Noncoding RNA Shows Context-Dependent Function | Center for Cancer Research
In addition to well-studied protein coding sequences, it is known that the genomes of higher organisms produce numerous noncoding RNAs (ncRNAs). Important roles for some ncRNAs in cell function have been demonstrated, though usually on a case-by-case basis, leading some scientists to argue that the majority of ncRNA production is just “noise” that results from the imperfect transcription machinery. The fact that many ncRNAs overlap with coding genes has hampered studies of their activities. Thus, a general understanding of whether ncRNA production is functional or not is lacking. To address this issue, Daniel Larson, Ph.D., of CCR’s Laboratory of Receptor Biology and Gene Expression, and his colleagues developed a new approach using single-molecule imaging in living cells. The researchers specifically labeled coding and ncRNAs from the GAL locus in yeast, which regulates the galactose response. Glucose is the preferred source of carbon for yeast, but when it is scarce, genes within the GAL locus, including GAL10 and GAL1, are activated to allow the metabolism of galactose.
Progressive changes in non-coding RNA profile in leucocytes with age
Muñoz-Culla, Maider; Irizar, Haritz; Gorostidi, Ana; Alberro, Ainhoa; Osorio-Querejeta, Iñaki; Ruiz-Martínez, Javier; Olascoaga, Javier; de Munain, Adolfo López; Otaegui, David
2017-01-01
It has been observed that immune cell deterioration occurs in the elderly, as well as a chronic low-grade inflammation called inflammaging. These cellular changes must be driven by numerous changes in gene expression and in fact, both protein-coding and non-coding RNA expression alterations have been observed in peripheral blood mononuclear cells from elder people. In the present work we have studied the expression of small non-coding RNA (microRNA and small nucleolar RNA -snoRNA-) from healthy individuals from 24 to 79 years old. We have observed that the expression of 69 non-coding RNAs (56 microRNAs and 13 snoRNAs) changes progressively with chronological age. According to our results, the age range from 47 to 54 is critical given that it is the period when the expression trend (increasing or decreasing) of age-related small non-coding RNAs is more pronounced. Furthermore, age-related miRNAs regulate genes that are involved in immune, cell cycle and cancer-related processes, which had already been associated to human aging. Therefore, human aging could be studied as a result of progressive molecular changes, and different age ranges should be analysed to cover the whole aging process. PMID:28448962
Etebari, Kayvan; Furlong, Michael J.; Asgari, Sassan
2015-01-01
Long non-coding RNAs (lncRNAs) play important roles in genomic imprinting, cancer, differentiation and regulation of gene expression. Here, we identified 3844 long intergenic ncRNAs (lincRNA) in Plutella xylostella, which is a notorious pest of cruciferous plants that has developed field resistance to all classes of insecticides, including Bacillus thuringiensis (Bt) endotoxins. Further, we found that some of those lincRNAs may potentially serve as precursors for the production of small ncRNAs. We found 280 and 350 lincRNAs that are differentially expressed in Chlorpyrifos and Fipronil resistant larvae. A survey on P. xylostella midgut transcriptome data from Bt-resistant populations revealed 59 altered lincRNA in two resistant strains compared with the susceptible population. We validated the transcript levels of a number of putative lincRNAs in deltamethrin-resistant larvae that were exposed to deltamethrin, which indicated that this group of lincRNAs might be involved in the response to xenobiotics in this insect. To functionally characterize DBM lincRNAs, gene ontology (GO) enrichment of their associated protein-coding genes was extracted and showed over representation of protein, DNA and RNA binding GO terms. The data presented here will facilitate future studies to unravel the function of lincRNAs in insecticide resistance or the response to xenobiotics of eukaryotic cells. PMID:26411386
Pietan, Lucas L.; Spradling, Theresa A.
2016-01-01
In animals, mitochondrial DNA (mtDNA) typically occurs as a single circular chromosome with 13 protein-coding genes and 22 tRNA genes. The various species of lice examined previously, however, have shown mitochondrial genome rearrangements with a range of chromosome sizes and numbers. Our research demonstrates that the mitochondrial genomes of two species of chewing lice found on pocket gophers, Geomydoecus aurei and Thomomydoecus minor, are fragmented with the 1,536 base-pair (bp) cytochrome-oxidase subunit I (cox1) gene occurring as the only protein-coding gene on a 1,916–1,964 bp minicircular chromosome in the two species, respectively. The cox1 gene of T. minor begins with an atypical start codon, while that of G. aurei does not. Components of the non-protein coding sequence of G. aurei and T. minor include a tRNA (isoleucine) gene, inverted repeat sequences consistent with origins of replication, and an additional non-coding region that is smaller than the non-coding sequence of other lice with such fragmented mitochondrial genomes. Sequences of cox1 minichromosome clones for each species reveal extensive length and sequence heteroplasmy in both coding and noncoding regions. The highly variable non-gene regions of G. aurei and T. minor have little sequence similarity with one another except for a 19-bp region of phylogenetically conserved sequence with unknown function. PMID:27589589
mRNA N6-methyladenosine methylation of postnatal liver development in pig.
He, Shen; Wang, Hong; Liu, Rui; He, Mengnan; Che, Tiandong; Jin, Long; Deng, Lamei; Tian, Shilin; Li, Yan; Lu, Hongfeng; Li, Xuewei; Jiang, Zhi; Li, Diyan; Li, Mingzhou
2017-01-01
N6-methyladenosine (m6A) is a ubiquitous reversible epigenetic RNA modification that plays an important role in the regulation of post-transcriptional protein coding gene expression. Liver is a vital organ and plays a major role in metabolism with numerous functions. Information concerning the dynamic patterns of mRNA m6A methylation during postnatal development of liver has been long overdue and elucidation of this information will benefit for further deciphering a multitude of functional outcomes of mRNA m6A methylation. Here, we profile transcriptome-wide m6A in porcine liver at three developmental stages: newborn (0 day), suckling (21 days) and adult (2 years). About 33% of transcribed genes were modified by m6A, with 1.33 to 1.42 m6A peaks per modified gene. m6A was distributed predominantly around stop codons. The consensus motif sequence RRm6ACH was observed in 78.90% of m6A peaks. A negative correlation (average Pearson's r = -0.45, P < 10-16) was found between levels of m6A methylation and gene expression. Functional enrichment analysis of genes consistently modified by m6A methylation at all three stages showed genes relevant to important functions, including regulation of growth and development, regulation of metabolic processes and protein catabolic processes. Genes with higher m6A methylation and lower expression levels at any particular stage were associated with the biological processes required for or unique to that stage. We suggest that differential m6A methylation may be important for the regulation of nutrient metabolism in porcine liver.
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa
2015-01-01
Background Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. Results One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. Conclusions These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity. PMID:26693737
Zhang, Huijuan; Bartley, Glenn E; Mitchell, Cheryl R; Zhang, Hui; Yokoyama, Wallace
2011-10-26
The physiological effects of the hydrolysates of white rice protein (WRP), brown rice protein (BRP), and soy protein (SP) hydrolyzed by the food grade enzyme, alcalase2.4 L, were compared to the original protein source. Male Syrian Golden hamsters were fed high-fat diets containing either 20% casein (control) or 20% extracted proteins or their hydrolysates as the protein source for 3 weeks. The brown rice protein hydrolysate (BRPH) diet group reduced weight gain 76% compared with the control. Animals fed the BRPH supplemented diet also had lower final body weight, liver weight, very low density lipoprotein cholesterol (VLDL-C), and liver cholesterol, and higher fecal fat and bile acid excretion than the control. Expression levels of hepatic genes for lipid oxidation, PPARα, ACOX1, and CPT1, were highest for hamsters fed the BRPH supplemented diet. Expression of CYP7A1, the gene regulating bile acid synthesis, was higher in all test groups. Expression of CYP51, a gene coding for an enzyme involved in cholesterol synthesis, was highest in the BRPH diet group. The results suggest that BRPH includes unique peptides that reduce weight gain and hepatic cholesterol synthesis.
Lie, Kai K; Tørresen, Ole K; Solbakken, Monica Hongrø; Rønnestad, Ivar; Tooming-Klunderud, Ave; Nederbragt, Alexander J; Jentoft, Sissel; Sæle, Øystein
2018-03-06
The ballan wrasse (Labrus bergylta) belongs to a large teleost family containing more than 600 species showing several unique evolutionary traits such as lack of stomach and hermaphroditism. Agastric fish are found throughout the teleost phylogeny, in quite diverse and unrelated lineages, indicating stomach loss has occurred independently multiple times in the course of evolution. By assembling the ballan wrasse genome and transcriptome we aimed to determine the genetic basis for its digestive system function and appetite regulation. Among other, this knowledge will aid the formulation of aquaculture diets that meet the nutritional needs of agastric species. Long and short read sequencing technologies were combined to generate a ballan wrasse genome of 805 Mbp. Analysis of the genome and transcriptome assemblies confirmed the absence of genes that code for proteins involved in gastric function. The gene coding for the appetite stimulating protein ghrelin was also absent in wrasse. Gene synteny mapping identified several appetite-controlling genes and their paralogs previously undescribed in fish. Transcriptome profiling along the length of the intestine found a declining expression gradient from the anterior to the posterior, and a distinct expression profile in the hind gut. We showed gene loss has occurred for all known genes related to stomach function in the ballan wrasse, while the remaining functions of the digestive tract appear intact. The results also show appetite control in ballan wrasse has undergone substantial changes. The loss of ghrelin suggests that other genes, such as motilin, may play a ghrelin like role. The wrasse genome offers novel insight in to the evolutionary traits of this large family. As the stomach plays a major role in protein digestion, the lack of genes related to stomach digestion in wrasse suggests it requires formulated diets with higher levels of readily digestible protein than those for gastric species.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Guo, Chun-Jun; Sun, Wei-Wen; Bruno, Kenneth S.
In secondary metabolite biosynthesis, core synthetic genes such as polyketide synthase genes or non-ribosomal peptide synthase genes usually encode proteins that generate various backbone precursors. These precursors are modified by other tailoring enzymes to yield a large variety of different secondary metabolites. The number of core synthesis genes in a given species correlates, therefore, with the number of types of secondary metabolites the organism can produce. In our study, heterologous expression of all the A. terreus NRPS-like genes showed that two NRPS-like proteins, encoded by atmelA and apvA, release the same natural product, aspulvinone E. More interestingly, further experiments revealedmore » that the aspulvinone E produced by two different genes accumulates in different fungal compartments. And this spatial control of aspulvinone E production is likely to be regulated by their own specific promoters. Comparative genomics indicates that atmelA and apvA might share a same ancestral gene and the gene apvA is inserted in a highly conserved region in Aspergillus species that contains genes coding for life-essential proteins. The study also identified one trans-prenyltransferase AbpB which is capable of prenylating two different substrates aspulvinones and butyrolactones. In total, our study shows the first example in which the locally distribution of the same natural product could lead to its incorporation into different SM pathways.« less
Polymorphisms in miRNA genes and their involvement in autoimmune diseases susceptibility.
Latini, Andrea; Ciccacci, Cinzia; Novelli, Giuseppe; Borgiani, Paola
2017-08-01
MicroRNAs (miRNAs) are small non-coding RNA molecules that negatively regulate the expression of multiple protein-encoding genes at the post-transcriptional level. MicroRNAs are involved in different pathways, such as cellular proliferation and differentiation, signal transduction and inflammation, and play crucial roles in the development of several diseases, such as cancer, diabetes, and cardiovascular diseases. They have recently been recognized to play a role also in the pathogenesis of autoimmune diseases. Although the majority of studies are focused on miRNA expression profiles investigation, a growing number of studies have been investigating the role of polymorphisms in miRNA genes in the autoimmune diseases development. Indeed, polymorphisms affecting the miRNA genes can modify the set of targets they regulate or the maturation efficiency. This review is aimed to give an overview about the available studies that have investigated the association of miRNA gene polymorphisms with the susceptibility to various autoimmune diseases and to their clinical phenotypes.
Dynamic and Widespread lncRNA Expression in a Sponge and the Origin of Animal Complexity
Gaiti, Federico; Fernandez-Valverde, Selene L.; Nakanishi, Nagayasu; Calcino, Andrew D.; Yanai, Itai; Tanurdzic, Milos; Degnan, Bernard M.
2015-01-01
Long noncoding RNAs (lncRNAs) are important developmental regulators in bilaterian animals. A correlation has been claimed between the lncRNA repertoire expansion and morphological complexity in vertebrate evolution. However, this claim has not been tested by examining morphologically simple animals. Here, we undertake a systematic investigation of lncRNAs in the demosponge Amphimedon queenslandica, a morphologically simple, early-branching metazoan. We combine RNA-Seq data across multiple developmental stages of Amphimedon with a filtering pipeline to conservatively predict 2,935 lncRNAs. These include intronic overlapping lncRNAs, exonic antisense overlapping lncRNAs, long intergenic nonprotein coding RNAs, and precursors for small RNAs. Sponge lncRNAs are remarkably similar to their bilaterian counterparts in being relatively short with few exons and having low primary sequence conservation relative to protein-coding genes. As in bilaterians, a majority of sponge lncRNAs exhibit typical hallmarks of regulatory molecules, including high temporal specificity and dynamic developmental expression. Specific lncRNA expression profiles correlate tightly with conserved protein-coding genes likely involved in a range of developmental and physiological processes, such as the Wnt signaling pathway. Although the majority of Amphimedon lncRNAs appears to be taxonomically restricted with no identifiable orthologs, we find a few cases of conservation between demosponges in lncRNAs that are antisense to coding sequences. Based on the high similarity in the structure, organization, and dynamic expression of sponge lncRNAs to their bilaterian counterparts, we propose that these noncoding RNAs are an ancient feature of the metazoan genome. These results are consistent with lncRNAs regulating the development of animals, regardless of their level of morphological complexity. PMID:25976353
Venkatachalam, Ananda B; Fontenot, Quenton; Farrara, Allyse; Wright, Jonathan M
2018-03-01
With the advent of high-throughput DNA sequencing technology, the genomic sequence of many disparate species has led to the relatively new discipline of genomics, the study of genome structure, function and evolution. Much work has been focused on the role of whole genome duplications (WGD) in the architecture of extant vertebrate genomes, particularly those of teleost fishes which underwent a WGD early in the teleost radiation >230 million years ago (mya). Our past work has focused on the fate of duplicated copies of a multigene family coding for the intracellular lipid-binding protein (iLBP) genes in the teleost fishes. To define the evolutionary processes that determined the fate of duplicated genes and generated the structure of extant fish genomes, however, requires comparative genomic analysis with a fish lineage that diverged before the teleost WGD, such as the spotted gar (Lepisosteus oculatus), an ancient, air-breathing, ray-finned fish. Here, we describe the genomic organization, chromosomal location and tissue-specific expression of a subfamily of the iLBP genes that code for fatty acid-binding proteins (Fabps) in spotted gar. Based on this work, we have defined the minimum suite of fabp genes prior to their duplication in the teleost lineages ~230-400 mya. Spotted gar, therefore, serves as an appropriate outgroup, or ancestral/ancient fish, that did not undergo the teleost-specific WGD. As such, analyses of the spatio-temporal regulation of spotted gar genes provides a foundation to determine whether the duplicated fabp genes have been retained in teleost genomes owing to either sub- or neofunctionalization. Copyright © 2017 Elsevier Inc. All rights reserved.
Novel exon 1 protein-coding regions N-terminally extend human KCNE3 and KCNE4.
Abbott, Geoffrey W
2016-08-01
The 5 human (h)KCNE β subunits each regulate various cation channels and are linked to inherited cardiac arrhythmias. Reported here are previously undiscovered protein-coding regions in exon 1 of hKCNE3 and hKCNE4 that extend their encoded extracellular domains by 44 and 51 residues, which yields full-length proteins of 147 and 221 residues, respectively. Full-length hKCNE3 and hKCNE4 transcript and protein are expressed in multiple human tissues; for hKCNE4, only the longer protein isoform is detectable. Two-electrode voltage-clamp electrophysiology revealed that, when coexpressed in Xenopus laevis oocytes with various potassium channels, the newly discovered segment preserved conversion of KCNQ1 by hKCNE3 to a constitutively open channel, but prevented its inhibition of Kv4.2 and KCNQ4. hKCNE4 slowing of Kv4.2 inactivation and positive-shifted steady-state inactivation were also preserved in the longer form. In contrast, full-length hKCNE4 inhibition of KCNQ1 was limited to 40% at +40 mV vs. 80% inhibition by the shorter form, and augmentation of KCNQ4 activity by hKCNE4 was entirely abolished by the additional segment. Among the genome databases analyzed, the longer KCNE3 is confined to primates; full-length KCNE4 is widespread in vertebrates but is notably absent from Mus musculus Findings highlight unexpected KCNE gene diversity, raise the possibility of dynamic regulation of KCNE partner modulation via splice variation, and suggest that the longer hKCNE3 and hKCNE4 proteins should be adopted in future mechanistic and genetic screening studies.-Abbott, G. W. Novel exon 1 protein-coding regions N-terminally extend human KCNE3 and KCNE4. © FASEB.
Yu, Hong; Kong, Lingfeng; Li, Qi
2016-01-01
In this study, we evaluated the efficacy of 12 mitochondrial protein-coding genes from 238 mitochondrial genomes of 140 molluscan species as potential DNA barcodes for mollusks. Three barcoding methods (distance, monophyly and character-based methods) were used in species identification. The species recovery rates based on genetic distances for the 12 genes ranged from 70.83 to 83.33%. There were no significant differences in intra- or interspecific variability among the 12 genes. The monophyly and character-based methods provided higher resolution than the distance-based method in species delimitation. Especially in closely related taxa, the character-based method showed some advantages. The results suggested that besides the standard COI barcode, other 11 mitochondrial protein-coding genes could also be potentially used as a molecular diagnostic for molluscan species discrimination. Our results also showed that the combination of mitochondrial genes did not enhance the efficacy for species identification and a single mitochondrial gene would be fully competent.
Lang, Patrick Y; Gershon, Timothy R
2018-05-01
New targets for brain tumor therapies may be identified by mutations that cause hereditary microcephaly. Brain growth depends on the repeated proliferation of stem and progenitor cells. Microcephaly syndromes result from mutations that specifically impair the ability of brain progenitor or stem cells to proliferate, by inducing either premature differentiation or apoptosis. Brain tumors that derive from brain progenitor or stem cells may share many of the specific requirements of their cells of origin. These tumors may therefore be susceptible to disruptions of the protein products of genes that are mutated in microcephaly. The potential for the products of microcephaly genes to be therapeutic targets in brain tumors are highlighted hereby reviewing research on EG5, KIF14, ASPM, CDK6, and ATR. Treatments that disrupt these proteins may open new avenues for brain tumor therapy that have increased efficacy and decreased toxicity. © 2018 WILEY Periodicals, Inc.
The Argonaute CSR-1 and its 22G-RNA cofactors are required for holocentric chromosome segregation.
Claycomb, Julie M; Batista, Pedro J; Pang, Ka Ming; Gu, Weifeng; Vasale, Jessica J; van Wolfswinkel, Josien C; Chaves, Daniel A; Shirayama, Masaki; Mitani, Shohei; Ketting, René F; Conte, Darryl; Mello, Craig C
2009-10-02
RNAi-related pathways regulate diverse processes, from developmental timing to transposon silencing. Here, we show that in C. elegans the Argonaute CSR-1, the RNA-dependent RNA polymerase EGO-1, the Dicer-related helicase DRH-3, and the Tudor-domain protein EKL-1 localize to chromosomes and are required for proper chromosome segregation. In the absence of these factors chromosomes fail to align at the metaphase plate and kinetochores do not orient to opposing spindle poles. Surprisingly, the CSR-1-interacting small RNAs (22G-RNAs) are antisense to thousands of germline-expressed protein-coding genes. Nematodes assemble holocentric chromosomes in which continuous kinetochores must span the expressed domains of the genome. We show that CSR-1 interacts with chromatin at target loci but does not downregulate target mRNA or protein levels. Instead, our findings support a model in which CSR-1 complexes target protein-coding domains to promote their proper organization within the holocentric chromosomes of C. elegans.
Modeling the Activity of Single Genes
NASA Technical Reports Server (NTRS)
Mjolsness, Eric; Gibson, Michael
1999-01-01
The central dogma of molecular biology states that information is stored in DNA, transcribed to messenger RNA (mRNA) and then translated into proteins. This picture is significantly augmentated when we consider the action of certain proteins in regulating transcription. These transcription factors provide a feedback pathway by which genes can regulate one another's expression as mRNA and then as protein. To review: DNA, RNA and proteins have different functions. DNA is the molecular storehouse of genetic information. When cells divide, the DNA is replicated, so that each daughter cell maintains the same genetic information as the mother cell. RNA acts as a go-between from DNA to proteins. Only a single copy of DNA is present, but multiple copies of the same piece of RNA may be present, allowing cells to make huge amounts of protein. In eukaryotes (organisms with a nucleus), DNA is found in the nucleus only. RNA is copied in the nucleus then translocates(moves) outside the nucleus, where it is transcribed into proteins. Along the way, the RNA may be spliced, i.e., may have pieces cut out. RNA then attaches to ribosomes and is translated to proteins. Proteins are the machinery of the cell other than DNA and RNA, all the complex molecules of the cell are proteins. Proteins are specialized machines, each of which fulfills its own task, which may be transporting oxygen, catalyzing reactions, or responding to extracellular signals, just to name a few. One of the more interesting functions a protein may have is binding directly or indirectly to DNA to perform transcriptional regulation, thus forming a closed feedback loop of gene regulation. The structure of DNA and the central dogma were understood in the 50s; in the early 80s it became possible to make arbitrary modifications to DNA and use cellular machinery to transcribe and translate the resulting genes; more recently, genomes (i.e., the complete DNA sequence) of many organisms have been sequenced. This large-scale sequencing began with simple organisms, viruses and bacteria, progressed to eukaryotes such as yeast, and more recently (1998) progressed to a multi-cellular animal, the nematode Caenorhabditis elegans. Sequencers have now moved on to the fruit fly Drosophila melanogaster, whose sequence is slated for completion by the end of 1999. The human genome project is expected to determine the complete sequence of all 3 billion bases of human DNA within the next five years. In the wake of genome-scale sequencing, further instrumentation is being developed to assay gene expression and function on a comparably large scale. Much of the work in computational biology focuses on computational tools used in sequencing, finding genes that are related to a particular gene, finding which parts of the DNA code for proteins and which do not, understanding what proteins will be formed from a given length of DNA, predicting how the proteins will fold from a one-dimensional structure into a three dimensional structure, and so on. Much less computational work has been done regarding the function of proteins. One reason for this is that different proteins function very differently, and so work on protein function is very specific to certain classes of proteins. There are, for example, proteins such enzymes that catalyze various intracellular reactions, receptors that respond to extracellular signals and ion channels that regulate the flow of charged particles into and out of the cell. In this chapter, we will consider a particular class of proteins called transcription factors(TFs), which are responsible for regulating when a certain gene is expressed in a certain cell, which cells it is express in, and how much is expressed. Understanding these processes will involve developing a deeper understanding of transcription, translation, and the cellular processes that control those processes. All of these elements fall under the aegis of gene regulation or more narrowly transcriptional regulation. Some of the key questions in gene regulation are: What genes are expressed in a certain cell at a certain time? How does gene expression differ from cell to cell in a multicellular organism? Which proteins act as transcription factors, i.e., are important in regulating gene expression? From questions like these, we hope to understand which genes are important for various macroscopic processes. Nearly all of the cells of a multicellular organism contain the same DNA. Yet this same genetic information yields a large number of different cell types. The fundamental difference between a neuron and a liver cell, for example, is which genes are expressed. Thus understanding gene regulation is an important step in understanding development. Furthermore, understanding the usual genes that are expressed in cells may give important clues about various diseases. Some diseases, such as sickle cell anemia and cystic fibrosis, are caused by defects in single, non-regulatory genes; others, such as certain cancers, are caused when the cellular control circuitry malfunctions - an understanding of these diseases will involve pathways of multiple interacting gene products. There are numerous challenges in the area of understanding and modeling gene regulation. First and foremost, biologists would like to develop a deeper understanding of the processes involved, including which genes and families of genes are important, how they interact, etc. From a computation point of view, there has been embarrassingly little work done. In this chapter there are many areas in which we can phrase meaningful, non-trivial computational questions, but questions that have not been addressed. Some of these are purely computational (what is a good algorithm for dealing with a model of type X) and others are more mathematical (given a system with certain characteristics, what sort of model can one use? How does one find biochemical parameters from system-level behavior using as few experiments as possible?). In addition to biological and algorithmic problems, there is also the ever-present issue of theoretical biology - what general principles can be derived from these systems, what can one do with models other than just simulate time-courses, what can be deduced about a class of systems without knowing all the details? The fundamental challenge to computationalists and theorists is to add value to the biology - to use models, modeling techniques and algorithms to understand the biology in new ways.
Wheat CBF gene family: identification of polymorphisms in the CBF coding sequence.
Mohseni, Sara; Che, Hua; Djillali, Zakia; Dumont, Estelle; Nankeu, Joseph; Danyluk, Jean
2012-12-01
Expression of cold-regulated genes needed for protection against freezing stress is mediated, in part, by the CBF transcription factor family. Previous studies with temperate cereals suggested that the CBF gene family in wheat was large, and that CBF genes were at the base of an important low temperature tolerance trait. Therefore, the goal of our study was to identify the CBF repertoire in the freezing-tolerant hexaploid wheat cultivar Norstar, and then to examine if the coding region of CBF genes in two spring cultivars contain polymorphisms that could affect the protein sequence and structure. Our analyses reveal that hexaploid wheat contains a complex CBF family consisting of at least 65 CBF genes of which 60 are known to be expressed in the cultivar Norstar. They represent 27 paralogous genes with 1-3 homeologous copies for the A, B, and D genomes. The cultivar Norstar contains two pseudogenes and at least 24 additional proteins having sequences and (or) structures that deviate from the consensus in the conserved AP2 DNA-binding and (or) C-terminal activation-domains. This suggests that in cultivars such as Norstar, low temperature tolerance may be increased through breeding of additional optimal alleles. The examination of the CBF repertoire present in the two spring cultivars, Chinese Spring and Manitou, reveals that they have additional polymorphisms affecting conserved positions in these domains. Understanding the effects of these polymorphisms will provide additional information for the selection of optimum CBF alleles in Triticeae breeding programs.
Keeping abreast with long non-coding RNAs in mammary gland development and breast cancer
Hansji, Herah; Leung, Euphemia Y.; Baguley, Bruce C.; Finlay, Graeme J.; Askarian-Amiri, Marjan E.
2014-01-01
The majority of the human genome is transcribed, even though only 2% of transcripts encode proteins. Non-coding transcripts were originally dismissed as evolutionary junk or transcriptional noise, but with the development of whole genome technologies, these non-coding RNAs (ncRNAs) are emerging as molecules with vital roles in regulating gene expression. While shorter ncRNAs have been extensively studied, the functional roles of long ncRNAs (lncRNAs) are still being elucidated. Studies over the last decade show that lncRNAs are emerging as new players in a number of diseases including cancer. Potential roles in both oncogenic and tumor suppressive pathways in cancer have been elucidated, but the biological functions of the majority of lncRNAs remain to be identified. Accumulated data are identifying the molecular mechanisms by which lncRNA mediates both structural and functional roles. LncRNA can regulate gene expression at both transcriptional and post-transcriptional levels, including splicing and regulating mRNA processing, transport, and translation. Much current research is aimed at elucidating the function of lncRNAs in breast cancer and mammary gland development, and at identifying the cellular processes influenced by lncRNAs. In this paper we review current knowledge of lncRNAs contributing to these processes and present lncRNA as a new paradigm in breast cancer development. PMID:25400658
Nayidu, Naghabushana K.; Kagale, Sateesh; Taheri, Ali; Withana-Gamage, Thushan S.; Parkin, Isobel A. P.; Sharpe, Andrew G.; Gruber, Margaret Y.
2014-01-01
Coding sequences for major trichome regulatory genes, including the positive regulators GLABRA 1(GL1), GLABRA 2 (GL2), ENHANCER OF GLABRA 3 (EGL3), and TRANSPARENT TESTA GLABRA 1 (TTG1) and the negative regulator TRIPTYCHON (TRY), were cloned from wild Brassica villosa, which is characterized by dense trichome coverage over most of the plant. Transcript (FPKM) levels from RNA sequencing indicated much higher expression of the GL2 and TTG1 regulatory genes in B. villosa leaves compared with expression levels of GL1 and EGL3 genes in either B. villosa or the reference genome species, glabrous B. oleracea; however, cotyledon TTG1 expression was high in both species. RNA sequencing and Q-PCR also revealed an unusual expression pattern for the negative regulators TRY and CPC, which were much more highly expressed in trichome-rich B. villosa leaves than in glabrous B. oleracea leaves and in glabrous cotyledons from both species. The B. villosa TRY expression pattern also contrasted with TRY expression patterns in two diploid Brassica species, and with the Arabidopsis model for expression of negative regulators of trichome development. Further unique sequence polymorphisms, protein characteristics, and gene evolution studies highlighted specific amino acids in GL1 and GL2 coding sequences that distinguished glabrous species from hairy species and several variants that were specific for each B. villosa gene. Positive selection was observed for GL1 between hairy and non-hairy plants, and as expected the origin of the four expressed positive trichome regulatory genes in B. villosa was predicted to be from B. oleracea. In particular the unpredicted expression patterns for TRY and CPC in B. villosa suggest additional characterization is needed to determine the function of the expanded families of trichome regulatory genes in more complex polyploid species within the Brassicaceae. PMID:24755905
Using a Euclid distance discriminant method to find protein coding genes in the yeast genome.
Zhang, Chun-Ting; Wang, Ju; Zhang, Ren
2002-02-01
The Euclid distance discriminant method is used to find protein coding genes in the yeast genome, based on the single nucleotide frequencies at three codon positions in the ORFs. The method is extremely simple and may be extended to find genes in prokaryotic genomes or eukaryotic genomes with less introns. Six-fold cross-validation tests have demonstrated that the accuracy of the algorithm is better than 93%. Based on this, it is found that the total number of protein coding genes in the yeast genome is less than or equal to 5579 only, about 3.8-7.0% less than 5800-6000, which is currently widely accepted. The base compositions at three codon positions are analyzed in details using a graphic method. The result shows that the preference codons adopted by yeast genes are of the RGW type, where R, G and W indicate the bases of purine, non-G and A/T, whereas the 'codons' in the intergenic sequences are of the form NNN, where N denotes any base. This fact constitutes the basis of the algorithm to distinguish between coding and non-coding ORFs in the yeast genome. The names of putative non-coding ORFs are listed here in detail.
Abernathy, Jason; Overturf, Ken
2018-01-04
Reformulation of aquafeeds in salmonid diets to include more plant proteins is critical for sustainable aquaculture. However, increasing plant proteins can lead to stunted growth and enteritis. Toward an understanding of the regulatory mechanisms behind plant protein utilization, directional RNA sequencing of liver tissues from a rainbow trout strain selected for growth on an all plant-protein diet and a control strain, both fed a plant diet for 12 weeks, were utilized to construct long noncoding RNAs. Antisense long noncoding RNAs were selected for differential expression and functional analyses since they have been shown to have regulatory actions within a genome. A total of 142 unique antisense long noncoding RNAs were differentially expressed between strains, 60 of which could be mapped to a gene. Genes underlying these noncoding RNAs are indicated in lipid metabolism and immunity. Six noncoding transcripts were also found to overlap with differentially expressed protein-coding genes, all of which were co-expressed. Associating variation in regulatory elements between rainbow trout strains with differing tolerance to plant-protein diets will assist in future studies toward increased gains throughout carnivorous aquaculture.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wada, Takeyoshi; Asahi, Toru; Research Organization for Nano & Life Innovation, Waseda University #03C309, TWIns, 2-2 Wakamatsu, Shinjuku, Tokyo, 162-8480
2016-08-26
The gene coding cereblon (CRBN) was originally identified in genetic linkage analysis of mild autosomal recessive nonsyndromic intellectual disability. CRBN has broad localization in both the cytoplasm and nucleus. However, the significance of nuclear CRBN remains unknown. In the present study, we aimed to elucidate the role of CRBN in the nucleus. First, we generated a series of CRBN deletion mutants and determined the regions responsible for the nuclear localization. Only CRBN protein lacking the N-terminal region was localized outside of the nucleus, suggesting that the N-terminal region is important for its nuclear localization. CRBN was also identified as amore » thalidomide-binding protein and component of the cullin-4-containing E3 ubiquitin ligase complex. Thalidomide has been reported to be involved in the regulation of the transcription factor Ikaros by CRBN-mediated degradation. To investigate the nuclear functions of CRBN, we performed co-immunoprecipitation experiments and evaluated the binding of CRBN to Ikaros. As a result, we found that CRBN was associated with Ikaros protein, and the N-terminal region of CRBN was required for Ikaros binding. In luciferase reporter gene experiments, CRBN modulated transcriptional activity of Ikaros. Furthermore, we found that CRBN modulated Ikaros-mediated transcriptional repression of the proenkephalin gene by binding to its promoter region. These results suggest that CRBN binds to Ikaros via its N-terminal region and regulates transcriptional activities of Ikaros and its downstream target, enkephalin. - Highlights: • We found that CRBN is a nucleocytoplasmic shutting protein and identified the key domain for nucleocytoplasmic shuttling. • CRBN associates with the transcription factor Ikaros via the N-terminal domain. • CRBN modulates Ikaros-mediated transcriptional regulation and its downstream target, enkephalin.« less
Hutchins, James R. A.
2014-01-01
The genomic era has enabled research projects that use approaches including genome-scale screens, microarray analysis, next-generation sequencing, and mass spectrometry–based proteomics to discover genes and proteins involved in biological processes. Such methods generate data sets of gene, transcript, or protein hits that researchers wish to explore to understand their properties and functions and thus their possible roles in biological systems of interest. Recent years have seen a profusion of Internet-based resources to aid this process. This review takes the viewpoint of the curious biologist wishing to explore the properties of protein-coding genes and their products, identified using genome-based technologies. Ten key questions are asked about each hit, addressing functions, phenotypes, expression, evolutionary conservation, disease association, protein structure, interactors, posttranslational modifications, and inhibitors. Answers are provided by presenting the latest publicly available resources, together with methods for hit-specific and data set–wide information retrieval, suited to any genome-based analytical technique and experimental species. The utility of these resources is demonstrated for 20 factors regulating cell proliferation. Results obtained using some of these are discussed in more depth using the p53 tumor suppressor as an example. This flexible and universally applicable approach for characterizing experimental hits helps researchers to maximize the potential of their projects for biological discovery. PMID:24723265
Transcriptional regulation of the human mitochondrial peptide deformylase (PDF).
Pereira-Castro, Isabel; Costa, Luís Teixeira da; Amorim, António; Azevedo, Luisa
2012-05-18
The last years of research have been particularly dynamic in establishing the importance of peptide deformylase (PDF), a protein of the N-terminal methionine excision (NME) pathway that removes formyl-methionine from mitochondrial-encoded proteins. The genomic sequence of the human PDF gene is shared with the COG8 gene, which encodes a component of the oligomeric golgi complex, a very unusual case in Eukaryotic genomes. Since PDF is crucial in maintaining mitochondrial function and given the atypical short distance between the end of COG8 coding sequence and the PDF initiation codon, we investigated whether the regulation of the human PDF is affected by the COG8 overlapping partner. Our data reveals that PDF has several transcription start sites, the most important of which only 18 bp from the initiation codon. Furthermore, luciferase-activation assays using differently-sized fragments defined a 97 bp minimal promoter region for human PDF, which is capable of very strong transcriptional activity. This fragment contains a potential Sp1 binding site highly conserved in mammalian species. We show that this binding site, whose mutation significantly reduces transcription activation, is a target for the Sp1 transcription factor, and possibly of other members of the Sp family. Importantly, the entire minimal promoter region is located after the end of COG8's coding region, strongly suggesting that the human PDF preserves an independent regulation from its overlapping partner. Copyright © 2012 Elsevier Inc. All rights reserved.
Cheng, Yang; Wang, Xue-yang; Du, Chang; Gao, Juan; Xu, Jia-ping
2014-01-01
Abstract Bombyx mori L. (Lepidoptera: Bombycidae) nucleopolyhedrovirus (BmNPV) is a highly pathogenic virus in the sericultural industry, often causing severe damage leading to large economic losses. The immune mechanisms of B. mori against this virus remain obscure. Previous studies had demonstrated Bmlipase-1, BmNox and Bmserine protease-2 showing antiviral activity in vitro , but data on the transcription levels of these proteins in different resistant strains were not reported. In order to determine the resistance level of the four different strains (P50, A35, A40, A53) and gain a better understanding of the mechanism of resistance to BmNPV in B. mori , the relative expression level of the genes coding the three antiviral proteins in larval haemolymph and midgut of different B. mori strains resistant to BmNPV was determined. The results showed that these genes expressed significantly higher in the resistant strains compared to the susceptible strain, and the differential expression levels were consistent with the LC50 values in different strains. The transcription level of the target genes almost all up-regulated in the larvae midgut and down-regulated in the haemolymph. The results indicate the correlation of these genes to BmNPV resistance in B. mori. PMID:25373223
Tsapara, Anna; Matter, Karl; Balda, Maria S
2006-03-01
The tight junction adaptor protein ZO-1 regulates intracellular signaling and cell proliferation. Its Src homology 3 (SH3) domain is required for the regulation of proliferation and binds to the Y-box transcription factor ZO-1-associated nucleic acid binding protein (ZONAB). Binding of ZO-1 to ZONAB results in cytoplasmic sequestration and hence inhibition of ZONAB's transcriptional activity. Here, we identify a new binding partner of the SH3 domain that modulates ZO-1-ZONAB signaling. Expression screening of a cDNA library with a fusion protein containing the SH3 domain yielded a cDNA coding for Apg-2, a member of the heat-shock protein 110 (Hsp 110) subfamily of Hsp70 heat-shock proteins, which is overexpressed in carcinomas. Regulated depletion of Apg-2 in Madin-Darby canine kidney cells inhibits G(1)/S phase progression. Apg-2 coimmunoprecipitates with ZO-1 and partially localizes to intercellular junctions. Junctional recruitment and coimmunoprecipitation with ZO-1 are stimulated by heat shock. Apg-2 competes with ZONAB for binding to the SH3 domain in vitro and regulates ZONAB's transcriptional activity in reporter gene assays. Our data hence support a model in which Apg-2 regulates ZONAB function by competing for binding to the SH3 domain of ZO-1 and suggest that Apg-2 functions as a regulator of ZO-1-ZONAB signaling in epithelial cells in response to cellular stress.
Tsapara, Anna; Matter, Karl; Balda, Maria S.
2006-01-01
The tight junction adaptor protein ZO-1 regulates intracellular signaling and cell proliferation. Its Src homology 3 (SH3) domain is required for the regulation of proliferation and binds to the Y-box transcription factor ZO-1-associated nucleic acid binding protein (ZONAB). Binding of ZO-1 to ZONAB results in cytoplasmic sequestration and hence inhibition of ZONAB's transcriptional activity. Here, we identify a new binding partner of the SH3 domain that modulates ZO-1–ZONAB signaling. Expression screening of a cDNA library with a fusion protein containing the SH3 domain yielded a cDNA coding for Apg-2, a member of the heat-shock protein 110 (Hsp 110) subfamily of Hsp70 heat-shock proteins, which is overexpressed in carcinomas. Regulated depletion of Apg-2 in Madin-Darby canine kidney cells inhibits G1/S phase progression. Apg-2 coimmunoprecipitates with ZO-1 and partially localizes to intercellular junctions. Junctional recruitment and coimmunoprecipitation with ZO-1 are stimulated by heat shock. Apg-2 competes with ZONAB for binding to the SH3 domain in vitro and regulates ZONAB's transcriptional activity in reporter gene assays. Our data hence support a model in which Apg-2 regulates ZONAB function by competing for binding to the SH3 domain of ZO-1 and suggest that Apg-2 functions as a regulator of ZO-1–ZONAB signaling in epithelial cells in response to cellular stress. PMID:16407410
Saavedra, Carlos; Milan, Massimo; Leite, Ricardo B.; Cordero, David; Patarnello, Tomaso; Cancela, M. Leonor; Bargelloni, Luca
2017-01-01
Growth rate is one of the most important traits from the point of view of individual fitness and commercial production in mollusks, but its molecular and physiological basis is poorly known. We have studied differential gene expression related to differences in growth rate in adult individuals of the commercial marine clam Ruditapes decussatus. Gene expression in the gills and the digestive gland was analyzed in 5 fast-growing and five slow-growing animals by means of an oligonucleotide microarray containing 14,003 probes. A total of 356 differentially expressed genes (DEG) were found. We tested the hypothesis that differential expression might be concentrated at the growth control gene core (GCGC), i.e., the set of genes that underlie the molecular mechanisms of genetic control of tissue and organ growth and body size, as demonstrated in model organisms. The GCGC includes the genes coding for enzymes of the insulin/insulin-like growth factor signaling pathway (IIS), enzymes of four additional signaling pathways (Raf/Ras/Mapk, Jnk, TOR, and Hippo), and transcription factors acting at the end of those pathways. Only two out of 97 GCGC genes present in the microarray showed differential expression, indicating a very little contribution of GCGC genes to growth-related differential gene expression. Forty eight DEGs were shared by both organs, with gene ontology (GO) annotations corresponding to transcription regulation, RNA splicing, sugar metabolism, protein catabolism, immunity, defense against pathogens, and fatty acid biosynthesis. GO term enrichment tests indicated that genes related to growth regulation, development and morphogenesis, extracellular matrix proteins, and proteolysis were overrepresented in the gills. In the digestive gland overrepresented GO terms referred to gene expression control through chromatin rearrangement, RAS-related small GTPases, glucolysis, and energy metabolism. These analyses suggest a relevant role of, among others, some genes related to the IIS, such as the ParaHox gene Xlox, CCAR and the CCN family of secreted proteins, in the regulation of growth in bivalves. PMID:29234285
Global Analysis of the Burkholderia thailandensis Quorum Sensing-Controlled Regulon
Majerczyk, Charlotte; Brittnacher, Mitchell; Jacobs, Michael; Armour, Christopher D.; Radey, Mathew; Schneider, Emily; Phattarasokul, Somsak; Bunt, Richard
2014-01-01
Burkholderia thailandensis contains three acyl-homoserine lactone quorum sensing circuits and has two additional LuxR homologs. To identify B. thailandensis quorum sensing-controlled genes, we carried out transcriptome sequencing (RNA-seq) analyses of quorum sensing mutants and their parent. The analyses were grounded in the fact that we identified genes coding for factors shown previously to be regulated by quorum sensing among a larger set of quorum-controlled genes. We also found that genes coding for contact-dependent inhibition were induced by quorum sensing and confirmed that specific quorum sensing mutants had a contact-dependent inhibition defect. Additional quorum-controlled genes included those for the production of numerous secondary metabolites, an uncharacterized exopolysaccharide, and a predicted chitin-binding protein. This study provides insights into the roles of the three quorum sensing circuits in the saprophytic lifestyle of B. thailandensis, and it provides a foundation on which to build an understanding of the roles of quorum sensing in the biology of B. thailandensis and the closely related pathogenic Burkholderia pseudomallei and Burkholderia mallei. PMID:24464461
Posttranscriptional regulation of lipid metabolism by non-coding RNAs and RNA binding proteins.
Singh, Abhishek K; Aryal, Binod; Zhang, Xinbo; Fan, Yuhua; Price, Nathan L; Suárez, Yajaira; Fernández-Hernando, Carlos
2017-11-29
Alterations in lipoprotein metabolism enhance the risk of cardiometabolic disorders including type-2 diabetes and atherosclerosis, the leading cause of death in Western societies. While the transcriptional regulation of lipid metabolism has been well characterized, recent studies have uncovered the importance of microRNAs (miRNAs), long-non-coding RNAs (lncRNAs) and RNA binding proteins (RBP) in regulating the expression of lipid-related genes at the posttranscriptional level. Work from several groups has identified a number of miRNAs, including miR-33, miR-122 and miR-148a, that play a prominent role in controlling cholesterol homeostasis and lipoprotein metabolism. Importantly, dysregulation of miRNA expression has been associated with dyslipidemia, suggesting that manipulating the expression of these miRNAs could be a useful therapeutic approach to ameliorate cardiovascular disease (CVD). The role of lncRNAs in regulating lipid metabolism has recently emerged and several groups have demonstrated their regulation of lipoprotein metabolism. However, given the high abundance of lncRNAs and the poor-genetic conservation between species, much work will be needed to elucidate the specific role of lncRNAs in controlling lipoprotein metabolism. In this review article, we summarize recent findings in the field and highlight the specific contribution of lncRNAs and RBPs in regulating lipid metabolism. Copyright © 2017 Elsevier Ltd. All rights reserved.
Pervasive transcription: detecting functional RNAs in bacteria.
Lybecker, Meghan; Bilusic, Ivana; Raghavan, Rahul
2014-01-01
Pervasive, or genome-wide, transcription has been reported in all domains of life. In bacteria, most pervasive transcription occurs antisense to protein-coding transcripts, although recently a new class of pervasive RNAs was identified that originates from within annotated genes. Initially considered to be non-functional transcriptional noise, pervasive transcription is increasingly being recognized as important in regulating gene expression. The function of pervasive transcription is an extensively debated question in the field of transcriptomics and regulatory RNA biology. Here, we highlight the most recent contributions addressing the purpose of pervasive transcription in bacteria and discuss their implications.
[Neuromuscular system and aging: involutions and implications].
Paillard, Thierry
2013-12-01
In aged human, the number of muscle fibers and motor units decreases. The remaining motor units lose their functionality (decrease of the discharge frequency, greater fluctuation of the discharge) particularly those which contain type II fibers. The renewal of intracellular proteins declines which creates a negative balance between the daily protein losses and the capacities to renew them. The activity of the protein kinase (Akt) that stimulates the synthesis of regulation proteins (mTOR, p70S6, IGFBP-5) declines whereas the factors of degradation of proteins (NF-kappa B) are activated. Besides, the process of activation and proliferation of satellite cells is affected and the production of anabolic hormones and local factors is decreased. After a strength training program, muscle hypertrophy is linked to the protein synthesis at the level of myosin heavy chain (MHC) isoforms in older subjects. However, the transcription of the genes that code the MHC-I (slow form) increases and the transcription of the genes that code the MHC-II (fast form) decreases. Thus, the transition of the phenotype towards a slower form cannot be inverted by strength training during the advanced in age. Moreover, strength training enables to decrease the proportion of fibers containing MHC of hybrid form in the process of evolution. Hence, strength training can engender a stabilization of the muscular phenotype i.e. different isoforms of MHC. In addition, strength training counteracts the noxious effects mentioned above by generating muscular hypertrophy thanks to a reactive increase in the production of anabolic hormones. A program of aerobic training can induce an increase in the synthesis of ARN messengers coding isoforms related to the oxidative metabolism (MHC-I and to a lesser extent MHC-IIa) while the transcribed for the type MHC-IIx decrease.
Barbosa, Angela S.; Monaris, Denize; Silva, Ludmila B.; Morais, Zenaide M.; Vasconcellos, Sílvio A.; Cianciarullo, Aurora M.; Isaac, Lourdes; Abreu, Patricia A. E.
2010-01-01
We have previously shown that pathogenic leptospiral strains are able to bind C4b binding protein (C4BP). Surface-bound C4BP retains its cofactor activity, indicating that acquisition of this complement regulator may contribute to leptospiral serum resistance. In the present study, the abilities of seven recombinant putative leptospiral outer membrane proteins to interact with C4BP were evaluated. The protein encoded by LIC11947 interacted with this human complement regulator in a dose-dependent manner. The cofactor activity of C4BP bound to immobilized recombinant LIC11947 (rLIC11947) was confirmed by detecting factor I-mediated cleavage of C4b. rLIC11947 was therefore named LcpA (for leptospiral complement regulator-acquiring protein A). LcpA was shown to be an outer membrane protein by using immunoelectron microscopy, cell surface proteolysis, and Triton X-114 fractionation. The gene coding for LcpA is conserved among pathogenic leptospiral strains. This is the first characterization of a Leptospira surface protein that binds to the human complement regulator C4BP in a manner that allows this important regulator to control complement system activation mediated either by the classical pathway or by the lectin pathway. This newly identified protein may play a role in immune evasion by Leptospira spp. and may therefore represent a target for the development of a human vaccine against leptospirosis. PMID:20404075
Molecular mechanisms of pathogenesis in hepatocellular carcinoma revealed by RNA‑sequencing.
Liu, Yao; Yang, Zhe; Du, Feng; Yang, Qiao; Hou, Jie; Yan, Xiaohong; Geng, Yi; Zhao, Yaning; Wang, Hua
2017-11-01
The present study aimed to explore the underlying molecular mechanisms of hepatocellular carcinoma (HCC). RNA‑sequencing profiles GSM629264 and GSM629265, from the GSE25599 data set, were downloaded from the Gene Expression Omnibus database and processed by quality evaluation. GSM629264 and GSM629265 were from HCC and adjacent non‑cancerous tissues, respectively. TopHat software was used for alignment analysis, followed by the detection of novel splicing sites. In addition, the Cufflinks software package was used to analyze gene expressions, and the Cuffdiff program was used to screen for differently expressed genes (DEGs) and differentially expressed splicing variants. Gene ontology functional enrichment and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses of DEGs were also performed. Transcription factors (TFs) and microRNAs (miRNAs) that regulate DEGs were identified, and a protein‑protein interaction (PPI) network was constructed. The hub node in the PPI network was obtained, and the TFs and miRNAs that regulated the hub node were further predicted. The quality of the sequencing data met the standards for analysis, and the clean reads were ~65%. Most sequencing reads mapped into coding sequence exons (CDS_exons), whereas other reads mapped into exon 3' untranslated regions (UTR_Exons), 5'UTR_Exons and Introns. Upregulated and downregulated DEGs between HCC and adjacent non‑cancerous tissues were screened. Genes of differentially expressed splicing variants were identified, including vesicle‑associated membrane protein 4, phosphatidylinositol glycan anchor biosynthesis class C, protein disulfide isomerase family A member 4 and growth arrest specific 5. Screened DEGs were enriched in the complement pathway. In the PPI network, ubiquitin C (UBC) was the hub node. UBC was predicted to be regulated by several TFs, including specificity protein 1 (SP1), FBJ murine osteosarcoma viral oncogene homolog (FOS), proto‑oncogene c‑JUN (JUN), FOS‑like antigen 2 (FOSL2) and SWI/SNF‑related, matrix‑associated, actin‑dependent regulator of chromatin, subfamily A, member 4 (SMARCA4), and several miRNAs, including miR‑30 and miR‑181. Results from the present study demonstrated that UBC, SP1, FOS, JUN, FOSL2, SMARCA4, miR‑30 and miR‑181 may participate in the development of HCC.
Long Non-Coding RNAs: A Novel Paradigm for Toxicology
Dempsey, Joseph L.; Cui, Julia Yue
2017-01-01
Long non-coding RNAs (lncRNAs) are over 200 nucleotides in length and are transcribed from the mammalian genome in a tissue-specific and developmentally regulated pattern. There is growing recognition that lncRNAs are novel biomarkers and/or key regulators of toxicological responses in humans and animal models. Lacking protein-coding capacity, the numerous types of lncRNAs possess a myriad of transcriptional regulatory functions that include cis and trans gene expression, transcription factor activity, chromatin remodeling, imprinting, and enhancer up-regulation. LncRNAs also influence mRNA processing, post-transcriptional regulation, and protein trafficking. Dysregulation of lncRNAs has been implicated in various human health outcomes such as various cancers, Alzheimer’s disease, cardiovascular disease, autoimmune diseases, as well as intermediary metabolism such as glucose, lipid, and bile acid homeostasis. Interestingly, emerging evidence in the literature over the past five years has shown that lncRNA regulation is impacted by exposures to various chemicals such as polycyclic aromatic hydrocarbons, benzene, cadmium, chlorpyrifos-methyl, bisphenol A, phthalates, phenols, and bile acids. Recent technological advancements, including next-generation sequencing technologies and novel computational algorithms, have enabled the profiling and functional characterizations of lncRNAs on a genomic scale. In this review, we summarize the biogenesis and general biological functions of lncRNAs, highlight the important roles of lncRNAs in human diseases and especially during the toxicological responses to various xenobiotics, evaluate current methods for identifying aberrant lncRNA expression and molecular target interactions, and discuss the potential to implement these tools to address fundamental questions in toxicology. PMID:27864543
Miyazaki, Haruko; Miyazaki, Yoshitsugu; Geber, Antonia; Parkinson, Tanya; Hitchcock, Christopher; Falconer, Derek J.; Ward, Douglas J.; Marsden, Katherine; Bennett, John E.
1998-01-01
Sequential Candida glabrata isolates were obtained from the mouth of a patient infected with human immunodeficiency virus type 1 who was receiving high doses of fluconazole for oropharyngeal thrush. Fluconazole-susceptible colonies were replaced by resistant colonies that exhibited both increased fluconazole efflux and increased transcripts of a gene which codes for a protein with 72.5% identity to Pdr5p, an ABC multidrug transporter in Saccharomyces cerevisiae. The deduced protein had a molecular mass of 175 kDa and was composed of two homologous halves, each with six putative transmembrane domains and highly conserved sequences of ATP-binding domains. When the earliest and most azole-susceptible isolate of C. glabrata from this patient was exposed to fluconazole, increased transcripts of the PDR5 homolog appeared, linking azole exposure to regulation of this gene. PMID:9661006
Kim, Seungill; Kim, Myung-Shin; Kim, Yong-Min; Yeom, Seon-In; Cheong, Kyeongchae; Kim, Ki-Tae; Jeon, Jongbum; Kim, Sunggil; Kim, Do-Sun; Sohn, Seong-Han; Lee, Yong-Hwan; Choi, Doil
2015-02-01
The onion (Allium cepa L.) is one of the most widely cultivated and consumed vegetable crops in the world. Although a considerable amount of onion transcriptome data has been deposited into public databases, the sequences of the protein-coding genes are not accurate enough to be used, owing to non-coding sequences intermixed with the coding sequences. We generated a high-quality, annotated onion transcriptome from de novo sequence assembly and intensive structural annotation using the integrated structural gene annotation pipeline (ISGAP), which identified 54,165 protein-coding genes among 165,179 assembled transcripts totalling 203.0 Mb by eliminating the intron sequences. ISGAP performed reliable annotation, recognizing accurate gene structures based on reference proteins, and ab initio gene models of the assembled transcripts. Integrative functional annotation and gene-based SNP analysis revealed a whole biological repertoire of genes and transcriptomic variation in the onion. The method developed in this study provides a powerful tool for the construction of reference gene sets for organisms based solely on de novo transcriptome data. Furthermore, the reference genes and their variation described here for the onion represent essential tools for molecular breeding and gene cloning in Allium spp. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
A Novel Kinesin-Like Protein with a Calmodulin-Binding Domain
NASA Technical Reports Server (NTRS)
Wang, W.; Takezawa, D.; Narasimhulu, S. B.; Reddy, A. S. N.; Poovaiah, B. W.
1996-01-01
Calcium regulates diverse developmental processes in plants through the action of calmodulin. A cDNA expression library from developing anthers of tobacco was screened with S-35-labeled calmodulin to isolate cDNAs encoding calmodulin-binding proteins. Among several clones isolated, a kinesin-like gene (TCK1) that encodes a calmodulin-binding kinesin-like protein was obtained. The TCK1 cDNA encodes a protein with 1265 amino acid residues. Its structural features are very similar to those of known kinesin heavy chains and kinesin-like proteins from plants and animals, with one distinct exception. Unlike other known kinesin-like proteins, TCK1 contains a calmodulin-binding domain which distinguishes it from all other known kinesin genes. Escherichia coli-expressed TCK1 binds calmodulin in a Ca(2+)-dependent manner. In addition to the presence of a calmodulin-binding domain at the carboxyl terminal, it also has a leucine zipper motif in the stalk region. The amino acid sequence at the carboxyl terminal of TCK1 has striking homology with the mechanochemical motor domain of kinesins. The motor domain has ATPase activity that is stimulated by microtubules. Southern blot analysis revealed that TCK1 is coded by a single gene. Expression studies indicated that TCKI is expressed in all of the tissues tested. Its expression is highest in the stigma and anther, especially during the early stages of anther development. Our results suggest that Ca(2+)/calmodulin may play an important role in the function of this microtubule-associated motor protein and may be involved in the regulation of microtubule-based intracellular transport.
dos Reis, Sávio Pinho; Tavares, Liliane de Souza Conceição; Costa, Carinne de Nazaré Monteiro; Brígida, Aílton Borges Santa; de Souza, Cláudia Regina Batista
2012-06-01
Cassava (Manihot esculenta Crantz) is one of the world's most important food crops. It is cultivated mainly in developing countries of tropics, since its root is a major source of calories for low-income people due to its high productivity and resistance to many abiotic and biotic factors. A previous study has identified a partial cDNA sequence coding for a putative RING zinc finger in cassava storage root. The RING zinc finger protein is a specialized type of zinc finger protein found in many organisms. Here, we isolated the full-length cDNA sequence coding for M. esculenta RZF (MeRZF) protein by a combination of 5' and 3' RACE assays. BLAST analysis showed that its deduced amino acid sequence has a high level of similarity to plant proteins of RZF family. MeRZF protein contains a signature sequence motif for a RING zinc finger at its C-terminal region. In addition, this protein showed a histidine residue at the fifth coordination site, likely belonging to the RING-H2 subgroup, as confirmed by our phylogenetic analysis. There is also a transmembrane domain in its N-terminal region. Finally, semi-quantitative RT-PCR assays showed that MeRZF expression is increased in detached leaves treated with sodium chloride. Here, we report the first evidence of a RING zinc finger gene of cassava showing potential role in response to salt stress.
Stotz, Henrik U; Harvey, Pascoe J; Haddadi, Parham; Mashanova, Alla; Kukol, Andreas; Larkan, Nicholas J; Borhan, M Hossein; Fitt, Bruce D L
2018-01-01
Genes coding for nucleotide-binding leucine-rich repeat (LRR) receptors (NLRs) control resistance against intracellular (cell-penetrating) pathogens. However, evidence for a role of genes coding for proteins with LRR domains in resistance against extracellular (apoplastic) fungal pathogens is limited. Here, the distribution of genes coding for proteins with eLRR domains but lacking kinase domains was determined for the Brassica napus genome. Predictions of signal peptide and transmembrane regions divided these genes into 184 coding for receptor-like proteins (RLPs) and 121 coding for secreted proteins (SPs). Together with previously annotated NLRs, a total of 720 LRR genes were found. Leptosphaeria maculans-induced expression during a compatible interaction with cultivar Topas differed between RLP, SP and NLR gene families; NLR genes were induced relatively late, during the necrotrophic phase of pathogen colonization. Seven RLP, one SP and two NLR genes were found in Rlm1 and Rlm3/Rlm4/Rlm7/Rlm9 loci for resistance against L. maculans on chromosome A07 of B. napus. One NLR gene at the Rlm9 locus was positively selected, as was the RLP gene on chromosome A10 with LepR3 and Rlm2 alleles conferring resistance against L. maculans races with corresponding effectors AvrLm1 and AvrLm2, respectively. Known loci for resistance against L. maculans (extracellular hemi-biotrophic fungus), Sclerotinia sclerotiorum (necrotrophic fungus) and Plasmodiophora brassicae (intracellular, obligate biotrophic protist) were examined for presence of RLPs, SPs and NLRs in these regions. Whereas loci for resistance against P. brassicae were enriched for NLRs, no such signature was observed for the other pathogens. These findings demonstrate involvement of (i) NLR genes in resistance against the intracellular pathogen P. brassicae and a putative NLR gene in Rlm9-mediated resistance against the extracellular pathogen L. maculans.
Proliferating cell nuclear antigen (Pcna) as a direct downstream target gene of Hoxc8
DOE Office of Scientific and Technical Information (OSTI.GOV)
Min, Hyehyun; Lee, Ji-Yeon; Bok, Jinwoong
2010-02-19
Hoxc8 is a member of Hox family transcription factors that play crucial roles in spatiotemporal body patterning during embryogenesis. Hox proteins contain a conserved 61 amino acid homeodomain, which is responsible for recognition and binding of the proteins onto Hox-specific DNA binding motifs and regulates expression of their target genes. Previously, using proteome analysis, we identified Proliferating cell nuclear antigen (Pcna) as one of the putative target genes of Hoxc8. Here, we asked whether Hoxc8 regulates Pcna expression by directly binding to the regulatory sequence of Pcna. In mouse embryos at embryonic day 11.5, the expression pattern of Pcna wasmore » similar to that of Hoxc8 along the anteroposterior body axis. Moreover, Pcna transcript levels as well as cell proliferation rate were increased by overexpression of Hoxc8 in C3H10T1/2 mouse embryonic fibroblast cells. Characterization of 2.3 kb genomic sequence upstream of Pcna coding region revealed that the upstream sequence contains several Hox core binding sequences and one Hox-Pbx binding sequence. Direct binding of Hoxc8 proteins to the Pcna regulatory sequence was verified by chromatin immunoprecipitation assay. Taken together, our data suggest that Pcna is a direct downstream target of Hoxc8.« less
Zhao, Yujia; Fan, Jingjing; Li, Jinlin; Li, Jun; Zhou, Xiaohong; Li, Chun
2016-12-01
Small non-coding RNAs (sRNAs) have received much attention in recent years due to their unique biological properties, which can efficiently and specifically tune target gene expressions in bacteria. Inspired by natural sRNAs, recent works have proposed the use of artificial sRNAs (asRNAs) as genetic tools to regulate desired gene that has been applied in several fields, such as metabolic engineering and bacterial physiology studies. However, the rational design of asRNAs is still a challenge. In this study, we proposed structure and length as two criteria to implement rational visualized and precise design of asRNAs. T7 expression system was one of the most useful recombinant protein expression systems. However, it was deeply limited by the formation of inclusion body. To settle this problem, we designed a series of asRNAs to inhibit the T7 RNA polymerase (Gene1) expression to balance the rate between transcription and folding of recombinant protein. Based on the heterologous expression of Aspergillus oryzae Li-3 glucuronidase in E. coli , the asRNA-antigene1-17bp can effectively decrease the inclusion body and increase the enzyme activity by 169.9%.
Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.
Mayer, K; Schüller, C; Wambutt, R; Murphy, G; Volckaert, G; Pohl, T; Düsterhöft, A; Stiekema, W; Entian, K D; Terryn, N; Harris, B; Ansorge, W; Brandt, P; Grivell, L; Rieger, M; Weichselgartner, M; de Simone, V; Obermaier, B; Mache, R; Müller, M; Kreis, M; Delseny, M; Puigdomenech, P; Watson, M; Schmidtheini, T; Reichert, B; Portatelle, D; Perez-Alonso, M; Boutry, M; Bancroft, I; Vos, P; Hoheisel, J; Zimmermann, W; Wedler, H; Ridley, P; Langham, S A; McCullagh, B; Bilham, L; Robben, J; Van der Schueren, J; Grymonprez, B; Chuang, Y J; Vandenbussche, F; Braeken, M; Weltjens, I; Voet, M; Bastiaens, I; Aert, R; Defoor, E; Weitzenegger, T; Bothe, G; Ramsperger, U; Hilbert, H; Braun, M; Holzer, E; Brandt, A; Peters, S; van Staveren, M; Dirske, W; Mooijman, P; Klein Lankhorst, R; Rose, M; Hauf, J; Kötter, P; Berneiser, S; Hempel, S; Feldpausch, M; Lamberth, S; Van den Daele, H; De Keyser, A; Buysshaert, C; Gielen, J; Villarroel, R; De Clercq, R; Van Montagu, M; Rogers, J; Cronin, A; Quail, M; Bray-Allen, S; Clark, L; Doggett, J; Hall, S; Kay, M; Lennard, N; McLay, K; Mayes, R; Pettett, A; Rajandream, M A; Lyne, M; Benes, V; Rechmann, S; Borkova, D; Blöcker, H; Scharfe, M; Grimm, M; Löhnert, T H; Dose, S; de Haan, M; Maarse, A; Schäfer, M; Müller-Auer, S; Gabel, C; Fuchs, M; Fartmann, B; Granderath, K; Dauner, D; Herzl, A; Neumann, S; Argiriou, A; Vitale, D; Liguori, R; Piravandi, E; Massenet, O; Quigley, F; Clabauld, G; Mündlein, A; Felber, R; Schnabl, S; Hiller, R; Schmidt, W; Lecharny, A; Aubourg, S; Chefdor, F; Cooke, R; Berger, C; Montfort, A; Casacuberta, E; Gibbons, T; Weber, N; Vandenbol, M; Bargues, M; Terol, J; Torres, A; Perez-Perez, A; Purnelle, B; Bent, E; Johnson, S; Tacon, D; Jesse, T; Heijnen, L; Schwarz, S; Scholler, P; Heber, S; Francs, P; Bielke, C; Frishman, D; Haase, D; Lemcke, K; Mewes, H W; Stocker, S; Zaccaria, P; Bevan, M; Wilson, R K; de la Bastide, M; Habermann, K; Parnell, L; Dedhia, N; Gnoj, L; Schutz, K; Huang, E; Spiegel, L; Sehkon, M; Murray, J; Sheet, P; Cordes, M; Abu-Threideh, J; Stoneking, T; Kalicki, J; Graves, T; Harmon, G; Edwards, J; Latreille, P; Courtney, L; Cloud, J; Abbott, A; Scott, K; Johnson, D; Minx, P; Bentley, D; Fulton, B; Miller, N; Greco, T; Kemp, K; Kramer, J; Fulton, L; Mardis, E; Dante, M; Pepin, K; Hillier, L; Nelson, J; Spieth, J; Ryan, E; Andrews, S; Geisel, C; Layman, D; Du, H; Ali, J; Berghoff, A; Jones, K; Drone, K; Cotton, M; Joshu, C; Antonoiu, B; Zidanic, M; Strong, C; Sun, H; Lamar, B; Yordan, C; Ma, P; Zhong, J; Preston, R; Vil, D; Shekher, M; Matero, A; Shah, R; Swaby, I K; O'Shaughnessy, A; Rodriguez, M; Hoffmann, J; Till, S; Granat, S; Shohdy, N; Hasegawa, A; Hameed, A; Lodhi, M; Johnson, A; Chen, E; Marra, M; Martienssen, R; McCombie, W R
1999-12-16
The higher plant Arabidopsis thaliana (Arabidopsis) is an important model for identifying plant genes and determining their function. To assist biological investigations and to define chromosome structure, a coordinated effort to sequence the Arabidopsis genome was initiated in late 1996. Here we report one of the first milestones of this project, the sequence of chromosome 4. Analysis of 17.38 megabases of unique sequence, representing about 17% of the genome, reveals 3,744 protein coding genes, 81 transfer RNAs and numerous repeat elements. Heterochromatic regions surrounding the putative centromere, which has not yet been completely sequenced, are characterized by an increased frequency of a variety of repeats, new repeats, reduced recombination, lowered gene density and lowered gene expression. Roughly 60% of the predicted protein-coding genes have been functionally characterized on the basis of their homology to known genes. Many genes encode predicted proteins that are homologous to human and Caenorhabditis elegans proteins.
Complete Mitochondrial Genome of Echinostoma hortense (Digenea: Echinostomatidae).
Liu, Ze-Xuan; Zhang, Yan; Liu, Yu-Ting; Chang, Qiao-Cheng; Su, Xin; Fu, Xue; Yue, Dong-Mei; Gao, Yuan; Wang, Chun-Ren
2016-04-01
Echinostoma hortense (Digenea: Echinostomatidae) is one of the intestinal flukes with medical importance in humans. However, the mitochondrial (mt) genome of this fluke has not been known yet. The present study has determined the complete mt genome sequences of E. hortense and assessed the phylogenetic relationships with other digenean species for which the complete mt genome sequences are available in GenBank using concatenated amino acid sequences inferred from 12 protein-coding genes. The mt genome of E. hortense contained 12 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes, and 1 non-coding region. The length of the mt genome of E. hortense was 14,994 bp, which was somewhat smaller than those of other trematode species. Phylogenetic analyses based on concatenated nucleotide sequence datasets for all 12 protein-coding genes using maximum parsimony (MP) method showed that E. hortense and Hypoderaeum conoideum gathered together, and they were closer to each other than to Fasciolidae and other echinostomatid trematodes. The availability of the complete mt genome sequences of E. hortense provides important genetic markers for diagnostics, population genetics, and evolutionary studies of digeneans.
Complete Mitochondrial Genome of Echinostoma hortense (Digenea: Echinostomatidae)
Liu, Ze-Xuan; Zhang, Yan; Liu, Yu-Ting; Chang, Qiao-Cheng; Su, Xin; Fu, Xue; Yue, Dong-Mei; Gao, Yuan; Wang, Chun-Ren
2016-01-01
Echinostoma hortense (Digenea: Echinostomatidae) is one of the intestinal flukes with medical importance in humans. However, the mitochondrial (mt) genome of this fluke has not been known yet. The present study has determined the complete mt genome sequences of E. hortense and assessed the phylogenetic relationships with other digenean species for which the complete mt genome sequences are available in GenBank using concatenated amino acid sequences inferred from 12 protein-coding genes. The mt genome of E. hortense contained 12 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes, and 1 non-coding region. The length of the mt genome of E. hortense was 14,994 bp, which was somewhat smaller than those of other trematode species. Phylogenetic analyses based on concatenated nucleotide sequence datasets for all 12 protein-coding genes using maximum parsimony (MP) method showed that E. hortense and Hypoderaeum conoideum gathered together, and they were closer to each other than to Fasciolidae and other echinostomatid trematodes. The availability of the complete mt genome sequences of E. hortense provides important genetic markers for diagnostics, population genetics, and evolutionary studies of digeneans. PMID:27180575
Zhang, Pengpeng; Xu, Haixia; Li, Rui; Wu, Wei; Chao, Zhe; Li, Cencen; Xia, Wei; Wang, Lei; Yang, Jinzeng; Xu, Yongjie
2018-06-01
Myoblast differentiation is a highly complex process that is regulated by proteins as well as by non-coding RNAs. Circular RNAs have been identified as an emerging new class of non-coding RNA in the modulation of skeletal muscle development, whereas their expression profiles and functional regulation in myoblast differentiation remain unknown. In the present study, we performed deep RNA-sequencing of C2C12 myoblasts during cell differentiation and uncovered 37,751 unique circular RNAs derived from 6943 hosting genes. The ensuing qRT-PCR and RNA fluorescence in situ hybridization verification were carried out to confirm the RNA-sequencing results. An unbiased analysis demonstrated dynamic circular RNA expression changes in the process of myoblast differentiation, and the circular RNA abundances were independent from their cognate linear RNAs. Gene ontology analysis showed that many down-regulated circular RNAs were exclusive to cell division and the cell cycle, whereas up-regulated circular RNAs were related to the cell development process. Furthermore, interaction networks of circular RNA-microRNA were constructed. Several microRNAs well-known for myoblast regulation, such as miR-133, miR-24 and miR-23a, were in this network. In summary, this study showed that circular RNA expression dynamics changed during myoblast differentiation. Circular RNAs play a role in regulating the myoblast cell cycle and development by acting as microRNA binding sites to facilitate their regulation of gene expression during myoblast differentiation. These findings open a new avenue for future investigation of this emerging RNA class in skeletal muscle growth and development. Copyright © 2018 Elsevier Ltd. All rights reserved.
Does CTCF mediate between nuclear organization and gene expression?
Ohlsson, Rolf; Lobanenkov, Victor; Klenova, Elena
2010-01-01
The multifunctional zinc-finger protein CCCTC-binding factor (CTCF) is a very strong candidate for the role of coordinating the expression level of coding sequences with their three-dimensional position in the nucleus, apparently responding to a "code" in the DNA itself. Dynamic interactions between chromatin fibers in the context of nuclear architecture have been implicated in various aspects of genome functions. However, the molecular basis of these interactions still remains elusive and is a subject of intense debate. Here we discuss the nature of CTCF-DNA interactions, the CTCF-binding specificity to its binding sites and the relationship between CTCF and chromatin, and we examine data linking CTCF with gene regulation in the three-dimensional nuclear space. We discuss why these features render CTCF a very strong candidate for the role and propose a unifying model, the "CTCF code," explaining the mechanistic basis of how the information encrypted in DNA may be interpreted by CTCF into diverse nuclear functions.
Bogdanov, Yuri F; Dadashev, Sergei Y; Grishaeva, Tatiana M
2003-01-01
Evolutionarily distant organisms have not only orthologs, but also nonhomologous proteins that build functionally similar subcellular structures. For instance, this is true with protein components of the synaptonemal complex (SC), a universal ultrastructure that ensures the successful pairing and recombination of homologous chromosomes during meiosis. We aimed at developing a method to search databases for genes that code for such nonhomologous but functionally analogous proteins. Advantage was taken of the ultrastructural parameters of SC and the conformation of SC proteins responsible for these. Proteins involved in SC central space are known to be similar in secondary structure. Using published data, we found a highly significant correlation between the width of the SC central space and the length of rod-shaped central domain of mammalian and yeast intermediate proteins forming transversal filaments in the SC central space. Basing on this, we suggested a method for searching genome databases of distant organisms for genes whose virtual proteins meet the above correlation requirement. Our recent finding of the Drosophila melanogaster CG17604 gene coding for synaptonemal complex transversal filament protein received experimental support from another lab. With the same strategy, we showed that the Arabidopsis thaliana and Caenorhabditis elegans genomes contain unique genes coding for such proteins.
Yi, Ruirong; Mukaiyama, Hiroyuki; Tachikawa, Takashi; Shimomura, Norihiro; Aimi, Tadanori
2010-01-01
In the bipolar basidiomycete Pholiota microspora, a pair of homeodomain protein genes located at the A-mating-type locus regulates mating compatibility. In the present study, we used a DNA-mediated transformation system in P. microspora to investigate the homeodomain proteins that control the clamp formation. When a single homeodomain protein gene (A3-hox1 or A3-hox2) from the A3 monokaryon strain was transformed into the A4 monokaryon strain, the transformants produced many pseudoclamps but very few clamps. When two homeodomain protein genes (A3-hox1 and A3-hox2) were transformed either separately or together into the A4 monokaryon, the ratio of clamps to the clamplike cells in the transformants was significantly increased to ca. 50%. We therefore concluded that the gene dosage of homeodomain protein genes is important for clamp formation. When the sip promoter was connected to the coding region of A3-hox1 and A3-hox2 and the fused fragments were introduced into NGW19-6 (A4), the transformants achieved more than 85% clamp formation and exhibited two nuclei per cell, similar to the dikaryon (NGW12-163 × NGW19-6). The results of real-time reverse transcription-PCR confirmed that sip promoter activity is greater than that of the native promoter of homeodomain protein genes in P. microspora. Thus, we concluded that nearly 100% clamp formation requires high expression levels of homeodomain protein genes and that altered expression of the A-mating-type genes alone is sufficient to drive true clamp formation. PMID:20453073
Energy metabolism in Desulfovibrio vulgaris Hildenborough: insights from transcriptome analysis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pereira, Patricia M.; He, Qiang; Valente, Filipa M.A.
2007-11-01
Sulphate-reducing bacteria are important players in the global sulphur and carbon cycles, with considerable economical and ecological impact. However, the process of sulphate respiration is still incompletely understood. Several mechanisms of energy conservation have been proposed, but it is unclear how the different strategies contribute to the overall process. In order to obtain a deeper insight into the energy metabolism of sulphate-reducers whole-genome microarrays were used to compare the transcriptional response of Desulfovibrio vulgaris Hildenborough grown with hydrogen/sulphate, pyruvate/sulphate, pyruvate with limiting sulphate, and lactate/thiosulphate, relative to growth in lactate/sulphate. Growth with hydrogen/sulphate showed the largest number of differentially expressedmore » genes and the largest changes in transcript levels. In this condition the most up-regulated energy metabolism genes were those coding for the periplasmic [NiFeSe]hydrogenase, followed by the Ech hydrogenase. The results also provide evidence for the involvement of formate cycling and the recently proposed ethanol pathway during growth in hydrogen. The pathway involving CO cycling is relevant during growth on lactate and pyruvate, but not during growth in hydrogen as the most down-regulated genes were those coding for the CO-induced hydrogenase. Growth on lactate/thiosulphate reveals a down-regulation of several energymetabolism genes similar to what was observed in the presence of nitrite. This study identifies the role of several proteins involved in the energy metabolism of D. vulgaris and highlights several novel genes related to this process, revealing a more complex bioenergetic metabolism than previously considered.« less
An Integrated Encyclopedia of DNA Elements in the Human Genome
2012-01-01
Summary The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure, and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall the project provides new insights into the organization and regulation of our genes and genome, and an expansive resource of functional annotations for biomedical research. PMID:22955616
Kaltner, H; Gabius, H-J
2012-04-01
Lectin histochemistry has revealed cell-type-selective glycosylation. It is under dynamic and spatially controlled regulation. Since their chemical properties allow carbohydrates to reach unsurpassed structural diversity in oligomers, they are ideal for high density information coding. Consequently, the concept of the sugar code assigns a functional dimension to the glycans of cellular glycoconjugates. Indeed, multifarious cell processes depend on specific recognition of glycans by their receptors (lectins), which translate the sugar-encoded information into effects. Duplication of ancestral genes and the following divergence of sequences account for the evolutionary dynamics in lectin families. Differences in gene number can even appear among closely related species. The adhesion/growth-regulatory galectins are selected as an instructive example to trace the phylogenetic diversification in several animals, most of them popular models in developmental and tumor biology. Chicken galectins are identified as a low-level-complexity set, thus singled out for further detailed analysis. The various operative means for establishing protein diversity among the chicken galectins are delineated, and individual characteristics in expression profiles discerned. To apply this galectin-fingerprinting approach in histopathology has potential for refining differential diagnosis and for obtaining prognostic assessments. On the grounds of in vitro work with tumor cells a strategically orchestrated co-regulation of galectin expression with presentation of cognate glycans is detected. This coordination epitomizes the far-reaching physiological significance of sugar coding.
microRNA in Cerebral Spinal Fluid as Biomarkers of Alzheimer’s Disease Risk After Brain Injury
2016-08-01
protein processing is a key feature of AD. MiRNAs are small non- coding RNA that regulate mRNA transcription, and may be a significant cause of protein...non- coding RNA that regulate mRNA transcription, and may be a significant cause of protein dysregulation. Our investigative team has generated
Gawin, Agnieszka; Valla, Svein; Brautaset, Trygve
2017-07-01
The XylS/Pm regulator/promoter system originating from the Pseudomonas putida TOL plasmid pWW0 is widely used for regulated low- and high-level recombinant expression of genes and gene clusters in Escherichia coli and other bacteria. Induction of this system can be graded by using different cheap benzoic acid derivatives, which enter cells by passive diffusion, operate in a dose-dependent manner and are typically not metabolized by the host cells. Combinatorial mutagenesis and selection using the bla gene encoding β-lactamase as a reporter have demonstrated that the Pm promoter, the DNA sequence corresponding to the 5' untranslated end of its cognate mRNA and the xylS coding region can be modified and improved relative to various types of applications. By combining such mutant genetic elements, altered and extended expression profiles were achieved. Due to their unique properties, obtained systems serve as a genetic toolbox valuable for heterologous protein production and metabolic engineering, as well as for basic studies aiming at understanding fundamental parameters affecting bacterial gene expression. The approaches used to modify XylS/Pm should be adaptable for similar improvements also of other microbial expression systems. In this review, we summarize constructions, characteristics, refinements and applications of expression tools using the XylS/Pm system. © 2017 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.
Recognition of Protein-coding Genes Based on Z-curve Algorithms
-Biao Guo, Feng; Lin, Yan; -Ling Chen, Ling
2014-01-01
Recognition of protein-coding genes, a classical bioinformatics issue, is an absolutely needed step for annotating newly sequenced genomes. The Z-curve algorithm, as one of the most effective methods on this issue, has been successfully applied in annotating or re-annotating many genomes, including those of bacteria, archaea and viruses. Two Z-curve based ab initio gene-finding programs have been developed: ZCURVE (for bacteria and archaea) and ZCURVE_V (for viruses and phages). ZCURVE_C (for 57 bacteria) and Zfisher (for any bacterium) are web servers for re-annotation of bacterial and archaeal genomes. The above four tools can be used for genome annotation or re-annotation, either independently or combined with the other gene-finding programs. In addition to recognizing protein-coding genes and exons, Z-curve algorithms are also effective in recognizing promoters and translation start sites. Here, we summarize the applications of Z-curve algorithms in gene finding and genome annotation. PMID:24822027
Hsu, Jack C-C; Reid, David W; Hoffman, Alyson M; Sarkar, Devanand; Nicchitta, Christopher V
2018-05-01
Astrocyte elevated gene-1 (AEG-1), an oncogene whose overexpression promotes tumor cell proliferation, angiogenesis, invasion, and enhanced chemoresistance, is thought to function primarily as a scaffolding protein, regulating PI3K/Akt and Wnt/β-catenin signaling pathways. Here we report that AEG-1 is an endoplasmic reticulum (ER) resident integral membrane RNA-binding protein (RBP). Examination of the AEG-1 RNA interactome by HITS-CLIP and PAR-CLIP methodologies revealed a high enrichment for endomembrane organelle-encoding transcripts, most prominently those encoding ER resident proteins, and within this cohort, for integral membrane protein-encoding RNAs. Cluster mapping of the AEG-1/RNA interaction sites demonstrated a normalized rank order interaction of coding sequence >5' untranslated region, with 3' untranslated region interactions only weakly represented. Intriguingly, AEG-1/membrane protein mRNA interaction sites clustered downstream from encoded transmembrane domains, suggestive of a role in membrane protein biogenesis. Secretory and cytosolic protein-encoding mRNAs were also represented in the AEG-1 RNA interactome, with the latter category notably enriched in genes functioning in mRNA localization, translational regulation, and RNA quality control. Bioinformatic analyses of RNA-binding motifs and predicted secondary structure characteristics indicate that AEG-1 lacks established RNA-binding sites though shares the property of high intrinsic disorder commonly seen in RBPs. These data implicate AEG-1 in the localization and regulation of secretory and membrane protein-encoding mRNAs and provide a framework for understanding AEG-1 function in health and disease. © 2018 Hsu et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Li, Shicheng; Sun, Xiao; Miao, Shuncheng; Liu, Jia; Jiao, Wenjie
2017-11-01
Cigarette smoking is one of the greatest preventable risk factors for developing cancer, and most cases of lung squamous cell carcinoma (lung SCC) are associated with smoking. The pathogenesis mechanism of tumor progress is unclear. This study aimed to identify biomarkers in smoking-related lung cancer, including protein-coding gene, long noncoding RNA, and transcription factors. We selected and obtained messenger RNA microarray datasets and clinical data from the Gene Expression Omnibus database to identify gene expression altered by cigarette smoking. Integrated bioinformatic analysis was used to clarify biological functions of the identified genes, including Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway, the construction of a protein-protein interaction network, transcription factor, and statistical analyses. Subsequent quantitative real-time PCR was utilized to verify these bioinformatic analyses. Five hundred and ninety-eight differentially expressed genes and 21 long noncoding RNA were identified in smoking-related lung SCC. GO and KEGG pathway analysis showed that identified genes were enriched in the cancer-related functions and pathways. The protein-protein interaction network revealed seven hub genes identified in lung SCC. Several transcription factors and their binding sites were predicted. The results of real-time quantitative PCR revealed that AURKA and BIRC5 were significantly upregulated and LINC00094 was downregulated in the tumor tissues of smoking patients. Further statistical analysis indicated that dysregulation of AURKA, BIRC5, and LINC00094 indicated poor prognosis in lung SCC. Protein-coding genes AURKA, BIRC5, and LINC00094 could be biomarkers or therapeutic targets for smoking-related lung SCC. © 2017 The Authors. Thoracic Cancer published by China Lung Oncology Group and John Wiley & Sons Australia, Ltd.
López-Ribera, Ignacio; La Paz, José Luis; Repiso, Carlos; García, Nora; Miquel, Mercè; Hernández, María Luisa; Martínez-Rivas, José Manuel; Vicient, Carlos M.
2014-01-01
A transcriptomic approach has been used to identify genes predominantly expressed in maize (Zea mays) scutellum during maturation. One of the identified genes is oil body associated protein1 (obap1), which is transcribed during seed maturation predominantly in the scutellum, and its expression decreases rapidly after germination. Proteins similar to OBAP1 are present in all plants, including primitive plants and mosses, and in some fungi and bacteria. In plants, obap genes are divided in two subfamilies. Arabidopsis (Arabidopsis thaliana) genome contains five genes coding for OBAP proteins. Arabidopsis OBAP1a protein is accumulated during seed maturation and disappears after germination. Agroinfiltration of tobacco (Nicotiana benthamiana) epidermal leaf cells with fusions of OBAP1 to yellow fluorescent protein and immunogold labeling of embryo transmission electron microscopy sections showed that OBAP1 protein is mainly localized in the surface of the oil bodies. OBAP1 protein was detected in the oil body cellular fraction of Arabidopsis embryos. Deletion analyses demonstrate that the most hydrophilic part of the protein is responsible for the oil body localization, which suggests an indirect interaction of OBAP1 with other proteins in the oil body surface. An Arabidopsis mutant with a transfer DNA inserted in the second exon of the obap1a gene and an RNA interference line against the same gene showed a decrease in the germination rate, a decrease in seed oil content, and changes in fatty acid composition, and their embryos have few, big, and irregular oil bodies compared with the wild type. Taken together, our findings suggest that OBAP1 protein is involved in the stability of oil bodies. PMID:24406791
Schnable, James C.; Pedersen, Brent S.; Subramaniam, Sabarinath; Freeling, Michael
2011-01-01
Whole genome duplications, or tetraploidies, are an important source of increased gene content. Following whole genome duplication, duplicate copies of many genes are lost from the genome. This loss of genes is biased both in the classes of genes deleted and the subgenome from which they are lost. Many or all classes are genes preferentially retained as duplicate copies are engaged in dose sensitive protein–protein interactions, such that deletion of any one duplicate upsets the status quo of subunit concentrations, and presumably lowers fitness as a result. Transcription factors are also preferentially retained following every whole genome duplications studied. This has been explained as a consequence of protein–protein interactions, just as for other highly retained classes of genes. We show that the quantity of conserved noncoding sequences (CNSs) associated with genes predicts the likelihood of their retention as duplicate pairs following whole genome duplication. As many CNSs likely represent binding sites for transcriptional regulators, we propose that the likelihood of gene retention following tetraploidy may also be influenced by dose–sensitive protein–DNA interactions between the regulatory regions of CNS-rich genes – nicknamed bigfoot genes – and the proteins that bind to them. Using grass genomes, we show that differential loss of CNSs from one member of a pair following the pre-grass tetraploidy reduces its chance of retention in the subsequent maize lineage tetraploidy. PMID:22645525
Mascarenhas, Roshan; Pietrzak, Maciej; Smith, Ryan M; Webb, Amy; Wang, Danxin; Papp, Audrey C; Pinsonneault, Julia K; Seweryn, Michal; Rempala, Grzegorz; Sadee, Wolfgang
2015-01-01
mRNA translation into proteins is highly regulated, but the role of mRNA isoforms, noncoding RNAs (ncRNAs), and genetic variants remains poorly understood. mRNA levels on polysomes have been shown to correlate well with expressed protein levels, pointing to polysomal loading as a critical factor. To study regulation and genetic factors of protein translation we measured levels and allelic ratios of mRNAs and ncRNAs (including microRNAs) in lymphoblast cell lines (LCL) and in polysomal fractions. We first used targeted assays to measure polysomal loading of mRNA alleles, confirming reported genetic effects on translation of OPRM1 and NAT1, and detecting no effect of rs1045642 (3435C>T) in ABCB1 (MDR1) on polysomal loading while supporting previous results showing increased mRNA turnover of the 3435T allele. Use of high-throughput sequencing of complete transcript profiles (RNA-Seq) in three LCLs revealed significant differences in polysomal loading of individual RNA classes and isoforms. Correlated polysomal distribution between protein-coding and non-coding RNAs suggests interactions between them. Allele-selective polysome recruitment revealed strong genetic influence for multiple RNAs, attributable either to differential expression of RNA isoforms or to differential loading onto polysomes, the latter defining a direct genetic effect on translation. Genes identified by different allelic RNA ratios between cytosol and polysomes were enriched with published expression quantitative trait loci (eQTLs) affecting RNA functions, and associations with clinical phenotypes. Polysomal RNA-Seq combined with allelic ratio analysis provides a powerful approach to study polysomal RNA recruitment and regulatory variants affecting protein translation.
Kim, Yoonhee; Zhang, Yinhua; Pang, Kaifang; Kang, Hyojin; Park, Heejoo; Lee, Yeunkum; Lee, Bokyoung; Lee, Heon-Jeong; Kim, Won-Ki; Geum, Dongho
2016-01-01
Bipolar disorder (BD), characterized by recurrent mood swings between depression and mania, is a highly heritable and devastating mental illness with poorly defined pathophysiology. Recent genome-wide molecular genetic studies have identified several protein-coding genes and microRNAs (miRNAs) significantly associated with BD. Notably, some of the proteins expressed from BD-associated genes function in neuronal synapses, suggesting that abnormalities in synaptic function could be one of the key pathogenic mechanisms of BD. In contrast, however, the role of BD-associated miRNAs in disease pathogenesis remains largely unknown, mainly because of a lack of understanding about their target mRNAs and pathways in neurons. To address this problem, in this study, we focused on a recently identified BD-associated but uncharacterized miRNA, miR-1908-5p. We identified and validated its novel target genes including DLGAP4, GRIN1, STX1A, CLSTN1 and GRM4, which all function in neuronal glutamatergic synapses. Moreover, bioinformatic analyses of human brain expression profiles revealed that the expression levels of miR-1908-5p and its synaptic target genes show an inverse-correlation in many brain regions. In our preliminary experiments, the expression of miR-1908-5p was increased after chronic treatment with valproate but not lithium in control human neural progenitor cells. In contrast, it was decreased by valproate in neural progenitor cells derived from dermal fibroblasts of a BD subject. Together, our results provide new insights into the potential role of miR-1908-5p in the pathogenesis of BD and also propose a hypothesis that neuronal synapses could be a key converging pathway of some BD-associated protein-coding genes and miRNAs. PMID:28035180
Valenzuela-Muñoz, Valentina; Valenzuela-Miranda, Diego; Gallardo-Escárate, Cristian
2018-05-24
The increasing capacity of transcriptomic analysis by high throughput sequencing has highlighted the presence of a large proportion of transcripts that do not encode proteins. In particular, long non-coding RNAs (lncRNAs) are sequences with low coding potential and conservation among species. Moreover, cumulative evidence has revealed important roles in post-transcriptional gene modulation in several taxa. In fish, the role of lncRNAs has been scarcely studied and even less so during the immune response against sea lice. In the present study we mined for lncRNAs in Atlantic salmon (Salmo salar) and Coho salmon (Oncorhynkus kisutch), which are affected by the sea louse Caligus rogercresseyi, evaluating the degree of sequence conservation between these two fish species and their putative roles during the infection process. Herein, Atlantic and Coho salmon were infected with 35 lice/fish and evaluated after 7 and 14 days post-infestation (dpi). For RNA sequencing, samples from skin and head kidney were collected. A total of 5658/4140 and 3678/2123 lncRNAs were identified in uninfected/infected Atlantic and Coho salmon transcriptomes, respectively. Species-specific transcription patterns were observed in exclusive lncRNAs according to the tissue analyzed. Furthermore, neighbor gene GO enrichment analysis of the top 100 highly regulated lncRNAs in Atlantic salmon showed that lncRNAs were localized near genes related to the immune response. On the other hand, in Coho salmon the highly regulated lncRNAs were localized near genes involved in tissue repair processes. This study revealed high regulation of lncRNAs closely localized to immune and tissue repair-related genes in Atlantic and Coho salmon, respectively, suggesting putative roles for lncRNAs in salmon against sea lice infestation. Copyright © 2018 Elsevier Ltd. All rights reserved.
GENCODE: the reference human genome annotation for The ENCODE Project.
Harrow, Jennifer; Frankish, Adam; Gonzalez, Jose M; Tapanari, Electra; Diekhans, Mark; Kokocinski, Felix; Aken, Bronwen L; Barrell, Daniel; Zadissa, Amonida; Searle, Stephen; Barnes, If; Bignell, Alexandra; Boychenko, Veronika; Hunt, Toby; Kay, Mike; Mukherjee, Gaurab; Rajan, Jeena; Despacio-Reyes, Gloria; Saunders, Gary; Steward, Charles; Harte, Rachel; Lin, Michael; Howald, Cédric; Tanzer, Andrea; Derrien, Thomas; Chrast, Jacqueline; Walters, Nathalie; Balasubramanian, Suganthi; Pei, Baikang; Tress, Michael; Rodriguez, Jose Manuel; Ezkurdia, Iakes; van Baren, Jeltje; Brent, Michael; Haussler, David; Kellis, Manolis; Valencia, Alfonso; Reymond, Alexandre; Gerstein, Mark; Guigó, Roderic; Hubbard, Tim J
2012-09-01
The GENCODE Consortium aims to identify all gene features in the human genome using a combination of computational analysis, manual annotation, and experimental validation. Since the first public release of this annotation data set, few new protein-coding loci have been added, yet the number of alternative splicing transcripts annotated has steadily increased. The GENCODE 7 release contains 20,687 protein-coding and 9640 long noncoding RNA loci and has 33,977 coding transcripts not represented in UCSC genes and RefSeq. It also has the most comprehensive annotation of long noncoding RNA (lncRNA) loci publicly available with the predominant transcript form consisting of two exons. We have examined the completeness of the transcript annotation and found that 35% of transcriptional start sites are supported by CAGE clusters and 62% of protein-coding genes have annotated polyA sites. Over one-third of GENCODE protein-coding genes are supported by peptide hits derived from mass spectrometry spectra submitted to Peptide Atlas. New models derived from the Illumina Body Map 2.0 RNA-seq data identify 3689 new loci not currently in GENCODE, of which 3127 consist of two exon models indicating that they are possibly unannotated long noncoding loci. GENCODE 7 is publicly available from gencodegenes.org and via the Ensembl and UCSC Genome Browsers.
Singhal, Dinesh K; Singhal, Raxita; Malik, Hruda N; Kumar, Surender; Kumar, Sudarshan; Mohanty, Ashok K; Kaushik, Jai K; Malakar, Dhruba
2014-01-01
Nanog is a homeodomain containing protein which plays important roles in regulation of signaling pathways for maintenance and induction of pluripotency in stem cells. Because of its unique expression in stem cells it is also regarded as pluripotency marker. In this study goat Nanog (gNanog) gene has been amplified, cloned and characterized at sequence level with successful over-expression in CHO-K1 cell line using a lentiviral based system. gNanog ORF is 903 bp long which codes for Nanog protein of size 300 amino acids (aas). Complete nucleotide sequence shows some evolutionary mutation in goat in comparision to other species. Protein sequence of goat is highly similar to other species. Overall, gNanog nucleotide sequence and predicted protein sequence showed high similarity and minimum divergence with cattle (96 % identity/4 % divergence) and buffalo (94/5 %) while low similarity and high divergence with pig (84/15 %), human (81/23 %) and mouse (69/40 %) indicating evolutionary closeness of gNanog to cattle and buffalo. gNanog lentiviral expression construct was prepared for over-expression of Nanog gene in adult goat fibroblast cells. Lentiviral expression construct of Nanog enabled continuous protein expression for induction and maintenance of pluripotency. Western blotting revealed the expression of Nanog gene at protein level which supported that the lentiviral expression system is highly promising for Nanog protein expression in differentiated goat cell.
Hoe, Nicholas; Huang, Chung M.; Landis, Gary; Verhage, Marian; Ford, Daniel; Yang, Junsheng; van Leeuwen, Fred W.; Tower, John
2011-01-01
Molecular Misreading (MM) is the inaccurate conversion of genomic information into aberrant proteins. For example, when RNA polymerase II transcribes a GAGAG motif it synthesizes at low frequency RNA with a two-base deletion. If the deletion occurs in a coding region, translation will result in production of misframed proteins. During mammalian aging, misframed versions of human amyloid precursor protein (hApp) and ubiquitin (hUbb) accumulate in the aggregates characteristic of neurodegenerative diseases, suggesting dysfunctional degradation or clearance. Here cDNA clones encoding wild-type hUbb and the frame-shifted version hUbb+1 were expressed in transgenic Drosophila using the doxycycline-regulated system. Misframed proteins were abundantly produced, both from the transgenes and from endogenous Drosophila ubiquitin-encoding genes, and their abundance increased during aging in whole-fly extracts. Over-expression of wild-type hUbb, but not hUbb+1, was toxic during fly development. In contrast, when over-expressed specifically in adult flies, hUbb+1 caused small decreases in life span, whereas hUbb was associated with small increases, preferentially in males. The data suggest that MM occurs in Drosophila and that the resultant misframed proteins accumulate with age. MM of the ubiquitin gene can produce alternative ubiquitin gene products with different and sometimes opposing phenotypic effects. PMID:21415465
Comparison and correlation of Simple Sequence Repeats distribution in genomes of Brucella species
Kiran, Jangampalli Adi Pradeep; Chakravarthi, Veeraraghavulu Praveen; Kumar, Yellapu Nanda; Rekha, Somesula Swapna; Kruti, Srinivasan Shanthi; Bhaskar, Matcha
2011-01-01
Computational genomics is one of the important tools to understand the distribution of closely related genomes including simple sequence repeats (SSRs) in an organism, which gives valuable information regarding genetic variations. The central objective of the present study was to screen the SSRs distributed in coding and non-coding regions among different human Brucella species which are involved in a range of pathological disorders. Computational analysis of the SSRs in the Brucella indicates few deviations from expected random models. Statistical analysis also reveals that tri-nucleotide SSRs are overrepresented and tetranucleotide SSRs underrepresented in Brucella genomes. From the data, it can be suggested that over expressed tri-nucleotide SSRs in genomic and coding regions might be responsible in the generation of functional variation of proteins expressed which in turn may lead to different pathogenicity, virulence determinants, stress response genes, transcription regulators and host adaptation proteins of Brucella genomes. Abbreviations SSRs - Simple Sequence Repeats, ORFs - Open Reading Frames. PMID:21738309
Zhao, Yi; Tang, Liang; Li, Zhe; Jin, Jinpu; Luo, Jingchu; Gao, Ge
2015-04-18
Long-established protein-coding genes may lose their coding potential during evolution ("unitary gene loss"). Members of the Poaceae family are a major food source and represent an ideal model clade for plant evolution research. However, the global pattern of unitary gene loss in Poaceae genomes as well as the evolutionary fate of lost genes are still less-investigated and remain largely elusive. Using a locally developed pipeline, we identified 129 unitary gene loss events for long-established protein-coding genes from four representative species of Poaceae, i.e. brachypodium, rice, sorghum and maize. Functional annotation suggested that the lost genes in all or most of Poaceae species are enriched for genes involved in development and response to endogenous stimulus. We also found that 44 mutated genomic loci of lost genes, which we referred as relics, were still actively transcribed, and of which 84% (37 of 44) showed significantly differential expression across different tissues. More interestingly, we found that there were totally five expressed relics may function as competitive endogenous RNA in brachypodium, rice and sorghum genome. Based on comparative genomics and transcriptome data, we firstly compiled a comprehensive catalogue of unitary gene loss events in Poaceae species and characterized a statistically significant functional preference for these lost genes as well showed the potential of relics functioning as competitive endogenous RNAs in Poaceae genomes.
Singh, Vineet K; Ring, Robert P; Aswani, Vijay; Stemper, Mary E; Kislow, Jennifer; Ye, Zhan; Shukla, Sanjay K
2017-12-01
Staphylococcus aureus is an opportunistic human pathogen that can cause serious infections in humans. A plethora of known and putative virulence factors are produced by staphylococci that collectively orchestrate pathogenesis. Ear protein (Escherichia coli ampicillin resistance) in S. aureus is an exoprotein in COL strain, predicted to be a superantigen, and speculated to play roles in antibiotic resistance and virulence. The goal of this study was to determine if expression of ear is modulated by single nucleotide polymorphisms in its promoter and coding sequences and whether this gene plays roles in antibiotic resistance and virulence. Promoter, coding sequences and expression of the ear gene in clinical and carriage S. aureus strains with distinct genetic backgrounds were analysed. The JE2 strain and its isogenic ear mutant were used in a systemic infection mouse model to determine the competiveness of the ear mutant.Results/Key findings. The ear gene showed a variable expression, with USA300FPR3757 showing a high-level expression compared to many of the other strains tested including some showing negligible expression. Higher expression was associated with agr type 1 but not correlated with phylogenetic relatedness of the ear gene based upon single nucleotide polymorphisms in the promoter or coding regions suggesting a complex regulation. An isogenic JE2 (USA300 background) ear mutant showed no significant difference in its growth, antibiotic susceptibility or virulence in a mouse model. Our data suggests that despite being highly expressed in a USA300 genetic background, Ear is not a significant contributor to virulence in that strain.
Ren, Jindong; Du, Xue; Zeng, Tao; Chen, Li; Shen, Junda; Lu, Lizhi; Hu, Jianhong
2017-10-01
Long noncoding RNAs (lncRNAs) and divergently expressed genes exist widely in different tissues of mammals and birds, in which they are involved in various biological processes. However, there is limited information on their role in the regulation of normal biological processes during differentiation, development, and reproduction in birds. In this study, whole transcriptome strand-specific RNA sequencing of the ovary from young ducks (60days), first-laying ducks (160days), and old ducks, i.e., ducks that stopped laying eggs (490days) was performed. The lncRNAs and mRNAs from these ducks were systematically analyzed and identified by duck genome sequencing in the three study groups. The transcriptome from the duck ovary comprised 15,011 protein-coding genes and 2905 lncRNAs; all the lncRNAs were identified as novel long noncoding transcripts. The comparison of transcriptome data from different study groups identified 2240 divergent transcription genes and 135 divergently expressed lncRNAs, which differed among the groups; most of them were significantly downregulated with age. Among the divergent genes, 38 genes were related to the reproductive process and 6 genes were upregulated. Further prediction analysis revealed that 52 lncRNAs were closely correlated with divergent reproductive mRNAs. More importantly, 6 remarkable lncRNAs were correlated significantly with the conversion of the ovary in different phases. Our results aid in the understanding of the divergent transcriptome of duck ovary in different phases and the underlying mechanisms that drive the specificity of protein-coding genes and lncRNAs in duck ovary. Copyright © 2017. Published by Elsevier B.V.
Mutant phenotypes for thousands of bacterial genes of unknown function
Price, Morgan N.; Wetmore, Kelly M.; Waters, R. Jordan; ...
2018-05-16
One-third of all protein-coding genes from bacterial genomes cannot be annotated with a function. Here, to investigate the functions of these genes, we present genome-wide mutant fitness data from 32 diverse bacteria across dozens of growth conditions. We identified mutant phenotypes for 11,779 protein-coding genes that had not been annotated with a specific function. Many genes could be associated with a specific condition because the gene affected fitness only in that condition, or with another gene in the same bacterium because they had similar mutant phenotypes. Of the poorly annotated genes, 2,316 had associations that have high confidence because theymore » are conserved in other bacteria. By combining these conserved associations with comparative genomics, we identified putative DNA repair proteins; in addition, we propose specific functions for poorly annotated enzymes and transporters and for uncharacterized protein families. Lastly, our study demonstrates the scalability of microbial genetics and its utility for improving gene annotations.« less
Mutant phenotypes for thousands of bacterial genes of unknown function
DOE Office of Scientific and Technical Information (OSTI.GOV)
Price, Morgan N.; Wetmore, Kelly M.; Waters, R. Jordan
One-third of all protein-coding genes from bacterial genomes cannot be annotated with a function. Here, to investigate the functions of these genes, we present genome-wide mutant fitness data from 32 diverse bacteria across dozens of growth conditions. We identified mutant phenotypes for 11,779 protein-coding genes that had not been annotated with a specific function. Many genes could be associated with a specific condition because the gene affected fitness only in that condition, or with another gene in the same bacterium because they had similar mutant phenotypes. Of the poorly annotated genes, 2,316 had associations that have high confidence because theymore » are conserved in other bacteria. By combining these conserved associations with comparative genomics, we identified putative DNA repair proteins; in addition, we propose specific functions for poorly annotated enzymes and transporters and for uncharacterized protein families. Lastly, our study demonstrates the scalability of microbial genetics and its utility for improving gene annotations.« less
Huanca-Mamani, Wilson; Arias-Carrasco, Raúl; Cárdenas-Ninasivincha, Steffany; Rojas-Herrera, Marcelo; Sepúlveda-Hermosilla, Gonzalo; Caris-Maldonado, José Carlos; Bastías, Elizabeth; Maracaja-Coutinho, Vinicius
2018-03-20
Long non-coding RNAs (lncRNAs) have been defined as transcripts longer than 200 nucleotides, which lack significant protein coding potential and possess critical roles in diverse cellular processes. Long non-coding RNAs have recently been functionally characterized in plant stress-response mechanisms. In the present study, we perform a comprehensive identification of lncRNAs in response to combined stress induced by salinity and excess of boron in the Lluteño maize, a tolerant maize landrace from Atacama Desert, Chile. We use deep RNA sequencing to identify a set of 48,345 different lncRNAs, of which 28,012 (58.1%) are conserved with other maize (B73, Mo17 or Palomero), with the remaining 41.9% belonging to potentially Lluteño exclusive lncRNA transcripts. According to B73 maize reference genome sequence, most Lluteño lncRNAs correspond to intergenic transcripts. Interestingly, Lluteño lncRNAs presents an unusual overall higher expression compared to protein coding genes under exposure to stressed conditions. In total, we identified 1710 putatively responsive to the combined stressed conditions of salt and boron exposure. We also identified a set of 848 stress responsive potential trans natural antisense transcripts ( trans -NAT) lncRNAs, which seems to be regulating genes associated with regulation of transcription, response to stress, response to abiotic stimulus and participating of the nicotianamine metabolic process. Reverse transcription-quantitative PCR (RT-qPCR) experiments were performed in a subset of lncRNAs, validating their existence and expression patterns. Our results suggest that a diverse set of maize lncRNAs from leaves and roots is responsive to combined salt and boron stress, being the first effort to identify lncRNAs from a maize landrace adapted to extreme conditions such as the Atacama Desert. The information generated is a starting point to understand the genomic adaptabilities suffered by this maize to surpass this extremely stressed environment.
Huanca-Mamani, Wilson; Arias-Carrasco, Raúl; Cárdenas-Ninasivincha, Steffany; Rojas-Herrera, Marcelo; Sepúlveda-Hermosilla, Gonzalo; Caris-Maldonado, José Carlos; Bastías, Elizabeth; Maracaja-Coutinho, Vinicius
2018-01-01
Long non-coding RNAs (lncRNAs) have been defined as transcripts longer than 200 nucleotides, which lack significant protein coding potential and possess critical roles in diverse cellular processes. Long non-coding RNAs have recently been functionally characterized in plant stress–response mechanisms. In the present study, we perform a comprehensive identification of lncRNAs in response to combined stress induced by salinity and excess of boron in the Lluteño maize, a tolerant maize landrace from Atacama Desert, Chile. We use deep RNA sequencing to identify a set of 48,345 different lncRNAs, of which 28,012 (58.1%) are conserved with other maize (B73, Mo17 or Palomero), with the remaining 41.9% belonging to potentially Lluteño exclusive lncRNA transcripts. According to B73 maize reference genome sequence, most Lluteño lncRNAs correspond to intergenic transcripts. Interestingly, Lluteño lncRNAs presents an unusual overall higher expression compared to protein coding genes under exposure to stressed conditions. In total, we identified 1710 putatively responsive to the combined stressed conditions of salt and boron exposure. We also identified a set of 848 stress responsive potential trans natural antisense transcripts (trans-NAT) lncRNAs, which seems to be regulating genes associated with regulation of transcription, response to stress, response to abiotic stimulus and participating of the nicotianamine metabolic process. Reverse transcription-quantitative PCR (RT-qPCR) experiments were performed in a subset of lncRNAs, validating their existence and expression patterns. Our results suggest that a diverse set of maize lncRNAs from leaves and roots is responsive to combined salt and boron stress, being the first effort to identify lncRNAs from a maize landrace adapted to extreme conditions such as the Atacama Desert. The information generated is a starting point to understand the genomic adaptabilities suffered by this maize to surpass this extremely stressed environment. PMID:29558449
Circular RNAs: Unexpected outputs of many protein-coding genes
Wilusz, Jeremy E.
2017-01-01
ABSTRACT Pre-mRNAs from thousands of eukaryotic genes can be non-canonically spliced to generate circular RNAs, some of which accumulate to higher levels than their associated linear mRNA. Recent work has revealed widespread mechanisms that dictate whether the spliceosome generates a linear or circular RNA. For most genes, circular RNA biogenesis via backsplicing is far less efficient than canonical splicing, but circular RNAs can accumulate due to their long half-lives. Backsplicing is often initiated when complementary sequences from different introns base pair and bring the intervening splice sites close together. This process is further regulated by the combinatorial action of RNA binding proteins, which allow circular RNAs to be expressed in unique patterns. Some genes do not require complementary sequences to generate RNA circles and instead take advantage of exon skipping events. It is still unclear what most mature circular RNAs do, but future investigations into their functions will be facilitated by recently described methods to modulate circular RNA levels. PMID:27571848
Analysis of protein-coding genetic variation in 60,706 humans.
Lek, Monkol; Karczewski, Konrad J; Minikel, Eric V; Samocha, Kaitlin E; Banks, Eric; Fennell, Timothy; O'Donnell-Luria, Anne H; Ware, James S; Hill, Andrew J; Cummings, Beryl B; Tukiainen, Taru; Birnbaum, Daniel P; Kosmicki, Jack A; Duncan, Laramie E; Estrada, Karol; Zhao, Fengmei; Zou, James; Pierce-Hoffman, Emma; Berghout, Joanne; Cooper, David N; Deflaux, Nicole; DePristo, Mark; Do, Ron; Flannick, Jason; Fromer, Menachem; Gauthier, Laura; Goldstein, Jackie; Gupta, Namrata; Howrigan, Daniel; Kiezun, Adam; Kurki, Mitja I; Moonshine, Ami Levy; Natarajan, Pradeep; Orozco, Lorena; Peloso, Gina M; Poplin, Ryan; Rivas, Manuel A; Ruano-Rubio, Valentin; Rose, Samuel A; Ruderfer, Douglas M; Shakir, Khalid; Stenson, Peter D; Stevens, Christine; Thomas, Brett P; Tiao, Grace; Tusie-Luna, Maria T; Weisburd, Ben; Won, Hong-Hee; Yu, Dongmei; Altshuler, David M; Ardissino, Diego; Boehnke, Michael; Danesh, John; Donnelly, Stacey; Elosua, Roberto; Florez, Jose C; Gabriel, Stacey B; Getz, Gad; Glatt, Stephen J; Hultman, Christina M; Kathiresan, Sekar; Laakso, Markku; McCarroll, Steven; McCarthy, Mark I; McGovern, Dermot; McPherson, Ruth; Neale, Benjamin M; Palotie, Aarno; Purcell, Shaun M; Saleheen, Danish; Scharf, Jeremiah M; Sklar, Pamela; Sullivan, Patrick F; Tuomilehto, Jaakko; Tsuang, Ming T; Watkins, Hugh C; Wilson, James G; Daly, Mark J; MacArthur, Daniel G
2016-08-18
Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.
Bitrián, Marta; Roodbarkelari, Farshad; Horváth, Mihály; Koncz, Csaba
2011-03-01
Recombineering, permitting precise modification of genes within bacterial artificial chromosomes (BACs) through homologous recombination mediated by lambda phage-encoded Red proteins, is a widely used powerful tool in mouse, Caenorhabditis and Drosophila genetics. As Agrobacterium-mediated transfer of large DNA inserts from binary BACs and TACs into plants occurs at low frequency, recombineering is so far seldom exploited in the analysis of plant gene functions. We have constructed binary plant transformation vectors, which are suitable for gap-repair cloning of genes from BACs using recombineering methods previously developed for other organisms. Here we show that recombineering facilitates PCR-based generation of precise translational fusions between coding sequences of fluorescent reporter and plant proteins using galK-based exchange recombination. The modified target genes alone or as part of a larger gene cluster can be transferred by high-frequency gap-repair into plant transformation vectors, stably maintained in Agrobacterium and transformed without alteration into plants. Versatile application of plant BAC-recombineering is illustrated by the analysis of developmental regulation and cellular localization of interacting AKIN10 catalytic and SNF4 activating subunits of Arabidopsis Snf1-related (SnRK1) protein kinase using in vivo imaging. To validate full functionality and in vivo interaction of tagged SnRK1 subunits, it is demonstrated that immunoprecipitated SNF4-YFP is bound to a kinase that phosphorylates SnRK1 candidate substrates, and that the GFP- and YFP-tagged kinase subunits co-immunoprecipitate with endogenous wild type AKIN10 and SNF4. © 2011 The Authors. The Plant Journal © 2011 Blackwell Publishing Ltd.